WEBVTT

00:00:00.080 --> 00:00:08.160
in an absolutely unprecedented move Intel has given me access to the

00:00:04.960 --> 00:00:10.960
engineers in their soc validation lab

00:00:08.160 --> 00:00:14.960
wow that is so cool i absolutely love it okay gotta go bye

00:00:12.800 --> 00:00:19.039
this is an opportunity to talk to some of the smartest people in the world

00:00:17.199 --> 00:00:24.240
so i immediately did what anyone would do i asked them

00:00:21.199 --> 00:00:25.840
what came first the chicken or the egg

00:00:24.240 --> 00:00:29.519
that's simple you must build both the chicken and an egg in parallel to make

00:00:27.279 --> 00:00:33.200
sure they're compatible with each other an

00:00:30.439 --> 00:00:37.280
unconventional answer but one that makes sense when you consider the cutting edge

00:00:35.200 --> 00:00:42.239
work that they do inside these walls take pci express gen 5 for example

00:00:40.320 --> 00:00:46.320
how can you build a compatible CPU and motherboard when there are no devices to

00:00:44.320 --> 00:00:50.320
test them with well you create one from scratch

00:00:49.120 --> 00:00:54.399
so cool just like i created this segway to our

00:00:52.160 --> 00:00:58.000
sponsor i fix it for repairs on the go ifixit has you covered find out more

00:00:56.239 --> 00:01:03.359
about the ultra portable minnow and moire sets and how they can make your

00:00:59.680 --> 00:01:03.359
repairs easier at the end of the video

00:01:10.400 --> 00:01:15.040
i was so blown away by the insane gear

00:01:13.600 --> 00:01:19.040
that they have in the validation lab when i was shooting my general tour of

00:01:16.880 --> 00:01:23.840
the israel development center that i immediately cancelled my return flight

00:01:21.439 --> 00:01:28.400
so that i could come back another day and get you guys a closer look at it

00:01:25.680 --> 00:01:34.320
this lab's goal is twofold first they need to test every possible function of

00:01:30.799 --> 00:01:37.280
the soc from logic to memory to

00:01:34.320 --> 00:01:41.680
integrated GPU to connectivity and second they need to test at scale to

00:01:40.400 --> 00:01:46.720
ensure that their results are going to be applicable across thousands hundreds

00:01:44.240 --> 00:01:50.640
of thousands or even millions of units of processors because it's the nature of

00:01:48.960 --> 00:01:55.759
the silicon business that there's going to be a spectrum of behavior from the

00:01:53.280 --> 00:02:00.960
silicon parts and you've got a test at scale to ensure that this whole spectrum

00:01:58.320 --> 00:02:04.960
is validated that's why this place is absolutely packed to the gills with test

00:02:03.920 --> 00:02:10.879
benches most of the ones we're looking at in here have the same core components a

00:02:08.640 --> 00:02:16.000
12th gen alder light processor on top of one of Intel's rvp or reference

00:02:13.599 --> 00:02:21.040
validation platform motherboards which man these things include some really

00:02:18.959 --> 00:02:24.400
cool functionality for like remote control and testing and all kinds of

00:02:22.800 --> 00:02:28.959
stuff i'm actually planning another follow-up video focusing just on this

00:02:26.560 --> 00:02:31.840
bad boy including a look at the liquid nitrogen cooled board that they've got

00:02:30.400 --> 00:02:35.760
in the overclocking lab so get subscribed to make sure you don't miss it

00:02:33.760 --> 00:02:39.519
beyond the base platform though there is a broad spectrum of different functions

00:02:37.920 --> 00:02:42.800
that the lab might want to look at that require different specialized equipment

00:02:42.000 --> 00:02:49.599
so let's start over here pci express gen 5 is one of the key

00:02:47.040 --> 00:02:53.440
selling points of 12th gen an Intel knew going into it that they were going to be

00:02:51.360 --> 00:02:58.080
well ahead of any consumer need for this kind of bandwidth it's twice as fast as

00:02:55.920 --> 00:03:02.239
gen 4 lane for lane and even ahead of market availability of any compatible

00:03:00.080 --> 00:03:06.319
devices but they did it anyway because there are advantages to being the market

00:03:04.080 --> 00:03:10.560
leader on new technologies not only does this kind of future compatibility give

00:03:08.319 --> 00:03:15.440
Intel a valuable marketing bullet point for their platform but it's also very

00:03:12.800 --> 00:03:20.000
likely that as NVIDIA or even AMD are developing their next-gen pci express

00:03:17.519 --> 00:03:24.080
gen 5 gpus that they're going to end up using Intel's alder lake platform for

00:03:22.000 --> 00:03:28.000
their own internal validation making it much more likely that any early quirks

00:03:26.400 --> 00:03:32.080
are going to be accounted for before those products ship to consumers

00:03:30.159 --> 00:03:35.519
of course living on the bleeding edge like that means that you've got to get

00:03:33.840 --> 00:03:40.640
creative sometimes okay this is a bit of an aside but this

00:03:37.920 --> 00:03:44.640
is my first hands-on with a pci express gen 5 device i mean not just me this is

00:03:43.040 --> 00:03:50.560
almost anyone's first hand on with the pci express gen 5 device this is so cool

00:03:48.480 --> 00:03:56.720
and immediately there are some noticeable physical changes compared to

00:03:52.959 --> 00:03:58.560
pci express gen 4. for example in gen 4

00:03:56.720 --> 00:04:02.319
here at the front of the connector only the presence pins were shorter so these

00:04:01.120 --> 00:04:07.760
tell the motherboard that there's a device installed but in gen 5 the

00:04:05.120 --> 00:04:11.760
presence pins and the power pins are shorter as well and this seems to be a

00:04:09.840 --> 00:04:17.600
result of what we've learned about hot plugging from years of Thunderbolt and

00:04:14.480 --> 00:04:19.759
NVMe storage devices so this change

00:04:17.600 --> 00:04:24.720
ensures that all the data connections are complete before the board powers up

00:04:22.880 --> 00:04:28.720
there are also some obvious changes to the data pin layout presumably to

00:04:26.720 --> 00:04:33.199
improve signal integrity because when we double our speed it means that we are

00:04:30.800 --> 00:04:37.120
far more sensitive to signaling issues you can actually see more detail about

00:04:34.960 --> 00:04:42.479
this in our HDMI cable tester video showing how the eye works and

00:04:40.000 --> 00:04:44.880
what happens if the eye gets too small you must be thinking though

00:04:44.080 --> 00:04:50.479
okay cool card but what does it do

00:04:48.240 --> 00:04:55.520
the answer is nothing but also

00:04:51.840 --> 00:04:57.680
everything there's no GPU or storage or

00:04:55.520 --> 00:05:02.800
network controller on here nothing like that the card is non-functioning in that

00:05:00.000 --> 00:05:09.199
sense but what it does do is generate and monitor traffic or errors and it can

00:05:05.840 --> 00:05:11.520
do so at every layer link physical

00:05:09.199 --> 00:05:14.639
transactional giving Intel's engineers almost full control of what they want to

00:05:13.520 --> 00:05:20.560
do with it most of these functions can be programmed in band so that is to say

00:05:18.240 --> 00:05:24.400
over the pci express bus and then the test can be executed but some of them

00:05:22.560 --> 00:05:29.600
are actually handled with dip switches that can do things like limit the speed

00:05:27.199 --> 00:05:34.000
the pci express generation the card runs at uh limit the number of lanes that are

00:05:31.759 --> 00:05:39.199
available i mean obviously you don't want to go build a whole bunch of

00:05:35.680 --> 00:05:41.039
different physical cards for 1x 4x 8x

00:05:39.199 --> 00:05:45.840
and 16x right another thing you don't want to build is custom silicon every

00:05:43.440 --> 00:05:50.560
time you find a hardware bug that's why this card uses a field programmable gate

00:05:48.320 --> 00:05:55.280
array or an fpga fpgas are prohibitively expensive

00:05:52.800 --> 00:05:59.840
compared to asics or normal computer chips so you'll only rarely see them in

00:05:57.520 --> 00:06:03.840
consumer products apple's afterburner accelerator card is an example but when

00:06:02.320 --> 00:06:08.479
you're developing a product that is meant to follow a specification that

00:06:06.400 --> 00:06:12.400
ain't finalized yet fpgas are a must

00:06:10.720 --> 00:06:17.160
because like it says in the name rather than being locked into a particular

00:06:14.319 --> 00:06:22.560
configuration they're actually hardware reprogrammable in the field allowing the

00:06:19.600 --> 00:06:27.120
engineers to make changes to the chip to address any shortcomings without wasting

00:06:24.639 --> 00:06:31.840
months designing and manufacturing a new asic every single time

00:06:29.440 --> 00:06:35.680
now i asked about the 8-pin power connector here and it turns out that

00:06:33.600 --> 00:06:40.720
it's not actually a requirement for the onboard fpga but rather it's there to

00:06:38.400 --> 00:06:45.520
validate external power functionality and as for the RAM slot they were

00:06:42.800 --> 00:06:49.840
extremely cagey about the RAM slot something something another mode we

00:06:47.360 --> 00:06:54.000
don't use i had kind of hoped it was for something

00:06:51.520 --> 00:06:56.400
like testing throughput to a known high bandwidth device or something like that

00:06:55.680 --> 00:07:00.080
but Intel has super high performance fpgas

00:06:59.120 --> 00:07:06.000
so obviously they're using their own so they wouldn't really need that

00:07:04.400 --> 00:07:11.840
i don't know this is really cool one of the big

00:07:08.880 --> 00:07:17.280
validation challenges today is usb type-c with all the different power

00:07:14.319 --> 00:07:21.919
envelopes and protocols DisplayPort native usb Thunderbolt not to mention

00:07:20.000 --> 00:07:26.960
daisy chaining with the latter we've reached a point of exponential possible

00:07:24.880 --> 00:07:30.800
configuration growth to the point where it is

00:07:27.919 --> 00:07:34.000
practically speaking impossible to test every single combination but if you've

00:07:32.479 --> 00:07:39.680
got one of these you can do an awful lot it doesn't

00:07:36.639 --> 00:07:41.680
really have a name but this usbc test

00:07:39.680 --> 00:07:45.599
rig is super cool each one of these wings hanging off of

00:07:43.599 --> 00:07:49.599
the test bench here has a variety of devices hanging off of it and they're

00:07:47.360 --> 00:07:55.360
both internally designed ones and commercially available ones i see some

00:07:51.840 --> 00:07:57.360
storage cards i see what looks like

00:07:55.360 --> 00:08:00.720
well i see more storage cards actually i don't recognize most of them then up

00:07:58.879 --> 00:08:04.639
here is the brains this thing is really cool typically what

00:08:02.800 --> 00:08:10.000
the engineer would do is they would plug this port here into the motherboard and

00:08:07.199 --> 00:08:14.080
then they connect anything they want to all of these ports up here and

00:08:12.080 --> 00:08:19.680
everything else is remotely configurable through software so you want to change

00:08:16.240 --> 00:08:21.039
from usb to Thunderbolt mode done

00:08:19.680 --> 00:08:26.560
want to connect this guy right here specifically

00:08:23.280 --> 00:08:28.879
easy how about this one and this one no

00:08:26.560 --> 00:08:32.719
problem what if you want to simulate the effects of someone plugging in and

00:08:31.120 --> 00:08:36.959
unplugging a device a thousand times

00:08:34.560 --> 00:08:41.760
well you could make your new hires do it or if you're Intel you could create a

00:08:39.200 --> 00:08:47.440
robot for that but the lab folks are all about working smarter not harder right

00:08:44.640 --> 00:08:51.519
guys yeah yeah that's what i thought now of course the main point of this kind of

00:08:49.440 --> 00:08:56.959
automation is to allow the text to run a much broader array of configurations and

00:08:54.560 --> 00:09:01.760
much more quickly which you need because the real world is full of scenarios that

00:08:59.279 --> 00:09:06.320
are far more challenging than plugging in and unplugging a device from a

00:09:03.920 --> 00:09:10.800
powered system like let's say a machine goes to sleep with one device

00:09:08.560 --> 00:09:15.200
and then wakes up with another like you're at work plugged into your usb

00:09:12.800 --> 00:09:20.160
dock you shut the lid on your laptop unplug throw it in your bag go home then

00:09:17.680 --> 00:09:23.200
you plug in an external display and fire up your machine

00:09:21.360 --> 00:09:28.800
well the system has to be able to figure out what the heck just happened and who it

00:09:26.880 --> 00:09:31.760
just woke up with i call this one the hangover scenario

00:09:30.480 --> 00:09:34.000
what do you think is that nickname gonna stick

00:09:34.240 --> 00:09:40.880
of course you guys might have noticed i talked about displays but we're not

00:09:38.880 --> 00:09:46.399
actually plugged into any type c displays

00:09:42.320 --> 00:09:49.040
that is where this little guy comes in

00:09:46.399 --> 00:09:53.440
welcome to times square get it because there's displays

00:09:50.800 --> 00:09:57.360
everywhere so clearly they do have parts in the lab that are dedicated to display

00:09:55.519 --> 00:10:01.600
testing full of all kinds of different monitors so they can ensure that you

00:09:59.120 --> 00:10:04.640
don't get flicker on a high resolution monitor when you plug in the onboard GPU

00:10:03.519 --> 00:10:09.120
for example but for what i think are fairly obvious

00:10:06.959 --> 00:10:15.279
reasons it's not practical to fill the whole lab with 65-inch tvs just so you

00:10:12.480 --> 00:10:20.560
can make sure that HDMI 2.1 works over on your usb test bench right so what

00:10:17.600 --> 00:10:24.000
they do is they use this little dongle called the dock

00:10:22.440 --> 00:10:29.519
d-o-k and what it does is it rips the edit

00:10:26.480 --> 00:10:31.760
information off of any display the e-did

00:10:29.519 --> 00:10:35.360
is kind of like the identifier of that particular model that reports its

00:10:33.519 --> 00:10:40.720
capabilities back to the connected device then what it does is it emulates it

00:10:38.720 --> 00:10:45.839
making your device think that that monitor is connected

00:10:42.800 --> 00:10:47.760
definitely a space saver there are some

00:10:45.839 --> 00:10:51.200
areas where you just can't take shortcuts though

00:10:49.040 --> 00:10:55.839
part of silicon behavior is influenced by its thermal environment so part of

00:10:53.519 --> 00:11:00.399
their testing methodology is of course to alter that environment i call this

00:10:58.240 --> 00:11:03.920
guy the thermal doodad and honestly

00:11:02.240 --> 00:11:08.160
i don't know much about it because the expert on this didn't happen to be here

00:11:06.000 --> 00:11:12.560
but what i do know is that it's a hydropneumatic cooling system which

00:11:10.240 --> 00:11:16.560
means other than that those words mean water

00:11:14.959 --> 00:11:20.320
and air i'm not sure and you can actually see

00:11:18.720 --> 00:11:27.040
that it's hooked up to the building's supply of both water and air i mean it's

00:11:24.240 --> 00:11:31.120
no LTT Store.com water bottle but i guess it's pretty cool

00:11:29.200 --> 00:11:36.160
and the control panel over here can either be adjusted manually or remotely

00:11:33.920 --> 00:11:41.040
and it'll actually change the parameters of the Intel control mechanism which is

00:11:38.560 --> 00:11:45.279
over here behind the test bench so this guy can raise and lower coolant

00:11:43.279 --> 00:11:50.720
temperatures on command allowing the techs to simulate a wide variety of

00:11:47.920 --> 00:11:55.360
different real world use cases let's say you turn on your machine outside on a

00:11:53.120 --> 00:12:01.279
freezing cold day so they've got to have the CPU be able to start cold and then

00:11:58.720 --> 00:12:05.120
get hot or maybe you take your machine out of the sweltering heat into an

00:12:03.279 --> 00:12:09.120
air-conditioned room well they've got to build a profile for the silicon's

00:12:06.720 --> 00:12:12.320
behavior under those conditions too and the thermal head

00:12:10.560 --> 00:12:18.880
that is such a boring engineer name by the way guys i'm gonna stick with the the doodad

00:12:16.079 --> 00:12:23.200
allows them to do just that which surely makes you wonder

00:12:20.959 --> 00:12:28.240
what do they do when they find these kinds of problems that must exist in the

00:12:25.519 --> 00:12:34.399
hardware i mean most of the chips in the validation lab here are going to be es1

00:12:30.720 --> 00:12:36.240
or es2 not final specs so bugs and

00:12:34.399 --> 00:12:42.000
undesirable behavior are part of the territory i am so glad you asked because

00:12:39.839 --> 00:12:46.399
i saved one of the coolest parts of the full tour for this follow-up video and

00:12:44.399 --> 00:12:51.760
we're gonna run down right now to the debug lab

00:12:48.959 --> 00:12:55.040
wait experts showed up ah okay so the red tubes are carrying water that could

00:12:53.440 --> 00:13:00.160
be as cold as like below zero okay way below the ambient

00:12:57.760 --> 00:13:05.279
temperature in this room so the air in the blue tubes here is actually to blow

00:13:02.800 --> 00:13:08.959
around the CPU socket and prevent any ice from forming or condensation or

00:13:07.600 --> 00:13:13.680
anything like that when they're going well below ambient temperature

00:13:11.360 --> 00:13:17.440
super cool okay now we're leaving when you need to dig as deep as possible

00:13:15.920 --> 00:13:23.839
into the silicon the debug lab is the place to go why

00:13:20.880 --> 00:13:28.800
because it's the home of the lada the laser assisted

00:13:26.160 --> 00:13:33.680
device alteration i don't remember that the name is not what's important what

00:13:30.720 --> 00:13:39.920
matters is how freaking cool this thing is it fires high powered lasers into the

00:13:37.200 --> 00:13:44.000
silicon like into the pnp junction so you can actually change the properties

00:13:41.920 --> 00:13:48.639
of the chip say for example you want to change the timing of an individual

00:13:46.079 --> 00:13:53.040
transistor or the voltage required to open it if the engineers want to see how

00:13:51.360 --> 00:13:57.600
the chip will behave in a way that's different from how they originally

00:13:54.560 --> 00:13:58.480
designed and fabbed it this can do it

00:13:57.600 --> 00:14:05.680
what like this bears repeating because it's basically science fiction you take a

00:14:02.480 --> 00:14:08.639
chip that's already fabbed you fire a

00:14:05.680 --> 00:14:12.639
freaking laser beam into it and it acts differently allowing you to test the

00:14:10.399 --> 00:14:18.079
outcome of a potential design change before you fab a whole new design

00:14:15.760 --> 00:14:23.040
absolutely unreal this next one's almost as fun

00:14:20.079 --> 00:14:27.360
unfortunately i'm not quite allowed to show it to you so we've got a wall of

00:14:24.959 --> 00:14:32.560
Intel employees in front of it but i'll describe it the irem is kind of like a

00:14:29.839 --> 00:14:36.720
flear thermal camera but for electrical activity so you want to see a gate open

00:14:35.279 --> 00:14:41.360
or close you want to monitor the activity of that gate when i say gate

00:14:38.959 --> 00:14:45.279
i'm talking about a single individual transistor one of billions on a chip

00:14:44.000 --> 00:14:50.240
like alder lake this is where you do it so the engineers

00:14:47.519 --> 00:14:54.160
can get these images of how the gates behave allowing them to say take an

00:14:52.560 --> 00:14:59.600
issue that comes up in the validation lab isolate it sometimes to the

00:14:57.199 --> 00:15:04.240
transitory that's problematic then fix it and move on with the debug and get

00:15:01.760 --> 00:15:09.360
closer to delivery and after everything we've seen i think the thing that blows

00:15:06.560 --> 00:15:15.199
my mind the most is that Intel has spent so many years talking about anything

00:15:12.320 --> 00:15:20.079
other than how cool this stuff is in my humble opinion their biggest pr

00:15:18.240 --> 00:15:24.800
problem is that they've let business people do too much of the talking while

00:15:22.320 --> 00:15:28.480
the gear heads have been quite literally locked away in the basement

00:15:26.560 --> 00:15:34.480
but i'm hoping that after these videos go up those days will be firmly behind

00:15:32.079 --> 00:15:39.600
us i mean honestly i don't see it going down any other way because literally

00:15:37.519 --> 00:15:44.639
every single person i met here was so enthusiastic and passionate about what

00:15:41.839 --> 00:15:48.959
they do that it's it's infectious i mean take dennis my camera operator he

00:15:46.959 --> 00:15:51.680
doesn't give too hot it's about cpus and he still managed to have a blast in this

00:15:50.800 --> 00:15:57.279
place the pride that these folks take in their work is inspiring and it's absolutely

00:15:55.920 --> 00:16:00.880
justified i mean in spite of the challenges of covet 19 they managed to

00:15:59.279 --> 00:16:05.839
hit their delivery estimate on alder lake to the week that they predicted

00:16:03.839 --> 00:16:09.600
it's mind-blowing especially now that you've seen all the steps involved am i

00:16:07.440 --> 00:16:14.959
right just like my sponsor segways blow your mind i fix it you break it i fix it

00:16:12.560 --> 00:16:18.399
not me of course but i fix it moire and minnow kits are the tool kits for the

00:16:16.399 --> 00:16:23.199
tinkerer on the go the pocket size minnow driver kit is only 14.99 with an

00:16:20.800 --> 00:16:27.759
easy to open magnetized case a built-in sorting tray 16 different bits and a

00:16:25.440 --> 00:16:31.680
handle with a built-in e-sim eject tool pretty fancy for something slightly

00:16:29.440 --> 00:16:35.920
bigger and longer the more a driver kit is only 1999 and comes with 32 different

00:16:34.160 --> 00:16:39.360
bits with extended reach next for digging into those hard-to-reach nooks

00:16:37.600 --> 00:16:42.240
and crannies and all ifixit kits come with a lifetime warranty as well so

00:16:40.880 --> 00:16:45.600
you're sure to end up in a landfill somewhere before your ifixit kit does so

00:16:44.320 --> 00:16:51.040
check out our links in the description to get yours today if you guys enjoyed this video all i got to do is point you

00:16:49.279 --> 00:16:56.880
to some of the other videos we've done maybe the fab tour go check it out it's

00:16:53.839 --> 00:16:56.880
gonna blow your mind
