1
00:00:00,000 --> 00:00:05,520
How many gigabytes does your hard drive or SSD hold? Are you sure about that?

2
00:00:05,520 --> 00:00:10,320
We've already discussed how over-provisioning and file system overhead often mean that the usable

3
00:00:10,320 --> 00:00:14,720
capacity of your drives is less than what's advertised, and you can check out this video

4
00:00:14,720 --> 00:00:19,120
to learn more. But it turns out there's another way that you can be easily fooled,

5
00:00:19,760 --> 00:00:23,440
the fact that the term Gigabyte itself can be ambiguous.

6
00:00:23,440 --> 00:00:27,600
Many of us think of a Gigabyte as exactly 1 billion bytes,

7
00:00:27,680 --> 00:00:32,560
however, it's a commonly accepted practice in the industry to instead use Gigabyte to mean

8
00:00:33,120 --> 00:00:38,160
this many bytes. If you look closely, that's a difference of about 74 megabytes,

9
00:00:38,160 --> 00:00:43,680
which is not an insignificant gap. So why does the industry have to make things so complicated?

10
00:00:43,680 --> 00:00:48,560
To get the answers, we have to look way back to the early days of modern computing.

11
00:00:48,560 --> 00:00:54,560
As early as the 1960s, binary addressing was already the standard for a working memory or RAM.

12
00:00:54,560 --> 00:00:58,480
You see, when a computer puts something into memory, that data is assigned to a particular

13
00:00:58,480 --> 00:01:03,520
address so the system can recall where it's stored. Binary addressing means that each

14
00:01:03,520 --> 00:01:09,680
memory address is expressed in binary digits, that is ones and zeros. So for example, if you have a

15
00:01:09,680 --> 00:01:18,000
10-bit address, there are two to the 10th power or 1024 possible addresses, not exactly 1000.

16
00:01:18,000 --> 00:01:24,960
It therefore became convenient to refer to 1024 bits as a kilo bit, or the same number of bytes

17
00:01:24,960 --> 00:01:30,800
as a kilo byte, even though those uses didn't strictly line up with the meaning of the kilo

18
00:01:30,800 --> 00:01:36,560
prefix as it's used in fields outside of computer science. We've also seen this at times with mass

19
00:01:36,560 --> 00:01:42,240
storage devices. During the 1980s, when the IBM PC platform was becoming popular, the industry

20
00:01:42,240 --> 00:01:48,000
needed to decide on a standard sector size. And if you didn't know, a sector is the smallest

21
00:01:48,000 --> 00:01:54,080
amount of data that can be stored at one time on a drive. The powers that be settled on making 512

22
00:01:54,080 --> 00:02:00,720
bytes the default size for one sector. And if you put two sectors together, you have 1024 bytes,

23
00:02:00,720 --> 00:02:04,960
which matched the binary addressing definition of kilo byte we discussed earlier.

24
00:02:05,760 --> 00:02:10,960
Even today, when we're using much larger drive sizes, sectors are still defined by sizes and

25
00:02:10,960 --> 00:02:17,920
powers of two, with the sector today usually coming in at 4096 bytes. However, the storage

26
00:02:17,920 --> 00:02:24,400
industry today typically uses the more common exactly one billion bytes definition of Gigabyte,

27
00:02:24,400 --> 00:02:29,760
since this definition allows drive makers to put larger capacities on their packaging and claim

28
00:02:29,760 --> 00:02:34,640
slightly faster speeds on their spec sheets. But wouldn't it be nice to know exactly what you're

29
00:02:34,640 --> 00:02:40,400
getting when you go out and buy a drive or memory card? Thankfully, more precise terms were published

30
00:02:40,400 --> 00:02:47,520
in 1999 with words like megabyte and gibby byte that unambiguously refer to units that are powers

31
00:02:47,520 --> 00:02:53,600
of two with the by or bi in the middle standing for binary. If you want to write that out as

32
00:02:53,600 --> 00:02:59,120
abbreviation, sticking a lowercase i between the capital G and the capital B is the correct way to

33
00:02:59,120 --> 00:03:04,640
do it. However, this obviously hasn't solved the problem completely. The more common words like

34
00:03:04,720 --> 00:03:11,280
megabyte and Gigabyte can refer to data units in either system, and the words with by or b in

35
00:03:11,280 --> 00:03:16,240
the middle haven't gained a whole lot of traction outside of tech circles. In fact, there have even

36
00:03:16,240 --> 00:03:22,000
been lawsuits over how these terms are used, with customers claiming that they got less storage than

37
00:03:22,000 --> 00:03:26,880
they paid for in certain circumstances. And as storage continues to grow and grow, this could

38
00:03:26,880 --> 00:03:32,720
become a more significant issue. For example, there's about a 125 terabyte difference between

39
00:03:32,720 --> 00:03:38,640
a petabyte and a pebby byte, but maybe we'll just constantly be storing such large files on

40
00:03:38,640 --> 00:03:42,880
our devices that these differences won't matter very much. Personally, I'm looking forward to

41
00:03:42,880 --> 00:03:48,000
sharing food photos on Instagram in 16k with encoded smell data. So thanks for watching guys,

42
00:03:48,000 --> 00:03:52,080
if you liked this video, give us a like, hey, hit subscribe, why not? And be sure to hit us up in

43
00:03:52,080 --> 00:03:56,000
the comments section with your suggestions or topics that we should cover in the future.
