NVIDIA Fixes Blackwell: A Swift Response to the GPU Issue

watch 1m, 10s
views 2

12:57, 24.10.2024

Article Content
arrow

  • Chip Improvements and TSMC’s Role
  • Mass Production of the Updated Chips

NVIDIA CEO Jensen Huang acknowledged a design flaw in the Blackwell series GPU, which led to delays in the supply of AI chips. The issue involved a functional defect that resulted in a low yield of working chips. According to Huang, the fault was entirely on NVIDIA and not their manufacturing partner TSMC, as some sources had suggested. He emphasized that TSMC was not only uninvolved in the problem but also played an active role in helping to fix it.

Chip Improvements and TSMC’s Role

The issue was resolved by modifying the upper metal layers and silicon bumps in the GPU, which enhanced performance. The fix required significant efforts, given the need to simultaneously manufacture seven different types of chips from scratch. The main challenges were associated with the CoWoS-L packaging technology, which uses LSI silicon bridges, the RDL interposer, and GPU chiplets. Problems arose due to thermal expansion of the components, causing system deformation. Such fixes typically take around 10 cycles, but NVIDIA and TSMC managed to resolve the issue in record time.

Mass Production of the Updated Chips

The updated Blackwell B100 and B200 GPUs are set to enter mass production by the end of October, with shipments expected to begin early next year. While the production of the improved chips is ramping up, NVIDIA still anticipates some shortage of high-performance GPUs in 2024, particularly for major cloud providers such as AWS, Google, and Microsoft.

Share

Was this article helpful to you?

VPS popular offers

-10%

CPU
CPU
3 Epyc Cores
RAM
RAM
2 GB
Space
Space
20 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 2048 Linux

8.8 /mo

/mo

Billed annually

-9.5%

CPU
CPU
8 Epyc Cores
RAM
RAM
32 GB
Space
Space
200 GB NVMe
Bandwidth
Bandwidth
Unlimited
wKVM-NVMe 32768 Windows

74.49 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
Unlimited
wKVM-SSD 4096 Windows

18.65 /mo

/mo

Billed annually

-8.4%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
75 GB SSD
Bandwidth
Bandwidth
Unlimited
10Ge-wKVM-SSD 2048 Windows

37.4 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Xeon Cores
RAM
RAM
8 GB
Space
Space
200 GB HDD
Bandwidth
Bandwidth
Unlimited
KVM-HDD 8192 Linux

25.25 /mo

/mo

Billed annually

-10%

CPU
CPU
10 Epyc Cores
RAM
RAM
64 GB
Space
Space
400 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 65536 Linux

135.49 /mo

/mo

Billed annually

-10.2%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
150 GB SSD
Bandwidth
Bandwidth
100 Mbps
DDoS Protected SSD-KVM 16384 Linux

123 /mo

/mo

Billed semiannually

-24.7%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
50 GB SSD
Bandwidth
Bandwidth
4 TB
KVM-SSD 4096 Metered Linux

31 /mo

/mo

Billed annually

-15.6%

CPU
CPU
3 Xeon Cores
RAM
RAM
1 GB
Space
Space
20 GB SSD
Bandwidth
Bandwidth
30 Mbps
DDoS Protected SSD-KVM 1024 Linux

38 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Epyc Cores
RAM
RAM
4 GB
Space
Space
50 GB NVMe
Bandwidth
Bandwidth
Unlimited
aiKVM-NVMe 4096 Linux

16.54 /mo

/mo

Billed annually

Other articles on this topic

cookie

Accept cookies & privacy policy?

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the HostZealot website.