Four years ago, we split GA100 into two halves that communicate through an interconnect. It was a big move - and yet barely anyone noticed, thanks to amazing work from CUDA and the GPU team.
Today, that work comes to fruition with the Blackwell launch. Two dies. One awesome GPU.
Look closely & you'll see that GH100 (and GA100) are built from two halves that communicate through a split L2. This improves scalability, although I worried it would be hard to program.
The transition was smoother than I expected, thanks to some SW and HW magic! 🧙♀️🧙♂️🪄