Nvidia’s Grace CPU knows no rivals
Powered by Arm Neoverse N2 cores, the Grace CPU will be used in Nvidia’s Superchips with both CPU+CPU and CPU+GPU versions. The company had showcased the GH200 GPU, which has the world’s fastest HBM3e memory, in the past months and positioned it in the Grace Hopper Superchip solution. Another pillar of this platform will be the Grace CPU, as the name suggests.
Some of Grace’s highlights are as follows:
- High performance CPU for HPC and cloud computing
- Superchip design with up to 144 Arm v9 CPU cores
- World’s first LPDDR5x with ECC memory and 1TB/s total bandwidth
- 900GB/s compatible interface, 7x faster than PCIe Gen 5
- Twice the packing density of DIMM-based solutions
- 2x more performance per watt than today’s leading CPU
- Support for all Nvidia software stacks and platforms, including RTX, HPC, AI and Omniverse
We know the features of the Grace CPU from the Grace Superchip configuration. Accordingly, Nvidia’s solution offers 144 (72 Arm Neoverse V2 per chip) cores, up to 1TB/s raw bandwidth, up to 960GB of LPDDR5X memory and 500W power consumption. Additional features include 117MB of L3 cache, 58 PCIe Gen5 lanes, and the TSMC 4N manufacturing process.
Intel is far behind
However, the solutions get even more interesting when compared to a true large-scale data center application. A 5MW Data Center throughput benchmark shown shows that Nvidia’s Grace Superchips are capable of delivering up to 2.5x the performance while being hugely efficient in the same benchmarks. For data center and server customers investing in these workloads, Grace CPUs could be a big game changer, just as Nvidia’s Tensor Core GPUs dominate the HPC and AI space. And finally, let’s mention that these comparisons are made by Nvidia.