With, I think, a massive grain of salt since this info is unverified and direct from the manufacturer…
Huawei’s official presentation claims their Cloudmatrix 385 supercomputer delivers 300 PFLOPS of computing power, 269 TB/s of network bandwidth, and 1,229 TB/s of total memory bandwidth. It also achieves 55 percent model fitting utilization (MFU) during training workloads and offers 2.8 Tbps of inter-card bandwidth, heavily emphasizing its strength in networking.
| Spec | NVL72 (Nvidia) | CloudMatrix 384 (Huawei) | Better? (%) |
|-----------------|----------------|--------------------------|------------|
| Total compute | 180 Pflops | 300 Pflops | 67% |
| Total network bw| 130 TB/s | 269 TB/s | 107% |
| Total mem bw | 576 TB/s | 1,229 TB/s | 113% |
And costs a yearly subscription, also random features get removed every month.