
SemiAnalysis said the AI server cluster reduced planned per-rack memory capacity and rack cost, while institutions later said the change affects only CPU-side pluggable memory rather than GPU-linked HBM demand.
Nvidia’s next-generation Rubin NVL72 AI server cluster remained in focus after SemiAnalysis said planned per-rack memory capacity would be reduced to 28TB from 55TB, with most systems using 96GB SOCAMM modules instead of the planned 192GB. The report also said the change would lower rack cost to $6.8 million from $7.6 million and triggered a global pullback in storage-related stocks. Institutions later said the reduction applies only to CPU-side pluggable memory, while demand for high-bandwidth memory tied to GPU computing remains intact, softening concerns about a broader hit to AI-memory demand.