While the excess sales can partially be explained by converting CPU and bitcoin servers, and upgrading functional or burnt out older GPUs, there is finite replaceable powered capacity, in addition to small growth rate of datacenters under active construction that can hope for 2026 opening. “Grey market” diversion to China can be a hidden source of sales.

This is a refined estimate based on taking out networking/software from each of NVidia’s sales channels.

Hyperscalers rarely buy commercial software licenses from NVIDIA (they build their own stacks), while Enterprise buyers are heavily dependent on software subscriptions like NVIDIA AI Enterprise ($4,500/GPU/year). Similarly, networking intensity follows a drastic gradient: a massive LLM training cluster requires a massive networking tax, whereas an Enterprise inference node does not.

To resolve this, we must break down NVIDIA’s $75.2 billion total data center revenue by applying asymmetric networking and software multipliers to each specific customer segment.


Phase 1: Re-Allocating Networking and Software by Segment

NVIDIA’s software layer consists of subscription revenue (which scales with the historical installed base, not just new capacity) and architecture licensing. Its networking segment consists of InfiniBand and Spectrum-X Ethernet switches, adapters, and cables.

Let’s dissect how these costs actually apply to each of the three purchasing categories:

1. Hyperscalers ($38.0B Total Segment)

  • Software Allocation (0.5%): Negligible. Hyperscalers rely on their own internal orchestrators and proprietary AI software layers. They only pay minimal foundational firmware fees.
  • Networking Allocation (22%): Exceptionally high. Building multi-thousand GPU clusters for LLM training requires massive networking fabrics. Even with the integrated copper backplane of the GB200 NVL72, hyperscalers must purchase massive external Quantum-X800 InfiniBand or Spectrum-X800 switches to link multiple racks together into a single cluster.
  • Net Compute Revenue: $29.45 Billion

2. AI Clouds & Sovereigns (~$21.2B of ACIE)

  • Software Allocation (3%): Moderate. Specialized AI clouds lease a small portion of NVIDIA’s software stack to provide turnkey developer environments, but their core business is raw infrastructure provision. Sovereign clouds often pay a premium for localized security software layers.
  • Networking Allocation (15%): High. They host large-scale foundational model clusters, requiring strong interconnect fabrics, though slightly less dense than the multi-tier topologies deployed by core hyperscalers.
  • Net Compute Revenue: $17.38 Billion

3. Enterprise & Industrial (~$16.0B of ACIE)

  • Software Allocation (20%): Very high. This is where NVIDIA’s recurring subscription revenue lives. Enterprise clients cannot build their own software stacks; they pay heavily for NVIDIA AI Enterprise, NIM microservices, and Omniverse licenses. This revenue applies to both new shipments and their legacy installed base.
  • Networking Allocation (5%): Very low. Most enterprise applications are small-scale clusters or isolated 8-GPU nodes executing localized inference or fine-tuning, requiring zero massive cluster switching.
  • Net Compute Revenue: $12.00 Billion

Phase 2: Refined Segment-by-Segment Power Calculations

With the refined, asymmetric compute revenue isolated, we can run the physical power conversion using tailored Average Selling Prices (ASPs), system power demands, and facility Power Usage Effectiveness (PUE) metrics.

Category A: Hyperscalers ($29.45B Net Compute)

  • Product Mix: 50% Blackwell NVL72 / 50% Hopper H200.

  • Blended Compute ASP: ~$42,000 (reflecting a mix of raw chip pricing and heavy rack-integration premiums).

  • Total GPUs Shipped:

    GPUs=$29,450,000,000$42,000≈701,000 unitsGPUs equals the fraction with numerator $ 29 comma 450 comma 000 comma 000 and denominator $ 42 comma 000 end-fraction is approximately equal to 701 comma 000 units

    GPUs=$29,450,000,000$42,000≈701,000 units

  • Blended Power per GPU: 1,300W (Nominal system draw including Grace CPUs and cooling pumps).

  • Hyperscaler Grid Footprint (1.15 PUE for ultra-efficient facilities):

    Grid Power=(701,000×1,300 W)×1.15≈1.05 GWGrid Power equals open paren 701 comma 000 cross 1 comma 300 W close paren cross 1.15 is approximately equal to 1.05 GW

    Grid Power=(701,000×1,300 W)×1.15≈𝟏.𝟎𝟓 GW

Category B: AI Clouds & Sovereigns ($17.38B Net Compute)

  • Product Mix: 80% Hopper (H100/H200) / 20% standalone Blackwell (B200).

  • Blended Compute ASP: ~$35,000 (standard market rate for high-end accelerator nodes without bulk hyperscaler discounts).

  • Total GPUs Shipped:

    GPUs=$17,380,000,000$35,000≈497,000 unitsGPUs equals the fraction with numerator $ 17 comma 380 comma 000 comma 000 and denominator $ 35 comma 000 end-fraction is approximately equal to 497 comma 000 units

    GPUs=$17,380,000,000$35,000≈497,000 units

  • Blended Power per GPU: 1,100W (Weighted heavily toward standard Hopper HGX server topologies).

  • AI Cloud Grid Footprint (1.25 PUE for mixed commercial multi-tenant sites):

    Grid Power=(497,000×1,100 W)×1.25≈0.68 GWGrid Power equals open paren 497 comma 000 cross 1 comma 100 W close paren cross 1.25 is approximately equal to 0.68 GW

    Grid Power=(497,000×1,100 W)×1.25≈𝟎.𝟔𝟖 GW

Category C: Enterprise & Industrial ($12.00B Net Compute)

  • Product Mix: 70% low-power inference cards (L40S, H100 NVL) / 30% mainstream H100s.

  • Blended Compute ASP: ~$18,000 (strongly depressed by high-volume, lower-cost PCIe form factors).

  • Total GPUs Shipped:

    GPUs=$12,000,000,000$18,000≈667,000 unitsGPUs equals the fraction with numerator $ 12 comma 000 comma 000 comma 000 and denominator $ 18 comma 000 end-fraction is approximately equal to 667 comma 000 units

    GPUs=$12,000,000,000$18,000≈667,000 units

  • Blended Power per GPU: 450W (Reflecting the dramatically lower power draw of enterprise edge and inference cards).

  • Enterprise Grid Footprint (1.25 PUE for on-premises or traditional enterprise cages):

    Grid Power=(667,000×450 W)×1.25≈0.38 GWGrid Power equals open paren 667 comma 000 cross 450 W close paren cross 1.25 is approximately equal to 0.38 GW

    Grid Power=(667,000×450 W)×1.25≈𝟎.𝟑𝟖 GW


Phase 3: Final Comparison: GW Sold vs. GW Deployed

Now, let’s look at how this highly refined model maps against the 1.55 GW of net-new trackable data center capacity that physically came online across the globe during the quarter:

Customer Segment NVIDIA GW Sold (Refined Power Footprint) Actual New GW Deployed (Capacity Online) Net Capacity Gap (Deficit)
Hyperscalers 1.05 GW 0.93 GW +0.12 GW (120 MW Deficit)
AI Clouds & Sovereigns 0.68 GW 0.42 GW +0.26 GW (260 MW Deficit)
Enterprise & Industrial 0.38 GW 0.20 GW (Est. legacy footprint) +0.18 GW (180 MW Deficit)
Total Global Market 2.11 GW 1.55 GW +0.56 GW (560 MW Deficit)

Key Takeaways from the Refined Model

  1. The Grid Deficit Narrowed: By properly allocating NVIDIA’s high software subscription margins out of the Enterprise sector and stripping heavy networking switch infrastructure out of the Hyperscale sector, the true global power footprint shipped by NVIDIA drops to 2.11 GW. The total global grid deficit sits at 560 Megawatts.
  2. Where the Logjam Actually Sits: Notice that the Hyperscaler gap is remarkably tight—only 120 MW. This proves that hyperscalers are incredibly efficient at matching their massive utility contracts directly to their hardware delivery schedules.
  3. The Hidden Crisis is in Tier-2 AI Clouds & Sovereigns: This segment represents a massive 260 MW deficit. Because these buyers lack the immense, multi-gigawatt land and power pipelines of the tech giants, they are receiving high-performance, high-power silicon far faster than their regional, third-party colocation data centers can actually deploy physical electricity to the racks.

This model confirms that the “homeless GPU” crisis is primarily concentrated outside of the core hyperscalers, driving smaller AI clouds to aggressively bid up any available third-party power capacity in the market today.

  • humanspiral@lemmy.caOP
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    1 day ago

    A weird point that Nvidia CFO made to say “Nvidia is awesome” is a claim that GPU rental rates are up year to date. There was a crash at end of 2025. The low for the quarter was Jan 1st. The high was March 10th at peak of openclaw frenzy (validated by openrouter charts). Current rates are lower than that peak. But also comparison to 2025 Q1 (what I thought CFO meant, rates are down significantly) For single GPUs.

    1. NVIDIA A100 (Ampere — 80GB SXM)

    • Q1 2025 Baseline: High: $2.40 | Low: $1.60 | Close: $1.85
    • Q1 2026 Window: High: $1.65 | Low: $0.80 | Close: $1.15
    • Current Normalized Rate: ~$1.07 / hr (Stable floor; primary use shifts to entry-level fine-tuning and quantized serving). 

    2.[NVIDIA H100 (Hopper — 80GB SXM)

    • Q1 2025 Baseline: High: $7.00 | Low: $5.50 | Close: $5.80 (Supply constraints started easing, down from the absolute peak $10/hr overcharges of late 2024).
    • Q1 2026 Window: High: $3.45 | Low: $1.70 | Close: $2.35 (Hit an absolute low floor of $1.70 in late 2025 before a 38% contract rebound in March due to an influx of video-generation workloads).
    • Current Normalized Rate: ~$2.49 / hr (The standard baseline workhorse for mainstream API serving). 

    3. [NVIDIA H200 (Hopper — 141GB HBM3e)

    • Q1 2025 Baseline: High: $5.20 | Low: $4.50 | Close: $4.80 (Extremely scarce; reserved exclusively for elite labs running early frontier training).
    • Q1 2026 Window: High: $4.40 | Low: $3.50 | Close: $3.80 (Inventory stabilized as neoclouds widely deployed HGX baseboards).
    • Current Normalized Rate: ~$3.39 / hr (The most cost-effective tier for high-concurrency FP8 deployment). 

    4. [NVIDIA B200 (Blackwell — 192GB HBM3e)

    • Q1 2025 Baseline: N/A (Sampling/Testing phase; unreleased to the public marketplace).
    • Q1 2026 Window: High: $6.11 | Low: $3.05 | Close: $4.95 _(Initial public availability; premium pric

    5. NVIDIA B300 (Blackwell Ultra — 288GB HBM3e)

    • Q1 2025 Baseline: N/A (In architectural development; unavailable for rental).
    • Q1 2026 Window: High: $8.50 | Low: $5.50 | Close: $7.25 (Early access provisioning; highly volatile due to constrained data center site capacity).
    • Current Normalized Rate: ~$6.10 / hr (Neocloud standard rate; pricing reflects the premium for its 288GB memory pool). 

    for clusters, google AI mode simply can’t provide accurate info. Some providers have fixed premiums, others 0 premium. Many never change prices but mass email promotional discounts. For all I know, this entire analysis could have been a halucination meant to drive my narrative. I have not verified most data claims made as it would be too much work. I imagine most of the specific ones are accurate, and single GPU rental rates are the dominant market in the US, and that data should be solid, but FIIK.