Nivida keeps selling more GPUs than the datacenter capacity that came online. estimate of 560mw for Q1 2026. Up significantly from 1270mw surplus over entire 2025.

humanspiral@lemmy.ca · 2 months ago

Nivida keeps selling more GPUs than the datacenter capacity that came online. estimate of 560mw for Q1 2026. Up significantly from 1270mw surplus over entire 2025.

humanspiral@lemmy.ca · edit-2 2 months ago

A weird point that Nvidia CFO made to say “Nvidia is awesome” is a claim that GPU rental rates are up year to date. There was a crash at end of 2025. The low for the quarter was Jan 1st. The high was March 10th at peak of openclaw frenzy (validated by openrouter charts). Current rates are lower than that peak. But also comparison to 2025 Q1 (what I thought CFO meant, rates are down significantly) For single GPUs.

1. NVIDIA A100 (Ampere — 80GB SXM)

Q1 2025 Baseline: High: $2.40 | Low: $1.60 | Close: $1.85
Q1 2026 Window: High: $1.65 | Low: $0.80 | Close: $1.15
Current Normalized Rate: ~$1.07 / hr (Stable floor; primary use shifts to entry-level fine-tuning and quantized serving).

2.[NVIDIA H100 (Hopper — 80GB SXM)

Q1 2025 Baseline: High: $7.00 | Low: $5.50 | Close: $5.80 (Supply constraints started easing, down from the absolute peak $10/hr overcharges of late 2024).
Q1 2026 Window: High: $3.45 | Low: $1.70 | Close: $2.35 (Hit an absolute low floor of $1.70 in late 2025 before a 38% contract rebound in March due to an influx of video-generation workloads).
Current Normalized Rate: ~$2.49 / hr (The standard baseline workhorse for mainstream API serving).

3. [NVIDIA H200 (Hopper — 141GB HBM3e)

Q1 2025 Baseline: High: $5.20 | Low: $4.50 | Close: $4.80 (Extremely scarce; reserved exclusively for elite labs running early frontier training).
Q1 2026 Window: High: $4.40 | Low: $3.50 | Close: $3.80 (Inventory stabilized as neoclouds widely deployed HGX baseboards).
Current Normalized Rate: ~$3.39 / hr (The most cost-effective tier for high-concurrency FP8 deployment).

4. [NVIDIA B200 (Blackwell — 192GB HBM3e)

Q1 2025 Baseline: N/A (Sampling/Testing phase; unreleased to the public marketplace).
Q1 2026 Window: High: $6.11 | Low: $3.05 | Close: $4.95 _(Initial public availability; premium pric

5. NVIDIA B300 (Blackwell Ultra — 288GB HBM3e)

Q1 2025 Baseline: N/A (In architectural development; unavailable for rental).
Q1 2026 Window: High: $8.50 | Low: $5.50 | Close: $7.25 (Early access provisioning; highly volatile due to constrained data center site capacity).
Current Normalized Rate: ~$6.10 / hr (Neocloud standard rate; pricing reflects the premium for its 288GB memory pool).

for clusters, google AI mode simply can’t provide accurate info. Some providers have fixed premiums, others 0 premium. Many never change prices but mass email promotional discounts. For all I know, this entire analysis could have been a halucination meant to drive my narrative. I have not verified most data claims made as it would be too much work. I imagine most of the specific ones are accurate, and single GPU rental rates are the dominant market in the US, and that data should be solid, but FIIK.

humanspiral@lemmy.ca · 2 months ago

More precise pricing trends from premium tier 2 networks, show demand has drastically fallen over the quarter. H200 very close to its bare runcosts. Theory is that Anthropic’s overpayment for Collosus 1 (xAI) capacity has drastically reduced utilization at cloud rental service.

NVIDIA B200 Blackwell (192GB HBM3e)

Tier-1 Spot Mechanic: AWS introduced B200 spot availability (p6 family baseline). However, because Tier-1 spot pools are subject to extreme automated reclamation, they command a rigid premium.
The Rebound: B200 spot experienced a localized surge around March 13th due to massive API volume peaks.

Week Ending (2026)	Tier-1 Spot (AWS/Azure)	Tier-2 Spot (CoreWeave/Nebius)	Tier-2 On-Demand (Nebius Menu)	The True Market State
Jan 2	$6.40 / hr	$4.50 / hr	$7.80 / hr	Initial launch window; hardware access highly constrained.
Jan 16	$6.40 / hr	$4.20 / hr	$7.50 / hr	Data center pipelines face massive backlog queues.
Jan 30	$6.40 / hr	$4.00 / hr	$7.50 / hr	Tier-2 unallocated floor space begins opening up.
Feb 13	$6.12 / hr	$3.85 / hr	$7.20 / hr	Multi-agent frameworks consume near-term supply.
Feb 27	$6.12 / hr	$3.50 / hr	$6.80 / hr	Influx of new Blackwell nodes flattens spot markup.
Mar 13	$6.12 / hr	$4.10 / hr	$6.50 / hr	The Agentic Peak: Spot surges via automated bidding.
Mar 27	$5.90 / hr	$2.95 / hr	$6.00 / hr	High supply volumes trigger a localized margin correction.
Apr 10	$5.90 / hr	$2.40 / hr	$5.50 / hr	Nebius and Lambda drop public on-demand baseline rates.
Apr 24	$5.56 / hr	$2.06 / hr	$5.50 / hr	The System Bottom: Spot drops to its absolute low.
May 8	$5.56 / hr	$2.40 / hr	$5.50 / hr	Short-term enterprise fine-tuning contracts absorb space.
May 22 (Current)	$5.56 / hr	$2.90 / hr	$5.50 / hr	Current Rebound: Spot firms ahead of June 1 price hikes.

NVIDIA H200 Hopper (141GB HBM3e)

The Clearance Reality: Unlike the B200, the H200 has failed to rebound. Because its FP8 processing layout lacks Blackwell’s native execution upgrades, developers are abandoning the H200 for production serving.
Below Cost: Tier-2 spot has flatlined at $1.45/hr, forcing providers to eat losses relative to raw facility overhead just to prevent expensive liquid-cooled spaces from sitting entirely dark.

Week Ending (2026)	Tier-1 Spot (AWS `p5e` baseline)	Tier-2 Spot (CoreWeave/Nebius)	Tier-2 On-Demand (Lambda/Nebius Menu)	The True Market State
Jan 2	$4.20 / hr	$2.50 / hr	$4.40 / hr	Highly stable; utilized as the core long-context architecture.
Jan 16	$4.20 / hr	$2.30 / hr	$4.25 / hr	AWS implements dynamic Capacity Block adjustments.
Jan 30	$4.20 / hr	$2.10 / hr	$4.00 / hr	Early enterprise teams migration toward Blackwell blocks.
Feb 13	$3.90 / hr	$1.95 / hr	$3.95 / hr	Minor spot stabilization as alternative backends fill up.
Feb 27	$3.90 / hr	$1.80 / hr	$3.80 / hr	Shift to newer precision matrices devalues older stock.
Mar 13	$3.90 / hr	$1.95 / hr	$3.80 / hr	Minor agent-driven peak provides short-term support.
Mar 27	$3.85 / hr	$1.70 / hr	$3.65 / hr	Massive bulk capacity deployments flood European hubs.
Apr 10	$3.85 / hr	$1.55 / hr	$3.50 / hr	Market signals show severe oversupply on legacy nodes.
Apr 24	$3.83 / hr	$1.45 / hr	$3.50 / hr	The Floor: Prices slide below break-even run costs.
May 8	$3.83 / hr	$1.45 / hr	$3.50 / hr	Capacity remains completely unallocated across major nodes.
May 22 (Current)	$3.83 / hr	$1.45 / hr	$3.50 / hr	Current Stagnation: Zero rebound; structural value tier.

Customer Segment	NVIDIA GW Sold (Refined Power Footprint)	Actual New GW Deployed (Capacity Online)	Net Capacity Gap (Deficit)
Hyperscalers	1.05 GW	0.93 GW	+0.12 GW (120 MW Deficit)
AI Clouds & Sovereigns	0.68 GW	0.42 GW	+0.26 GW (260 MW Deficit)
Enterprise & Industrial	0.38 GW	0.20 GW (Est. legacy footprint)	+0.18 GW (180 MW Deficit)
Total Global Market	2.11 GW	1.55 GW	+0.56 GW (560 MW Deficit)