bitdepth.co
HomePillarsTopicsLong-readsAbout
Search⌘KSubscribe
LIVESat · Jun 13 · 2026, 10:16 UTC
HBM3E demand +47% q/qNVDA inventory lead time −9dSK Hynix HBM4 risk prod 2026Q3TSMC CoWoS capex +$3.8B
↳ The AI memory bottleneck/Software
Published Apr 15 · 8 min read

The FP8 → INT4 quantization roadmap.

Inference vendors are racing from FP8 to INT4 as the next lever on compute efficiency. We map the roadmap across frameworks, the accuracy tradeoffs that actually matter in production, and which memory vendors benefit most.

PublishedApr 15, 2026
Length2,000 words · 8 min
Part of pillar
The AI memory bottleneck

Continuously-updated topic hub. Edition 14, updated 2h ago. 9 sub-articles.

Open the full pillar →
Reading
8 min · 2,000 wd
Tagged
SoftwareAI
Related — within this pillar
Supply chain
Samsung's HBM3E qualification, finally. The full timeline and what it means for NVIDIA's 2026 allocation strategy.
May 12
Supply chain
Why every AI training run is now a packaging negotiation.
May 09
Solutions
Cerebras WSE-4 is generally available. We ran the benchmarks. The numbers are real.
May 08
Supply chain
Why Samsung's HBM gap is closing.
May 05
↳ Part of pillar

This is one entry in The AI memory bottleneck.

Thirty more sub-articles, six tracked solution paths, a weekly-updated timeline, and a live aggregated feed — all on the pillar page.

Open the pillar →
More from this pillar

More in The AI memory bottleneck

↳ Supply chain · 9 min

Samsung's HBM3E qualification, finally. The full timeline and what it means for NVIDIA's 2026 allocation strategy.

May 12 · 2,400 words
↳ Supply chain · 14 min

Why every AI training run is now a packaging negotiation.

May 09 · 4,100 words
↳ Solutions · 11 min

Cerebras WSE-4 is generally available. We ran the benchmarks. The numbers are real.

May 08 · 3,200 words
bitdepth.co

Independent technology journalism, organized by what matters — not by what's new. Pillars are my long-running, continuously-updated topic hubs. A one-person publication.

RSSXMastodonAPI
Pillars
  • The AI memory bottleneck
  • The advanced packaging race
  • Export controls & the new silicon geopolitics
  • The power wall
  • See all 6 →
Read
  • Homepage
  • Topics
  • Long-reads
  • Search
  • Archive
About
  • My method
  • Editor
  • Newsletter
  • Sources policy
  • Contact
© 2026 bitdepth.co · Independent · Reader-supportedMade for builders · v3.0 "Pillar"

We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.