Post
Google has unveiled Ironwood, its seventh-generation Tensor Processing Unit (TPU), at Cloud Next 2025 — a major leap in AI inference technology. With a peak performance of 4,614 teraflops, up to 192GB of HBM per chip, and bandwidth reaching 7.2 Tbps, Ironwood delivers double the performance per watt compared to its predecessor, Trillium. Offered in 256 and 9,216 chip configurations, it’s designed to scale based on cloud AI workload needs. Scheduled for release in late 2025, Ironwood strengthens Google’s push for AI infrastructure dominance while reducing dependency on third-party hardware.