Tekmono
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
No Result
View All Result
Tekmono
No Result
View All Result
Home News
Google Unveils Ironwood TPU for AI Workloads

Google Unveils Ironwood TPU for AI Workloads

by Tekmono Editorial Team
08/09/2025
in News
Share on FacebookShare on Twitter

Google has unveiled its seventh-generation Tensor Processing Unit (TPU), dubbed Ironwood, at Hot Chips 2025, building on its initial announcement at Google Cloud Next ’25 in April, marking a significant advancement in AI computing.

The Ironwood TPU is specifically designed for large-scale inference workloads, a shift from previous generations that focused on training. Each Ironwood chip incorporates two compute dies, delivering 4,614 TFLOPs of FP8 performance. The chip features eight stacks of HBM3e, providing 192 GB of memory per chip with a 7.3 TB/s bandwidth. The system architecture is designed to scale up to 9,216 chips per pod, facilitated by 1.2 TB/s of I/O bandwidth, eliminating the need for glue logic and achieving a total of 42.5 exaflops of performance.

A notable highlight of Ironwood is its extensive memory capacity. A single pod provides 1.77 PB of directly addressable HBM, which Google claims is a new world record for shared memory supercomputers. This is made possible by optical circuit switches that link racks together, enabling seamless communication between the chips.

Related Reads

Apple Unveils iPhone 17e Starting at $599

Honor Launches Thinner Magic V6 Foldable Phone

Trump Orders Immediate Halt to Anthropic AI Use

Claude AI Suffers Partial Service Disruption on March 2

The Ironwood TPU also prioritizes reliability and resilience. The hardware is equipped with the ability to automatically reconfigure around failed nodes and restore workloads from checkpoints. Additional features include an on-chip root of trust, built-in self-test functions, silent data corruption mitigation, and logic repair functions to improve manufacturing yield. According to Google, a strong emphasis on RAS (reliability, availability, and serviceability) is evident throughout the architecture.

Cooling for the Ironwood TPU is handled by a cold-plate solution integrated with Google’s third-generation liquid-cooling infrastructure. Google claims that Ironwood achieves a twofold improvement in performance per watt compared to its predecessor, Trillium. Dynamic voltage and frequency scaling further enhance efficiency during varied workloads, ensuring optimal performance.

The design of Ironwood also leveraged AI techniques to optimize ALU circuits and floor plans. A fourth-generation SparseCore has been added to accelerate embeddings and collective operations, supporting workloads such as recommendation engines. This integration of AI in the design process underscores Google’s commitment to pushing the boundaries of AI computing.

Ironwood deployment is currently underway at hyperscale within Google Cloud data centers. However, the TPU remains an internal platform and is not directly available to Google Cloud customers. The development and deployment of Ironwood reflect Google’s long-term investment in AI compute infrastructure.

Ryan Smith of ServeTheHome commented on Google’s presentation at Hot Chips 2025, stating, “This was an awesome presentation. Google saw the need to create high‑end AI compute many generations ago. Now the company is innovating at every level from the chips, to the interconnects, and to the physical infrastructure. Even as the last Hot Chips 2025 presentation this had the audience glued to the stage at what Google was showing.”

ShareTweet

You Might Be Interested

Apple Unveils iPhone 17e Starting at 9
News

Apple Unveils iPhone 17e Starting at $599

02/03/2026
Honor Launches Thinner Magic V6 Foldable Phone
News

Honor Launches Thinner Magic V6 Foldable Phone

02/03/2026
Trump Orders Immediate Halt to Anthropic AI Use
News

Trump Orders Immediate Halt to Anthropic AI Use

02/03/2026
Claude AI Suffers Partial Service Disruption on March 2
News

Claude AI Suffers Partial Service Disruption on March 2

02/03/2026
Please login to join discussion

Recent Posts

  • Apple Unveils iPhone 17e Starting at $599
  • Honor Launches Thinner Magic V6 Foldable Phone
  • Trump Orders Immediate Halt to Anthropic AI Use
  • Claude AI Suffers Partial Service Disruption on March 2
  • Claude Chatbot Overtakes ChatGPT in US App Store

Recent Comments

No comments to show.
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
Tekmono is a Linkmedya brand. © 2015.

No Result
View All Result
  • News
  • Guides
  • Lists
  • Reviews
  • Deals