Habana Gaudi2 Triples Performance

The latest chip from the Intel subsidiary offers a big performance jump from the initial Gaudi, putting it into the same class as Nvidia’s new Hopper GPU.

Linley Gwennap

With its recent Hopper announcement, Nvidia dramatically raised the bar to compete in the data-center AI market, more than doubling its lead relative to other vendors. Intel’s Habana subsidiary is the first to reach this new height, delivering its second-generation AI accelerator with performance and power efficiency similar to Hopper’s. Habana expects Gaudi2 systems to arrive by the end of this year, only a few months after Hopper. It also previewed its next-generation low-power accelerator, Greco.

Taking advantage of a shrink to 7nm manufacturing, Gaudi2 delivers huge improvements over the first-generation design. It triples the core count and features 48MB of on-chip SRAM, twice as much as Gaudi. The new accelerator triples the amount of High Bandwidth Memory (HBM) to 96GB and provides 2.5x more bandwidth. Gaudi2 adds features such as video decoding and support for emerging FP8 data types. The greater compute and memory capacity triples Gaudi’s ResNet performance but also raises the power to 600W TDP, which is still less than Hopper’s.

Habana announced the original Gaudi in 2019, just before its acquisition by Intel, but the 16nm training chip didn’t reach the market until last year; Amazon now offers it in an AWS instance. Gaudi outperforms Nvidia’s 12nm V100 but falls well behind the 7nm A100. Yet the A100 uses more power than Gaudi and, on the basis of AWS pricing, costs more than double, giving Habana the lead in performance per dollar.

Gaudi2 is sampling. According to Habana’s initial tests, the new chip doubles the A100’s training throughput on ResNet-50. This increase should put Gaudi2 within range of the Hopper H100. Habana will sell Gaudi2 on an OAM module; it also offers a server baseboard that can hold eight accelerators.

Habana Gaudi2 block diagram

Habana Gaudi2 baseboard

Free Newsletter

Get the latest analysis of new developments in semiconductor market and research analysis.

Subscribers can view the full article in the TechInsights Platform.

Subscriber Login

You must be a subscriber to access the Manufacturing Analysis reports & services.

If you are not a subscriber, you should be! Enter your email below to contact us about access.

Manufacturing Analysis

Subscriber Login

Analysis Insights

July 23, 2025

Inside the Future of Wearables | Teardown Insights & Market Trends eBook

Discover what's powering next-gen wearables. Get teardown insights, sensor trends, and strategic analysis in our free TechInsights eBook—built for tech leaders.

Learn More

June 23, 2025

Huawei Matebook Fold Uses Kirin X90 Built on SMIC’s 7nm (N+2) Node

TechInsights confirms Huawei's Matebook Fold | Ultimate Design features the Kirin X90 SoC built on SMIC’s 7nm (N+2) process—debunking rumors of a breakthrough 5nm node.

Learn More

June 20, 2025

Chip Observer June 2025

Stay informed on the latest shifts in semiconductor policy, AI, packaging, and market dynamics in the June 2025 Chip Observer, featuring insights on Qualcomm, OpenAI, Huawei, and more.

Learn More