MLPerf Benchmarks GPT Training Times

The MLPerf benchmark added tests to assess GPT-training performance. AI processors from Nvidia and Intel (Habana) both exhibit good scaling.

Joseph Byrne

GPT3 has entered the MLPerf Training arena, and only two chip companies have ventured to grapple with this benchmark. Nvidia, working with data-center operator CoreWeave, posted the fastest training time. Intel’s Habana Labs doesn’t match Nvidia’s performance but stands out as the only business to challenge the AI juggernaut.

MLCommons, the consortium of AI companies behind the MLPerf benchmarks, has released results for MLPerf Training v3.0, the biggest update since the benchmark’s inception. In addition to evaluating large-language-model performance by testing performance on the huge GPT3 model (the basis for the popular ChatGPT), it also updates the recommender model to the more-complex DLRM-DCNv2 to better represent models in use. Other tests carry over from the prior version, v2.1.

As with past releases, the newest release has submissions for only a few AI processors. Despite many companies promising to take on Nvidia, only Intel reported even a partial v3.0 results set. In addition to submissions for Habana’s Gaudi2, Intel also provided updated results for the Xeon 8480+ (Sapphire Rapids) operating without an add-on accelerator.

As before, Nvidia’s newest big AI chip, the H100 (Hopper), delivered the most performance per chip. As its software has improved, Gaudi2’s performance per chip has inched up. It now tops Nvidia’s A100 results on the three tasks for which scores are available for both chips and even approaches the H100 on ResNet 50.

Free Newsletter

Get the latest analysis of new developments in semiconductor market and research analysis.

Subscribers can view the full article in the TechInsights Platform.

Subscriber Login

You must be a subscriber to access the Manufacturing Analysis reports & services.

If you are not a subscriber, you should be! Enter your email below to contact us about access.

Manufacturing Analysis

Subscriber Login

Analysis Insights

June 23, 2025

Huawei Matebook Fold Uses Kirin X90 Built on SMIC’s 7nm (N+2) Node

TechInsights confirms Huawei's Matebook Fold | Ultimate Design features the Kirin X90 SoC built on SMIC’s 7nm (N+2) process—debunking rumors of a breakthrough 5nm node.

Learn More

June 20, 2025

Chip Observer June 2025

Stay informed on the latest shifts in semiconductor policy, AI, packaging, and market dynamics in the June 2025 Chip Observer, featuring insights on Qualcomm, OpenAI, Huawei, and more.

Learn More

June 18, 2025

US Tariffs and Taiwan Curbs Reshape Semiconductor Supply

Trump’s tariff threat and Taiwan’s export curbs on Huawei and SMIC add new risks and delays to global semiconductor supply chains and chip manufacturing plans.

Learn More