NeuPro-M Enhances BF16, FP8 Support

Ceva’s updated NeuPro-M artificial-intelligence accelerator can better handle floating-point and INT4 data, enabling developers to use different types at different neural-network layers to improve performance with less impact on accuracy.

Joseph Byrne

Ceva has updated its licensable NeuPro-M design to better handle Transformers—the neural networks underpinning ChatGPT and Dall-E AI software and now finding application at the edge in computer vision. Architectural changes to NeuPro-M improve its power efficiency and can increase its performance sevenfold on models that can take advantage of the new features. Critical among these is the addition of BF16 and FP8 support to more NeuPro-M function units and improved handling of sparse data.

Targeting AI inferencing tasks in embedded sensing applications like driver assistance, robotics, and surveillance cameras, the NeuPro-M architecture comprises a single “common” subsystem and one or more engines. The former interfaces to a host CPU and real-time peripherals such as an image signal processor (ISP), and it performs control, safety, and security functions as well as compressing/decompressing data and weights. The latter handles most of the processing, and Ceva scales the number of engines to address different performance levels.

The intellectual-property (IP) vendor offers NeuPro-M versions with one, two, four, or eight engines for up to 256 TOPS of raw performance. The two- and eight-engine models are new. The single-engine NPM11 is fully verified and has been delivered to its first customers. Ceva plans for the final RTL for the other configurations (NPM12, NPM14, and NPM18) to be available by the end of the year.

Free Newsletter

Get the latest analysis of new developments in semiconductor market and research analysis.

Subscribers can view the full article in the TechInsights Platform.

Subscriber Login

You must be a subscriber to access the Manufacturing Analysis reports & services.

If you are not a subscriber, you should be! Enter your email below to contact us about access.

Manufacturing Analysis

Subscriber Login

Analysis Insights

July 23, 2025

Inside the Future of Wearables | Teardown Insights & Market Trends eBook

Discover what's powering next-gen wearables. Get teardown insights, sensor trends, and strategic analysis in our free TechInsights eBook—built for tech leaders.

Learn More

June 23, 2025

Huawei Matebook Fold Uses Kirin X90 Built on SMIC’s 7nm (N+2) Node

TechInsights confirms Huawei's Matebook Fold | Ultimate Design features the Kirin X90 SoC built on SMIC’s 7nm (N+2) process—debunking rumors of a breakthrough 5nm node.

Learn More

June 20, 2025

Chip Observer June 2025

Stay informed on the latest shifts in semiconductor policy, AI, packaging, and market dynamics in the June 2025 Chip Observer, featuring insights on Qualcomm, OpenAI, Huawei, and more.

Learn More