Friday, May 1, 2026
  • About Web3Wire
  • Web3Wire NFTs
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Media Network
  • RSS Feed
  • Contact Us
Web3Wire
No Result
View All Result
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
No Result
View All Result
Web3Wire
No Result
View All Result
Home Artificial Intelligence

MLCommons Releases New MLPerf Inference v5.1 Benchmark Results

September 9, 2025
in Artificial Intelligence, GlobeNewswire, Web3
Reading Time: 10 mins read
5
SHARES
244
VIEWS
Share on TwitterShare on LinkedInShare on Facebook

SAN FRANCISCO, Sept. 09, 2025 (GLOBE NEWSWIRE) — Today, MLCommons® announced new results for its industry-standard MLPerf® Inference v5.1 benchmark suite, tracking the relentless forward momentum of the AI community and its new capabilities, new models, and new hardware and software systems.

The MLPerf Inference benchmark suite is designed to measure how quickly systems can run AI models across a variety of workloads. The open-source and peer-reviewed suite performs system performance benchmarking in an architecture-neutral, representative, and reproducible manner, creating a level playing field for competition that drives innovation, performance, and energy efficiency for the entire industry. It provides critical technical information for customers who are procuring and tuning AI systems.

This round of MLPerf Inference results sets a record for the number of participants submitting systems for benchmarking at 27. Those submissions include systems using five newly-available processors and improved versions of AI software frameworks. The v5.1 suite introduces three new benchmarks that further challenge AI systems to perform at their peak against modern workloads.

“The pace of innovation in AI is breathtaking,” said Scott Wasson, Director of Product Management at MLCommons. “The MLPerf Inference working group has aggressively built new benchmarks to keep pace with this progress. As a result, Inference 5.1 features several new benchmark tests, including DeepSeek-R1 with reasoning, and interactive scenarios with tighter latency requirements for some LLM-based tests. Meanwhile, the submitters to MLPerf Inference 5.1 yet again have produced results demonstrating substantial performance gains over prior rounds.”

Llama 2 70B generative AI test establishes the trendlines

The Llama 2 70B benchmark continues to be the most popular benchmark in the suite, with 24 submitters in this round.

CAPTION

It also gives a clear picture of overall performance improvement in AI systems over time. In some scenarios, the best performing systems improved by as much as 50% over the best system in the 5.0 release just six months ago. This round saw another first: a submission of a heterogeneous system that used software to load-balance an inference workload across different types of accelerators.

CAPTION

In response to demand from the community, this round expands the interactive scenario introduced in the previous version, which tests performance under lower latency constraints as required for agentic and other applications of LLMs. The interactive scenarios, now tested for multiple models, saw robust participation from submitters in version 5.1.

Three new tests introduced

MLPerf Inference v5.1 introduces three new benchmarks to the suite: DeepSeek-R1; Llama 3.1 8B; and Whisper Large V3.

DeepSeek R1 is the first “reasoning model” to be added to the suite. Reasoning models are designed to tackle challenging tasks, using a multi-step process to break down problems into smaller pieces in order to produce higher quality responses. The workload in the test incorporates prompts from five datasets covering mathematics problem-solving, general question answering, and code generation.

“Reasoning models are an emerging and important area for AI models, with their own unique pattern of processing,” said Miro Hodak, MLPerf Inference working group co-chair. “It’s important to have real data to understand how reasoning models perform on existing and new systems, and MLCommons is stepping up to provide that data. And it’s equally important to thoroughly stress-test the current systems so that we learn their limits; DeepSeek R1 increases the difficulty level of the benchmark suite, giving us new and valuable information.”

More information on the DeepSeek R1 benchmark can be found here.

Llama 3.1 8B is a smaller LLM useful for tasks such as text summarization in both datacenter and edge scenarios. With the Inference 5.1 release, this model is replacing an older one (GPT-J) but retaining the same dataset, performing the same benchmark task but with a more contemporary model that better reflects the current state of the art. Llama 3.1 8B uses a large context length of 128,000 tokens, whereas GPT-J only used 2048. The test uses the CNN-DailyMail dataset, among the most popular publicly available for text summarization tasks. The Llama 3.1 8B benchmark supports both datacenter and edge systems, with custom workloads for each.

More information on the Llama 3.1 8B benchmark can be found here.

Whisper Large V3 is an open-source speech recognition model built on a transformer-based encoder-decoder architecture. It features high accuracy and multilingual capabilities across a wide range of tasks, including transcription and translation. For the benchmark test it is paired with a modified version of the Librispeech audio dataset. The benchmark supports both datacenter and edge systems.

“MLPerf Inference benchmarks are live and designed to capture the state of AI deployment across the industry,” said Frank Han, co-chair of the MLPerf Inference Working Group. “This round adds a speech-to-text model, reflecting the need to benchmark beyond large language models. Speech recognition combines language modeling with additional stages like acoustic feature extraction and segmentation, broadening the performance profile and stressing system aspects such as memory bandwidth, latency, and throughput. By including such workloads, MLPerf Inference offers a more holistic and realistic view of AI inference challenges.”

More information on the Whisper Large V3 benchmark can be found here.

The momentum builds for AI… and for MLPerf benchmarks

The MLPerf Inference 5.1 benchmark received submissions from a total of 27 participating organizations: AMD, ASUSTek, Azure, Broadcom, Cisco, Coreweave, Dell, GATEOverflow, GigaComputing, Google, Hewlett Packard Enterprise, Intel, KRAI, Lambda, Lenovo, MangoBoost, MiTac, Nebius, NVIDIA, Oracle, Quanta Cloud Technology, Red Hat Inc, Single Submitter: Amitash Nanda, Supermicro, TheStage AI, University of Florida, and Vultr.

The results included tests for five newly-available accelerators:

  • AMD Instinct MI355X
  • Intel Arc Pro B60 48GB Turbo
  • NVIDIA GB300
  • NVIDIA RTX 4000 Ada-PCIe-20GB
  • NVIDIA RTX Pro 6000 Blackwell Server Edition

“This is such an exciting time to be working in the AI community,” said David Kanter, head of MLPerf at MLCommons. “Between the breathtaking pace of innovation and the robust flow of new entrants, stakeholders who are procuring systems have more choices than ever. Our mission with the MLPerf Inference benchmark is to help them make well-informed choices, using trustworthy, relevant performance data for the workloads they care about the most. The field of AI is certainly a moving target, but that makes our work – and our effort to stay on the cutting edge – even more essential.”

Kanter continued, “We would like to welcome our new submitters for version 5.1: MiTac, Nebius, Single Submitter: Amitash Nanda, TheStage AI, University of Florida, and Vultr. And I would particularly like to highlight our two participants from academia: Amitash Nanda, and the team from the University of Florida. Both academia and industry have important roles to play in efforts such as ours to advance open, transparent, trustworthy benchmarks. In this round we also received two power submissions, a data center submission from Lenovo and an edge submission from GATEOverflow. MLPerf Power results combine performance results with power measurements to offer a true indication of power-efficient computing. We commend these participants for their submissions and invite broader MLPerf Power participation from the community going forward.”

View the results

To view the results for MLPerf Inference v5.1, please visit the Datacenter and Edge benchmark results pages.

About MLCommons

MLCommons is the world’s leader in AI benchmarking. An open engineering consortium supported by over 125 members and affiliates, MLCommons has a proven record of bringing together academia, industry, and civil society to measure and improve AI. The foundation for MLCommons began with the MLPerf benchmarks in 2018, which rapidly scaled as a set of industry metrics to measure machine learning performance and promote transparency of machine learning techniques. Since then, MLCommons has continued to use collective engineering to build the benchmarks and metrics required for better AI – ultimately helping to evaluate and improve the accuracy, safety, speed, and efficiency of AI technologies.

For additional information on MLCommons and details on becoming a member, please visit MLCommons.org or email participation@mlcommons.org.

Press Inquiries: contact press@mlcommons.org

Photos accompanying this announcement are available at

https://www.globenewswire.com/NewsRoom/AttachmentNg/e7e63ecd-ed99-4ceb-bfe2-847abc32d2e6

https://www.globenewswire.com/NewsRoom/AttachmentNg/1057c0ca-e973-409d-9969-42f725cc70d9

About Web3Wire
Web3Wire – Information, news, press releases, events and research articles about Web3, Metaverse, Blockchain, Artificial Intelligence, Cryptocurrencies, Decentralized Finance, NFTs and Gaming.
Visit Web3Wire for Web3 News and Events, Block3Wire for the latest Blockchain news and Meta3Wire to stay updated with Metaverse News.

ShareTweet1ShareSendShare2
Previous Post

Trust Stamp joins South Korea’s K-Startup Grand Challenge 2025

Next Post

Tai and Navix Announce Partnership to Simplify Freight Audit and Accelerate Cash Flow for Brokers and 3PLs

Related Posts

Datavault AI and CyberCatch Announce Signing of Binding Letter of Intent for Datavault AI to Acquire CyberCatch to Accelerate AI-Driven, Quantum-Resistant Cyber Risk Mitigation Solutions

Strategic acquisition is anticipated to position Datavault AI to bring CyberCatch's AI-enabled cyber risk mitigation solution into Datavault AI's SanQtum-secured edge Graphics Processing Unit ecosystem, addressing a global information security market projected to reach $240 billion in 2026 (Gartner) CyberCatch's post-quantum cryptography conversion plan is also expected to position the...

Read moreDetails

Mind Well Solutions Launches Prova — The World’s First AI Synthetic Focus Group Platform at insightroom.io

Mind Well Solutions Launches Prova -- The World's First AI Synthetic Focus Group Platform at insightroom.ioRevolutionary SaaS platform replaces $10,000+ traditional focus groups with AI-powered synthetic audience simulation in under 2 minutes PORTSMOUTH, NH, May 01, 2026 /24-7PressRelease/ -- Mind Well Solutions today announced the launch of Prova, the world's...

Read moreDetails

Natixis CIB Expands in India with the Establishment of GIFT City Branch

MUMBAI, India, May 1, 2026 /PRNewswire/ -- Natixis Corporate & Investment Banking, (Natixis CIB) today announced that it has opened a branch in Gujarat International Finance Tec-City (GIFT City). This marks a significant milestone in the bank's long-term strategy to strengthen its presence in India and the wider Asia Pacific...

Read moreDetails

Inspira Secures $596,000 AME System Order from Leading Irish Technological University

RA'ANANA, Israel, May 01, 2026 (GLOBE NEWSWIRE) -- Inspira Technologies Oxy B.H.N. Ltd. (Nasdaq: IINN, IINNW) ("Inspira" or the "Company") today announced that it has secured a $596,000 purchase order for an Additively Manufactured Electronics ("AME") system from a leading Irish technological research university. The order is structured with a...

Read moreDetails

Inuvo Announces Nomination of Sanja Partalo to Board as Company Prioritizes IntentKey AI Commercial Integrations

LITTLE ROCK, Ark., May 01, 2026 (GLOBE NEWSWIRE) -- Inuvo, Inc. (NYSE American: INUV) (the “Company”), a leader in artificial intelligence-driven advertising technology, today announced that Adtech authority Sanja Partalo has been nominated for election to the Company’s Board of Directors at the 2026 annual meeting of shareholders. Partalo is...

Read moreDetails

Bel Fuse Announces Upcoming Conference Schedule for May 2026

WEST ORANGE, N.J., May 01, 2026 (GLOBE NEWSWIRE) -- Bel Fuse Inc. (Nasdaq: BELFA and BELFB), a global designer, manufacturer, and provider of critical electronic components, systems and solutions for customers in aerospace, defense, industrial, and data-driven markets, today announced its investor conference schedule for May 2026: Oppenheimer 21st Annual Industrial Growth...

Read moreDetails

reAlpha (Nasdaq: AIRE) CEO and CFO to Present Company’s Vertically Integrated Homebuying Vision at Two New York Conferences

DUBLIN, Ohio, May 01, 2026 (GLOBE NEWSWIRE) -- reAlpha Tech Corp. (Nasdaq: AIRE) (the “Company” or “reAlpha”), an AI-powered real estate technology company, today announced Mike Logozzo, Chief Executive Officer, and Thomas Kutzman, Chief Financial Officer, will present at The Market Movers Investor Summit and the D. Boral Global Conference....

Read moreDetails

Rezolve Ai’s SQD Token Goes Live on Revolut, Opening Access to 70 Million+ Users Globally

NEW YORK, May 01, 2026 (GLOBE NEWSWIRE) -- Rezolve Ai (NASDAQ: RZLV), a global leader in Agentic Commerce and AI-powered retail infrastructure, today announced that the native token of its decentralized data layer, SQD, is now officially listed on Revolut, Europe’s leading financial super-app. The listing makes the $SQD token...

Read moreDetails

WRAP® Expands Domestic Adoption of Non-Lethal Response™ Solutions with Purchase Order from Carolina Beach Police Department

MIAMI, May 01, 2026 (GLOBE NEWSWIRE) -- Wrap Technologies, Inc. (NASDAQ: WRAP) (“Wrap” or the “Company”), global leader Non-Lethal Response and public safety technology, today announced that the Carolina Beach Police Department (the “Department”) in North Carolina has issued a purchase order for BolaWrap® devices as part of the Department’s investment...

Read moreDetails

Tower Rush Game UK 2026: 1Win Launched Tower Rush Game App

New York City, NY, May 01, 2026 (GLOBE NEWSWIRE) -- Tower Rush game UK searches have been increasing as more users explore instant-play casino formats that focus on speed, timing, and decision-making rather than traditional gameplay cycles. Unlike slot-based systems, these games allow players to interact continuously, making each session...

Read moreDetails
Web3Wire NFTs - The Web3 Collective

Web3Wire, $W3W Token and .w3w tld Whitepaper

Web3Wire, $W3W Token and .w3w tld Whitepaper

Claim your space in Web3 with .w3w Domain!

Web3Wire

Trending on Web3Wire

  • Top Cross-Chain DeFi Solutions to Watch by 2025

    87 shares
    Share 35 Tweet 22
  • Unifying Blockchain Ecosystems: 2024 Guide to Cross-Chain Interoperability

    160 shares
    Share 64 Tweet 40
  • 74Software completes refinancing of its Term Loans and Revolving Credit Facility

    6 shares
    Share 2 Tweet 2
  • Discover 2025’s Top 5 Promising Low-Cap Crypto Gems

    98 shares
    Share 39 Tweet 25
  • Top 5 Wallets for Seamless Multi-Chain Trading in 2025

    83 shares
    Share 33 Tweet 21
Join our Web3Wire Community!

Our newsletters are only twice a month, reaching around 10000+ Blockchain Companies, 800 Web3 VCs, 600 Blockchain Journalists and Media Houses.


* We wont pass your details on to anyone else and we hate spam as much as you do. By clicking the signup button you agree to our Terms of Use and Privacy Policy.

Web3Wire Podcasts

Upcoming Events

There are currently no events.

Latest on Web3Wire

  • Datavault AI and CyberCatch Announce Signing of Binding Letter of Intent for Datavault AI to Acquire CyberCatch to Accelerate AI-Driven, Quantum-Resistant Cyber Risk Mitigation Solutions
  • Moveon Technologies Appoints Industry Veteran Desmond Lim as Chief Executive Officer to Lead Global Expansion in Advanced Precision Optical Solutions
  • Mind Well Solutions Launches Prova — The World’s First AI Synthetic Focus Group Platform at insightroom.io
  • Brian Armstrong’s Regulatory Playbook Is Working. He’s Not the Only One Who Noticed
  • Natixis CIB Expands in India with the Establishment of GIFT City Branch

RSS Latest on Block3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Covo Finance: Revolutionary Crypto Leverage Trading Platform
  • WorldStrides and HEX Announce Partnership to Offer High School and University Students Innovative Courses Designed to Improve Their Outlook in the Digital Age

RSS Latest on Meta3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Thumbtack Honored as a 2023 Transform Awards Winner
  • Accenture Invests in Looking Glass to Accelerate Shift from 2D to 3D
Web3Wire

Web3Wire is your go-to source for the latest insights and updates in Web3, Metaverse, Blockchain, AI, Cryptocurrencies, DeFi, NFTs, and Gaming. We provide comprehensive coverage through news, press releases, event updates, and research articles, keeping you informed about the rapidly evolving digital world.

  • About Web3Wire
  • Founder’s Note
  • Web3Wire NFTs – The Web3 Collective
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Event Partners
  • Community Partners
  • Our Media Network
  • Media Kit
  • RSS Feeds
  • Contact Us

Crypto Coins

  • Top 10 Coins
  • Top 50 Coins
  • Top 100 Coins
  • All Coins – Marketcap
  • Crypto Coins Heatmap

Crypto Exchanges

  • Top 10 Exchanges
  • Top 50 Exchanges
  • Top 100 Exchanges
  • All Crypto Exchanges

Crypto Stocks

  • Blockchain Stocks
  • NFT Stocks
  • Metaverse Stocks
  • Artificial Intelligence Stocks

Web3Wire Whitepaper | Tokenomics

Web3 Resources

  • Top Web3 and Crypto Youtube Channels
  • Latest Crypto News
  • Latest DeFi News
  • Latest Web3 News

Blockchain Resources

  • Blockchain and Web3 Resources
  • Decentralized Finance (DeFi) – Research Reports
  • All Crypto Whitepapers

Metaverse Resources

  • AR VR and Metaverse Resources
  • Metaverse Courses
Claim your space in Web3 with .w3w!

The Klyrox Protocol | The Algorithmic Monographs

Top 50 Web3 Blogs and Websites
Web3Wire Podcast on Spotify Web3Wire Podcast on Amazon Music 
Web3Wire - Web3 and Blockchain - News, Events and Press Releases | Product Hunt
Web3Wire on Google News

Media Portfolio: Block3Wire | Meta3Wire

  • Privacy Policy
  • Terms of Use
  • Disclaimer
  • Sitemap
  • For Search Engines
  • Crypto Sitemap
  • Exchanges Sitemap

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

No Result
View All Result
  • Coins
    • Top 10 Cryptocurrencies
    • Top 50 Cryptocurrencies
    • Top 100 Cryptocurrencies
    • All Coins
  • Exchanges
    • Top 10 Cryptocurrency Exchanges
    • Top 50 Cryptocurrency Exchanges
    • Top 100 Cryptocurrency Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.