Monday, March 23, 2026
  • About Web3Wire
  • Web3Wire NFTs
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Media Network
  • RSS Feed
  • Contact Us
Web3Wire
No Result
View All Result
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
No Result
View All Result
Web3Wire
No Result
View All Result
Home Artificial Intelligence

MLCommons Releases New MLPerf Inference v5.1 Benchmark Results

September 9, 2025
in Artificial Intelligence, GlobeNewswire, Web3
Reading Time: 10 mins read
5
SHARES
244
VIEWS
Share on TwitterShare on LinkedInShare on Facebook

SAN FRANCISCO, Sept. 09, 2025 (GLOBE NEWSWIRE) — Today, MLCommons® announced new results for its industry-standard MLPerf® Inference v5.1 benchmark suite, tracking the relentless forward momentum of the AI community and its new capabilities, new models, and new hardware and software systems.

The MLPerf Inference benchmark suite is designed to measure how quickly systems can run AI models across a variety of workloads. The open-source and peer-reviewed suite performs system performance benchmarking in an architecture-neutral, representative, and reproducible manner, creating a level playing field for competition that drives innovation, performance, and energy efficiency for the entire industry. It provides critical technical information for customers who are procuring and tuning AI systems.

This round of MLPerf Inference results sets a record for the number of participants submitting systems for benchmarking at 27. Those submissions include systems using five newly-available processors and improved versions of AI software frameworks. The v5.1 suite introduces three new benchmarks that further challenge AI systems to perform at their peak against modern workloads.

“The pace of innovation in AI is breathtaking,” said Scott Wasson, Director of Product Management at MLCommons. “The MLPerf Inference working group has aggressively built new benchmarks to keep pace with this progress. As a result, Inference 5.1 features several new benchmark tests, including DeepSeek-R1 with reasoning, and interactive scenarios with tighter latency requirements for some LLM-based tests. Meanwhile, the submitters to MLPerf Inference 5.1 yet again have produced results demonstrating substantial performance gains over prior rounds.”

Llama 2 70B generative AI test establishes the trendlines

The Llama 2 70B benchmark continues to be the most popular benchmark in the suite, with 24 submitters in this round.

CAPTION

It also gives a clear picture of overall performance improvement in AI systems over time. In some scenarios, the best performing systems improved by as much as 50% over the best system in the 5.0 release just six months ago. This round saw another first: a submission of a heterogeneous system that used software to load-balance an inference workload across different types of accelerators.

CAPTION

In response to demand from the community, this round expands the interactive scenario introduced in the previous version, which tests performance under lower latency constraints as required for agentic and other applications of LLMs. The interactive scenarios, now tested for multiple models, saw robust participation from submitters in version 5.1.

Three new tests introduced

MLPerf Inference v5.1 introduces three new benchmarks to the suite: DeepSeek-R1; Llama 3.1 8B; and Whisper Large V3.

DeepSeek R1 is the first “reasoning model” to be added to the suite. Reasoning models are designed to tackle challenging tasks, using a multi-step process to break down problems into smaller pieces in order to produce higher quality responses. The workload in the test incorporates prompts from five datasets covering mathematics problem-solving, general question answering, and code generation.

“Reasoning models are an emerging and important area for AI models, with their own unique pattern of processing,” said Miro Hodak, MLPerf Inference working group co-chair. “It’s important to have real data to understand how reasoning models perform on existing and new systems, and MLCommons is stepping up to provide that data. And it’s equally important to thoroughly stress-test the current systems so that we learn their limits; DeepSeek R1 increases the difficulty level of the benchmark suite, giving us new and valuable information.”

More information on the DeepSeek R1 benchmark can be found here.

Llama 3.1 8B is a smaller LLM useful for tasks such as text summarization in both datacenter and edge scenarios. With the Inference 5.1 release, this model is replacing an older one (GPT-J) but retaining the same dataset, performing the same benchmark task but with a more contemporary model that better reflects the current state of the art. Llama 3.1 8B uses a large context length of 128,000 tokens, whereas GPT-J only used 2048. The test uses the CNN-DailyMail dataset, among the most popular publicly available for text summarization tasks. The Llama 3.1 8B benchmark supports both datacenter and edge systems, with custom workloads for each.

More information on the Llama 3.1 8B benchmark can be found here.

Whisper Large V3 is an open-source speech recognition model built on a transformer-based encoder-decoder architecture. It features high accuracy and multilingual capabilities across a wide range of tasks, including transcription and translation. For the benchmark test it is paired with a modified version of the Librispeech audio dataset. The benchmark supports both datacenter and edge systems.

“MLPerf Inference benchmarks are live and designed to capture the state of AI deployment across the industry,” said Frank Han, co-chair of the MLPerf Inference Working Group. “This round adds a speech-to-text model, reflecting the need to benchmark beyond large language models. Speech recognition combines language modeling with additional stages like acoustic feature extraction and segmentation, broadening the performance profile and stressing system aspects such as memory bandwidth, latency, and throughput. By including such workloads, MLPerf Inference offers a more holistic and realistic view of AI inference challenges.”

More information on the Whisper Large V3 benchmark can be found here.

The momentum builds for AI… and for MLPerf benchmarks

The MLPerf Inference 5.1 benchmark received submissions from a total of 27 participating organizations: AMD, ASUSTek, Azure, Broadcom, Cisco, Coreweave, Dell, GATEOverflow, GigaComputing, Google, Hewlett Packard Enterprise, Intel, KRAI, Lambda, Lenovo, MangoBoost, MiTac, Nebius, NVIDIA, Oracle, Quanta Cloud Technology, Red Hat Inc, Single Submitter: Amitash Nanda, Supermicro, TheStage AI, University of Florida, and Vultr.

The results included tests for five newly-available accelerators:

  • AMD Instinct MI355X
  • Intel Arc Pro B60 48GB Turbo
  • NVIDIA GB300
  • NVIDIA RTX 4000 Ada-PCIe-20GB
  • NVIDIA RTX Pro 6000 Blackwell Server Edition

“This is such an exciting time to be working in the AI community,” said David Kanter, head of MLPerf at MLCommons. “Between the breathtaking pace of innovation and the robust flow of new entrants, stakeholders who are procuring systems have more choices than ever. Our mission with the MLPerf Inference benchmark is to help them make well-informed choices, using trustworthy, relevant performance data for the workloads they care about the most. The field of AI is certainly a moving target, but that makes our work – and our effort to stay on the cutting edge – even more essential.”

Kanter continued, “We would like to welcome our new submitters for version 5.1: MiTac, Nebius, Single Submitter: Amitash Nanda, TheStage AI, University of Florida, and Vultr. And I would particularly like to highlight our two participants from academia: Amitash Nanda, and the team from the University of Florida. Both academia and industry have important roles to play in efforts such as ours to advance open, transparent, trustworthy benchmarks. In this round we also received two power submissions, a data center submission from Lenovo and an edge submission from GATEOverflow. MLPerf Power results combine performance results with power measurements to offer a true indication of power-efficient computing. We commend these participants for their submissions and invite broader MLPerf Power participation from the community going forward.”

View the results

To view the results for MLPerf Inference v5.1, please visit the Datacenter and Edge benchmark results pages.

About MLCommons

MLCommons is the world’s leader in AI benchmarking. An open engineering consortium supported by over 125 members and affiliates, MLCommons has a proven record of bringing together academia, industry, and civil society to measure and improve AI. The foundation for MLCommons began with the MLPerf benchmarks in 2018, which rapidly scaled as a set of industry metrics to measure machine learning performance and promote transparency of machine learning techniques. Since then, MLCommons has continued to use collective engineering to build the benchmarks and metrics required for better AI – ultimately helping to evaluate and improve the accuracy, safety, speed, and efficiency of AI technologies.

For additional information on MLCommons and details on becoming a member, please visit MLCommons.org or email participation@mlcommons.org.

Press Inquiries: contact press@mlcommons.org

Photos accompanying this announcement are available at

https://www.globenewswire.com/NewsRoom/AttachmentNg/e7e63ecd-ed99-4ceb-bfe2-847abc32d2e6

https://www.globenewswire.com/NewsRoom/AttachmentNg/1057c0ca-e973-409d-9969-42f725cc70d9

About Web3Wire
Web3Wire – Information, news, press releases, events and research articles about Web3, Metaverse, Blockchain, Artificial Intelligence, Cryptocurrencies, Decentralized Finance, NFTs and Gaming.
Visit Web3Wire for Web3 News and Events, Block3Wire for the latest Blockchain news and Meta3Wire to stay updated with Metaverse News.

ShareTweet1ShareSendShare2
Previous Post

Trust Stamp joins South Korea’s K-Startup Grand Challenge 2025

Next Post

Tai and Navix Announce Partnership to Simplify Freight Audit and Accelerate Cash Flow for Brokers and 3PLs

Related Posts

Algorithmic Trading Platforms Market is Booming Worldwide | AlgoTrader, QuantConnect, TradeStation

Algorithmic Trading Platforms The latest study released on the Global Algorithmic Trading Platforms Market by HTF MI Research evaluates market size, trend, and forecast to 2033. The Algorithmic Trading Platforms study covers significant research data and proofs to be a handy resource document for managers, analysts, industry experts and other...

Read moreDetails

Problem Based Learning Market to Set Phenomena Growth During 2026 to 2033

Problem Based Learning Market HTF MI just released the Global Problem Based Learning Market Study, a comprehensive analysis of the market that spans more than 143+ pages and describes the product and industry scope as well as the market prognosis and status for 2025-2032. The marketization process is being accelerated...

Read moreDetails

Abishai Financial Asia: Oil Climbs on Supply Concerns

Brent crude trades above $100 on Thursday as Strait of Hormuz disruption tightens supply and damages LNG capacity; investors recalibrate stress tests, liquidity buffers and hedging overlays while emergency reserves struggle to calm volatility.Trading on Thursday forces a rapid reassessment of energy risk, with Abishai Financial Asia Pte. Ltd. highlighting...

Read moreDetails

Pepeto (PEPETO) Offers Zero Revenue Backing After $8.1M, Yet Taurox (TAUX) Presale Hits $453K at Phase 2

Taurox (TAUX) Decentralized Hedge Fund Pepeto raised $8.1 million in a presale that has stretched across 17 months with no working product behind it. PepetoSwap, the bridge, and the exchange sit in "coming soon" status with no public beta and no launch date. There is no protocol generating fees, no...

Read moreDetails

Pepeto (PEPETO) Has No Revenue Model or Fee Structure Next to Taurox (TAUX), Smart Money Enters for 100x

Taurox (TAUX) Decentralized Hedge Fund Pepeto has no fee-generating product, no protocol revenue, and no value accrual mechanism beyond speculative token price movement. The $8.1 million raised in its presale funded marketing and promises, not a working protocol. Twenty percent of the 420 trillion token supply, roughly 84 trillion tokens,...

Read moreDetails

Pepeto (PEPETO) Anonymous Team Spends 84T on Marketing or Taurox (TAUX): $453K Presale With KYC Wins

Taurox (TAUX) Decentralized Hedge Fund Pepeto spends 84 trillion tokens on marketing, 20% of its 420 trillion supply, and the people spending it are anonymous. The project claims a connection to a PEPE cofounder, but no name, no LinkedIn profile, and no verifiable identity has ever been attached to the...

Read moreDetails

Taurox (TAUX) Overshadows Pepeto (PEPETO) 84T Marketing Spend as Smart Investors Target 100x Returns

Taurox (TAUX) Decentralized Hedge Fund Pepeto directs 20% of its 420 trillion supply to marketing. That is 84 trillion tokens funding press coverage, sponsored articles, and promotional campaigns controlled by an anonymous team. The spending is not tied to any performance metric. It does not depend on product delivery, user...

Read moreDetails

Best Crypto to Invest In With 5 Risk Layers and a Protocol Kill Switch: Taurox (TAUX) Eyes 100x

Taurox (TAUX) Decentralized Hedge Fund Black swan events do not send advance warnings. They arrive without notice, liquidate billions in hours, and expose every protocol that prioritized speed to market over risk management. The past three months have erased 40% of altcoin value across the board, triggered $334 million in...

Read moreDetails

Pepeto (PEPETO) Spends 84T on Press With No Delivered Product, Yet Experts Shift to Taurox (TAUX)

Taurox (TAUX) Decentralized Hedge Fund Pepeto allocated 84 trillion tokens to marketing, 20% of its 420 trillion supply. That budget has funded press releases across more than a dozen publications, each repeating claims of five CEX listings, a swap platform, a cross-chain bridge, and a decentralized exchange. After 17 months,...

Read moreDetails

Best Crypto to Buy Now as Altcoins Crash 40% in 3 Months: Taurox (TAUX) High-Water Mark Eyes 100x

Taurox (TAUX) Decentralized Hedge Fund Crypto markets have wiped out 40% of altcoin value in three months, and the coins that rallied hardest in late 2024 are leading the collapse. Traders who chased breakout patterns on leveraged positions lost over $334 million in liquidations in a single day this week....

Read moreDetails
Web3Wire NFTs - The Web3 Collective

Web3Wire, $W3W Token and .w3w tld Whitepaper

Web3Wire, $W3W Token and .w3w tld Whitepaper

Claim your space in Web3 with .w3w Domain!

Web3Wire

Trending on Web3Wire

  • Sugar Harmony (2026 CONSUMER REPORT): Tainted Supplement Warning Issued as “Glucose Reset Ritual” Goes Viral

    7 shares
    Share 3 Tweet 2
  • Discover 2025’s Top 5 Promising Low-Cap Crypto Gems

    94 shares
    Share 38 Tweet 24
  • Japan AI Culinary Robots Market 2026 | Growth Drivers, Key Players & Investment Opportunities

    6 shares
    Share 2 Tweet 2
  • Understanding Soulbound Tokens SBT Their Definition and Significance

    50 shares
    Share 20 Tweet 13
  • Fireflies adds venture capital AI features that deliver investment intelligence, not just transcription

    6 shares
    Share 2 Tweet 2
Join our Web3Wire Community!

Our newsletters are only twice a month, reaching around 10000+ Blockchain Companies, 800 Web3 VCs, 600 Blockchain Journalists and Media Houses.


* We wont pass your details on to anyone else and we hate spam as much as you do. By clicking the signup button you agree to our Terms of Use and Privacy Policy.

Web3Wire Podcasts

Upcoming Events

There are currently no events.

Latest on Web3Wire

  • Algorithmic Trading Platforms Market is Booming Worldwide | AlgoTrader, QuantConnect, TradeStation
  • Problem Based Learning Market to Set Phenomena Growth During 2026 to 2033
  • Abishai Financial Asia: Oil Climbs on Supply Concerns
  • Pepeto (PEPETO) Offers Zero Revenue Backing After $8.1M, Yet Taurox (TAUX) Presale Hits $453K at Phase 2
  • Pepeto (PEPETO) Has No Revenue Model or Fee Structure Next to Taurox (TAUX), Smart Money Enters for 100x

RSS Latest on Block3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Covo Finance: Revolutionary Crypto Leverage Trading Platform
  • WorldStrides and HEX Announce Partnership to Offer High School and University Students Innovative Courses Designed to Improve Their Outlook in the Digital Age

RSS Latest on Meta3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Thumbtack Honored as a 2023 Transform Awards Winner
  • Accenture Invests in Looking Glass to Accelerate Shift from 2D to 3D
Web3Wire

Web3Wire is your go-to source for the latest insights and updates in Web3, Metaverse, Blockchain, AI, Cryptocurrencies, DeFi, NFTs, and Gaming. We provide comprehensive coverage through news, press releases, event updates, and research articles, keeping you informed about the rapidly evolving digital world.

  • About Web3Wire
  • Founder’s Note
  • Web3Wire NFTs – The Web3 Collective
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Event Partners
  • Community Partners
  • Our Media Network
  • Media Kit
  • RSS Feeds
  • Contact Us

Crypto Coins

  • Top 10 Coins
  • Top 50 Coins
  • Top 100 Coins
  • All Coins – Marketcap
  • Crypto Coins Heatmap

Crypto Exchanges

  • Top 10 Exchanges
  • Top 50 Exchanges
  • Top 100 Exchanges
  • All Crypto Exchanges

Crypto Stocks

  • Blockchain Stocks
  • NFT Stocks
  • Metaverse Stocks
  • Artificial Intelligence Stocks

Web3Wire Whitepaper | Tokenomics

Web3 Resources

  • Top Web3 and Crypto Youtube Channels
  • Latest Crypto News
  • Latest DeFi News
  • Latest Web3 News

Blockchain Resources

  • Blockchain and Web3 Resources
  • Decentralized Finance (DeFi) – Research Reports
  • All Crypto Whitepapers

Metaverse Resources

  • AR VR and Metaverse Resources
  • Metaverse Courses
Claim your space in Web3 with .w3w!

The Klyrox Protocol | The Algorithmic Monographs

Top 50 Web3 Blogs and Websites
Web3Wire Podcast on Spotify Web3Wire Podcast on Amazon Music 
Web3Wire - Web3 and Blockchain - News, Events and Press Releases | Product Hunt
Web3Wire on Google News

Media Portfolio: Block3Wire | Meta3Wire

  • Privacy Policy
  • Terms of Use
  • Disclaimer
  • Sitemap
  • For Search Engines
  • Crypto Sitemap
  • Exchanges Sitemap

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

No Result
View All Result
  • Coins
    • Top 10 Cryptocurrencies
    • Top 50 Cryptocurrencies
    • Top 100 Cryptocurrencies
    • All Coins
  • Exchanges
    • Top 10 Cryptocurrency Exchanges
    • Top 50 Cryptocurrency Exchanges
    • Top 100 Cryptocurrency Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.