Thursday, August 7, 2025
  • About Web3Wire
  • Web3Wire NFTs
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Media Network
  • RSS Feed
  • Contact Us
Web3Wire
No Result
View All Result
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
No Result
View All Result
Web3Wire
No Result
View All Result
Home Artificial Intelligence

Industry’s First-to-Market Supermicro NVIDIA HGX™ B200 Systems Demonstrate AI Performance Leadership on MLPerf® Inference v5.0 Results

April 4, 2025
in Artificial Intelligence, PRNewswire, Web3
Reading Time: 7 mins read
5
SHARES
243
VIEWS
Share on TwitterShare on LinkedInShare on Facebook

Latest Benchmarks Show Supermicro Systems with the NVIDIA B200 Outperformed the Previous Generation of Systems with 3X the Token Generation Per Second

SAN JOSE, Calif., April 3, 2025 /PRNewswire/ — Super Micro Computer, Inc. (SMCI), a Total IT Solution Provider for AI/ML, HPC, Cloud, Storage, and 5G/Edge, is announcing first-to-market industry leading performance on several MLPerf Inference v5.0 benchmarks, using the NVIDIA HGX™ B200 8-GPU. The 4U liquid-cooled and 10U air-cooled systems achieved the best performance in select benchmarks. Supermicro demonstrated more than 3 times the tokens per second (Token/s) generation for Llama2-70B and Llama3.1-405B benchmarks compared to H200 8-GPU systems.

Nvidia HGX B200 Systems

“Supermicro remains a leader in the AI industry, as evidenced by the first new benchmarks released by MLCommons in 2025,” said Charles Liang, president and CEO of Supermicro. “Our building block architecture enables us to be first-to-market with a diverse range of systems optimized for various workloads. We continue to collaborate closely with NVIDIA to fine-tune our systems and secure a leadership position in AI workloads.”

Learn more about the new MLPerf v5.0 Inference benchmarks at: https://mlcommons.org/benchmarks/inference-datacenter/

Supermicro is the only system vendor publishing record MLPerf inference performance (on select benchmarks) for both the air-cooled and liquid-cooled NVIDIA HGX™ B200 8-GPU systems. Both air-cooled and liquid-cooled systems were operational before the MLCommons benchmark start date. Supermicro engineers optimized the systems and software to showcase the impressive performance. Within the operating margin, the Supermicro air-cooled B200 system exhibited the same level of performance as the liquid-cooled B200 system. Supermicro has been delivering these systems to customers while we conducted the benchmarks.

MLCommons emphasizes that all results be reproducible, that the products are available and that the results can be audited by other MLCommons members. Supermicro engineers optimized the systems and software, as allowed by the MLCommons rules.

The SYS-421GE-NBRT-LCC (8x NVIDIA B200-SXM-180GB) and SYS-A21GE-NBRT (8x NVIDIA B200-SXM-180GB) showed performance leadership running the Mixtral 8x7B Inference, Mixture of Experts benchmarks with 129,000 tokens/second. The Supermicro air-cooled and liquid-cooled NVIDIA B200 based system delivered over 1,000 tokens/second inference for the large Llama3.1-405b model, whereas the previous generations of GPU systems have much smaller results. For smaller inferencing tasks, using the LLAMA2-70b benchmark, a Supermicro system with the NVIDIA B200 SXM-180GB installed shows the highest performance from a Tier 1 system supplier.

Specifically:

  • Stable Diffusion XL (Server)
    SYS-A21GE-NBRT (8x B200-SXM-180GB)

    #1 queries/s, 28.92

  • llama2-70b-interactive-99 (Server)
    SYS-A21GE-NBRT (8x B200-SXM-180GB)

    #1 Tokens/s, 62,265.70

  • Llama3.1-405b (offline)
    SYS-421GE-NBRT-LCC (8xB200-SXM-180GB)

    #1 Tokens/s 1521.74

  • Llama3.1-405b (Server)
    SYS-A21GE-NBRT (8x B200-SXNM-180GB)

    #1 Tokens/s, 1080.31 (for an 8-GPU node)

  • mixtral-8x7b (Server)
    SYS-421GE-NBRT-LCC (8x B200-SXM-180GB)

    #1 Tokens/s, 129,047.00

  • mixtral-8x7b (Offline)
    SYS-421GE-NBRT-LCC (8x B200-SXM-180GB)

    #1 Tokens/s, 128,795.00

“MLCommons congratulates Supermicro on their submission to the MLPerf Inference v5.0 benchmark. We are pleased to see their results showcasing significant performance gains compared to earlier generations of systems,” said David Kanter, Head of MLPerf at MLCommons. “Customers will be pleased by the performance improvements achieved which are validated by the neutral, representative and reproducible MLPerf results.”

Supermicro offers a comprehensive AI portfolio with over 100 GPU-optimized systems, both air-cooled and liquid-cooled options, with a choice of CPUs, ranging from single-socket optimized systems to 8-way multiprocessor systems. Supermicro rack-scale systems include computing, storage, and network components, which reduce the time required to install them once they are delivered to a customer site.

Supermicro’s NVIDIA HGX B200 8-GPU systems utilize next-generation liquid-cooling and air-cooling technology. The newly developed cold plates and the new 250kW coolant distribution unit (CDU) more than double the cooling capacity of the previous generation in the same 4U form factor. Available in 42U, 48U, or 52U configurations, the rack-scale design with the new vertical coolant distribution manifolds (CDM) no longer occupies valuable rack units. This enables eight systems, comprising 64 NVIDIA Blackwell GPUs in a 42U rack, and up to 12 systems with 96 NVIDIA Blackwell GPUs in a 52U rack.

The new air-cooled 10U NVIDIA HGX B200 system features a redesigned chassis with expanded thermal headroom to accommodate eight 1000W TDP Blackwell GPUs. Up to 4 of the new 10U air-cooled systems can be installed and fully integrated in a rack, the same density as the previous generation, while providing up to 15x inference and 3x training performance.

About Super Micro Computer, Inc.

Supermicro (NASDAQ: SMCI) is a global leader in Application-Optimized Total IT Solutions. Founded and operating in San Jose, California, Supermicro is committed to delivering first-to-market innovation for Enterprise, Cloud, AI, and 5G Telco/Edge IT Infrastructure. We are a Total IT Solutions provider with server, AI, storage, IoT, switch systems, software, and support services. Supermicro’s motherboard, power, and chassis design expertise further enables our development and production, enabling next-generation innovation from cloud to edge for our global customers. Our products are designed and manufactured in-house (in the US, Taiwan, and the Netherlands), leveraging global operations for scale and efficiency and optimized to improve TCO and reduce environmental impact (Green Computing). The award-winning portfolio of Server Building Block Solutions® allows customers to optimize for their exact workload and application by selecting from a broad family of systems built from our flexible and reusable building blocks that support a comprehensive set of form factors, processors, memory, GPUs, storage, networking, power, and cooling solutions (air-conditioned, free air cooling or liquid cooling).

Supermicro, Server Building Block Solutions, and We Keep IT Green are trademarks and/or registered trademarks of Super Micro Computer, Inc.

All other brands, names, and trademarks are the property of their respective owners.

Photo – https://web3wire.org/wp-content/uploads/2025/04/Super_Micro_Computer_MLPerf.jpg

Logo – https://web3wire.org/wp-content/uploads/2025/04/Supermicro_Logo.jpg

View original content:https://www.prnewswire.co.uk/news-releases/industrys-first-to-market-supermicro-nvidia-hgx-b200-systems-demonstrate-ai-performance-leadership-on-mlperf-inference-v5-0-results-302419125.html

About Web3Wire
Web3Wire – Information, news, press releases, events and research articles about Web3, Metaverse, Blockchain, Artificial Intelligence, Cryptocurrencies, Decentralized Finance, NFTs and Gaming.
Visit Web3Wire for Web3 News and Events, Block3Wire for the latest Blockchain news and Meta3Wire to stay updated with Metaverse News.
ShareTweet1ShareSendShare2
Previous Post

Friendly’s New CRO, Paul Goldenberg, is an InsurTech Entrepreneur and Growth Expert

Next Post

Quantinuum Selected by DARPA to Advance to First Stage of Quantum Benchmarking Initiative

Related Posts

Jiayin Group Inc. Releases 2024 Environmental, Social and Governance (ESG) Report

SHANGHAI, China, Aug. 07, 2025 (GLOBE NEWSWIRE) -- Jiayin Group Inc. (“Jiayin” or the “Company”) (NASDAQ: JFIN), a leading fintech platform in China, today published its 2024 Environmental, Social, and Governance (ESG) Report. This publication, the Company’s fourth ESG report, highlights Jiayin’s ongoing commitment to corporate sustainability, ethical business practices,...

Read moreDetails

Krutrim Partners with Cloudera to Power AI-Driven Innovation in India

SINGAPORE, Aug. 06, 2025 (GLOBE NEWSWIRE) --  - Cloudera, the only company bringing AI to data anywhere, today announced that Krutrim, India’s own sovereign cloud platform, is working with Cloudera to power large-scale analytics and data lake workloads for Ola on Krutrim Cloud. The Cloudera-based solution will soon be available to...

Read moreDetails

Cloudera Data Services Brings Private AI to the Data Center

SANTA CLARA, Calif., Aug. 06, 2025 (GLOBE NEWSWIRE) -- Cloudera, the only company bringing AI to data anywhere, today announced the latest release of Cloudera Data Services, bringing Private AI on premises and giving enterprises secure, GPU-accelerated generative AI capabilities behind their firewall. With built-in governance and hybrid portability, organizations...

Read moreDetails

Bank Negara Indonesia Strengthens Partnership with Cloudera to Scale AI-Powered Business Transformation

Cloudera’s secure, scalable platform to power BNI’s next phase of digital transformation, enabling real-time insights, advanced machine learning, and cross-agency collaboration. SINGAPORE, Aug. 06, 2025 (GLOBE NEWSWIRE) -- – Cloudera,  the only company bringing AI to data anywhere, today announced that Bank Negara Indonesia (BNI) is deepening its partnership with...

Read moreDetails

Synaptics To Participate at Upcoming Investor Conference Monday, August 11, 2025

SAN JOSE, Calif., Aug. 06, 2025 (GLOBE NEWSWIRE) -- Synaptics® Incorporated (Nasdaq: SYNA) today announced its participation in the upcoming investor conference. Rahul Patel, President & Chief Executive Officer, and Ken Rizvi, CFO, will present at the KeyBanc Capital Markets’ Technology Leadership Forum on Monday, August 11, 2025, at 10:00...

Read moreDetails

Wisfile Debuts as the Free AI Solution to Automatically Sort Photos by Date

Amid the exponential growth of personal and professional digital assets, the frustration of navigating chaotic folders filled with unsorted photos has become a universal challenge. Manual tagging, renaming, and organizing images by date can consume hours that should be spent on meaningful work or cherished memories. Now, with AI-driven innovation,...

Read moreDetails

AvidXchange Announces Second-Quarter 2025 Financial Results

CHARLOTTE, N.C., Aug. 06, 2025 (GLOBE NEWSWIRE) -- AvidXchange Holdings, Inc. (Nasdaq: AVDX), a leading provider of accounts payable (AP) automation software and payment solutions for middle market businesses and their suppliers, today announced financial results for the second quarter ended June 30, 2025. Second Quarter 2025 Financial Highlights: Total...

Read moreDetails

Radware Reports DDoS Attack Volumes in APAC Rise 364%

TOKYO, Aug. 07, 2025 (GLOBE NEWSWIRE) -- Radware® (NASDAQ: RDWR), a global leader in application security and delivery solutions for multi-cloud environments, released threat intelligence findings that offer a year-over-year look at the rise in cyber activity in the APAC region. Radware’s threat intelligence is based on 2024 network and...

Read moreDetails

Pivotal’s eVTOL Aircraft Draws Enthusiastic Response from California Fire Agencies in Multi-Agency Demonstration Series

PALO ALTO, Calif., Aug. 06, 2025 (GLOBE NEWSWIRE) -- Pivotal, a leader in personal electric vertical takeoff and landing (eVTOL) aircraft, recently concluded a trio of public safety demonstration events across California, partnering with San Bernardino County Fire Department, Southern Marin Fire District (SMFD), and Cosumnes Fire Department. Each event...

Read moreDetails

Quick Custom Intelligence Partners with ComOps to Offer Secure, Scalable Off-Site Human Player-Development Services for Casinos

SAN DIEGO, Aug. 06, 2025 (GLOBE NEWSWIRE) -- Quick Custom Intelligence (QCI), the global leader in AI-driven casino analytics, today announced a strategic partnership with ComOps, a hospitality performance partner delivering support-as-a-service solutions that enhance guest experiences and improve workforce efficiency. Through this partnership, QCI clients can now extend their...

Read moreDetails
Web3Wire NFTs - The Web3 Collective

Web3Wire, $W3W Token and .w3w tld Whitepaper

Web3Wire, $W3W Token and .w3w tld Whitepaper

Claim your space in Web3 with .w3w Domain!

Web3Wire

Trending on Web3Wire

  • Unifying Blockchain Ecosystems: 2024 Guide to Cross-Chain Interoperability

    89 shares
    Share 36 Tweet 22
  • Top Cross-Chain DeFi Solutions to Watch by 2025

    49 shares
    Share 20 Tweet 12
  • Discover 2025’s Top 5 Promising Low-Cap Crypto Gems

    66 shares
    Share 26 Tweet 17
  • Top 5 Wallets for Seamless Multi-Chain Trading in 2025

    44 shares
    Share 18 Tweet 11
  • 10 Must-Watch Undervalued Cryptos Primed for 2025 Surge

    14 shares
    Share 6 Tweet 4
Join our Web3Wire Community!

Our newsletters are only twice a month, reaching around 10000+ Blockchain Companies, 800 Web3 VCs, 600 Blockchain Journalists and Media Houses.


* We wont pass your details on to anyone else and we hate spam as much as you do. By clicking the signup button you agree to our Terms of Use and Privacy Policy.

Web3Wire Podcasts

Upcoming Events

Web 3.0 and AI Summit 2025

2025-09-11
Frankfurt
Summit

Latest on Web3Wire

  • Jiayin Group Inc. Releases 2024 Environmental, Social and Governance (ESG) Report
  • Krutrim Partners with Cloudera to Power AI-Driven Innovation in India
  • Cloudera Data Services Brings Private AI to the Data Center
  • Bank Negara Indonesia Strengthens Partnership with Cloudera to Scale AI-Powered Business Transformation
  • Synaptics To Participate at Upcoming Investor Conference Monday, August 11, 2025

RSS Latest on Block3Wire

  • Covo Finance: Revolutionary Crypto Leverage Trading Platform
  • WorldStrides and HEX Announce Partnership to Offer High School and University Students Innovative Courses Designed to Improve Their Outlook in the Digital Age
  • Cathedra Bitcoin Announces Leasing of 2.5-MW Bitcoin Mining Facility
  • Global Web3 Payments Leader, Banxa, Announces Integration With Metis to Usher In Next Wave of Cryptocurrency Users
  • Dexalot Launches First Hybrid DeFi Subnet on Avalanche

RSS Latest on Meta3Wire

  • Thumbtack Honored as a 2023 Transform Awards Winner
  • Accenture Invests in Looking Glass to Accelerate Shift from 2D to 3D
  • MetatronAI.com Unveils Revolutionary AI-Chat Features and Interface Upgrades
  • Purely.website – Disruptive new platform combats rising web hosting costs
  • WEMADE and Metagravity Sign Strategic Alliance MOU to Collaborate on Blockchain Games for the Metaverse
Web3Wire

Web3Wire is your go-to source for the latest insights and updates in Web3, Metaverse, Blockchain, AI, Cryptocurrencies, DeFi, NFTs, and Gaming. We provide comprehensive coverage through news, press releases, event updates, and research articles, keeping you informed about the rapidly evolving digital world.

  • About Web3Wire
  • Web3Wire NFTs – The Web3 Collective
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Event Partners
  • Community Partners
  • Our Media Network
  • Media Kit
  • RSS Feeds
  • Contact Us

Whitepaper | Tokenomics

Crypto Coins

  • Top 10 Coins
  • Top 50 Coins
  • Top 100 Coins
  • All Coins – Marketcap
  • Crypto Coins Heatmap

Crypto Exchanges

  • Top 10 Exchanges
  • Top 50 Exchanges
  • Top 100 Exchanges
  • All Crypto Exchanges

Crypto Stocks

  • Blockchain Stocks
  • NFT Stocks
  • Metaverse Stocks
  • Artificial Intelligence Stocks

Media Portfolio: Block3Wire | Meta3Wire

Web3 Resources

  • Top Web3 and Crypto Youtube Channels
  • Latest Crypto News
  • Latest DeFi News
  • Latest Web3 News

Blockchain Resources

  • Blockchain and Web3 Resources
  • Decentralized Finance (DeFi) – Research Reports
  • All Crypto Whitepapers

Metaverse Resources

  • AR VR and Metaverse Resources
  • Metaverse Courses
Claim your space in Web3 with .w3w!
Top 50 Web3 Blogs and Websites
Web3Wire Podcast on Spotify Web3Wire Podcast on Amazon Music 
Web3Wire - Web3 and Blockchain - News, Events and Press Releases | Product Hunt
Web3Wire on Google News
  • Privacy Policy
  • Terms of Use
  • Disclaimer
  • Sitemap
  • For Search Engines
  • Crypto Sitemap
  • Exchanges Sitemap

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

No Result
View All Result
  • Coins
    • Top 10 Cryptocurrencies
    • Top 50 Cryptocurrencies
    • Top 100 Cryptocurrencies
    • All Coins
  • Exchanges
    • Top 10 Cryptocurrency Exchanges
    • Top 50 Cryptocurrency Exchanges
    • Top 100 Cryptocurrency Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.