Tuesday, June 30, 2026
  • About Web3Wire
  • Web3Wire NFTs
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Media Network
  • RSS Feed
  • Contact Us
Web3Wire
No Result
View All Result
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
No Result
View All Result
Web3Wire
No Result
View All Result
Home Press Release Accesswire

Atlas Cloud Launches High-Efficiency AI Inference Platform, Outperforming DeepSeek

May 28, 2025
in Accesswire, Artificial Intelligence, Web3
Reading Time: 6 mins read
5
SHARES
256
VIEWS
Share on TwitterShare on LinkedInShare on Facebook

Developed with SGLang, Atlas Inference surpasses leading AI companies in throughput and cost, running DeepSeek V3 & R1 faster than DeepSeek themselves.

NEW YORK CITY, NEW YORK / ACCESS Newswire / May 28, 2025 / Atlas Cloud, the all-in-one AI competency center for training and deploying AI models, today announced the launch of Atlas Inference, an AI inference platform that dramatically reduces GPU and server requirements, enabling faster, more cost-effective deployment of large language models (LLMs).

Atlas Cloud Logo
Atlas Cloud Logo
Atlas Cloud logo

Atlas Inference, co-developed with SGLang, an AI inference engine, maximizes GPU efficiency by processing more tokens faster and with less hardware. When comparing DeepSeek’s published performance results, Atlas Inference’s 12-node H100 cluster outperformed DeepSeek’s reference implementation of their DeepSeek-V3 model while using two-thirds of the servers. Atlas’ platform reduces infrastructure requirements and operational costs while addressing hardware costs, which represent up to 80% of AI operational expenses.

“We built Atlas Inference to fundamentally break down the economics of AI deployment,” said Jerry Tang, Atlas CEO. “Our platform’s ability to process 54,500 input tokens and 22,500 output tokens per second per node means businesses can finally make high-volume LLM services profitable instead of merely break-even. I believe this will have a significant ripple effect throughout the industry. Simply put, we’re surpassing industry standards set by hyperscalers by delivering superior throughput with fewer resources.”

Atlas Inference’s performance also exceeds major players like Amazon, NVIDIA and Microsoft, delivering up to 2.1 times greater throughput using 12 nodes compared to competitors’ larger setups. It maintains sub-5-second first-token latency and 100-millisecond inter-token latency with more than 10,000 concurrent sessions, ensuring a scaled, superior experience. The platform’s performance is driven by four key innovations:

  • Prefill/Decode Disaggregation: Separates compute-intensive operations from memory-bound processes to optimize efficiency

  • DeepExpert (DeepEP) Parallelism with Load Balancers: Ensures over 90% GPU utilization

  • Two-Batch OverlapTechnology: Increases throughput by enabling larger batches and utilization of both compute and communication phases simultaneously

  • DisposableTensor Memory Models: Prevents crashes during long sequences for reliable operation

“This platform represents a significant leap forward for AI inference,” said Yineng Zhang, Core Developer at SGLang. “What we built here may become the new standard for GPU utilization and latency management. We believe this will unlock capabilities previously out of reach for the majority of the industry regarding throughput and efficiency.”

Combined with a lower cost per token, linear scaling behavior, and reduced emissions compared to leading vendors, Atlas Inference provides a cost-efficient and scalable AI deployment.

Atlas Inference works with standard hardware and supports custom models, giving customers complete flexibility. Teams can upload fine-tuned models and keep them isolated on dedicated GPUs, making the platform ideal for organizations requiring brand-specific voice or domain expertise.

The platform is available immediately for enterprise customers and early-stage startups.

About Atlas Cloud

Atlas Cloud is your all-in-one AI competency center, powering leading AI teams with safe, simple, and scalable infrastructure for training and deploying models. Atlas Cloud also offers an on-demand GPU platform that delivers fast, serverless compute. Backed by Dell, HPE, and Supermicro, Atlas delivers near instant access to up to 5,000 GPUs across a global SuperCloud fabric with 99% uptime and baked-in compliance. Learn more at atlascloud.ai.

SOURCE: Atlas Cloud

About Web3Wire
Web3Wire – Information, news, press releases, events and research articles about Web3, Metaverse, Blockchain, Artificial Intelligence, Cryptocurrencies, Decentralized Finance, NFTs and Gaming.
Visit Web3Wire for Web3 News and Events, Block3Wire for the latest Blockchain news and Meta3Wire to stay updated with Metaverse News.
ShareTweet1ShareSendShare2
Previous Post

In Response to the Rise of AI-Assisted Document Fraud, Certidox Offers a Patented, Open-Source, Tamper-Proof Technology – Exclusive Presentation in Chicago on May 29

Next Post

Samsung Knox Support Now Live in AirDroid Business

Related Posts

Intrusion Inc. Announces Acquisition of VigilAigent to Create an AI-Native Cybersecurity Platform

Acquisitio imp oves Compa y's top li e by addi g app oximately $3.5 millio i a ual ecu i g eve ue f om multi-yea co t acts I teg ates a established comme cial etwo k of eselle pa t e s a d custome s Compa y...

Read moreDetails

AIPOCH Launches MedSkillAudit, an AI Audit Framework to Evaluate Medical AI Agent Skills Before Deployment

SINGAPORE, Ju e 29, 2026 (GLOBE NEWSWIRE) -- AIPOCH, i collabo atio with the Depa tme t of Pathology at Zho gsha Hospital, Fuda U ive sity, today u veiled MedSkillAudit, a p e-deployme t domai -specific audit f amewo k desig ed to ide tify scie tifically u eliable...

Read moreDetails

Solulu Tech Expands Stablecoin Infrastructure to Support Cross-Border Payments and Multi-Currency Settlement

New Yo k, NY, Ju e 29, 2026 (GLOBE NEWSWIRE) -- Solulu Tech, a U.S.-based stablecoi i f ast uctu e compa y, today a ou ced the co ti ued expa sio of its global i f ast uctu e desig ed to suppo t complia t stablecoi payme...

Read moreDetails

Autheo Introduces the Internet Operating System: A Decentralized Coordination Layer for the Web, Blockchain, and AI

SHERIDAN, Wyo., Ju e 29, 2026 (GLOBE NEWSWIRE) -- Autheo today lau ched the Mai et of its dece t alized ope ati g system — a coo di atio laye e abli g the Web, Web3, AI age ts, a d c ypto applicatio s to i te ope...

Read moreDetails

ADAM Awarded 20-Year GSA IDIQ Contract for Trident AI, Establishing First Federal AI Data Trust Layer Contract in the United States

Milwaukee, WI, Ju e 29, 2026 --(PR.com)-- ADAM, a Milwaukee-based tech ology compa y buildi g ext-ge e atio data ve ificatio a d AI accou tability i f ast uctu e, today a ou ced that it has bee awa ded a 20-yea I defi ite Delive y/I defi ite...

Read moreDetails

Dr. John Spencer Ellis Expands AI Search Visibility Services for Medical Doctors Seeking Sustainable Patient Acquisition

Las Vegas, NV, Ju e 29, 2026 --(PR.com)-- Reputatio Retu I t oduces Comp ehe sive Solutio s Helpi g Physicia s Mai tai P omi e ce as Patie t Discove y Shifts to A tificial I tellige ce Platfo msD . Joh Spe ce Ellis, fou de of Reputatio...

Read moreDetails

Appy Pie Launches Course Builder Enabling Educators to Deliver Guided Courses Without Code

Ou custome s we e al eady teachi g with quizzes, videos, a d flashca ds i side thei Appy Pie apps. Cou se Builde fi ally lets them co ect those pieces i to a eal cu iculum that lea e s ca follow step by step. A academy,...

Read moreDetails

Physical AI & Mission-Critical Networks to Drive $6.6 Billion Private 5G Market, Says SNS Telecom & IT

P ivate 5G Spe di g to Reach $6.6 Billio P ivate cellula etwo ks la gely emai ed a f i ge solutio i the 2G a d 3G e as, although GSM-R etwo ks fo ailway commu icatio s a e still ope atio al ahead of a...

Read moreDetails

eSIM Intel Launches as an Independent Research Resource to Help Travelers Navigate the Fast Growing eSIM Market

A ew compa iso platfo m b i gs cou t y-by-cou t y cla ity to the fast-g owi g but co fusi g eSIM ma ket. The way t avele s buy mobile data is cha gi g fast. Acco di g to GSMA I tellige ce, global...

Read moreDetails

Artificial Intelligence and GEO: A New Era of Brand Awareness

PHA is a GEO Age cy i İsta bul, Tu key I a e a of apid digital t a sfo matio , a tificial i tellige ce is fu dame tally eshapi g commu icatio a d ma keti g st ategies. New AI-powe ed i fo matio a...

Read moreDetails
Web3Wire NFTs - The Web3 Collective

Web3Wire, $W3W Token and .w3w tld Whitepaper

Web3Wire, $W3W Token and .w3w tld Whitepaper

Claim your space in Web3 with .w3w Domain!

Web3Wire

Trending on Web3Wire

  • Top Cross-Chain DeFi Solutions to Watch by 2025

    151 shares
    Share 60 Tweet 38
  • GENISOM AI Debuts at ICRA 2026 with Full-Stack Embodied Intelligence System

    44 shares
    Share 18 Tweet 11
  • Top Layer 1 Crypto Projects to Watch in 2025

    21 shares
    Share 8 Tweet 5
  • Understanding Soulbound Tokens SBT Their Definition and Significance

    69 shares
    Share 28 Tweet 17
  • Unifying Blockchain Ecosystems: 2024 Guide to Cross-Chain Interoperability

    174 shares
    Share 70 Tweet 44
Join our Web3Wire Community!

Our newsletters are only twice a month, reaching around 10000+ Blockchain Companies, 800 Web3 VCs, 600 Blockchain Journalists and Media Houses.


* We wont pass your details on to anyone else and we hate spam as much as you do. By clicking the signup button you agree to our Terms of Use and Privacy Policy.

Web3Wire Podcasts

Upcoming Events

There are currently no events.

Latest on Web3Wire

  • Intrusion Inc. Announces Acquisition of VigilAigent to Create an AI-Native Cybersecurity Platform
  • AIPOCH Launches MedSkillAudit, an AI Audit Framework to Evaluate Medical AI Agent Skills Before Deployment
  • Solulu Tech Expands Stablecoin Infrastructure to Support Cross-Border Payments and Multi-Currency Settlement
  • Autheo Introduces the Internet Operating System: A Decentralized Coordination Layer for the Web, Blockchain, and AI
  • ADAM Awarded 20-Year GSA IDIQ Contract for Trident AI, Establishing First Federal AI Data Trust Layer Contract in the United States

RSS Latest on Block3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Covo Finance: Revolutionary Crypto Leverage Trading Platform
  • WorldStrides and HEX Announce Partnership to Offer High School and University Students Innovative Courses Designed to Improve Their Outlook in the Digital Age

RSS Latest on Meta3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Thumbtack Honored as a 2023 Transform Awards Winner
  • Accenture Invests in Looking Glass to Accelerate Shift from 2D to 3D
Web3Wire

Web3Wire is your go-to source for the latest insights and updates in Web3, Metaverse, Blockchain, AI, Cryptocurrencies, DeFi, NFTs, and Gaming. We provide comprehensive coverage through news, press releases, event updates, and research articles, keeping you informed about the rapidly evolving digital world.

  • About Web3Wire
  • Founder’s Note
  • Web3Wire NFTs – The Web3 Collective
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Event Partners
  • Community Partners
  • Our Media Network
  • Media Kit
  • RSS Feeds
  • Contact Us

Crypto Coins

  • Top 10 Coins
  • Top 50 Coins
  • Top 100 Coins
  • All Coins – Marketcap
  • Crypto Coins Heatmap

Crypto Exchanges

  • Top 10 Exchanges
  • Top 50 Exchanges
  • Top 100 Exchanges
  • All Crypto Exchanges

Crypto Stocks

  • Blockchain Stocks
  • NFT Stocks
  • Metaverse Stocks
  • Artificial Intelligence Stocks

Web3Wire Whitepaper | Tokenomics

Web3 Resources

  • Top Web3 and Crypto Youtube Channels
  • Latest Crypto News
  • Latest DeFi News
  • Latest Web3 News

Blockchain Resources

  • Blockchain and Web3 Resources
  • Decentralized Finance (DeFi) – Research Reports
  • All Crypto Whitepapers

Metaverse Resources

  • AR VR and Metaverse Resources
  • Metaverse Courses
Claim your space in Web3 with .w3w!

The Klyrox Protocol | The Algorithmic Monographs

Top 50 Web3 Blogs and Websites
Web3Wire Podcast on Spotify Web3Wire Podcast on Amazon Music 
Web3Wire - Web3 and Blockchain - News, Events and Press Releases | Product Hunt
Web3Wire on Google News

Media Portfolio: Block3Wire | Meta3Wire

  • Privacy Policy
  • Terms of Use
  • Disclaimer
  • Sitemap
  • For Search Engines
  • Crypto Sitemap
  • Exchanges Sitemap

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

No Result
View All Result
  • Coins
    • Top 10 Cryptocurrencies
    • Top 50 Cryptocurrencies
    • Top 100 Cryptocurrencies
    • All Coins
  • Exchanges
    • Top 10 Cryptocurrency Exchanges
    • Top 50 Cryptocurrency Exchanges
    • Top 100 Cryptocurrency Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.