Tuesday, June 2, 2026
  • About Web3Wire
  • Web3Wire NFTs
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Media Network
  • RSS Feed
  • Contact Us
Web3Wire
No Result
View All Result
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
  • Home
  • Web3
    • Latest
    • AI
    • Business
    • Blockchain
    • Cryptocurrencies
    • Decentralized Finance
    • Metaverse
    • Non-Fungible Token
    • Press Release
  • Technology
    • Consumer Tech
    • Digital Fashion
    • Editor’s Choice
    • Guides
    • Stories
  • Coins
    • Top 10 Coins
    • Top 50 Coins
    • Top 100 Coins
    • All Coins
  • Exchanges
    • Top 10 Crypto Exchanges
    • Top 50 Crypto Exchanges
    • Top 100 Crypto Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks
  • Events
  • News
    • Latest Crypto News
    • Latest DeFi News
    • Latest Web3 News
No Result
View All Result
Web3Wire
No Result
View All Result
Home Press Release GlobeNewswire

Cloudera Unveils AI Inference Service with Embedded NVIDIA NIM Microservices to Accelerate GenAI Development and Deployment

October 8, 2024
in GlobeNewswire
Reading Time: 9 mins read
5
SHARES
243
VIEWS
Share on TwitterShare on LinkedInShare on Facebook

Cloudera’s AI Inference service boosts LLM performance speeds by 36x using NVIDIA accelerated computing and NVIDIA NIM microservices, providing enhanced performance, robust security, and scalable flexibility for enterprises

Combined capability brings together companies’ differentiators in a single offering: Cloudera’s trusted data as the foundation for trusted AI with NVIDIA accelerated computing and the NVIDIA AI Enterprise software platform to deploy secure and performant AI applications privately on Cloudera

SANTA CLARA, Calif and NEW YORK, Oct. 08, 2024 (GLOBE NEWSWIRE) — Cloudera, the only true hybrid platform for data, analytics, and AI, today launched Cloudera AI Inference powered by NVIDIA NIM microservices, part of the NVIDIA AI Enterprise platform. As one of the industry’s first AI inference services to provide embedded NIM microservice capability, Cloudera AI Inference uniquely streamlines the deployment and management of large-scale AI models, allowing enterprises to harness their data’s true potential to advance GenAI from pilot phases to full production.

Recent data from Deloitte reveals the biggest barriers to GenAI adoption for enterprises are compliance risks and governance concerns, yet adoption of GenAI is progressing at a rapid pace, with over two-thirds of organizations increasing their GenAI budgets in Q3 this year. To mitigate these concerns, businesses must turn to running AI models and applications privately – whether on premises or in public clouds. This shift requires secure and scalable solutions that avoid complex, do-it-yourself approaches.

Cloudera AI Inference protects sensitive data from leaking to non-private, vendor-hosted AI model services by providing secure development and deployment within enterprise control. Powered by NVIDIA technology, the service helps to build trusted data for trusted AI with high-performance speeds, enabling the efficient development of AI-driven chatbots, virtual assistants, and agentic applications impacting both productivity and new business growth.

The launch of Cloudera AI Inference comes on the heels of the company’s collaboration with NVIDIA, reinforcing Cloudera’s commitment to driving enterprise AI innovation at a critical moment, as industries navigate the complexities of digital transformation and AI integration.

Developers can build, customize, and deploy enterprise-grade LLMs with up to 36x faster performance using NVIDIA Tensor Core GPUs and nearly 4x throughput compared with CPUs. The seamless user experience integrates UI and APIs directly with NVIDIA NIM microservice containers, eliminating the need for command-line interfaces (CLI) and separate monitoring systems. The service integration with Cloudera’s AI Model Registry also enhances security and governance by managing access controls for both model endpoints and operations. Users benefit from a unified platform where all models—whether LLM deployments or traditional models—are seamlessly managed under a single service.

Additional key features of Cloudera AI Inference include:

  • Advanced AI Capabilities: Utilize NVIDIA NIM microservices to optimize open-source LLMs, including LLama and Mistral, for cutting-edge advancements in natural language processing (NLP), computer vision, and other AI domains.
  • Hybrid Cloud & Privacy: Run workloads on prem or in the cloud, with VPC deployments for enhanced security and regulatory compliance.
  • Scalability & Monitoring: Rely on auto-scaling, high availability (HA), and real-time performance tracking to detect and correct issues, and deliver efficient resource management.
  • Open APIs & CI/CD Integration: Access standards-compliant APIs for model deployment, management, and monitoring for seamless integration with CI/CD pipelines and MLOps workflows.
  • Enterprise Security: Enforce model access with Service Accounts, Access Control, Lineage, and Auditing features.
  • Risk-Managed Deployment: Conduct A/B testing and canary rollouts for controlled model updates.

“Enterprises are eager to invest in GenAI, but it requires not only scalable data but also secure, compliant, and well-governed data,” said industry analyst, Sanjeev Mohan. “Productionizing AI at scale privately introduces complexity that DIY approaches struggle to address. Cloudera AI Inference bridges this gap by integrating advanced data management with NVIDIA’s AI expertise, unlocking data’s full potential while safeguarding it. With enterprise-grade security features like service accounts, access control, and audit, organizations can confidently protect their data and run workloads on prem or in the cloud, deploying AI models efficiently with the necessary flexibility and governance.”

“We are excited to collaborate with NVIDIA to bring Cloudera AI Inference to market, providing a single AI/ML platform that supports nearly all models and use cases so enterprises can both create powerful AI apps with our software and then run those performant AI apps in Cloudera as well,” said Dipto Chakravarty, Chief Product Officer at Cloudera. “With the integration of NVIDIA AI, which facilitates smarter decision-making through advanced performance, Cloudera is innovating on behalf of its customers by building trusted AI apps with trusted data at scale.”

“Enterprises today need to seamlessly integrate generative AI with their existing data infrastructure to drive business outcomes,” said Kari Briski, vice president of AI software, models and services at NVIDIA. “By incorporating NVIDIA NIM microservices into Cloudera’s AI Inference platform, we’re empowering developers to easily create trustworthy generative AI applications while fostering a self-sustaining AI data flywheel.”

These new capabilities will be unveiled at Cloudera’s premier AI and data conference, Cloudera EVOLVE NY, taking place Oct. 10. Click here to learn more about how these latest updates deepen Cloudera’s commitment, elevating enterprise data from pilot to production with GenAI.

About Cloudera
Cloudera is the only true hybrid platform for data, analytics, and AI. With 100x more data under management than other cloud-only vendors, Cloudera empowers global enterprises to transform data of all types, on any public or private cloud, into valuable, trusted insights. Our open data lakehouse delivers scalable and secure data management with portable cloud-native analytics, enabling customers to bring GenAI models to their data while maintaining privacy and ensuring responsible, reliable AI deployments. The world’s largest brands in financial services, insurance, media, manufacturing, and government rely on Cloudera to use their data to solve what seemed impossible—today and in the future.

To learn more, visit Cloudera.com and follow us on LinkedIn and X. Cloudera and associated marks are trademarks or registered trademarks of Cloudera, Inc. All other company and product names may be trademarks of their respective owners.

Contact

Jess Hohn-Cabana
cloudera@v2comms.com

About Web3Wire
Web3Wire – Information, news, press releases, events and research articles about Web3, Metaverse, Blockchain, Artificial Intelligence, Cryptocurrencies, Decentralized Finance, NFTs and Gaming.
Visit Web3Wire for Web3 News and Events, Block3Wire for the latest Blockchain news and Meta3Wire to stay updated with Metaverse News.
ShareTweet1ShareSendShare2
Previous Post

StarVasa Acquires GenMat’s Space-based Technologies, Assets, and Business Operations

Next Post

US Technology Leaders Tap NVIDIA AI Software to Transform World’s Industries

Related Posts

ASUS and ROG Win 10 Best Choice Awards at Computex 2026

KEY POINTS ROG G1000 Edition 20 gaming desktop wins the prestigious Golden Award in the Gaming and Immersive Tech category ASUS ExpertBook Ultra receives a Sustainable Tech Special Award for leadership in sustainable commercial computing ASUS and ROG earn eight additional Category Awards for products spanning gaming, AI computing, creative...

Read moreDetails

ASUS Showcases New ExpertBook B5 Flip G2 and Intel- and AMD Powered ExpertBook P5 G2 at Computex

Fremont, CA, June 01, 2026 (GLOBE NEWSWIRE) -- ASUS today at Computex announced ASUS ExpertBook B5 Flip G2, a versatile 360° convertible laptop designed for business professionals, students, and educators who demand flexibility, performance, and security. The show also features the ExpertBook P5 and PM5, ASUS’ latest addition to its...

Read moreDetails

ASUS Announces New Additions to the Zenbook and Vivobook Series at Computex 2026

KEY POINTS Refined aesthetics and dependable performance: new Zenbook 14 and Vivobook S Series laptops feature the latest power-efficient processors and premium design Smarter everyday AI assistance: ASUS Zenni Claw makes agentic AI feel easier, more practical, and more secure TORONTO, June 01, 2026 (GLOBE NEWSWIRE) -- At Computex today,...

Read moreDetails

ASUS Announces ExpertBook B5 Flip G2, Bringing Flexible and Secure Computing to Modern Work and Learning

KEY POINTS Versatile design: 360º convertible built for hybrid work and study; garaged MPP 2.0 stylus; premium 1.34kg aluminum chassis is just 14.9mm thin Dual cameras: 1080p FHD user-facing camera for calls and classes, plus 5.0 MP world-facing camera to capture notes, projects, and real-world moments Enterprise-grade security: ASUS ExpertGuardian...

Read moreDetails

JIADE LIMITED Announces Closing of Additional $8.64 Million Registered Direct Offering

Chengdu, China, June 01, 2026 (GLOBE NEWSWIRE) -- JIADE LIMITED (Nasdaq: JDZG) (“JIADE” or the “Company”), a provider of one-stop comprehensive education support services for adult education institutions through its subsidiaries in the People’s Republic of China, today announced that the Company has completed the additional closing of $8.64 million...

Read moreDetails

AI Is Shipping Faster Than Customers Can Adopt It, New Research Finds

AMSTERDAM, June 01, 2026 (GLOBE NEWSWIRE) -- Instruqt, the hands-on adoption platform used by software companies to onboard developers, customers, and prospects, today released its annual report, The State of Developer Adoption – the first independent benchmark of how marketing, sales, and education teams are responding to the widening gap...

Read moreDetails

ZetaChain: The Private Memory Layer for AI

Key takeaways: ZetaChain launched Anuma, its first consumer AI product: the private AI that remembers, with one encrypted memory across every AI you use and access controlled by you. ZETA is the token that powers Anuma and the network underneath it: unlocking models and agents, backing memory, settling usage, rewarding...

Read moreDetails

Micron Powers AI Everywhere at COMPUTEX 2026

TAIPEI, Taiwan, June 01, 2026 (GLOBE NEWSWIRE) -- Micron Technology, Inc. (Nasdaq: MU) today announced a showcase of its full portfolio of AI-optimized memory and storage solutions during COMPUTEX 2026, empowering next-generation AI data center and intelligent edge applications. As AI workloads expand from training to large-scale inference, including reasoning-heavy...

Read moreDetails

Constellation Software Inc. Confirms Closing of DerbySoft Acquisition Through Juniper Group

TORONTO, June 01, 2026 (GLOBE NEWSWIRE) -- Constellation Software Inc. (TSX: CSU) today announced that, through Juniper Group, an operating group of Vela Software, it has completed its previously announced acquisition of a majority interest in Derbysoft Holdings Limited (“Derbysoft”), the ultimate parent company of DerbySoft Inc., include PKFARE. DerbySoft...

Read moreDetails

authID Announces its 2026 Annual Meeting to be Held on July 6, 2026

DENVER, June 01, 2026 (GLOBE NEWSWIRE) -- authID® (Nasdaq: AUID)(“authID” or the “Company”), a leading provider of biometric identity verification and authentication solutions, today announces that the 2026 Annual Meeting will be held virtually on July 6, 2026, at 10.00 a.m. EDT.  The Notice of Meeting and further information can be...

Read moreDetails
Web3Wire NFTs - The Web3 Collective

Web3Wire, $W3W Token and .w3w tld Whitepaper

Web3Wire, $W3W Token and .w3w tld Whitepaper

Claim your space in Web3 with .w3w Domain!

Web3Wire

Trending on Web3Wire

  • Top Cross-Chain DeFi Solutions to Watch by 2025

    110 shares
    Share 44 Tweet 28
  • Understanding Soulbound Tokens SBT Their Definition and Significance

    62 shares
    Share 25 Tweet 16
  • What is a Gold IRA? (Guide Released)

    7 shares
    Share 3 Tweet 2
  • Top Layer 1 Crypto Projects to Watch in 2025

    11 shares
    Share 4 Tweet 3
  • Unifying Blockchain Ecosystems: 2024 Guide to Cross-Chain Interoperability

    169 shares
    Share 68 Tweet 42
Join our Web3Wire Community!

Our newsletters are only twice a month, reaching around 10000+ Blockchain Companies, 800 Web3 VCs, 600 Blockchain Journalists and Media Houses.


* We wont pass your details on to anyone else and we hate spam as much as you do. By clicking the signup button you agree to our Terms of Use and Privacy Policy.

Web3Wire Podcasts

Upcoming Events

There are currently no events.

Latest on Web3Wire

  • ASUS and ROG Win 10 Best Choice Awards at Computex 2026
  • ASUS Showcases New ExpertBook B5 Flip G2 and Intel- and AMD Powered ExpertBook P5 G2 at Computex
  • ASUS Announces New Additions to the Zenbook and Vivobook Series at Computex 2026
  • ASUS Announces ExpertBook B5 Flip G2, Bringing Flexible and Secure Computing to Modern Work and Learning
  • Graid Technology Launches VROC(TM) by Graid Technology with 24-Month Roadmap and Tier 1 OEM Support

RSS Latest on Block3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Covo Finance: Revolutionary Crypto Leverage Trading Platform
  • WorldStrides and HEX Announce Partnership to Offer High School and University Students Innovative Courses Designed to Improve Their Outlook in the Digital Age

RSS Latest on Meta3Wire

  • The Algorithmic Monographs: A Five-Volume Civil Code for the Age of Autonomous Intelligence
  • Ali Sadhik Shaik: Practitioner, Scholar, and Author – Focused on the Governance of Intelligent Systems
  • The Klyrox Protocol: A Decentralized Framework to Close the AI Accountability Gap
  • Thumbtack Honored as a 2023 Transform Awards Winner
  • Accenture Invests in Looking Glass to Accelerate Shift from 2D to 3D
Web3Wire

Web3Wire is your go-to source for the latest insights and updates in Web3, Metaverse, Blockchain, AI, Cryptocurrencies, DeFi, NFTs, and Gaming. We provide comprehensive coverage through news, press releases, event updates, and research articles, keeping you informed about the rapidly evolving digital world.

  • About Web3Wire
  • Founder’s Note
  • Web3Wire NFTs – The Web3 Collective
  • .w3w TLD
  • $W3W Token
  • Web3Wire DAO
  • Event Partners
  • Community Partners
  • Our Media Network
  • Media Kit
  • RSS Feeds
  • Contact Us

Crypto Coins

  • Top 10 Coins
  • Top 50 Coins
  • Top 100 Coins
  • All Coins – Marketcap
  • Crypto Coins Heatmap

Crypto Exchanges

  • Top 10 Exchanges
  • Top 50 Exchanges
  • Top 100 Exchanges
  • All Crypto Exchanges

Crypto Stocks

  • Blockchain Stocks
  • NFT Stocks
  • Metaverse Stocks
  • Artificial Intelligence Stocks

Web3Wire Whitepaper | Tokenomics

Web3 Resources

  • Top Web3 and Crypto Youtube Channels
  • Latest Crypto News
  • Latest DeFi News
  • Latest Web3 News

Blockchain Resources

  • Blockchain and Web3 Resources
  • Decentralized Finance (DeFi) – Research Reports
  • All Crypto Whitepapers

Metaverse Resources

  • AR VR and Metaverse Resources
  • Metaverse Courses
Claim your space in Web3 with .w3w!

The Klyrox Protocol | The Algorithmic Monographs

Top 50 Web3 Blogs and Websites
Web3Wire Podcast on Spotify Web3Wire Podcast on Amazon Music 
Web3Wire - Web3 and Blockchain - News, Events and Press Releases | Product Hunt
Web3Wire on Google News

Media Portfolio: Block3Wire | Meta3Wire

  • Privacy Policy
  • Terms of Use
  • Disclaimer
  • Sitemap
  • For Search Engines
  • Crypto Sitemap
  • Exchanges Sitemap

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

No Result
View All Result
  • Coins
    • Top 10 Cryptocurrencies
    • Top 50 Cryptocurrencies
    • Top 100 Cryptocurrencies
    • All Coins
  • Exchanges
    • Top 10 Cryptocurrency Exchanges
    • Top 50 Cryptocurrency Exchanges
    • Top 100 Cryptocurrency Exchanges
    • All Crypto Exchanges
  • Stocks
    • Blockchain Stocks
    • NFT Stocks
    • Metaverse Stocks
    • Artificial Intelligence Stocks

© 2024 Web3Wire. We strongly recommend our readers to DYOR, before investing in any cryptocurrencies, blockchain projects, or ICOs, particularly those that guarantee profits.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.