Categories: Web and IT News

Gcore integrates NVIDIA Dynamo to deliver high-performance, cost-efficient AI inference as a fully managed service

One-click deployment of NVIDIA’s open-source inference framework across public, private, hybrid, and on-prem environments

Gcore, the global infrastructure and software provider for AI, cloud, network, and security solutions, announced the integration of NVIDIA Dynamo into its AI inference solutions. The integration delivers significant GPU efficiency gains—up to 6x higher throughput and 2x lower latency—as a fully managed, one-click deployment. Dynamo is available now on Gcore Everywhere Inference and Gcore Everywhere AI.

NVIDIA Dynamo is an open-source inference framework, specifically designed to accelerate and optimize large-scale generative AI and inference models. Dynamo addresses the core challenges that businesses experience when running inference at scale: GPU underutilization, static resource allocation, memory bottlenecks, and data transfer inefficiency.

Marketing Technology News: MarTech Interview with Haley Trost, Group Product Marketing Manager @ Braze

Gcore is delivering Dynamo as a fully managed solution, pre-optimized for popular inference models. Customers can activate Dynamo with a single click within the Gcore Customer Portal, without managing routing, KV cache logic, or GPU scheduling. This builds on Gcore’s commitment to simplifying AI deployment through its intuitive, easy-to-use platform. The Dynamo integration is supported across private cloud, hybrid, and on-premises inference environments on Gcore Everywhere AI and Everywhere Inference.

Seva Vayner, Product Director of Edge Cloud and AI at Gcore, comments: “Modern inference isn’t just ‘run a model’—it’s batching, routing, dynamic workloads, longer contexts, and tight SLOs. In that reality, small scheduling and utilization losses become big performance and cost penalties. By integrating Dynamo as a managed service in Gcore, we bring advanced GPU optimization directly into the runtime path so customers see higher effective throughput and steadier tail latency, without operating the complexity themselves.”

Marketing Technology News: Cross-Department Collaboration with Marketing Workflow Automation: Enhancing Alignment Between Sales, Customer Service, and Marketing Teams

Beyond performance gains, NVIDIA Dynamo delivers meaningful cost optimization by increasing GPU utilization and reducing wasted cycles during decode and cache recomputation. By disaggregating prefill and decode, applying KV cache-aware routing, and leveraging NIXL for efficient inter-node communication, Dynamo ensures more requests are processed on the same hardware. This lowers cost per token and improves overall ROI. Gcore makes it particularly easy to access these efficiencies at scale.

The post Gcore integrates NVIDIA Dynamo to deliver high-performance, cost-efficient AI inference as a fully managed service first appeared on PressReleaseCC.

Gcore integrates NVIDIA Dynamo to deliver high-performance, cost-efficient AI inference as a fully managed service first appeared on Web and IT News.

awnewsor

Recent Posts

The Quiet Death of the Dumb Terminal: Why Claude’s New Computer Use Is the Real AI Interface War

Anthropic just made its AI agent permanently resident on your desktop. Not as a chatbot…

8 hours ago

The Billionaire Who Says Your Kids Should Learn to Code Like They Learn to Read — And Why Wall Street Should Listen

Jack Clark thinks coding is the new literacy. Not in the vague, aspirational way that…

8 hours ago

Your AI Chatbot Is Flattering You — And It’s Making Its Answers Worse

Ask a chatbot a question and you’ll get an answer. But the answer you get…

8 hours ago

Google Photos Finally Fixes Its Most Annoying Editing Flaw — And It’s About Time

For years, cropping a photo in Google Photos has been an exercise in quiet frustration.…

8 hours ago

The Squeeze Is On: How U.S. Sanctions, OPEC Politics, and a Shadow War Are Reshaping Global Oil Markets

OPEC’s crude oil production dropped sharply in May, and the reasons stretch far beyond the…

8 hours ago

Google’s Gemini Is About to Know You Better Than You Know Yourself — And That’s the Whole Point

Google is making its biggest bet yet on the idea that artificial intelligence should be…

8 hours ago

This website uses cookies.