Cloud infrastructure company Vultr is delivering an optimized inference stack on the NVIDIA Rubin platform and adopting NVIDIA Dynamo and NVIDIA Nemotron.
NVIDIA and Vultr continue a long-standing partnership
These moves represent a milestone in NVIDIA and Vultr’s long-standing collaboration, providing tokenomics to support enterprises with ready-to-deploy composable cloud infrastructure that leverages NVIDIA-optimized open-source model and inference frameworks.
By adopting NVIDIA Dynamo and Nemotron, Vultr will accelerate AI outcomes and targeted use cases. These open-source resources enable higher throughput and seamless scaling of inference workloads.
NVIDIA Dynamo and Nemotron will combine with Vultr’s high-performance infrastructure to accelerate deployment while reducing inference costs.
Vultr cloud enables building at scale in fast-paced AI innovation cycles
As a Preferred NVIDIA Cloud Partner, Vultr customers can build once and deploy widely to drive scale and reduce time-to-value for AI applications.
This new enterprise inference stack can be deployed on public, private, and sovereign clouds.
“This rise of agentic AI demands powerful, reliable AI infrastructure, and a production-ready full stack to accelerate innovation,” said J.J. Kardwell, CEO of Vultr.
“With NVIDIA and our software partners, we are delivering an integrated AI environment that enables enterprises to deploy next-generation models efficiently and at scale on NVIDIA’s Rubin Platform.”
NemoClaw gains secure environment boost
Vultr and NVIDIA are also working together on NVIDIA NemoClaw, an open-source stack that simplifies running OpenClaw always-on assistants more safely with a single command.
It is part of the NVIDIA Agent Toolkit, installing the NVIDIA OpenShell runtime – a secure environment for running autonomous agents and open source models like NVIDIA Nemotron.
“Vultr’s global reach and hyperscaler-level capacity make them a powerful partner in this next evolution of the AI era,” said Dave Salvator, director of accelerated computing products at NVIDIA.
“Innovating with Vultr allows us to optimize our robust open-source portfolio for enterprise AI workloads, propelling advancements in agentic AI and reinventing the economics of inference. Unlocking NVIDIA Vera Rubin systems means unlocking the future of the enterprise, where AI takes productivity, efficiency, and quality of service to new heights.”
Vultr partners with NetApp on data estate
Additionally, Vultr has partnered with NetApp to deliver a resilient, high-performance foundation required for an AI-ready data estate.
NetApp’s AFX, a disaggregated data management platform, delivers the performance and scale needed for building modern AI-driven business solutions.
Combining NetApp’s AI Data Engine, built on the NVIDIA AI Data Platform reference design, enables AI services accelerated with AI-ready data transformed in-place, secured, and performant for enterprise-scale inference, driving agentic AI workflows.
“Our collaboration with Vultr was founded on a shared mission to help enterprises navigate today’s data management challenges and push the boundaries of AI,” said Syam Nair, chief product officer at NetApp.
“In bringing an enterprise-grade data platform delivering GPU-saturating performance with built-in security to this optimized stack for the next generation of AI infrastructure, we’re helping customers leverage AI with agility and deliver business outcomes without compromises.”
Vultr has announced the immediate availability of full-stack NVIDIA AI Enterprise inference solutions through partners WWT and NetApp, with planned support for NVIDIA Vera Rubin in Q4 2026.
NVIDIA has been on a roll expanding collaborations with partners in 2026. Read more about their expanded partnership with Red Hat to deliver rack-scale, enterprise AI.





