Ted Hisokawa
March 19, 2025 06:22
NVIDIA reveals the DGX Cloud server without server, a new AI solution allowing transparent deployment in cloud environments with improved scalability and flexibility, targeting independent software providers (ISV).
NVIDIA has announced the launch of DGX Cloud Serverless Inference, an AI inference solution on a revolutionary automatic scale designed to rationalize the deployment of applications on various cloud environments. This innovative platform aims to simplify the complexities faced by independent software providers (ISV) when deploying AI applications in the world NVIDIA official blog.
Revolutionize the deployment of AI
Propelled by NVIDIA Cloud Functions (NVCF), DGX Cloud Server inference Summary of multi-cluster infrastructure configurations, allowing transparent scalability in multi-dive and premises. The platform provides a unified approach to deploy IA workloads, high performance IT (HPC) and containerized applications, allowing ISV to extend their scope without the management of complex infrastructure.
Advantages for independent software providers
The server -free inference solution offers several key advantages for ISVs:
- Reduced operational complexity: ISVS can deploy applications closer to customer infrastructure with a single unified service, regardless of the cloud supplier.
- Increased agility: The platform allows rapid scaling to accommodate burst or short-term workloads.
- Flexible integration: Existing calculation configurations can be integrated using your own calculation capacities (byo).
- Exploratory freedom: ISVs can test new geographies and suppliers without engaging in long -term investments, supporting various use cases such as data sovereignty and low latency requirements.
Support various workloads
The DGX Cloud server without server is equipped to manage a variety of workloads, including AI, graphic and workloads. He excels in the execution of large -language models (LLM), detection of image generation objects and tasks. The platform is also optimized for graphic workloads such as digital twins and simulations, by taking advantage of Nvidia’s expertise in graphic IT.
How does it work
ISVs can start using DGX Cloud server without server using microservices and Nvidia Nim plans. The platform supports personalized containers, allowing automation and global balance of load on several calculation targets. This configuration allows ISVS to effectively deploy applications, taking advantage of a single API termination point to manage requests.
Pioneers of use cases
Several ISVs have already adopted a DGX Cloud server -free inference, with its potential to transform various industries. Companies like Aibl and Bria take advantage of the platform to improve their solutions fueled by AI, demonstrating significant improvements in profitability and scalability.
While Nvidia continues to innovate in AI and Cloud Computing, the DGX Cloud server -free inference represents a significant step to allow ISV to exploit the full potential of AI technologies with ease and efficiency.
Image source: Shutterstock
(Tagstotranslate) ai
👑 #MR_HEKA 👑