Google Cloud & NVIDIA Unite to Bring Secure Agentic AI to On-Premises with Gemini Models
April 9, 2025
Google Cloud and NVIDIA have formed a partnership to enable the on-premises deployment of agentic AI capabilities using Gemini models, addressing security concerns for enterprises.
This collaboration integrates Google’s Gemini large language models with NVIDIA’s Blackwell infrastructure and Confidential Computing, enhancing performance while securing sensitive data.
Agentic AI systems are designed to reason, plan, and take autonomous actions, which can significantly enhance enterprise applications such as IT support and fraud detection in finance.
Unlike traditional AI, which performs basic tasks, agentic AI can autonomously diagnose IT issues, execute fixes, and adjust fraud detection rules in real time.
This development is particularly significant for regulated industries like healthcare, finance, and government, which have faced challenges with cloud-only deployment models.
With local deployment options, industries can now innovate with AI while adhering to strict data residency laws, thus maintaining compliance.
The deployment allows enterprises to benefit from public cloud AI capabilities while retaining control and security through local infrastructure.
Developers can leverage AI inferencing through the Gemini API without the burden of infrastructure or model management, streamlining the development process.
Google Cloud has introduced the GKE Inference Gateway to optimize AI inference workloads, enhancing performance and reducing costs while ensuring model security.
This announcement positions Google Cloud as a leading provider of confidential computing capabilities for agentic AI workloads across cloud and hybrid environments.
The public preview of Gemini on Google Distributed Cloud is set to begin in the third quarter of 2025, in collaboration with NVIDIA, which will supply Blackwell systems for on-premises deployment.
The partnership is particularly relevant for industries that require strict compliance and data sovereignty, allowing them to utilize high-quality AI without transferring data to external clouds.
Summary based on 6 sources
Get a daily email with more AI stories
Sources

NVIDIA Blog • Apr 9, 2025
NVIDIA Brings Agentic AI Reasoning to Enterprises With Google Cloud
Interesting Engineering • Apr 9, 2025
NVIDIA and Google enable local deployment of Gemini AI models
Fierce Network • Apr 9, 2025
Nvidia, Google Cloud team to deliver secure, on-prem agentic AI for enterprise workloads
Techzine Global • Apr 9, 2025
Google brings Gemini to on-premises data centers with Distributed Cloud