How to Use the NVIDIA NIM MCP in Pydantic AI
Run type-safe GPU monitoring and scaling operations inside your Pydantic AI agent workflows.
Works with every AI agent you already use
…and any MCP-compatible client
Connect NVIDIA NIM MCP to Pydantic AI
Create your Vinkius account to connect NVIDIA NIM to Pydantic AI and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.
Validate GPU status using Pydantic AI schemas
Calling `nim_get_gpu_status` parses physical GPU topological limits and active memory variables. Because this runs inside Pydantic AI, every returned VRAM value is strictly validated against your Python type models.\n\nIf the hardware reports an unexpected memory format, the agent raises a validation error immediately. This prevents corrupted hardware data from breaking downstream routing logic.
Type-safe replica scaling with this MCP Server
This MCP Server uses `nim_scale_replicas` to update the active container count to handle shifting user demand. Your agent executes this operation with strict integer bounds validation.\n\nNo invalid replica counts can be sent to the cluster. The agent monitors the scaling progress using `nim_check_health_ready` to ensure new instances are fully initialized before routing traffic.
Audit running models and metadata
Executing `nim_list_models` returns a verified list of active LLMs running on your local backend. The agent checks this list to ensure target models are online before initiating user sessions.\n\nTo confirm the exact configuration of those models, `nim_get_metadata` pulls engine execution bounds. Your agent uses this data to verify that max sequence lengths match your application requirements.
Set up NVIDIA NIM MCP in Pydantic AI
Prerequisites
- Python 3.10+ installed
-
pydantic-ai-slim[fastmcp]package - Active Vinkius subscription with a valid endpoint token
- 1
Install Pydantic AI with FastMCP
Run
pip install "pydantic-ai-slim[fastmcp]". The FastMCP toolset replaces the deprecatedMCPServerHTTPclass with full protocol support. - 2
Configure the FastMCPToolset
Pass a JSON-style config dict to
FastMCPToolsetwith your Vinkius URL. Replace[YOUR_TOKEN_HERE]with your token from cloud.vinkius.com. Supports Streamable HTTP, SSE, and Stdio transports. - 3
Create and run your agent
Pass the toolset to
Agent(toolsets=[toolset])and callagent.run(). Swapopenai:gpt-4ofor any supported model — Anthropic, Google, Mistral, or Groq.
from pydantic_ai import Agent
from pydantic_ai.toolsets.fastmcp import FastMCPToolset
toolset = FastMCPToolset({
"mcpServers": {
"nvidia-nim-mcp": {
"url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}
}
})
agent = Agent(
"openai:gpt-4o",
toolsets=[toolset],
system_prompt="You have access to NVIDIA NIM tools.",
)
result = await agent.run("List recent NVIDIA NIM transactions")
print(result.output) Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by NVIDIA NIM. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
Why Choose Vinkius
Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.
Real-time monitoring
Live
visibility into every interaction
Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.
Built-in savings
60%
lower AI costs
Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.
Single dashboard
One
place for every integration
Every tool your AI connects to, managed from a single screen. One account, complete control.
Common questions about NVIDIA NIM MCP in Pydantic AI
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
Start using the NVIDIA NIM MCP today
We host it, we monitor it, we maintain it. You just paste one token.