4,500+ servers built on MCP Fusion
Vinkius
NVIDIA NIM logo
Vinkius
Claude Code logo

How to Use the NVIDIA NIM MCP in Claude Code

Claude Code manages NVIDIA NIM infrastructure from your terminal using native MCP tools for scaling and telemetry.

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

NVIDIA NIM MCP on Cursor AI Code Editor MCP Client NVIDIA NIM MCP on Claude Desktop App MCP Integration NVIDIA NIM MCP on OpenAI Agents SDK MCP Compatible NVIDIA NIM MCP on Visual Studio Code MCP Extension Client NVIDIA NIM MCP on GitHub Copilot AI Agent MCP Integration NVIDIA NIM MCP on Google Gemini AI MCP Integration NVIDIA NIM MCP on Lovable AI Development MCP Client NVIDIA NIM MCP on Mistral AI Agents MCP Compatible NVIDIA NIM MCP on Amazon AWS Bedrock MCP Support
MCP Servers - Free for Subscribers
Claude Code

Connect NVIDIA NIM MCP to Claude Code

Create your Vinkius account to connect NVIDIA NIM to Claude Code and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.

GDPR Free for Subscribers

Headless NVIDIA NIM monitoring

Claude Code runs `nim_get_gpu_status` to pull hardware metrics directly into your shell. You can pipe this output into scripts for automated monitoring. It also uses `nim_get_metrics` to extract scaling data. This is perfect for CI/CD pipelines that need to verify throughput before proceeding.

Terminal-based deployment control

You can trigger `nim_scale_replicas` from the command line to adjust your inference capacity. It is a direct way to handle spikes without a GUI. It uses `nim_list_models` to inventory your environment. You get a clean list of inference targets printed right in your terminal window.

Automated health checks for NVIDIA NIM

Claude Code calls `nim_check_health_live` to confirm orchestrator responsiveness in your cron jobs. It ensures your infrastructure is ready before executing tasks. It verifies readiness with `nim_check_health_ready` to ensure model artifacts are loaded. It prevents pipeline failures by checking the state of your GPU layers first.

Setup guide

Set up NVIDIA NIM MCP in Claude Code

Prerequisites

  • Claude Code CLI installed (npm install -g @anthropic-ai/claude-code)
  • Active Vinkius subscription with a valid endpoint token
  1. 1

    Run the add command

    Open your terminal and run the command shown on the right. Replace [YOUR_TOKEN_HERE] with your endpoint token from cloud.vinkius.com. Use --scope user to make it available across all projects.

  2. 2

    Verify the connection

    Start a Claude Code session and type /mcp to list connected servers. You should see nvidia-nim-mcp with a green status indicator.

  3. 3

    Start using tools

    Ask Claude Code something like "Check my latest NVIDIA NIM transactions." It will automatically discover and invoke the available NVIDIA NIM tools.

Terminal
claude mcp add --transport http nvidia-nim-mcp https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Why Choose Vinkius

Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.

Real-time monitoring

Live

visibility into every interaction

Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.

Built-in savings

60%

lower AI costs

Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.

Single dashboard

One

place for every integration

Every tool your AI connects to, managed from a single screen. One account, complete control.

Common questions about NVIDIA NIM MCP in Claude Code

Use `nim_get_container_logs` to pull the output into your terminal. You can then use standard tools like grep to search for specific errors in the log stream.
Yes. You can include `nim_scale_replicas` in your scripts to adjust hardware assignments dynamically during your build process.
Run the health check tools directly from your CLI. Claude Code returns the status of your inference layers immediately after the probe finishes.
Your connection details are managed through your local config file. Claude Code uses these to talk to the server without exposing your tokens in the shell history.
It accesses GPU memory stats, model metadata, and container logs. This is purely read/write telemetry intended for operational management of your local inference stack.

Start using the NVIDIA NIM MCP today

We host it, we monitor it, we maintain it. You just paste one token.

Built & Managed by Vinkius 30s setup 8 tools

We've already built the connector for NVIDIA NIM. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 8 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.