4,500+ servers built on MCP Fusion
Vinkius
NVIDIA NIM logo
Vinkius
Cline logo

How to Use the NVIDIA NIM MCP in Cline

Cline controls your NVIDIA NIM containers by executing hardware scaling and status checks through the MCP server.

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

NVIDIA NIM MCP on Cursor AI Code Editor MCP Client NVIDIA NIM MCP on Claude Desktop App MCP Integration NVIDIA NIM MCP on OpenAI Agents SDK MCP Compatible NVIDIA NIM MCP on Visual Studio Code MCP Extension Client NVIDIA NIM MCP on GitHub Copilot AI Agent MCP Integration NVIDIA NIM MCP on Google Gemini AI MCP Integration NVIDIA NIM MCP on Lovable AI Development MCP Client NVIDIA NIM MCP on Mistral AI Agents MCP Compatible NVIDIA NIM MCP on Amazon AWS Bedrock MCP Support
MCP Servers - Free for Subscribers
Cline

Connect NVIDIA NIM MCP to Cline

Create your Vinkius account to connect NVIDIA NIM to Cline and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.

GDPR Free for Subscribers

Cline manages GPU inference states

Cline calls `nim_get_gpu_status` to read memory usage and hardware constraints before it starts a new task. It prevents OOM errors by checking the metal first. It also uses `nim_get_metadata` to map out the current configuration. You get precise data on what is running and how it is tuned.

Cline executes container operations

You can tell Cline to fix a deployment using `nim_scale_replicas`. It adjusts your infrastructure on the fly to meet demand. It uses `nim_get_container_logs` to parse through execution errors. Cline reads the output and suggests fixes based on the actual logs.

Hardware-aware task execution

Cline runs `nim_check_health_live` to ensure the orchestrator is up. It won't try to deploy models if the host is unresponsive. It checks `nim_check_health_ready` to ensure your model artifacts are fully loaded in GPU memory. This makes your agent aware of the actual state of your inference backend.

Setup guide

Set up NVIDIA NIM MCP in Cline

Prerequisites

  • VS Code with Cline extension installed
  • Active Vinkius subscription with a valid endpoint token
  1. 1

    Open Cline MCP settings

    Click the Cline icon in the VS Code sidebar to open the Cline panel. Then click the MCP Servers icon (server stack) at the top-right corner of the panel.

  2. 2

    Add a remote server

    Click "Remote Servers" at the top, then click "Add Remote MCP". In the Name field, type nvidia-nim-mcp. In the URL field, paste your Vinkius endpoint: https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp. Get your token from cloud.vinkius.com.

  3. 3

    Enable the server

    After saving, the server appears in the Cline MCP panel. Toggle the switch to enable it. The status indicator turns green when the connection is live.

  4. 4

    Start using tools

    Return to the Cline chat and ask: "Check my latest NVIDIA NIM refund status." Cline will discover the available tools and request your approval before invoking each one — giving you full control over every action.

Cline MCP Settings
{
  "mcpServers": {
    "nvidia-nim-mcp": {
      "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
    }
  }
}

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by NVIDIA NIM. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

Why Choose Vinkius

Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.

Real-time monitoring

Live

visibility into every interaction

Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.

Built-in savings

60%

lower AI costs

Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.

Single dashboard

One

place for every integration

Every tool your AI connects to, managed from a single screen. One account, complete control.

Common questions about NVIDIA NIM MCP in Cline

Cline invokes `nim_get_metrics` to grab raw Prometheus data. It uses these numbers to decide if it should scale your replicas or optimize your current model load.
Cline uses `nim_list_models` to see exactly what is deployed. It then writes code or config files based on the models it finds.
The connection relies on your local endpoint. Cline interacts only with the tools you explicitly authorize through the MCP interface.
No. Cline fetches the logs via `nim_get_container_logs` only when you ask it to debug an issue. It does not persist your container data outside of the current session.
Cline reports the error from `nim_scale_replicas` directly in the chat. You can see the failure reason and decide if you want to retry or manually intervene.

Start using the NVIDIA NIM MCP today

We host it, we monitor it, we maintain it. You just paste one token.

Built & Managed by Vinkius 30s setup 8 tools

We've already built the connector for NVIDIA NIM. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 8 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.