4,500+ servers built on MCP Fusion
Vinkius
NVIDIA API Catalog logo
Vinkius
Claude Desktop logo

How to Use the NVIDIA API Catalog MCP in Claude

Get raw NVIDIA API Catalog model power straight inside Claude Desktop to run heavy LLM inference without writing code.

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

NVIDIA API Catalog MCP on Cursor AI Code Editor MCP Client NVIDIA API Catalog MCP on Claude Desktop App MCP Integration NVIDIA API Catalog MCP on OpenAI Agents SDK MCP Compatible NVIDIA API Catalog MCP on Visual Studio Code MCP Extension Client NVIDIA API Catalog MCP on GitHub Copilot AI Agent MCP Integration NVIDIA API Catalog MCP on Google Gemini AI MCP Integration NVIDIA API Catalog MCP on Lovable AI Development MCP Client NVIDIA API Catalog MCP on Mistral AI Agents MCP Compatible NVIDIA API Catalog MCP on Amazon AWS Bedrock MCP Support
MCP Servers - Free for Subscribers
Claude Desktop

Connect NVIDIA API Catalog MCP to Claude Desktop

Create your Vinkius account to connect NVIDIA API Catalog to Claude Desktop and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.

GDPR Free for Subscribers

Run Llama3 and Nemotron Models in Claude Desktop

The NVIDIA API Catalog MCP server connects Claude Desktop directly to NVIDIA's hosted models through `nvidia_chat_completion` and `nvidia_list_foundation_models`. You get immediate access to Nemotron-4 and Llama-3-70b-Instruct right within your Claude Desktop chat sidebar, bypassing the usual API setup. Just ask Claude to run a prompt against a specific NVIDIA model path. The Claude Desktop client calls the tool, executes the inference on NVIDIA's GPU cloud, and drops the output straight into your current workspace.

Analyze Images Natively with Llama-Vision

The `nvidia_vision_inference` tool enables Claude Desktop to analyze image data using NVIDIA's hosted vision models. Drag a mockup or UI screenshot directly into your Claude chat to identify structural layout issues without running local PyTorch environments. Claude parses the visual layout, matches the elements against your design requirements, and writes the corrected layout code directly back into your chat history.

Track NVIDIA Token Quotas and Cloud Status

The `nvidia_check_token_quota` tool monitors your active developer account balance directly inside Claude Desktop via this MCP integration. You will know exactly when you are running low on credits before starting a massive inference run. If model responses seem sluggish, ask Claude to check endpoints using `nvidia_get_cloud_status`. It pings the NVIDIA endpoints to report real-time latencies right in your sidebar.

Setup guide

Set up NVIDIA API Catalog MCP in Claude Web or Desktop

  1. 1

    Open Claude Settings

    Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

  2. 2

    Add Custom Connector

    Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL: https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

  3. 3

    Start a conversation

    Open a new chat. The NVIDIA API Catalog MCP tools are available immediately — no restart needed.

Endpoint URL

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

No configuration file needed — paste the URL directly in the Claude web interface.

Available on Free (1 connector), Pro, Max, Team, and Enterprise plans.

Why Choose Vinkius

Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.

Real-time monitoring

Live

visibility into every interaction

Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.

Built-in savings

60%

lower AI costs

Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.

Single dashboard

One

place for every integration

Every tool your AI connects to, managed from a single screen. One account, complete control.

Common questions about NVIDIA API Catalog MCP in Claude Desktop

Add the server configuration to your local `claude_desktop_config.json` file. Provide your NVIDIA API key as an environment variable, then restart the Claude Desktop app to initialize the tools.
Yes, Claude Desktop uses the `nvidia_vision_inference` tool to pass image payloads directly to NVIDIA's hosted vision models and displays the analysis in your chat.
The client uses `nvidia_check_token_quota` to query your remaining API credits. You can ask Claude to check your balance anytime to prevent unexpected inference interruptions.
Yes. The server exposes the `nvidia_list_lora_adapters` tool, which lets Claude Desktop discover and apply your fine-tuned overrides directly to active chat sessions.
All your prompt text, images, and embeddings processed by `nvidia_chat_completion` go directly to NVIDIA's API endpoints. Vinkius runs the MCP server in an isolated sandbox, meaning your raw inputs are never logged or stored locally on our platform.

Start using the NVIDIA API Catalog MCP today

We host it, we monitor it, we maintain it. You just paste one token.

Built & Managed by Vinkius 30s setup 8 tools

We've already built the connector for NVIDIA API Catalog. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 8 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.