How to Use the NVIDIA API Catalog MCP in Claude
Get raw NVIDIA API Catalog model power straight inside Claude Desktop to run heavy LLM inference without writing code.
Works with every AI agent you already use
…and any MCP-compatible client
Connect NVIDIA API Catalog MCP to Claude Desktop
Create your Vinkius account to connect NVIDIA API Catalog to Claude Desktop and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.
Run Llama3 and Nemotron Models in Claude Desktop
The NVIDIA API Catalog MCP server connects Claude Desktop directly to NVIDIA's hosted models through `nvidia_chat_completion` and `nvidia_list_foundation_models`. You get immediate access to Nemotron-4 and Llama-3-70b-Instruct right within your Claude Desktop chat sidebar, bypassing the usual API setup. Just ask Claude to run a prompt against a specific NVIDIA model path. The Claude Desktop client calls the tool, executes the inference on NVIDIA's GPU cloud, and drops the output straight into your current workspace.
Analyze Images Natively with Llama-Vision
The `nvidia_vision_inference` tool enables Claude Desktop to analyze image data using NVIDIA's hosted vision models. Drag a mockup or UI screenshot directly into your Claude chat to identify structural layout issues without running local PyTorch environments. Claude parses the visual layout, matches the elements against your design requirements, and writes the corrected layout code directly back into your chat history.
Track NVIDIA Token Quotas and Cloud Status
The `nvidia_check_token_quota` tool monitors your active developer account balance directly inside Claude Desktop via this MCP integration. You will know exactly when you are running low on credits before starting a massive inference run. If model responses seem sluggish, ask Claude to check endpoints using `nvidia_get_cloud_status`. It pings the NVIDIA endpoints to report real-time latencies right in your sidebar.
Set up NVIDIA API Catalog MCP in Claude Web or Desktop
- 1
Open Claude Settings
Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.
- 2
Add Custom Connector
Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:
https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcpReplace[YOUR_TOKEN_HERE]with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials. - 3
Start a conversation
Open a new chat. The NVIDIA API Catalog MCP tools are available immediately — no restart needed.
Endpoint URL
https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp No configuration file needed — paste the URL directly in the Claude web interface.
Available on Free (1 connector), Pro, Max, Team, and Enterprise plans.
Why Choose Vinkius
Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.
Real-time monitoring
Live
visibility into every interaction
Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.
Built-in savings
60%
lower AI costs
Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.
Single dashboard
One
place for every integration
Every tool your AI connects to, managed from a single screen. One account, complete control.
Common questions about NVIDIA API Catalog MCP in Claude Desktop
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
Start using the NVIDIA API Catalog MCP today
We host it, we monitor it, we maintain it. You just paste one token.