How to Use the NVIDIA API Catalog MCP in Windsurf
Connect the NVIDIA API Catalog to Windsurf and let Cascade run inference tasks directly from your codebase.
Works with every AI agent you already use
…and any MCP-compatible client
Connect NVIDIA API Catalog MCP to Windsurf
Create your Vinkius account to connect NVIDIA API Catalog to Windsurf and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.
Execute model inference in Windsurf
Cascade pulls foundation models directly from the catalog to run completions. You trigger `nvidia_chat_completion` to process logic without leaving the IDE. Windsurf chains these calls to handle complex reasoning. It feeds the output into your active files so you see the code changes as they happen.
Monitor NVIDIA API Catalog status
Keep tabs on your usage limits and system health within your editor. You use `nvidia_check_token_quota` to prevent unexpected budget spikes during long sessions. `nvidia_get_cloud_status` pings the endpoints so you know the service is up before Cascade starts a multi-step task. It removes the guesswork from your workflow.
Run vision and summarization tools
Feed graphical data into your agent using `nvidia_vision_inference` to analyze UI mockups or diagrams. It processes the visual input and returns structured data for your project. Use `nvidia_summarize_content` when you need to condense large files or logs quickly. Windsurf pipes the results into your context window for immediate review.
Set up NVIDIA API Catalog MCP in Windsurf
Prerequisites
- Windsurf IDE installed (macOS, Windows, or Linux)
- Active Vinkius subscription with a valid endpoint token
- 1
Open MCP configuration
Click the Cascade assistant icon in the sidebar, then click the hammer icon (🔨) at the top of the panel. Select "Configure" to open
~/.codeium/windsurf/mcp_config.json. - 2
Add the NVIDIA API Catalog MCP
Paste the JSON snippet shown on the right into the
mcpServersobject. Replace[YOUR_TOKEN_HERE]with your endpoint token from cloud.vinkius.com. - 3
Refresh MCPs
Go back to the hammer icon (🔨) in Cascade and click "Refresh". Windsurf will detect the new server. No full restart is needed — the connection is hot-reloaded.
- 4
Verify in Cascade
Start a new Cascade conversation and ask something like "Show my NVIDIA API Catalog payment history." If connected, Cascade will call the NVIDIA API Catalog tools directly. You will see a green dot next to the server name in the MCP panel.
{
"mcpServers": {
"nvidia-api-catalog-mcp": {
"url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}
}
} Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by NVIDIA API Catalog. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
Why Choose Vinkius
Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.
Real-time monitoring
Live
visibility into every interaction
Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.
Built-in savings
60%
lower AI costs
Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.
Single dashboard
One
place for every integration
Every tool your AI connects to, managed from a single screen. One account, complete control.
Common questions about NVIDIA API Catalog MCP in Windsurf
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
Start using the NVIDIA API Catalog MCP today
We host it, we monitor it, we maintain it. You just paste one token.