How to Use the Modal (Serverless AI Infrastructure) MCP in Claude
Spin up and monitor serverless GPU deployments on Modal right from your Claude Desktop chat interface.
Works with every AI agent you already use
…and any MCP-compatible client
Connect Modal (Serverless AI Infrastructure) MCP to Claude Desktop
Create your Vinkius account to connect Modal (Serverless AI Infrastructure) to Claude Desktop and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.
Audit Modal Serverless Apps directly in Claude Desktop
The `list_apps` tool fetches your active and historical serverless application runs directly inside your Claude Desktop window. You don't have to jump to the web console or run CLI commands to see what's currently executing on Modal's infrastructure. If a run hangs or consumes too many resources, your agent can use `stop_app` to kill the execution immediately. This direct control keeps your compute bills predictable without requiring you to switch windows.
Inspect deployments and track GPU configurations
The `list_deployments` tool retrieves your active, promoted production deployments from the platform. Your agent reads this list to verify which models are live and what endpoints are exposed. When you need deep technical details, `get_deployment` pulls the exact configuration of a specific run. Claude Desktop displays these raw specifications so you can verify GPU allocations or cold-start settings on the fly.
Audit storage and secrets using your MCP Server
The `list_volumes` tool scans your persisted network block volumes to verify disk mounts and storage allocations. Your agent matches these volumes to your serverless containers to make sure your model weights are mapped correctly. For environment configurations, `list_secrets` pulls the names of your active secret dictionaries. Claude Desktop displays these references without exposing the raw values, keeping your production credentials safe while debugging.
Set up Modal (Serverless AI Infrastructure) MCP in Claude Web or Desktop
- 1
Open Claude Settings
Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.
- 2
Add Custom Connector
Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:
https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcpReplace[YOUR_TOKEN_HERE]with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials. - 3
Start a conversation
Open a new chat. The Modal (Serverless AI Infrastructure) MCP tools are available immediately — no restart needed.
Endpoint URL
https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp No configuration file needed — paste the URL directly in the Claude web interface.
Available on Free (1 connector), Pro, Max, Team, and Enterprise plans.
Why Choose Vinkius
Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.
Real-time monitoring
Live
visibility into every interaction
Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.
Built-in savings
60%
lower AI costs
Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.
Single dashboard
One
place for every integration
Every tool your AI connects to, managed from a single screen. One account, complete control.
Common questions about Modal (Serverless AI Infrastructure) MCP in Claude Desktop
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
Start using the Modal (Serverless AI Infrastructure) MCP today
We host it, we monitor it, we maintain it. You just paste one token.