4,500+ servers built on MCP Fusion
Vinkius
Baseten logo
Vinkius
CrewAI logo

How to Use the Baseten MCP in CrewAI

Delegate serverless model routing and deployment checks to autonomous agent teams using CrewAI and this MCP Server.

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Baseten MCP on Cursor AI Code Editor MCP Client Baseten MCP on Claude Desktop App MCP Integration Baseten MCP on OpenAI Agents SDK MCP Compatible Baseten MCP on Visual Studio Code MCP Extension Client Baseten MCP on GitHub Copilot AI Agent MCP Integration Baseten MCP on Google Gemini AI MCP Integration Baseten MCP on Lovable AI Development MCP Client Baseten MCP on Mistral AI Agents MCP Compatible Baseten MCP on Amazon AWS Bedrock MCP Support
MCP Servers - Free for Subscribers
CrewAI

Connect Baseten MCP to CrewAI

Create your Vinkius account to connect Baseten to CrewAI and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.

GDPR Free for Subscribers

Let CrewAI agents run serverless predictions

The `predict` tool lets your agents send input tensors directly to Baseten models for instant serverless inference. It handles formatting the payload to match the model's expected shape. In a CrewAI setup, a researcher agent gathers raw text, while a separate analyst agent uses this tool to run sentiment predictions. The crew passes the results between themselves to complete complex analytical pipelines.

Let agents search models using this MCP Server

The `list_models` tool retrieves all active machine learning models managed in your Baseten workspace. It returns their IDs, configurations, and current statuses. A CrewAI coordinator agent calls this tool to discover which models are available. It then matches the task requirements to the best model before handing the execution over to a specialized sub-agent.

Validate active deployments across your crew

The `get_deployment` tool fetches the exact runtime details of a specific Baseten model deployment. It tells your agent whether the endpoint is active, scaling, or idle. Before running a massive batch prediction, a monitor agent in your crew runs this check. If the deployment is cold, the agent waits or triggers a warm-up call to prevent the main crew from stalling.

Setup guide

Set up Baseten MCP in CrewAI

Prerequisites

  • Python 3.10+ installed
  • crewai package (pip install crewai)
  • Active Vinkius subscription with a valid endpoint token
  1. 1

    Install CrewAI

    Run pip install crewai to install the framework. MCP support is built-in via the mcps parameter.

  2. 2

    Add the MCP URL to your agent

    Pass your Vinkius endpoint directly to the mcps list. Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. CrewAI handles tool discovery and caching automatically.

  3. 3

    Kick off your crew

    Create a Crew with your agent and tasks. Call crew.kickoff() — the agent will automatically invoke Baseten tools as needed.

crew.py
from crewai import Agent, Task, Crew

agent = Agent(
    role="Baseten Analyst",
    goal="Access and analyze Baseten data via MCP.",
    backstory="Expert analyst with direct Baseten access.",
    mcps=[
        "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
    ],
)

task = Task(
    description="List recent Baseten transactions",
    agent=agent,
    expected_output="A summary of recent activity",
)

crew = Crew(agents=[agent], tasks=[task])
result = crew.kickoff()
print(result)

Why Choose Vinkius

Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.

Real-time monitoring

Live

visibility into every interaction

Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.

Built-in savings

60%

lower AI costs

Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.

Single dashboard

One

place for every integration

Every tool your AI connects to, managed from a single screen. One account, complete control.

Common questions about Baseten MCP in CrewAI

Pass the Baseten MCP Server URL directly into the `mcps` list when initializing your CrewAI agent. The framework automatically registers tools like `predict` and `list_models`.
Yes, a manager agent can call `list_models` to find available options. It then instructs a worker agent to use `get_model` and run predictions on the chosen deployment.
If the `predict` tool returns an error, the CrewAI agent reads the traceback. The agent then rewrites the input tensor dictionary to match the model's expected schema and retries the call.
Yes, you can use the `MCPServerHTTP` class from `crewai.mcp` along with a `tool_filter` to only expose read tools like `list_models` while blocking prediction tools.
All communications between CrewAI and Baseten run through secure, authenticated HTTPS endpoints. The Vinkius MCP host uses ephemeral V8 sandboxes that do not store your tensor inputs, model configurations, or secret names.

Start using the Baseten MCP today

We host it, we monitor it, we maintain it. You just paste one token.

Built & Managed by Vinkius 30s setup 6 tools

We've already built the connector for Baseten. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 6 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.