2,500+ MCP servers ready to use
Vinkius

Baseten MCP Server for Mastra AI 6 tools — connect in under 2 minutes

Built by Vinkius GDPR 6 Tools SDK

Mastra AI is a TypeScript-native agent framework built for modern web stacks. Connect Baseten through Vinkius and Mastra agents discover all tools automatically. type-safe, streaming-ready, and deployable anywhere Node.js runs.

Vinkius supports streamable HTTP and SSE.

typescript
import { Agent } from "@mastra/core/agent";
import { createMCPClient } from "@mastra/mcp";
import { openai } from "@ai-sdk/openai";

async function main() {
  // Your Vinkius token. get it at cloud.vinkius.com
  const mcpClient = await createMCPClient({
    servers: {
      "baseten": {
        url: "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp",
      },
    },
  });

  const tools = await mcpClient.getTools();
  const agent = new Agent({
    name: "Baseten Agent",
    instructions:
      "You help users interact with Baseten " +
      "using 6 tools.",
    model: openai("gpt-4o"),
    tools,
  });

  const result = await agent.generate(
    "What can I do with Baseten?"
  );
  console.log(result.text);
}

main();
Baseten
Fully ManagedVinkius Servers
60%Token savings
High SecurityEnterprise-grade
IAMAccess control
EU AI ActCompliant
DLPData protection
V8 IsolateSandboxed
Ed25519Audit chain
<40msKill switch
Stream every event to Splunk, Datadog, or your own webhook in real-time

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure

About Baseten MCP Server

Connect your Baseten account to any AI agent and track, deploy, and execute your machine learning models through natural conversation.

Mastra's agent abstraction provides a clean separation between LLM logic and Baseten tool infrastructure. Connect 6 tools through Vinkius and use Mastra's built-in workflow engine to chain tool calls with conditional logic, retries, and parallel execution. deployable to any Node.js host in one command.

O que você pode fazer

  • Model Management — List managed models, fetch configurations, and understand active routing boundaries
  • Serverless Deployments — Inspect exact replica states, autoscaling configurations, and deployment versions
  • Inference Execution — Run direct predictions (predict) pushing tensor payloads or JSON directly to GPU weights
  • Workspace Secrets — Enumerate active environment secrets securely mapped inside the isolated orchestration ecosystem

Como funciona

1. Subscribe to this server
2. Enter your Baseten API Key
3. Gain complete ML-Ops control over your active inference nodes using Claude, Cursor, or your preferred agent

Scale unified AI infrastructure without bouncing between terminal windows. Your agent becomes a capable Machine Learning Operator tracking your GPU lifecycle.

Para quem é?

  • ML Engineers — execute test payloads to deployments instantaneously without spinning up local Python notebooks
  • DevOps/SREs — audit running deployment resources and verify replica states reliably from your core IDE
  • AI Researchers — inspect version schemas and manage inference pipeline architectures quickly

The Baseten MCP Server exposes 6 tools through the Vinkius. Connect it to Mastra AI in under two minutes — no API keys to rotate, no infrastructure to provision, no vendor lock-in. Your configuration, your data, your control.

How to Connect Baseten to Mastra AI via MCP

Follow these steps to integrate the Baseten MCP Server with Mastra AI.

01

Install dependencies

Run npm install @mastra/core @mastra/mcp @ai-sdk/openai

02

Replace the token

Replace [YOUR_TOKEN_HERE] with your Vinkius token

03

Run the agent

Save to agent.ts and run with npx tsx agent.ts

04

Explore tools

Mastra discovers 6 tools from Baseten via MCP

Why Use Mastra AI with the Baseten MCP Server

Mastra AI provides unique advantages when paired with Baseten through the Model Context Protocol.

01

Mastra's agent abstraction provides a clean separation between LLM logic and tool infrastructure. add Baseten without touching business code

02

Built-in workflow engine chains MCP tool calls with conditional logic, retries, and parallel execution for complex automation

03

TypeScript-native: full type inference for every Baseten tool response with IDE autocomplete and compile-time checks

04

One-command deployment to any Node.js host. Vercel, Railway, Fly.io, or your own infrastructure

Baseten + Mastra AI Use Cases

Practical scenarios where Mastra AI combined with the Baseten MCP Server delivers measurable value.

01

Automated workflows: build multi-step agents that query Baseten, process results, and trigger downstream actions in a typed pipeline

02

SaaS integrations: embed Baseten as a first-class tool in your product's AI features with Mastra's clean agent API

03

Background jobs: schedule Mastra agents to query Baseten on a cron and store results in your database automatically

04

Multi-agent systems: create specialist agents that collaborate using Baseten tools alongside other MCP servers

Baseten MCP Tools for Mastra AI (6)

These 6 tools become available when you connect Baseten to Mastra AI via MCP:

01

get_deployment

Get explicit details of a running deployment

02

get_model

Get a specific Baseten model

03

list_deployments

List active inferences bounds matching a specific model

04

list_models

List Baseten managed models

05

list_secrets

List securely managed workspace secrets without showing values

06

predict

Formulate the explicit tensor shapes or dictionaries strictly matching the deployed instance. Invoke a serverless model inference prediction

Example Prompts for Baseten in Mastra AI

Ready-to-use prompts you can give your Mastra AI agent to start working with Baseten immediately.

01

"List standard machine learning models we currently host on Baseten."

02

"Run a prediction against the Sentiment model ID 12345 using this text input: 'The new feature completely broke my workflow.'"

03

"Check if our Baseten project has a secret scoped as 'OPENAI_API_KEY_FALLBACK'."

Troubleshooting Baseten MCP Server with Mastra AI

Common issues when connecting Baseten to Mastra AI through the Vinkius, and how to resolve them.

01

createMCPClient not exported

Install: npm install @mastra/mcp

Baseten + Mastra AI FAQ

Common questions about integrating Baseten MCP Server with Mastra AI.

01

How does Mastra AI connect to MCP servers?

Create an MCPClient with the server URL and pass it to your agent. Mastra discovers all tools and makes them available with full TypeScript types.
02

Can Mastra agents use tools from multiple servers?

Yes. Pass multiple MCP clients to the agent constructor. Mastra merges all tool schemas and the agent can call any tool from any server.
03

Does Mastra support workflow orchestration?

Yes. Mastra has a built-in workflow engine that lets you chain MCP tool calls with branching logic, error handling, and parallel execution.

Connect Baseten to Mastra AI

Get your token, paste the configuration, and start using 6 tools in under 2 minutes. No API key management needed.