Cartesia (Voice AI) MCP Server for Vercel AI SDKGive Vercel AI SDK instant access to 20 tools to Clone Voice, Create Pronunciation Dict, Delete Pronunciation Dict, and more
The Vercel AI SDK is the TypeScript toolkit for building AI-powered applications. Connect Cartesia (Voice AI) through Vinkius and every tool is available as a typed function. ready for React Server Components, API routes, or any Node.js backend.
Ask AI about this MCP Server for Vercel AI SDK
The Cartesia (Voice AI) MCP Server for Vercel AI SDK is a standout in the Ai Frontier category — giving your AI agent 20 tools to work with, ready to go from day one.
Vinkius delivers Streamable HTTP and SSE to any MCP client
import { createMCPClient } from "@ai-sdk/mcp";
import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
async function main() {
const mcpClient = await createMCPClient({
transport: {
type: "http",
// Your Vinkius token. get it at cloud.vinkius.com
url: "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp",
},
});
try {
const tools = await mcpClient.tools();
const { text } = await generateText({
model: openai("gpt-4o"),
tools,
prompt: "Using Cartesia (Voice AI), list all available capabilities.",
});
console.log(text);
} finally {
await mcpClient.close();
}
}
main();
* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
About Cartesia (Voice AI) MCP Server
Connect Cartesia to your AI agent to unlock high-performance voice synthesis and speech recognition. Cartesia's Sonic models provide industry-leading latency and quality for real-time applications.
The Vercel AI SDK gives every Cartesia (Voice AI) tool full TypeScript type inference, IDE autocomplete, and compile-time error checking. Connect 20 tools through Vinkius and stream results progressively to React, Svelte, or Vue components. works on Edge Functions, Cloudflare Workers, and any Node.js runtime.
What you can do
- Text-to-Speech (TTS) — Generate high-fidelity audio bytes or stream via SSE using models like Sonic 3.5 and Sonic 3.
- Speech-to-Text (STT) — Transcribe audio files into text using the Ink Whisper model with multi-language support.
- Voice Cloning — Create custom voice models from as little as 5 seconds of audio input.
- Voice Management — List, retrieve, and update voices, or use the Voice Changer to transform existing audio.
- Pronunciation Control — Manage custom pronunciation dictionaries for specialized terminology or accents.
- Agent Orchestration — List and manage AI agents and monitor call logs and usage credits.
The Cartesia (Voice AI) MCP Server exposes 20 tools through the Vinkius. Connect it to Vercel AI SDK in under two minutes — credentials fully managed, no infrastructure to provision, no vendor lock-in. Your configuration, your data, your control.
All 20 Cartesia (Voice AI) tools available for Vercel AI SDK
When Vercel AI SDK connects to Cartesia (Voice AI) through Vinkius, your AI agent gets direct access to every tool listed below — spanning text-to-speech, speech-to-text, voice-synthesis, and more. Every call runs in a secure, isolated environment with full audit visibility. Beyond a simple connection, you get real-time monitoring of agent activity, enterprise governance, and optimized token usage.
Clone voice on Cartesia (Voice AI)
Clone a voice from a 5s audio clip
Create pronunciation dict on Cartesia (Voice AI)
Create a new pronunciation dictionary
Delete pronunciation dict on Cartesia (Voice AI)
Delete a pronunciation dictionary
Delete voice on Cartesia (Voice AI)
Delete a voice
Generate access token on Cartesia (Voice AI)
Generate a short-lived access token for client-side requests
Get agent on Cartesia (Voice AI)
Get details for a specific voice agent
Get usage credits on Cartesia (Voice AI)
Get credit usage statistics
Get voice on Cartesia (Voice AI)
Get details for a specific voice
Infill bytes on Cartesia (Voice AI)
Generate audio to smoothly connect two existing segments
List agent calls on Cartesia (Voice AI)
List calls and transcripts for a specific agent
List agents on Cartesia (Voice AI)
List all voice agents
List pronunciation dicts on Cartesia (Voice AI)
List pronunciation dictionaries
List voices on Cartesia (Voice AI)
List available voices
Localize voice on Cartesia (Voice AI)
Adapt a voice to a new language/dialect
Stt batch on Cartesia (Voice AI)
Transcribe audio file to text (Batch STT)
Tts bytes on Cartesia (Voice AI)
Generate text-to-speech audio bytes
Tts sse on Cartesia (Voice AI)
Generate text-to-speech via Server-Sent Events
Update pronunciation dict on Cartesia (Voice AI)
Update a pronunciation dictionary
Update voice on Cartesia (Voice AI)
Update voice metadata
Voice changer bytes on Cartesia (Voice AI)
Change voice of an audio clip while preserving intonation
Connect Cartesia (Voice AI) to Vercel AI SDK via MCP
Follow these steps to wire Cartesia (Voice AI) into Vercel AI SDK. The entire setup takes under two minutes — your credentials stay safe behind Vinkius.
Install dependencies
npm install @ai-sdk/mcp ai @ai-sdk/openaiReplace the token
[YOUR_TOKEN_HERE] with your Vinkius tokenRun the script
agent.ts and run with npx tsx agent.tsExplore tools
Why Use Vercel AI SDK with the Cartesia (Voice AI) MCP Server
Vercel AI SDK provides unique advantages when paired with Cartesia (Voice AI) through the Model Context Protocol.
TypeScript-first: every MCP tool gets full type inference, IDE autocomplete, and compile-time error checking out of the box
Framework-agnostic core works with Next.js, Nuxt, SvelteKit, or any Node.js runtime. same Cartesia (Voice AI) integration everywhere
Built-in streaming UI primitives let you display Cartesia (Voice AI) tool results progressively in React, Svelte, or Vue components
Edge-compatible: the AI SDK runs on Vercel Edge Functions, Cloudflare Workers, and other edge runtimes for minimal latency
Cartesia (Voice AI) + Vercel AI SDK Use Cases
Practical scenarios where Vercel AI SDK combined with the Cartesia (Voice AI) MCP Server delivers measurable value.
AI-powered web apps: build dashboards that query Cartesia (Voice AI) in real-time and stream results to the UI with zero loading states
API backends: create serverless endpoints that orchestrate Cartesia (Voice AI) tools and return structured JSON responses to any frontend
Chatbots with tool use: embed Cartesia (Voice AI) capabilities into conversational interfaces with streaming responses and tool call visibility
Internal tools: build admin panels where team members interact with Cartesia (Voice AI) through natural language queries
Example Prompts for Cartesia (Voice AI) in Vercel AI SDK
Ready-to-use prompts you can give your Vercel AI SDK agent to start working with Cartesia (Voice AI) immediately.
"List all available voices in my Cartesia account."
"Generate a WAV audio file saying 'Welcome to the future of AI' using voice ID 79a045e3-a621-4923-b05c-8029db0dffca."
"Check my current usage credits on Cartesia."
Troubleshooting Cartesia (Voice AI) MCP Server with Vercel AI SDK
Common issues when connecting Cartesia (Voice AI) to Vercel AI SDK through Vinkius, and how to resolve them.
createMCPClient is not a function
npm install @ai-sdk/mcpCartesia (Voice AI) + Vercel AI SDK FAQ
Common questions about integrating Cartesia (Voice AI) MCP Server with Vercel AI SDK.
How does the Vercel AI SDK connect to MCP servers?
createMCPClient from @ai-sdk/mcp and pass the server URL. The SDK discovers all tools and provides typed TypeScript interfaces for each one.Can I use MCP tools in Edge Functions?
Does it support streaming tool results?
useChat and streamText that handle tool calls and display results progressively in the UI.Explore More MCP Servers
View all →
Thinkific
10 toolsCreate and sell online courses with a platform that handles course hosting, student enrollment, and payment processing beautifully.

ChartMogul
12 toolsUnderstand your subscription metrics with MRR tracking, churn analysis, and cohort reports that reveal growth opportunities.

Click2Mail
8 toolsManage physical mail and printing via Click2Mail — track jobs, monitor credit balance, and submit mailings directly from any AI agent.

Thesaurus API
2 toolsSearch synonyms and antonyms — audit linguistics via AI.
