Cartesia (Voice AI) MCP Server for CursorGive Cursor instant access to 20 tools to Clone Voice, Create Pronunciation Dict, Delete Pronunciation Dict, and more
Cursor is an AI-first code editor built on VS Code that integrates LLM-powered coding assistance directly into the development workflow. Its Agent mode enables autonomous multi-step coding tasks, and MCP support lets agents access external data sources and APIs during code generation.
Ask AI about this MCP Server for Cursor
The Cartesia (Voice AI) MCP Server for Cursor is a standout in the Ai Frontier category — giving your AI agent 20 tools to work with, ready to go from day one.
Vinkius delivers Streamable HTTP and SSE to any MCP client
{
"mcpServers": {
"cartesia-voice-ai": {
"url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}
}
}Vinkius Desktop App
The modern way to manage MCP Servers — no config files, no terminal commands. Install Cartesia (Voice AI) and 4,000+ MCP Servers from a single visual interface.





* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
About Cartesia (Voice AI) MCP Server
Connect Cartesia to your AI agent to unlock high-performance voice synthesis and speech recognition. Cartesia's Sonic models provide industry-leading latency and quality for real-time applications.
Cursor's Agent mode turns Cartesia (Voice AI) into an in-editor superpower. Ask Cursor to generate code using live data from Cartesia (Voice AI) and it fetches, processes, and writes. all in a single agentic loop. 20 tools appear alongside file editing and terminal access, creating a unified development environment grounded in real-time information.
What you can do
- Text-to-Speech (TTS) — Generate high-fidelity audio bytes or stream via SSE using models like Sonic 3.5 and Sonic 3.
- Speech-to-Text (STT) — Transcribe audio files into text using the Ink Whisper model with multi-language support.
- Voice Cloning — Create custom voice models from as little as 5 seconds of audio input.
- Voice Management — List, retrieve, and update voices, or use the Voice Changer to transform existing audio.
- Pronunciation Control — Manage custom pronunciation dictionaries for specialized terminology or accents.
- Agent Orchestration — List and manage AI agents and monitor call logs and usage credits.
The Cartesia (Voice AI) MCP Server exposes 20 tools through the Vinkius. Connect it to Cursor in under two minutes — credentials fully managed, no infrastructure to provision, no vendor lock-in. Your configuration, your data, your control.
All 20 Cartesia (Voice AI) tools available for Cursor
When Cursor connects to Cartesia (Voice AI) through Vinkius, your AI agent gets direct access to every tool listed below — spanning text-to-speech, speech-to-text, voice-synthesis, and more. Every call runs in a secure, isolated environment with full audit visibility. Beyond a simple connection, you get real-time monitoring of agent activity, enterprise governance, and optimized token usage.
Clone voice on Cartesia (Voice AI)
Clone a voice from a 5s audio clip
Create pronunciation dict on Cartesia (Voice AI)
Create a new pronunciation dictionary
Delete pronunciation dict on Cartesia (Voice AI)
Delete a pronunciation dictionary
Delete voice on Cartesia (Voice AI)
Delete a voice
Generate access token on Cartesia (Voice AI)
Generate a short-lived access token for client-side requests
Get agent on Cartesia (Voice AI)
Get details for a specific voice agent
Get usage credits on Cartesia (Voice AI)
Get credit usage statistics
Get voice on Cartesia (Voice AI)
Get details for a specific voice
Infill bytes on Cartesia (Voice AI)
Generate audio to smoothly connect two existing segments
List agent calls on Cartesia (Voice AI)
List calls and transcripts for a specific agent
List agents on Cartesia (Voice AI)
List all voice agents
List pronunciation dicts on Cartesia (Voice AI)
List pronunciation dictionaries
List voices on Cartesia (Voice AI)
List available voices
Localize voice on Cartesia (Voice AI)
Adapt a voice to a new language/dialect
Stt batch on Cartesia (Voice AI)
Transcribe audio file to text (Batch STT)
Tts bytes on Cartesia (Voice AI)
Generate text-to-speech audio bytes
Tts sse on Cartesia (Voice AI)
Generate text-to-speech via Server-Sent Events
Update pronunciation dict on Cartesia (Voice AI)
Update a pronunciation dictionary
Update voice on Cartesia (Voice AI)
Update voice metadata
Voice changer bytes on Cartesia (Voice AI)
Change voice of an audio clip while preserving intonation
Connect Cartesia (Voice AI) to Cursor via MCP
Follow these steps to wire Cartesia (Voice AI) into Cursor. The entire setup takes under two minutes — your credentials stay safe behind Vinkius.
Open MCP Settings
Cmd+Shift+P (macOS) or Ctrl+Shift+P (Windows/Linux) → search "MCP Settings"Add the server config
mcp.json file that opensSave the file
Start using Cartesia (Voice AI)
Why Use Cursor with the Cartesia (Voice AI) MCP Server
Cursor AI Code Editor provides unique advantages when paired with Cartesia (Voice AI) through the Model Context Protocol.
Agent mode turns Cursor into an autonomous coding assistant that can read files, run commands, and call MCP tools without switching context
Cursor's Composer feature can generate entire files using real-time data fetched through MCP. no copy-pasting from external dashboards
MCP tools appear alongside built-in tools like file reading and terminal access, creating a unified agentic environment
VS Code extension compatibility means your existing workflow, keybindings, and extensions all work alongside MCP tools
Cartesia (Voice AI) + Cursor Use Cases
Practical scenarios where Cursor combined with the Cartesia (Voice AI) MCP Server delivers measurable value.
Code generation with live data: ask Cursor to generate a security report module using live DNS and subdomain data fetched through MCP
Automated documentation: have Cursor query your API's tool schemas and generate TypeScript interfaces or OpenAPI specs automatically
Infrastructure-as-code: Cursor can fetch domain configurations and generate corresponding Terraform or CloudFormation templates
Test scaffolding: ask Cursor to pull real API responses via MCP and generate unit test fixtures from actual data
Example Prompts for Cartesia (Voice AI) in Cursor
Ready-to-use prompts you can give your Cursor agent to start working with Cartesia (Voice AI) immediately.
"List all available voices in my Cartesia account."
"Generate a WAV audio file saying 'Welcome to the future of AI' using voice ID 79a045e3-a621-4923-b05c-8029db0dffca."
"Check my current usage credits on Cartesia."
Troubleshooting Cartesia (Voice AI) MCP Server with Cursor
Common issues when connecting Cartesia (Voice AI) to Cursor through Vinkius, and how to resolve them.
Tools not appearing in Cursor
Server shows as disconnected
Cartesia (Voice AI) + Cursor FAQ
Common questions about integrating Cartesia (Voice AI) MCP Server with Cursor.
What is Agent mode and why does it matter for MCP?
Where does Cursor store MCP configuration?
mcp.json file. You can configure servers at the project level (.cursor/mcp.json in your project root) or globally (~/.cursor/mcp.json). Project-level configs take precedence.Can Cursor use MCP tools in inline edits?
How do I verify MCP tools are loaded?
Explore More MCP Servers
View all →
DEV.to
38 toolsManage your DEV.to presence — publish articles, fetch latest posts, and update content directly from your AI agent.

Tana
10 toolsConnect your AI to Tana. Build intelligent knowledge graphs, define supertags, and capture dynamic nested nodes directly from the prompt.

AT&T IoT
10 toolsIoT Control Center -- Manage SIM devices, activation, data pools, shared plans, and connectivity diagnostics via AT&T IoT API.

NewsAPI
10 toolsSearch breaking news and historical articles from 150,000+ sources via NewsAPI.org.
