Cartesia (Voice AI) MCP Server for Google ADKGive Google ADK instant access to 20 tools to Clone Voice, Create Pronunciation Dict, Delete Pronunciation Dict, and more
Google Agent Development Kit (ADK) is Google's framework for building production AI agents. Add Cartesia (Voice AI) as an MCP tool provider through Vinkius and your ADK agents can call every tool with full schema introspection.
Ask AI about this MCP Server for Google ADK
The Cartesia (Voice AI) MCP Server for Google ADK is a standout in the Ai Frontier category — giving your AI agent 20 tools to work with, ready to go from day one.
Vinkius delivers Streamable HTTP and SSE to any MCP client
from google.adk.agents import Agent
from google.adk.tools.mcp_tool import McpToolset
from google.adk.tools.mcp_tool.mcp_session_manager import (
StreamableHTTPConnectionParams,
)
# Your Vinkius token. get it at cloud.vinkius.com
mcp_tools = McpToolset(
connection_params=StreamableHTTPConnectionParams(
url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp",
)
)
agent = Agent(
model="gemini-2.5-pro",
name="cartesia_voice_ai_agent",
instruction=(
"You help users interact with Cartesia (Voice AI) "
"using 20 available tools."
),
tools=[mcp_tools],
)
* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
About Cartesia (Voice AI) MCP Server
Connect Cartesia to your AI agent to unlock high-performance voice synthesis and speech recognition. Cartesia's Sonic models provide industry-leading latency and quality for real-time applications.
Google ADK natively supports Cartesia (Voice AI) as an MCP tool provider. declare Vinkius Edge URL and the framework handles discovery, validation, and execution automatically. Combine 20 tools with Gemini's long-context reasoning for complex multi-tool workflows, with production-ready session management and evaluation built in.
What you can do
- Text-to-Speech (TTS) — Generate high-fidelity audio bytes or stream via SSE using models like Sonic 3.5 and Sonic 3.
- Speech-to-Text (STT) — Transcribe audio files into text using the Ink Whisper model with multi-language support.
- Voice Cloning — Create custom voice models from as little as 5 seconds of audio input.
- Voice Management — List, retrieve, and update voices, or use the Voice Changer to transform existing audio.
- Pronunciation Control — Manage custom pronunciation dictionaries for specialized terminology or accents.
- Agent Orchestration — List and manage AI agents and monitor call logs and usage credits.
The Cartesia (Voice AI) MCP Server exposes 20 tools through the Vinkius. Connect it to Google ADK in under two minutes — credentials fully managed, no infrastructure to provision, no vendor lock-in. Your configuration, your data, your control.
All 20 Cartesia (Voice AI) tools available for Google ADK
When Google ADK connects to Cartesia (Voice AI) through Vinkius, your AI agent gets direct access to every tool listed below — spanning text-to-speech, speech-to-text, voice-synthesis, and more. Every call runs in a secure, isolated environment with full audit visibility. Beyond a simple connection, you get real-time monitoring of agent activity, enterprise governance, and optimized token usage.
Clone voice on Cartesia (Voice AI)
Clone a voice from a 5s audio clip
Create pronunciation dict on Cartesia (Voice AI)
Create a new pronunciation dictionary
Delete pronunciation dict on Cartesia (Voice AI)
Delete a pronunciation dictionary
Delete voice on Cartesia (Voice AI)
Delete a voice
Generate access token on Cartesia (Voice AI)
Generate a short-lived access token for client-side requests
Get agent on Cartesia (Voice AI)
Get details for a specific voice agent
Get usage credits on Cartesia (Voice AI)
Get credit usage statistics
Get voice on Cartesia (Voice AI)
Get details for a specific voice
Infill bytes on Cartesia (Voice AI)
Generate audio to smoothly connect two existing segments
List agent calls on Cartesia (Voice AI)
List calls and transcripts for a specific agent
List agents on Cartesia (Voice AI)
List all voice agents
List pronunciation dicts on Cartesia (Voice AI)
List pronunciation dictionaries
List voices on Cartesia (Voice AI)
List available voices
Localize voice on Cartesia (Voice AI)
Adapt a voice to a new language/dialect
Stt batch on Cartesia (Voice AI)
Transcribe audio file to text (Batch STT)
Tts bytes on Cartesia (Voice AI)
Generate text-to-speech audio bytes
Tts sse on Cartesia (Voice AI)
Generate text-to-speech via Server-Sent Events
Update pronunciation dict on Cartesia (Voice AI)
Update a pronunciation dictionary
Update voice on Cartesia (Voice AI)
Update voice metadata
Voice changer bytes on Cartesia (Voice AI)
Change voice of an audio clip while preserving intonation
Connect Cartesia (Voice AI) to Google ADK via MCP
Follow these steps to wire Cartesia (Voice AI) into Google ADK. The entire setup takes under two minutes — your credentials stay safe behind Vinkius.
Install Google ADK
pip install google-adkReplace the token
[YOUR_TOKEN_HERE] with your Vinkius tokenCreate the agent
Explore tools
Why Use Google ADK with the Cartesia (Voice AI) MCP Server
Google ADK provides unique advantages when paired with Cartesia (Voice AI) through the Model Context Protocol.
Google ADK natively supports MCP tool servers. declare a tool provider and the framework handles discovery, validation, and execution
Built on Gemini models, ADK provides long-context reasoning ideal for complex multi-tool workflows with Cartesia (Voice AI)
Production-ready features like session management, evaluation, and deployment come built-in. not bolted on
Seamless integration with Google Cloud services means you can combine Cartesia (Voice AI) tools with BigQuery, Vertex AI, and Cloud Functions
Cartesia (Voice AI) + Google ADK Use Cases
Practical scenarios where Google ADK combined with the Cartesia (Voice AI) MCP Server delivers measurable value.
Enterprise data agents: ADK agents query Cartesia (Voice AI) and cross-reference results with internal databases for comprehensive analysis
Multi-modal workflows: combine Cartesia (Voice AI) tool responses with Gemini's vision and language capabilities in a single agent
Automated compliance checks: schedule ADK agents to query Cartesia (Voice AI) regularly and flag policy violations or configuration drift
Internal tool platforms: build self-service agent platforms where teams connect their own MCP servers including Cartesia (Voice AI)
Example Prompts for Cartesia (Voice AI) in Google ADK
Ready-to-use prompts you can give your Google ADK agent to start working with Cartesia (Voice AI) immediately.
"List all available voices in my Cartesia account."
"Generate a WAV audio file saying 'Welcome to the future of AI' using voice ID 79a045e3-a621-4923-b05c-8029db0dffca."
"Check my current usage credits on Cartesia."
Troubleshooting Cartesia (Voice AI) MCP Server with Google ADK
Common issues when connecting Cartesia (Voice AI) to Google ADK through Vinkius, and how to resolve them.
McpToolset not found
pip install --upgrade google-adkCartesia (Voice AI) + Google ADK FAQ
Common questions about integrating Cartesia (Voice AI) MCP Server with Google ADK.
How does Google ADK connect to MCP servers?
Can ADK agents use multiple MCP servers?
Which Gemini models work best with MCP tools?
Explore More MCP Servers
View all →
Novu
39 toolsAutomate multi-channel notifications via Novu — trigger workflows, manage subscribers, and handle preferences directly from any AI agent.

ncScale
10 toolsMonitor and observe your no-code stack via ncScale — track logs, alerts, and tickets directly from your AI agent.

EZO Asset Intelligence
10 toolsEquip your AI agent to manage fixed assets, track inventory, and monitor checkouts via the EZO.io (EZOfficeInventory) API.

API-Futebol (Brazilian Football)
12 toolsThe definitive server for Brazilian football — track Brasileirão, Copa do Brasil, and State Leagues via AI.
