Play.ht (Voice Cloning) MCP Server with 2 Tools for Claude, Cursor, and AI Agents
Generate ultra-realistic speech and clone voices instantly using Play.ht's advanced AI voice engines directly from your agent. Vinkius routes your AI agents directly to Play.ht (Voice Cloning) through a governed connection. 2 tools ready to use with Claude, ChatGPT, Cursor, or any AI agent — no hosting, no setup, connect in 30 seconds.
Ask AI about this server
Compatible with every major AI agent and IDE

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
What is the Play.ht MCP Server?
The Play.ht MCP Server routes AI agents like Claude, ChatGPT, and Cursor directly to Play.ht via 2 tools. Generate ultra-realistic speech and clone voices instantly using Play.ht's advanced AI voice engines directly from your agent. Powered by Vinkius — your credentials stay on your side of the connection, every request is auditable. Connect in under 2 minutes.
Built-in capabilities (2)
Tools for your AI Agents to operate Play.ht
Ask your AI agent "Generate an MP3 of 'Hello, how are you today?' using voice ID 's3://voice-cloning-zero-shot/...' and the Play3.0-mini engine." and get the answer without opening a single dashboard. With 2 tools connected to real Play.ht data, your agents reason over live information, cross-reference it with other MCP servers, and deliver insights you would spend hours assembling manually.
Works with Claude, ChatGPT, Cursor, and any MCP-compatible client. Powered by Vinkius — your credentials never touch the AI model, every request is auditable. Connect in under two minutes.
Why teams choose Vinkius
One subscription gives you the infrastructure to connect your AI agents to thousands of MCP servers — and deploy your own to the Vinkius Edge. Your credentials stay yours. Your data flows directly between your agent and the API. DLP blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade routing and governance, zero maintenance.
Build your own MCP Server with our secure development framework →The Play.ht (Voice Cloning) App Connector works with every AI agent you already use
…and any MCP-compatible client


















Use all 2 Play.ht (Voice Cloning) tools with your AI agents right now
Vinkius routes your AI agents to Play.ht (Voice Cloning) through a governed proxy. Beyond a simple connection, you get full visibility into every action your agents perform, with enterprise-grade security and up to 60% savings on AI costs.
Create instant voice clone on Play.ht (Voice Cloning)
Provide the audio file as a base64 encoded string. Create an instant voice clone from an audio sample
Generate tts stream on Play.ht (Voice Cloning)
Generate audio from text using Play.ht TTS
What the Play.ht (Voice Cloning) MCP Server unlocks
Connect Play.ht to your AI agent to generate high-quality Text-to-Speech (TTS) and create instant voice clones through natural conversation.
What you can do
- Text-to-Speech Generation — Convert text into lifelike audio using various engines like Play3.0-mini and PlayHT2.0-turbo.
- Instant Voice Cloning — Create a digital twin of any voice by simply providing a short audio sample.
- Fine-grained Control — Adjust speed, quality, emotion, and temperature to get the perfect vocal performance.
- Multi-language Support — Generate speech in multiple languages including English, French, Spanish, and more.
How it works
1. Subscribe to this server
2. Enter your Play.ht User ID and API Key
3. Start generating audio and cloning voices from Claude, Cursor, or any MCP-compatible client
Who is this for?
- Content Creators — automate voiceovers for videos and podcasts without manual recording
- Developers — integrate realistic voice interactions into applications and games
- Marketers — create personalized audio messages and localized content at scale
Frequently asked questions about the Play.ht (Voice Cloning) MCP Server
How can I generate audio from text using a specific voice?
Use the generate_tts_stream tool. Provide the text and the unique voice ID. You can also customize the voice_engine (like Play3.0-mini) and emotion for better results.
Can I create a new voice clone with this server?
Yes! Use the create_instant_voice_clone tool. You'll need to provide a name for the voice and a base64 encoded audio sample of the voice you want to clone.
What audio formats are supported for generation?
When using generate_tts_stream, you can specify the output_format as mp3, wav, ogg, flac, or mulaw.
More in this category

Qdrant
7 toolsEmpower your AI to interact directly with your Qdrant vector database — query clusters, perform similarity searches, and manage collections effortlessly.

Perplexity AI Alternative
8 toolsAccess Perplexity's AI search and chat models — get web-grounded answers with citations, search the web and run AI conversations from any AI agent.

DataRobot
6 toolsManage AutoML via DataRobot — monitor projects and models, track deployments, and audit ML datasets directly from any AI agent.

Exa
3 toolsSemantic search engine built for AI — find conceptually relevant web content, not just keyword matches. Powered by neural search technology.
You might also like

Vercel
11 toolsDeploy frontend applications instantly with a platform optimized for Next.js, serverless functions, and edge computing globally.

Mindbody
15 toolsManage classes, appointments, clients, staff, and sales from your Mindbody-powered fitness studio, spa, or wellness business through natural conversation.

AT&T Messaging
9 toolsCPaaS Messaging -- Send SMS/MMS, manage shortcodes, track delivery status, and run bulk campaigns via AT&T Messaging API.

Nylas
10 toolsEquip your AI agent to manage emails, calendars, and contacts across all providers (Gmail, Outlook) through a single unified interaction.
We built the connector to Play.ht (Voice Cloning). Now put your agents to work. Fully governed.
Vinkius is the AI Gateway with managed hosting. Stop building connectors. Every connection runs inside eight layers of security.
Hosted, sandboxed, and live on AWS. You don't provision anything. You don't maintain anything. You connect.
Every tool call, every token, every response. Logged and auditable. Data flows direct from Play.ht (Voice Cloning) to your agent. Nothing is stored on our side. Ever.
Eight governance layers on every request. Sensitive data redacted before it reaches the model. Kill switch if anything goes sideways. Always on.
