Compatible with every major AI agent and IDE
What is the Play.ht (Voice Cloning) MCP Server?
Connect Play.ht to your AI agent to generate high-quality Text-to-Speech (TTS) and create instant voice clones through natural conversation.
What you can do
- Text-to-Speech Generation — Convert text into lifelike audio using various engines like Play3.0-mini and PlayHT2.0-turbo.
- Instant Voice Cloning — Create a digital twin of any voice by simply providing a short audio sample.
- Fine-grained Control — Adjust speed, quality, emotion, and temperature to get the perfect vocal performance.
- Multi-language Support — Generate speech in multiple languages including English, French, Spanish, and more.
How it works
- Subscribe to this server
- Enter your Play.ht User ID and API Key
- Start generating audio and cloning voices from Claude, Cursor, or any MCP-compatible client
Who is this for?
- Content Creators — automate voiceovers for videos and podcasts without manual recording
- Developers — integrate realistic voice interactions into applications and games
- Marketers — create personalized audio messages and localized content at scale
Built-in capabilities (2)
Provide the audio file as a base64 encoded string. Create an instant voice clone from an audio sample
Generate audio from text using Play.ht TTS
Why Vercel AI SDK?
The Vercel AI SDK gives every Play.ht (Voice Cloning) tool full TypeScript type inference, IDE autocomplete, and compile-time error checking. Connect 2 tools through Vinkius and stream results progressively to React, Svelte, or Vue components. works on Edge Functions, Cloudflare Workers, and any Node.js runtime.
- —
TypeScript-first: every MCP tool gets full type inference, IDE autocomplete, and compile-time error checking out of the box
- —
Framework-agnostic core works with Next.js, Nuxt, SvelteKit, or any Node.js runtime. same Play.ht (Voice Cloning) integration everywhere
- —
Built-in streaming UI primitives let you display Play.ht (Voice Cloning) tool results progressively in React, Svelte, or Vue components
- —
Edge-compatible: the AI SDK runs on Vercel Edge Functions, Cloudflare Workers, and other edge runtimes for minimal latency
Play.ht (Voice Cloning) in Vercel AI SDK
Play.ht (Voice Cloning) and 4,000+ other MCP servers. One platform. One governance layer.
Teams that connect Play.ht (Voice Cloning) to Vercel AI SDK through Vinkius don't need to source, host, or maintain individual MCP servers. Every tool call runs inside a hardened runtime with credential isolation, DLP, and a signed audit chain.
Raw MCP | Vinkius | |
|---|---|---|
| Server catalog | Find and host yourself | 4,000+ managed |
| Infrastructure | Self-hosted | Sandboxed V8 isolates |
| Credential handling | Plaintext in config | Vault + runtime injection |
| Data loss prevention | None | Configurable DLP policies |
| Kill switch | None | Global instant shutdown |
| Financial circuit breakers | None | Per-server limits + alerts |
| Audit trail | None | Ed25519 signed logs |
| SIEM log streaming | None | Splunk, Datadog, Webhook |
| Honeytokens | None | Canary alerts on leak |
| Custom domains | Not applicable | DNS challenge verified |
| GDPR compliance | Manual effort | Automated purge + export |
Why teams choose Vinkius for Play.ht (Voice Cloning) in Vercel AI SDK
The Play.ht (Voice Cloning) MCP Server runs on Vinkius-managed infrastructure inside AWS — a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts. All 2 tools execute in hardened sandboxes optimized for native MCP execution.
Your AI agents in Vercel AI SDK only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure, zero maintenance.

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
How Vinkius secures
Play.ht (Voice Cloning) for Vercel AI SDK
Every tool call from Vercel AI SDK to the Play.ht (Voice Cloning) MCP Server is protected by DLP redaction, cryptographic audit chains, V8 sandbox isolation, kill switch, and financial circuit breakers.
Frequently asked questions
How can I generate audio from text using a specific voice?
Use the generate_tts_stream tool. Provide the text and the unique voice ID. You can also customize the voice_engine (like Play3.0-mini) and emotion for better results.
Can I create a new voice clone with this server?
Yes! Use the create_instant_voice_clone tool. You'll need to provide a name for the voice and a base64 encoded audio sample of the voice you want to clone.
What audio formats are supported for generation?
When using generate_tts_stream, you can specify the output_format as mp3, wav, ogg, flac, or mulaw.
How does the Vercel AI SDK connect to MCP servers?
Import createMCPClient from @ai-sdk/mcp and pass the server URL. The SDK discovers all tools and provides typed TypeScript interfaces for each one.
Can I use MCP tools in Edge Functions?
Yes. The AI SDK is fully edge-compatible. MCP connections work on Vercel Edge Functions, Cloudflare Workers, and similar runtimes.
Does it support streaming tool results?
Yes. The SDK provides streaming primitives like useChat and streamText that handle tool calls and display results progressively in the UI.
createMCPClient is not a function
Install: npm install @ai-sdk/mcp
Explore More MCP Servers
View all →
Airbnb
12 toolsSearch and manage Airbnb listings, experiences, reservations, and pricing directly from any AI agent.

Sinch
10 toolsEnable your AI agent to send SMS messages, monitor delivery reports, and manage contact groups via the Sinch API.

Flodesk
10 toolsDesign gorgeous email campaigns with intuitive templates that grow your audience and reflect your brand without design skills.

HTMLCSSToImage
10 toolsGenerate high-quality images and PDFs from HTML/CSS or URLs directly from your AI agent.
