Coqui TTS MCP for AI. Turn Text Into Studio-Quality Speech.

Q: How can I check which voice models are currently installed on my server?

You can use the listmodels tool. Your agent will query the Coqui server and return a list of all available TTS models ready for synthesis.

Q: Is it possible to generate audio files from a text string directly?

Yes! Use the synthesizespeech tool by providing the text you want to convert. The agent will process it through Coqui and return the audio metadata.

Q: What do I need to provide to connect my local Coqui instance?

You only need to provide the COQUISERVERURL. This is the base address where your Coqui Speech Studio API is reachable (e.g., http://localhost:5002).

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Connect to your AI in seconds.

Coqui TTS (Open Source Speech Studio API) instantly converts text into high-quality speech audio. This MCP connects your AI client to self-hosted or cloud Coqui models, letting you list available voices and generate accurate voiceovers directly from an agent conversation.

It’s perfect for developers who need reliable, open-source Text-to-Speech output without leaving their code editor.

What your AI can do

List models

Finds and reports the full list of all text-to-speech models currently running on your Coqui server.

Synthesize speech

Generates an actual audio file based on a text input using one of your available TTS models.

Check available voices

You ask what models are ready, and it returns a list of all TTS voices currently loaded on your Coqui server.

Generate audio from text

It takes any block of text you provide and immediately converts it into synthesized speech.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

Coqui TTS (Open Source Speech Studio API) with 2 Tools

These two tools let you manage available voices and then use them to convert any text input into synthesized speech audio.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Coqui TTS (Open Source Speech Studio API) on Vinkius

List Models

Finds and reports the full list of all text-to-speech models currently running on your Coqui server.

Synthesize Speech

Generates an actual audio file based on a text input using one of your available TTS...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Coqui TTS integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "coqui-tts-open-source-speech-studio-api": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Coqui TTS tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"coqui-tts-open-source-speech-studio-api": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Coqui TTS (Open Source Speech Studio API), then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Coqui TTS. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 2 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

The manual process of creating voice samples is slow and expensive.

Right now, if you need a new script read aloud for testing, you either record it yourself (taking up hours) or hire a freelancer. This means copy-pasting the text into an external service's web form, waiting minutes for processing, and then downloading individual MP3 files—a tedious loop of copy/paste and file management.

With this MCP connected to your agent, you skip the UI entirely. You just tell your agent what needs saying. It handles connecting to your Coqui API, confirms the model is ready, runs `synthesize_speech`, and gives you the audio metadata instantly—all without leaving your chat window.

Synthesize Speech with Coqui TTS

The biggest win is eliminating the need for manual API scripting. You don't have to write, 'First call Model A; then pass text X.' Your agent handles that orchestration automatically when you use `synthesize_speech`.

It’s a massive difference now. Voice generation isn't a multi-step coding project; it's just another conversation prompt.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

What your AI can actually do with this

Need to turn written text into speech? This MCP connects your AI client to a Coqui Speech Studio API endpoint. You can use this connection through Vinkius to get high-quality voice synthesis from models you manage yourself. It lets your agent discover all the voices available on your server and then synthesize audio based on natural conversation.

Whether you're building an app or just making sample voiceovers, you send text, and it comes back as spoken audio. You don't have to write separate scripts; your agent handles the whole process. This is how developers build features that actually talk.

Built · Hosted · Managed by Vinkius Coqui TTS MCP - Synthesize Audio from Text

Server ID 019e5d0c-1be5-727d-8d4b-17b201c1c2ff

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

What Changes When You Connect

You get reliable, open-source voice generation. You don't rely on proprietary APIs with usage caps or unpredictable costs.

Using the list_models tool lets you see exactly which voices are active on your server before you write a single line of code.

The synthesis process is streamlined. Instead of writing boilerplate API calls, your agent handles the text-to-speech conversion for you.

It’s built for developers who need voice output in their application logic. You just tell the agent to synthesize speech using synthesize_speech.

You keep control of your models. Since this connects to your self-hosted Coqui API, you manage the infrastructure and data.

See it in action

01 01

Creating a product tour walkthrough

A technical writer needs to demonstrate how a new feature works. Instead of recording three separate voice tracks, they use their agent to run list_models first, pick an English model, and then call synthesize_speech repeatedly for each step. The result is a cohesive audio guide.

02 02

Testing localization models

A global product manager wants to see if their new Chinese language model works correctly. They use the agent, which calls list_models, confirms the correct locale ID is available, and then uses synthesize_speech to test a sample phrase.

03 03

Building an automated notification system

A developer builds a CI/CD pipeline that needs to read error logs aloud for quick review. They connect the MCP, confirming model availability with list_models, and then pass the log text to synthesize_speech.

04 04

Generating sample voiceovers quickly

A content creator has 50 lines of script for a podcast trailer. Using the agent, they batch-feed the text into synthesize_speech after confirming model health with list_models, generating all audio files in minutes.

The honest tradeoffs

Assuming generic voice quality

Anti-pattern

The developer just sends random text to a general TTS API and gets an unusable, robotic sound that doesn't match the brand tone.

The Fix

First, use list_models to find specific models (like XTTS) known for better quality. Then, pass the text to synthesize_speech using that model ID. This gives you control over the voice.

Skipping initial model discovery

Anti-pattern

The developer writes code assuming a specific English model exists, but due to server changes or deployment issues, the call fails immediately.

The Fix

Always run list_models first. This confirms your current setup and prevents runtime failures when calling synthesize_speech.

Using TTS for complex audio

Anti-pattern

Trying to synthesize a sound effect or music track using the text-to-speech tool.

The Fix

This MCP is strictly for speech. For non-speech sounds, you need dedicated audio libraries, not synthesize_speech.

Questions you might have

How can I check which voice models are currently installed on my server? +

You can use the list_models tool. Your agent will query the Coqui server and return a list of all available TTS models ready for synthesis.

Is it possible to generate audio files from a text string directly? +

Yes! Use the synthesize_speech tool by providing the text you want to convert. The agent will process it through Coqui and return the audio metadata.

What do I need to provide to connect my local Coqui instance? +

You only need to provide the COQUI_SERVER_URL. This is the base address where your Coqui Speech Studio API is reachable (e.g., http://localhost:5002).

When I use list_models, how do I determine if a model supports a specific language? +

The model name itself indicates compatibility. Look for standard prefixes like 'en' for English or 'multilingual' for broad dialect support. This helps you select the right voice profile upfront.

After calling synthesize_speech, how do I retrieve detailed information about the generated audio file? +

The system returns comprehensive metadata immediately after synthesis. You get details on the file ID, model configuration used, and storage location for easy retrieval.

What happens if my API connection fails during synthesize_speech? +

If the service encounters an issue, the agent returns a specific HTTP status code along with an error message. This allows you to quickly debug whether it's a connectivity or input problem.

Are there any rate limits when I use synthesize_speech? +

Rate limiting depends entirely on your self-hosted Coqui setup. Your API provider manages the throttling, and the agent will pass those specific error codes back to you for handling.

What file formats can I expect after running synthesize_speech? +

The API handles standard audio formats like WAV and MP3. For definitive proof of supported output types, consult the official Coqui documentation or use list_models to check capabilities.

Connect to your AI in seconds.

List models

Synthesize speech

Coqui TTS (Open Source Speech Studio API) with 2 Tools

Make your AI actually useful.

List Models

Synthesize Speech

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Works with Claude, ChatGPT, Cursor, and more

The manual process of creating voice samples is slow and expensive.

Synthesize Speech with Coqui TTS

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

See it in action

Creating a product tour walkthrough

Testing localization models

Building an automated notification system

Generating sample voiceovers quickly

The honest tradeoffs

Assuming generic voice quality

Skipping initial model discovery

Using TTS for complex audio

When It Fits, When It Doesn't

Questions you might have