Vinkius

Speechmatics MCP. Transcribe Audio and Generate Voiceovers, Automatically.

Speechmatics provides high-accuracy audio processing capabilities right in your agent. Transcribe massive amounts of audio files—whether they're podcasts or meeting recordings—into structured text. You can also convert any written script into natural, human-sounding speech using various voices (like Sarah, Theo, and Megan). It handles everything from batch transcription to job management, giving you full control over your audio pipelines.

Speechmatics MCP is compatible with Claude Claude
Speechmatics MCP is compatible with ChatGPT ChatGPT
Speechmatics MCP is compatible with Cursor Cursor
Speechmatics MCP is compatible with Gemini Gemini
Speechmatics MCP is compatible with Windsurf Windsurf
Speechmatics MCP is compatible with VS Code VS Code
Speechmatics MCP is compatible with JetBrains JetBrains
Speechmatics MCP is compatible with Vercel Vercel
See Vinkius in Action

Give Claude and any AI agent real-world access

Transcribe audio files

Submit large audio recordings and receive highly accurate written transcripts.

Generate synthetic speech

Turn plain text into high-quality, natural-sounding voice audio using multiple character voices.

Manage transcription jobs

Keep track of every processing task, listing recent activity and checking the status of ongoing jobs.

Retrieve completed transcripts

Pull finished transcriptions in various formats like JSON or plain text for immediate use.

Waiting for input…

AI Agent
Speechmatics

What AI agents can do with Speechmatics with 8 Tools

These eight tools let you manage every step of advanced audio processing: submitting jobs, tracking status, generating voiceovers, and retrieving clean transcripts.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Speechmatics MCP

Create Job

Starts a new process to transcribe an audio file from a provided source.

Create Temp Key

Generates secure, temporary API keys for client-side access management.

Delete Job

Removes a transcription job from the system if it was started accidentally or is no...

Generate Tts

Converts specified text into an audio file using high-quality, natural speech voices.

Get Job

Retrieves the current status and specific details for a single transcription job ID.

Get Transcript

Pulls the final, completed text or subtitle file associated with a finished job.

Get Usage

Checks your current billing consumption and usage statistics for the service.

List Jobs

Shows a list of all recent transcription jobs you have submitted to the system.

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Speechmatics MCP is compatible with Claude

Claude AI

1

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

2

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

3

Start a conversation

Open a new chat. The Speechmatics integration is available immediately — no restart needed.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

  • Import from OpenAPI, Swagger, or YAML specs
  • Create Agent Skills with progressive disclosure
  • Deploy to edge with MCPFusion framework
  • Built in DLP, auth, and compliance on each call
  • Real time usage dashboard and cost metering
  • Publish to catalog or keep private
Start building

Make Your AI Do More

Start with Speechmatics, then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

  • Use this MCP plus 5,200+ others, all in one place
  • Add new capabilities to your AI anytime you want
  • Connections are secured and governed automatically
  • Track usage and costs across all your servers
  • Works with Claude, ChatGPT, Cursor, and more
  • New servers added to the catalog weekly
Speechmatics MCP server cover

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Speechmatics. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Sifting Through Hours of Raw Audio Is a Full-Time Job

Today, getting usable text from audio means clicking into a separate transcription service. You upload the file, wait an unknown time for processing, then you log back in to download a messy export. This cycle involves copying data from one system, pasting it into another for cleanup, and hoping all the timestamps didn't break your script.

With this MCP, audio processing happens inside your agent’s conversation flow. You just submit the source file, and the system handles the entire wait period in the background. The finished text or subtitles are delivered directly back to you through a simple command.

Get Natural Speech with generate_tts

Writing scripts and then having a voice artist record them is slow, expensive, and requires scheduling. You also have to manually edit the audio file and sync it perfectly to your video timeline.

Now you input your text and tell your agent which professional voice you want using generate_tts. It delivers the finished, ready-to-use audio file in seconds. The friction point—the wait time and manual labor—is gone.

What Speechmatics MCP does for your AI

Dealing with raw audio is a massive headache for any workflow. Before this MCP, turning hours of recorded conversation or video content into usable text required specialized software and tedious manual exports. Now, your agent connects directly to Speechmatics through Vinkius, letting you handle advanced audio processing as part of a natural conversation.

You can feed it an audio file—via URL or base64—and quickly start a batch transcription job. Need voiceovers for training videos? Just give it the text and tell it which high-quality voice to use. The system manages all the background work, monitoring your jobs until the transcript is ready for you to pull out in JSON or SRT format.

Built · Hosted · Managed by Vinkius Speechmatics-MCP: Transcribe Audio & Generate Voice
Server ID 019e38f0-fe67-71c5-b58a-73494de646c4
Vinkius Inspector
Compliance Grade A+
Score 100/100
Vinkius Inspector Badge — Score 100/100

Frequently asked questions about Speechmatics MCP

How do I transcribe a large podcast episode with Speechmatics MCP? +

You start by using create_job, providing the audio URL or base64. Your agent monitors its status until it's complete, then you use get_transcript to pull the final text.

Can I generate subtitles with Speechmatics MCP? +

Yes. After a job finishes using create_job, you can retrieve the transcript using get_transcript and specify SRT format for subtitle files.

Is there a way to track my spending on Speechmatics MCP? +

Absolutely. You use the get_usage tool anytime to check your account consumption statistics without leaving your current workflow.

What is the difference between list_jobs and get_job using Speechmatics MCP? +

list_jobs shows a summary of all recent jobs you've run. Use get_job when you know the specific ID of one job and need detailed status updates on it.

Do I need to manage API keys for Speechmatics MCP? +

Yes, but it’s easy. You can use create_temp_key to generate temporary credentials, keeping your main key secure while allowing controlled access for testing or specific integrations.