Deepgram MCP Server
Power audio AI via Deepgram — perform high-speed speech-to-text, generate lifelike text-to-speech, track usage, and manage API keys directly from any AI agent.
Ask AI about this MCP Server
Vinkius supports streamable HTTP and SSE.

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
What is the Deepgram MCP Server?
The Deepgram MCP Server gives AI agents like Claude, ChatGPT, and Cursor direct access to Deepgram via 10 tools. Power audio AI via Deepgram — perform high-speed speech-to-text, generate lifelike text-to-speech, track usage, and manage API keys directly from any AI agent. Powered by the Vinkius - no API keys, no infrastructure, connect in under 2 minutes.
Built-in capabilities (10)
Tools for your AI Agents to operate Deepgram
Ask your AI agent "Transcribe this audio: https://example.com/recording.mp3 using nova-2" and get the answer without opening a single dashboard. With 10 tools connected to real Deepgram data, your agents reason over live information, cross-reference it with other MCP servers, and deliver insights you would spend hours assembling manually.
Works with Claude, ChatGPT, Cursor, and any MCP-compatible client. Powered by the Vinkius - your credentials never touch the AI model, every request is auditable. Connect in under two minutes.
Why teams choose Vinkius
One subscription gives you access to thousands of MCP servers - and you can deploy your own to the Vinkius Edge. Your AI agents only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure and security, zero maintenance.
Build your own MCP Server with our secure development framework →Vinkius works with every AI agent you already use
…and any MCP-compatible client


















Deepgram MCP Server capabilities
10 toolsIdentify precise active arrays spanning native Gateway auth
Inspect deep internal arrays mitigating specific Plan Math
Retrieve explicit Cloud logging tracing explicit Vault limits
Perform structural extraction of properties driving active Account logic
Provision a highly-available JSON Payload generating hard Customer bindings
Dispatch an automated validation check routing explicit Gateway history
Identify bounded CRM records inside the Headless Deepgram Platform
Identify precise active arrays spanning native Hold parsing
Enumerate explicitly attached structured rules exporting active Billing
Irreversibly vaporize explicit validations extracting rich Churn flags
What the Deepgram MCP Server unlocks
Connect your Deepgram account to any AI agent and take full control of your speech-to-text (STT) and text-to-speech (TTS) workflows through natural conversation.
What you can do
- Speech-to-Text (STT) — Dispatch automated transcription requests for remote audio URLs using the lightning-fast Nova-2 model to consume explicit WAV/MP3 web streams
- Text-to-Speech (TTS) — Generate high-fidelity audio from raw text using Aura voices, outputting the exact binary stream footprint natively from your chat
- Usage Monitoring — Analyze specific global bounds hitting
/usageto map literally terabytes of exact API transcription times and TTS byte usage - Project & Key Management — List and create ephemeral Deepgram access boundaries (API keys) and isolate organizational tenants where Audio AI billing is enforced
- Wallet Oversight — Retrieve explicit cloud logging tracing explicit Vault limits and verify direct wallet thresholds to ensure pipelines never drop
- Identity & Invites — Manage developer limits by listing members and sending team invites to specific project UUIDs strictly
How it works
1. Subscribe to this server
2. Enter your Deepgram API Key (found in the Deepgram Console under Settings > API Keys)
3. Start managing your audio AI workflows from Claude, Cursor, or any MCP-compatible client
Who is this for?
- AI Developers — test STT/TTS models and manage API keys without leaving the development environment
- Product Teams — monitor audio AI usage and verify transcription accuracy in real-time
- Data Engineers — audit transcription volumes and manage project-wide audio pipelines using natural language
- Ops Teams — track wallet balances and manage team access across multiple Deepgram projects
Frequently asked questions about the Deepgram MCP Server
Can my agent transcribe an audio file from a public URL?
Yes. Use the 'transcribe_url' tool. Provide the public URL of the audio file (WAV, MP3, etc.) and specify the model (e.g., 'nova-2'). The agent will dispatch the request to Deepgram and return the transcribed text instantly.
How do I generate speech from text using the agent?
Use the 'speak_text' tool. Provide the text script and the target voice model (e.g., 'aura-asteria-en'). Your agent will trigger the high-fidelity Aura voice engine and return the binary audio stream data.
Can I monitor my remaining project balance via chat?
Absolutely. Use the 'get_balances' tool with your project ID. The agent will retrieve your current wallet thresholds and funding limits directly from Deepgram to ensure your audio pipelines stay active.
More in this category
You might also like
Connect Deepgram with your favorite client
Step-by-step setup guides for every MCP-compatible client and framework:
Anthropic's native desktop app for Claude with built-in MCP support.
AI-first code editor with integrated LLM-powered coding assistance.
GitHub Copilot in VS Code with Agent mode and MCP support.
Purpose-built IDE for agentic AI coding workflows.
Autonomous AI coding agent that runs inside VS Code.
Anthropic's agentic CLI for terminal-first development.
Python SDK for building production-grade OpenAI agent workflows.
Google's framework for building production AI agents.
Type-safe agent development for Python with first-class MCP support.
TypeScript toolkit for building AI-powered web applications.
TypeScript-native agent framework for modern web stacks.
Python framework for orchestrating collaborative AI agent crews.
Leading Python framework for composable LLM applications.
Data-aware AI agent framework for structured and unstructured sources.
Microsoft's framework for multi-agent collaborative conversations.
Give your AI agents the power of Deepgram MCP Server
Production-grade Deepgram MCP Server. Verified, monitored, and maintained by Vinkius. Ready for your AI agents — connect and start using immediately.






