3,400+ MCP servers ready to use
Vinkius
L

Bring Speech To Text
to LlamaIndex

Learn how to connect Deepgram to LlamaIndex and start using 6 AI agent tools in minutes. Fully managed, enterprise secure, and ready to use without writing a single line of code.

Convert Text To SpeechGet Project UsageList Api KeysList Available ModelsList Deepgram ProjectsTranscribe Audio Url

What is the Deepgram MCP Server?

Connect your Deepgram account to any AI agent and take full control of your speech-to-text (STT) and text-to-speech (TTS) workflows through natural conversation.

What you can do

  • Transcription Orchestration — Convert speech from public audio or video URLs into high-fidelity text programmatically using the latest Nova-3 models with smart formatting and diarization
  • Neural Speech Synthesis — Programmatically generate natural-sounding audio from text input using the high-speed Aura engine to coordinate voice-enabled interfaces
  • Model Discovery — Access complete directories of high-performance STT and TTS models supported by Deepgram to ensure the perfect accuracy and latency for your content
  • Project & Usage Monitoring — Programmatically track your API utilization, minute consumption, and request counts across multiple projects for instant operational reporting
  • Credential Lifecycle — Retrieve identifiers for active API keys associated with your projects directly through your agent to maintain high-fidelity security oversight

How it works

1. Subscribe to this server
2. Retrieve your API Key from the Deepgram Console
3. Start transcribing and synthesizing audio from Claude, Cursor, or any MCP client

No more manual file uploading or complex latency tuning in the portal. Your AI acts as your dedicated audio engineer and media production coordinator.

Who is this for?

  • Developers & Engineers — instantly transcribe recorded meetings and integrate high-speed TTS into applications using natural language commands
  • Content Creators — automate the generation of voiceovers and subtitles for global video assets without leaving your workspace
  • Research Teams — scale the processing of interview recordings and monitor usage limits through simple AI queries

Built-in capabilities (6)

convert_text_to_speech

Generate audio from text (TTS)

get_project_usage

Check API usage and limits

list_api_keys

List active API keys

list_available_models

List high-performance AI models

list_deepgram_projects

List your Deepgram projects

transcribe_audio_url

Transcribe an audio file via URL

Why LlamaIndex?

LlamaIndex agents combine Deepgram tool responses with indexed documents for comprehensive, grounded answers. Connect 6 tools through Vinkius and query live data alongside vector stores and SQL databases in a single turn. ideal for hybrid search, data enrichment, and analytical workflows.

  • Data-first architecture: LlamaIndex agents combine Deepgram tool responses with indexed documents for comprehensive, grounded answers

  • Query pipeline framework lets you chain Deepgram tool calls with transformations, filters, and re-rankers in a typed pipeline

  • Multi-source reasoning: agents can query Deepgram, a vector store, and a SQL database in a single turn and synthesize results

  • Observability integrations show exactly what Deepgram tools were called, what data was returned, and how it influenced the final answer

L
See it in action

Deepgram in LlamaIndex

AI AgentVinkius
High Security·Kill Switch·Plug and Play
Why Vinkius

Deepgram and 3,400+ other MCP servers. One platform. One governance layer.

Teams that connect Deepgram to LlamaIndex through Vinkius don't need to source, host, or maintain individual MCP servers. Every tool call runs inside a hardened runtime with credential isolation, DLP, and a signed audit chain.

3,400+MCP Servers ready
<40msCold start
60%Token savings
Raw MCP
Vinkius
Server catalogFind and host yourself3,400+ managed
InfrastructureSelf-hostedSandboxed V8 isolates
Credential handlingPlaintext in configVault + runtime injection
Data loss preventionNoneConfigurable DLP policies
Kill switchNoneGlobal instant shutdown
Financial circuit breakersNonePer-server limits + alerts
Audit trailNoneEd25519 signed logs
SIEM log streamingNoneSplunk, Datadog, Webhook
HoneytokensNoneCanary alerts on leak
Custom domainsNot applicableDNS challenge verified
GDPR complianceManual effortAutomated purge + export
Enterprise Security

Why teams choose Vinkius for Deepgram in LlamaIndex

The Deepgram MCP Server runs on Vinkius-managed infrastructure inside AWS — a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts. All 6 tools execute in hardened sandboxes optimized for native MCP execution.

Your AI agents in LlamaIndex only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure, zero maintenance.

Deepgram
Fully ManagedVinkius Servers
60%Token savings
High SecurityEnterprise-grade
IAMAccess control
EU AI ActCompliant
DLPData protection
V8 IsolateSandboxed
Ed25519Audit chain
<40msKill switch
Stream every event to Splunk, Datadog, or your own webhook in real-time

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure

The Vinkius Advantage

How Vinkius secures Deepgram for LlamaIndex

Every tool call from LlamaIndex to the Deepgram MCP Server is protected by DLP redaction, cryptographic audit chains, V8 sandbox isolation, kill switch, and financial circuit breakers.

< 40msCold start
Ed25519Signed audit chain
60%Token savings
FAQ

Frequently asked questions

01

How do I get a Deepgram API Key?

Log in to the Deepgram Console, navigate to the API Keys section, and create a new key with the necessary permissions.

02

What is the Nova-3 model?

Nova-3 is Deepgram's latest state-of-the-art transcription model, offering unmatched speed and accuracy for real-world audio.

03

Can I synthesize speech in different voices?

Yes! The convert_text_to_speech tool allows you to specify models like aura-asteria-en or aura-orion-en for different vocal profiles.

04

How does LlamaIndex connect to MCP servers?

Use the MCP client adapter to create a connection. LlamaIndex discovers all tools and wraps them as query engine tools compatible with any LlamaIndex agent.

05

Can I combine MCP tools with vector stores?

Yes. LlamaIndex agents can query Deepgram tools and vector store indexes in the same turn, combining real-time and embedded data for grounded responses.

06

Does LlamaIndex support async MCP calls?

Yes. LlamaIndex's async agent framework supports concurrent MCP tool calls for high-throughput data processing pipelines.

07

BasicMCPClient not found

Install: pip install llama-index-tools-mcp