D-ID MCP Server
Create AI videos via D-ID — generate talking avatars from text or audio, list stock presenters, and monitor credit balance directly from any AI agent.
Vinkius AI Gateway supports streamable HTTP and SSE.

Works with every AI agent you already use
…and any MCP-compatible client


















D-ID MCP Server: see your AI Agent in action
Built-in capabilities (10)
create_clip
Create a D-ID clip using a stock presenter (no image needed). Pre-built digital humans with backgrounds
create_talk
Create a talking avatar video using D-ID. An AI avatar speaks your text with lip-sync and natural expressions
create_talk_audio
Create a D-ID talking avatar from a pre-recorded audio file. Avatar lip-syncs to your audio
delete_talk
Delete a D-ID talk
get_clip
Get status of a D-ID clip. Returns status and result URL
get_credits
Get current D-ID credit balance and plan info
get_talk
Wait for creation to finish. Get status of a D-ID talk. Returns status (created/started/done/error), result_url when done
list_presenters
List available D-ID presenters. Returns presenter IDs, names, and preview images
list_talks
List all D-ID talks. Returns talk IDs, statuses, and creation timestamps
upload_image
Upload a face image to D-ID for use as avatar source. Returns image URL
What this connector unlocks
Connect your D-ID account to any AI agent and take full control of your AI video generation and digital human workflows through natural conversation.
What you can do
- Talking Avatar Generation — Create high-quality videos where an AI avatar speaks your text with precise lip-sync and natural expressions using Microsoft or Amazon TTS
- Audio-to-Video Sync — Provision talking avatars from pre-recorded audio files, allowing the agent to generate videos that match your literal audio boundaries
- Digital Presenters — List and retrieve available D-ID stock presenters including pre-built digital humans and background options for fast video production
- Media Uploads — Upload face images to D-ID servers to use them as custom avatar sources for your personalized video content
- Run Monitoring — Track the status of your talks and clips (created, started, done, error) and retrieve final result URLs when processing is complete
- Credit Auditing — Get your current D-ID credit balance and plan info to verify remaining limits and manage video generation quotas
How it works
1. Subscribe to this server
2. Enter your D-ID API Key (found in D-ID Studio > Account Settings)
3. Start generating AI videos from Claude, Cursor, or any MCP-compatible client
Who is this for?
- Content Creators — generate personalized video messages and social media content without manual editing
- Marketers — create localized talking head videos for different markets using various TTS providers and voices
- Product Teams — quickly prototype digital human interactions and verify video outputs through natural language
- Developers — test and debug D-ID video generation pipelines and presenter mappings directly from the chat
Frequently asked questions
Give your AI agents the power of D-ID
Access D-ID and 2,000+ MCP servers — ready for your agents to use, right now. No glue code. No custom integrations. Just plug Vinkius AI Gateway and let your agents work.
More in this category

Lindy (Autonomous AI Employees)
10 toolsManage autonomous AI employees via Lindy — trigger task runs, monitor reasoning logs, and audit app integrations.

Vald
6 toolsPower your agent with Vald — query, insert, and manage dense vectors on a highly scalable, distributed nearest-neighbor engine.

Fireworks AI
6 toolsEmpower LLM applications via Fireworks AI — perform ultra-fast chat completions, generate embeddings and images, and transcribe audio directly from any AI agent.
You might also like

Front
12 toolsManage shared inboxes, track conversations, and collaborate on emails via AI agents with Front.

Adobe Acrobat Sign
10 toolsSend, track, and manage e-signatures via Adobe Acrobat Sign — create agreements, check signing status, and access audit trails from any AI agent.
Deep Talk
10 toolsEquip your AI agent to analyze conversation datasets, extract topics, and monitor sentiment via the Deep Talk API.
