D-ID MCP Server
Create AI videos via D-ID — generate talking avatars from text or audio, list stock presenters, and monitor credit balance directly from any AI agent.
Ask AI about this MCP Server
Vinkius supports streamable HTTP and SSE.

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
What is the D-ID MCP Server?
The D-ID MCP Server gives AI agents like Claude, ChatGPT, and Cursor direct access to D-ID via 10 tools. Create AI videos via D-ID — generate talking avatars from text or audio, list stock presenters, and monitor credit balance directly from any AI agent. Powered by the Vinkius - no API keys, no infrastructure, connect in under 2 minutes.
Built-in capabilities (10)
Tools for your AI Agents to operate D-ID
Ask your AI agent "Create a talking avatar video with script: 'Welcome to our new platform!'" and get the answer without opening a single dashboard. With 10 tools connected to real D-ID data, your agents reason over live information, cross-reference it with other MCP servers, and deliver insights you would spend hours assembling manually.
Works with Claude, ChatGPT, Cursor, and any MCP-compatible client. Powered by the Vinkius - your credentials never touch the AI model, every request is auditable. Connect in under two minutes.
Why teams choose Vinkius
One subscription gives you access to thousands of MCP servers - and you can deploy your own to the Vinkius Edge. Your AI agents only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure and security, zero maintenance.
Build your own MCP Server with our secure development framework →Vinkius works with every AI agent you already use
…and any MCP-compatible client


















D-ID MCP Server capabilities
10 toolsCreate a D-ID clip using a stock presenter (no image needed). Pre-built digital humans with backgrounds
Create a talking avatar video using D-ID. An AI avatar speaks your text with lip-sync and natural expressions
Create a D-ID talking avatar from a pre-recorded audio file. Avatar lip-syncs to your audio
Delete a D-ID talk
Get status of a D-ID clip. Returns status and result URL
Get current D-ID credit balance and plan info
Wait for creation to finish. Get status of a D-ID talk. Returns status (created/started/done/error), result_url when done
List available D-ID presenters. Returns presenter IDs, names, and preview images
List all D-ID talks. Returns talk IDs, statuses, and creation timestamps
Upload a face image to D-ID for use as avatar source. Returns image URL
What the D-ID MCP Server unlocks
Connect your D-ID account to any AI agent and take full control of your AI video generation and digital human workflows through natural conversation.
What you can do
- Talking Avatar Generation — Create high-quality videos where an AI avatar speaks your text with precise lip-sync and natural expressions using Microsoft or Amazon TTS
- Audio-to-Video Sync — Provision talking avatars from pre-recorded audio files, allowing the agent to generate videos that match your literal audio boundaries
- Digital Presenters — List and retrieve available D-ID stock presenters including pre-built digital humans and background options for fast video production
- Media Uploads — Upload face images to D-ID servers to use them as custom avatar sources for your personalized video content
- Run Monitoring — Track the status of your talks and clips (created, started, done, error) and retrieve final result URLs when processing is complete
- Credit Auditing — Get your current D-ID credit balance and plan info to verify remaining limits and manage video generation quotas
How it works
1. Subscribe to this server
2. Enter your D-ID API Key (found in D-ID Studio > Account Settings)
3. Start generating AI videos from Claude, Cursor, or any MCP-compatible client
Who is this for?
- Content Creators — generate personalized video messages and social media content without manual editing
- Marketers — create localized talking head videos for different markets using various TTS providers and voices
- Product Teams — quickly prototype digital human interactions and verify video outputs through natural language
- Developers — test and debug D-ID video generation pipelines and presenter mappings directly from the chat
Frequently asked questions about the D-ID MCP Server
Can my agent create a talking avatar using a custom voice ID?
Yes. Use the 'create_talk' tool and specify the TTS provider (microsoft or amazon) and the exact voice ID. The agent will orchestrate the request to generate an avatar that speaks your script with that specific vocal identity.
How do I use a custom image as the source for my talking avatar?
First, use the 'upload_image' tool with a publicly accessible URL of your face image. The agent will upload it to D-ID and return a new internal URL which you can then pass as the 'source_url' in 'create_talk'.
Can I check my remaining D-ID credits through the agent?
Absolutely. Use the 'get_credits' tool. Your agent will pull your current balance and plan info directly from D-ID, helping you manage your video generation limits and quotas through natural conversation.
More in this category
You might also like
Connect D-ID with your favorite client
Step-by-step setup guides for every MCP-compatible client and framework:
Anthropic's native desktop app for Claude with built-in MCP support.
AI-first code editor with integrated LLM-powered coding assistance.
GitHub Copilot in VS Code with Agent mode and MCP support.
Purpose-built IDE for agentic AI coding workflows.
Autonomous AI coding agent that runs inside VS Code.
Anthropic's agentic CLI for terminal-first development.
Python SDK for building production-grade OpenAI agent workflows.
Google's framework for building production AI agents.
Type-safe agent development for Python with first-class MCP support.
TypeScript toolkit for building AI-powered web applications.
TypeScript-native agent framework for modern web stacks.
Python framework for orchestrating collaborative AI agent crews.
Leading Python framework for composable LLM applications.
Data-aware AI agent framework for structured and unstructured sources.
Microsoft's framework for multi-agent collaborative conversations.
Give your AI agents the power of D-ID MCP Server
Production-grade D-ID MCP Server. Verified, monitored, and maintained by Vinkius. Ready for your AI agents — connect and start using immediately.






