Datadog AI LLM Observability MCP for AI Agents. Monitor token usage and track model performance metrics in production systems

Datadog AI (LLM Observability) MCP allows you to monitor, audit, and track performance metrics for your LLMs in real-time. It lets your agent pull high-precision data on token usage, latency spikes, prompt content, and overall infrastructure health directly from your existing Datadog setup.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Give Claude and any AI agent real-world access

Querying Token and Latency Metrics

Find the average token usage, peak consumption times, and overall latency for your models over specific periods.

Auditing Prompt Content and Model Spans

Retrieve detailed records of literal prompts and response traces, helping you debug exactly what inputs caused performance issues.

Checking for Active Service Outages

Monitor your infrastructure to detect real-time service disruptions or active outages blocking agent workflows.

Creating Performance Alerts

Set up monitors that alert you when AI responses drop below expected performance levels or hit resource limits.

Analyzing Global AI Infrastructure Spending

Enumerate widgets that graph total global spending and usage across different LLM providers, aiding budget planning.

Ask an AI about this

Waiting for input…

AI Agent

What AI agents can do with Datadog AI LLM Observability: 10 Tools for Model Performance Auditing

These tools let your agent perform deep checks on performance metrics, track service incidents, list available dashboards, and audit detailed usage spans.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Datadog AI (LLM Observability) MCP

Create Event

Inspects deep internal arrays related to plan math calculations for debugging purposes.

Create Monitor

Creates explicit validation checks, allowing you to monitor specific metrics or...

List Dashboards

Retrieves a list of structured rules attached to billing accounts for monitoring...

List Events

Identifies precise active arrays spanning native gateway authentication records.

List Incidents

Dispatches an automated validation check to route explicit historical service outage...

Search Llm Spans

Searches for detailed JSON payload contents, providing hard customer usage bindings and context.

List Ai Monitors

Retrieves explicit cloud logging information that traces resource limits associated with AI models.

Query Metrics

Queries core LLM observability metrics, such as token count and latency, from the...

Submit Series

Performs structural extraction of properties that drive active account logic changes.

List Service Accounts

Identifies precise active arrays spanning native hold parsing records for service...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Datadog AI LLM Observability MCP for AI Agents MCP is compatible with Claude

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Datadog AI LLM Observability MCP for AI Agents integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "datadog-ai-llm-observability": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Datadog AI LLM Observability MCP for AI Agents tools with full Vinkius guardrails applied.

Datadog AI LLM Observability MCP for AI Agents MCP is compatible with VS Code

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"datadog-ai-llm-observability": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on each call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Datadog AI (LLM Observability), then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,200+ others, all in one place
Add new capabilities to your AI anytime you want
Connections are secured and governed automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog weekly

Datadog AI LLM Observability MCP for AI Agents MCP server cover

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Datadog. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Datadog AI LLM Observability: Auditing Prompt Logs for Model Debugging

Manually debugging an LLM pipeline is a nightmare. You spend hours switching between the application logs, the metrics dashboard, and the billing console. You try to find out why latency spiked on Tuesday morning or which prompt caused that massive token burst—it’s a painful copy-paste cycle across multiple tabs.

With this MCP, you ask your agent, 'Show me all prompts run by Agent X between 9 AM and 10 AM.' The tool uses `search_llm_spans` to pull the full JSON payload content directly. You get the exact prompt logic, the response trace, and the associated metrics in a single chat output.

Datadog AI LLM Observability: Tracking Infrastructure Costs and Alerts

Before connecting this MCP, figuring out your total cost meant running separate reports for OpenAI, Anthropic, and internal compute. You had to piece together usage patterns from various billing systems, making it impossible to see the global picture.

Now, you can ask the agent, 'What is our projected spend next month if we increase volume?' The MCP pulls dashboard insights, giving you a unified graph of AI expenses across providers instantly. It shifts cost management from reactive auditing to proactive planning.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

llm-observability

token-usage

prompt-monitoring

ai-performance

telemetry

model-auditing

What Datadog AI LLM Observability MCP for AI Agents MCP does for your AI

Running models is complex; tracking their cost and performance shouldn't be. This MCP connects your AI client to your Datadog account so you can manage LLM observability through natural conversation. Instead of hopping between dashboards and logs, your agent handles the deep dive. You can query metrics for specific things like token counts or latency timeseries, pull full prompt logs, and even check active outages that might be blocking multi-agent workflows.

It also lets you view widgets graphing global AI expenses across providers like OpenAI and Anthropic.

When you connect this MCP via Vinkius, your agent gets immediate visibility into every part of your model stack—from simple usage tracking to complex incident reporting. You'll know exactly when a dynamic LLM model was switched out or if performance is starting to drop below established thresholds.

Built · Hosted · Managed by Vinkius Datadog AI LLM Observability MCP for AI Agents — Model Performance Tracking

Server ID 019d7581-a5af-72b6-a2cf-684e1f80d513

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

Benefits of connecting Datadog AI LLM Observability MCP for AI Agents MCP

Track actual resource consumption by querying specific metrics, like average tokens per request or latency spikes, using query_metrics.

Never miss an outage. Use list_incidents to get real-time updates on service disruptions that could halt your agentic workflows.

Manage performance automatically by calling create_monitor, setting alerts for when model responses fall below acceptable thresholds.

Keep a clean audit trail of every interaction. Utilize search_llm_spans to retrieve the exact prompt and response payload contents needed for debugging.

Control your costs proactively. You can view global spending patterns by using list_dashboards, giving you financial oversight across all model providers.

Datadog AI LLM Observability MCP for AI Agents MCP use cases

01 01

Debugging a sudden spike in costs

A developer noticed their monthly LLM bill was spiking. They asked their agent to check the logs, which used search_llm_spans to retrieve specific payloads and pinpointed that a single unoptimized prompt loop was causing excessive token usage.

02 02

Verifying model stability after an update

The MLOps team just rolled out Model v2. They used list_ai_monitors to check if all their existing performance monitors were still tracking correctly and confirmed that the new version maintained low latency metrics.

03 03

Diagnosing agent failure during peak hours

When an automated workflow failed, the SRE used list_incidents to check for active service disruptions. The report showed a temporary gateway authentication failure that was blocking multi-agent orchestration.

04 04

Optimizing cloud spending across multiple services

The FinOps team needed an overall view of AI spend. They used the MCP to enumerate global expenses, allowing them to compare usage patterns between OpenAI and Anthropic in one place.

Datadog AI LLM Observability MCP for AI Agents MCP tradeoffs

What to watch out for, and the recommended way to handle each one.

Checking logs manually

Avoid

Having to jump into Datadog's dashboard, filter by 'LLM,' then search for specific time ranges just to find out why latency spiked yesterday afternoon.

Instead

Just ask your agent. Use query_metrics and specify the exact timeframe you care about. The MCP handles all the filtering and data retrieval in one go.

Missing a service outage

Avoid

Assuming that because the application seems fine, nothing is wrong. You might miss a subtle background failure or an active incident blocking core functionality.

Instead

Always run list_incidents first. It checks for known and active outages before you start debugging specific model issues.

Guessing the root cause of high tokens

Avoid

Seeing a massive token count but not knowing which prompt or which user interaction caused it, leading to wasted time and inaccurate cost reports.

Instead

Run search_llm_spans. This gives you the full, literal payload context—the exact input and output that drove the high usage.

When to use Datadog AI LLM Observability MCP for AI Agents MCP

Use this MCP if your primary pain point is visibility into LLM performance, cost, or infrastructure health. If you need to know why a model was slow or expensive, this tool's access to detailed metrics and spans is critical. Don't use it if all you need is basic usage counting; simple billing APIs might suffice. However, if your issue involves diagnosing the root cause—like linking a spike in token counts back to a specific prompt structure—you absolutely need search_llm_spans. If you are only interested in general system status and not LLM-specific metrics, another type of infrastructure monitoring tool will work better.

Frequently asked questions about Datadog AI LLM Observability MCP for AI Agents MCP

How does the Datadog AI LLM Observability MCP help me track costs? +

It provides a unified view of your spending. Instead of checking separate billing portals for every provider, you can ask the agent to graph global expenses and see exactly which models are driving your highest costs.

I need to debug a failed LLM workflow; what should I use with this MCP? +

Use the tool that searches for LLM spans. It lets you pull the full prompt payload and response traces, showing you exactly which input caused the failure or poor output.

Can this MCP tell me if my AI services are currently down? +

Yes. By listing incidents, your agent checks for active outages and service disruptions across your entire infrastructure, ensuring that a simple background failure won't break your workflow.

How do I set up alerts for poor model performance using the Datadog AI LLM Observability MCP? +

You can use the capability to create monitors. You tell the agent what threshold you care about, and it sets up an alert that notifies you when the latency or token usage gets too high.

Is this Datadog AI LLM Observability MCP better than just checking raw logs? +

It's much better. Instead of drowning in raw, unstructured data, the MCP interprets those logs and presents you with actionable metrics—like average usage or specific failure points—in plain language.

Give Claude and any AI agent real-world access

What AI agents can do with Datadog AI LLM Observability: 10 Tools for Model Performance Auditing

Create Event

Inspects deep internal arrays related to plan math calculations for debugging purposes.

Create Monitor

Creates explicit validation checks, allowing you to monitor specific metrics or...

List Dashboards

Retrieves a list of structured rules attached to billing accounts for monitoring...

List Events

Identifies precise active arrays spanning native gateway authentication records.

List Incidents

Dispatches an automated validation check to route explicit historical service outage...

Search Llm Spans

Searches for detailed JSON payload contents, providing hard customer usage bindings and context.

List Ai Monitors

Retrieves explicit cloud logging information that traces resource limits associated with AI models.

Query Metrics

Queries core LLM observability metrics, such as token count and latency, from the...

Submit Series

Performs structural extraction of properties that drive active account logic changes.

List Service Accounts

Identifies precise active arrays spanning native hold parsing records for service...

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Datadog AI LLM Observability: Auditing Prompt Logs for Model Debugging

Datadog AI LLM Observability: Tracking Infrastructure Costs and Alerts

llm-observability

token-usage

prompt-monitoring

ai-performance

telemetry

model-auditing

What Datadog AI LLM Observability MCP for AI Agents MCP does for your AI

How to set up Datadog AI LLM Observability MCP for AI Agents MCP

Who uses Datadog AI LLM Observability MCP for AI Agents MCP

Benefits of connecting Datadog AI LLM Observability MCP for AI Agents MCP

Datadog AI LLM Observability MCP for AI Agents MCP use cases

Debugging a sudden spike in costs

Verifying model stability after an update

Diagnosing agent failure during peak hours

Optimizing cloud spending across multiple services

Datadog AI LLM Observability MCP for AI Agents MCP tradeoffs

Checking logs manually

Missing a service outage

Guessing the root cause of high tokens

When to use Datadog AI LLM Observability MCP for AI Agents MCP

Frequently asked questions about Datadog AI LLM Observability MCP for AI Agents MCP