Datadog MCP for AI Agents. Query metrics and logs with natural conversation.

Q: How do I find specific errors using Datadog MCP?

You use the searchlogs tool. You just tell your agent what you're looking for, like '500 Internal Server Error from yesterday,' and it pulls structured data directly.

Q: Can I check if my service meets its goals with Datadog MCP?

Yes. You run listslos to see all defined Service Level Objectives, which instantly tells you the target percentage and your current compliance status for any monitored metric.

Q: What is the purpose of the querymetrics tool?

querymetrics retrieves time-series data. This lets you visualize performance trends, like CPU usage or request count, over a specific period to spot gradual degradation.

Q: Does Datadog MCP help with scheduled maintenance?

Yes, the listdowntimes tool checks for planned maintenance periods. This prevents you from wasting time troubleshooting an outage that was simply expected downtime.

Q: How do I see all available monitors quickly?

Use the listmonitors function to get a filtered list of every active monitor, letting you check their type, query definition, and current alert status instantly.

Datadog MCP connects your AI agent directly to infrastructure monitoring and log management data. Query performance metrics, search application logs for specific errors, and manage alert monitors without leaving your chat window or IDE. Monitor everything from service level objectives to host health using natural language commands.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Give Claude and any AI agent real-world access

Querying Performance Metrics

Get time-series data for specific infrastructure or application metrics within a defined date range.

Searching Application Logs

Pull structured log entries to find traces and status codes related to errors or bottlenecks across services.

Managing Alerts and Monitors

View, list, and modify monitor configurations, checking current alert statuses or muting active alerts temporarily.

Inspecting Service Health Objectives (SLOs)

Retrieve the definitions of service level agreements, including target percentages and current compliance status for a given metric or monitor.

Reviewing Infrastructure Assets

List all connected hosts, view dashboard layouts, or identify scheduled maintenance periods to plan around.

Ask an AI about this

Waiting for input…

AI Agent

What AI agents can do with Datadog: 11 Tools for Observability

These tools allow you to query metrics, analyze log entries, check monitor states, and retrieve infrastructure metadata directly through your AI agent.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Datadog MCP

List Dashboards

Lists all available monitoring dashboards and provides their titles, layout types, and direct URLs.

Query Metrics

Retrieves time-series data points for infrastructure or application metrics within a...

List Downtimes

Identifies planned maintenance windows by listing scheduled downtime periods and...

List Slos

Retrieves all Service Level Objective definitions, showing target percentages and...

Search Logs

Searches the log storage to find entries matching a query syntax, including...

List Monitors

Filters and returns metadata for all configured monitors, allowing you to check their type, current status (alert, ok), or query definition.

Get Monitor

Fetches detailed information about a specific monitor, including its thresholds, notification settings, and historical status changes.

Mute Monitor

Silences an active alert monitor for a set period of time to prevent unnecessary...

List Events

Provides a collection of system events, such as alerts or deployment actions...

Get Dashboard

Retrieves the full configuration details for a specific dashboard, including widget...

List Hosts

Lists all connected infrastructure hosts, showing their agent version, associated...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Datadog integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "datadog": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Datadog tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"datadog": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on each call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Datadog, then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,200+ others, all in one place
Add new capabilities to your AI anytime you want
Connections are secured and governed automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog weekly

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Datadog. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

The Pain of Context Switching in Observability Solved with Vinkius AI Gateway

Today, figuring out why an application is slow means jumping between four different places. You start on the main dashboard to see if CPU usage spiked, then you switch tabs to look at recent alerts for warnings. Next, you copy a timestamp and paste it into the log viewer to find the corresponding error message. If nothing sticks, you have to open another tool just to check what Service Level Objectives are supposed to be.

With this MCP, that process collapses into one conversation. You ask your agent about the performance issue—for example, 'Why was latency high an hour ago?' The agent automatically runs `query_metrics` to establish the trend, then uses `search_logs` on that time window to find specific stack traces, and finally checks the relevant monitor status using `list_monitors`. You get a single, comprehensive answer.

Datadog MCP Provides Full Observability Control

You eliminate manual tasks like cross-referencing timestamps across multiple systems. No more copying IDs from the dashboard to manually look up details in a separate monitor list.

Now, you talk to your infrastructure. You use this MCP to manage alerts—you can even `mute_monitor` if the team is already aware of an issue, or check planned maintenance using `list_downtimes`. It gives you command over your system status without leaving your primary workflow.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

infrastructure-monitoring

log-analysis

performance-metrics

cloud-observability

alerting

real-time-monitoring

What your AI can actually do with this

Connecting Datadog via this MCP lets you take full command of complex cloud infrastructure monitoring right through a simple conversation with your AI agent. Instead of jumping between dashboards, sifting through raw logs, and cross-referencing alert status pages, you just ask. You can query time-series metrics to track performance trends over specific time periods or use the tool to search application logs for structural traces matching known errors.

Need to know if a service is healthy? Check active monitors by state or list out all your configured Service Level Objectives (SLOs) to see compliance status at a glance. Everything you need—from checking host metadata to identifying planned downtime—is available via natural language interaction, making troubleshooting faster and less painful.

Since this MCP lives on Vinkius, you connect once from any AI-compatible client and gain immediate access to top-tier observability tools like this one.

Built · Hosted · Managed by Vinkius Datadog MCP - Monitor Infrastructure Metrics & Logs

Server ID 019d7581-c015-7220-b99a-6852b938fd83

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

What Changes When You Connect

Stop context switching. Instead of jumping between the dashboard, log viewer, and alert list, your agent handles all three steps in one chat interaction.

Get immediate visibility into service health by querying Service Level Objectives (SLOs), which shows exactly how close or far a metric is from its compliance target.

Save time during incidents. Use list_monitors to quickly find every active alert and then use get_monitor to check if it needs muting before calling the team.

Pinpoint failures fast. You can search_logs for specific error codes across massive log volumes, instantly narrowing down bottlenecks without writing complex regex filters.

Understand your infrastructure deeply. Use list_hosts or get_dashboard to get metadata on every asset connected, including agent versions and cloud provider details.

See it in action

01 01

Investigating a sudden spike in latency

The developer asks the agent: 'Show me performance metrics for API latency last hour.' The agent uses query_metrics to find the time series data. They then use search_logs around that peak time, finding 503 errors, and finally check all active monitors using list_monitors to see if an alert was triggered.

02 02

Auditing a flaky service

The SRE needs assurance the system is stable. They ask the agent to list SLOs (list_slos). If compliance looks good, they check the dashboard details using get_dashboard and then use query_metrics on key resource usage to verify stability.

03 03

Handling planned downtime

A team member needs to schedule maintenance. They ask the agent to list scheduled downtimes (list_downtimes). This confirms if the window is clear, and they can use list_hosts afterward to ensure all target infrastructure nodes are accounted for.

04 04

Onboarding a new team member

A junior engineer needs to understand the system boundaries. They ask to list all dashboards (list_dashboards) and check which hosts are connected (list_hosts), giving them a clear map of the operational scope.

The honest tradeoffs

What to watch out for, and the recommended way to handle each one.

Treating logs like simple text searches

Avoid

Typing 'find all errors' into a generic chat client and hoping it gives structured data. This often results in unformatted, unusable log dumps.

Instead

Use search_logs with specific query syntax to retrieve entries matching status levels and structured attributes. Specify the time frame so you get actionable intelligence, not just noise.

Over-relying on dashboard visuals only

Avoid

Seeing a metric dip on a dashboard but having no idea why it dipped or what services were involved.

Instead

After seeing the trend from query_metrics, immediately cross-reference by using list_monitors to check if any specific alert rule was triggered, or use get_monitor for details.

Ignoring service agreements

Avoid

Assuming that because the system is running, it meets all performance goals. This fails when compliance slowly degrades over time.

Instead

Always confirm health by calling list_slos first. This shows you the official Service Level Objective definition and your current compliance status against defined targets.

When It Fits, When It Doesn't

Use this MCP if your primary need is deep operational observability, meaning you are analyzing performance metrics, debugging code failures, or managing alerts in a live system. If you need to know when something broke, search_logs and query_metrics are essential.

Don't use it if you are trying to write new application code (use an IDE-focused tool instead). Don't use it if your goal is purely strategic planning that doesn't rely on real-time data. If you need to know which tools exist, use list_dashboards; this MCP handles the content of those dashboards. This MCP assumes you already have a connected cloud infrastructure; if you are just starting setup, you might need a different discovery tool.

Questions you might have

How do I find specific errors using Datadog MCP? +

You use the search_logs tool. You just tell your agent what you're looking for, like '500 Internal Server Error from yesterday,' and it pulls structured data directly.

Can I check if my service meets its goals with Datadog MCP? +

Yes. You run list_slos to see all defined Service Level Objectives, which instantly tells you the target percentage and your current compliance status for any monitored metric.

What is the purpose of the `query_metrics` tool? +

query_metrics retrieves time-series data. This lets you visualize performance trends, like CPU usage or request count, over a specific period to spot gradual degradation.

Does Datadog MCP help with scheduled maintenance? +

Yes, the list_downtimes tool checks for planned maintenance periods. This prevents you from wasting time troubleshooting an outage that was simply expected downtime.

How do I see all available monitors quickly? +

Use the list_monitors function to get a filtered list of every active monitor, letting you check their type, query definition, and current alert status instantly.

View all recipes →

View all recipes

Give Claude and any AI agent real-world access

What AI agents can do with Datadog: 11 Tools for Observability

List Dashboards

Lists all available monitoring dashboards and provides their titles, layout types, and direct URLs.

Query Metrics

Retrieves time-series data points for infrastructure or application metrics within a...

List Downtimes

Identifies planned maintenance windows by listing scheduled downtime periods and...

List Slos

Retrieves all Service Level Objective definitions, showing target percentages and...

Search Logs

Searches the log storage to find entries matching a query syntax, including...

List Monitors

Filters and returns metadata for all configured monitors, allowing you to check their type, current status (alert, ok), or query definition.

Get Monitor

Fetches detailed information about a specific monitor, including its thresholds, notification settings, and historical status changes.

Mute Monitor

Silences an active alert monitor for a set period of time to prevent unnecessary...

List Events

Provides a collection of system events, such as alerts or deployment actions...

Get Dashboard

Retrieves the full configuration details for a specific dashboard, including widget...

List Hosts

Lists all connected infrastructure hosts, showing their agent version, associated...

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

The Pain of Context Switching in Observability Solved with Vinkius AI Gateway

Datadog MCP Provides Full Observability Control

infrastructure-monitoring

log-analysis

performance-metrics

cloud-observability

alerting

real-time-monitoring

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

Investigating a sudden spike in latency

Auditing a flaky service

Handling planned downtime

Onboarding a new team member

The honest tradeoffs

Treating logs like simple text searches

Over-relying on dashboard visuals only

Ignoring service agreements

When It Fits, When It Doesn't