MCP Catalog

AI Stack

Observability & Monitoring for AI Agents

Sentry. Grafana. Datadog. New Relic. PagerDuty. The telemetry layers that actually matter. Direct access to logs, traces, and metrics—no dashboard fatigue, just raw signal.

Curated by the Vinkius team -- 5 MCP servers reviewed, tested, and ready to connect. Create a free account and start in seconds -- no infrastructure or code needed.

Ask AI about these MCP Servers

Open in ChatGPT Open in Claude Open in Perplexity

Sentry MCP Server

01 MCP Server

Sentry MCP Server

Raw exception tracking. Stack traces and crash contexts without the noise.

Look, unhandled exceptions kill reliability. This MCP Server gives your agents raw execution context against your Sentry projects. They can pull stack traces, map failing commits, and resolve issues without touching the UI. —Signal through the noise—your AI triages the exact line of failing code, exactly how an incident commander would.

Issue queries with stack trace context

Release health & regression tracking

Error resolution & assignment workflows

Connect your agent

Rollbar MCP Server

Rollbar

Real-time error tracking & deployment analysis

BugSnag MCP Server

BugSnag

Application stability monitoring & crash reports

Honeybadger MCP Server

Honeybadger

Error & uptime monitoring for developers

Checkly

Synthetic monitoring & API checks as code

Grafana MCP Server

02 MCP Server

Grafana MCP Server

LogQL, traces, and metrics. Raw querying across your entire telemetry stack.

When an outage hits, dashboard fatigue is real. This MCP Server drops your agents into Grafana so they can run LogQL queries and pull Tempo traces directly. They don't just look at graphs; they retrieve the underlying raw telemetry. The reality is, if your agent needs to correlate a latency spike to a specific pod, this is how you automate the investigation.

Dashboard queries & metric visualization

LogQL log queries & Tempo tracing

Alert rule management & annotations

Connect your agent

Incident.io MCP Server

Incident.io

Modern incident response & on-call platform

Opsgenie MCP Server

Opsgenie

Alert routing & on-call by Atlassian

FireHydrant MCP Server

FireHydrant

Incident lifecycle management for DevOps

Better Stack MCP Server

Better Stack

Uptime, logs & incident management in one

Datadog MCP Server

03 MCP Server

Datadog MCP Server

Infrastructure signals and APM traces without the UI friction.

Context switching during an incident wastes precious minutes. This MCP Server connects your agents to Datadog so they can pull host metrics and parse faceted APM traces. We're talking immediate access to the trace waterfalls that explain why a request failed. Let's be clear, if your agent is just summarizing alerts without reading the underlying trace, it's not actually helping. Get the raw data.

Infrastructure metrics & host monitoring

Log queries with faceted search

APM traces & alert monitor management

Connect your agent

UptimeRobot MCP Server

UptimeRobot

Free uptime monitoring for 2M+ monitors

Pingdom MCP Server

Pingdom

Website performance & uptime monitoring

HetrixTools MCP Server

HetrixTools

Server & blacklist monitoring infrastructure

Honeycomb MCP Server

Honeycomb

High-cardinality observability & distributed tracing

New Relic MCP Server

04 MCP Server

New Relic MCP Server

NRQL execution. Raw transaction tracing and infrastructure health.

Your telemetry is useless if the agent can't query it. This MCP Server opens up New Relic's NRQL engine to your AI. They can run heavy metric aggregations across global services and pull error rate changes instantly. —You know the drill—keep the analysis close to the telemetry, and let the agent retrieve exactly the spans it needs to identify the bottleneck.

NRQL queries across all telemetry

Application performance & error rate monitoring

Transaction trace analysis & infrastructure health

Connect your agent

Datadog Cloud SIEM MCP Server

Datadog Cloud SIEM

Cloud-native SIEM with real-time threat detection

Elastic Security MCP Server

Elastic Security

SIEM, endpoint & cloud security in one

Sumo Logic MCP Server

Sumo Logic

Cloud-native machine data analytics & SIEM

Metaplane MCP Server

Metaplane

Data observability & anomaly detection

PagerDuty MCP Server

05 MCP Server

PagerDuty MCP Server

Incident routing and on-call state management.

At 3 AM, nobody wants to figure out who is on call. This MCP Server gives your agents read/write access to PagerDuty's incident layer. They can acknowledge pages, route alerts to the correct escalation policy, and append diagnostic context to the timeline. Honestly, manually updating incident state is toil; let the agent handle the orchestration so engineers can actually fix the outage.

Incident creation, acknowledgment & resolution

On-call schedule & escalation policy management

Service routing & real-time incident timelines

Connect your agent

Prefect MCP Server

Prefect

Workflow orchestration with built-in observability

Temporal MCP Server

Temporal

Durable execution with workflow visibility

Dagster MCP Server

Dagster

Data pipeline orchestration & asset monitoring

Trigger.dev MCP Server

Trigger.dev

Background jobs with real-time run visibility

Logs. Metrics. Alerts. Traces. Incidents. Ready for AI Agents.

Free to start. Connect these servers to your AI agents in seconds -- no infrastructure to set up, no code to write.

Hosting, security, updates, and uptime -- all on us. You just connect and use.

Create Free Account Explore All Categories