Vinkius
App Catalog
AI Stack

Observability & Monitoring for AI Agents

Sentry. Grafana. Datadog. New Relic. PagerDuty. The telemetry layers that actually matter. Direct access to logs, traces, and metrics—no dashboard fatigue, just raw signal.

Curated by the Vinkius team -- 5 MCP servers reviewed, tested, and ready to connect. Create a free account and start in seconds -- no infrastructure or code needed.

Sentry MCP Server
01 MCP Server

Sentry MCP Server

Raw exception tracking. Stack traces and crash contexts without the noise.

sentry.io

Look, unhandled exceptions kill reliability. This MCP Server gives your agents raw execution context against your Sentry projects. They can pull stack traces, map failing commits, and resolve issues without touching the UI. —Signal through the noise—your AI triages the exact line of failing code, exactly how an incident commander would.

Issue queries with stack trace context
Release health & regression tracking
Error resolution & assignment workflows
Connect your agent
Grafana MCP Server
02 MCP Server

Grafana MCP Server

LogQL, traces, and metrics. Raw querying across your entire telemetry stack.

grafana.com

When an outage hits, dashboard fatigue is real. This MCP Server drops your agents into Grafana so they can run LogQL queries and pull Tempo traces directly. They don't just look at graphs; they retrieve the underlying raw telemetry. The reality is, if your agent needs to correlate a latency spike to a specific pod, this is how you automate the investigation.

Dashboard queries & metric visualization
LogQL log queries & Tempo tracing
Alert rule management & annotations
Connect your agent
Datadog MCP Server
03 MCP Server

Datadog MCP Server

Infrastructure signals and APM traces without the UI friction.

datadoghq.com

Context switching during an incident wastes precious minutes. This MCP Server connects your agents to Datadog so they can pull host metrics and parse faceted APM traces. We're talking immediate access to the trace waterfalls that explain why a request failed. Let's be clear, if your agent is just summarizing alerts without reading the underlying trace, it's not actually helping. Get the raw data.

Infrastructure metrics & host monitoring
Log queries with faceted search
APM traces & alert monitor management
Connect your agent
New Relic MCP Server
04 MCP Server

New Relic MCP Server

NRQL execution. Raw transaction tracing and infrastructure health.

newrelic.com

Your telemetry is useless if the agent can't query it. This MCP Server opens up New Relic's NRQL engine to your AI. They can run heavy metric aggregations across global services and pull error rate changes instantly. —You know the drill—keep the analysis close to the telemetry, and let the agent retrieve exactly the spans it needs to identify the bottleneck.

NRQL queries across all telemetry
Application performance & error rate monitoring
Transaction trace analysis & infrastructure health
Connect your agent
PagerDuty MCP Server
05 MCP Server

PagerDuty MCP Server

Incident routing and on-call state management.

pagerduty.com

At 3 AM, nobody wants to figure out who is on call. This MCP Server gives your agents read/write access to PagerDuty's incident layer. They can acknowledge pages, route alerts to the correct escalation policy, and append diagnostic context to the timeline. Honestly, manually updating incident state is toil; let the agent handle the orchestration so engineers can actually fix the outage.

Incident creation, acknowledgment & resolution
On-call schedule & escalation policy management
Service routing & real-time incident timelines
Connect your agent

Logs. Metrics. Alerts. Traces. Incidents. Ready for AI Agents.

Free to start. Connect these servers to your AI agents in seconds -- no infrastructure to set up, no code to write.

Hosting, security, updates, and uptime -- all on us. You just connect and use.