Vinkius

New Relic AI (LLM Observability) MCP. Get total cost and performance metrics via conversation.

New Relic AI (LLM Observability) lets you pull performance data, token costs, and user feedback directly from your LLMs using natural conversation. Instead of logging into dashboards to check p95 latency or calculating total USD spend, you ask your agent for the metrics immediately. Track every chat completion, audit model behavior, and verify infrastructure health—all in one place.

New Relic AI (LLM Observability) MCP is compatible with Claude Claude
New Relic AI (LLM Observability) MCP is compatible with ChatGPT ChatGPT
New Relic AI (LLM Observability) MCP is compatible with Cursor Cursor
New Relic AI (LLM Observability) MCP is compatible with Gemini Gemini
New Relic AI (LLM Observability) MCP is compatible with Windsurf Windsurf
New Relic AI (LLM Observability) MCP is compatible with VS Code VS Code
New Relic AI (LLM Observability) MCP is compatible with JetBrains JetBrains
New Relic AI (LLM Observability) MCP is compatible with Vercel Vercel
See Vinkius in Action

Give Claude and any AI agent real-world access

Audit LLM Performance Metrics

Get average response times and the 95th percentile latency data to ensure your models remain fast.

Track Token Expenditure

Calculate precise USD costs for all token usage across your entire AI infrastructure.

Review Model Interactions

Retrieve detailed chat completion messages and original prompts to audit model behavior in real-time.

Measure User Satisfaction

Fetch chronological user feedback and 1-5 rating scores provided by human supervisors.

Execute Custom Queries

Run advanced, read-only queries using the New Relic Query Language (NRQL) against your AI datasets.

Monitor Infrastructure Health

Examine active APM apps, dashboards, and alert policies to check overall system integrity.

Waiting for input…

AI Agent
New Relic AI (LLM Observability)

What AI agents can do with New Relic AI (LLM Observability): 10 Tools

Use these tools to manage everything from calculating precise LLM token costs and checking system latency to auditing user feedback ratings.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using New Relic AI (LLM Observability) MCP

List Alert Policies

Checks all existing automated alerts configured for the system's plan math.

List Apm Apps

Retrieves a list of currently running APM applications to validate service status.

Custom Nrql

Runs sophisticated, read-only queries using the New Relic Query Language (NRQL) for...

List Dashboards

Finds all active operational dashboards tied to native Gateway authentication.

Query Llm Errors

Identifies and lists specific error logs related to LLM processing.

Query Llm Costs

Calculates the precise monetary cost of tokens used by your agents over a specified period.

Query Llm Events

Retrieves bounded records tracking general activity within the New Relic platform.

Query Llm Feedback

Gathers human-submitted feedback and rating scores associated with LLM outputs.

Query Llm Latency

Measures the speed of your LLMs by retrieving p95 latency matrices and average...

Post Custom Event

Sends custom telemetry rows to track unique internal states or behaviors within your...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

New Relic AI (LLM Observability) MCP is compatible with Claude

Claude AI

1

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

2

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

3

Start a conversation

Open a new chat. The New Relic AI (LLM Observability) integration is available immediately — no restart needed.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

  • Import from OpenAPI, Swagger, or YAML specs
  • Create Agent Skills with progressive disclosure
  • Deploy to edge with MCPFusion framework
  • Built in DLP, auth, and compliance on each call
  • Real time usage dashboard and cost metering
  • Publish to catalog or keep private
Start building

Make Your AI Do More

Start with New Relic AI (LLM Observability), then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

  • Use this MCP plus 5,200+ others, all in one place
  • Add new capabilities to your AI anytime you want
  • Connections are secured and governed automatically
  • Track usage and costs across all your servers
  • Works with Claude, ChatGPT, Cursor, and more
  • New servers added to the catalog weekly
New Relic AI (LLM Observability) MCP server cover

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by New Relic AI. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

The Visibility Gap: Where AI Costs and Performance Go Missing

Right now, understanding your LLM stack is a nightmare. To figure out why costs spiked or why responses slowed down, you have to jump between New Relic's billing dashboards, the APM console, and raw chat logs. You spend time clicking through tabs, copying metrics, and trying to stitch together one single story: 'It cost X dollars because it was slow.'

With this MCP, that manual process vanishes. You simply ask your agent a question like, 'What's the token usage trend over the last week?' The tool runs `query_llm_costs` and provides the answer immediately in conversation, connecting performance metrics to actual dollars spent.

Get LLM Observability with New Relic AI (LLM Observability)

Manual monitoring requires checking multiple endpoints: logging into the query interface for `query_llm_latency`, going to a separate dashboard tool to check `list_dashboards`, and then manually calculating costs via an external spreadsheet. It's slow, and it's incomplete.

Now, your agent handles all of that complexity. You get instant access to performance data, error logs (`query_llm_errors`), and resource usage checks—all through a single chat interface. Your focus shifts from dashboard maintenance to making the AI better.

What New Relic AI (LLM Observability) MCP does for your AI

You run complex AI agents that use Large Language Models (LLMs). Things break, costs spike unexpectedly, or performance dips when nobody is looking. This MCP connects New Relic AI to your existing agent workflow, giving you full visibility into everything happening under the hood. You can ask for total token usage across all models in dollars and cents.

Need to know why responses slow down? Check the p95 latency metrics instantly. Want to audit model behavior? Review raw chat completion messages to understand exactly what the LLM saw or generated. This access means you don't have to jump between cost dashboards, performance monitoring tools, and logs just to get a complete picture.

By connecting this MCP via Vinkius, your agent becomes an operational detective for your AI stack.

Built · Hosted · Managed by Vinkius New Relic AI (LLM Observability) - Track Costs & Latency
Server ID 019d75dc-e7ba-70bb-8f02-309d5f2787c7
Vinkius Inspector
Compliance Grade A+
Score 100/100
Vinkius Inspector Badge — Score 100/100

Frequently asked questions about New Relic AI (LLM Observability) MCP

How does New Relic AI (LLM Observability) track token costs? +

This MCP uses query_llm_costs to calculate your total LLM token spend. It gives you the exact USD consumption across different models and services, so you never lose money tracking usage.

Can I check my LLM performance latency with this MCP? +

Yes, use query_llm_latency. It pulls p95 latency matrices and average response times, helping you pinpoint exactly when your agent's responses slow down.

What kind of data can I audit with New Relic AI (LLM Observability)? +

You can audit everything: chat completion messages for model behavior, human supervisor feedback using query_llm_feedback, and raw internal agent states via post_custom_event.

Is New Relic AI (LLM Observability) read-only? +

Yes. The tool uses mechanisms like custom_nrql which are strictly read-only queries, meaning you can pull insights without risking any changes to your live infrastructure.

Does this MCP help with general system health checks? +

It does. You can use tools like list_apm_apps and list_alert_policies to check the operational status of your entire environment, not just the LLM component.