3,400+ servers built on vurb.ts
Vinkius
O

Bring Llm Tracing
to OpenAI Agents SDK

Learn how to connect Langfuse (LLM Tracing & Evals) to OpenAI Agents SDK and start using 10 AI agent tools in minutes. Fully managed, enterprise secure, and ready to use without writing a single line of code.

MCP Inspector GDPR Free for Subscribers
Create ObservationCreate ScoreGet Daily MetricsGet ObservationGet TraceList ObservationsList PromptsList ScoresList SessionsList Traces
Langfuse (LLM Tracing & Evals)

What is the Langfuse (LLM Tracing & Evals) MCP Server?

Connect your Langfuse account to any AI agent and take full control of your LLM observability, prompt management, and quality evaluation through natural conversation.

What you can do

  • Trace Orchestration — List and retrieve detailed traces of LLM API sessions, exposing latencies, token counts, and exact chained payloads directly from your agent
  • Prompt Vault Access — Query actively managed prompt templates and versions to inspect system instructions and expected input variables
  • Observation Analysis — Deep-dive into individual spans, events, and generations within a trace to pinpoint failures or performance bottlenecks securely
  • Evaluation & Scoring — Attach structured human feedback or automated evaluation metrics to specific traces to monitor model grounding and accuracy
  • Usage Metrics — Generate aggregated daily reports on USD costs and average latency to track your AI infrastructure spending in real-time
  • Session Monitoring — Extract correlated user sessions to understand multi-turn interaction boundaries and improve long-term agentic workflows

How it works

1. Subscribe to this server
2. Enter your Langfuse API URL, Public Key, and Secret Key
3. Start monitoring your LLM application from Claude, Cursor, or any MCP-compatible client

Who is this for?

  • LLM Engineers — debug complex AI chains and measure exact token latencies through natural conversation without manual dashboard searching
  • Product Owners — monitor daily AI costs and user satisfaction scores across multiple production environments
  • Data Scientists — manage prompt templates and audit evaluation metrics to improve model response quality and grounding efficiently

Built-in capabilities (10)

create_observation

Create a new LLM observation (span, event, generation) inside a trace

create_score

g. 1-5 stars) or automated pipeline metrics bounding exactly onto the specified Trace or Observation. Attach human feedback or evaluation metrics to a trace/observation

get_daily_metrics

Generate rolled-up USD cost and aggregated latency statistics

get_observation

Retrieve explicit span or generation context within a trace

get_trace

Get complete telemetry and nested graph for a single trace

list_observations

List raw observation objects spanning across traces

list_prompts

Extract actively managed prompt templates and versions

list_scores

List all explicit scores mapping quality or cost algorithms

list_sessions

List high-level user session entities encapsulating multiple traces

list_traces

List all traces tracking LLM API sessions

Why OpenAI Agents SDK?

The OpenAI Agents SDK auto-discovers all 10 tools from Langfuse (LLM Tracing & Evals) through native MCP integration. Build agents with built-in guardrails, tracing, and handoff patterns. chain multiple agents where one queries Langfuse (LLM Tracing & Evals), another analyzes results, and a third generates reports, all orchestrated through Vinkius.

  • Native MCP integration via MCPServerSse, pass the URL and the SDK auto-discovers all tools with full type safety

  • Built-in guardrails, tracing, and handoff patterns let you build production-grade agents without reinventing safety infrastructure

  • Lightweight and composable: chain multiple agents and MCP servers in a single pipeline with minimal boilerplate

  • First-party OpenAI support ensures optimal compatibility with GPT models for tool calling and structured output

O
See it in action

Langfuse (LLM Tracing & Evals) in OpenAI Agents SDK

AI AgentVinkius
High Security·Kill Switch·Plug and Play
Why Vinkius

Langfuse (LLM Tracing & Evals) and 3,400+ other MCP servers. One platform. One governance layer.

Teams that connect Langfuse (LLM Tracing & Evals) to OpenAI Agents SDK through Vinkius don't need to source, host, or maintain individual MCP servers. Every tool call runs inside a hardened runtime with credential isolation, DLP, and a signed audit chain.

3,400+MCP Servers ready
<40msCold start
60%Token savings
Raw MCP
Vinkius
Server catalogFind and host yourself3,400+ managed
InfrastructureSelf-hostedSandboxed V8 isolates
Credential handlingPlaintext in configVault + runtime injection
Data loss preventionNoneConfigurable DLP policies
Kill switchNoneGlobal instant shutdown
Financial circuit breakersNonePer-server limits + alerts
Audit trailNoneEd25519 signed logs
SIEM log streamingNoneSplunk, Datadog, Webhook
HoneytokensNoneCanary alerts on leak
Custom domainsNot applicableDNS challenge verified
GDPR complianceManual effortAutomated purge + export
Enterprise Security

Why teams choose Vinkius for Langfuse (LLM Tracing & Evals) in OpenAI Agents SDK

The Langfuse (LLM Tracing & Evals) MCP Server runs on Vinkius-managed infrastructure inside AWS — a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts. All 10 tools execute in hardened sandboxes optimized for native MCP execution.

Your AI agents in OpenAI Agents SDK only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure, zero maintenance.

Langfuse (LLM Tracing & Evals)
Fully ManagedVinkius Servers
60%Token savings
High SecurityEnterprise-grade
IAMAccess control
EU AI ActCompliant
DLPData protection
V8 IsolateSandboxed
Ed25519Audit chain
<40msKill switch
Stream every event to Splunk, Datadog, or your own webhook in real-time

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure

The Vinkius Advantage

How Vinkius secures Langfuse (LLM Tracing & Evals) for OpenAI Agents SDK

Every tool call from OpenAI Agents SDK to the Langfuse (LLM Tracing & Evals) MCP Server is protected by DLP redaction, cryptographic audit chains, V8 sandbox isolation, kill switch, and financial circuit breakers.

< 40msCold start
Ed25519Signed audit chain
60%Token savings
FAQ

Frequently asked questions

01

Can I see the exact system instruction for a specific prompt version?

Yes. Use the list_prompts tool to browse your managed templates. Your agent can retrieve the exact text and variables for any deployed prompt version, making it easy to audit AI logic through natural conversation.

02

How do I log human feedback for a specific trace?

Use the create_score tool by providing the Trace ID and a JSON payload defining the score name (e.g. 'user-satisfaction') and value. Your agent will attach this structured data directly to the Langfuse record.

03

Can my agent report on my LLM spending for the current day?

Absolutely. The get_daily_metrics tool retrieves aggregated USD costs and average latency metrics from Langfuse. Your agent can summarize these statistics to help you monitor your infrastructure budget in real-time.

04

How does the OpenAI Agents SDK connect to MCP?

Use MCPServerSse(url=...) to create a server connection. The SDK auto-discovers all tools and makes them available to your agent with full type information.

05

Can I use multiple MCP servers in one agent?

Yes. Pass a list of MCPServerSse instances to the agent constructor. The agent can use tools from all connected servers within a single run.

06

Does the SDK support streaming responses?

Yes. The SDK supports SSE and Streamable HTTP transports, both of which work natively with Vinkius.

07

MCPServerStreamableHttp not found

Ensure you have the latest version: pip install --upgrade openai-agents

08

Agent not calling tools

Make sure your prompt explicitly references the task the tools can help with.