Chainlit Observability MCP for AI. Audit model failures and map conversation flow.

Q: How do I find out how many different AI apps I'm running with Chainlit? (listprojects)

Call listprojects. This tool returns a clean list of all the independently tracked projects you have configured in your Cloud instance.

Q: I need to see what users talked about today. Which tool do I use? (listthreads)

Use listthreads. It finds and lists every unique conversational thread within a specific project, giving you the IDs needed for deeper inspection.

Q: What is the difference between getthread and liststeps? (getthread)

getthread gives you the entire conversation payload—the raw chat transcript. liststeps, on the other hand, breaks that down into the machine logic: identifying each specific prompt or tool call made during the session.

Q: How do I check if my chatbot is popular? (getstats)

getstats pulls global metrics. It provides traffic boundaries and usage figures, telling you how many conversations were processed and what resource consumption was measured over time.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Connect to your AI in seconds.

Chainlit provides observability for AI applications, letting you audit chat threads and track LLM performance metrics securely. It maps global traffic statistics across your entire AI portfolio; lets you query full chronological conversations from users; and tracks every internal logic jump—identifying prompts, tool executions, and retrieval boundaries used per interaction.

What your AI can do

List projects

Retrieves a list of every configured Chainlit Cloud project space that is actively managing app tracking.

List threads

Identifies all individual conversational threads that occurred within a specified deployed project.

Get thread

Retrieves the full, exact payload for a single conversational thread, mapping its complete topology.

+ 3 more capabilities included

View overall project statistics

Retrieves global traffic data and usage figures across all configured applications.

Inspect full conversation history

Gets the complete, chronological transcript for a specific user interaction thread.

Trace model logic steps

Maps out the internal decision process by listing every prompt and tool execution within a single chat session.

Analyze user ratings and feedback

Collects explicit and implicit user reviews, including thumbs up/down signals, for performance tracking.

List available AI projects

Retrieves a list of all independently tracked Chainlit Cloud project spaces.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

Chainlit MCP - 6 Tools

These tools let your agent audit conversational history by listing projects, retrieving thread payloads, tracking performance metrics, and analyzing user feedbacks.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Chainlit on Vinkius

List Projects

Retrieves a list of every configured Chainlit Cloud project space that is actively managing app tracking.

List Threads

Identifies all individual conversational threads that occurred within a specified...

Get Thread

Retrieves the full, exact payload for a single conversational thread, mapping its...

List Steps

Lists the raw sequence of programmatic interaction steps, defining the prompts and...

List Feedbacks

Lists all user reviews and feedback signals related to conversational accuracy and...

Get Stats

Pulls explicit analytics statistics detailing traffic volume and resource consumption across designated projects.

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Chainlit Observability integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "chainlit": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Chainlit Observability tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"chainlit": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Chainlit, then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Chainlit. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 6 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

Tracking chat performance manually means juggling multiple dashboards.

Today, if your AI agent has a bug or performs poorly, you have to jump between the chat platform's analytics dashboard, the log management system, and the user feedback portal. You copy project IDs here, search for conversations there, and manually correlate timestamps across three different tools just to figure out what went wrong.

With this MCP, your agent does the heavy lifting. It connects directly to the source data. You simply ask it to audit a failure; it pulls the full internal logic path, showing you exactly which prompt or tool execution failed. The result is immediate, structured diagnostics.

The Chainlit MCP gives you visibility into every logical step.

Before this, understanding the internal decision-making process was nearly impossible without deep access to proprietary backend logs. You couldn't tell if a bad response came from poor initial data or a flawed function call; it was a black box issue requiring developer guesswork.

Now, using `list_steps`, your agent opens that black box. It reveals the full stack: every prompt used and every tool executed in chronological order. You finally know exactly where to fix it.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

What your AI can actually do with this

This MCP connects to your Chainlit Cloud projects, giving your agent direct access to deep conversation data. Instead of just seeing if the chatbot worked, you can look inside: audit specific interactions or pull global analytics mapping usage across all deployed apps. You'll get explicit records of what happened in every chat session, from listing the full thread payload to tracking user sentiment via collected feedbacks.

It’s crucial for diagnosing failures; your agent finds out exactly which internal prompts and tool calls led to a bad output. For product teams, this means automatically summarizing negative feedback or polling new chats for compliance parameters, all without manually reading logs. Integrating this through Vinkius lets you connect this powerful audit capability directly into your preferred AI client.

Built · Hosted · Managed by Vinkius Chainlit-MCP: Audit Chat History & Performance

Server ID 019d756b-c13a-72ca-83ae-2976210f76cd

Vinkius Inspector

Compliance Grade F

Score 3.6/100

Report View Report ↗

What Changes When You Connect

Diagnose production errors instantly. By calling list_steps, you extract the full logical sequence, pinpointing whether a bad output resulted from an initial prompt or a flawed tool execution.

Track user sentiment systematically. Use list_feedbacks to gather all explicit and implicit ratings across your app—you get a metric of success versus failure immediately.

Monitor total application health. Calling get_stats gives you global traffic counts and usage figures, letting you understand the scale of activity without diving into dashboards.

Review specific conversations easily. You can use list_threads to find recent user interactions, then get_thread to pull the full payload for deep analysis on a single chat session.

Manage multiple apps efficiently. Start by using list_projects to see every independent tracking space you manage before running targeted audits.

See it in action

01 01

Identifying hallucination triggers

An agent needs to know why the model provided incorrect data for a client. It calls list_steps on the affected thread, which returns the exact prompt and internal tool execution that caused the failure, allowing immediate developer correction.

02 02

Calculating overall product value

A PM wants to know if the recent UI update improved user satisfaction. The agent calls list_feedbacks across all projects, grouping and summarizing negative outcomes ('thumbs down') versus positive ones.

03 03

Auditing compliance in chat logs

A QA team member needs to check if the bot is adhering to privacy policies. The agent calls list_threads for a specific project, then uses get_thread to pull transcripts and scan them for unmasked PII.

04 04

Debugging resource bottlenecks

The ops engineer notices slow performance spikes. They use get_stats to check traffic boundaries and consumption rates; if the usage is high, they narrow down the issue by listing all available projects via list_projects.

The honest tradeoffs

Checking data in isolation

Anti-pattern

Running only get_stats and assuming that tells you everything about performance.

The Fix

No. You must combine calls. Use list_projects first to scope the area, then use get_stats to get the high-level view, and finally drill down with list_threads or list_steps to diagnose a specific incident.

Ignoring time boundaries

Anti-pattern

Trying to analyze historical data without knowing the precise date range.

The Fix

Always confirm your scope. Use get_stats by specifying the required traffic boundary, or if you need a specific conversation, use list_threads with defined start and end dates.

Assuming single-source truth

Anti-pattern

Believing that just looking at user feedback is enough to explain model failure.

The Fix

It's not. User ratings are subjective; the actual cause requires list_steps. The combination of a negative rating from list_feedbacks paired with the raw trace data from list_steps gives you the full picture.

When It Fits, When It Doesn't

Use this MCP if your primary need is observability. You need to diagnose why an AI agent behaved a certain way, or measure its real-world impact on users; that's what this connector does best. Don't use it if you just need to send a message or retrieve a static piece of information (use a messaging MCP instead). If your goal is purely data storage or CRUD operations outside of chat logs, then this isn't right for you—you need a dedicated database MCP. However, if you are building a conversational agent and suddenly realize it’s failing silently in production, this MCP is essential; the combination of list_projects to scope the environment and get_stats to measure impact makes it non-negotiable.

Questions you might have

How do I find out how many different AI apps I'm running with Chainlit? (list_projects) +

Call list_projects. This tool returns a clean list of all the independently tracked projects you have configured in your Cloud instance.

I need to see what users talked about today. Which tool do I use? (list_threads) +

Use list_threads. It finds and lists every unique conversational thread within a specific project, giving you the IDs needed for deeper inspection.

What is the difference between `get_thread` and `list_steps`? (get_thread) +

get_thread gives you the entire conversation payload—the raw chat transcript. list_steps, on the other hand, breaks that down into the machine logic: identifying each specific prompt or tool call made during the session.

How do I check if my chatbot is popular? (get_stats) +

get_stats pulls global metrics. It provides traffic boundaries and usage figures, telling you how many conversations were processed and what resource consumption was measured over time.

When I run `list_steps`, what does the raw programmatic interaction step actually show me? +

It reveals the exact sequence of internal logic jumps used in a single chat. You get explicit details on every prompt, the model's output, and which tools were executed during that specific interaction.

How can I use `list_feedbacks` to find all the specific reasons users rated my bot poorly? +

This tool aggregates user review feedbacks, letting you filter by sentiment (like 'thumbs down'). You can read explicit textual complaints and spot recurring issues, such as formatting problems or poor tone.

If I know the ID, how do I use `get_thread` to pull the full payload of one specific chat? +

You provide the unique thread ID, and the tool returns the complete data structure. This is ideal for compliance audits or recreating a session without pulling in surrounding conversations.

Before I use `list_projects`, what credentials do I need to connect my project? +

You must provide your Chainlit Cloud URL along with the associated Project API Key. These two pieces of information authenticate your agent and set the specific scope for all data queries.

Will the AI agent be able to monitor the user interactions and evaluate chat history? +

Yes! The agent can dive into the list_threads and get_thread endpoints to retrieve comprehensive interaction logs from your deployed Chainlit apps. You can essentially command the agent to read past AI chats, summarize usage, or identify edge cases in the user input.

Can it track the individual thought steps and LLM prompt tokens consumed? +

Absolutely. Using the list_steps tool, your agent analyzes the programmatic trace—including specific LLM calls, function blocks, or retrieval events. Thus, identifying hallucinations or latency issues is as easy as typing a prompt.

Is it possible to extract and analyze human feedback scores instantly? +

Yes. The integration provides native capabilities via list_feedbacks to retrieve the explicit thumbs up, down, and textual comments your users left on specific messages, streamlining QA.

Connect to your AI in seconds.

List projects

List threads

Get thread

Chainlit MCP - 6 Tools

Make your AI actually useful.

List Projects

List Threads

Get Thread

List Steps

List Feedbacks

Get Stats

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Works with Claude, ChatGPT, Cursor, and more

Tracking chat performance manually means juggling multiple dashboards.

The Chainlit MCP gives you visibility into every logical step.

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

See it in action

Identifying hallucination triggers

Calculating overall product value

Auditing compliance in chat logs

Debugging resource bottlenecks

The honest tradeoffs

Checking data in isolation

Ignoring time boundaries

Assuming single-source truth

When It Fits, When It Doesn't

Questions you might have