Compatible with every major AI agent and IDE
What is the LibreChat MCP Server?
Connect your LibreChat instance to any AI agent and gain programmatic control over your self-hosted AI ecosystem. This server allows you to bridge your custom agents and models with any MCP-compatible client.
What you can do
- Agent Orchestration — List all available agents and models configured in your LibreChat environment.
- Unified Completions — Create chat completions using the Agents API, providing an OpenAI-compatible interface for your custom setups.
- Open Responses — Utilize the Open Responses API specification to generate structured AI outputs.
- Session Management — Authenticate directly via email and password to retrieve access tokens when a static API key is not preferred.
How it works
- Subscribe to this server
- Provide your LibreChat Base URL and API Key (or use the login tool)
- Start interacting with your private LLM agents through Claude, Cursor, or other MCP tools.
Who is this for?
- AI Engineers — integrate self-hosted LibreChat agents into automated workflows and IDEs.
- DevOps Teams — monitor and query available model configurations across different environments.
- Power Users — centralize access to multiple private LLMs through a single, secure interface.
Built-in capabilities (4)
Model corresponds to an Agent ID. Create a chat completion using the Agents API
List available LibreChat models/agents
Login to LibreChat to get access and refresh tokens
Create a response using the Open Responses API
Why LlamaIndex?
LlamaIndex agents combine LibreChat tool responses with indexed documents for comprehensive, grounded answers. Connect 4 tools through Vinkius and query live data alongside vector stores and SQL databases in a single turn. ideal for hybrid search, data enrichment, and analytical workflows.
- —
Data-first architecture: LlamaIndex agents combine LibreChat tool responses with indexed documents for comprehensive, grounded answers
- —
Query pipeline framework lets you chain LibreChat tool calls with transformations, filters, and re-rankers in a typed pipeline
- —
Multi-source reasoning: agents can query LibreChat, a vector store, and a SQL database in a single turn and synthesize results
- —
Observability integrations show exactly what LibreChat tools were called, what data was returned, and how it influenced the final answer
LibreChat in LlamaIndex
LibreChat and 4,000+ other MCP servers. One platform. One governance layer.
Teams that connect LibreChat to LlamaIndex through Vinkius don't need to source, host, or maintain individual MCP servers. Every tool call runs inside a hardened runtime with credential isolation, DLP, and a signed audit chain.
Raw MCP | Vinkius | |
|---|---|---|
| Server catalog | Find and host yourself | 4,000+ managed |
| Infrastructure | Self-hosted | Sandboxed V8 isolates |
| Credential handling | Plaintext in config | Vault + runtime injection |
| Data loss prevention | None | Configurable DLP policies |
| Kill switch | None | Global instant shutdown |
| Financial circuit breakers | None | Per-server limits + alerts |
| Audit trail | None | Ed25519 signed logs |
| SIEM log streaming | None | Splunk, Datadog, Webhook |
| Honeytokens | None | Canary alerts on leak |
| Custom domains | Not applicable | DNS challenge verified |
| GDPR compliance | Manual effort | Automated purge + export |
Why teams choose Vinkius for LibreChat in LlamaIndex
The LibreChat MCP Server runs on Vinkius-managed infrastructure inside AWS — a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts. All 4 tools execute in hardened sandboxes optimized for native MCP execution.
Your AI agents in LlamaIndex only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure, zero maintenance.

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
How Vinkius secures
LibreChat for LlamaIndex
Every tool call from LlamaIndex to the LibreChat MCP Server is protected by DLP redaction, cryptographic audit chains, V8 sandbox isolation, kill switch, and financial circuit breakers.
Frequently asked questions
How can I see which agents are currently available in my LibreChat instance?
You can use the list_models tool. It will query your configured LibreChat instance and return a list of all accessible agents and models associated with your credentials.
Can I use this server to chat with a specific agent by its ID?
Yes! Use the chat_completions tool. Simply provide the model (which is the Agent ID) and an array of messages to generate a response from that specific agent.
What should I do if I don't have a static API key for my instance?
You can use the login tool. By providing your email and password, the server will authenticate with LibreChat and retrieve the necessary access tokens for subsequent requests.
How does LlamaIndex connect to MCP servers?
Use the MCP client adapter to create a connection. LlamaIndex discovers all tools and wraps them as query engine tools compatible with any LlamaIndex agent.
Can I combine MCP tools with vector stores?
Yes. LlamaIndex agents can query LibreChat tools and vector store indexes in the same turn, combining real-time and embedded data for grounded responses.
Does LlamaIndex support async MCP calls?
Yes. LlamaIndex's async agent framework supports concurrent MCP tool calls for high-throughput data processing pipelines.
BasicMCPClient not found
Install: pip install llama-index-tools-mcp
Explore More MCP Servers
View all →
Klevu (E-commerce AI Search)
10 toolsPower your e-commerce discovery via Klevu AI — execute keyword searches, manage category merchandising, and retrieve product recommendations.

Mattermost (Secure Team Collaboration)
10 toolsManage team collaboration via Mattermost — send messages, search channels, and audit team activities.

Hotjar (Behavior Analytics)
10 toolsAnalyze user behavior via Hotjar — list sites, retrieve survey responses, and manage feedback widgets.

Payload CMS
10 toolsManage headless content via Payload CMS — manipulate documents, patch JSON states, retrieve singletons, and structure database endpoints using AI.
