4,000+ servers built on vurb.ts
Vinkius
LlamaIndexFramework
LlamaIndex
LibreChat MCP Server

Bring Llm Orchestration
to LlamaIndex

Learn how to connect LibreChat to LlamaIndex and start using 4 AI agent tools in minutes. Fully managed, enterprise secure, and ready to use without writing a single line of code.

MCP Inspector GDPR Free for Subscribers
Chat CompletionsList ModelsLoginOpen Responses

Compatible with every major AI agent and IDE

ClaudeClaude
ChatGPTChatGPT
CursorCursor
GeminiGemini
WindsurfWindsurf
VS CodeVS Code
JetBrainsJetBrains
VercelVercel
+ other MCP clients
LibreChat

What is the LibreChat MCP Server?

Connect your LibreChat instance to any AI agent and gain programmatic control over your self-hosted AI ecosystem. This server allows you to bridge your custom agents and models with any MCP-compatible client.

What you can do

  • Agent Orchestration — List all available agents and models configured in your LibreChat environment.
  • Unified Completions — Create chat completions using the Agents API, providing an OpenAI-compatible interface for your custom setups.
  • Open Responses — Utilize the Open Responses API specification to generate structured AI outputs.
  • Session Management — Authenticate directly via email and password to retrieve access tokens when a static API key is not preferred.

How it works

  1. Subscribe to this server
  2. Provide your LibreChat Base URL and API Key (or use the login tool)
  3. Start interacting with your private LLM agents through Claude, Cursor, or other MCP tools.

Who is this for?

  • AI Engineers — integrate self-hosted LibreChat agents into automated workflows and IDEs.
  • DevOps Teams — monitor and query available model configurations across different environments.
  • Power Users — centralize access to multiple private LLMs through a single, secure interface.

Built-in capabilities (4)

chat_completions

Model corresponds to an Agent ID. Create a chat completion using the Agents API

list_models

List available LibreChat models/agents

login

Login to LibreChat to get access and refresh tokens

open_responses

Create a response using the Open Responses API

Why LlamaIndex?

LlamaIndex agents combine LibreChat tool responses with indexed documents for comprehensive, grounded answers. Connect 4 tools through Vinkius and query live data alongside vector stores and SQL databases in a single turn. ideal for hybrid search, data enrichment, and analytical workflows.

  • Data-first architecture: LlamaIndex agents combine LibreChat tool responses with indexed documents for comprehensive, grounded answers

  • Query pipeline framework lets you chain LibreChat tool calls with transformations, filters, and re-rankers in a typed pipeline

  • Multi-source reasoning: agents can query LibreChat, a vector store, and a SQL database in a single turn and synthesize results

  • Observability integrations show exactly what LibreChat tools were called, what data was returned, and how it influenced the final answer

L
See it in action

LibreChat in LlamaIndex

AI AgentVinkius
High Security·Kill Switch·Plug and Play
Why Vinkius

LibreChat and 4,000+ other MCP servers. One platform. One governance layer.

Teams that connect LibreChat to LlamaIndex through Vinkius don't need to source, host, or maintain individual MCP servers. Every tool call runs inside a hardened runtime with credential isolation, DLP, and a signed audit chain.

4,000+MCP Servers ready
<40msCold start
60%Token savings
Raw MCP
Vinkius
Server catalogFind and host yourself4,000+ managed
InfrastructureSelf-hostedSandboxed V8 isolates
Credential handlingPlaintext in configVault + runtime injection
Data loss preventionNoneConfigurable DLP policies
Kill switchNoneGlobal instant shutdown
Financial circuit breakersNonePer-server limits + alerts
Audit trailNoneEd25519 signed logs
SIEM log streamingNoneSplunk, Datadog, Webhook
HoneytokensNoneCanary alerts on leak
Custom domainsNot applicableDNS challenge verified
GDPR complianceManual effortAutomated purge + export
Enterprise Security

Why teams choose Vinkius for LibreChat in LlamaIndex

The LibreChat MCP Server runs on Vinkius-managed infrastructure inside AWS — a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts. All 4 tools execute in hardened sandboxes optimized for native MCP execution.

Your AI agents in LlamaIndex only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure, zero maintenance.

LibreChat
Fully ManagedVinkius Servers
60%Token savings
High SecurityEnterprise-grade
IAMAccess control
EU AI ActCompliant
DLPData protection
V8 IsolateSandboxed
Ed25519Audit chain
<40msKill switch
Stream every event to Splunk, Datadog, or your own webhook in real-time

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure

The Vinkius Advantage

How Vinkius secures LibreChat for LlamaIndex

Every tool call from LlamaIndex to the LibreChat MCP Server is protected by DLP redaction, cryptographic audit chains, V8 sandbox isolation, kill switch, and financial circuit breakers.

< 40msCold start
Ed25519Signed audit chain
60%Token savings
FAQ

Frequently asked questions

01

How can I see which agents are currently available in my LibreChat instance?

You can use the list_models tool. It will query your configured LibreChat instance and return a list of all accessible agents and models associated with your credentials.

02

Can I use this server to chat with a specific agent by its ID?

Yes! Use the chat_completions tool. Simply provide the model (which is the Agent ID) and an array of messages to generate a response from that specific agent.

03

What should I do if I don't have a static API key for my instance?

You can use the login tool. By providing your email and password, the server will authenticate with LibreChat and retrieve the necessary access tokens for subsequent requests.

04

How does LlamaIndex connect to MCP servers?

Use the MCP client adapter to create a connection. LlamaIndex discovers all tools and wraps them as query engine tools compatible with any LlamaIndex agent.

05

Can I combine MCP tools with vector stores?

Yes. LlamaIndex agents can query LibreChat tools and vector store indexes in the same turn, combining real-time and embedded data for grounded responses.

06

Does LlamaIndex support async MCP calls?

Yes. LlamaIndex's async agent framework supports concurrent MCP tool calls for high-throughput data processing pipelines.

07

BasicMCPClient not found

Install: pip install llama-index-tools-mcp

Explore More MCP Servers

View all →