Open WebUI MCP Server with 12 Tools for Claude, Cursor, and AI Agents
Manage your Open WebUI instance — list models, handle chat completions, and manage RAG collections directly from any AI agent. Vinkius routes your AI agents directly to Open WebUI through a governed connection. 12 tools ready to use with Claude, ChatGPT, Cursor, or any AI agent — no hosting, no setup, connect in 30 seconds.
Ask AI about this server
Compatible with every major AI agent and IDE

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
What is the Open WebUI MCP Server?
The Open WebUI MCP Server routes AI agents like Claude, ChatGPT, and Cursor directly to Open WebUI via 12 tools. Manage your Open WebUI instance — list models, handle chat completions, and manage RAG collections directly from any AI agent. Powered by Vinkius — your credentials stay on your side of the connection, every request is auditable. Connect in under 2 minutes.
Built-in capabilities (12)
Tools for your AI Agents to operate Open WebUI
Ask your AI agent "List all models available in my Open WebUI instance." and get the answer without opening a single dashboard. With 12 tools connected to real Open WebUI data, your agents reason over live information, cross-reference it with other MCP servers, and deliver insights you would spend hours assembling manually.
Works with Claude, ChatGPT, Cursor, and any MCP-compatible client. Powered by Vinkius — your credentials never touch the AI model, every request is auditable. Connect in under two minutes.
Why teams choose Vinkius
One subscription gives you the infrastructure to connect your AI agents to thousands of MCP servers — and deploy your own to the Vinkius Edge. Your credentials stay yours. Your data flows directly between your agent and the API. DLP blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade routing and governance, zero maintenance.
Build your own MCP Server with our secure development framework →The Open WebUI App Connector works with every AI agent you already use
…and any MCP-compatible client


















Use all 12 Open WebUI tools with your AI agents right now
Vinkius routes your AI agents to Open WebUI through a governed proxy. Beyond a simple connection, you get full visibility into every action your agents perform, with enterprise-grade security and up to 60% savings on AI costs.
Add file to collection on Open WebUI
Add a file to a knowledge collection
Chat completed on Open WebUI
Run outlet filters for completed chat
Chat completions on Open WebUI
OpenAI-compatible chat completion
Create new chat on Open WebUI
Must generate UUIDs for message IDs. Create a new chat (Backend-Controlled Flow)
Get file status on Open WebUI
Check file processing status
List models on Open WebUI
Retrieve all models
Ollama embed on Open WebUI
Ollama API Embeddings
Ollama generate on Open WebUI
Ollama API Generate Completion
Ollama tags on Open WebUI
List Ollama models
Process web url on Open WebUI
Process a web URL into a collection
Send message on Open WebUI
Anthropic-compatible message generation
Upload file on Open WebUI
Content is extracted and stored in the vector DB. Provide file content as base64. Upload a file for RAG
What the Open WebUI MCP Server unlocks
Connect your Open WebUI instance to any AI agent and take full control of your local and cloud LLM orchestration through natural conversation.
What you can do
- Model Management — Use
list_modelsto fetch all available models including Ollama, OpenAI, and Open WebUI Functions. - RAG & Knowledge Base — Upload files with
upload_file, process web content viaprocess_web_url, and organize them into collections usingadd_file_to_collection. - Chat Orchestration — Create and manage backend-controlled chats with
create_new_chator use OpenAI/Anthropic compatible endpoints likechat_completionsandsend_message. - Native Ollama Support — Directly interact with the Ollama API using
ollama_generate,ollama_tags, andollama_embedfor local inference tasks. - File Processing — Monitor the status of your document ingestion with
get_file_statusto ensure your RAG context is ready.
How it works
1. Subscribe to this server
2. Enter your Open WebUI Base URL and API Key
3. Start managing your LLM infrastructure from Claude, Cursor, or any MCP-compatible client
Who is this for?
- AI Engineers — automate the testing of different models and RAG configurations without leaving the terminal or IDE.
- Knowledge Managers — quickly ingest documentation and web URLs into Open WebUI collections via simple commands.
- DevOps Teams — monitor local Ollama instances and manage model availability across the organization.
Frequently asked questions about the Open WebUI MCP Server
How can I check if a model is available in my Open WebUI instance?
You can use the list_models tool. It will return a complete list of all configured models, including those from Ollama, OpenAI, and internal Open WebUI functions.
Can I add a website to my RAG collection using just a URL?
Yes! Use the process_web_url tool. Provide the URL and the target collection name, and the server will scrape and index the content for you.
How do I know when my uploaded file is ready for querying?
After using upload_file, you can check the ingestion progress by calling get_file_status with the returned File ID. It will tell you if the status is 'completed' or 'pending'.
More in this category

Hugging Face
15 toolsAccess thousands of pre-trained AI models for NLP, vision, and audio tasks with the largest open-source machine learning hub.

HashiCorp Nomad
10 toolsManage workloads and orchestration via Nomad — track jobs, nodes, and deployments directly from your AI agent.

UUID & ULID Generator
2 toolsStop LLMs from hallucinating fake or repeated IDs. Generate mathematically guaranteed v4 UUIDs and time-sortable ULIDs natively.

PeerTube (YouTube Alternative)
10 toolsInteract with decentralized PeerTube instances — manage video feeds, download content, and handle user registration via AI.
You might also like

HiFlow
12 toolsWorkflow and business process management.

Cloudmersive
12 toolsValidate data, scan files for viruses, and process documents with a suite of utility APIs for security and compliance.

Sentry
10 toolsGrant your AI agent full access to Sentry's Application Performance Monitoring tools to track raw exceptions, resolve error logs, and inspect crash stack traces dynamically.

Telegram Bot Alternative
12 toolsControl and manage your Telegram bots — send messages, photos, and audit chats via AI.
We built the connector to Open WebUI. Now put your agents to work. Fully governed.
Vinkius is the AI Gateway with managed hosting. Stop building connectors. Every connection runs inside eight layers of security.
Hosted, sandboxed, and live on AWS. You don't provision anything. You don't maintain anything. You connect.
Every tool call, every token, every response. Logged and auditable. Data flows direct from Open WebUI to your agent. Nothing is stored on our side. Ever.
Eight governance layers on every request. Sensitive data redacted before it reaches the model. Kill switch if anything goes sideways. Always on.
