How to Use the LocalAI MCP in Claude
Run private LLMs and voice-to-text models directly from your Claude Desktop app using your own hardware.
Works with every AI agent you already use
…and any MCP-compatible client
Connect LocalAI MCP to Claude Desktop
Create your Vinkius account to connect LocalAI to Claude Desktop and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.
Generate images and detect objects inside Claude Desktop
The `detect_objects` and `face_analyze` tools let Claude Desktop process images directly on your local machine. This MCP Server lets you analyze physical objects and facial attributes inside your desktop chat window without your files ever leaving your local drive. Need to build visual assets on the fly? Run `generate_image` to spin up graphics using your local GPU right inside the chat window. The server handles complex prompt structures and negative prompts behind the scenes, outputting files directly to your workspace.
Run local speech-to-text and chat completions
Running `transcribe_audio` and `text_to_speech` lets Claude Desktop handle audio files locally without hitting external cloud APIs. Your desktop agent processes audio payloads on your machine, giving you quick turnarounds for meeting notes or voice synthesis right in your chat. You can also swap backends on the fly. The server uses `chat_completions` and `list_models` to check what open-source models are running on your local rig, letting your agent route text work to the most efficient local model available.
Install models directly from Claude Desktop
The `apply_model` tool lets Claude Desktop manage your local model catalog, turning your local system into an active MCP Server. This removes the need to jump back and forth to your terminal just to download and configure new models from the local gallery. Keep tabs on your hardware resources without leaving your workflow. The `get_system_info` tool lets your agent monitor local VRAM and CPU usage so you don't crash your machine when running heavy inference tasks.
Set up LocalAI MCP in Claude Web or Desktop
- 1
Open Claude Settings
Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.
- 2
Add Custom Connector
Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:
https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcpReplace[YOUR_TOKEN_HERE]with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials. - 3
Start a conversation
Open a new chat. The LocalAI MCP tools are available immediately — no restart needed.
Endpoint URL
https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp No configuration file needed — paste the URL directly in the Claude web interface.
Available on Free (1 connector), Pro, Max, Team, and Enterprise plans.
Why Choose Vinkius
Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.
Real-time monitoring
Live
visibility into every interaction
Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.
Built-in savings
60%
lower AI costs
Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.
Single dashboard
One
place for every integration
Every tool your AI connects to, managed from a single screen. One account, complete control.
Common questions about LocalAI MCP in Claude Desktop
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
Start using the LocalAI MCP today
We host it, we monitor it, we maintain it. You just paste one token.