How to Use the Fireworks AI MCP in Claude Code
Run ultra-fast Fireworks AI model inference, embeddings, and transcriptions directly from your Claude Code terminal.
Works with every AI agent you already use
…and any MCP-compatible client
Connect Fireworks AI MCP to Claude Code
Create your Vinkius account to connect Fireworks AI to Claude Code and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.
Headless model discovery for Claude Code
The `list_models` tool queries the Fireworks AI registry directly from your command-line interface. When running automated scripts, Claude Code uses this MCP server to check which open-weights models are online and ready to accept inference requests. This enables dynamic fallback routing in your terminal workflows. If a specific model is offline, your CLI agent automatically selects an alternative from the returned list without crashing your build pipeline.
Fast pipeline execution via this MCP Server
The `chat` tool processes high-speed chat completions directly within your terminal session or CI/CD pipelines. Claude Code runs this tool to analyze logs, generate patch files, and automate code reviews at a fraction of the cost of standard APIs. To handle non-chat text generation, the `completion` tool executes raw prompts without chat template overhead. This is perfect for terminal-based automation where you need to pipe raw text outputs directly into other shell utilities.
Terminal-based asset generation and transcription
The `image` tool lets you generate visual assets directly from a terminal prompt without opening a web browser. Claude Code calls the endpoint, downloads the generated image, and saves it to your specified path in your current directory. For automated audio processing, the `transcribe` tool converts voice recordings into structured text files. Meanwhile, the `embed` tool generates vector embeddings of your files, enabling you to build powerful command-line semantic search tools.
Set up Fireworks AI MCP in Claude Code
Prerequisites
- Claude Code CLI installed (
npm install -g @anthropic-ai/claude-code) - Active Vinkius subscription with a valid endpoint token
- 1
Run the add command
Open your terminal and run the command shown on the right. Replace
[YOUR_TOKEN_HERE]with your endpoint token from cloud.vinkius.com. Use--scope userto make it available across all projects. - 2
Verify the connection
Start a Claude Code session and type
/mcpto list connected servers. You should seefireworks-ai-mcpwith a green status indicator. - 3
Start using tools
Ask Claude Code something like "Check my latest Fireworks AI transactions." It will automatically discover and invoke the available Fireworks AI tools.
claude mcp add --transport http fireworks-ai-mcp https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp Why Choose Vinkius
Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.
Real-time monitoring
Live
visibility into every interaction
Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.
Built-in savings
60%
lower AI costs
Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.
Single dashboard
One
place for every integration
Every tool your AI connects to, managed from a single screen. One account, complete control.
Common questions about Fireworks AI MCP in Claude Code
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
Start using the Fireworks AI MCP today
We host it, we monitor it, we maintain it. You just paste one token.