Anyscale MCP Server
Orchestrate your Anyscale infrastructure — manage LLM queries, vectors, services, and cluster batch jobs directly from your AI agent.
Vinkius AI Gateway supports streamable HTTP and SSE.

Works with every AI agent you already use
…and any MCP-compatible client


















Anyscale MCP Server: see your AI Agent in action
Built-in capabilities (7)
chat_completion
Pass an array of messages with roles (user, assistant, system). Generate conversational responses via Anyscale LLMs
generate_embeddings
Generate semantic vector embeddings for text
get_service
Retrieve details about a specific Anyscale service
list_jobs
List Anyscale batch or training jobs
list_models
g., meta-llama/Llama-2-70b-chat-hf). List available AI models on Anyscale Endpoints
list_services
List Anyscale deployed services
text_completion
Use for foundational instruct generation. Generate text completion using Anyscale generic completion API
What this connector unlocks
Connect your Anyscale environment to your AI agent and manage both AI inference and backend scalable infrastructure natively through natural conversation.
What you can do
- Model Discovery and Querying — List all active foundational models inside your environment and send conversational or zero-shot instruct prompts
- Embeddings Pipeline — Generate semantic vector embeddings for arrays of text inputs directly in-flight
- Services Fleet — Monitor deployed Ray services, fetch cluster states, and map live service endpoint configurations
- Cluster Jobs — Query Ray batch jobs to inspect recent execution statuses and training metrics right from your terminal
How it works
1. Subscribe to this server
2. Provide your Anyscale API Key and Base URL
3. Interface with your models, services, and Ray cluster via Claude, Cursor, or your favorite MCP agent
Scale up your AI operations without opening terminal panes to check Ray cluster status.
Who is this for?
- AI & MLOps Engineers — automate the inspection of deployed models, jobs, and embeddings safely during CI workflows
- Data Scientists — submit rapid completion tasks to specialized LLMs running inside your Anyscale VPC
- Backend Developers — debug service health metrics and endpoint statuses without navigating the heavy cloud dashboard
Frequently asked questions
Give your AI agents the power of Anyscale
Access Anyscale and 2,000+ MCP servers — ready for your agents to use, right now. No glue code. No custom integrations. Just plug Vinkius AI Gateway and let your agents work.
More in this category

Cohere (AI Platform)
7 toolsPower enterprise AI via Cohere — generate text, perform chat completions, reorder documents, and manage embeddings directly from any AI agent.

Playground AI
10 toolsGenerate, inpaint, upscale, and transform images using Playground AI's powerful models via natural language.

MindsDB (AI Database & Predictors)
6 toolsManage AI-powered data via MindsDB — execute SQL predictions, audit ML models, and connect data sources.
You might also like

Impact.com
10 toolsManage partnership campaigns, ads, and affiliate payouts via Impact.com API.

Bridge Data Output
10 toolsAccess standardized real estate data via the Bridge API — browse MLS listings, property details, and agent info directly from any AI agent.

Slack
6 toolsAutomate Slack messaging — send messages, search conversations, list channels and users directly from any AI agent.
