#Embeddings MCP Servers
Discover 29 MCP servers tagged with Embeddings on the Vinkius App Catalog.
OpenAI MCP
10 toolsUse GPT-4o, DALL-E 3, embeddings, fine-tuning, and moderation as tools inside your AI agent workflows.
Mistral AI (Frontier LLMs & Embeddings)
7 toolsManage AI inference via Mistral. Execute chat completions, generate RAG embeddings, and audit frontier models.
NVIDIA AI
9 toolsAccess LLMs, embeddings, code generation, and reasoning via NVIDIA API Catalog.
Cohere (Embed & Rerank)
6 toolsEmpower RAG via Cohere. Generate high-quality text embeddings, rerank documents for better accuracy, and perform AI classification directly from any AI agent.
Elasticsearch Vector
6 toolsEmpower vector search via Elasticsearch. Perform dense vector kNN searches, handle index mappings, and index embedding documents directly from any AI agent.
Jina AI (Search Foundation & LLM Grounding) MCP
6 toolsPower your RAG and search via Jina AI. Generate embeddings, rerank documents, read URLs, and perform semantic web search.
MongoDB Atlas Vector Search MCP Server
6 toolsManage vector storage via MongoDB Atlas. Perform similarity searches, query MQL documents, and audit collections.
OpenSearch Vector
6 toolsRun k-NN vector searches on OpenSearch. Create indexes, upsert embeddings, query similar documents, and manage your vector store from any AI agent.
pgvector (Vector Database) MCP Server
6 toolsRun vector similarity searches, manage embedding tables, and build AI-powered retrieval pipelines. All directly inside your existing PostgreSQL database.
Cohere MCP
6 toolsAccess Cohere AI models via API. Chat with Command models, generate embeddings, rerank documents and tokenize text from any AI agent.
Mistral AI MCP Integration
10 toolsAccess Mistral AI models via API. Chat with Claude alternatives, generate embeddings, moderate content and manage batch jobs from any AI agent.
Mistral AI
10 toolsBuild with European open-weight language models that deliver strong reasoning, multilingual capability, and efficient inference.
Chroma (Vector DB) MCP
7 toolsManage vector embeddings via Chroma. List collections, query embeddings, and audit document counts directly from any AI agent.
Couchbase (Vector & NoSQL)
7 toolsManage vector search and NoSQL via Couchbase. Execute N1QL queries, perform KNN vector searches, and audit documents directly from any AI agent.
Fireworks AI MCP Server
6 toolsEmpower LLM applications via Fireworks AI. Perform ultra-fast chat completions, generate embeddings and images, and transcribe audio directly from any AI agent.
LanceDB (Serverless Vector DB)
6 toolsManage vectorized data via LanceDB. Perform similarity searches, create tables, and manage multi-modal embeddings.
Redis Vector
6 toolsEquip your AI to autonomously manage embeddings, run KNN similarity searches, and administrate vector indexes natively inside your Redis stack.
Supabase Vector MCP Integration
7 toolsConnect your AI to Supabase Vector. Execute pgvector semantic searches, manage embeddings, and run relational database queries directly from your terminal.
Vertex AI Vector Search
6 toolsBring Google's massive vector matching power to your AI agent. Search billions of semantic embeddings and administer Vertex Index endpoints directly in chat.
Exa AI MCP
12 toolsSearch the web with neural embeddings that understand meaning, not just keywords, and return the most relevant results for any query.
Milvus (Open-Source Vector Database) MCP
7 toolsManage vector storage via Milvus. Perform ANN searches, query scalar entities, and audit collections.
Vald
6 toolsPower your agent with Vald. Query, insert, and manage dense vectors on a highly scalable, distributed nearest-neighbor engine.
Baidu Qianfan
6 toolsOrchestrate Baidu Qianfan AI models. Manage chat completions, embeddings, and prompt templates directly from any AI agent.
Lingyi Wanwu MCP Integration
5 toolsOrchestrate Lingyi Wanwu AI models. Manage chat completions, embeddings, and monitor Yi model performance directly from any AI agent.
DeepInfra (Serverless LLM Inference)
4 toolsRun top-tier LLMs, image generation, and embeddings via DeepInfra's serverless infrastructure directly from your AI agent.
Eden AI Alternative
13 toolsAccess 100+ AI models through a single API. Route LLMs, generate embeddings, and execute specialized AI tasks like OCR and translation.
Gradient AI (LLM API & Finetuning)
19 toolsAccess powerful LLMs, fine-tune models on your own data, and generate embeddings directly through your AI agent.
SambaNova (AI Inference)
3 toolsHigh-speed AI inference for Llama 3, DeepSeek, and MiniMax models via SambaNova's ultra-fast SN40L chips.
Voyage AI (AI Embeddings API)
13 toolsGenerate high-quality text, multimodal, and contextualized embeddings, plus high-precision reranking for RAG workflows.