#Llm MCP Servers
Discover 13 MCP servers tagged with Llm on the Vinkius App Catalog.
OpenAI MCP
10 toolsUse GPT-4o, DALL-E 3, embeddings, fine-tuning, and moderation as tools inside your AI agent workflows.
Anthropic Alternative MCP
6 toolsAccess Claude models via Anthropic API. Send messages, count tokens, manage batches and discover models from any AI agent.
Mistral AI (Frontier LLMs & Embeddings)
7 toolsManage AI inference via Mistral. Execute chat completions, generate RAG embeddings, and audit frontier models.
NVIDIA AI
9 toolsAccess LLMs, embeddings, code generation, and reasoning via NVIDIA API Catalog.
Cohere (AI Platform) MCP Server
7 toolsPower enterprise AI via Cohere. Generate text, perform chat completions, reorder documents, and manage embeddings directly from any AI agent.
Cohere MCP
6 toolsAccess Cohere AI models via API. Chat with Command models, generate embeddings, rerank documents and tokenize text from any AI agent.
Mistral AI MCP Integration
10 toolsAccess Mistral AI models via API. Chat with Claude alternatives, generate embeddings, moderate content and manage batch jobs from any AI agent.
DeepSeek MCP
12 toolsAccess powerful open-weight language models for reasoning, code generation, and complex problem solving at competitive cost.
Together AI Alternative
27 toolsAccess 100+ open-source models for chat, image generation, and fine-tuning. Power your AI agents with Llama 3.3, Flux, and more.
Together AI
7 toolsGenerate code, evaluate embeddings, and deploy open-source LLMs instantly from your local agent via Together AI's infrastructure.
Gradient AI (LLM API & Finetuning)
19 toolsAccess powerful LLMs, fine-tune models on your own data, and generate embeddings directly through your AI agent.
Writer (AI Enterprise LLM) MCP
24 toolsAccess Writer's enterprise-grade LLMs and Knowledge Graph capabilities to generate content, manage files, and query RAG-based data.
Forefront MCP Server
10 toolsAccess Forefront AI models directly from your agent. Generate chat completions, manage fine-tuning jobs, and collect LLM outputs with pipelines.