Llm MCP Servers — Browse AI Agent Connectors by Tag

OpenAI MCP

10 tools

Use GPT-4o, DALL-E 3, embeddings, fine-tuning, and moderation as tools inside your AI agent workflows.

Anthropic MCP

6 tools

Access Claude models via Anthropic API. Send messages, count tokens, manage batches and discover models from any AI agent.

Mistral AI (Frontier LLMs & Embeddings) MCP

7 tools

Manage AI inference via Mistral. Execute chat completions, generate RAG embeddings, and audit frontier models.

NVIDIA AI MCP

9 tools

Access LLMs, embeddings, code generation, and reasoning via NVIDIA API Catalog.

Cohere (AI Platform) MCP

5 tools

Power enterprise AI via Cohere. Generate text, perform chat completions, reorder documents, and manage embeddings directly from any AI agent.

Cohere MCP

6 tools

Access Cohere AI models via API. Chat with Command models, generate embeddings, rerank documents and tokenize text from any AI agent.

Mistral AI MCP

10 tools

Access Mistral AI models via API. Chat with Claude alternatives, generate embeddings, moderate content and manage batch jobs from any AI agent.

DeepSeek MCP

12 tools

Access powerful open-weight language models for reasoning, code generation, and complex problem solving at competitive cost.

Together AI MCP

27 tools

Access 100+ open-source models for chat, image generation, and fine-tuning. Power your AI agents with Llama 3.3, Flux, and more.

New

Ollama MCP

12 tools

Run LLM models via Ollama cloud API. Generate completions, chat with multimodal models, create embeddings, and inspect model details from any AI agent.

New

Z.AI MCP

12 tools

Access the full Z.AI platform from any AI agent. Chat completions with GLM models, image and video generation, audio transcription, OCR, web search, and agent tools.

Together AI MCP

7 tools

Generate code, evaluate embeddings, and deploy open-source LLMs instantly from your local agent via Together AI's infrastructure.

Gradient AI (LLM API & Finetuning) MCP

19 tools

Access powerful LLMs, fine-tune models on your own data, and generate embeddings directly through your AI agent.

Writer (AI Enterprise LLM) MCP

24 tools

Access Writer's enterprise-grade LLMs and Knowledge Graph capabilities to generate content, manage files, and query RAG-based data.

Forefront MCP

10 tools

Access Forefront AI models directly from your agent. Generate chat completions, manage fine-tuning jobs, and collect LLM outputs with pipelines.

New

GPU Inference Memory Calculator MCP

4 tools

Estimate GPU VRAM requirements for LLM inference based on model parameters, precision, and batch size.

New

LLM API Cost Calculator MCP

4 tools

Estimate and compare the financial impact of LLM usage across different providers.

New

LLM Context Window Budgeter MCP

0 tools

Monitor and predict LLM context window exhaustion with precision token forecasting.

New

Prompt Injection Pattern Scanner MCP

4 tools

Scans user-supplied text for structural patterns associated with prompt-injection attempts.

New

RAG Chunk Size Optimizer MCP

0 tools

Evaluate RAG chunking strategies by calculating segmentation metrics, embedding costs, and context viability.

New

LLM Fine-Tuning Dataset Validator MCP

5 tools

Verify structural integrity, token distribution, and training costs of JSONL datasets.

#Llm MCP Servers

OpenAI MCP

Anthropic MCP

Mistral AI (Frontier LLMs & Embeddings) MCP

NVIDIA AI MCP

Cohere (AI Platform) MCP

Cohere MCP

Mistral AI MCP

DeepSeek MCP

Together AI MCP

Ollama MCP

Z.AI MCP

Together AI MCP

Gradient AI (LLM API & Finetuning) MCP

Writer (AI Enterprise LLM) MCP

Forefront MCP

GPU Inference Memory Calculator MCP

LLM API Cost Calculator MCP

LLM Context Window Budgeter MCP

Prompt Injection Pattern Scanner MCP

RAG Chunk Size Optimizer MCP

LLM Fine-Tuning Dataset Validator MCP

Subscribe on Vinkius

Configure your credentials

Connect and start building