2,000+ MCP servers ready to useZero-Trust ArchitectureTitanium-grade infrastructure
Vinkius

NVIDIA AI MCP Server

Built by Vinkius GDPR ToolsGratuit

Access LLMs, embeddings, code generation, and reasoning via NVIDIA API Catalog.

Vinkius AI Gateway prend en charge le streamable HTTP et le SSE.

NVIDIA AI

Fonctionne avec tous les agents IA que vous utilisez déjà

…et tout client compatible MCP

CursorClaudeOpenAIVS CodeCopilotGoogleLovableMistralAWSCursorClaudeOpenAIVS CodeCopilotGoogleLovableMistralAWS

NVIDIA MCP Server : voyez votre AI Agent en action

AI AgentVinkiusNVIDIA AI
You

Vinkius AI Gateway
GDPR·High Security·Kill Switch·Ultra-Low Latency·Plug and Play

Capacités intégrées (9)

analyze_sentiment

Analyze the sentiment of a text

ask_question

Optionally provide context for better answers. Ask a question to a powerful reasoning model (405B params)

chat_completion

Use "model" to specify which AI model (e.g., "meta/llama-3.1-70b-instruct", "mistralai/mistral-large"). Messages should be in OpenAI format: [{role: "user", content: "..."}]. Chat with an NVIDIA AI model (Llama, Mistral, etc)

generate_code

Specify language if needed. Generate code from a natural language prompt

get_embeddings

Model: "nvidia/nv-embed-v1". Generate vector embeddings from text

list_models

List all available AI models on the NVIDIA API Catalog

summarize_text

Summarize long text into a concise version

text_to_sql

Convert natural language to SQL query

translate_text

Translate text to another language

Ce que ce connecteur débloque

Connect NVIDIA AI to any AI agent and harness the power of GPU-accelerated foundation models — chat with Llama, generate embeddings, write code with CodeLlama, translate text, and perform complex reasoning through the NVIDIA API Catalog.

What you can do

  • Chat with LLMs — Access Llama 3.1, Mistral, Nemotron, and more via chat completions
  • Generate Embeddings — Create vector embeddings for search and clustering
  • Code Generation — Write code from natural language prompts using CodeLlama
  • Summarization — Condense long documents into concise summaries
  • Translation — Neural translation between dozens of languages
  • Text-to-SQL — Convert natural language questions into SQL queries
  • Sentiment Analysis — Analyze the emotional tone of text
  • Complex Reasoning — Ask questions to the 405B-parameter reasoning model

How it works

1. Subscribe to this server 2. Enter your NVIDIA API Key (from build.nvidia.com) 3. Start running AI models from Claude, Cursor, or any MCP-compatible client

Who is this for?

  • Developers — Prototype AI features without managing GPU infrastructure
  • Data Scientists — Generate embeddings and run NLP tasks at scale
  • Business Analysts — Use text-to-SQL to query databases with natural language

Questions fréquemment posées

Donnez à vos agents IA la puissance de NVIDIA

Accédez à NVIDIA et à plus de 2 000 serveurs MCP — prêts à être utilisés par vos agents, dès maintenant. Pas de code glue. Pas d'intégrations personnalisées. Branchez simplement Vinkius AI Gateway et laissez vos agents travailler.