NVIDIA Vision MCP Server
Generate images, analyze visuals, detect objects, and caption images via NVIDIA Vision APIs.
Vinkius AI Gateway prend en charge le streamable HTTP et le SSE.

Fonctionne avec tous les agents IA que vous utilisez déjà
…et tout client compatible MCP


















NVIDIA MCP Server : voyez votre AI Agent en action
Capacités intégrées (9)
detect_objects
Detect and list all objects in an image
document_qa
Works with scanned documents, forms, receipts. Ask questions about a document image (OCR + understanding)
generate_image
Model options: "stabilityai/stable-diffusion-3-medium", "stabilityai/stable-diffusion-xl-base-1.0". Size format: "1024x1024". Generate an image from a text prompt using Stable Diffusion
image_captioning
Generate a detailed caption for an image
image_segmentation
Segment and identify all objects in an image
list_vision_models
List available vision models on NVIDIA API Catalog
style_transfer
Apply an artistic style to an image
visual_grounding
Locate a specific object or phrase in an image
visual_question_answering
Provide a public image URL. Ask a question about an image
Ce que ce connecteur débloque
Connect NVIDIA Vision to any AI agent and unlock powerful image understanding and generation — create images with Stable Diffusion, analyze visuals with Kosmos-2, answer questions about images, and perform object detection through natural conversation.
What you can do
- Generate Images — Create images from text prompts using Stable Diffusion models
- Visual Q&A — Ask questions about any image and get detailed answers
- Image Captioning — Generate detailed descriptions of image contents
- Object Detection — Identify and list all objects visible in an image
- Document Understanding — Extract information from scanned documents and forms
- Visual Grounding — Locate specific objects or phrases within images
- Style Transfer — Apply artistic styles to existing images
- Image Segmentation — Segment images into distinct object regions
How it works
1. Subscribe to this server 2. Enter your NVIDIA API Key (from build.nvidia.com) 3. Start analyzing and generating images from Claude, Cursor, or any MCP-compatible clientWho is this for?
- Designers — Generate concepts and analyze visual compositions quickly
- Developers — Integrate image understanding into apps without managing GPU infrastructure
- Content Creators — Generate images and apply style transfers for social media
Questions fréquemment posées
Donnez à vos agents IA la puissance de NVIDIA
Accédez à NVIDIA et à plus de 2 000 serveurs MCP — prêts à être utilisés par vos agents, dès maintenant. Pas de code glue. Pas d'intégrations personnalisées. Branchez simplement Vinkius AI Gateway et laissez vos agents travailler.
Plus dans cette catégorie

Bloomberg Law
13 outilsAccess 200M+ court dockets, case law, and legal news via Bloomberg Law Enterprise Dockets API for comprehensive legal research.

Paylocity
10 outilsManage payroll and HR via Paylocity — list employees, track earnings, and audit benefits setup directly from any AI agent.
Google BigQuery
7 outilsEmpower your AI agent to query massive datasets via BigQuery — execute Standard SQL, track active jobs, and inspect table schemas natively.
Vous pourriez aussi aimer

Zapier
6 outilsMonitor automated workflows, audit app connections, and search for Zap templates on Zapier — the leader in AI orchestration.

Beds24
8 outilsManage properties, bookings, rooms, calendar, availability, and pricing for your Beds24 channel manager through natural conversation.

Wix eCommerce
10 outilsManage products, orders, and inventory on Wix — the complete eCommerce platform for growing online businesses.
