4,500+ servers built on MCP Fusion
Vinkius

Qdrant MCP. Run vector searches and manage collections from chat.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Qdrant MCP on Cursor AI Code Editor MCP Client Qdrant MCP on Claude Desktop App MCP Integration Qdrant MCP on OpenAI Agents SDK MCP Compatible Qdrant MCP on Visual Studio Code MCP Extension Client Qdrant MCP on GitHub Copilot AI Agent MCP Integration Qdrant MCP on Google Gemini AI MCP Integration Qdrant MCP on Lovable AI Development MCP Client Qdrant MCP on Mistral AI Agents MCP Compatible Qdrant MCP on Amazon AWS Bedrock MCP Support

Just plug in your AI agents and start using Vinkius.

Qdrant connects your AI client directly to a vector database cluster. It lets you query embeddings, perform nearest neighbor similarity searches, and manage all collections without writing complex scripts.

Use the `search` tool to find data points matching semantic queries, or use `list_collections` to map out what's stored in your indexes.

What your AI agents can do

Count

Returns a single number: the total count of points in a specified collection.

Delete

Removes specific data points from a collection. Note that this action is irreversible.

Get collection

Retrieves detailed metadata about one specific collection, like its distance metric and vector size.

+ 4 more capabilities included
Find nearest vector matches

You pass a query vector (a JSON array of floats), and the agent returns the most similar data points from your collections.

List all indexes in the cluster

The agent runs list_collections to show you every available collection, giving you a map of your stored data.

Read metadata and metrics for one index

You specify a collection name, and the agent calls get_collection to report on its structure, including distance metrics and total payload points.

Count all indexed records

The agent uses count to give you an immediate total number of points in a specified collection.

Iterate through large datasets

You use the scroll tool to paginate through thousands of data points, allowing you to inspect IDs and payloads sequentially without memory limits.

Retrieve specific records by ID

If you know a record's unique identifier (ID), the agent uses get_points to pull that exact payload immediately.

Supported MCP Clients

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients
Free for Subscribers

Waiting for input…

AI Agent

Qdrant: 7 Tools for Vector Management

Use these seven tools to query your vector clusters, list indexes, perform similarity searches, and manage your embedded data directly from your AI agent.

action019d75fb

count

Returns a single number: the total count of points in a specified collection.

action019d75fb

delete

Removes specific data points from a collection. Note that this action is irreversible.

get019d75fb

get collection

Retrieves detailed metadata about one specific collection, like its distance metric and vector size.

get019d75fb

get points

Pulls the actual data (payloads) for points when you provide their unique IDs.

list019d75fb

list collections

Lists every collection present in your Qdrant database instance.

action019d75fb

scroll

Returns batches of points and their payloads, useful for reading through large amounts of data page by page.

action019d75fb

search

Performs a nearest neighbor search in a collection using a JSON array of floats as the query vector.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

  • Import from OpenAPI, Swagger, or YAML specs
  • Create Agent Skills with progressive disclosure
  • Deploy to edge with MCPFusion framework
  • Built in DLP, auth, and compliance on every call
  • Real time usage dashboard and cost metering
  • Publish to catalog or keep private
Start building

Make Your AI Do More

Start with Qdrant, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

  • Use this MCP plus 4,700+ others, all in one place
  • Add new capabilities to your AI anytime you want
  • Every connection is secured and compliant automatically
  • Track usage and costs across all your servers
  • Works with Claude, ChatGPT, Cursor, and more
  • New servers added to the catalog every week

What you can do with this MCP connector

Qdrant hooks your AI client directly into a vector database cluster. You can query embeddings, run nearest neighbor similarity searches, and manage every collection without having to write any complex scripts or boilerplate code. It gives your agent the raw power it needs to interact with structured data.

To start mapping out what's in the system, you use list_collections. This tool runs instantly and shows you a full list of every single collection stored on your Qdrant instance; it’s like getting an index map of all your indexes. Once you know which collections exist, you can dive deeper into any specific one by calling get_collection.

You pass the name of the collection, and the agent spits out detailed metadata about it—you'll get metrics like its configured distance metric (e.g., Cosine or Euclidean) and the expected vector size for all payloads.

When you need to know how big a collection is, use count. You pass a specific collection name, and it returns one single number: the total count of points housed in that index. This gives you an immediate grasp of your dataset's scale without having to pull records. If you suspect something’s wrong or if you need to prune old data, you use delete.

You specify which data points you want gone from a collection; remember, this action is permanent, so double-check what you send.

For finding specific records by their unique ID, the agent uses get_points. If you know the exact identifier (ID) of a payload, you feed that ID to this tool, and it pulls the full data—the actual payload—immediately. This is crucial for debugging or validating specific pieces of information.

When you need to read through massive amounts of data page by page, scroll handles it. You use this tool when your dataset is too big to load into memory all at once. It returns batches of points and their payloads sequentially, allowing your agent to iterate through thousands of records without hitting any memory limits.

This lets you inspect IDs and contents one chunk after the other.

The core function—the search itself—is done with search. You pass a JSON array of floats; that’s your query vector. The tool then performs a nearest neighbor search within a specified collection, returning the data points that match your semantic query best. This is how you find data that means the same thing as your input, even if the words don't match exactly.

To put it all together: If your goal is to analyze data quality or debug embeddings, you combine scroll with get_points. To build a knowledge graph and link related concepts, you rely on search. You can check the overall structure of your environment by running list_collections, then validate that structure using get_collection before querying.

If you need to know how many records are available across all indexes without actually pulling them, use count. The entire workflow—from mapping out what exists with list_collections to reading specific payloads with get_points, and finally performing the heavy lifting of finding semantic matches using search—runs entirely through your agent's ability to invoke these tools, giving you full control over your vector data.

How Qdrant MCP Works

  1. 1 First, subscribe to this server and provide your Qdrant Base URL and API Key.
  2. 2 Next, prompt your AI client with a query. For example: 'List all collections' or 'Find data similar to X vector'.
  3. 3 The agent executes the necessary tool (like list_collections or search), and you get back the structured payload data directly in the chat.

The bottom line is that your AI client treats Qdrant like a native API endpoint, letting you query complex vector math right through natural conversation.

Who Is Qdrant MCP For?

ML Engineers and Data Scientists need this. You’re the one staring at embeddings in a notebook all day, tired of context switching between your code editor, your terminal, and your database GUI. This lets you test RAG pipelines and debug vector payloads directly through chat.

ML Engineer

Uses the server to query embedded spaces while building or testing Retrieval-Augmented Generation (RAG) applications, running search queries for context.

Data Scientist

Inspecting live indices and debugging payload quality. They use scroll and get_points to manually verify data points without launching a full Jupyter session.

Backend Developer

Managing the vector cluster configuration, running count, or using delete to clear out stale or redundant datasets efficiently from the API layer.

What Changes When You Connect

  • Semantic Search on Demand: Skip the boilerplate code. Just ask your agent to search for context, and it runs a nearest neighbor query using complex embeddings—all in one turn.
  • Instant Inventory Check: Need to know what indexes you have? Use list_collections. It gives you an immediate map of every data source without needing to navigate multiple dashboards.
  • Deep Data Debugging: Instead of dumping logs, use the scroll tool. You can paginate through millions of records and inspect IDs and payloads right in your chat window.
  • Efficiency Gains with count: Get a quick total point count using count. It’s much faster than running an aggregation query just to check data volume.
  • Targeted Data Cleanup: Use delete for safe, controlled removal of bad or redundant data points. You don't need a separate admin script for simple cleanup tasks.

Real-World Use Cases

01

Debugging poor embedding quality

A data scientist wants to check if the latest batch of embeddings are actually working. They use list_collections first, then select a collection and run scroll. This lets them inspect hundreds of payloads quickly to verify that the right information is being indexed.

02

Building RAG context retrieval

The engineer needs an answer based on external documentation. They prompt their agent: 'Find documents related to API rate limits.' The agent runs search with the query vector, and returns the top 3 payloads that provide the direct answer.

03

Auditing database health

A backend developer suspects a collection is getting bloated with old data. They use count to check the current volume and then run get_collection to verify the distance metric, helping them decide if they need to clean up.

04

Finding specific records by ID

You know a user's record has ID 1f2a3. Instead of scrolling through millions of results, you use get_points with that exact ID and pull the payload immediately. It’s fast and precise.

The Tradeoffs

Writing manual pagination loops

The developer writes a 50-line Python script using while loops and sleep() calls just to view the first page of results from a collection.

Just ask your agent to run scroll. The tool handles the iteration and pagination for you, allowing you to inspect batches of points without writing any loop logic.

Assuming all data is searchable

Trying to perform a search query on a collection that hasn't been properly indexed with embeddings or using the wrong vector size.

Always check first. Use get_collection and list_collections to confirm the collection exists and has the correct distance metrics before running a costly search.

Over-relying on single tools

Only using count and ignoring the actual data. You know how many points exist, but you can't debug what those points say.

Always pair volume checks with inspection. Use list_collections to find the target, then run scroll or get_points to confirm payload quality.

When It Fits, When It Doesn't

Use this server if your core problem involves semantic similarity: finding data based on meaning (vectors) rather than exact keywords. You need it when you're building RAG systems, testing embeddings, or managing a vector store.

Don't use this if you only need simple relational CRUD operations on non-vector fields (e.g., 'Give me the user by name'). For that, stick to standard SQL or NoSQL APIs. This is for deep data math and indexing. If your goal is merely listing collections, list_collections works; if you need structured schema details, run get_collection. Never assume a collection exists; always check with list_collections first.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Qdrant. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 7 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

count delete get_collection get_points list_collections scroll search

Debugging vector payloads shouldn't require running custom scripts.

Right now, if you need to debug your embeddings—say, checking the payload of 50 data points from a collection—you have to write a dedicated script. You gotta set up the connection, run the query, handle the pagination loop manually, and then print the results. It’s tedious and slows down your iteration cycle.

With this MCP server, you just tell your agent what to do: 'Scroll through the first 50 points in my docs-embeddings collection.' The agent handles all the looping and API calls behind the scenes. You get clean, usable data right back into the conversation.

Qdrant MCP Server gives you total control over your vector indexes.

Manual maintenance tasks—like listing all collections or getting a hard count of records—used to require writing multiple, separate API calls just to gather basic metrics. You'd run one tool for the list, another tool for the total, and a third for the structure.

Now you can ask your agent to do it all in one go. The server manages the complex multi-step process so you get structured answers instantly. It cuts out the boilerplate API plumbing.

Common Questions About Qdrant MCP

How do I find my Qdrant URL and API Key? +

For Qdrant Cloud: Go to the Qdrant Cloud Console, select your cluster to open the Cluster Detail Page. The endpoint will be displayed there (e.g., xyz.us-east4-0.gcp.cloud.qdrant.io), and you can generate Database API Keys underneath it (they start with eyJhb). For Self-hosted: Provide your custom URL and the static custom key you defined in your config.yaml.

Can my AI use this for a RAG architecture directly? +

Yes contextually, but practically your agent acts as the database debugger. It can formulate vector arrays to query search_points, retrieving identical payload structures. It's meant for the engineer building the RAG, helping you inspect distances and debug faulty retrieval mechanisms mid-code.

Does it support deleting vectors? +

Yes. If an embedding got corrupted or references dropped articles, use the delete tool. Pass the collection name and the list of specific IDs. Qdrant handles the mutation instantly and updates the index without rebuilding.

What if I have millions of points? +

Instead of overloading your chat context, instruct your agent to use the count tool to grasp the scale, and the scroll tool with a small limit constraint (e.g., 5-10 records at a time). This paginates large bodies cleanly when analyzing index health.

How do I check which collections are available using the `list_collections` tool? +

It lists every vector collection in your Qdrant instance. The output shows the collection name, current point count, and optimization details (like distance metrics or vector size).

What format must my query be when running the `search` tool? +

You must provide a JSON array of floats for the query vector. The search tool then performs nearest neighbor similarity searches against your specified collection.

Using `get_collection`, what specific metadata can I retrieve about an index? +

This tool pulls detailed information on a given collection, including its total payload points and the distance metrics it uses (e.g., Cosine or Dot product).

How does the `scroll` tool help me debug data payloads? +

The scroll tool allows you to traverse a collection sequentially, returning both the point IDs and their full JSON payloads. This is ideal for inspecting data quality without writing complex pagination logic.

More in this category

You might also like

Built & Managed by Vinkius 30s setup 7 tools

We've already built the connector for Qdrant. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 7 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.