Qdrant MCP. Run vector searches and manage collections from chat.
Works with every AI agent you already use
…and any MCP-compatible client
Just plug in your AI agents and start using Vinkius.
Qdrant connects your AI client directly to a vector database cluster. It lets you query embeddings, perform nearest neighbor similarity searches, and manage all collections without writing complex scripts.
Use the `search` tool to find data points matching semantic queries, or use `list_collections` to map out what's stored in your indexes.
What your AI agents can do
Count
Returns a single number: the total count of points in a specified collection.
Delete
Removes specific data points from a collection. Note that this action is irreversible.
Get collection
Retrieves detailed metadata about one specific collection, like its distance metric and vector size.
You pass a query vector (a JSON array of floats), and the agent returns the most similar data points from your collections.
The agent runs list_collections to show you every available collection, giving you a map of your stored data.
You specify a collection name, and the agent calls get_collection to report on its structure, including distance metrics and total payload points.
The agent uses count to give you an immediate total number of points in a specified collection.
You use the scroll tool to paginate through thousands of data points, allowing you to inspect IDs and payloads sequentially without memory limits.
If you know a record's unique identifier (ID), the agent uses get_points to pull that exact payload immediately.
Ask AI about this MCP
Supported MCP Clients
Waiting for input…
Qdrant: 7 Tools for Vector Management
Use these seven tools to query your vector clusters, list indexes, perform similarity searches, and manage your embedded data directly from your AI agent.
019d75fbcount
Returns a single number: the total count of points in a specified collection.
019d75fbdelete
Removes specific data points from a collection. Note that this action is irreversible.
019d75fbget collection
Retrieves detailed metadata about one specific collection, like its distance metric and vector size.
019d75fbget points
Pulls the actual data (payloads) for points when you provide their unique IDs.
019d75fblist collections
Lists every collection present in your Qdrant database instance.
019d75fbscroll
Returns batches of points and their payloads, useful for reading through large amounts of data page by page.
019d75fbsearch
Performs a nearest neighbor search in a collection using a JSON array of floats as the query vector.
Choose How to Get Started
Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.
Build Your Own
Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.
- Import from OpenAPI, Swagger, or YAML specs
- Create Agent Skills with progressive disclosure
- Deploy to edge with MCPFusion framework
- Built in DLP, auth, and compliance on every call
- Real time usage dashboard and cost metering
- Publish to catalog or keep private
Make Your AI Do More
Start with Qdrant, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.
- Use this MCP plus 4,700+ others, all in one place
- Add new capabilities to your AI anytime you want
- Every connection is secured and compliant automatically
- Track usage and costs across all your servers
- Works with Claude, ChatGPT, Cursor, and more
- New servers added to the catalog every week
What you can do with this MCP connector
Qdrant hooks your AI client directly into a vector database cluster. You can query embeddings, run nearest neighbor similarity searches, and manage every collection without having to write any complex scripts or boilerplate code. It gives your agent the raw power it needs to interact with structured data.
To start mapping out what's in the system, you use list_collections. This tool runs instantly and shows you a full list of every single collection stored on your Qdrant instance; it’s like getting an index map of all your indexes. Once you know which collections exist, you can dive deeper into any specific one by calling get_collection.
You pass the name of the collection, and the agent spits out detailed metadata about it—you'll get metrics like its configured distance metric (e.g., Cosine or Euclidean) and the expected vector size for all payloads.
When you need to know how big a collection is, use count. You pass a specific collection name, and it returns one single number: the total count of points housed in that index. This gives you an immediate grasp of your dataset's scale without having to pull records. If you suspect something’s wrong or if you need to prune old data, you use delete.
You specify which data points you want gone from a collection; remember, this action is permanent, so double-check what you send.
For finding specific records by their unique ID, the agent uses get_points. If you know the exact identifier (ID) of a payload, you feed that ID to this tool, and it pulls the full data—the actual payload—immediately. This is crucial for debugging or validating specific pieces of information.
When you need to read through massive amounts of data page by page, scroll handles it. You use this tool when your dataset is too big to load into memory all at once. It returns batches of points and their payloads sequentially, allowing your agent to iterate through thousands of records without hitting any memory limits.
This lets you inspect IDs and contents one chunk after the other.
The core function—the search itself—is done with search. You pass a JSON array of floats; that’s your query vector. The tool then performs a nearest neighbor search within a specified collection, returning the data points that match your semantic query best. This is how you find data that means the same thing as your input, even if the words don't match exactly.
To put it all together: If your goal is to analyze data quality or debug embeddings, you combine scroll with get_points. To build a knowledge graph and link related concepts, you rely on search. You can check the overall structure of your environment by running list_collections, then validate that structure using get_collection before querying.
If you need to know how many records are available across all indexes without actually pulling them, use count. The entire workflow—from mapping out what exists with list_collections to reading specific payloads with get_points, and finally performing the heavy lifting of finding semantic matches using search—runs entirely through your agent's ability to invoke these tools, giving you full control over your vector data.
How Qdrant MCP Works
- 1 First, subscribe to this server and provide your Qdrant Base URL and API Key.
- 2 Next, prompt your AI client with a query. For example: 'List all collections' or 'Find data similar to X vector'.
- 3 The agent executes the necessary tool (like
list_collectionsorsearch), and you get back the structured payload data directly in the chat.
The bottom line is that your AI client treats Qdrant like a native API endpoint, letting you query complex vector math right through natural conversation.
Who Is Qdrant MCP For?
ML Engineers and Data Scientists need this. You’re the one staring at embeddings in a notebook all day, tired of context switching between your code editor, your terminal, and your database GUI. This lets you test RAG pipelines and debug vector payloads directly through chat.
Uses the server to query embedded spaces while building or testing Retrieval-Augmented Generation (RAG) applications, running search queries for context.
Inspecting live indices and debugging payload quality. They use scroll and get_points to manually verify data points without launching a full Jupyter session.
Managing the vector cluster configuration, running count, or using delete to clear out stale or redundant datasets efficiently from the API layer.
What Changes When You Connect
- Semantic Search on Demand: Skip the boilerplate code. Just ask your agent to
searchfor context, and it runs a nearest neighbor query using complex embeddings—all in one turn. - Instant Inventory Check: Need to know what indexes you have? Use
list_collections. It gives you an immediate map of every data source without needing to navigate multiple dashboards. - Deep Data Debugging: Instead of dumping logs, use the
scrolltool. You can paginate through millions of records and inspect IDs and payloads right in your chat window. - Efficiency Gains with
count: Get a quick total point count usingcount. It’s much faster than running an aggregation query just to check data volume. - Targeted Data Cleanup: Use
deletefor safe, controlled removal of bad or redundant data points. You don't need a separate admin script for simple cleanup tasks.
Real-World Use Cases
Debugging poor embedding quality
A data scientist wants to check if the latest batch of embeddings are actually working. They use list_collections first, then select a collection and run scroll. This lets them inspect hundreds of payloads quickly to verify that the right information is being indexed.
Building RAG context retrieval
The engineer needs an answer based on external documentation. They prompt their agent: 'Find documents related to API rate limits.' The agent runs search with the query vector, and returns the top 3 payloads that provide the direct answer.
Auditing database health
A backend developer suspects a collection is getting bloated with old data. They use count to check the current volume and then run get_collection to verify the distance metric, helping them decide if they need to clean up.
Finding specific records by ID
You know a user's record has ID 1f2a3. Instead of scrolling through millions of results, you use get_points with that exact ID and pull the payload immediately. It’s fast and precise.
The Tradeoffs
Writing manual pagination loops
The developer writes a 50-line Python script using while loops and sleep() calls just to view the first page of results from a collection.
→
Just ask your agent to run scroll. The tool handles the iteration and pagination for you, allowing you to inspect batches of points without writing any loop logic.
Assuming all data is searchable
Trying to perform a search query on a collection that hasn't been properly indexed with embeddings or using the wrong vector size.
→
Always check first. Use get_collection and list_collections to confirm the collection exists and has the correct distance metrics before running a costly search.
Over-relying on single tools
Only using count and ignoring the actual data. You know how many points exist, but you can't debug what those points say.
→
Always pair volume checks with inspection. Use list_collections to find the target, then run scroll or get_points to confirm payload quality.
When It Fits, When It Doesn't
Use this server if your core problem involves semantic similarity: finding data based on meaning (vectors) rather than exact keywords. You need it when you're building RAG systems, testing embeddings, or managing a vector store.
Don't use this if you only need simple relational CRUD operations on non-vector fields (e.g., 'Give me the user by name'). For that, stick to standard SQL or NoSQL APIs. This is for deep data math and indexing. If your goal is merely listing collections, list_collections works; if you need structured schema details, run get_collection. Never assume a collection exists; always check with list_collections first.
Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Qdrant. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
VINKIUS INFRASTRUCTURE
Cloud Hosted
Managed infra
V8 Isolated
Sandboxed per request
Zero-Trust Proxy
No stored credentials
DLP Enforced
Policy on every call
GDPR Compliant
EU data residency
Token Compression
~60% cost reduction
Works with Claude, ChatGPT, Cursor, and more
The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.
This server provides 7 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.
Available Capabilities
Debugging vector payloads shouldn't require running custom scripts.
Right now, if you need to debug your embeddings—say, checking the payload of 50 data points from a collection—you have to write a dedicated script. You gotta set up the connection, run the query, handle the pagination loop manually, and then print the results. It’s tedious and slows down your iteration cycle.
With this MCP server, you just tell your agent what to do: 'Scroll through the first 50 points in my docs-embeddings collection.' The agent handles all the looping and API calls behind the scenes. You get clean, usable data right back into the conversation.
Qdrant MCP Server gives you total control over your vector indexes.
Manual maintenance tasks—like listing all collections or getting a hard count of records—used to require writing multiple, separate API calls just to gather basic metrics. You'd run one tool for the list, another tool for the total, and a third for the structure.
Now you can ask your agent to do it all in one go. The server manages the complex multi-step process so you get structured answers instantly. It cuts out the boilerplate API plumbing.
Common Questions About Qdrant MCP
How do I find my Qdrant URL and API Key? +
For Qdrant Cloud: Go to the Qdrant Cloud Console, select your cluster to open the Cluster Detail Page. The endpoint will be displayed there (e.g., xyz.us-east4-0.gcp.cloud.qdrant.io), and you can generate Database API Keys underneath it (they start with eyJhb). For Self-hosted: Provide your custom URL and the static custom key you defined in your config.yaml.
Can my AI use this for a RAG architecture directly? +
Yes contextually, but practically your agent acts as the database debugger. It can formulate vector arrays to query search_points, retrieving identical payload structures. It's meant for the engineer building the RAG, helping you inspect distances and debug faulty retrieval mechanisms mid-code.
Does it support deleting vectors? +
Yes. If an embedding got corrupted or references dropped articles, use the delete tool. Pass the collection name and the list of specific IDs. Qdrant handles the mutation instantly and updates the index without rebuilding.
What if I have millions of points? +
Instead of overloading your chat context, instruct your agent to use the count tool to grasp the scale, and the scroll tool with a small limit constraint (e.g., 5-10 records at a time). This paginates large bodies cleanly when analyzing index health.
How do I check which collections are available using the `list_collections` tool? +
It lists every vector collection in your Qdrant instance. The output shows the collection name, current point count, and optimization details (like distance metrics or vector size).
What format must my query be when running the `search` tool? +
You must provide a JSON array of floats for the query vector. The search tool then performs nearest neighbor similarity searches against your specified collection.
Using `get_collection`, what specific metadata can I retrieve about an index? +
This tool pulls detailed information on a given collection, including its total payload points and the distance metrics it uses (e.g., Cosine or Dot product).
How does the `scroll` tool help me debug data payloads? +
The scroll tool allows you to traverse a collection sequentially, returning both the point IDs and their full JSON payloads. This is ideal for inspecting data quality without writing complex pagination logic.
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
More in this category
Midjourney
AI image generation — create, upscale, vary, and blend images using Midjourney's Imagine API.
Baseten
Manage your Baseten AI models — orchestrate deployments, list secrets, and run serverless inference predictions autonomously.
Exa AI
Search the web with neural embeddings that understand meaning, not just keywords, and return the most relevant results for any query.
You might also like
BILL Spend & Expense
Manage corporate spend via BILL — list budgets, cards, and transactions directly from any AI agent.
Qichacha / 企查查
Leading enterprise data platform in China — search companies, check industrial info, and monitor risks via AI.
Duolingo
Track language learning progress on Duolingo — view profiles, streaks, XP, leaderboards and dictionary hints.