Haystack (deepset Cloud) MCP. Run RAG pipelines and search indexed knowledge.

Q: How do I use the listworkspaces tool with Haystack (deepset Cloud) MCP Server?

The listworkspaces tool shows you all isolated environments you've set up. You use this first to confirm the specific context (e.g., 'staging' or 'production') before running any search or pipeline operations.

Q: What is the difference between searchdocuments and runpipeline?

searchdocuments executes a raw vector search over the index. runpipeline executes a complete, configured workflow (the pipeline) which includes embedding nodes and retriever logic, making it more comprehensive.

Q: Can I check metadata for a file using getfile?

Yes, the getfile tool lets you retrieve specific metadata for an uploaded file. This is crucial for auditing, as it tells you exactly what information is attached to the source document embedding.

Q: How do I see what pipelines are available? Use listpipelines.

Simply calling listpipelines lists all available NLP topologies in the current workspace. This shows you which workflows (like 'default-rag') are ready to be tested via runpipeline.

Q: How do I check which files I can manage using the listfiles tool?

The listfiles tool shows all files currently uploaded. It lets you see the names and basic status of your document repository.

Q: What is the purpose of the getfile tool?

The getfile tool retrieves specific metadata about a file. You use it when you need details beyond just the file name, like creation dates or authors.

Q: If I want to see all available search methods, should I use listpipelines or searchdocuments?

Use listpipelines first. This shows you the available NLP topologies and search setups. Once you pick a pipeline, you can run the search using runpipeline.

Q: How do I run a specific knowledge search using the searchdocuments tool?

The searchdocuments tool runs a direct vector search against your indexed knowledge. You pass it the query and the target index to get immediate context.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

Haystack (deepset Cloud) MCP Server manages your RAG pipelines and enterprise search context. It lets your AI agent interact with isolated workspaces, run document searches over indexed knowledge, and list metadata for both files and NLP pipelines.

Manage complex, contextual AI workflows without leaving your chat client.

What your AI agents can do

Get file

Retrieves specific metadata for an uploaded file.

Get pipeline

Gets the detailed configuration and structure of a single pipeline.

List files

Lists all files currently uploaded to the system.

+ 4 more capabilities included

Audit Workspaces

List and manage isolated environments where different search contexts are maintained.

Inspect Pipelines

View the structure of NLP topologies, including embedding nodes and retriever logic.

Execute Search Tests

Run a pipeline search by dispatching immediate LLM or Retriever invocations.

Perform Vector Search

Trigger dense or sparse vector searches across your indexed enterprise knowledge.

Check File Metadata

List uploaded files and retrieve specific metadata attached to source document embeddings.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

Haystack (deepset Cloud) MCP Server: 7 Tools for RAG & Search

These tools allow your agent to list workspaces, manage pipelines, run searches, and inspect document metadata directly within the chat interface.

get019d75ae

get file

Retrieves specific metadata for an uploaded file.

get019d75ae

get pipeline

Gets the detailed configuration and structure of a single pipeline.

list019d75ae

list files

Lists all files currently uploaded to the system.

list019d75ae

list pipelines

Lists all available pipelines across the connected workspace.

list019d75ae

list workspaces

Lists all isolated environments you have set up.

run019d75ae

run pipeline

Executes a complete search run using a specified pipeline configuration.

search019d75ae

search documents

Triggers a dense or sparse vector search against your entire document index.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Haystack (deepset Cloud), then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

Haystack (deepset Cloud) MCP Server manages your whole RAG pipeline and enterprise search context. It lets your AI agent interact with isolated workspaces, run document searches over indexed knowledge, and list metadata for both files and NLP pipelines. You manage complex, contextual AI workflows without ever leaving your chat client.

Audit Workspaces
You'll list all isolated environments you've set up with list_workspaces and then get detailed info on a specific environment with list_pipelines and get_pipeline. Inspect Pipelines
You can view the structure of any NLP topology, checking out embedding nodes and retriever logic using get_pipeline. Execute Search Tests
To test your setup, you run a search using a specified pipeline configuration with run_pipeline.

You can also trigger a direct, dense or sparse vector search against your entire document index using search_documents. Perform Vector Search
Use search_documents to trigger dense or sparse vector searches across your indexed enterprise knowledge. Check File Metadata
To see what files you've uploaded, run list_files to get a list of everything in the system; you can then check specific source document embedding metadata for any file using get_file.

How Haystack (deepset Cloud) MCP Works

1 Subscribe to the server and provide your deepset Cloud API credentials.
2 Your AI client connects, granting the agent full access to the Haystack tools.
3 You prompt the agent (e.g., 'Check the compliance documents'). The agent sequences list_workspaces -> run_pipeline -> search_documents to deliver the answer.

The bottom line is, you manage your entire RAG pipeline lifecycle—from setup to execution—using only natural conversation.

Who Is Haystack (deepset Cloud) MCP For?

This is for the ML Engineer who needs to test a new knowledge retrieval model without touching the cloud console. It's for the Product Manager who needs to prove that the documents are indexed correctly before launch. It's for the Data Scientist who needs to audit complex pipelines rapidly. It lets you treat your entire data stack like a set of chat commands.

ML Engineer

Tests new embedding models or retriever logic by calling run_pipeline and verifying the search results against source documents.

Product Manager

Verifies the document indexing status and tracks search performance by listing files and workspaces.

Data Scientist

Audits existing RAG pipelines and document sources by calling list_pipelines and search_documents to confirm data coverage.

What Changes When You Connect

Test search logic instantly. Instead of clicking through a dashboard, use run_pipeline to dispatch immediate LLM or Retriever invocations and test your RAG setup.
See your data structure clearly. Use list_pipelines and get_pipeline to visualize NLP topologies, checking every embedding node and retriever logic without needing the cloud console.
Manage your context isolation. list_workspaces lets you list and select separate environments, ensuring your search queries hit the right isolated data set.
Audit source files easily. Use list_files to see what documents are uploaded, and get_file to check the metadata attached to specific embeddings.
Deep search capabilities. search_documents runs vector searches over your whole indexed knowledge base, going beyond simple keyword matching.
End-to-end visibility. You can chain tools—e.g., list_workspaces then run_pipeline—to manage the entire lifecycle of an AI search query.

Real-World Use Cases

Auditing a Production Search

A Product Manager needs to confirm if the 'Q3 Compliance' documents are indexed correctly. They use the agent to run list_workspaces to select the right environment, then list_files to confirm the document count. Finally, they call search_documents to validate that the desired snippets appear.

Testing a New RAG Model

An ML Engineer wants to test a new embedding model. They use the agent to call get_pipeline on the staging workspace to check the current topology. They then use run_pipeline to execute the search with the new model, verifying the results before committing the change.

Debugging Pipeline Failures

A Data Scientist finds a pipeline is giving weird results. They use the agent to call get_pipeline to inspect the exact sequence of nodes (e.g., checking the retriever logic). They then use search_documents to narrow the search scope and pinpoint the data source issue.

Onboarding a New Data Source

A team member has uploaded 50 new PDFs. They use the agent to call list_files to confirm the uploads. They then use run_pipeline to run a preliminary search on the new data, checking if the pipeline handles the new file types correctly.

The Tradeoffs

Manual Console Checks

The developer logs into the deepset console, navigates to the workspace, manually checks the file list, and then runs the search, copying results into a spreadsheet.

→ Use the agent to sequence the process. First, call list_workspaces to select the context. Then, use run_pipeline to execute the search, and finally, get_file to pull metadata for the source IDs. It's all in chat.

Ignoring Context

The agent just runs search_documents without knowing which workspace or pipeline to use, leading to ambiguous or stale results.

→ Always start by calling list_workspaces to confirm the correct isolated environment. This ensures your run_pipeline call executes against the intended, governed context.

Assuming Full Functionality

Thinking that just running search_documents is enough when the data requires specific NLP processing steps.

→ Don't just search. First, use list_pipelines to see what topologies exist. Then, use run_pipeline to ensure the data runs through the required pre-processing steps before the search.

When It Fits, When It Doesn't

Use this MCP Server if your core task is managing, testing, and debugging Retrieval-Augmented Generation (RAG) pipelines using external, indexed knowledge. You need to move the entire lifecycle—from checking file metadata (get_file) to running the full search (run_pipeline)—into a single, conversational interface. Don't use this if you simply need basic file operations or general LLM prompting; other general-purpose agents handle that. You also don't need it if you are only running single, isolated vector searches without needing to audit the pipeline structure. If you only need to see what files exist, list_files is enough, but this server lets you connect the files to the pipelines and the search results.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by deepset Cloud. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 7 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

get_file get_pipeline list_files list_pipelines list_workspaces run_pipeline search_documents

Manually auditing search pipelines is a nightmare.

Right now, testing a search pipeline means opening the deepset console. You have to switch tabs to check the file list, click into the workspace to see which pipelines are active, and then manually trigger a run. You copy results from one screen, and then you go to another screen to check the source file metadata. It takes a decade and a dozen clicks.

With this server, you just tell your agent, 'Test the compliance search.' The agent coordinates `list_workspaces` and `run_pipeline` behind the scenes. You get the answer, and you can ask it to pull the source file IDs using `get_file`—all in the chat window. It's a single, coordinated operation.

Haystack (deepset Cloud) MCP Server: Search and Validate

You eliminate the need to jump between the deepset console, the file browser, and the pipeline visualization tool. You never have to manually run a search and then manually check the source document metadata in a different view.

This tool lets you treat your entire complex data stack—the pipelines, the workspaces, and the documents—as a single, queryable entity. It's instant validation for your ML teams.

Common Questions About Haystack (deepset Cloud) MCP

How do I use the `list_workspaces` tool with Haystack (deepset Cloud) MCP Server? +

The list_workspaces tool shows you all isolated environments you've set up. You use this first to confirm the specific context (e.g., 'staging' or 'production') before running any search or pipeline operations.

What is the difference between `search_documents` and `run_pipeline`? +

search_documents executes a raw vector search over the index. run_pipeline executes a complete, configured workflow (the pipeline) which includes embedding nodes and retriever logic, making it more comprehensive.

Can I check metadata for a file using `get_file`? +

Yes, the get_file tool lets you retrieve specific metadata for an uploaded file. This is crucial for auditing, as it tells you exactly what information is attached to the source document embedding.

How do I see what pipelines are available? Use `list_pipelines`. +

Simply calling list_pipelines lists all available NLP topologies in the current workspace. This shows you which workflows (like 'default-rag') are ready to be tested via run_pipeline.

How do I check which files I can manage using the `list_files` tool? +

The list_files tool shows all files currently uploaded. It lets you see the names and basic status of your document repository.

What is the purpose of the `get_file` tool? +

The get_file tool retrieves specific metadata about a file. You use it when you need details beyond just the file name, like creation dates or authors.

If I want to see all available search methods, should I use `list_pipelines` or `search_documents`? +

Use list_pipelines first. This shows you the available NLP topologies and search setups. Once you pick a pipeline, you can run the search using run_pipeline.

How do I run a specific knowledge search using the `search_documents` tool? +

The search_documents tool runs a direct vector search against your indexed knowledge. You pass it the query and the target index to get immediate context.

Can I test my RAG pipelines directly via my AI agent? +

Yes. Use the run_pipeline tool to dispatch a query to any registered pipeline in your workspace. Your agent will return the response from the NLP topology, allowing you to verify retriever performance and LLM grounding without leaving your workspace.

How can I audit my document indexing status? +

Ask your agent to list files in your workspace. You can then get specific metadata for any file to ensure embeddings and attributes are correctly attached. This is essential for debugging retrieval issues in production environments.

Is it possible to manage multiple deepset Cloud workspaces? +

Absolutely. The agent provides high-level workspace listing, allowing you to navigate across tenant boundaries and isolation zones easily. You just need to provide the workspace name to any pipeline or search command.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript