Bring Rag
to VS Code Copilot
Create your Vinkius account to connect Vectara to VS Code Copilot and start using all 7 AI tools in minutes. Fully managed, enterprise secure, and ready to use without writing a single line of code. No hosting, no server setup — just connect and start using.
Compatible with every major AI agent and IDE
What is the Vectara MCP Server?
Connect your Vectara environment to any AI agent to unlock enterprise-grade Retrieval-Augmented Generation (RAG) and semantic search directly inside your conversational IDE or workspace.
What you can do
- Semantic Search — Query your indexed private corpora naturally and return highly relevant, grounded documents without traditional keyword matching limitations.
- Conversational RAG — Execute fully-fledged interactive chats leveraging Vectara's backend to provide detailed, cited answers strictly based on your secure documents.
- Corpus Management — List all available data corpora, retrieve unique keys, and discover the shape of your indexed data environment on the fly.
- Document Auditing — Monitor specific document indexes within a corpus, verify correct ingestions, or permanently delete obsolete files avoiding polluted search results.
How it works
- Subscribe to this server
- Enter your Vectara API Key and Customer ID
- Start retrieving knowledge from Claude, Cursor, or any MCP-compatible client
Your AI agent becomes an elite cognitive search gateway to all your internal data.
Who is this for?
- Software Engineers — debug RAG implementation challenges by directly testing
queryresponses via chat instead of writing disposable test scripts. - Data Engineers — securely remove stale database context arrays manually inserted into Vectara via quick conversational text commands.
- Product Leads — ask questions against internal product manuals stored as a Vectara corpus without waiting for the frontend UI development.
- Technical Writers — locate specific passages traversing across thousands of embedded documents effortlessly leveraging contextual semantic queries.
Built-in capabilities (7)
This action is irreversible. Permanently removes a document from a corpus
Provide corpus keys and the user query to get a summarized AI response with citations. Executes a RAG-powered chat completion
Retrieves metadata and configuration for a specific corpus
Lists previous RAG chat sessions
Lists all corpora (searchable datasets) in the Vectara account
Lists all indexed documents within a specific corpus
Provide one or more comma-separated corpus keys and the query text. Executes a semantic search across one or more corpora
Why VS Code Copilot?
GitHub Copilot Agent mode brings Vectara data directly into your VS Code workflow. With a project-scoped config, the entire team shares access to 7 tools. Copilot queries live data, generates typed code, and writes tests from actual API responses, all without leaving the editor.
- —
VS Code is used by over 70% of developers. adding MCP tools to Copilot means your team can leverage external data without leaving their primary editor
- —
Project-scoped MCP configs (
.vscode/mcp.json) let you commit server configurations to your repository, ensuring the entire team shares the same tool access - —
Copilot's Agent mode integrates MCP tools seamlessly with file editing, terminal commands, and workspace search in a single agentic loop
- —
GitHub's enterprise compliance and audit features extend to MCP tool usage, providing visibility into how AI interacts with external services
Vectara in VS Code Copilot
Why run Vectara with Vinkius?
The Vectara connection runs on our fully managed, secure cloud infrastructure. We handle the hosting, maintenance, and security so you don't have to deal with servers or code. All 7 tools are ready to work instantly without any complex setup.
You stay in complete control of your data. Your AI only accesses the information you approve, keeping your sensitive passwords and private details completely safe. Plus, with automatic optimizations, your AI works faster and more efficiently.

* Every connection is hosted and maintained by Vinkius. We handle the security, updates, and infrastructure so you don't have to write code or manage servers. See our infrastructure
Over 4,000 integrations ready for AI agents
Explore a vast library of pre-built integrations, optimized and ready to deploy.
Connect securely in under 30 seconds
Generate tokens to authenticate and link external services in a single step.
Complete visibility into every agent action
Audit live requests, latency, success rates, and active security compliance policies.
Optimize spending and track token ROI
Analyze real-time token consumption and cost metrics detailed by connection.




Explore our live AI Agents Analytics dashboard to see it all working
This dashboard is included when you connect Vectara using Vinkius. You will never be left in the dark about what your AI agents are doing with your tools.
Vectara and 4,000+ other AI tools. No hosting, no code, ready to use.
Professionals who connect Vectara to VS Code Copilot through Vinkius don't need to write code, manage servers, or worry about security. Everything is pre-configured, secure, and runs automatically in the background.
Raw MCP | Vinkius | |
|---|---|---|
| Ready-to-use MCPs | Find and configure each manually | 4,000+ MCPs ready to use |
| Connection Setup | Manual coding & server setup | 1-click instant connection |
| Server Hosting | You host it yourself (needs 24/7 uptime) | 100% hosted & managed by Vinkius |
| Security & Privacy | Stored in plaintext config files | Bank-grade encrypted vault |
| Activity Visibility | Blind execution (no logs or tracking) | Live dashboard with real-time logs |
| Cost Control | Runaway AI token spend risk | Automatic budget limits |
| Revoking Access | Must delete files or code to stop | 1-click disconnect button |
How Vinkius secures
Vectara for VS Code Copilot
Every request between VS Code Copilot and Vectara is protected by our secure gateway. We automatically keep your sensitive data private, prevent unauthorized access, and let you disconnect instantly at any time.
Frequently asked questions
Can I query my internal documents directly using just conversational chat?
Yes. If your data is indexed in a Vectara corpus, simply ask your agent: search the 'employee-handbook' corpus for remote work policies. The agent uses the queryTool to pass your question to Vectara's semantic engine, effortlessly bringing back precisely matching paragraph citations instantly.
How do I remove outdated context files destroying the accuracy of my RAG model?
You don't need to rebuild APIs or use cURL. Tell your AI: delete document ID 'doc-992a' from my Sales corpus. It automatically formats the mutation and wipes the poisoned embedding from Vectara's nodes permanently, restoring high accuracy.
Will the RAG Chat tool provide accurate source citations?
Yes. When you instruct the agent to run execute_rag_chat, Vectara processes the query against its internal LLM and index, returning a synthesized natural language answer appended solidly with exact document citations, proving the AI isn't hallucinating facts.
Which VS Code version supports MCP?
MCP support requires VS Code 1.99 or later with the GitHub Copilot extension. Ensure both are updated to the latest version. Older versions of Copilot may not expose the Agent mode toggle.
How do I switch to Agent mode?
Open the Copilot Chat panel and look for two mode options: "Ask" and "Agent". Click "Agent" to enable autonomous tool calling. In Ask mode, Copilot provides conversational answers but cannot invoke MCP tools.
Can I restrict which MCP tools Copilot can access?
Yes. VS Code shows a tool consent dialog before any MCP tool is invoked for the first time. You can also configure tool access policies at the organization level through GitHub Copilot settings.
Does MCP work in VS Code Remote or Codespaces?
Yes. MCP servers configured via .vscode/mcp.json work in Remote SSH, WSL, and GitHub Codespaces environments. The MCP connection is established from the remote host, so ensure the server URL is accessible from that environment.
MCP tools not available
Ensure you are in Agent mode in Copilot Chat. MCP tools only appear in Agent mode.
Explore More MCP Servers
View all →
Quaderno
10 toolsBring automated tax compliance and invoicing directly into your AI workflow — calculate global taxes, issue invoices, and manage CRM contacts in seconds.

ValueSERP
10 toolsBring real-time Google Search data into your AI agent. Search organically, find images, news, scholars, and related questions without getting blocked.

Skyscanner
6 toolsSearch flights worldwide — compare prices by date, find cheapest days to fly and discover flight routes.

Voiceflow
12 toolsDesign, prototype, and launch conversational AI agents with a visual builder that handles complex dialog flows without code.
