Typesense Cloud MCP. Diagnose search latency and cluster status instantly.
Works with every AI agent you already use
…and any MCP-compatible client
Just plug in your AI agents and start using Vinkius.
Typesense Cloud MCP Server handles full search cluster diagnostics. You use it to check operational health status, fetch real-time performance metrics, list all collections and virtual aliases, audit API keys, or run multiple complex searches in one go.
What your AI agents can do
Execute multi search
Sends a JSON array of search requests to run multiple queries in one API call.
Get cluster health
Checks and reports the current operational health status of the entire Typesense cluster.
Get cluster metrics
Pulls performance data, including search latency, CPU usage, and resource consumption logs.
Determines if all nodes in your Typesense cluster are online and responding normally.
Fetches quantitative data, including search latency (median/99th percentile), CPU usage, and overall resource consumption patterns.
Runs several independent search queries across different collections in a single API call using JSON array input.
Retrieves the names of every searchable collection currently configured on the cluster.
Lists virtual aliases that map to your real data collections, helping you manage public access points cleanly.
Retrieves a list of all active and configured API keys for the cluster for auditing purposes.
Ask AI about this MCP
Supported MCP Clients
Waiting for input…
Typesense Cloud MCP Server: 6 Tools for Search Ops
Use these six tools to diagnose cluster issues, audit configurations, or execute complex searches without writing any code.
019d7617execute multi search
Sends a JSON array of search requests to run multiple queries in one API call.
019d7617get cluster health
Checks and reports the current operational health status of the entire Typesense cluster.
019d7617get cluster metrics
Pulls performance data, including search latency, CPU usage, and resource consumption logs.
019d7617list api keys
Retrieves a full list of all API keys configured for the Typesense cluster.
019d7617list collection aliases
Lists all virtual aliases that map external names to internal collections.
019d7617list collections
Gives a list of every actual searchable data collection hosted on the cluster.
Choose How to Get Started
Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.
Build Your Own
Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.
- Import from OpenAPI, Swagger, or YAML specs
- Create Agent Skills with progressive disclosure
- Deploy to edge with MCPFusion framework
- Built in DLP, auth, and compliance on every call
- Real time usage dashboard and cost metering
- Publish to catalog or keep private
Make Your AI Do More
Start with Typesense Cloud, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.
- Use this MCP plus 4,700+ others, all in one place
- Add new capabilities to your AI anytime you want
- Every connection is secured and compliant automatically
- Track usage and costs across all your servers
- Works with Claude, ChatGPT, Cursor, and more
- New servers added to the catalog every week
What you can do with this MCP connector
Look, you're running a search cluster. It's complex, right? You don't want to be logged into some terminal just poking around with curl commands trying to figure out why latency spiked last Tuesday. This server lets your agent talk directly to the Typesense backend. Instead of manually checking things, your AI client does it for you.
When you connect this endpoint, you get full diagnostic control over a live search cluster. You're not just getting simple status lights; you're pulling quantitative data so you know exactly what's happening under the hood.
Checking Core Operational Status
To see if everything's running right, your agent uses get_cluster_health. This tool checks all nodes in the cluster to confirm they're online and responding normally. It tells you immediately if any part of the system is down or struggling with basic connectivity issues.
If you need deeper performance intel, you run get_cluster_metrics. This pulls specific numbers: search latency—you get both the median time and the 99th percentile—along with CPU usage reports and logs detailing overall resource consumption. You'll see patterns in how much work the cluster is handling.
Managing Data Structure and Access Points
You gotta know what data you’re searching, and who has keys to it. To get a list of every searchable collection configured on the cluster, call list_collections. This gives you the names of all your actual data sources.
When you need to audit how users access those collections, use list_collection_aliases. This tool lists virtual aliases; these are clean public-facing names that map back to your real, internal data collections. It keeps your API management tidy.
For security, always check the keys. Run list_api_keys to pull a complete list of every active and configured API key for the whole cluster. You'll know exactly who has access and what they can do with it.
Running Complex Searches
The most powerful thing here is running queries efficiently. Instead of sending one search request at a time, you use execute_multi_search. This tool lets your agent send a JSON array containing multiple independent search requests in a single API call. You can query several different collections simultaneously and get all the raw JSON results back without having to juggle ten separate calls.
Think of it like this: Need to check inventory across five regions, and also run a trending topic search on articles, all at once? You bundle those into one request using execute_multi_search. The agent handles the complex payload structure, so you just get clean results. It’s about batching work—getting everything done in one go.
This gives your AI client native control over diagnosis and querying. You don't need to learn a bunch of command-line syntax; you just tell your agent what job needs doing, whether it's checking the 99th percentile latency or running five different searches across three collections. The agent executes the right tool call against the Typesense backend.
How Typesense Cloud MCP Works
- 1 First, subscribe to the server and provide your Typesense Host URL and API Key.
- 2 Second, activate your AI client (Claude, Cursor, etc.) within the MCP environment.
- 3 Third, ask your agent a question—like 'Check cluster health' or 'Get metrics for last hour.' The agent executes the required tool call and presents structured data back to you.
The bottom line is: You stop writing code to check the database and start talking to it.
Who Is Typesense Cloud MCP For?
This server is for DevOps Engineers, Search Architects, and DBAs. If your job involves keeping a distributed search engine running 24/7, you need this. It’s built for the ops engineer who's tired of clicking through ten different dashboards at 2 am just to find out why search latency spiked.
Uses get_cluster_health and get_cluster_metrics to spot performance degradation before users complain, minimizing downtime.
Validates complex query flows by running multi-searches (execute_multi_search) across multiple collections before deployment.
Runs list_api_keys and list_collection_aliases to audit security boundaries and ensure proper data mapping.
What Changes When You Connect
- Stop running manual
curlcommands. By using the agent to runget_cluster_metrics, you get structured, actionable data about latency thresholds without leaving your chat interface. - Audit security quickly. Use
list_api_keyswhenever a new client or integration comes online. This keeps track of every active key for proper governance. - Test complex queries safely. The
execute_multi_searchtool lets you send multiple search requests at once. You test the whole flow before committing code. - Understand your data model. Use
list_collectionsandlist_collection_aliasestogether to map what's visible externally versus what actually exists in the database. - Get immediate status checks. Need to know if the cluster is up? A simple prompt triggers
get_cluster_health, giving you an instant 'ok' or warning.
Real-World Use Cases
Investigating a Latency Spike
A user reports slow search results. Instead of guessing, the agent first runs get_cluster_health to confirm node status. If healthy, it immediately uses get_cluster_metrics to pinpoint if the issue is high CPU or a specific latency spike at the 99th percentile.
Preparing for a New Feature
A new e-commerce catalog needs searching. The architect first runs list_collections to see existing indexes. Then, they use list_collection_aliases to create the public entry point and test it with a multi-search via execute_multi_search.
Security Compliance Audit
Before an external team gets access, the DBA runs list_api_keys. They verify that only necessary keys exist. If they find too many, they know exactly what needs to be revoked and why.
Validating Search Endpoints
A developer wants to validate a complex search logic before deployment. Using execute_multi_search, they simulate the exact query using raw JSON schemas, guaranteeing it works across all targeted collections simultaneously.
The Tradeoffs
Debugging sequentially
Manually checking 'Is it up? -> What are the metrics? -> Which keys exist?' This takes jumping between a dashboard, a CLI, and an API docs page.
→
Use your agent to orchestrate the calls. Start with get_cluster_health, then follow up with get_cluster_metrics in one conversation thread for a complete diagnostic picture.
Assuming collection structure
Running a search query and getting an error, but not knowing if the underlying collection or alias is misspelled.
→
First, run list_collections to see all available indexes. Then, use list_collection_aliases to check what public names map to those physical collections.
Over-querying for a single bug
Running five different diagnostic tools back-to-back in separate sessions because the first tool didn't provide enough detail.
→
Focus your initial diagnosis. If latency is suspected, run get_cluster_metrics. Don't waste calls on keys or aliases until performance metrics point to an authentication failure.
When It Fits, When It Doesn't
Use this server if your problem space involves the operational status, measurable performance (latency/CPU), or structured querying of a Typesense search cluster. You need it when you have credentials and access to the cloud endpoint.
Don't use it if:
1. Your issue is network-level hardware failure (e.g., local ISP outage). This server only deals with the API layer.
2. You are trying to write the search query itself. The tools execute queries; they don't design them. Use execute_multi_search when you have the JSON payload ready.
3. Your problem is unrelated to Typesense (e.g., a bug in your front-end React code). This tool only talks to the search backend.
Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Typesense Cloud. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
VINKIUS INFRASTRUCTURE
Cloud Hosted
Managed infra
V8 Isolated
Sandboxed per request
Zero-Trust Proxy
No stored credentials
DLP Enforced
Policy on every call
GDPR Compliant
EU data residency
Token Compression
~60% cost reduction
Works with Claude, ChatGPT, Cursor, and more
The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.
This server provides 6 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.
Available Capabilities
Diagnosing cluster failures shouldn't require memorizing CLI commands.
Today, figuring out why search is slow means logging into a terminal. You run `curl` against an endpoint, you check the response headers, and if that fails, you have to switch screens to a dashboard to see resource consumption. It’s a painful cycle of copy-pasting errors and jumping between tabs.
With this MCP server, you just tell your agent: 'Check my search performance.' The agent handles the `get_cluster_health` call, grabs the metrics via `get_cluster_metrics`, and spits out a clean summary. You get the diagnosis in natural language, not raw JSON dumps.
Typesense Cloud MCP Server gives you full control over your search backend.
Before this server, managing collections meant manually confirming names and dependencies across multiple internal systems. If a developer changed an alias name in the UI, it wasn't always clear where that change propagated or what old references were left hanging around.
Now, you can run `list_collection_aliases` to see every virtual mapping and cross-reference it with `list_collections`. It makes managing your entire search structure simple and auditable.
Common Questions About Typesense Cloud MCP
How do I check if my Typesense cluster is running right now using get_cluster_health? +
You simply ask the agent to run get_cluster_health. It checks the operational status and returns a clear 'ok' or an error code, telling you exactly where the connection broke.
What is the difference between list_collections and list_collection_aliases? +
list_collections shows the actual physical data indexes. list_collection_aliases shows the virtual names that point to those physical collections, which is useful for public-facing APIs.
Can I run multiple different searches at once with execute_multi_search? +
Yes. You provide a JSON array of search requests, and execute_multi_search runs them all in one API call. This is much faster than running individual queries sequentially.
How do I see if my current API key has been used? +
You run the list_api_keys tool. This gives you a list of all keys configured and helps track who might have access to the cluster data.
How can I check resource usage details with get_cluster_metrics? +
It reports real-time performance metrics, not just simple status. You get data points like average search latency (median) and node resource consumption patterns, including CPU usage percentages.
What information does list_api_keys provide about my cluster access? +
It lists every API key configured for the Typesense cloud. This allows you to audit your credentials quickly, helping you track which keys are active and manage potential security risks.
Why should I use an alias instead of a collection name when using list_collection_aliases? +
Aliases abstract the real underlying structure. Using them decouples your code from physical data changes, meaning you can update the actual collections without breaking applications that rely on the virtual alias.
If I run multiple searches with execute_multi_search, what happens if one query fails? +
The tool attempts all requests and reports results for every successful search. If a single request fails, it returns an error code for that specific object without stopping the processing of other valid queries.
Can the AI provide an analysis if our search endpoints hit performance degradation? +
Yes. Instructing the agent to run the 'get_cluster_metrics' tool will pull out CPU times, active requests processing, and milliseconds threshold details so the AI inherently explains why the node slows down.
Can it execute a multi-search batch via chat? +
Absolutely. Give it a target array of searches (e.g., search product_A and product_B tables). It formats the payload perfectly to hit the endpoint natively and unpacks both search answers logically back to you.
Does it detect empty aliases mapped incorrectly? +
Yes! When listing all collections and matching them against 'list_aliases', the agent can see if a virtual layer points to a non-existent cluster data structure instantly. Very handy for debugging production releases.
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
More in this category
Gitea
Manage self-hosted Git via Gitea — list and manage repositories, track issues and pull requests, handle organizations, and audit branches directly from any AI agent.
Faker Data Generator
Generate realistic fake data in seconds — names, emails, addresses, credit cards, companies, and more. 60+ locales including Brazilian Portuguese. The most complete data generator in the ecosystem, with 5M+ weekly downloads.
Directus
Manage any SQL database via Directus — handle collection items, audit schemas and fields, manage users, and track media storage directly from any AI agent.
You might also like
Emoji Name Resolver
Convert emoji shortcodes to Unicode and back (:rocket: ↔ 🚀). Essential for Slack, Discord, and GitHub cross-platform compatibility.
1msg.io
Official WhatsApp Business API automation — send messages, manage templates, and monitor channel status via AI.
AppTweak
Bring App Store Optimization (ASO) to your AI agent — track app rankings, fetch keyword volumes, and analyze competitor downloads via chat.