Typesense Cloud MCP. Diagnose search latency and cluster status instantly.

Q: How do I check if my Typesense cluster is running right now using getclusterhealth?

You simply ask the agent to run getclusterhealth. It checks the operational status and returns a clear 'ok' or an error code, telling you exactly where the connection broke.

Q: What is the difference between listcollections and listcollectionaliases?

listcollections shows the actual physical data indexes. listcollectionaliases shows the virtual names that point to those physical collections, which is useful for public-facing APIs.

Q: Can I run multiple different searches at once with executemultisearch?

Yes. You provide a JSON array of search requests, and executemultisearch runs them all in one API call. This is much faster than running individual queries sequentially.

Q: How do I see if my current API key has been used?

You run the listapikeys tool. This gives you a list of all keys configured and helps track who might have access to the cluster data.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

Typesense Cloud MCP Server handles full search cluster diagnostics. You use it to check operational health status, fetch real-time performance metrics, list all collections and virtual aliases, audit API keys, or run multiple complex searches in one go.

What your AI agents can do

Execute multi search

Sends a JSON array of search requests to run multiple queries in one API call.

Get cluster health

Checks and reports the current operational health status of the entire Typesense cluster.

Get cluster metrics

Pulls performance data, including search latency, CPU usage, and resource consumption logs.

+ 3 more capabilities included

Check Cluster Health

Determines if all nodes in your Typesense cluster are online and responding normally.

Retrieve Performance Metrics

Fetches quantitative data, including search latency (median/99th percentile), CPU usage, and overall resource consumption patterns.

Execute Multi-Searches

Runs several independent search queries across different collections in a single API call using JSON array input.

List All Collections

Retrieves the names of every searchable collection currently configured on the cluster.

Audit Collection Aliases

Lists virtual aliases that map to your real data collections, helping you manage public access points cleanly.

Manage API Keys

Retrieves a list of all active and configured API keys for the cluster for auditing purposes.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

Typesense Cloud MCP Server: 6 Tools for Search Ops

Use these six tools to diagnose cluster issues, audit configurations, or execute complex searches without writing any code.

execute019d7617

execute multi search

Sends a JSON array of search requests to run multiple queries in one API call.

get019d7617

get cluster health

Checks and reports the current operational health status of the entire Typesense cluster.

get019d7617

get cluster metrics

Pulls performance data, including search latency, CPU usage, and resource consumption logs.

list019d7617

list api keys

Retrieves a full list of all API keys configured for the Typesense cluster.

list019d7617

list collection aliases

Lists all virtual aliases that map external names to internal collections.

list019d7617

list collections

Gives a list of every actual searchable data collection hosted on the cluster.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Typesense Cloud, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

Look, you're running a search cluster. It's complex, right? You don't want to be logged into some terminal just poking around with curl commands trying to figure out why latency spiked last Tuesday. This server lets your agent talk directly to the Typesense backend. Instead of manually checking things, your AI client does it for you.

When you connect this endpoint, you get full diagnostic control over a live search cluster. You're not just getting simple status lights; you're pulling quantitative data so you know exactly what's happening under the hood.

Checking Core Operational Status

To see if everything's running right, your agent uses get_cluster_health. This tool checks all nodes in the cluster to confirm they're online and responding normally. It tells you immediately if any part of the system is down or struggling with basic connectivity issues.

If you need deeper performance intel, you run get_cluster_metrics. This pulls specific numbers: search latency—you get both the median time and the 99th percentile—along with CPU usage reports and logs detailing overall resource consumption. You'll see patterns in how much work the cluster is handling.

Managing Data Structure and Access Points

You gotta know what data you’re searching, and who has keys to it. To get a list of every searchable collection configured on the cluster, call list_collections. This gives you the names of all your actual data sources.

When you need to audit how users access those collections, use list_collection_aliases. This tool lists virtual aliases; these are clean public-facing names that map back to your real, internal data collections. It keeps your API management tidy.

For security, always check the keys. Run list_api_keys to pull a complete list of every active and configured API key for the whole cluster. You'll know exactly who has access and what they can do with it.

Running Complex Searches

The most powerful thing here is running queries efficiently. Instead of sending one search request at a time, you use execute_multi_search. This tool lets your agent send a JSON array containing multiple independent search requests in a single API call. You can query several different collections simultaneously and get all the raw JSON results back without having to juggle ten separate calls.

Think of it like this: Need to check inventory across five regions, and also run a trending topic search on articles, all at once? You bundle those into one request using execute_multi_search. The agent handles the complex payload structure, so you just get clean results. It’s about batching work—getting everything done in one go.

This gives your AI client native control over diagnosis and querying. You don't need to learn a bunch of command-line syntax; you just tell your agent what job needs doing, whether it's checking the 99th percentile latency or running five different searches across three collections. The agent executes the right tool call against the Typesense backend.

How Typesense Cloud MCP Works

1 First, subscribe to the server and provide your Typesense Host URL and API Key.
2 Second, activate your AI client (Claude, Cursor, etc.) within the MCP environment.
3 Third, ask your agent a question—like 'Check cluster health' or 'Get metrics for last hour.' The agent executes the required tool call and presents structured data back to you.

The bottom line is: You stop writing code to check the database and start talking to it.

Who Is Typesense Cloud MCP For?

This server is for DevOps Engineers, Search Architects, and DBAs. If your job involves keeping a distributed search engine running 24/7, you need this. It’s built for the ops engineer who's tired of clicking through ten different dashboards at 2 am just to find out why search latency spiked.

DevOps Engineer

Uses get_cluster_health and get_cluster_metrics to spot performance degradation before users complain, minimizing downtime.

Search Architect

Validates complex query flows by running multi-searches (execute_multi_search) across multiple collections before deployment.

Database Administrator (DBA)

Runs list_api_keys and list_collection_aliases to audit security boundaries and ensure proper data mapping.

What Changes When You Connect

Stop running manual curl commands. By using the agent to run get_cluster_metrics, you get structured, actionable data about latency thresholds without leaving your chat interface.
Audit security quickly. Use list_api_keys whenever a new client or integration comes online. This keeps track of every active key for proper governance.
Test complex queries safely. The execute_multi_search tool lets you send multiple search requests at once. You test the whole flow before committing code.
Understand your data model. Use list_collections and list_collection_aliases together to map what's visible externally versus what actually exists in the database.
Get immediate status checks. Need to know if the cluster is up? A simple prompt triggers get_cluster_health, giving you an instant 'ok' or warning.

Real-World Use Cases

Investigating a Latency Spike

A user reports slow search results. Instead of guessing, the agent first runs get_cluster_health to confirm node status. If healthy, it immediately uses get_cluster_metrics to pinpoint if the issue is high CPU or a specific latency spike at the 99th percentile.

Preparing for a New Feature

A new e-commerce catalog needs searching. The architect first runs list_collections to see existing indexes. Then, they use list_collection_aliases to create the public entry point and test it with a multi-search via execute_multi_search.

Security Compliance Audit

Before an external team gets access, the DBA runs list_api_keys. They verify that only necessary keys exist. If they find too many, they know exactly what needs to be revoked and why.

Validating Search Endpoints

A developer wants to validate a complex search logic before deployment. Using execute_multi_search, they simulate the exact query using raw JSON schemas, guaranteeing it works across all targeted collections simultaneously.

The Tradeoffs

Debugging sequentially

Manually checking 'Is it up? -> What are the metrics? -> Which keys exist?' This takes jumping between a dashboard, a CLI, and an API docs page.

→ Use your agent to orchestrate the calls. Start with get_cluster_health, then follow up with get_cluster_metrics in one conversation thread for a complete diagnostic picture.

Assuming collection structure

Running a search query and getting an error, but not knowing if the underlying collection or alias is misspelled.

→ First, run list_collections to see all available indexes. Then, use list_collection_aliases to check what public names map to those physical collections.

Over-querying for a single bug

Running five different diagnostic tools back-to-back in separate sessions because the first tool didn't provide enough detail.

→ Focus your initial diagnosis. If latency is suspected, run get_cluster_metrics. Don't waste calls on keys or aliases until performance metrics point to an authentication failure.

When It Fits, When It Doesn't

Use this server if your problem space involves the operational status, measurable performance (latency/CPU), or structured querying of a Typesense search cluster. You need it when you have credentials and access to the cloud endpoint.

Don't use it if:
1. Your issue is network-level hardware failure (e.g., local ISP outage). This server only deals with the API layer.
2. You are trying to write the search query itself. The tools execute queries; they don't design them. Use execute_multi_search when you have the JSON payload ready.
3. Your problem is unrelated to Typesense (e.g., a bug in your front-end React code). This tool only talks to the search backend.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Typesense Cloud. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 6 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

execute_multi_search get_cluster_health get_cluster_metrics list_api_keys list_collection_aliases list_collections

Diagnosing cluster failures shouldn't require memorizing CLI commands.

Today, figuring out why search is slow means logging into a terminal. You run `curl` against an endpoint, you check the response headers, and if that fails, you have to switch screens to a dashboard to see resource consumption. It’s a painful cycle of copy-pasting errors and jumping between tabs.

With this MCP server, you just tell your agent: 'Check my search performance.' The agent handles the `get_cluster_health` call, grabs the metrics via `get_cluster_metrics`, and spits out a clean summary. You get the diagnosis in natural language, not raw JSON dumps.

Typesense Cloud MCP Server gives you full control over your search backend.

Before this server, managing collections meant manually confirming names and dependencies across multiple internal systems. If a developer changed an alias name in the UI, it wasn't always clear where that change propagated or what old references were left hanging around.

Now, you can run `list_collection_aliases` to see every virtual mapping and cross-reference it with `list_collections`. It makes managing your entire search structure simple and auditable.

Common Questions About Typesense Cloud MCP

How do I check if my Typesense cluster is running right now using get_cluster_health? +

You simply ask the agent to run get_cluster_health. It checks the operational status and returns a clear 'ok' or an error code, telling you exactly where the connection broke.

What is the difference between list_collections and list_collection_aliases? +

list_collections shows the actual physical data indexes. list_collection_aliases shows the virtual names that point to those physical collections, which is useful for public-facing APIs.

Can I run multiple different searches at once with execute_multi_search? +

Yes. You provide a JSON array of search requests, and execute_multi_search runs them all in one API call. This is much faster than running individual queries sequentially.

How do I see if my current API key has been used? +

You run the list_api_keys tool. This gives you a list of all keys configured and helps track who might have access to the cluster data.

How can I check resource usage details with get_cluster_metrics? +

It reports real-time performance metrics, not just simple status. You get data points like average search latency (median) and node resource consumption patterns, including CPU usage percentages.

What information does list_api_keys provide about my cluster access? +

It lists every API key configured for the Typesense cloud. This allows you to audit your credentials quickly, helping you track which keys are active and manage potential security risks.

Why should I use an alias instead of a collection name when using list_collection_aliases? +

Aliases abstract the real underlying structure. Using them decouples your code from physical data changes, meaning you can update the actual collections without breaking applications that rely on the virtual alias.

If I run multiple searches with execute_multi_search, what happens if one query fails? +

The tool attempts all requests and reports results for every successful search. If a single request fails, it returns an error code for that specific object without stopping the processing of other valid queries.

Can the AI provide an analysis if our search endpoints hit performance degradation? +

Yes. Instructing the agent to run the 'get_cluster_metrics' tool will pull out CPU times, active requests processing, and milliseconds threshold details so the AI inherently explains why the node slows down.

Can it execute a multi-search batch via chat? +

Absolutely. Give it a target array of searches (e.g., search product_A and product_B tables). It formats the payload perfectly to hit the endpoint natively and unpacks both search answers logically back to you.

Does it detect empty aliases mapped incorrectly? +

Yes! When listing all collections and matching them against 'list_aliases', the agent can see if a virtual layer points to a non-existent cluster data structure instantly. Very handy for debugging production releases.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript