RudderStack MCP. Audit data flow from source to warehouse.

Q: How do I use listsources to check my data inputs?

Run listsources first. This gives you a complete catalog of all configured data intake points (e.g., 'Web Analytics,' 'Mobile App'). You then use getsource if you need deep metrics on a specific source.

Q: What is the difference between listconnections and listsources?

listsources tells you what data can enter the system. listconnections tells you which sources are actively mapped to send data out to a destination, confirming the actual flow.

Q: Does listaudiences tell me if my segment is working?

It tells you that an audience exists and lists its details. To know if it's actively syncing and healthy, check listconnections to ensure the source powering that audience is connected.

Q: Do I need listtrackingplans for every data update?

Yes. If you are changing or adding event types, running listtrackingplans lets you validate the required schema first. This prevents bad data from hitting your warehouse.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

RudderStack connects your AI agent directly to RudderStack, letting you audit complex marketing data pipelines and customer event tracking. You can list all configured sources, map connections between those sources and destinations, and verify user segment definitions (audiences).

It turns your chat interface into a live data engineering console.

What your AI agents can do

Get destination

Retrieves specific details for one data endpoint (a destination).

Get source

Retrieves specific configuration and metrics for one data source.

List audiences

Returns a list of every defined user audience segment in the CDP.

+ 4 more capabilities included

Map Data Sources

You can list all configured data sources (list_sources) or get specific details for one source using get_source.

Audit Destinations

You retrieve a list of all data endpoints (list_destinations) or get full metrics on a single destination using get_destination.

Verify Data Flow

You check the entire pipeline integrity by listing all connections between sources and destinations (list_connections).

Manage Tracking Schemas

You list and review defined tracking plans (list_tracking_plans) to ensure your data adheres to required schemas.

Review User Audiences

You pull a comprehensive list of all segmented user groups or audiences defined in the CDP (list_audiences).

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

RudderStack MCP Server: 7 Tools for Data Auditing

Use these seven tools to list, get details on, and validate every component of your data pipeline—from the source where events originate to the final destination warehouse.

get019d7600

get destination

Retrieves specific details for one data endpoint (a destination).

get019d7600

get source

Retrieves specific configuration and metrics for one data source.

list019d7600

list audiences

Returns a list of every defined user audience segment in the CDP.

list019d7600

list connections

Lists all active connections between your data sources and destinations.

list019d7600

list destinations

Returns a complete list of all configured data endpoints within RudderStack.

list019d7600

list sources

Lists every active and inactive data source connected to the platform.

list019d7600

list tracking plans

Returns a catalog of all tracking plans, detailing schema rules for your events.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with RudderStack, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

You gotta treat your data pipeline like it's live on the line—you don't wanna find out about a broken link when the metrics are already wrong. This MCP Server connects your AI agent straight to RudderStack, letting you audit every single piece of customer event tracking and marketing data right from your chat interface.

It lets your agent act like a dedicated Data Engineer sitting next to you, giving you full visibility into how user data moves through the system—from where it enters until it hits its final storage spot. You'll map out everything, check every connection point, and verify all those tricky segment definitions without ever leaving your chat window.

To start auditing sources, your agent can run list_sources to pull a complete roster of every data source connected to the platform, whether it’s active or just gathering dust. If you know exactly which source you're looking at, calling get_source lets you drill down deep, pulling specific configuration details and performance metrics for that single point of entry.

When you need to see all the places your data is headed, running list_destinations gives you a full catalog of every configured endpoint. You can then narrow that down by using get_destination, which pulls granular information about one specific destination—like its connection protocol or schema requirements.

To verify the whole flow, check all the links between your data sources and destinations; running list_connections shows you a list of every active link. This is critical because it confirms that Source A actually talks to Destination B, which saves you from chasing ghosts in broken pipelines.

For governance, your agent can review how your events are structured by executing list_tracking_plans, which returns a catalog detailing all the schema rules enforced on your event types. If you need to know what specific user groups exist within the Customer Data Platform (CDP), running list_audiences pulls every defined segment or audience list available for querying.

Together, these tools let you manage and validate everything: You'll see all sources via list_sources, get deep metrics on a source using get_source; you’ll map out endpoints with list_destinations, check details on one endpoint with get_destination; the full pipeline integrity is verified by checking connections via list_connections; your data structure must adhere to rules defined in list_tracking_plans; and finally, you can pull a list of every user segment using list_audiences.

How RudderStack MCP Works

1 Connect your AI client to the RudderStack MCP Server. You'll need an API access token from your RudderStack account settings.
2 Your agent executes a command like 'List all sources,' triggering the list_sources tool call against your live CDP environment.
3 The server returns structured data—a list of active sources, their IDs, and current status—which your AI client reads and presents to you.

The bottom line is: it lets you talk to your data infrastructure using plain language commands.

Who Is RudderStack MCP For?

This server is for the DevOps Engineer who needs to verify a broken data pipeline at 2 am. It's for Marketing Operations staff needing to prove that event schemas are correct before a campaign launches, and for Data Architects auditing connectivity across dozens of systems. You use this when 'where did my data go?' isn't enough—you need to know why it went there.

Data Engineer

You run list_connections and get_source repeatedly, tracing dependency graphs after an ETL failure. You check the logs for that.

Marketing Operations Manager

You confirm if a new campaign's custom event type is correctly included in all relevant tracking plans using list_tracking_plans before launch day.

Enterprise Administrator

You run full audits, checking that every defined audience (list_audiences) has a valid connection and schema definition across the whole platform.

What Changes When You Connect

Verify Data Integrity: Use list_connections to see if your sources actually link up. This immediately flags broken pipelines before they lose customer event data.
Schema Control: Run list_tracking_plans to check the required schema for a specific event type. You won't send bad data because you checked the plan first.
Full Visibility: Listing all components (list_sources, list_destinations) gives your agent a map of every single node in your entire data ecosystem, even the forgotten ones.
Segment Validation: Use list_audiences to confirm that the user segments you built yesterday are still synced and available for today's ad campaign targeting.
Deep Source Inspection: If one source looks suspicious, get_source lets you drill down into its specific metrics without having to jump through three different dashboards.

Real-World Use Cases

Debugging a Broken Funnel

A user notices that 'purchase' events aren't making it to Snowflake. They ask their agent: 'What connection links the website source to Snowflake?' The agent runs list_connections, finds the broken link ID (conn_xyz), and reports that the flow mapping is incomplete, saving hours of manual debugging.

Pre-Launch Schema Audit

Marketing Ops needs to launch a new feature. They use list_tracking_plans to verify if the required 'feature_usage' event type schema exists and is enforced across all relevant sources, confirming compliance before writing a single line of code.

Identifying Data Leakage

An admin suspects data might be going to an old, abandoned marketing tool. They run list_destinations and compare the output against the current active destination list, spotting an unused endpoint that needs to be decommissioned.

Auditing Audience Syncs

Before running a major retargeting campaign, the agent checks using list_audiences to confirm that all necessary personalized sub-clusters (e.g., 'iOS App Users') are successfully synced and available in the data warehouse for querying.

The Tradeoffs

Querying connections without validation

Asking, 'Is my website sending data to Snowflake?' without first checking if a connection exists. This leads to vague answers or failed tool calls because the agent doesn't know which IDs to check.

→ Always start by running list_connections to get the required ID format and confirm the source-to-destination pair is active before asking about data flow.

Assuming all sources are connected

Thinking that because a source exists (e.g., 'Facebook Pixel'), it must be feeding into every destination listed in your CDP.

→ Never assume connectivity. Use list_connections to see only the established, verified paths between specific sources and destinations.

Confusing schema with audience

Believing that simply listing an audience (list_audiences) means its data is ready for immediate querying. You need to check the underlying connection first.

→ If you pull a list of audiences, follow up by checking list_connections to ensure the source powering those segments can actually talk to your destination.

When It Fits, When It Doesn't

Use this server if your data architecture is complex and involves multiple stages: Source -> Connection -> Tracking Plan -> Destination. If you need to verify how data moves from point A to B, or if you're debugging a specific schema failure, this is mandatory. However, don't use it if all you need is a simple log dump—a basic database client might be faster. Also, if your biggest concern is just reading raw metrics on a single table without worrying about its upstream lineage, get_source alone will only solve half the problem. You need the full orchestration provided by checking connections and plans.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by RudderStack API. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 7 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

get_destination get_source list_audiences list_connections list_destinations list_sources list_tracking_plans

Tracking data flow across dozens of microservices shouldn't require opening ten different tabs.

Today, figuring out a data path means jumping between the source dashboard to check event metrics, then going into the connection manager to see if it’s enabled, and finally checking the destination logs just to confirm nothing dropped. It's clicking through three different UIs and copying IDs back and forth.

With this MCP server, you ask your agent: 'Show me the path for user signup data.' You get a single, consolidated answer that references the source, verifies the connection status via `list_connections`, and confirms the schema using `list_tracking_plans`. It’s immediate.

RudderStack MCP Server: Audit the Data Lineage

Manually auditing data lineage involves running reports on five different internal systems—the web source, the backend service, the segment database, and two separate analytics warehouses. You spend hours trying to piece together a single truth.

Now, you ask your agent: 'What’s the full path for user ID 123?' The server runs `get_source`, then checks connections, then confirms the final destination, giving you one definitive answer instantly.

Common Questions About RudderStack MCP

How do I use list_sources to check my data inputs? +

Run list_sources first. This gives you a complete catalog of all configured data intake points (e.g., 'Web Analytics,' 'Mobile App'). You then use get_source if you need deep metrics on a specific source.

What is the difference between list_connections and list_sources? +

list_sources tells you what data can enter the system. list_connections tells you which sources are actively mapped to send data out to a destination, confirming the actual flow.

Does list_audiences tell me if my segment is working? +

It tells you that an audience exists and lists its details. To know if it's actively syncing and healthy, check list_connections to ensure the source powering that audience is connected.

Do I need list_tracking_plans for every data update? +

Yes. If you are changing or adding event types, running list_tracking_plans lets you validate the required schema first. This prevents bad data from hitting your warehouse.

When I use `list_sources`, what security protocols govern how my data streams are viewed? +

The connection relies on OAuth or API Key authentication. Your AI client executes the call using credentials you provide, ensuring that only authorized requests can list source details. We never expose raw tokens; your agent interacts with encrypted endpoints.

If I run `list_connections`, how do I identify a connection that is experiencing data drops or latency? +

The output provides real-time metrics on event throughput and error counts. Look for 'dropped events' or high 'latency averages'; these numbers tell you exactly where the pipeline stalls. Zero drops means a healthy flow.

Does running `get_destination` limit how many data points I can audit in one request? +

No, the tool handles large datasets efficiently by paginating results automatically. You don't hit a hard cap; however, remember that massive volume queries might slow down your agent's response time.

When I check `list_tracking_plans`, does it confirm if my source is capturing custom event types? +

It confirms the schema required for tracking. To verify specific custom events, you must cross-reference the listed plan details against your internal data dictionary. It shows what should be captured.

Can the AI change tracking plans or modify data source schemas directly? +

No, this integration limits actions inherently strictly internally seamlessly natively gracefully fully perfectly properly reliably securely precisely solely toward organically gracefully strictly accessing read-only operations effectively smoothly efficiently successfully parsing natively purely locally correctly efficiently safely retrieving data logically effortlessly organically reliably cleanly dynamically dynamically safely correctly reading data purely explicitly properly.

Can the AI list audience segments and their sync status? +

Yes. Use list_audiences to retrieve all configured audience segments, including their names and associated destination syncs. This is useful for verifying remarketing pipelines.

Which destination types does the integration support? +

The integration queries any destination configured in your RudderStack workspace — data warehouses, analytics platforms, marketing tools, and cloud storage. Use list_destinations to see all active endpoints.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript