Azure Synapse Analytics MCP. Manage data pipelines and audit pools without leaving your chat.

Q: How do I check the status of my data pipelines using listpipelines?

You first use listpipelines to get a manifest of all available pipelines. Then, if you suspect a specific job, you run getpipeline with the name to pull its full definition and check its latest run status.

Q: Can I see all my Spark notebooks with listnotebooks?

Yes, listnotebooks gives you a list of all Spark notebooks. This helps you see which code assets exist and who last modified them, letting you quickly find the right starting point for your analysis.

Q: How do I audit all my compute pools?

You run listsqlpools and listsparkpools. These two tools cover the dedicated/serverless SQL capacity and the dedicated Spark compute, giving you a full picture of your compute capacity.

Q: How do I check the type of compute pools using listsqlpools?

listsqlpools lists both dedicated and serverless SQL analytics pools. This lets you immediately see if a pool is designed for high-performance, dedicated workloads or for ad-hoc, serverless queries.

Q: What information does getpipeline provide about a single data flow?

getpipeline returns the full, precise definition of the specified Azure Synapse pipeline. You can use this definition to trace every step, including activity target parameters and logical flow.

Q: Can I view external dependencies using listlinkedservices?

listlinkedservices lists all explicit external connections. It shows dependencies referencing services like Key Vaults or Blob Storages, helping you audit the entire data ecosystem's security perimeter.

Q: How do I find specific Spark notebooks using listnotebooks?

listnotebooks retrieves a list of all stored Spark notebooks. You can then specify notebook metadata or IDs to pinpoint exactly which analysis script you need to review.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

Azure Synapse Analytics MCP Server lets you manage complex data workflows and audit your entire Synapse environment using natural conversation.

You can list and inspect Spark pools, trace data movement via integration pipelines, and check the status of linked services and datasets without opening the Azure portal.

It gives you full control over your enterprise analytics data infrastructure.

What your AI agents can do

Get pipeline

Gets the precise definition and parameters of one specific Azure Synapse pipeline.

List datasets

Lists every explicitly defined dataset target in your Azure Synapse workspace.

List linked services

Lists all external connections (Linked Services) used by your Synapse resources.

+ 4 more capabilities included

Audit Compute Pools

Lists all provisioned compute infrastructure, including dedicated SQL pools, serverless SQL pools, and Apache Spark analytic clusters.

Inspect Data Pipelines

Retrieves the detailed definition of a specific Synapse data integration pipeline.

List Data Assets

Retrieves a list of all defined, explicit datasets within the Synapse workspace.

Trace Data Flow

Lists all data integration pipelines, allowing you to survey the entire set of ETL/ELT workflows.

Map External Connections

Lists all linked services, identifying external dependencies like Key Vaults or Blob Storages.

Browse Spark Notebooks

Lists all global Apache Spark analytics notebooks stored in the workspace.

Check SQL Pool Status

Lists all dedicated and serverless SQL analytics pools available in Synapse.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

Azure Synapse Analytics MCP Server: 7 Tools for Data Governance

Use these tools to inspect Synapse components, including data pipelines, compute pools, datasets, and external linked services.

get019d7557

get pipeline

Gets the precise definition and parameters of one specific Azure Synapse pipeline.

list019d7557

list datasets

Lists every explicitly defined dataset target in your Azure Synapse workspace.

list019d7557

list linked services

Lists all external connections (Linked Services) used by your Synapse resources.

list019d7557

list notebooks

Lists every Apache Spark notebook stored in the workspace.

list019d7557

list pipelines

Lists all the data integration pipelines that exist in your Synapse workspace.

list019d7557

list spark pools

Lists all pre-provisioned Apache Spark analytic compute pools.

list019d7557

list sql pools

Lists all dedicated and serverless SQL analytics compute pools.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Azure Synapse Analytics, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

You wanna manage your whole Synapse setup and audit all your data pipelines just by talking to your AI client. This server lets you poke around your entire analytics environment without ever touching the Azure portal. You get full control over your data infrastructure, straight from your IDE.

Audit Compute Pools

It lists all the compute infrastructure you've got running, including all the dedicated and serverless SQL pools, plus every Apache Spark analytic cluster. You can also check the status of those SQL pools by listing all dedicated and serverless SQL analytics pools available in Synapse.

Trace Data Flow

It lists all the data integration pipelines in your Synapse workspace, letting you survey every single ETL or ELT workflow you've built. You can also get the precise definition and parameters for any specific Azure Synapse pipeline.

List Data Assets

It pulls a list of every defined, explicit dataset target in your Synapse workspace.

Map External Connections

It lists all the external connections—the Linked Services—used by your Synapse resources, showing you dependencies like Key Vaults or Blob Storages.

Browse Spark Notebooks

It lists every Apache Spark notebook stored in your workspace.

Here's the deal with the compute side: You can list all pre-provisioned Apache Spark analytic compute pools and also list all dedicated and serverless SQL analytics compute pools. You can also check the status of all those SQL pools.

To see what data's moving, you can list all the data integration pipelines, and for any one, you can get its full definition and parameters. To check your data storage, it lists all defined datasets. You can also see every external connection used, and it'll list every Spark notebook stored in the workspace.

How Azure Synapse Analytics MCP Works

1 Subscribe to the server and provide your Azure Synapse Workspace URL and an active Access Token.
2 Ask your AI client to perform a specific action, like listing all pipelines or checking a linked service's health.
3 The server executes the query and returns structured data, letting your agent read and summarize the results.

The bottom line is you get a conversational layer over complex Azure data governance commands.

Who Is Azure Synapse Analytics MCP For?

This is for the Data Engineer who gets paid to know exactly how data moves. You need visibility into every connection, every pool, and every step of an ETL process without having to click through the Azure portal. It’s for the Ops Engineer who has to answer, 'Why is the billing spiking?' on demand.

Data Engineer

Uses list_pipelines and get_pipeline to trace failed ETL jobs and debug complex data movement logic.

Data Scientist

Runs list_notebooks and list_datasets to quickly survey available code and check data scope before starting analysis.

Cloud Operations Engineer

Checks list_sql_pools and list_spark_pools to audit compute resource usage and answer scaling questions on-demand.

What Changes When You Connect

See the full lineage of data connections. Instead of manually clicking through service dependencies, running list_linked_services instantly maps out all external mappings (Blob Storage, Key Vault) for a single view.
Pinpoint pipeline failures fast. Use get_pipeline to retrieve a job's full definition. You see the exact logical steps and target parameters that caused the failure, eliminating manual portal investigation.
Audit all compute capacity. Running list_sql_pools and list_spark_pools lets you compare dedicated vs. serverless capacity immediately, giving you a clear picture of your current billing limits.
Survey available code assets. list_notebooks shows every Spark notebook in the workspace. You can quickly scope the analysis by seeing who has created code and where it's stored.
Map all data sources. list_datasets gives you a clear inventory of every dataset defined. This is essential for knowing the exact scope of data available before writing any transformation logic.
Understand the entire workflow. list_pipelines provides a master list of every data integration pipeline. You can then drill down with get_pipeline to understand the full sequence of data movement.

Real-World Use Cases

Investigating a Broken ETL Job

The nightly customer sync fails. Instead of logging into the Azure portal and clicking through, you tell your agent to run get_pipeline for the failed job. The agent pulls the full definition, showing exactly which step failed and why, letting you fix it right from your chat window.

Checking Compute Resource Limits

You need to know if you're hitting your SQL capacity. You run list_sql_pools and list_spark_pools. The agent lists both dedicated and serverless pools, letting you compare usage and identify potential scaling bottlenecks before the billing alarm goes off.

Mapping Data Dependencies

A new service needs to write data. You ask the agent to run list_linked_services. It pulls the list of all connected services, showing you if the required Key Vault or Blob Storage is already mapped and accessible.

Discovering Data Assets

You start a new project and need to know what data exists. You ask the agent to run list_datasets and list_notebooks. It gives you an immediate inventory of all structured data targets and all available Spark code, accelerating your discovery phase.

The Tradeoffs

Trying to view all resources at once

You try to manually run list_datasets, list_linked_services, list_pipelines, and list_sql_pools in sequence. This creates a giant wall of JSON output that is impossible to parse and doesn't tell you the relationship between the assets.

→ Ask your agent to cross-reference the tools. Instead of just listing, ask: 'Show me all pipelines that use a linked service pointing to the 'KeyVault'. The agent uses the underlying logic to synthesize this connectivity, which is much faster than manual cross-referencing.

Assuming a simple list is enough

You see a list of 50 pipelines from list_pipelines and assume they are all active. You waste time manually checking the status of each one, only to find the necessary one is misconfigured.

→ Use get_pipeline on the specific pipeline name. This gives you the detailed, single-source-of-truth definition, including its current status, which is better than just a list.

Ignoring compute types

You see a high usage count and assume it's a data problem. You fail to check the compute layer, wasting time optimizing the wrong component.

→ Run list_sql_pools and list_spark_pools together. This separates the issue: are you running out of dedicated SQL capacity, or do you need to scale up your Spark cluster?

When It Fits, When It Doesn't

Use this if you need to govern, audit, or debug the operational state of your Synapse data environment. This includes tracking down why a pipeline failed, confirming where data is stored, or auditing which external resources are attached.

Don't use it if you are just looking for basic documentation or definitions. If you only need to know the general concept of a Spark notebook, you can read the documentation. If you need to see the 12 specific notebooks in your workspace and their metadata, use list_notebooks.

If you only care about the compute resources (billing, scaling), stick to list_sql_pools and list_spark_pools. If you need to see the data movement (ETL/ELT), use the pipeline tools (list_pipelines, get_pipeline).

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Azure Synapse Analytics. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 7 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

get_pipeline list_datasets list_linked_services list_notebooks list_pipelines list_spark_pools list_sql_pools

Debugging data pipelines shouldn't require switching between five different Azure portals.

Today, finding a pipeline failure means hopping between the Synapse portal, the Key Vault to check credentials, and the resource group dashboard to check compute limits. You copy IDs, you switch tabs, and you spend 30 minutes just collecting the raw data points before you can even start debugging.

With the Azure Synapse Analytics MCP Server, you keep it all in your chat client. Your agent runs `get_pipeline` and pulls the entire job definition. It also checks `list_linked_services` for credential status. You get the full story, instantly.

Azure Synapse Analytics MCP Server: Get full visibility into your data pools.

You used to run separate commands to check the dedicated SQL pools and then run another set of commands to check the Spark pools. This process was fragmented, making it hard to compare resource types and forecast costs.

Now, simply asking the agent to check pools runs `list_sql_pools` and `list_spark_pools` together. You get a single, unified status report that lets you see compute scale thresholds and billing architecture queries on-demand. It's done.

Common Questions About Azure Synapse Analytics MCP

How do I check the status of my data pipelines using list_pipelines? +

You first use list_pipelines to get a manifest of all available pipelines. Then, if you suspect a specific job, you run get_pipeline with the name to pull its full definition and check its latest run status.

What is the difference between list_datasets and list_linked_services? +

Datasets are the data structures themselves (the tables/views). Linked Services are the connections to those data structures (e.g., the connection string to the Blob Storage where the data lives).

Can I see all my Spark notebooks with list_notebooks? +

Yes, list_notebooks gives you a list of all Spark notebooks. This helps you see which code assets exist and who last modified them, letting you quickly find the right starting point for your analysis.

How do I audit all my compute pools? +

You run list_sql_pools and list_spark_pools. These two tools cover the dedicated/serverless SQL capacity and the dedicated Spark compute, giving you a full picture of your compute capacity.

How do I check the type of compute pools using list_sql_pools? +

list_sql_pools lists both dedicated and serverless SQL analytics pools. This lets you immediately see if a pool is designed for high-performance, dedicated workloads or for ad-hoc, serverless queries.

What information does get_pipeline provide about a single data flow? +

get_pipeline returns the full, precise definition of the specified Azure Synapse pipeline. You can use this definition to trace every step, including activity target parameters and logical flow.

Can I view external dependencies using list_linked_services? +

list_linked_services lists all explicit external connections. It shows dependencies referencing services like Key Vaults or Blob Storages, helping you audit the entire data ecosystem's security perimeter.

How do I find specific Spark notebooks using list_notebooks? +

list_notebooks retrieves a list of all stored Spark notebooks. You can then specify notebook metadata or IDs to pinpoint exactly which analysis script you need to review.

Can I audit completely isolated Spark clusters running heavy tasks? +

Yes. Ask the agent to pull your active or paused list_spark_pools. It exposes pre-provisioned engine nodes along with boundaries explicitly defining core-node dimensions for analytic scaling operations.

Can my AI pinpoint the exact failing task inside an ETL pipeline? +

Using the specific get_pipeline action equipped with the exact target name, the agent unfolds the raw underlying JSON orchestration topology. You can trace its target execution mappings, identifying incorrectly bounded inputs right on your prompt window.

Is it possible to diagnose broken connectivity to external Azure services? +

Yes! The agent can invoke the built-in Linked Services inspector. This extracts all distinct dependencies attached to your current scope, such as mapping paths to a lost Key Vault or orphaned CosmosDB. Finding blind spots has never been faster.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript