data.world MCP. Find, audit, and map your entire data portfolio via chat.

Q: Can I see all the projects I own using listmyprojects?

Yes. Running listmyprojects retrieves summaries of all projects you own, including their creation date and which datasets are linked.

Q: What is the difference between listmydatasets and searchcatalog?

listmydatasets only shows datasets you personally own. searchcatalog searches across the entire platform, finding relevant data even if you don't own it.

Q: Does listprojectinsights show live data?

No, listprojectinsights shows documented findings and data visualizations that were published to the project workspace. It shows what was agreed upon, not what's changing minute by minute.

Q: How do I check my account permissions using the getmyprofile tool?

The getmyprofile tool retrieves your account metadata, including your username and the permissions you hold. This lets you confirm your access level before running sensitive data commands.

Q: If I need to find a dataset, should I use searchcatalog or listmydatasets?

searchcatalog searches the entire data.world index, letting you find anything by title, tags, or description. Use listmydatasets only when you need to see a list of datasets you personally own.

Q: How can I see my recent work using listrecentactivity?

The listrecentactivity tool streams an activity log showing everything that happened in your account. You can track dataset updates, project contributions, or new collections added.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

data.world: Connect your AI agent to the data.world API to manage data assets, projects, and queries. Use the agent to search the catalog, list your datasets, or retrieve detailed project metadata.

It lets you find datasets, monitor project status, and access saved SQL/SPARQL queries directly through conversation, making data governance actionable.

What your AI agents can do

Get dataset details

Gets detailed information for a specific dataset, including tags, file listings, and license info.

Get my profile

Retrieves metadata for the authenticated user, showing your username and account permissions.

Get project details

Gets detailed information for a specific project, including its members and current status.

+ 7 more capabilities included

Search the data catalog

Your agent searches data.world's index across titles, descriptions, and tags to return a ranked list of matching datasets and projects.

Get dataset specifics

The agent retrieves detailed metadata for a dataset, including its file listings, tags, and license information.

List and monitor projects

Your agent lists all projects you manage and retrieves detailed information about members, objectives, and linked resources for any specific project.

Track saved queries and insights

The agent lists all saved SQL or SPARQL queries tied to a dataset, or retrieves published findings and visualizations from a project.

View user and asset metadata

Your agent retrieves your profile attributes or lists all datasets and collections you own on the platform.

Check recent activity

The agent pulls a chronological stream of activity logs covering dataset updates, project changes, and new collection entries.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

data.world MCP Server: 10 Tools for Data Governance

These tools let your agent interact with the data.world API to search, list, and retrieve deep metadata for datasets, projects, and saved queries.

get019d7582

get dataset details

Gets detailed information for a specific dataset, including tags, file listings, and license info.

get019d7582

get my profile

Retrieves metadata for the authenticated user, showing your username and account permissions.

get019d7582

get project details

Gets detailed information for a specific project, including its members and current status.

list019d7582

list dataset queries

Lists all saved SQL or SPARQL queries associated with a specific dataset.

list019d7582

list my collections

Lists all data collections you own or manage on the platform.

list019d7582

list my datasets

Lists all datasets you own on data.world, showing their titles and last modification dates.

list019d7582

list my projects

Lists all projects you own on data.world, including their creation date and linked datasets.

list019d7582

list project insights

Lists documented findings and data visualizations attached to a specific project workspace.

list019d7582

list recent activity

Lists a stream of recent activity logs, tracking dataset updates and project contributions.

search019d7582

search catalog

Searches across the entire data.world catalog for datasets and projects using full-text search.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with data.world, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

data.world: Manage Data Assets & Queries

Your agent lets you handle data discovery and asset management right where you are. You don't gotta jump between tabs or tools to check on your data. It hooks up your AI client to the data.world API, giving your agent the power to search the catalog, list your datasets, and pull project details—all through natural conversation.

Searching the Catalog

Your agent searches the whole data.world index. You just need to tell it what you're looking for, and it returns a ranked list of matching datasets and projects based on titles, descriptions, and tags. You can find what you need without digging through endless folders.

Dataset Specifics

When you zero in on a dataset, your agent pulls the full metadata. You'll get the file listings, tags, and license info for that specific dataset. You can also list all the datasets you own using list_my_datasets and check out every collection you manage with list_my_collections.

Project Management

Your agent lists all the projects you own using list_my_projects, showing their creation date and what datasets they're linked to. For any specific project, you can get detailed info about who's on it and its current status using get_project_details. You can also check out all the documented findings and data visualizations attached to a project workspace via list_project_insights.

Queries and Activity

It tracks saved queries and project activity. If you need to see what SQL or SPARQL queries are saved for a dataset, list_dataset_queries shows you those. You can also pull a stream of recent activity logs with list_recent_activity, tracking everything from dataset updates to new project contributions.

User and Asset Metadata

Your agent checks your profile attributes using get_my_profile and lists all the datasets you own using list_my_datasets.

How data.world MCP Works

1 Connect the data.world integration to your AI client and authenticate with your API Token.
2 Tell your agent the data asset you need (e.g., 'Find datasets about climate change').
3 Your agent runs the necessary tools (search_catalog or list_my_datasets) and summarizes the findings for you.

The bottom line is that your agent handles the entire data retrieval process, letting you interact with the catalog using natural language instead of complex API calls.

Who Is data.world MCP For?

Data Stewards, Data Scientists, and Knowledge Managers. If your job involves knowing what data exists, who owns it, or how it's used, this is for you. Stop spending hours clicking through dashboards just to find the correct dataset or project owner.

Data Scientist

Uses the agent to find relevant datasets and pull query definitions for research, skipping manual metadata checks.

Data Steward

Monitors project status and audits data assets across the platform using the agent, checking list_my_datasets and get_project_details.

Knowledge Manager

Explores organizational collections and insights across the enterprise using the agent, documenting data plans with list_my_collections.

What Changes When You Connect

Discover what you need instantly. Instead of manually browsing categories, use search_catalog to find datasets or projects by title, description, or tag. It cuts straight to the results.
Know who owns what. Use list_my_datasets and get_project_details to see which team owns a dataset or project. No more guessing who to email for access.
Audit complex queries. If you're starting a project, use list_dataset_queries to see every saved SQL or SPARQL query for that dataset. You don't have to guess what was done before.
Keep track of changes. list_recent_activity provides a live feed of activity—dataset updates, project contributions, and new collections—so you never miss a governance alert.
See project status at a glance. Use list_project_insights to pull documented findings and visualizations for a project, giving you a high-level status without opening the workspace.
Understand the landscape. list_my_collections lets you see all curated data groups you manage, keeping your data plans organized.

Real-World Use Cases

Need to verify data lineage for a new model

A data scientist is building a new model. Instead of tracking down the source dataset, they ask their agent to run search_catalog for 'customer behavior'. The agent finds the dataset, and the scientist immediately uses get_dataset_details to verify the file listings and tags, confirming the data source is compliant before writing a single line of code.

Figuring out who built the last sales report

A business analyst needs to know the source of a key metric. They ask the agent to check the project 'Q3 Sales Analysis'. The agent calls list_project_insights and presents the documented findings, showing who published the insight and what the core assumptions were.

Quickly listing all owned data sources

A data steward needs a full inventory. They ask the agent to list all datasets and projects. The agent runs list_my_datasets and list_my_projects back-to-back, providing a clean, consolidated list of everything they manage.

Reviewing data access rights for a team

A knowledge manager needs to audit access. They ask the agent to retrieve the profile information via get_my_profile and then list all associated collections using list_my_collections to ensure the team has the right visibility over critical data groups.

Investigating a data quality issue

A data engineer notices weird data. They ask the agent to run list_recent_activity to see who modified the dataset last. The agent provides the activity log, showing the timestamp and the user who made the change, allowing the engineer to immediately contact the right person.

The Tradeoffs

Relying on manual search and clicking

A user manually searches the data.world UI, filters by owner, clicks into the dataset, then navigates to the queries tab, and finally copies the name of the project insight. This takes 5-7 minutes and is prone to human error.

→ Just ask your agent. Say: 'Show me the details and all associated queries for the 'Customer Data' dataset.' Your agent runs get_dataset_details and list_dataset_queries in one go, giving you all the data you need instantly.

Ignoring project context

A user finds a dataset and assumes it's ready for use. They use the data without first checking if it's linked to an official project or if published insights exist.

→ Always check the context first. Run list_my_projects to see if the dataset belongs to an official project, and then use list_project_insights to see documented findings before using the data.

Treating metadata as static

A user assumes the dataset hasn't changed since last week. They run an old query and get unexpected results, but they don't know why.

→ Check the activity log first. Run list_recent_activity to see when the dataset was last updated. If the change is critical, check get_dataset_details for the latest tags.

When It Fits, When It Doesn't

Use this if you need a single pane of glass view of your entire data landscape. If your job requires you to answer questions like, 'Where is the most current, approved data on X?' or 'Who is responsible for monitoring Project Y?', this server is essential. It lets your agent perform complex cross-references—like pairing a dataset from list_my_datasets with a project from list_my_projects—without you having to manually jump between tabs.

Don't use this if you only need to know your own basic profile info, or if you are sure you only need to run one single, specific query. In those cases, a simple client-side data retrieval tool is fine. This is for governance and discovery, not simple execution.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by data.world. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 10 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

get_dataset_details get_my_profile get_project_details list_dataset_queries list_my_collections list_my_datasets list_my_projects list_project_insights list_recent_activity search_catalog

Sifting through data catalogs shouldn't require five different tabs.

Today, finding a usable dataset means clicking into the catalog, searching by keyword, then clicking 'datasets,' filtering by date, then clicking 'owner,' and finally finding the right project folder. You spend more time navigating the interface than you do actually analyzing the data.

With the data.world MCP Server, your agent handles all that clicking. You just ask: 'Show me all climate datasets owned by the Research Team.' Your agent runs `search_catalog` and returns the precise list, eliminating the manual, multi-step process.

data.world MCP Server: Access everything you need with `list_dataset_queries`.

Before running a query, you usually have to manually navigate to the project, find the associated dataset, and then look for the saved queries tab. If the query isn't saved, you're stuck, or you have to guess which version is correct.

Now, you ask your agent to 'List all saved queries for the Sales dataset.' The agent runs `list_dataset_queries` and gives you the full list and metadata right away. It’s simple. It just works.

Common Questions About data.world MCP

How do I use the `search_catalog` tool with data.world? +

You just tell your agent what you're looking for. The agent interprets your request and runs search_catalog against the entire data.world index. It returns a ranked list of matching datasets and projects.

Can I see all the projects I own using `list_my_projects`? +

Yes. Running list_my_projects retrieves summaries of all projects you own, including their creation date and which datasets are linked.

What is the difference between `list_my_datasets` and `search_catalog`? +

list_my_datasets only shows datasets you personally own. search_catalog searches across the entire platform, finding relevant data even if you don't own it.

How does `get_dataset_details` help me? +

It gives you the deep metadata you need, including file listings, tags, and license info, which is critical for knowing if the data can actually be used for your project.

Does `list_project_insights` show live data? +

No, list_project_insights shows documented findings and data visualizations that were published to the project workspace. It shows what was agreed upon, not what's changing minute by minute.

How do I check my account permissions using the `get_my_profile` tool? +

The get_my_profile tool retrieves your account metadata, including your username and the permissions you hold. This lets you confirm your access level before running sensitive data commands.

If I need to find a dataset, should I use `search_catalog` or `list_my_datasets`? +

search_catalog searches the entire data.world index, letting you find anything by title, tags, or description. Use list_my_datasets only when you need to see a list of datasets you personally own.

How can I see my recent work using `list_recent_activity`? +

The list_recent_activity tool streams an activity log showing everything that happened in your account. You can track dataset updates, project contributions, or new collections added.

How do I get a data.world API Token? +

Log in to data.world, go to your user settings, and navigate to the Advanced > API Tokens section to generate a new token.

Can the agent run SQL queries? +

This integration currently focuses on discovering assets and retrieving saved query definitions. Executing arbitrary SQL queries is managed via the data.world interface or specialized SDKs.

Does the integration support private datasets? +

Yes, as long as the provided API Token has the necessary permissions, the agent can list and retrieve information for any dataset you have access to, including private ones.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript