Data.gov Catalog MCP for AI. Query US Government Data with Precision.

Q: How do I use getlocationgeometry with searchdatasets?

You first run getlocationgeometry using a location ID to pull the specific boundary coordinates. You then pass those exact boundaries into your query when calling searchdatasets. This limits results perfectly.

Q: Can I use getkeywords to find a specific type of dataset?

No, getkeywords only tells you which topics are popular and how many datasets mention them. To actually find those datasets, you must run the results through searchdatasets.

Q: How do I list all available government agencies?

Use getorganizations. It returns a complete list of every publishing organization that contributes data to the catalog. This is your starting point for scoping research.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Connect to your AI in seconds.

Data.gov Catalog MCP connects your AI agent directly to the official US Government open data catalog. You can search thousands of datasets, track publishing organizations, and map precise geographic boundaries from agencies like NASA or NOAA using a single query.

This bypasses manual website browsing entirely.

What your AI can do

Get location geometry

Returns the precise GeoJSON boundary coordinates for a given location identifier.

Get harvest record raw

Retrieves the original, unmodified source data payload for inspection.

Get harvest record

Pulls metadata detailing how a specific dataset was originally added to the catalog.

+ 5 more capabilities included

Search and filter datasets

Find specific public datasets by using keywords, organization names, and defined filters.

Identify publishing organizations

Get a complete list of all government agencies that publish data to the catalog.

Map geographic boundaries

Retrieve precise GeoJSON coordinates for any known location ID, allowing you to filter other datasets spatially.

Inspect dataset metadata

View the original source data payload and the processed, structured version of a record.

Analyze data trends

Check which keywords are most common across the entire catalog and how many datasets use them.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

Data.gov Catalog: 8 Available Tools

These eight tools give you granular control over finding, inspecting, and mapping every piece of open data available in the US government catalog.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Data.gov Catalog on Vinkius

Get Location Geometry

Returns the precise GeoJSON boundary coordinates for a given location identifier.

Get Harvest Record Raw

Retrieves the original, unmodified source data payload for inspection.

Get Harvest Record

Pulls metadata detailing how a specific dataset was originally added to the catalog.

Get Harvest Record Transformed

Gets a cleaned-up version of the record in a standardized format ready for use.

Get Keywords

Lists popular keywords and counts how many datasets reference each term across the...

Get Organizations

Provides a complete list of every publishing organization in the catalog.

Search Locations

Suggests location names and IDs that can be used to accurately narrow down a search area.

Search Datasets

Searches the entire government data catalog using specific keywords, filters, and...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Data.gov Catalog integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "datagov-catalog": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Data.gov Catalog tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"datagov-catalog": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Data.gov Catalog, then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Data.gov. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 8 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

Getting data requires jumping through hoops.

Today, finding a specific dataset means opening the Data.gov portal, figuring out which agency has the right info, clicking through multiple filtering layers (like 'State' -> 'City' -> 'Water Quality'), and then downloading CSVs that may not even match what you needed.

With this MCP, the process stops being manual clicks. You just tell your agent exactly what you need—say, all datasets on water quality for a specific city—and it handles the complex chain of searches, filtering, and geometry retrieval automatically.

Get Location Geometry Data

Manually defining search areas is tedious. You'd have to find a location ID, then manually look up its boundary coordinates in a separate GIS tool, and finally paste those complex GeoJSON boundaries into your primary query.

Now, you just ask for the geometry using this MCP. It retrieves the precise coordinate set instantly, letting you immediately pass that structured data into other tools like `search_datasets` without leaving the chat.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

What your AI can actually do with this

Need specific government data? This connection lets you talk to the Data.gov Catalog, giving your AI agent access to a massive repository of federal data. You don't have to navigate dozens of agency websites; you just ask for what you need—whether it’s a list of all publishing agencies or datasets related to climate modeling.

For example, if you are building a mapping tool, you can first find the precise geographic boundary for a city using its location ID. Then, you can use that geometry to filter down thousands of available records to only show data relevant to that area. You'll also get deep insight into how any dataset was created by inspecting both the original source files and the cleaned-up versions.

When you connect this MCP through Vinkius, your agent treats the entire government data ecosystem as one searchable pool. This means whether you need a simple keyword count or complex spatial filtering, it's all available right in your chat window.

Built · Hosted · Managed by Vinkius Data.gov Catalog MCP - Search Open Government Data

Server ID 019e3885-580b-73e1-9201-ff0571b05b26

Vinkius Inspector

Compliance Grade A+

Score 98.33/100

Report View Report ↗

What Changes When You Connect

Find relevant data without sifting through thousands of links. Use the search functionality to pull datasets based on keywords or specific organization filters.

Pinpoint exact areas using geometry. You can run a location ID through get_location_geometry and immediately use that boundary to filter your searches, making results hyper-local.

Understand data lineage. Don't just take the metadata; check the raw source payload or the transformed version of a record to verify data integrity.

Build knowledge maps quickly. Run get_organizations to get an exhaustive list of agencies, letting you target your research scope immediately.

Spot trends in government focus. Use get_keywords to see which topics are generating the most open data records right now.

See it in action

01 01

Mapping a specific environmental issue

A researcher needs all water quality reports for Chicago, Illinois. They first run search_locations to get the location ID, then use get_location_geometry with that ID. Finally, they pass the resulting GeoJSON boundary into search_datasets to filter only relevant datasets.

02 02

Auditing data source reliability

A developer needs to know if a dataset's metadata is complete. They find a promising record and use get_harvest_record_raw to check the original, unmodified payload before building their application logic.

03 03

Comparing federal focus areas

A policy analyst wants to know if climate change is more frequently discussed than economic development. They run get_keywords and then compare the resulting dataset counts for 'climate' versus 'economy'.

04 04

Listing all data providers

Someone building a directory of government open resources needs to know who publishes what. They simply call get_organizations to get an instant list of every contributing agency.

The honest tradeoffs

Searching by general topic only

Anti-pattern

Asking the agent, 'Tell me about data for water quality.' This yields too many results because it lacks geographic or source constraints.

The Fix

Instead, first use search_locations to get a specific boundary ID. Then, combine that with get_location_geometry before running search_datasets. That locks the search down properly.

Assuming data structure is consistent

Anti-pattern

Building an app based on only the 'transformed' record without checking the original source, which might contain critical caveats.

The Fix

Always check get_harvest_record_raw alongside get_harvest_record_transformed. The raw payload tells you what the data actually was.

Using a single keyword search

Anti-pattern

Simply searching for 'NASA' results in thousands of records, making it impossible to find the right dataset quickly.

The Fix

First use get_keywords to narrow down related terms (e.g., finding both 'climate' and 'global'). Then, run a targeted search using those specific keywords with search_datasets.

Questions you might have

How do I use get_location_geometry with search_datasets? +

You first run get_location_geometry using a location ID to pull the specific boundary coordinates. You then pass those exact boundaries into your query when calling search_datasets. This limits results perfectly.

What is the difference between get_harvest_record and get_harvest_record_raw? +

The raw record gives you the original, untouched source data payload. The standard harvest record provides metadata about how that dataset was initially ingested into the catalog.

Can I use get_keywords to find a specific type of dataset? +

No, get_keywords only tells you which topics are popular and how many datasets mention them. To actually find those datasets, you must run the results through search_datasets.

How do I list all available government agencies? +

Use get_organizations. It returns a complete list of every publishing organization that contributes data to the catalog. This is your starting point for scoping research.

When using search_datasets, what do I need regarding API keys or authentication? +

You must provide an API key if your proxy requires it. The process is simple: connect the MCP via Vinkius and supply your required credentials at the connection step. This ensures your agent can access the full US Government repository.

If I run get_location_geometry and receive an error, what does that usually mean? +

An error typically means the provided location ID is invalid or hasn't been fully indexed. Double-check the ID against the output of search_locations first. If the ID is correct, you might be hitting a temporary service limit.

What structure does get_harvest_record_transformed provide for my data? +

It returns a standardized DCAT-US payload structure. This transformed format makes it easy to parse common metadata fields like publication date and spatial bounding boxes, regardless of the original source schema.

How can I filter search_datasets using multiple criteria simultaneously? +

You combine filters directly in your query prompt. For instance, you can specify both a keyword AND an organization slug. The MCP handles prioritizing these combined parameters to narrow down results efficiently.

Can I search for datasets within a specific geographic area? +

Yes! Use search_locations to find a location ID, then get_location_geometry to get the GeoJSON. Finally, pass that to search_datasets with the spatial_geometry parameter.

How do I find datasets from a specific agency like NASA? +

Use the search_datasets tool and provide 'nasa' in the org_slug parameter. You can combine this with a search query q for more specific results.

What is the difference between raw and transformed harvest records? +

The get_harvest_record_raw tool returns the original metadata from the source agency, while get_harvest_record_transformed returns the data mapped to the standard DCAT-US schema used by Data.gov.

Connect to your AI in seconds.

Get location geometry

Get harvest record raw

Get harvest record

Data.gov Catalog: 8 Available Tools

Make your AI actually useful.

Get Location Geometry

Get Harvest Record Raw

Get Harvest Record

Get Harvest Record Transformed

Get Keywords

Get Organizations

Search Locations

Search Datasets

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Works with Claude, ChatGPT, Cursor, and more

Getting data requires jumping through hoops.

Get Location Geometry Data

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

See it in action

Mapping a specific environmental issue

Auditing data source reliability

Comparing federal focus areas

Listing all data providers

The honest tradeoffs

Searching by general topic only

Assuming data structure is consistent

Using a single keyword search

When It Fits, When It Doesn't

Questions you might have