ParseHub MCP. Control Web Scraping Runs via Chat Conversation

Q: How do I start a scrape if I want to use different pages?

You use the runprojectwithurl tool. This lets you target custom URLs while keeping all of your project's original scraping rules and template definitions intact.

Q: Can ParseHub MCP list what projects I already have?

Yes, use the listprojects tool. It shows every web scraping project you’ve set up, giving you the unique tokens needed for subsequent commands.

Q: What if my scrape job fails? Can I stop it?

You can monitor the status using getrundetails. If it's stalled or taking too long, use the cancelrun tool to safely stop the operation and free up resources.

Q: How do I get data from a run that finished yesterday?

First, you should listruns to find the specific ID. Once you have the ID for a completed job, use getrundata to pull down the structured JSON payload.

ParseHub connects advanced cloud scraping jobs directly into your AI workflow. List configured projects, dispatch headless runs, check crawler status in real time, and pull structured datasets via chat commands. Stop managing web scrapers through separate dashboards; control complex data collection right where you write.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Give Claude and any AI agent real-world access

List configured projects

View every web scraping project saved in your account, including their unique tokens and template details.

Start a data extraction run

Tell the MCP to trigger a new headless scrape job for any specified project.

Target custom URLs

Start a scraping run that focuses on specific pages, bypassing the default starting URL for a project.

Check run status and progress

Get real-time updates on whether a scheduled scrape is queued, running, or if it has completed successfully.

Download extracted data payload

Retrieve the final structured JSON data from any completed scraping run for immediate use.

Ask an AI about this

Waiting for input…

AI Agent

What AI agents can do with ParseHub with 10 Tools

These tools let you manage the entire lifecycle of web scraping: listing projects, starting runs, tracking progress, and retrieving final, clean data payloads.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using ParseHub MCP

Cancel Run

Stops a running or queued scrape job to free up cloud resources and prevent unnecessary charges.

Delete Run

Permanently removes old scraping run history and associated data, helping you clean...

Get Project

Retrieves the full configuration details for a specific web scraping project token.

Get Run Data

Downloads the final, structured JSON payload from a run only after it has been...

Get Run Details

Checks the current status of a specific scrape job to determine if it's waiting in...

Get Last Ready Data

Immediately fetches the latest completed data for a project without needing to track individual run tokens first.

List Projects

Lists all available web scraping projects in your account, providing unique tokens and status information.

List Runs

Provides a historical record of every run for a project, useful for auditing or...

Run Project

Initiates a new scrape job using the default start URL and template configured in an...

Run Project With Url

Starts a scraping run targeting a specific, custom web address while maintaining all...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The ParseHub integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "parsehub": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the ParseHub tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"parsehub": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on each call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with ParseHub, then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,200+ others, all in one place
Add new capabilities to your AI anytime you want
Connections are secured and governed automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog weekly

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by ParseHub. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

The Grind: Web Scraping Used To Be a Juggling Act

Today, getting structured data from a website means opening the scraping dashboard in one tab. You have to manually configure the starting URL, hit run, then switch tabs every ten minutes to check if it's still running. If you need to change the target site, you restart the whole process and repeat those clicks.

With this MCP, your agent handles the entire cycle. You tell it what you need, and it manages the complex headless browser automation in the background. The result? Clean, structured data payloads appear directly for your agent to use—no dashboard refreshing required.

ParseHub: Structured Data Extraction Via Conversation

Manual steps that disappear include navigating project tokens, monitoring status codes across different UIs, and manually downloading ZIP files just to get a JSON array. These are all abstracted away.

Now you simply command the action. You use `run_project` to kick off the job, check it with `get_run_details`, and when ready, pull the exact data using `get_run_data`. It's one simple conversation.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

data-extraction

headless-browser

web-crawling

json-output

cloud-scraping

automation-workflows

What ParseHub MCP does for your AI

Web scraping used to mean logging into a dedicated dashboard, setting up parameters, hitting 'run,' then waiting for emails or refreshing pages until the data finally appeared. Now, you can manage that entire process inside your chat agent. This MCP lets you treat web crawling like any other function call.

You can list all your existing projects—including their start URLs and templates. Need new data? Just dispatch a run job on command, specifying which project to use or even overriding the default starting URL. The system tracks everything, telling you if the job is queued or running. When it’s done, you don't just get a 'Success' message; you pull down secure, structured JSON arrays containing all the scraped payloads, ready for your agent to process.

Built · Hosted · Managed by Vinkius ParseHub - Web Scraping & Data Extraction MCP

Server ID 019d75ef-66f9-73b1-9183-e89e904e7d83

Vinkius Inspector

Compliance Grade A+

Score 98.33/100

Report View Report ↗

Benefits of connecting ParseHub MCP

You don't have to switch between the ParseHub dashboard and your agent. You trigger, monitor, and retrieve data—all within one chat session.

Need fresh data fast? Use get_last_ready_data to grab the absolute latest payload without having to track a specific run token first.

When you need to scrape different pages using the same template (like product categories), use run_project_with_url. It changes only the start page, not your extraction rules.

The system keeps track of everything. Use get_run_details to check if a job is queued or running without needing to refresh an external web app.

You can clean up old jobs and manage costs by using tools like cancel_run or permanently removing data with delete_run.

ParseHub MCP use cases

01 01

Monitoring Competitor Pricing Changes

A market analyst needs to know if a competitor changed its pricing structure. They ask the agent to run an extractor on the main product page, wait for get_run_details to confirm completion, and then use get_run_data to pull the structured JSON of all price points.

02 02

Processing a Batch of Articles

A research team has 50 articles on different websites. Instead of running 50 jobs manually, they ask the agent to use run_project_with_url for each unique URL, then collect all the resulting structured data into one payload.

03 03

Auditing Historical Scrapes

A data engineer needs proof of what was scraped last month. They ask the agent to list_runs, find a specific run ID, and confirm its contents using get_run_data before moving on.

04 04

Stopping an Overdue Job

A job gets stuck in an infinite loop. The user uses the agent to check the status via get_run_details, determines it's stalled, and immediately calls cancel_run to free up resources.

ParseHub MCP tradeoffs

What to watch out for, and the recommended way to handle each one.

Assuming data is ready.

Avoid

The user asks the agent for the final JSON payload right after running a job. The agent fails because the run status is still 'queued' or 'running', and get_run_data cannot be called yet.

Instead

Always check the progress first. Use get_run_details to monitor the job until the system confirms it is complete. Only then should you use get_run_data.

Ignoring project scope.

Avoid

The user wants to scrape data from a new site but uses the default run command, which only targets the original project's starting URL and template.

Instead

If you need to target a completely different page or set of pages while keeping the same scraping rules, use run_project_with_url. This overrides the default start address.

Overwriting data accidentally.

Avoid

A user repeatedly runs jobs without cleaning up old results, leading to a massive storage quota bill and confusion about which data is current.

Instead

Use list_runs first to identify the specific historical run you need. When finished with an old job, use delete_run to permanently free up that stored payload.

When to use ParseHub MCP

Use this MCP if your primary goal is automated, multi-step web data extraction and structured JSON output. You're working with content on the public internet—like product pages, competitor sites, or academic journals—and you need to run complex, headless browser scraping jobs without leaving your AI chat interface. This is a full lifecycle tool: it lets you list projects, manage runs, check status (get_run_details), and finally pull the data (get_run_data).

Don't use this if:
1. You are extracting data from a database (use a dedicated SQL/NoSQL connector).
2. You just need to send a simple message or write text (use a messaging MCP).
3. You only need to validate the format of data you already have in memory. For pure schema validation, use a type-safe tool like Pydantic AI instead.

Frequently asked questions about ParseHub MCP

How do I start a scrape if I want to use different pages? +

You use the run_project_with_url tool. This lets you target custom URLs while keeping all of your project's original scraping rules and template definitions intact.

Can ParseHub MCP list what projects I already have? +

Yes, use the list_projects tool. It shows every web scraping project you’ve set up, giving you the unique tokens needed for subsequent commands.

What if my scrape job fails? Can I stop it? +

You can monitor the status using get_run_details. If it's stalled or taking too long, use the cancel_run tool to safely stop the operation and free up resources.

How do I get data from a run that finished yesterday? +

First, you should list_runs to find the specific ID. Once you have the ID for a completed job, use get_run_data to pull down the structured JSON payload.

Do I need an API key for ParseHub MCP? +

Yep. You must subscribe and provide your ParseHub API Key during setup so the agent can authenticate and manage cloud scraping jobs on your behalf.

Give Claude and any AI agent real-world access

What AI agents can do with ParseHub with 10 Tools

Cancel Run

Stops a running or queued scrape job to free up cloud resources and prevent unnecessary charges.

Delete Run

Permanently removes old scraping run history and associated data, helping you clean...

Get Project

Retrieves the full configuration details for a specific web scraping project token.

Get Run Data

Downloads the final, structured JSON payload from a run only after it has been...

Get Run Details

Checks the current status of a specific scrape job to determine if it's waiting in...

Get Last Ready Data

Immediately fetches the latest completed data for a project without needing to track individual run tokens first.

List Projects

Lists all available web scraping projects in your account, providing unique tokens and status information.

List Runs

Provides a historical record of every run for a project, useful for auditing or...

Run Project

Initiates a new scrape job using the default start URL and template configured in an...

Run Project With Url

Starts a scraping run targeting a specific, custom web address while maintaining all...

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

The Grind: Web Scraping Used To Be a Juggling Act

ParseHub: Structured Data Extraction Via Conversation

data-extraction

headless-browser

web-crawling

json-output

cloud-scraping

automation-workflows

What ParseHub MCP does for your AI

How to set up ParseHub MCP

Who uses ParseHub MCP

Benefits of connecting ParseHub MCP

ParseHub MCP use cases

Monitoring Competitor Pricing Changes

Processing a Batch of Articles

Auditing Historical Scrapes

Stopping an Overdue Job

ParseHub MCP tradeoffs

Assuming data is ready.

Ignoring project scope.

Overwriting data accidentally.

When to use ParseHub MCP

Frequently asked questions about ParseHub MCP