ParseHub MCP Server
Control advanced cloud scraping projects via ParseHub — list targets, dispatch headless runs, trace crawler status, and fetch extracted datasets directly via AI.
Ask AI about this MCP Server
Vinkius supports streamable HTTP and SSE.

* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
What is the ParseHub MCP Server?
The ParseHub MCP Server gives AI agents like Claude, ChatGPT, and Cursor direct access to ParseHub via 10 tools. Control advanced cloud scraping projects via ParseHub — list targets, dispatch headless runs, trace crawler status, and fetch extracted datasets directly via AI. Powered by the Vinkius - no API keys, no infrastructure, connect in under 2 minutes.
Built-in capabilities (10)
Tools for your AI Agents to operate ParseHub
Ask your AI agent "Fetch the list of scrape projects I have on my ParseHub account." and get the answer without opening a single dashboard. With 10 tools connected to real ParseHub data, your agents reason over live information, cross-reference it with other MCP servers, and deliver insights you would spend hours assembling manually.
Works with Claude, ChatGPT, Cursor, and any MCP-compatible client. Powered by the Vinkius - your credentials never touch the AI model, every request is auditable. Connect in under two minutes.
Why teams choose Vinkius
One subscription gives you access to thousands of MCP servers - and you can deploy your own to the Vinkius Edge. Your AI agents only access the data you authorize, with DLP that blocks sensitive information from ever reaching the model, kill switch for instant shutdown, and up to 60% token savings. Enterprise-grade infrastructure and security, zero maintenance.
Build your own MCP Server with our secure development framework →Vinkius works with every AI agent you already use
…and any MCP-compatible client


















ParseHub MCP Server capabilities
10 toolsIf the run was already scraping pages, partial data may be available. Data from already-scraped pages is preserved and can be retrieved with get_run_data. Use this to stop long-running scrapes or free up queue slots. Cancel a queued or actively running ParseHub run
Cannot be undone. Use this to clean up old runs and free up storage quota on your account. Permanently delete a ParseHub run and its extracted data
Ideal for dashboards or integrations that always want the freshest available data without managing individual run tokens. Instantly get the latest completed data for a ParseHub project
The project_token can be found via list_projects or in the ParseHub desktop client settings tab. Get detailed configuration of a specific ParseHub project
Only works when the run status is "complete" and data_ready is true. The JSON structure mirrors the template selection configuration set up in the ParseHub desktop client. Download the raw JSON data extracted from a completed ParseHub run
Poll this endpoint to wait for a run to complete before fetching data. Check the status of a specific ParseHub run
Each project includes a project_token (unique identifier), title, last_run timestamp, and template configuration. Use the project_token for all subsequent run management operations. List all ParseHub web scraping projects
Useful for auditing or finding a specific completed run to fetch data from. Get the history of all runs for a ParseHub project
Returns a run_token for tracking progress. The run enters a queue and begins processing within seconds. Use get_run to monitor and get_run_data to retrieve results once complete. Start a new ParseHub scraping run for a project
Perfect for scraping different pages with the same template (e.g., different product categories). The template extraction rules still apply unchanged — only the starting page changes. Start a ParseHub run targeting a custom URL instead of the project default
What the ParseHub MCP Server unlocks
Bring ParseHub Cloud Scraping directly into your AI workflows. Manage pre-configured web scraping targets natively and orchestrate complex headless browser automation directly from chat. Dispatch run jobs on command, query execution status limits, and extract final parsed payloads securely.
What you can do
- Project Navigation — Inspect and list configured ParseHub projects, determining start URLs, templates, and total crawler pages attached
- Execution Dispatch — Command remote servers to trigger specific headless data extraction jobs
run_projectoptionally overriding starting URLs natively - Observability Tracing — Monitor exactly where a
Runobject is (queued, initialized, running, complete) without checking the desktop app - Payload Extraction — Pull down structured arrays containing the scraped payloads securely via
get_run_datamatching explicit datasets
How it works
1. Subscribe to this server
2. Enter your ParseHub API Key
3. Start orchestrating complex scraping runs natively from Claude, Cursor, or your MCP UI
Who is this for?
- Data Engineers — trigger cloud scraping logic securely and pipeline extracted datasets straight into processing tools natively
- Marketing Intel — fetch completed scrapers watching competitor pricing logic tracking JSON arrays automatically
- Research Analysts — kick off academic paper extractors via chat and digest the results upon completion
Frequently asked questions about the ParseHub MCP Server
Do I need the ParseHub Desktop tool running to use this?
No. This integration operates completely natively via ParseHub's Cloud API endpoints. You only need the desktop app to build the templates originally. All executions mapped here happen on their cloud scaling servers.
Can I provide a different Start URL when running a project?
Yes. The run_project_with_url command allows you to explicitly provide a start_url query property. This instructs the ParseHub crawler to ignore its project-saved URL and begin parsing the newly mapped domain using the same semantic template.
Is the downloaded data returned in JSON or raw HTML?
The payload fetched by get_run_data is exported entirely as structured, pre-parsed JSON mirroring the exact template node selections defined in your project architecture.
More in this category
You might also like
Connect ParseHub with your favorite client
Step-by-step setup guides for every MCP-compatible client and framework:
Anthropic's native desktop app for Claude with built-in MCP support.
AI-first code editor with integrated LLM-powered coding assistance.
GitHub Copilot in VS Code with Agent mode and MCP support.
Purpose-built IDE for agentic AI coding workflows.
Autonomous AI coding agent that runs inside VS Code.
Anthropic's agentic CLI for terminal-first development.
Python SDK for building production-grade OpenAI agent workflows.
Google's framework for building production AI agents.
Type-safe agent development for Python with first-class MCP support.
TypeScript toolkit for building AI-powered web applications.
TypeScript-native agent framework for modern web stacks.
Python framework for orchestrating collaborative AI agent crews.
Leading Python framework for composable LLM applications.
Data-aware AI agent framework for structured and unstructured sources.
Microsoft's framework for multi-agent collaborative conversations.
Give your AI agents the power of ParseHub MCP Server
Production-grade ParseHub MCP Server. Verified, monitored, and maintained by Vinkius. Ready for your AI agents — connect and start using immediately.






