Grepsr MCP. Manage web scraping and retrieve structured data on demand.

Q: How do I check the status of a crawl using the getreporthistory tool?

The getreporthistory tool provides a full log of every crawl run. You can check the run status, see how many records were processed, and identify the exact date and time of the last successful run.

Q: Can I trigger a crawl without using the runreport tool?

No. The runreport tool is the specific function designed to initiate an immediate, manual crawl run for a defined report ID. It's the only way to manually refresh the data.

Q: How do I set up an automatic alert when data is ready?

Use the createwebhook tool. This sets up a notification URL that alerts your external system (like a Slack channel or database) the moment Grepsr finishes processing new data.

Q: Where do I check if my data can reach S3?

You check this using the listintegrations tool. This lists all active data delivery endpoints and confirms if S3 or SFTP is correctly configured for your accounts.

Q: How do I check my API usage limits using the getusagestats tool?

The getusagestats tool provides real-time metrics on your account's API usage and current request limits. You can check if you're nearing a limit or what your quota is for the month.

Q: What details can I retrieve about a specific report using the getreportdetails tool?

The getreportdetails tool gives you the full metadata and configuration for any report. This includes the original scraping rules, the last time it ran, and other setup parameters.

Q: Can I list all my projects using the listprojects tool?

Yes, the listprojects tool retrieves a comprehensive list of all your scraping projects. This lets you see and manage the containers for your individual reports.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

Grepsr MCP Server manages your web scraping workflow. It lets your AI client trigger on-demand crawls, view execution history, and retrieve structured datasets from any report.

You can also manage webhook setups and list all connected cloud storage integrations (S3, SFTP).

What your AI agents can do

Create webhook

Sets up a new URL webhook to notify systems when a specific report has new data.

Get latest data

Gets the most recent set of structured data records from a specified report.

Get me

Retrieves basic metadata and account details for the connected Grepsr account.

+ 9 more capabilities included

Manage all scraping reports and projects

List all existing reports and projects, and retrieve detailed configuration information for any specific item.

Initiate manual data crawls

Trigger an immediate, on-demand crawl run for a specified report to update your data.

Fetch the latest scraped data

Retrieve the most recent, structured dataset associated with a given report.

Audit data delivery connections

List all active data integrations, such as S3 and SFTP, to confirm data flow setup.

Monitor and query crawl history

Check the execution status, record counts, and full history for any given report.

Set up data change notifications

Create, list, and manage webhooks to notify external systems when new data is available.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

Grepsr MCP Server: 12 Tools for Web Scraping & Data Management

These tools let your AI client interact directly with your Grepsr account to manage reports, run crawls, retrieve data, and audit your data pipeline.

create019d75ab

create webhook

Sets up a new URL webhook to notify systems when a specific report has new data.

get019d75ab

get latest data

Gets the most recent set of structured data records from a specified report.

get019d75ab

get me

Retrieves basic metadata and account details for the connected Grepsr account.

get019d75ab

get report data

Queries and returns structured records pulled from a specific report ID.

get019d75ab

get report details

Fetches the full configuration and metadata for a given report or crawler.

get019d75ab

get report history

Retrieves a detailed log of all past crawl runs for a specific report.

get019d75ab

get usage stats

Checks the current API usage count and any rate limit restrictions on the account.

list019d75ab

list integrations

Lists all active destinations where scraped data is delivered, like S3 or SFTP.

list019d75ab

list projects

Lists every scraping project configured in the Grepsr account.

list019d75ab

list reports

Lists every report and crawler currently set up in the Grepsr account.

list019d75ab

list webhooks

Lists all webhooks configured for a specific report ID.

run019d75ab

run report

Starts an immediate, manual crawl run for a specified report to update its data.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Grepsr, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

You're connecting your AI client to Grepsr so you can manage your whole web scraping operation. You'll get direct access to the tools needed to keep your data fresh and flowing. list_reports and list_projects let you see every report and every project you've got set up. You can then use get_report_details to pull the full setup and metadata for any specific report or crawler.

Need to know what's going on? get_report_history gives you a detailed log of every crawl run, showing you the execution status and record counts. You can kick off a manual update anytime with run_report, which starts an immediate crawl run for a specific report. To grab the freshest data, use get_latest_data for the most recent structured records, or get_report_data to query structured records from a specific report ID.

You can also check the full history by calling get_report_history. For the connections, list_integrations shows you all the destinations—like S3 or SFTP—where your scraped data is going. You'll also manage notifications with create_webhook to set up a new URL webhook, and list_webhooks lets you see all the webhooks configured for a report.

You can check your API usage and rate limit status with get_usage_stats. You can pull basic account info using get_me. These tools let you manage data delivery, check usage limits, and control the entire data flow right through your agent.

How Grepsr MCP Works

1 Subscribe to the Grepsr server and enter your API key (found in your Grepsr account under Profile > Personal Details).
2 Tell your AI client the specific task: 'List all my scraping reports' or 'Run a crawl for ID 104'.
3 Your agent executes the tool call, retrieving the list of reports or initiating the crawl, and presents the result to you.

The bottom line is you can manage and extract data from your web scraping pipeline using natural language commands.

Who Is Grepsr MCP For?

Data Engineers who need to check crawl statuses and verify record counts without opening a dashboard. Market Researchers who must trigger data refreshes for competitor pricing. Operations Teams that need to retrieve the latest scraped datasets and verify delivery integration status quickly.

Data Engineer

Uses get_report_history to check crawl statuses and get_usage_stats to verify record counts without manual dashboard navigation.

Market Researcher

Triggers a data refresh for competitor product lists using run_report directly through the chat interface.

Operations Analyst

Retrieves the latest scraped datasets using get_latest_data and verifies data delivery status using list_integrations.

What Changes When You Connect

Don't manually check dashboards. Use get_report_history to track the status and record counts of crawl runs instantly.
Stop guessing if data arrived. Use list_integrations to see all active data delivery connections (S3, SFTP) and confirm data flow setup.
Need fresh data? Trigger a crawl with run_report and get the dataset refreshed without touching the UI.
Pull the final numbers immediately. get_latest_data fetches the most recent structured records for a report, skipping the history logs.
Keep systems updated. Use create_webhook to set up notifications that alert your internal systems as soon as new data lands.
Understand your limits. get_usage_stats checks your API usage against request limits so you never hit a quota wall.

Real-World Use Cases

Checking Competitor Price Changes

A market researcher needs to know if a competitor changed its price. They ask their agent to run run_report for the product list report. The agent confirms the crawl started, then uses get_report_history to monitor the status until the data is ready. Finally, it calls get_latest_data to present the updated prices.

Auditing Data Delivery Setup

An operations analyst needs to confirm that the nightly scrape data made it to the cloud warehouse. They use list_integrations to verify that S3 and SFTP are active. They then use list_reports to confirm the correct report is feeding that data.

Debugging Data Ingestion Failures

A data engineer notices a report hasn't updated. They first run get_report_details to check the report's configuration. Then, they use get_report_history to find the last failed run and diagnose the exact failure point without manual dashboard navigation.

Automating Data Alerts

An ops team needs to know instantly when a critical dataset is updated. Instead of polling, they use create_webhook to set up a notification. The agent confirms the webhook is active, eliminating the need for constant manual checks.

The Tradeoffs

Asking for all data in one shot

Trying to pull every single record from a report using one massive query that times out or hits payload limits. This wastes tokens and fails silently.

→ First, use get_report_history to determine the report's run date. Then, use get_report_data to query specific date ranges or filtered record sets. Always narrow the scope.

Confusing reports with projects

Thinking that listing all projects (list_projects) gives you the current, structured data. It only gives you the container; you still need to run the data extraction.

→ To see the current data, you must first use list_reports to find the report ID, and then use get_latest_data with that ID.

Ignoring data delivery status

Assuming that because a report ran successfully, the data actually reached the target warehouse. The job might have failed downstream.

→ Always run list_integrations to verify that the destination (S3, SFTP) is active and configured for the relevant report.

When It Fits, When It Doesn't

Use this server if your primary goal is to programmatically manage, monitor, and extract structured data from web scraping reports. You need to trigger crawls (run_report), check data lineage (list_integrations), or pull records directly (get_report_data). Don't use this if you just need to read a static file or run a general database query; those require different database tools. If you only need to know what reports exist, use list_reports. If you need to know if a report can send data anywhere, use list_integrations. If you need to know when to run it, use get_report_history.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Grepsr. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 12 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

create_webhook get_latest_data get_me get_report_data get_report_details get_report_history get_usage_stats list_integrations list_projects list_reports list_webhooks run_report

Manual data pipelines require constant dashboard refreshing.

Today, updating data means logging into the Grepsr dashboard. You check the 'History' tab to see if the last crawl succeeded, then you check the 'Integrations' tab to confirm if S3 received the file. If anything is off, you have to manually run the job and wait, often clicking through three different tabs just to get a status update.

With the Grepsr MCP Server, you tell your agent, 'Check the status of the Amazon Product Data report.' The agent handles the multi-step check—it verifies the run status and the delivery integration status—and gives you a single, clear answer in the chat. No dashboard clicking required.

Grepsr MCP Server: Get structured data and manage crawls.

Before, to get the latest data, you had to navigate to the report, find the 'Run' button, click it, wait for the process to finish, and *then* find the 'View Data' button. It was a three-step process with high failure points.

Now, you just tell your agent: 'Run the Amazon Product Data report and show me the 5 newest records.' The agent handles the full sequence—triggering the crawl, waiting for completion, and pulling the records—and gives you the final, clean data set instantly.

Common Questions About Grepsr MCP

How do I check the status of a crawl using the get_report_history tool? +

The get_report_history tool provides a full log of every crawl run. You can check the run status, see how many records were processed, and identify the exact date and time of the last successful run.

Can I trigger a crawl without using the run_report tool? +

No. The run_report tool is the specific function designed to initiate an immediate, manual crawl run for a defined report ID. It's the only way to manually refresh the data.

What does the get_latest_data tool actually retrieve? +

It retrieves the most recent, structured data records for a report. This is a direct data pull, unlike get_report_details, which only fetches configuration metadata.

How do I set up an automatic alert when data is ready? +

Use the create_webhook tool. This sets up a notification URL that alerts your external system (like a Slack channel or database) the moment Grepsr finishes processing new data.

Where do I check if my data can reach S3? +

You check this using the list_integrations tool. This lists all active data delivery endpoints and confirms if S3 or SFTP is correctly configured for your accounts.

How do I check my API usage limits using the get_usage_stats tool? +

The get_usage_stats tool provides real-time metrics on your account's API usage and current request limits. You can check if you're nearing a limit or what your quota is for the month.

What details can I retrieve about a specific report using the get_report_details tool? +

The get_report_details tool gives you the full metadata and configuration for any report. This includes the original scraping rules, the last time it ran, and other setup parameters.

Can I list all my projects using the list_projects tool? +

Yes, the list_projects tool retrieves a comprehensive list of all your scraping projects. This lets you see and manage the containers for your individual reports.

Can my agent trigger a new web crawl in Grepsr? +

Yes. Use the 'run_report' tool. By providing the Report ID, the agent can programmatically trigger an on-demand crawl, starting the data extraction process immediately flawlessly.

How do I retrieve the actual scraped data records via chat? +

You can use the 'get_report_data' or 'get_latest_data' tools. Your agent will fetch the structured records from Grepsr's database and present them in a readable format within your chat interface flawlessly.

Can I check my API usage limits through the agent? +

Absolutely. Use the 'get_usage_stats' tool. Your agent will retrieve your current plan limits and remaining API credits, helping you manage your data extraction budget flawlessly.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript