ScrapingBee MCP for AI. Extract web data and bypass anti-bot systems from chat.

Q: How do I scrape pages that use JavaScript when using the ScrapingBee MCP Server?

You must use either scrapewithjs or the general scrapewebpage tool. These tools activate a headless browser, which executes all the site's JavaScript before pulling the content. This is essential for modern Single Page Applications (SPAs).

Q: What's the difference between extractdata and extractstructureddata?

extractdata uses natural language to guide the AI on what data you want. extractstructureddata is more precise; it requires you to provide specific CSS or XPath selectors, which guarantees the exact element you need.

Q: How do I check if my scraping budget is okay with ScrapingBee MCP Server?

Use either getusage or getapiusage. Both tools connect to your dashboard and provide real-time information on how many credits you've used and how much you have left.

Q: Can I just get a picture of the webpage using ScrapingBee MCP Server?

Yep, that's what takescreenshot does. It captures an image file of the URL, automatically handling any necessary browser rendering to make sure the screenshot is accurate.

Q: If the target site redesigns its layout, should I use extractdata or extractdatawithai?

Use extractdatawithai. The AI reads content contextually. If the underlying CSS selectors change (which they often do), the AI still finds and extracts the correct data based on natural language understanding.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

How this MCP server connects to your AI agent

ScrapingBee manages web data extraction by handling anti-bot systems, rotating proxies, and JavaScript rendering automatically. You connect it to your AI agent and run complex scraping jobs using natural language prompts.

It bypasses typical site blocks so you can reliably get raw HTML, structured JSON, or screenshots from any website.

What AI agents can do with ScrapingBee Automation

Extract data

Pulls generalized structured data out of a given web page using AI instructions.

Get usage

Retrieves a detailed breakdown of current API consumption and remaining credit limits.

Extract data with ai

Instructs the agent to extract and return data as formatted JSON based on natural language descriptions.

+ 7 more capabilities included

Parse complex web pages into structured JSON

The agent uses natural language to identify and structure data points from a given webpage into clean, usable JSON format.

Extract specific fields using selectors

You target data by providing CSS or XPath selectors, guaranteeing the extraction of precise structured data regardless of surrounding text.

Capture dynamic website content

The system runs a headless browser to render JavaScript, allowing you to scrape data from modern SPAs that load content dynamically.

Bypass detection and IP blocks

It manages proxy rotation and implements stealth mode protocols, making scraping difficult for anti-bot systems to detect or block.

Capture visual proof of a page

The tool takes a screenshot of the target URL, capturing what the user sees in their browser.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

What AI agents can do with ScrapingBee: 10 Tools for Web Data Extraction

These tools let your AI client handle every aspect of web scraping—from basic data parsing to sophisticated proxy rotation and full browser rendering.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using ScrapingBee on Vinkius

Extract Data

Pulls generalized structured data out of a given web page using AI instructions.

Get Usage

Retrieves a detailed breakdown of current API consumption and remaining credit...

Extract Data With Ai

Instructs the agent to extract and return data as formatted JSON based on natural...

Extract Structured Data

Extracts specific, structured JSON data by defining precise CSS or XPath selectors...

Scrape Webpage

Scrapes an entire webpage while automatically handling JavaScript, proxy rotation...

Take Screenshot

Captures a visual screenshot of the requested website URL, automatically handling necessary browser rendering.

Get Api Usage

Checks the current usage status and available credits for your ScrapingBee API key.

Scrape With Js

Scrapes a page specifically by enabling full JavaScript rendering to capture dynamic...

Scrape With Proxy

Scrapes a page using premium proxy rotation, which helps bypass geo-restrictions and...

Scrape With Stealth

Scrapes a page in stealth mode to mimic human behavior and bypass advanced bot...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The ScrapingBee integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "scrapingbee": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the ScrapingBee tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"scrapingbee": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with ScrapingBee, then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by ScrapingBee. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Built on the Model Context Protocol (MCP) for Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 10 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

Manual web data extraction is a nightmare of tabs and copy-pasting., Solved with Vinkius AI Gateway

You know the drill. You find a list of competitor products, open 50 different tabs, manually click into each one to grab the price, the model number, and the availability status. Then you spend an hour cleaning up that mess in Excel because some prices are text and others are formatted numbers.

With this MCP server, your agent handles it all. You point your AI client at a URL list and prompt it: 'Get me the model name and current price for these 50 items.' It runs `scrape_webpage` behind the scenes, manages proxies, extracts the data, and gives you clean JSON back. Done.

ScrapingBee MCP Server: Get reliable web data straight from your chat.

You're no longer worried about technical failures—like a website changing its class name or using a dynamic load method. You just ask the question in natural language, and the tool suite handles the underlying complexity of JS rendering and anti-bot circumvention automatically.

It’s reliable data extraction, period. Whether you need to capture a screenshot with `take_screenshot` for proof, or extract structured JSON using `extract_structured_data`, it works every time.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

scrapingbee

web-scraping

data-extraction

headless-browser

proxy-rotation

captcha-solving

ai-extraction

stealth-scraping

mcp

What your AI can actually do with this

Yo, check it. This ain't just some API you plug in; this whole setup lets your agent handle web data extraction like a pro, bypassing all the bullshit anti-bot crap that usually bricks your job.

When you use extract_data, your AI client pulls general structured data off any webpage based on natural language instructions. If you need something tighter, you can run extract_data_with_ai, which makes sure the agent returns exactly what you ask for—clean JSON formatted right out of the gate. But if you know precisely where the data lives, forget the AI guesswork; use extract_structured_data to target specific fields by defining CSS or XPath selectors against a page, guaranteeing you get that precise structured JSON regardless of how messy the surrounding text is.

For scraping whole sites, it's all about controlling the render. You can run scrape_webpage, which scrapes everything while automatically handling JavaScript rendering, rotating proxies, and anti-bot measures for full browser simulation. If you only care about the dynamic stuff—the content that only loads after a script runs—you hit up scrape_with_js.

Need to get around geo-blocks or IP bans? Use scrape_with_proxy to run the scrape using premium proxy rotation, which makes it look like traffic is coming from different places. And if you wanna be sneaky, use scrape_with_stealth; this runs the page in stealth mode to mimic how a real human browses, slipping past even advanced bot detection systems.

Sometimes, you just need proof that the page loaded right. Running take_screenshot captures a visual image of the target URL; it handles all the necessary browser rendering so you get exactly what the user sees in their browser window. If you're pulling data from modern Single Page Applications (SPAs) or anything dynamic, your agent runs a headless browser to render JavaScript first, letting you capture that content.

When you need to manage the logistics, you've got tools for that too. To check if you can afford another scrape, use get_api_usage to see your current usage status and available credits against your ScrapingBee API key. For a full breakdown of how much juice you’re burning and what your remaining credit limits are, just run get_usage.

It's that simple.

Basically, you let your AI client do all the heavy lifting. You tell it what data to get—whether it's general text blocks, specific selectors, or just a visual screenshot—and this system handles the technical nightmare of getting past rate limits, proxy bans, and JavaScript rendering issues. You don't worry about complex infrastructure; you just prompt your agent with natural language instructions.

Built · Hosted · Managed by Vinkius ScrapingBee - Web Scraping & Data Extraction MCP Server

Server ID 019dd155-7276-7239-8754-ede30848d4b8

Vinkius Inspector

Compliance Grade A+

Score 98.33/100

Report View Report ↗

What Changes When You Connect

Stop dealing with broken scrapers. By using scrape_webpage, you get full browser rendering, meaning dynamic content (like pricing loaded by JS) actually gets pulled out—no manual workarounds required.

Don't guess how to structure data. If you need clean JSON, use extract_data_with_ai and just describe the fields in plain English; the AI handles the schema mapping for you.

Hitting a security wall? Use scrape_with_proxy. It rotates your IP address across residential proxies, letting you scrape high-security sites without triggering blocks or limits.

Need data that's absolutely specific? Skip the fuzzy extraction and use extract_structured_data with CSS/XPath selectors. This guarantees schema adherence for mission-critical fields.

Want to see what the user sees? The take_screenshot tool lets you capture visual proof of a page, which is great for debugging or reporting on site layouts.

See it in action

01 01

Competitive pricing data from dynamic e-commerce sites

A market analyst needs product specs and prices. Instead of using basic scraping that fails when the site loads JavaScript, they run scrape_with_js. This captures all the necessary JS-rendered content, allowing them to then use extract_data to pull out the structured names and prices.

02 02

Collecting lead contacts from a protected portal

A growth engineer needs multiple email addresses from a platform that blocks basic requests. They set up a loop using scrape_with_proxy, cycling through different IPs to scrape the user list, then use extract_data_with_ai on each page pull to get clean JSON records.

03 03

Debugging a complex web flow

A developer is testing an anti-bot feature. They run scrape_webpage with both proxy rotation and stealth mode enabled (scrape_with_proxy + scrape_with_stealth). If the data comes back, they know their access method works for high-security environments.

04 04

Generating a report on site layout issues

A QA tester needs to prove that an element is missing. They use take_screenshot first to capture the current view. If the data extraction fails, they can share the screenshot alongside the failure log to pinpoint exactly what went wrong.

The honest tradeoffs

Assuming static HTML is enough

Anti-pattern

Trying to scrape a modern news site using basic scraping when all the article content loads after an initial JS call. The data will be incomplete and missing key information.

The Fix

You must use scrape_with_js or scrape_webpage. These tools run a full browser engine, ensuring that JavaScript executes before you attempt to extract data.

Using general scraping for specific fields

Anti-pattern

Running only extract_data when the required data point is nested deep in a table structure. The AI might guess wrong or pull too much surrounding fluff.

The Fix

For guaranteed accuracy, use extract_structured_data. Define your target using precise CSS selectors (e.g., .product-card > h2) to nail down exactly what you need.

Ignoring IP blocking entirely

Anti-pattern

Running a loop of 100 scrapes from the same machine without changing IPs, resulting in an immediate ban and zero useful data.

The Fix

Always wrap bulk scraping with scrape_with_proxy. This rotates your source IP address pool across premium residential proxies, maintaining persistence.

When It Fits, When It Doesn't

Use this server if your goal is to pull structured data from the web and you suspect anti-bot measures might block simple scrapers. Start by determining the complexity: 1) Is the content loaded via JavaScript? If yes, use scrape_with_js or scrape_webpage. 2) Are you getting blocked? Use scrape_with_proxy or scrape_with_stealth. 3) Do you need perfect schema adherence? Use extract_structured_data with CSS selectors. If all else fails, try the general approach of extract_data_with_ai, but always be prepared to refine your method based on the source's complexity.

Questions you might have

How do I scrape pages that use JavaScript when using the ScrapingBee MCP Server? +

You must use either scrape_with_js or the general scrape_webpage tool. These tools activate a headless browser, which executes all the site's JavaScript before pulling the content. This is essential for modern Single Page Applications (SPAs).

Is ScrapingBee MCP Server safe for scraping high-security sites? +

Yes. For high-security or restricted sites, you need to use scrape_with_proxy. This tool manages rotating residential proxies, which helps keep your IP address hidden and prevents rate limiting.

What's the difference between `extract_data` and `extract_structured_data`? +

extract_data uses natural language to guide the AI on what data you want. extract_structured_data is more precise; it requires you to provide specific CSS or XPath selectors, which guarantees the exact element you need.

How do I check if my scraping budget is okay with ScrapingBee MCP Server? +

Use either get_usage or get_api_usage. Both tools connect to your dashboard and provide real-time information on how many credits you've used and how much you have left.

Can I just get a picture of the webpage using ScrapingBee MCP Server? +

Yep, that's what take_screenshot does. It captures an image file of the URL, automatically handling any necessary browser rendering to make sure the screenshot is accurate.

How does `scrape_with_proxy` handle getting blocked or rate-limited by a website? +

It handles blocks automatically. The tool uses premium proxy rotation, so if one IP address gets flagged, it instantly switches to another. This keeps your scraping session running without interruption due to IP bans.

If the target site redesigns its layout, should I use `extract_data` or `extract_data_with_ai`? +

Use extract_data_with_ai. The AI reads content contextually. If the underlying CSS selectors change (which they often do), the AI still finds and extracts the correct data based on natural language understanding.

When I run `scrape_webpage`, can I get the raw HTML payload instead of just structured JSON? +

Yes, you retrieve the complete source. The tool captures the full, rendered HTML content. This lets your agent process the page entirely later—great for deep analysis or archiving.

Can my AI automatically extract structured JSON from a web page using ScrapingBee? +

Yes! Use the extract_data tool. You can provide standard extraction rules or set ai=true to let ScrapingBee's AI models identify and parse the data fields you need automatically.

How do I use premium or residential proxies for high-security sites? +

Simply include premium_proxy: true in your scrape_general parameters. This will route your request through residential IPs, making it much harder for anti-bot systems to detect and block.

How do I find my ScrapingBee API Key? +

Log in to your ScrapingBee dashboard, and your API Key will be clearly visible in the Credentials section on the main page.

How this MCP server connects to your AI agent

What AI agents can do with ScrapingBee Automation

Extract data

Get usage

Extract data with ai

What AI agents can do with ScrapingBee: 10 Tools for Web Data Extraction

Extract Data

Pulls generalized structured data out of a given web page using AI instructions.

Get Usage

Retrieves a detailed breakdown of current API consumption and remaining credit...

Extract Data With Ai

Instructs the agent to extract and return data as formatted JSON based on natural...

Extract Structured Data

Extracts specific, structured JSON data by defining precise CSS or XPath selectors...

Scrape Webpage

Scrapes an entire webpage while automatically handling JavaScript, proxy rotation...

Take Screenshot

Captures a visual screenshot of the requested website URL, automatically handling necessary browser rendering.

Get Api Usage

Checks the current usage status and available credits for your ScrapingBee API key.

Scrape With Js

Scrapes a page specifically by enabling full JavaScript rendering to capture dynamic...

Scrape With Proxy

Scrapes a page using premium proxy rotation, which helps bypass geo-restrictions and...

Scrape With Stealth

Scrapes a page in stealth mode to mimic human behavior and bypass advanced bot...

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Built on the Model Context Protocol (MCP) for Claude, ChatGPT, Cursor, and more

Manual web data extraction is a nightmare of tabs and copy-pasting., Solved with Vinkius AI Gateway

ScrapingBee MCP Server: Get reliable web data straight from your chat.

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

Competitive pricing data from dynamic e-commerce sites

Collecting lead contacts from a protected portal

Debugging a complex web flow

Generating a report on site layout issues

The honest tradeoffs

Assuming static HTML is enough

Using general scraping for specific fields

Ignoring IP blocking entirely

When It Fits, When It Doesn't

Questions you might have