ScrapingBee MCP for AI. Get clean, structured data from any website, reliably.

Q: How do I scrape dynamic JavaScript sites with scrapehtml?

You simply pass the URL to scrapehtml. The server supports full JS rendering, meaning it executes the page's scripts before scraping. This is how you get content that only loads after a user clicks something.

Q: Can I use searchgoogle to find structured data from news sites?

Yes. searchgoogle provides structured JSON results for various result types, including news and web links. It doesn't just give you a list of URLs; it gives context.

Q: What is the difference between scrapehtml and getamazonproduct?

The difference is specificity. scrapehtml pulls everything from an arbitrary URL, giving raw web content. getamazonproduct specifically targets Amazon's API endpoints to pull only structured product data (like ASIN, price, rating) in a reliable format.

Q: Do I need separate tools for Walmart and Google?

Yes. Using searchwalmart gives you results tailored specifically to the Walmart site structure, whereas using searchgoogle gives broader search engine results that might include other retail sites.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

How this MCP server connects to your AI agent

ScrapingBee handles complex web data extraction, bypassing anti-bot measures like Cloudflare. Use it directly with your AI agent to scrape dynamic JavaScript content, pull structured Amazon product details, or search Google and Walmart—all without writing a single line of crawler code.

What AI agents can do with ScrapingBee Automation

Get amazon product

Scrapes and returns structured details for any specific Amazon product using its ASIN code.

Ask chatgpt

Queries the ChatGPT API through ScrapingBee for general text generation tasks.

Fast search

Performs a quick search engine result page (SERP) query.

+ 4 more capabilities included

Extracting dynamic website content

Use scrape_html to pull full-fidelity web page source code that runs JavaScript and bypasses common anti-bot defenses.

Running structured Google searches

Invoke search_google to retrieve organized JSON data from various search results (web, news, images) without manual parsing.

Pulling Amazon product details

Call get_amazon_product with an ASIN code to get structured pricing, ratings, and core product information directly.

Scraping major retail sites

Access specific search results from Walmart or YouTube using dedicated tools like search_walmart and search_youtube.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

What AI agents can do with ScrapingBee: 7 Tools for Web Data Retrieval

These seven tools give your AI client direct access to structured data from web searches, e-commerce sites, and dynamic pages. No coding needed.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using ScrapingBee on Vinkius

Get Amazon Product

Scrapes and returns structured details for any specific Amazon product using its ASIN code.

Ask Chatgpt

Queries the ChatGPT API through ScrapingBee for general text generation tasks.

Fast Search

Performs a quick search engine result page (SERP) query.

Search Google

Scrapes and returns structured JSON results from Google Search for various result...

Scrape Html

Fetches the content of a web page, supporting JS rendering, proxy management, and...

Search Walmart

Scrapes search results specifically from Walmart's website.

Search Youtube

Scrapes and returns structured data from YouTube search results.

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The ScrapingBee integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "scrapingbee-alternative": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the ScrapingBee tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"scrapingbee-alternative": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with ScrapingBee, then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by ScrapingBee. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Built on the Model Context Protocol (MCP) for Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 7 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

Web data shouldn't require a full-time web developer., Solved with Vinkius AI Gateway

Today, getting solid data means clicking through five different tabs: the product page, the reviews section, the news articles, and sometimes even a map view. You copy links, you manually cross-reference prices across three different vendor sites, and then you spend an hour cleaning up all the messy HTML before your AI agent can actually use it.

With this MCP server, that whole process vanishes. Your agent calls the right tool—maybe `search_google` or `get_amazon_product`—and gets a clean JSON payload back instantly. You stop managing selectors and start analyzing data.

ScrapingBee MCP Server: Structured Data From Any Source

Forget the manual steps of writing custom Python scrapers for every new site layout or dealing with Cloudflare CAPTCHAs. You don't have to worry about proxy rotation, wait times, or if a site suddenly changes its CSS class name.

The server handles that infrastructure overhead. Your agent just asks for 'the content,' and you get the clean Markdown or HTML payload back. It’s reliable data retrieval without writing boilerplate code.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

scraping

proxy

headless-browser

data-extraction

ai-scraping

What your AI can actually do with this

You need clean data from the web, period. This server lets your AI agent grab it without you writing a single line of crawler code. ScrapingBee handles all the headache—the headless browsing, proxy rotation, and dealing with sites that run JavaScript. You just point the tool at what you want.

General Web Extraction (scrape_html)
When you need full-fidelity source code from any website, scrape_html is what you use. It fetches content even if the site relies on JavaScript to display it and manages proxy rotations so you don't get blocked. You can run custom wait times or specify ad blocking rules, which means the HTML you pull back is usable, not junk.
Major Retail & Search Sites
For specific e-commerce data, dedicated tools save you time. To grab structured product details from Amazon, just call get_amazon_product and supply an ASIN code; it returns clean pricing, ratings, and core info right away. You can also scrape search results for other big players: use search_walmart to pull specific data straight from Walmart's site, or run search_youtube to get structured data pulled directly from YouTube searches.
Advanced Search Capabilities
When you need general web intelligence, your options are solid. If you wanna know what people are talking about, use search_google. This tool gives you organized JSON results for various search types—web pages, news articles, maps, and images—without needing manual parsing on your end. For a quick hit of info, the fast_search tool runs general SERP queries that grab fast search engine results.
AI Content Generation & Specialized Scraping
The server also includes ask_chatgpt, letting your agent query the ChatGPT API for general text generation tasks whenever you need it. If a site is tricky and needs more generalized scraping, the core functionality of scrape_html handles fetching content across the board.
How Your Agent Uses It
Your AI client just calls a tool—say, search_google with specific parameters for news results. The server executes the scrape or query, cleans the data, and sends the result back to your agent's context window. You get it as raw HTML, clean Markdown text, or structured JSON data, ready to use immediately.

Built · Hosted · Managed by Vinkius ScrapingBee MCP Server - Web Scraping & Data Extraction

Server ID 019ea605-5734-73a3-9af7-ba525d7d3aad

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

Here's how it actually works

The bottom line is: you tell your AI agent what to scrape, and the server does the hard work of getting it cleanly.

Subscribe to this server and input your unique ScrapingBee API Key.

Your AI agent executes a tool call (e.g., scrape_html) providing the URL, parameters, and desired output format.

The server handles all proxy management, rendering, and scraping logic, returning clean data directly into your client's context.

What Changes When You Connect

Bypass anti-bot walls. Forget getting blocked by Cloudflare; the server handles proxy rotation and rendering issues so your scrape always completes.

Structured output means less cleanup work for you. You get Amazon product details or Google results as clean JSON, ready for immediate use in your workflow.

Handle JavaScript sites effortlessly. Tools like scrape_html fully render dynamic content—you don't have to write custom Selenium code just to see the page.

See it in action

01 01

Monitoring competitor pricing shifts

A market researcher needs to know if a rival lowered their price. They ask their agent to use get_amazon_product and search_google, comparing the current listing's price against historical data points, saving hours of manual comparison.

02 02

Building an automated news aggregator

A developer wants a feed of top stories. They instruct their agent to run search_google for 'tech news today' and then pipe the results into scrape_html, ensuring they only pull article snippets, not just links.

03 03

Analyzing viral video trends

A content strategist wants to track trending topics. They use search_youtube to get a list of popular videos and then cross-reference those titles with general web scraping using scrape_html for context.

04 04

Gathering diverse search data

An analyst needs to compare information across platforms. They run search_google for a topic, then use search_walmart and search_youtube to get three different structured views of the same subject.

The honest tradeoffs

Relying on simple HTTP requests

Anti-pattern

Trying to scrape a modern news site using basic Python libraries like requests, only to find the content is missing because it relies on client-side JavaScript rendering.

The Fix

You must use scrape_html. It supports JS rendering, so your agent can pull the full page source no matter how dynamic the target website is.

Manually scraping Amazon listings

Anti-pattern

Attempting to scrape ASIN data by clicking through search results and copying prices/ratings into a spreadsheet. It's slow, breaks easily, and misses key details.

The Fix

Use the get_amazon_product tool. Give it the ASIN code, and it returns structured JSON with price, rating, and more—all in one call.

Ignoring search result types

Anti-pattern

Assuming a Google search for 'best cameras' will give you enough info just by scraping the main web link. You miss out on map data or news articles.

The Fix

Use search_google. It provides structured results across multiple categories (web, maps, news) so your agent has a complete view of the search query.

When It Fits, When It Doesn't

You should use this server if your workflow requires gathering data from external websites and you can't rely on a single API endpoint. It’s ideal for comparing structured results (e.g., running get_amazon_product vs. checking a price scraped via scrape_html). Don't use it, however, if all the data you need lives behind a dedicated, stable API with an official JSON feed already available to your client—in that case, direct integration is cleaner and faster. If you only need basic text generation or simple database lookups, don't waste time setting up scraping infrastructure.

Questions you might have

How do I scrape dynamic JavaScript sites with scrape_html? +

You simply pass the URL to scrape_html. The server supports full JS rendering, meaning it executes the page's scripts before scraping. This is how you get content that only loads after a user clicks something.

Can I use search_google to find structured data from news sites? +

Yes. search_google provides structured JSON results for various result types, including news and web links. It doesn't just give you a list of URLs; it gives context.

What is the difference between scrape_html and get_amazon_product? +

The difference is specificity. scrape_html pulls everything from an arbitrary URL, giving raw web content. get_amazon_product specifically targets Amazon's API endpoints to pull only structured product data (like ASIN, price, rating) in a reliable format.

Do I need separate tools for Walmart and Google? +

Yes. Using search_walmart gives you results tailored specifically to the Walmart site structure, whereas using search_google gives broader search engine results that might include other retail sites.

What happens if my requests exceed standard rate limits when using scrape_html? +

The server manages rate limiting automatically. It handles proxy rotation and request throttling so your scraping process doesn't get blocked by IP bans or excessive calls.

Can scrape_html output content as something other than raw HTML? +

Yes, you can specify the desired format in your query. You can receive the extracted data as clean Markdown, plain text, or standard raw HTML, depending on what your agent needs.

Does the server handle advanced anti-bot measures like Cloudflare blocks? +

It does. The underlying infrastructure bypasses these challenges automatically. Your AI client simply sends the request, and we handle the complexity of CAPTCHAs and bot detection.

What specific identifiers do I need when calling get_amazon_product? +

You must provide the product's unique ASIN code. Using this identifier ensures the scraper targets a single, precise listing to pull accurate details like current price and star rating.

Can I extract specific data from a page using natural language instead of CSS selectors? +

Yes! Use the scrape_html tool and provide your request in the ai_query parameter. The server will use ScrapingBee's AI capabilities to parse the HTML and return exactly what you asked for.

How do I handle websites that require JavaScript to load content? +

The scrape_html tool has render_js enabled by default. You can also use wait or wait_for parameters to ensure the page is fully loaded before the data is captured.

Can I get structured results from Google Search directly? +

Absolutely. Use the search_google tool with your query. It returns structured JSON containing organic results, ads, and related searches, saving you from parsing search result pages manually.

How this MCP server connects to your AI agent

ScrapingBee handles complex web data extraction, bypassing anti-bot measures like Cloudflare. Use it directly with your AI agent to scrape dynamic JavaScript content, pull structured Amazon product details, or search Google and Walmart—all without writing a single line of crawler code.

What AI agents can do with ScrapingBee Automation

Get amazon product

Ask chatgpt

Fast search

What AI agents can do with ScrapingBee: 7 Tools for Web Data Retrieval

Get Amazon Product

Scrapes and returns structured details for any specific Amazon product using its ASIN code.

Ask Chatgpt

Queries the ChatGPT API through ScrapingBee for general text generation tasks.

Fast Search

Performs a quick search engine result page (SERP) query.

Search Google

Scrapes and returns structured JSON results from Google Search for various result...

Scrape Html

Fetches the content of a web page, supporting JS rendering, proxy management, and...

Search Walmart

Scrapes search results specifically from Walmart's website.

Search Youtube

Scrapes and returns structured data from YouTube search results.

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Built on the Model Context Protocol (MCP) for Claude, ChatGPT, Cursor, and more

Web data shouldn't require a full-time web developer., Solved with Vinkius AI Gateway

ScrapingBee MCP Server: Structured Data From Any Source

scraping

proxy

headless-browser

data-extraction

ai-scraping

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

Monitoring competitor pricing shifts

Building an automated news aggregator

Analyzing viral video trends

Gathering diverse search data

The honest tradeoffs

Relying on simple HTTP requests

Manually scraping Amazon listings

Ignoring search result types

When It Fits, When It Doesn't

Questions you might have