Scrapfly MCP for AI. Structured Data & Visual Proof from Any Website

Q: How does Scrapfly MCP Server handle Cloudflare anti-bot bypass?

The server handles this automatically. When you use the webscrape tool, it runs sophisticated proxy and rendering techniques to ensure your request gets through the bot protection layers.

Q: Can I just scrape raw HTML with Scrapfly MCP Server?

Yes, you can. Use webscrape if you need the full source code. But remember, for usable data, follow up by using aidataextraction to structure it.

Q: What tools do I use to check my usage?

You'll use checkcreditusage. This tool lets your agent read your current consumption stats, so you always know how much API credit is left for your job.

Q: What is the best way to verify my API key setup before running a large scrape with Scrapfly?

Run the testscrapflyauth tool. This confirms your connection status and validates that your API key is properly linked to your account, saving you credits on failed jobs.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

How this MCP server connects to your AI agent

Scrapfly lets your AI agent scrape web data at scale through a managed API connection. It handles proxies, browser rendering, and anti-bot bypassing automatically.

You can run complex extraction jobs—from raw HTML to structured JSON—and capture specific element screenshots directly in conversation. No need to manage headless browsers or worry about IP rotation; just talk to your agent.

What AI agents can do with Scrapfly Automation

Ai data extraction

Extracts structured data from a web page using AI models, converting the content into JSON.

Get api status

Retrieves high-level information about your Scrapfly account status.

List extraction models

Retrieves a list of available AI models used for data structuring and extraction.

+ 9 more capabilities included

Scrape raw web content

The agent pulls the full HTML source code from any specified website.

Extract structured JSON records

The agent uses AI models to read a complex webpage and output data in clean, usable JSON format.

Capture page screenshots

The agent takes images of full web pages or specific elements on the page.

Manage proxy locations

The agent connects to millions of residential proxies across 50+ countries for localized scraping.

Check API usage and status

The agent reads your account metrics, like credit consumption or project details.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

What AI agents can do with Scrapfly MCP Server: 12 Tools for Web Data Extraction

Use these tools to scrape web data, extract structured records into JSON, take screenshots, and manage your scraping account metrics via natural language commands.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Scrapfly on Vinkius

Ai Data Extraction

Extracts structured data from a web page using AI models, converting the content into JSON.

Get Api Status

Retrieves high-level information about your Scrapfly account status.

List Extraction Models

Retrieves a list of available AI models used for data structuring and extraction.

Get Project Details

Gets metadata and specific configuration details for a defined scraping project.

Get Scraping Capabilities

Checks which advanced scraping features, like proxy support or anti-bot bypass, are...

Get Screenshot Capabilities

Determines the specific screenshot functionality and parameters that can be used.

Check Credit Usage

Checks your current API credit consumption and usage statistics.

List Proxy Regions

Shows the geographical regions and types of proxies available for scraping.

List Api Webhooks

Lists all webhooks that have been configured on your Scrapfly account for...

Web Scrape

Scrapes the raw HTML content of any specified URL, handling complex rendering issues.

Capture Screenshot

Takes an image of a website or specific element on a webpage for visual...

Test Scrapfly Auth

Verifies that your provided API credentials are valid and active with Scrapfly.

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Scrapfly integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "scrapfly": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Scrapfly tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"scrapfly": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Scrapfly, then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Scrapfly. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Built on the Model Context Protocol (MCP) for Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 12 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

Web scraping shouldn't require managing proxies or debugging headless browsers., Solved with Vinkius AI Gateway

Today, getting clean data from a competitor's site means logging into their developer tools, checking if they block requests, manually cycling through IPs, and dealing with inconsistent HTML—all before you even start extracting the price points. It’s tedious, error-prone work.

With Scrapfly MCP Server, your AI agent handles all that tech complexity. You just tell it what to scrape; the system uses residential proxies and headless rendering under the hood. The result isn't raw HTML; it's clean JSON output ready for a database.

Scrapfly MCP Server: Get structured data & visual proof.

Previously, if you needed to prove how a competitor presented a feature, you had to run the site, take multiple screenshots on your local machine, and manually stitch them together. Then you'd have to write up what each image meant.

Now, you ask the agent for an audit. It takes the screenshot (`capture_screenshot`), it extracts specific elements (like feature names), and it compiles everything into a single, actionable report in your chat window.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

scrapfly

web-scraping

data-extraction

anti-bot-bypass

residential-proxies

ai-extraction

js-rendering

screenshots-api

mcp

What your AI can actually do with this

Scrapfly lets your agent scrape web data at scale through a managed API connection. It handles proxies, browser rendering, and anti-bot bypassing automatically. You can run complex extraction jobs—from raw HTML to structured JSON—and capture specific element screenshots directly in conversation. No need to manage headless browsers or worry about IP rotation; just talk to your agent.

Data Collection & Extraction

The web_scrape tool pulls the full HTML source code from any site you point it at, handling rendering issues so you get clean raw content every time. When you use ai_data_extraction, the system runs AI models over that web page and gives you structured JSON data—it’s ready to drop right into a database.

The capture_screenshot tool takes an image of either the whole website or just one specific element, giving you visual proof right in your chat thread. You can check what kind of screenshots are possible with get_screenshot_capabilities, and if you need help figuring out which models work best for data structuring, run list_extraction_models to see them all.

Advanced Scraping & Geo-Targeting

You don't gotta worry about IP rotation or bot detection; the platform manages that automatically. You can check what advanced features are available by running get_scraping_capabilities, which confirms things like proxy support and anti-bot bypass functionality. The agent connects to millions of residential proxies spread across 50+ countries, so you can manage data collection for specific regions; run list_proxy_regions to see the exact areas and proxy types available.

When you're ready to pull raw HTML or structured data, you’ll know exactly what’s going on with web_scrape and ai_data_extraction. The system uses sophisticated rendering to scrape content even from sites protected by major bot mitigation systems.

Account Monitoring & Configuration

You've got tools to keep tabs on the whole operation. To see if your API credentials are valid, run test_scrapfly_auth immediately; it verifies everything’s active. You can get an overall snapshot of your account health with get_api_status, and for deeper details on a specific job, use get_project_details. If you're tracking costs, check_credit_usage tells you exactly how many API credits you've burned through.

For setup management, the system lets you list all configured webhooks using list_api_webhooks, so your agent can notify you when data is ready. You also get a clear view of what scraping capabilities are available with get_scraping_capabilities.

How it Works in Practice

Your AI client coordinates everything. First, you tell the agent to scrape a URL using web_scrape. If you need structured data from that raw HTML, you trigger ai_data_extraction, which outputs usable JSON. You can also ask the agent to capture an image of the results or the page itself via capture_screenshot.

To ensure your scraping stays localized and realistic, the system uses proxies managed through methods confirmed by list_proxy_regions. When you're done collecting data for a project, you run get_project_details to review all the metadata and configuration used. You never have to manage headless browsers or worry about IP changes; just talk to your agent.

Built · Hosted · Managed by Vinkius Scrapfly MCP Server - Web Scraping & Data Extraction

Server ID 019dd154-c682-73f6-bc90-45c1087b1146

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

What Changes When You Connect

Bypass anti-bot systems and Cloudflare blocks. The web_scrape tool handles programmatic retrieval of clean HTML, even when sites are protected.

Stop cleaning spreadsheets manually. Use ai_data_extraction to turn complex web pages into structured JSON records, ready for immediate use.

Get visual confirmation every time. You can take element-specific or full-page screenshots using capture_screenshot, which is perfect for audits.

Stay localized and reliable. The system accesses millions of residential proxies via tools like list_proxy_regions across 50+ countries, guaranteeing regional data capture.

Keep operations clean. Use API calls like check_credit_usage to monitor your spending and manage project metadata directly through the agent.

See it in action

01 01

Monitoring Competitor Pricing

The Market Researcher needs competitor pricing from three different regional sites. They ask their agent to run web_scrape across all URLs, ensuring they use proxies listed by list_proxy_regions. The agent pulls the raw HTML, then uses ai_data_extraction to isolate and standardize the price points into a single JSON file.

02 02

Building Visual Audits

The Growth Engineer needs to compare two competitor's checkout flows. They ask the agent to take full-page screenshots (capture_screenshot) of the critical steps, noting any missing elements or dark mode issues, creating an immediate visual audit report.

03 03

Extracting Data from JS-Heavy Portals

The Data Scientist hits a portal that only loads data via JavaScript. Instead of failing, they ask the agent to run web_scrape, which uses headless rendering. The agent successfully gets the dynamic content and then pipes it into ai_data_extraction for clean JSON output.

04 04

Checking API Health Before a Run

Before running a massive data job, the operations team member asks the agent to run get_api_status. This immediately confirms credentials are good and checks available credits using check_credit_usage, preventing costly failures mid-job.

The honest tradeoffs

Treating web scraping like a simple API call

Anti-pattern

Trying to scrape content by just hitting the main URL and expecting perfect JSON. You get raw HTML, which is messy and filled with scripts.

The Fix

Always start with web_scrape for the full source code. Then, pipe that output into ai_data_extraction. This separates getting the data from structuring it.

Ignoring geo-restrictions

Anti-pattern

Running a single scraper job without telling the system where to scrape from. You get blocked by Cloudflare immediately.

The Fix

First, check list_proxy_regions to see your options. Then, use web_scrape with a specific proxy configuration for reliable, localized data.

Failing when the site is dynamic

Anti-pattern

The scraper fails because the content only loads after JavaScript runs. You're stuck with static HTML that doesn't contain the pricing data.

The Fix

Rely on the built-in headless browser rendering capability of web_scrape. This forces the system to wait for all dynamic JS content to load before scraping.

Questions you might have

How does Scrapfly MCP Server handle Cloudflare anti-bot bypass? +

The server handles this automatically. When you use the web_scrape tool, it runs sophisticated proxy and rendering techniques to ensure your request gets through the bot protection layers.

Can I just scrape raw HTML with Scrapfly MCP Server? +

Yes, you can. Use web_scrape if you need the full source code. But remember, for usable data, follow up by using ai_data_extraction to structure it.

What tools do I use to check my usage? +

You'll use check_credit_usage. This tool lets your agent read your current consumption stats, so you always know how much API credit is left for your job.

Does Scrapfly MCP Server support multiple countries for proxies? +

Yep. The system supports millions of residential proxies across 50+ countries. You can check the available locations using list_proxy_regions before starting a geo-specific scrape.

When I use `ai_data_extraction`, can it handle complex web layouts to generate structured JSON? +

Yep, it transforms complicated page content into clean, machine-readable JSON based on your prompts. You define the desired data schema (like a list of objects), and the AI fills in the blanks automatically.

Can I use `capture_screenshot` to focus on specific elements instead of capturing the whole page? +

Yes, you can provide CSS selectors or element IDs when calling capture_screenshot. This lets you pinpoint and capture only small sections of a webpage without wasting space on surrounding junk.

What is the best way to verify my API key setup before running a large scrape with Scrapfly? +

Run the test_scrapfly_auth tool. This confirms your connection status and validates that your API key is properly linked to your account, saving you credits on failed jobs.

How much history does the `get_project_details` tool provide for my scraping work? +

It gives a full overview of your project's run parameters and status. You can check total credits consumed, the last successful run date, and any stored metadata used during the job.

Can my AI automatically extract structured JSON from a web page using Scrapfly? +

Yes! Use the ai_data_extraction tool. Provide the URL and optionally a model or prompt, and your agent will return the parsed data in structured JSON format instantly.

How do I use residential proxies to bypass anti-bot systems? +

Simply ask the agent to run the web_scrape action. Scrapfly handles anti-bot (ASP) and premium proxy rotation automatically based on the site's security level.

How do I find my Scrapfly API Key? +

Log in to your Scrapfly account, navigate to the Dashboard, and you will find your unique secret API key prominently displayed.

How this MCP server connects to your AI agent

What AI agents can do with Scrapfly Automation

Ai data extraction

Get api status

List extraction models

What AI agents can do with Scrapfly MCP Server: 12 Tools for Web Data Extraction

Ai Data Extraction

Extracts structured data from a web page using AI models, converting the content into JSON.

Get Api Status

Retrieves high-level information about your Scrapfly account status.

List Extraction Models

Retrieves a list of available AI models used for data structuring and extraction.

Get Project Details

Gets metadata and specific configuration details for a defined scraping project.

Get Scraping Capabilities

Checks which advanced scraping features, like proxy support or anti-bot bypass, are...

Get Screenshot Capabilities

Determines the specific screenshot functionality and parameters that can be used.

Check Credit Usage

Checks your current API credit consumption and usage statistics.

List Proxy Regions

Shows the geographical regions and types of proxies available for scraping.

List Api Webhooks

Lists all webhooks that have been configured on your Scrapfly account for...

Web Scrape

Scrapes the raw HTML content of any specified URL, handling complex rendering issues.

Capture Screenshot

Takes an image of a website or specific element on a webpage for visual...

Test Scrapfly Auth

Verifies that your provided API credentials are valid and active with Scrapfly.

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Built on the Model Context Protocol (MCP) for Claude, ChatGPT, Cursor, and more

Web scraping shouldn't require managing proxies or debugging headless browsers., Solved with Vinkius AI Gateway

Scrapfly MCP Server: Get structured data & visual proof.

What your AI can actually do with this

Data Collection & Extraction

Advanced Scraping & Geo-Targeting

Account Monitoring & Configuration

How it Works in Practice

Here's how it actually works

Who is this actually for?

What Changes When You Connect

Monitoring Competitor Pricing

Building Visual Audits

Extracting Data from JS-Heavy Portals

Checking API Health Before a Run