Web Scraper MCP. Give your AI agent live web access and clean data extraction.

Q: Can I compare articles from different websites at once?

Yes, use batchread. This tool fetches multiple URLs in parallel, allowing your agent to process and compare the content of up to ten sources simultaneously. It's ideal for comparative analysis.

Q: How do I find all links on a page?

Use the listlinks tool. It systematically pulls every single outbound hyperlink from the web page without needing to download or parse the full body content, giving you just the list.

Web Scraper is an MCP that gives your AI agent direct read access to live web pages. It lets your agent pull clean, usable text from any URL, stripping away ads and site clutter. You can also extract structured metadata like titles and links, or crawl entire documentation sites up to ten pages deep.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Give Claude and any AI agent real-world access

Clean Article Reading

Your agent strips away ads and site navigation from any webpage, returning only the main article content as clean Markdown.

Metadata Collection

The tool extracts structured data like SEO titles, descriptions, canonical links, and all outbound hyperlinks without downloading the page body.

Site Deep Crawling

Your agent automatically navigates a starting URL, crawling up to ten pages deep to map out an entire documentation site or wiki.

Bulk Data Fetching

You can process multiple web sources at once, fetching and comparing content from up to ten different URLs in parallel.

Ask an AI about this

Waiting for input…

AI Agent

What AI agents can do with Web Scraper with 5 Tools

These tools let you pull content, links, structured data, and maps of entire websites using your AI client.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Web Scraper MCP

Read

Pulls any public webpage into clean Markdown format, stripping away ads and clutter for readable content.

Extract

Gathers structured metadata from a page, pulling out the title, description, OG...

List Links

Pulls every single outbound hyperlink found across an entire web page's source code.

Batch Read

Fetches and processes content from up to ten different URLs simultaneously for...

Crawl

Automatically crawls a website starting at a given URL, mapping out the content of...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Web Scraper MCP is compatible with Claude

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Web Scraper integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "web-scraper": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Web Scraper tools with full Vinkius guardrails applied.

Web Scraper MCP is compatible with VS Code

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"web-scraper": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on each call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Web Scraper, then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,200+ others, all in one place
Add new capabilities to your AI anytime you want
Connections are secured and governed automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog weekly

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Web Scraper. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Finding facts on the live web used to be a mess of tabs.

Before connecting your agent to a web scraper MCP, gathering information meant opening ten different browser tabs. You'd copy text from one article, paste it into a spreadsheet, and then manually open another site just to check the metadata or see if there were links pointing elsewhere. It was tedious clicking and constant context switching.

Now, you tell your agent which URL to read. The MCP handles all that legwork—it strips away the ads, the sticky footers, and the navigation bars. You get clean Markdown text immediately, letting your agent work on pure, actionable data without the junk.

Web Scraper MCP delivers real-time web data directly to your workflow.

The biggest time sink used to be manually gathering multiple sources for comparison. You'd have to visit URL A, then copy the summary; open URL B, and repeat the process. This made synthesizing complex reports slow and error-prone.

With this MCP, you simply tell your agent to use `batch_read`. It pulls both sites simultaneously. Your agent compares the two sources—the 'React' article versus the 'HTML' article—and delivers a single, synthesized answer. The data is always current.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

web-crawling

markdown-conversion

data-extraction

reader-view

content-parsing

url-fetching

What Web Scraper MCP does for your AI

Stop letting your agent guess facts. This MCP connects your AI client directly to the public internet, giving it a real-time source of information. Instead of hallucinating, your agent reads live articles, parses complex technical documentation, and pulls clean text from any link you provide. You can convert cluttered webpages into pristine Markdown using Mozilla Readability logic.

Need to compare sources? Use the batch reading feature to pull up to ten different URLs simultaneously. For developers, this means pointing your agent at a new library's API docs and having it write code based on the absolute latest syntax. The whole catalog of tools is hosted and managed by Vinkius, so you connect once and get access to all web data capabilities.

Built · Hosted · Managed by Vinkius Web Scraper MCP - Extract Live Web Content & Metadata

Server ID 019d7604-5402-7173-a44b-24cf5d07da40

Vinkius Inspector

Compliance Grade A+

Score 98.33/100

Report View Report ↗

Benefits of connecting Web Scraper MCP

Instead of relying on stale training data, you let the agent read real-time articles. This eliminates factual hallucinations entirely.

The read tool converts any messy website into pristine Markdown. You get readable content instantly, perfect for documentation or blog posts.

Need to audit a site? Use the extract tool to pull only the metadata—titles, descriptions, and OG tags—without downloading the whole page body.

Comparing sources is easy with batch_read. You can feed up to ten URLs at once, allowing your agent to compare concepts or summarize multiple articles in one go.

For deep research, use the crawl tool. Give it a single documentation hub link and let your agent map out every related page automatically.

Web Scraper MCP use cases

01 01

Comparing two product architectures

A developer needs to know if 'React-first' or 'HTML-first' is better for their client. They ask their agent to run batch_read on both competing articles, allowing the AI to compare them side-by-side and give a definitive recommendation.

02 02

Auditing competitor websites

An SEO specialist uses extract on five competitor sites. The agent quickly pulls all metadata—titles, descriptions, canonical tags—enabling the specialist to identify weak spots in their own site's optimization.

03 03

Researching a niche topic

A researcher drops 15 links related to quantum computing. They ask the agent to use read on each, and then summarize the entire collection of clean Markdown text into one coherent report.

04 04

Mapping out an old wiki

An internal team uses crawl on their company's legacy documentation hub. The agent maps every related page up to ten deep, giving the team a complete structure map before migrating the content.

Web Scraper MCP tradeoffs

What to watch out for, and the recommended way to handle each one.

Using the AI for general knowledge

Avoid

Asking your agent, 'What is the current best practice for modern web design?' and getting an answer based on data from 2021.

Instead

Always use this MCP. Point your agent to a specific documentation link and ask it to read or extract the content there. This guarantees you're using real-time information.

Trying to get metadata piecemeal

Avoid

Asking for titles, then separately asking for links, resulting in two disconnected steps and multiple API calls.

Instead

Use the extract tool first. It pulls all the structured data—titles, descriptions, and link counts—in one single, efficient request.

Forgetting about site depth

Avoid

Asking the agent to summarize a massive wiki but only getting content from the landing page.

Instead

Use the crawl tool. This tells your agent not just to read the starting URL, but to automatically map and process related pages up to 10 levels deep.

When to use Web Scraper MCP

Use this MCP if you need verifiable information from a live web source. If the core of your task is reading articles, pulling structured metadata (titles/links), or mapping out documentation structures, this tool is what you want. Don't use it if you just need to process data that already exists within a private database—use a dedicated database connector instead. Also, don't rely on it for general knowledge; always provide the target URL first. If you only need to pull links from one page and nothing else, list_links is sufficient. But if you need both the clean text and the links, you need this MCP.

Frequently asked questions about Web Scraper MCP

How does the Web Scraper MCP handle complex documentation sites? +

It uses the crawl tool to map out entire documentation hubs. You give it the starting URL, and your agent automatically navigates up to ten related pages so you don't miss any linked content.

Can I compare articles from different websites at once? +

Yes, use batch_read. This tool fetches multiple URLs in parallel, allowing your agent to process and compare the content of up to ten sources simultaneously. It's ideal for comparative analysis.

Do I need any special keys or authentication to use Web Scraper? +

No. You don't need API keys or any specific credentials. Once you subscribe to this MCP, you just paste the link into your chat and tell your agent what task it needs to perform.

Is the content from the Web Scraper always clean? +

Yes. The primary reading tool converts messy webpages using Mozilla Readability logic, which strips out boilerplate code, ads, and navigation bars so you only get pristine text.

How do I find all links on a page? +

Use the list_links tool. It systematically pulls every single outbound hyperlink from the web page without needing to download or parse the full body content, giving you just the list.

Give Claude and any AI agent real-world access

What AI agents can do with Web Scraper with 5 Tools

Read

Pulls any public webpage into clean Markdown format, stripping away ads and clutter for readable content.

Extract

Gathers structured metadata from a page, pulling out the title, description, OG...

List Links

Pulls every single outbound hyperlink found across an entire web page's source code.

Batch Read

Fetches and processes content from up to ten different URLs simultaneously for...

Crawl

Automatically crawls a website starting at a given URL, mapping out the content of...

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Finding facts on the live web used to be a mess of tabs.

Web Scraper MCP delivers real-time web data directly to your workflow.

web-crawling

markdown-conversion

data-extraction

reader-view

content-parsing

url-fetching

What Web Scraper MCP does for your AI

How to set up Web Scraper MCP

Who uses Web Scraper MCP

Benefits of connecting Web Scraper MCP

Web Scraper MCP use cases

Comparing two product architectures

Auditing competitor websites

Researching a niche topic

Mapping out an old wiki

Web Scraper MCP tradeoffs

Using the AI for general knowledge

Trying to get metadata piecemeal

Forgetting about site depth

When to use Web Scraper MCP

Frequently asked questions about Web Scraper MCP