Firecrawl MCP. Scrape and crawl websites into clean Markdown data.

Q: How do I use Firecrawl with my AI client to scrape a single page?

You use the scrapeurl tool. Just give your agent the URL you want. It handles the JavaScript rendering and outputs clean Markdown, making the content ready for analysis right away.

Q: Can Firecrawl crawl an entire website recursively?

Yes. You use the startcrawl tool. It initiates a background job and returns a job ID. You then use getcrawlstatus to monitor the progress until the entire site is indexed.

Q: What is the difference between mapwebsitestructure and startcrawl?

mapwebsitestructure only discovers links; it doesn't download content. startcrawl executes the crawl and downloads the actual page content. Use mapping to plan, and crawling to execute.

Q: How do I check if my Firecrawl API usage is over my limit?

Run the getapiusage tool. This tells you your remaining credits and helps you manage your budget before you run out of data extraction capacity.

Q: How do I manage an ongoing crawl using the cancelactivecrawl tool?

You call cancelactivecrawl with the job ID. This immediately stops the crawl job, preventing further processing and saving credits. You can't restart a canceled job; you'll need to initiate a new crawl.

Q: What is the difference between scrapeurl and mapwebsitestructure?

scrapeurl converts a single URL into clean, Markdown-ready content. mapwebsitestructure simply discovers all reachable links on a site without extracting any content. Use mapwebsitestructure first if you only need a site map.

Q: How do I monitor my job progress after running startcrawl?

You use getcrawlstatus with the job ID returned by startcrawl. This tells you if the crawl is running, paused, or complete. It's the only way to track the real-time status of a background job.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

Firecrawl. Scrapes and crawls entire websites into clean, structured markdown using a single API call. It handles JavaScript rendering, automatically excludes boilerplate content like headers and footers, and allows your AI agent to programmatically discover and ingest full knowledge bases from any root URL.

What your AI agents can do

Cancel active crawl

Stops a crawl job that is currently running using a provided job ID.

Get api usage

Checks and reports your current Firecrawl credit usage and remaining limits.

Get crawl status

Retrieves the current status (running, completed, failed) for a specific crawl job ID.

+ 3 more capabilities included

Scrape a single URL to Markdown

The agent sends a URL, and the server returns the content as clean, structured markdown, automatically cleaning up boilerplate elements.

Map a website's internal structure

The agent inputs a root domain, and the server lists every reachable link on that site without downloading the content.

Recursively crawl a site

The agent inputs a root URL and a depth limit, and the server initiates a background job that systematically discovers and scrapes all linked pages.

Check the status of a crawl job

The agent sends a job ID, and the server returns the current status of the crawl, letting you know if it's running, finished, or failed.

Get current API usage

The agent asks for usage details, and the server returns your remaining Firecrawl credits and usage history.

Stop a running crawl

The agent provides an active job ID, and the server sends a signal to terminate the crawling process immediately.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

OAuth 2.0 Compatible

Claude

ChatGPT

Cursor

Gemini

VS Code

JetBrains

Vercel

Zendesk

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

Firecrawl MCP Server: 6 Tools for Web Data Extraction

These six tools let your AI agent manage the full lifecycle of web data: from mapping site structures to scraping individual URLs and monitoring job status.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Firecrawl on Vinkius

cancel019dd0f0

cancel active crawl

Stops a crawl job that is currently running using a provided job ID.

get019dd0f0

get api usage

Checks and reports your current Firecrawl credit usage and remaining limits.

get019dd0f0

get crawl status

Retrieves the current status (running, completed, failed) for a specific crawl job ID.

map019dd0f0

map website structure

Discovers and lists all accessible URLs on a domain without downloading the content of the pages.

scrape019dd0f0

scrape url

Converts the content of a single specified URL into clean, structured markdown format.

start019dd0f0

start crawl

Initiates a recursive crawling job on a root URL, returning a job ID for tracking.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Firecrawl, then connect any of our 4,800+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,800+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Firecrawl. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 6 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Manually gathering web data is a nightmare of copy-pasting.

Today, you open a website, copy the text, paste it into Notion, and then manually cut and paste the relevant sections into your research document. You spend minutes filtering out the boilerplate—the headers, the footers, the related articles widgets—just to get to the actual content. It's slow, tedious, and you always miss something.

With Firecrawl, you just point your agent at the URL. It handles the whole process, automatically stripping out the junk content and giving you the core, clean text you need. It's done in a single conversation turn.

Firecrawl MCP Server: Extract clean content and structure.

You no longer have to switch between a browser, a text editor, and a spreadsheet. The agent handles the entire data pipeline: first, it uses `map_website_structure` to see the site map, then `start_crawl` to get the content, and finally, it presents the structured data directly back to your chat window.

The difference is the workflow. You don't manage the crawl, the state, or the API keys. Your AI client manages it all. You just ask the question.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

What you can do with this MCP connector

Firecrawl lets your AI agent scrape and crawl entire websites, spitting out clean, structured markdown. You'll use this server to ingest full knowledge bases from any root URL, and it handles JavaScript rendering and stripping out all the boilerplate junk like headers and footers.

To grab a single URL, your agent sends the address and gets the content back as structured markdown, automatically cleaning up the fluff. You can also map a site's internal structure by giving it a root domain; the server lists every reachable link without actually downloading the content. If you wanna crawl a whole site, your agent inputs a root URL and a depth limit, and the server starts a background job that systematically discovers and scrapes all linked pages.

When you need to track a crawl job, your agent sends a job ID, and the server returns the current status: running, finished, or failed. You can also check your API usage by asking for details, and the server spits out your remaining Firecrawl credits and usage history. If a crawl job gets out of hand, your agent can provide an active job ID, and the server sends a signal to terminate the process immediately.

Built · Hosted · Managed by Vinkius Firecrawl - Scrape Websites into Markdown with AI Server ID 019dd0f1-1cfd-731d-bbff-c84b7b8f732b

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

What Changes When You Connect

Structured Data Output: Forget messy HTML. scrape_url converts complex web pages into clean, structured Markdown, making the data instantly usable by your LLM.
Deep Site Discovery: Need more than just one page? start_crawl handles recursive crawling, systematically finding and scraping every linked subpage to build a full knowledge base.
Planning Before Execution: Don't guess what's on the site. Use map_website_structure first to get a full list of reachable URLs, letting you plan your data gathering before spending credits.
Full Process Visibility: Use get_crawl_status and get_api_usage to monitor jobs in real-time. You always know if the crawl is running, finished, or if you're running low on credits.
Visual and Textual Record: Capture a full-page screenshot alongside the structured text using the agent's visual capture capability. You get both the raw visual context and the clean text data.
Stop and Control: If a crawl goes wrong or you change your mind, cancel_active_crawl lets you shut down the job immediately.

Real-World Use Cases

Building a RAG Knowledge Base

A developer needs to index all documentation from a vendor site. Instead of manually scraping 50 pages, they ask their agent to use map_website_structure first, then start_crawl on the root URL. The resulting job ID feeds into their RAG pipeline, providing a complete knowledge base.

Competitive Analysis

A market researcher needs to track competitor pricing across 10 product pages. They use scrape_url on each specific URL, ensuring that every piece of extracted content is clean Markdown, allowing for easy comparison and structured data analysis.

Archiving an Article

A content creator finds a great article and needs to save it for later. They ask their agent to use scrape_url, which extracts the main text and automatically strips out the distracting sidebars and ads. They also capture a full-page screenshot for context.

Auditing a Website's Links

A site auditor needs to know every possible internal link on a large corporate site. They use map_website_structure to get a complete list of all reachable URLs, without wasting time or credits on content extraction.

The Tradeoffs

Scraping everything at once

Calling start_crawl on a massive, unknown site without checking the scope or depth limit. This risks hitting rate limits, wasting credits, and generating a massive, unmanageable data dump.

→ First, use map_website_structure to scope the site. Then, use start_crawl with a defined depth. If you only need content from one specific page, bypass the crawl and use scrape_url instead.

Relying on raw HTML output

Taking the raw output from a simple scraper and feeding it to the LLM. The LLM has to filter out headers, footers, and navigation menus, which introduces noise and weakens the context.

→ Always use scrape_url. It guarantees the output is clean, structured Markdown, focusing only on the core content needed for the LLM.

Ignoring job status

Initiating a crawl with start_crawl and then forgetting to check the progress. You don't know if the job is still running, stuck, or if it failed due to an internal error.

→ After calling start_crawl, immediately use get_crawl_status with the returned job ID. This confirms the job started and lets you track it until completion.

Common Questions About Firecrawl MCP

How do I use Firecrawl with my AI client to scrape a single page? +

You use the scrape_url tool. Just give your agent the URL you want. It handles the JavaScript rendering and outputs clean Markdown, making the content ready for analysis right away.

Can Firecrawl crawl an entire website recursively? +

Yes. You use the start_crawl tool. It initiates a background job and returns a job ID. You then use get_crawl_status to monitor the progress until the entire site is indexed.

What is the difference between `map_website_structure` and `start_crawl`? +

map_website_structure only discovers links; it doesn't download content. start_crawl executes the crawl and downloads the actual page content. Use mapping to plan, and crawling to execute.

How do I check if my Firecrawl API usage is over my limit? +

Run the get_api_usage tool. This tells you your remaining credits and helps you manage your budget before you run out of data extraction capacity.

Can Firecrawl capture screenshots while crawling? +

Yes, the agent can capture full-page screenshots of any URL. This adds a visual record to your data set, giving you context beyond just the text.

How do I manage an ongoing crawl using the `cancel_active_crawl` tool? +

You call cancel_active_crawl with the job ID. This immediately stops the crawl job, preventing further processing and saving credits. You can't restart a canceled job; you'll need to initiate a new crawl.

What is the difference between `scrape_url` and `map_website_structure`? +

scrape_url converts a single URL into clean, Markdown-ready content. map_website_structure simply discovers all reachable links on a site without extracting any content. Use map_website_structure first if you only need a site map.

How do I monitor my job progress after running `start_crawl`? +

You use get_crawl_status with the job ID returned by start_crawl. This tells you if the crawl is running, paused, or complete. It's the only way to track the real-time status of a background job.

How do I find my Firecrawl API Key? +

Can I scrape content excluding headers and footers? +

Yes! The scrape_url tool includes an onlyMainContent parameter. When set to true, Firecrawl uses AI to extract only the core article or page content.

How long does a recursive crawl take? +

Crawl time depends on the site size and depth. Use the get_crawl_status tool to monitor progress and retrieve results once the job is complete.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python