#Data Extraction MCP Servers
Discover 70 MCP servers tagged with Data Extraction on the Vinkius App Catalog.
Tavily
6 toolsSearch the web for AI. Audit search context, answers, and extracted content via AI.
Bing Search
10 toolsPower your AI agent with global web data via Bing Search. Query web pages, images, news, videos, and local businesses in real-time.
Firecrawl
4 toolsTurn any website into clean, LLM-ready Markdown with a single API call. Scrape, crawl, search, and map the entire web for your AI agent.
Jina AI (Search Foundation & LLM Grounding) MCP
6 toolsPower your RAG and search via Jina AI. Generate embeddings, rerank documents, read URLs, and perform semantic web search.
Web Scraper MCP Server
5 toolsEquip your AI agent with the ability to read web pages, extract metadata, and crawl documentation sites as clean Markdown.
Apify Alternative MCP Server
10 toolsManage your cloud automation. Audit actors, tasks, and datasets via AI.
Firecrawl
6 toolsCrawl and scrape entire websites into clean LLM-ready markdown with a single API call that handles JavaScript rendering.
Apify MCP
10 toolsCommand Apify scrapers from your AI agent. Run actors, extract web data, poll datasets, and automate browser tasks seamlessly.
SerpApi
12 toolsEquip your AI agent with real-time web search capabilities across Google, Bing, Baidu, Yahoo, and DuckDuckGo.
Firecrawl Alternative MCP Integration
6 toolsScrape and crawl the web. Audit website content and maps via AI.
SEC EDGAR
3 toolsOfficial US corporate filings database. Access 10-K, 10-Q, and financial data via AI.
Apify MCP
7 toolsRun web scraping actors, collect structured data, and manage storage datasets for large-scale data extraction projects.
Exa AI MCP
12 toolsSearch the web with neural embeddings that understand meaning, not just keywords, and return the most relevant results for any query.
PDF.co
12 toolsParse, generate, merge, and convert PDF documents programmatically with an API that handles complex document processing tasks.
Bright Data
10 toolsAccess the world's #1 web data platform. Bypass anti-bot protections, extract structured search engine data, and manage scraping browsers directly from your AI agent.
Diffbot MCP
10 toolsAutomate web data extraction via Diffbot. Turn any website into structured JSON data for articles, products, discussions, and more directly from any AI agent.
Import.io (Web Data Extraction) MCP
10 toolsExtract structured data from any website via Import.io. Run extractors, manage bulk crawls, and monitor API usage.
Oxylabs MCP Server
10 toolsScrape any website via Oxylabs. Extract Google SERPs, Amazon products, Bing and Yandex results, or any arbitrary URL with JS rendering from any AI agent.
ScraperAPI MCP
10 toolsEquip your AI agent with proxy rotation and headless browsers to extract HTML, Google SERPs, and Amazon data at scale.
SEC EDGAR Financials — Revenue, Income, Assets, EPS & Industry Comparison
4 toolsExtract XBRL financial data from SEC filings: revenue, net income, total assets, liabilities, stockholders' equity, EPS, and cash for any U.S. public company. Compare financial metrics across all companies industry-wide using XBRL frames. Like a free mini-Bloomberg terminal.
SerpApi Alternative MCP Server
6 toolsScrape search engine results. Audit Google, Bing, and YouTube via AI.
Diffbot MCP Server
12 toolsExtract structured data from any web page using AI that understands content like a human and builds knowledge graphs automatically.
Scrapfly MCP
12 toolsScrape web data at scale with a managed API that handles proxies, browser rendering, and anti-bot bypassing automatically.
ScrapingAnt MCP Server
5 toolsExtract web data reliably with rotating proxies, headless Chrome rendering, and CAPTCHA solving built into every request.
ScrapingBee
10 toolsScrape websites without getting blocked using headless browsers, proxy rotation, and JavaScript rendering handled for you.
HTML DOM Query Engine MCP Integration
1 toolsExtract specific text and attributes from massive HTML payloads instantly using CSS selectors. Fast, memory-efficient DOM parsing.
iCal Calendar Parser MCP Server
1 toolsParse exported .ics calendar files from Google Calendar, Apple Calendar, or Outlook local. Let your AI find free slots, count meetings, and manage your schedule.
OFX Bank Statement Parser MCP Server
1 toolsTurn archaic OFX/QFX bank exports into clean JSON transactions safely and local. Let your AI act as your personal accountant without uploading sensitive financial data.
PDF Invoice Data Extractor MCP
1 toolsExtract raw text directly from digital PDF invoices entirely local. Keeps your sensitive accounting data air-gapped while letting the AI classify NIFs, suppliers, and totals.
Affinda MCP Server
5 toolsIntelligent document processing. Parse resumes, invoices, and IDs via AI.
Browse AI
10 toolsAutomate web data extraction via Browse AI. Run robots, monitor websites, and retrieve captured data directly from any AI agent.
Dext
10 toolsEquip your AI agent to manage receipts, track invoices, and monitor accounting data via the Dext API.