Oxylabs MCP Server for LlamaIndex 10 tools — connect in under 2 minutes
LlamaIndex specializes in data-aware AI agents that connect LLMs to structured and unstructured sources. Add Oxylabs as an MCP tool provider through Vinkius and your agents can query, analyze, and act on live data alongside your existing indexes.
ASK AI ABOUT THIS MCP SERVER
Vinkius supports streamable HTTP and SSE.
import asyncio
from llama_index.tools.mcp import BasicMCPClient, McpToolSpec
from llama_index.core.agent.workflow import FunctionAgent
from llama_index.llms.openai import OpenAI
async def main():
# Your Vinkius token. get it at cloud.vinkius.com
mcp_client = BasicMCPClient("https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
mcp_tool_spec = McpToolSpec(client=mcp_client)
tools = await mcp_tool_spec.to_tool_list_async()
agent = FunctionAgent(
tools=tools,
llm=OpenAI(model="gpt-4o"),
system_prompt=(
"You are an assistant with access to Oxylabs. "
"You have 10 tools available."
),
)
response = await agent.run(
"What tools are available in Oxylabs?"
)
print(response)
asyncio.run(main())
* Every MCP server runs on Vinkius-managed infrastructure inside AWS - a purpose-built runtime with per-request V8 isolates, Ed25519 signed audit chains, and sub-40ms cold starts optimized for native MCP execution. See our infrastructure
About Oxylabs MCP Server
Give your AI agent the ability to extract structured data from any website with Oxylabs Web Scraping API. Scrape Google, Amazon, Bing, and Yandex with specialized parsers, or hit any arbitrary URL with the universal scraper — with optional headless browser rendering.
LlamaIndex agents combine Oxylabs tool responses with indexed documents for comprehensive, grounded answers. Connect 10 tools through Vinkius and query live data alongside vector stores and SQL databases in a single turn. ideal for hybrid search, data enrichment, and analytical workflows.
What you can do
- Universal Scraper — Extract HTML from any website. Enable JS rendering for single-page apps that require client-side execution
- Google Search — Parse Google SERPs with organic rankings, knowledge panels, ads, and featured snippets, localized by country TLD
- Google Shopping — Extract product listings with merchants, prices, reviews, and availability from Google Shopping
- Google Images — Pull image URLs, dimensions, and hosting origins from Google Image Search
- Google News — Extract time-indexed news articles with publishers and publication dates
- Amazon Search — Scrape Amazon product listings with ASINs, prices, Prime markers, and seller data across regional domains
- Amazon Product — Deep-extract individual product pages with variations, stock states, bullet points, and vendor hierarchies
- Bing Search — Parse Bing SERPs with organic web graph snippets
- Yandex Search — Extract rankings from the Yandex search engine for CIS/RU market analysis
- Custom Payload — Send fully customized Oxylabs schema payloads for advanced proxy and location configurations
The Oxylabs MCP Server exposes 10 tools through the Vinkius. Connect it to LlamaIndex in under two minutes — no API keys to rotate, no infrastructure to provision, no vendor lock-in. Your configuration, your data, your control.
How to Connect Oxylabs to LlamaIndex via MCP
Follow these steps to integrate the Oxylabs MCP Server with LlamaIndex.
Install dependencies
Run pip install llama-index-tools-mcp llama-index-llms-openai
Replace the token
Replace [YOUR_TOKEN_HERE] with your Vinkius token
Run the agent
Save to agent.py and run: python agent.py
Explore tools
The agent discovers 10 tools from Oxylabs
Why Use LlamaIndex with the Oxylabs MCP Server
LlamaIndex provides unique advantages when paired with Oxylabs through the Model Context Protocol.
Data-first architecture: LlamaIndex agents combine Oxylabs tool responses with indexed documents for comprehensive, grounded answers
Query pipeline framework lets you chain Oxylabs tool calls with transformations, filters, and re-rankers in a typed pipeline
Multi-source reasoning: agents can query Oxylabs, a vector store, and a SQL database in a single turn and synthesize results
Observability integrations show exactly what Oxylabs tools were called, what data was returned, and how it influenced the final answer
Oxylabs + LlamaIndex Use Cases
Practical scenarios where LlamaIndex combined with the Oxylabs MCP Server delivers measurable value.
Hybrid search: combine Oxylabs real-time data with embedded document indexes for answers that are both current and comprehensive
Data enrichment: query Oxylabs to augment indexed data with live information before generating user-facing responses
Knowledge base agents: build agents that maintain and update knowledge bases by periodically querying Oxylabs for fresh data
Analytical workflows: chain Oxylabs queries with LlamaIndex's data connectors to build multi-source analytical reports
Oxylabs MCP Tools for LlamaIndex (10)
These 10 tools become available when you connect Oxylabs to LlamaIndex via MCP:
scrape_amazon_product
Extract deep E-Commerce retail arrays extracting arbitrary Amazon ASINs
scrape_amazon_search
Perform structured data extraction looping over Amazon query listings
scrape_bing_serp
Calculate and extract structured Bing Search Web mappings
scrape_custom_payload
oxylabs.io/v1/queries` customizing Proxy Types, Locations and deeply nested config params. Dispatch entirely arbitrary strict Oxylabs Payload JSON structures
scrape_google_images
Parse and Scrape specific Google Image Search layout blocks
scrape_google_news
Export time-indexed News articles via Google News blocks
scrape_google_serp
Extract Google Search Engine Results Page (SERP) blocks structurally
scrape_google_shopping
Extract E-Commerce structures hitting Google Shopping engines
scrape_universal
oxylabs.io/v1/queries` endpoint mapping `source=universal`. Dynamically extracts raw remote HTML safely avoiding proxy blocks. Use `js_render=true` explicitly if the target SPA strictly requires client rendering. Scrape any arbitrary HTML website using Oxylabs Universal Scraper API
scrape_yandex_serp
Execute structural analysis indexing Yandex Search instances
Example Prompts for Oxylabs in LlamaIndex
Ready-to-use prompts you can give your LlamaIndex agent to start working with Oxylabs immediately.
"Search Google for 'best noise cancelling headphones 2024' and show me the top 5 organic results."
"Get the full product details for Amazon ASIN B0C8PSXB2M."
"Scrape the homepage of techcrunch.com with JavaScript rendering enabled."
Troubleshooting Oxylabs MCP Server with LlamaIndex
Common issues when connecting Oxylabs to LlamaIndex through the Vinkius, and how to resolve them.
BasicMCPClient not found
pip install llama-index-tools-mcpOxylabs + LlamaIndex FAQ
Common questions about integrating Oxylabs MCP Server with LlamaIndex.
How does LlamaIndex connect to MCP servers?
Can I combine MCP tools with vector stores?
Does LlamaIndex support async MCP calls?
Connect Oxylabs with your favorite client
Step-by-step setup guides for every MCP-compatible client and framework:
Anthropic's native desktop app for Claude with built-in MCP support.
AI-first code editor with integrated LLM-powered coding assistance.
GitHub Copilot in VS Code with Agent mode and MCP support.
Purpose-built IDE for agentic AI coding workflows.
Autonomous AI coding agent that runs inside VS Code.
Anthropic's agentic CLI for terminal-first development.
Python SDK for building production-grade OpenAI agent workflows.
Google's framework for building production AI agents.
Type-safe agent development for Python with first-class MCP support.
TypeScript toolkit for building AI-powered web applications.
TypeScript-native agent framework for modern web stacks.
Python framework for orchestrating collaborative AI agent crews.
Leading Python framework for composable LLM applications.
Data-aware AI agent framework for structured and unstructured sources.
Microsoft's framework for multi-agent collaborative conversations.
Connect Oxylabs to LlamaIndex
Get your token, paste the configuration, and start using 10 tools in under 2 minutes. No API key management needed.
