4,500+ servers built on MCP Fusion
Vinkius
arXiv logo
Vinkius
LlamaIndex logo

How to Use the arXiv MCP in LlamaIndex

Index arXiv preprints directly into your LlamaIndex vector stores for academic RAG.

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

arXiv MCP on Cursor AI Code Editor MCP Client arXiv MCP on Claude Desktop App MCP Integration arXiv MCP on OpenAI Agents SDK MCP Compatible arXiv MCP on Visual Studio Code MCP Extension Client arXiv MCP on GitHub Copilot AI Agent MCP Integration arXiv MCP on Google Gemini AI MCP Integration arXiv MCP on Lovable AI Development MCP Client arXiv MCP on Mistral AI Agents MCP Compatible arXiv MCP on Amazon AWS Bedrock MCP Support
MCP Servers - Free for Subscribers
LlamaIndex

Connect arXiv MCP to LlamaIndex

Create your Vinkius account to connect arXiv to LlamaIndex and route execution through our secure gateway. The platform manages server hosting, runtime updates, and security layers. Configuration requires no manual server provisioning.

GDPR Free for Subscribers

Index arXiv Search Results to LlamaIndex Vector Stores

The `search_arxiv` tool pulls raw metadata and PDF links from over 2.5 million preprints directly into your LlamaIndex pipeline. Instead of just reading the results, your system indexes these abstracts into a vector store to build a searchable local database of current research. This turns temporary search results into a persistent knowledge base. Your LlamaIndex agents can query this index later, combining fresh academic data with your private documents.

Build Academic RAG with the arXiv MCP Server

The `get_arxiv_paper` tool retrieves full preprint details by ID. LlamaIndex uses this tool to ground its answers in verified academic metadata, eliminating the hallucinations that typically plague research summaries. By using the `McpToolSpec` wrapper, you convert these academic retrieval functions into standard LlamaIndex tools. Your query engine can then decide when to pull fresh preprint data to resolve a user's technical question.

Feed Live Preprint Data Into LlamaIndex Agents

The `search_arxiv` tool allows your LlamaIndex agents to query domains like physics, math, and computer science on the fly. Your agent can dynamically adjust its search parameters based on the semantic gaps it finds in your current index. You set this up using `BasicMCPClient` and pass the tool list to a `FunctionAgent`. The agent runs asynchronously, pulling down abstracts and organizing them into structured nodes for immediate retrieval.

Setup guide

Set up arXiv MCP in LlamaIndex

Prerequisites

  • Python 3.10+ installed
  • llama-index-tools-mcp package
  • Active Vinkius subscription with a valid endpoint token
  1. 1

    Install dependencies

    Run pip install llama-index-tools-mcp llama-index-llms-openai. The MCP tools package provides BasicMCPClient and McpToolSpec.

  2. 2

    Connect with BasicMCPClient

    Point BasicMCPClient to your Vinkius endpoint URL. Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. Supports SSE and Streamable HTTP transports.

  3. 3

    Convert to LlamaIndex tools

    Call mcp_tool_spec.to_tool_list_async() to convert all arXiv MCP tools into native FunctionTool objects that any LlamaIndex agent can use.

  4. 4

    Run with any LLM

    Create a FunctionAgent with the tools and your preferred LLM. Swap OpenAI for Anthropic, Gemini, or any LlamaIndex-supported provider.

agent.py
from llama_index.tools.mcp import BasicMCPClient, McpToolSpec
from llama_index.core.agent.workflow import FunctionAgent
from llama_index.llms.openai import OpenAI

# Connect to the MCP
mcp_client = BasicMCPClient(
    "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
)
mcp_tool_spec = McpToolSpec(client=mcp_client)

# Convert MCP tools to LlamaIndex tools
tools = await mcp_tool_spec.to_tool_list_async()

# Create and run the agent
agent = FunctionAgent(
    tools=tools,
    llm=OpenAI(model="gpt-4o"),
    system_prompt="You have access to arXiv tools.",
)
response = await agent.run("List recent arXiv data")

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by arXiv. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

Why Choose Vinkius

Vinkius connects your tools to AI with real-time monitoring and automatic cost savings — all from one dashboard.

Real-time monitoring

Live

visibility into every interaction

Connect your favorite tools to your AI and see exactly what's happening — every request, every response, in real time.

Built-in savings

60%

lower AI costs

Vinkius compresses data between your apps and your AI automatically. Lower bills every month — no configuration required.

Single dashboard

One

place for every integration

Every tool your AI connects to, managed from a single screen. One account, complete control.

Common questions about arXiv MCP in LlamaIndex

Install `llama-index-tools-mcp` and initialize `BasicMCPClient` pointing to your endpoint. Wrap it in `McpToolSpec` and call `to_tool_list_async()` to get tools for your agent.
Yes. Use `search_arxiv` to retrieve papers, convert the resulting abstracts into LlamaIndex Document objects, and ingest them into your vector index.
By calling `get_arxiv_paper`, the agent retrieves the exact abstract and metadata. LlamaIndex forces the synthesizer to use only this retrieved text as its context.
Yes. You can use the `allowed_tools` filter when configuring your client to restrict the agent to either search or direct paper retrieval.
All paper IDs and search terms pass through an isolated V8 sandbox on Vinkius. No search history or metadata is stored permanently, keeping your research directions confidential.

Start using the arXiv MCP today

We host it, we monitor it, we maintain it. You just paste one token.

Built & Managed by Vinkius 30s setup 2 tools

We've already built the connector for arXiv. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 2 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.