arXiv MCP. Find cutting-edge research before it's published.

Q: How do I use the searcharxiv tool?

To use searcharxiv, provide keywords and optionally specify categories or boolean filters. For example, 'LLM AND reasoning' in the cs.AI category.

Q: What is the difference between searcharxiv and getarxivpaper?

searcharxiv finds papers based on keywords across 2.5M+ preprints. getarxivpaper retrieves the complete record for one paper when you already have its specific arXiv ID.

Q: Does getarxivpaper give me the PDF link?

Yes, getarxivpaper returns the full metadata package, which includes a direct PDF download link for the paper.

Q: Can I search for papers across different scientific fields?

Yes, searcharxiv supports multiple domains. You can filter results by categories like physics, math, or economics simultaneously.

Q: What if I use searcharxiv and only get a title?

The searcharxiv tool returns title, authors, abstract, categories, and the PDF link in every result, so you get much more than just a title.

Q: How do I handle large result sets using the searcharxiv tool?

The searcharxiv tool handles large result sets by providing a paginated list of results. You receive summaries, including the title, authors, and abstract for each paper, which keeps the data manageable. If you need more than the initial results, the agent should request the next page of results.

Q: What format do the IDs need to be for getarxivpaper?

The getarxivpaper tool accepts two common arXiv ID formats: the numeric format (e.g., 2106.09685) or the older alphanumeric format (e.g., cs/0101001). Use either one, and the tool pulls the complete metadata.

Q: Does the searcharxiv tool support boolean logic in queries?

Yes, the searcharxiv tool supports boolean logic. You can combine keywords using AND, OR, and NOT operators to narrow your search scope, making highly specific queries possible.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

arXiv MCP Server lets you search and retrieve scientific preprints. Access 2.5M+ papers across physics, computer science, math, and biology.

You can find specific papers by ID using `get_arxiv_paper` or explore cutting-edge research using `search_arxiv`. It pulls full abstracts, author lists, and direct PDF links for the latest academic findings.

What your AI agents can do

Get arxiv paper

Retrieves the full metadata (authors, abstract, date, PDF link) for a specific paper using its unique arXiv ID.

Search arxiv

Searches 2.5M+ scientific preprints across multiple fields by keywords, categories, and boolean logic.

Search by Keyword and Category

Find multiple papers across scientific fields by passing keywords, boolean logic, and specific categories to search_arxiv.

Retrieve Full Paper Metadata

Get all structured details—authors, abstract, categories, and the PDF link—for one specific paper ID using get_arxiv_paper.

Filter by Scientific Domain

Narrow search results using specific arXiv categories, such as cs.AI (AI/ML), physics, or math.

Handle Boolean Search Logic

Combine search terms using boolean operators to target highly specific concepts (e.g., 'Transformer AND causality').

Download PDF Links

Each search result and detailed paper retrieval includes a direct, actionable link to the PDF file.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

get019d7552

get arxiv paper

Retrieves the full metadata (authors, abstract, date, PDF link) for a specific paper using its unique arXiv ID.

search019d7552

search arxiv

Searches 2.5M+ scientific preprints across multiple fields by keywords, categories, and boolean logic.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with arXiv, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

arXiv Server - Search Scientific Preprints & Abstracts lets your AI client drill into the latest academic research. You're looking at 2.5 million preprints across physics, computer science, math, and biology. You can use search_arxiv to find multiple papers by mixing keywords, specific categories, and boolean logic. Need to narrow it down? You can pass specific categories like cs.AI, physics, or math to limit your search.

You can combine terms using boolean operators, like 'Transformer AND causality', to target super specific concepts. Every search result you pull back includes a direct, working link to the PDF. If you already know the exact paper ID, you can use get_arxiv_paper to grab all the structured details: the title, every author, the full abstract, categories, the date, and that PDF link.

You'll never need an API key either. Just connect your agent, and you're set.

How arXiv MCP Works

1 Start by telling your AI client the scope of your search—for instance, 'Find papers on quantum error correction in the physics category.'
2 The client uses the search_arxiv tool, providing the keywords and necessary filters. It gets back a list of titles, authors, abstracts, and PDF links.
3 If you need deep details on one result, the client then calls get_arxiv_paper with the specific arXiv ID to pull the full, structured record.

The bottom line is: You search broadly with search_arxiv, and then you drill down with get_arxiv_paper when you need specifics.

Who Is arXiv MCP For?

The ML engineer who needs to know what's new in LLMs before it hits a journal. The academic researcher needing to track specific theoretical proofs in real-time. The data scientist who has to synthesize findings from disparate, cutting-edge sources. This is for people who need to operate at the bleeding edge of knowledge.

AI/ML Engineer

Searches for the latest preprints on transformer models, diffusion, or reinforcement learning to keep up with academic breakthroughs.

Theoretical Physicist

Tracks new findings in quantum computing or high-energy physics as they are released, ensuring no critical paper is missed.

Quantitative Researcher

Uses structured searches to find new statistical methodologies or theorems in mathematics and economics.

What Changes When You Connect

Find the latest work immediately. search_arxiv covers physics, CS, math, and biology, letting you see the most recent preprints without waiting for journal publication.
Get structured metadata, not just links. get_arxiv_paper returns all authors, the full abstract, and the DOI (if available) for a paper ID, saving you multiple lookups.
Target complex concepts. You can use boolean logic and category filters in search_arxiv to find papers like 'LLM AND reasoning' across specific domains (e.g., cs.AI).
Access specialized fields. Need to track quantum computing? Filter by quant-ph or physics in search_arxiv. The tool lets you focus on deep, niche topics.
Save time on PDF retrieval. Both tools provide direct, actionable links, letting your agent immediately pull the full text for review.

Real-World Use Cases

Tracking a specific model's evolution

An AI engineer needs to know every paper related to 'Diffusion Models' published in the last month. They run search_arxiv with 'Diffusion Model' and filter by cs.AI. The agent returns 15 recent preprints, giving the engineer a complete, categorized list to review.

Investigating a known theoretical paper

A researcher has an old paper ID, arXiv:1706.03762. They run get_arxiv_paper with this ID. The agent instantly pulls the full details—the abstract, all authors, and the PDF link—without needing to visit the original site.

Broad literature review on a new topic

A data scientist wants a quick overview of 'Self-Consistency' in LLMs. They use search_arxiv with the keyword 'self-consistency' and set the category to cs.AI. The agent compiles the top 10 results, giving a broad, immediate picture of the field's current state.

Comparing methodologies across domains

A quant researcher needs to see how 'Topological Codes' are discussed in both quantum physics and advanced mathematics. They use search_arxiv by combining 'topological code' with a category filter for physics and math respectively, comparing the results side-by-side.

The Tradeoffs

Over-relying on general search engines

Typing 'AI papers on LLMs' into Google Scholar and clicking through 50 links. This forces you to manually check if the paper is a preprint, if it's peer-reviewed, or if the abstract is complete.

→ Use search_arxiv instead. It filters results by preprint status and gives you the full abstract and categories right away. Use get_arxiv_paper if you already have an ID.

Trying to search by vague concepts

Asking the agent to 'find papers about good AI.' This is too broad and yields irrelevant results because it lacks technical constraints.

→ Be specific. Use search_arxiv and combine keywords with categories (e.g., 'LLM AND reasoning' AND cs.AI). Always include the domain filter.

Forgetting the ID format

Passing a title like 'Transformer' to get_arxiv_paper. The tool requires a specific arXiv ID format (e.g., 2106.09685).

→ Only use get_arxiv_paper when you have the exact identifier. For general searches, always start with search_arxiv.

When It Fits, When It Doesn't

Use this server if your primary need is to find the most current, academically rigorous information, especially in fast-moving fields like AI/ML or Quantum Physics. You must use it if you need to differentiate between a general academic search and a live, pre-peer-reviewed preprint archive.

Don't use this if you are looking for citation history within a specific university database (use specialized citation tools) or if you are only interested in the final, published journal version (use journal APIs). This server is for the pre-review stage. Always choose search_arxiv for discovery and get_arxiv_paper for known targets.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by arXiv. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 2 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

get_arxiv_paper search_arxiv

Finding cutting-edge research shouldn't feel like digging through a dumpster.

Right now, finding the latest scientific breakthroughs means hopping between Google Scholar, specialized university sites, and preprint feeds. You copy keywords, paste them into a search box, then you spend 20 minutes clicking through dozens of result pages, manually checking if the paper is a preprint, if the abstract is complete, or if the link actually works.

With the arXiv MCP Server, your agent handles the noise. You tell it the topic, and it uses `search_arxiv` to return a clean list of results. You get the full abstract, author list, and direct PDF link—all in one place. No clicking through dozens of messy search result pages.

get_arxiv_paper: Get the full record for any specific paper ID

Previously, if you knew an arXiv ID but needed the full context—like a list of all authors or the exact publication date—you'd have to visit the main site, parse the page, and copy-paste the details. It was a multi-step, error-prone process just to get metadata.

Now, calling `get_arxiv_paper` pulls that structured data directly into your workflow. You get the title, authors, and abstract formatted cleanly. It’s an instant, reliable data dump, letting you analyze the context without ever leaving your agent's environment.

Common Questions About arXiv MCP

How do I use the `search_arxiv` tool? +

To use search_arxiv, provide keywords and optionally specify categories or boolean filters. For example, 'LLM AND reasoning' in the cs.AI category.

What is the difference between `search_arxiv` and `get_arxiv_paper`? +

search_arxiv finds papers based on keywords across 2.5M+ preprints. get_arxiv_paper retrieves the complete record for one paper when you already have its specific arXiv ID.

Does `get_arxiv_paper` give me the PDF link? +

Yes, get_arxiv_paper returns the full metadata package, which includes a direct PDF download link for the paper.

Can I search for papers across different scientific fields? +

Yes, search_arxiv supports multiple domains. You can filter results by categories like physics, math, or economics simultaneously.

What if I use `search_arxiv` and only get a title? +

The search_arxiv tool returns title, authors, abstract, categories, and the PDF link in every result, so you get much more than just a title.

How do I handle large result sets using the `search_arxiv` tool? +

The search_arxiv tool handles large result sets by providing a paginated list of results. You receive summaries, including the title, authors, and abstract for each paper, which keeps the data manageable. If you need more than the initial results, the agent should request the next page of results.

What format do the IDs need to be for `get_arxiv_paper`? +

The get_arxiv_paper tool accepts two common arXiv ID formats: the numeric format (e.g., 2106.09685) or the older alphanumeric format (e.g., cs/0101001). Use either one, and the tool pulls the complete metadata.

Does the `search_arxiv` tool support boolean logic in queries? +

Yes, the search_arxiv tool supports boolean logic. You can combine keywords using AND, OR, and NOT operators to narrow your search scope, making highly specific queries possible.

What is a preprint and how does arXiv work? +

A preprint is a scientific paper shared publicly before formal peer review. arXiv allows researchers to share their findings immediately upon completion, accelerating scientific communication. Papers are assigned a permanent arXiv ID (e.g., 2106.09685) and are freely accessible forever. Many landmark papers in AI, physics, and math appeared on arXiv months or years before journal publication.

What scientific categories and disciplines does arXiv cover? +

arXiv covers 8 major domains: Physics (astro-ph, cond-mat, hep, quant-ph), Mathematics (math.*), Computer Science (cs.AI, cs.LG, cs.CL, cs.CV, cs.CR, etc.), Quantitative Biology (q-bio), Quantitative Finance (q-fin), Statistics (stat), Electrical Engineering (eess), and Economics (econ). Each domain has multiple sub-categories for precise filtering.

Is arXiv free and do I need an API key? +

Yes, arXiv is completely free and operated as a non-profit by Cornell University. No API key or registration is required for search queries. The only limitation is a rate limit of approximately 1 request per 3 seconds for the search API. All papers are freely downloadable as PDF and accessible in perpetuity.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript