arXiv MCP. Find cutting-edge research before it's published.
Works with every AI agent you already use
…and any MCP-compatible client
Just plug in your AI agents and start using Vinkius.
arXiv MCP Server lets you search and retrieve scientific preprints. Access 2.5M+ papers across physics, computer science, math, and biology.
You can find specific papers by ID using `get_arxiv_paper` or explore cutting-edge research using `search_arxiv`. It pulls full abstracts, author lists, and direct PDF links for the latest academic findings.
What your AI agents can do
Get arxiv paper
Retrieves the full metadata (authors, abstract, date, PDF link) for a specific paper using its unique arXiv ID.
Search arxiv
Searches 2.5M+ scientific preprints across multiple fields by keywords, categories, and boolean logic.
Find multiple papers across scientific fields by passing keywords, boolean logic, and specific categories to search_arxiv.
Get all structured details—authors, abstract, categories, and the PDF link—for one specific paper ID using get_arxiv_paper.
Narrow search results using specific arXiv categories, such as cs.AI (AI/ML), physics, or math.
Combine search terms using boolean operators to target highly specific concepts (e.g., 'Transformer AND causality').
Each search result and detailed paper retrieval includes a direct, actionable link to the PDF file.
Ask AI about this MCP
Supported MCP Clients
Waiting for input…
019d7552get arxiv paper
Retrieves the full metadata (authors, abstract, date, PDF link) for a specific paper using its unique arXiv ID.
019d7552search arxiv
Searches 2.5M+ scientific preprints across multiple fields by keywords, categories, and boolean logic.
Choose How to Get Started
Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.
Build Your Own
Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.
- Import from OpenAPI, Swagger, or YAML specs
- Create Agent Skills with progressive disclosure
- Deploy to edge with MCPFusion framework
- Built in DLP, auth, and compliance on every call
- Real time usage dashboard and cost metering
- Publish to catalog or keep private
Make Your AI Do More
Start with arXiv, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.
- Use this MCP plus 4,700+ others, all in one place
- Add new capabilities to your AI anytime you want
- Every connection is secured and compliant automatically
- Track usage and costs across all your servers
- Works with Claude, ChatGPT, Cursor, and more
- New servers added to the catalog every week
What you can do with this MCP connector
arXiv Server - Search Scientific Preprints & Abstracts lets your AI client drill into the latest academic research. You're looking at 2.5 million preprints across physics, computer science, math, and biology. You can use search_arxiv to find multiple papers by mixing keywords, specific categories, and boolean logic. Need to narrow it down? You can pass specific categories like cs.AI, physics, or math to limit your search.
You can combine terms using boolean operators, like 'Transformer AND causality', to target super specific concepts. Every search result you pull back includes a direct, working link to the PDF. If you already know the exact paper ID, you can use get_arxiv_paper to grab all the structured details: the title, every author, the full abstract, categories, the date, and that PDF link.
You'll never need an API key either. Just connect your agent, and you're set.
How arXiv MCP Works
- 1 Start by telling your AI client the scope of your search—for instance, 'Find papers on quantum error correction in the physics category.'
- 2 The client uses the
search_arxivtool, providing the keywords and necessary filters. It gets back a list of titles, authors, abstracts, and PDF links. - 3 If you need deep details on one result, the client then calls
get_arxiv_paperwith the specific arXiv ID to pull the full, structured record.
The bottom line is: You search broadly with search_arxiv, and then you drill down with get_arxiv_paper when you need specifics.
Who Is arXiv MCP For?
The ML engineer who needs to know what's new in LLMs before it hits a journal. The academic researcher needing to track specific theoretical proofs in real-time. The data scientist who has to synthesize findings from disparate, cutting-edge sources. This is for people who need to operate at the bleeding edge of knowledge.
Searches for the latest preprints on transformer models, diffusion, or reinforcement learning to keep up with academic breakthroughs.
Tracks new findings in quantum computing or high-energy physics as they are released, ensuring no critical paper is missed.
Uses structured searches to find new statistical methodologies or theorems in mathematics and economics.
What Changes When You Connect
- Find the latest work immediately.
search_arxivcovers physics, CS, math, and biology, letting you see the most recent preprints without waiting for journal publication. - Get structured metadata, not just links.
get_arxiv_paperreturns all authors, the full abstract, and the DOI (if available) for a paper ID, saving you multiple lookups. - Target complex concepts. You can use boolean logic and category filters in
search_arxivto find papers like 'LLM AND reasoning' across specific domains (e.g.,cs.AI). - Access specialized fields. Need to track quantum computing? Filter by
quant-phorphysicsinsearch_arxiv. The tool lets you focus on deep, niche topics. - Save time on PDF retrieval. Both tools provide direct, actionable links, letting your agent immediately pull the full text for review.
Real-World Use Cases
Tracking a specific model's evolution
An AI engineer needs to know every paper related to 'Diffusion Models' published in the last month. They run search_arxiv with 'Diffusion Model' and filter by cs.AI. The agent returns 15 recent preprints, giving the engineer a complete, categorized list to review.
Investigating a known theoretical paper
A researcher has an old paper ID, arXiv:1706.03762. They run get_arxiv_paper with this ID. The agent instantly pulls the full details—the abstract, all authors, and the PDF link—without needing to visit the original site.
Broad literature review on a new topic
A data scientist wants a quick overview of 'Self-Consistency' in LLMs. They use search_arxiv with the keyword 'self-consistency' and set the category to cs.AI. The agent compiles the top 10 results, giving a broad, immediate picture of the field's current state.
Comparing methodologies across domains
A quant researcher needs to see how 'Topological Codes' are discussed in both quantum physics and advanced mathematics. They use search_arxiv by combining 'topological code' with a category filter for physics and math respectively, comparing the results side-by-side.
The Tradeoffs
Over-relying on general search engines
Typing 'AI papers on LLMs' into Google Scholar and clicking through 50 links. This forces you to manually check if the paper is a preprint, if it's peer-reviewed, or if the abstract is complete.
→
Use search_arxiv instead. It filters results by preprint status and gives you the full abstract and categories right away. Use get_arxiv_paper if you already have an ID.
Trying to search by vague concepts
Asking the agent to 'find papers about good AI.' This is too broad and yields irrelevant results because it lacks technical constraints.
→
Be specific. Use search_arxiv and combine keywords with categories (e.g., 'LLM AND reasoning' AND cs.AI). Always include the domain filter.
Forgetting the ID format
Passing a title like 'Transformer' to get_arxiv_paper. The tool requires a specific arXiv ID format (e.g., 2106.09685).
→
Only use get_arxiv_paper when you have the exact identifier. For general searches, always start with search_arxiv.
When It Fits, When It Doesn't
Use this server if your primary need is to find the most current, academically rigorous information, especially in fast-moving fields like AI/ML or Quantum Physics. You must use it if you need to differentiate between a general academic search and a live, pre-peer-reviewed preprint archive.
Don't use this if you are looking for citation history within a specific university database (use specialized citation tools) or if you are only interested in the final, published journal version (use journal APIs). This server is for the pre-review stage. Always choose search_arxiv for discovery and get_arxiv_paper for known targets.
Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by arXiv. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
VINKIUS INFRASTRUCTURE
Cloud Hosted
Managed infra
V8 Isolated
Sandboxed per request
Zero-Trust Proxy
No stored credentials
DLP Enforced
Policy on every call
GDPR Compliant
EU data residency
Token Compression
~60% cost reduction
Works with Claude, ChatGPT, Cursor, and more
The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.
This server provides 2 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.
Available Capabilities
Finding cutting-edge research shouldn't feel like digging through a dumpster.
Right now, finding the latest scientific breakthroughs means hopping between Google Scholar, specialized university sites, and preprint feeds. You copy keywords, paste them into a search box, then you spend 20 minutes clicking through dozens of result pages, manually checking if the paper is a preprint, if the abstract is complete, or if the link actually works.
With the arXiv MCP Server, your agent handles the noise. You tell it the topic, and it uses `search_arxiv` to return a clean list of results. You get the full abstract, author list, and direct PDF link—all in one place. No clicking through dozens of messy search result pages.
get_arxiv_paper: Get the full record for any specific paper ID
Previously, if you knew an arXiv ID but needed the full context—like a list of all authors or the exact publication date—you'd have to visit the main site, parse the page, and copy-paste the details. It was a multi-step, error-prone process just to get metadata.
Now, calling `get_arxiv_paper` pulls that structured data directly into your workflow. You get the title, authors, and abstract formatted cleanly. It’s an instant, reliable data dump, letting you analyze the context without ever leaving your agent's environment.
Common Questions About arXiv MCP
How do I use the `search_arxiv` tool? +
To use search_arxiv, provide keywords and optionally specify categories or boolean filters. For example, 'LLM AND reasoning' in the cs.AI category.
What is the difference between `search_arxiv` and `get_arxiv_paper`? +
search_arxiv finds papers based on keywords across 2.5M+ preprints. get_arxiv_paper retrieves the complete record for one paper when you already have its specific arXiv ID.
Does `get_arxiv_paper` give me the PDF link? +
Yes, get_arxiv_paper returns the full metadata package, which includes a direct PDF download link for the paper.
Can I search for papers across different scientific fields? +
Yes, search_arxiv supports multiple domains. You can filter results by categories like physics, math, or economics simultaneously.
What if I use `search_arxiv` and only get a title? +
The search_arxiv tool returns title, authors, abstract, categories, and the PDF link in every result, so you get much more than just a title.
How do I handle large result sets using the `search_arxiv` tool? +
The search_arxiv tool handles large result sets by providing a paginated list of results. You receive summaries, including the title, authors, and abstract for each paper, which keeps the data manageable. If you need more than the initial results, the agent should request the next page of results.
What format do the IDs need to be for `get_arxiv_paper`? +
The get_arxiv_paper tool accepts two common arXiv ID formats: the numeric format (e.g., 2106.09685) or the older alphanumeric format (e.g., cs/0101001). Use either one, and the tool pulls the complete metadata.
Does the `search_arxiv` tool support boolean logic in queries? +
Yes, the search_arxiv tool supports boolean logic. You can combine keywords using AND, OR, and NOT operators to narrow your search scope, making highly specific queries possible.
What is a preprint and how does arXiv work? +
A preprint is a scientific paper shared publicly before formal peer review. arXiv allows researchers to share their findings immediately upon completion, accelerating scientific communication. Papers are assigned a permanent arXiv ID (e.g., 2106.09685) and are freely accessible forever. Many landmark papers in AI, physics, and math appeared on arXiv months or years before journal publication.
What scientific categories and disciplines does arXiv cover? +
arXiv covers 8 major domains: Physics (astro-ph, cond-mat, hep, quant-ph), Mathematics (math.*), Computer Science (cs.AI, cs.LG, cs.CL, cs.CV, cs.CR, etc.), Quantitative Biology (q-bio), Quantitative Finance (q-fin), Statistics (stat), Electrical Engineering (eess), and Economics (econ). Each domain has multiple sub-categories for precise filtering.
Is arXiv free and do I need an API key? +
Yes, arXiv is completely free and operated as a non-profit by Cornell University. No API key or registration is required for search queries. The only limitation is a rate limit of approximately 1 request per 3 seconds for the search API. All papers are freely downloadable as PDF and accessible in perpetuity.
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
More in this category
Eurostat Full Access — EU Statistical Intelligence
The ultimate EU statistics Mega-Server: 26 tools spanning economy (GDP, inflation, debt), demographics (population, unemployment, migration), trade, environment (emissions, energy, renewables), and 7,000+ dataset discovery — all 27 EU member states.
NASA TechPort (Technology Projects)
Explore NASA's technology project portfolio—search projects, track funding opportunities, and analyze R&D taxonomies directly.
NOAA Observations — US Current Conditions
Real-time weather observations from thousands of official NWS stations: temperature, wind speed and direction, humidity, barometric pressure, visibility, and weather conditions across the United States.
You might also like
UKG Pro Learning
Manage employee training, courses, and learning paths via UKG Pro Learning.
Amplemarket
Supercharge your outbound sales with AI-driven prospecting, multi-channel sequences, and smart lead scoring that closes deals.
Neon (Serverless PostgreSQL)
Manage serverless database infrastructure via Neon — spawn zero-copy branches, audit projects, and monitor compute endpoints.