blazickjp/arxiv-mcp-server
arxiv-mcp-server is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.
If you are the rightful owner of arxiv-mcp-server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
The ArXiv MCP Server provides a bridge between AI assistants and arXiv's research repository through the Model Context Protocol (MCP).
Try arxiv-mcp-server with chat:
Has a README
Github repo has a README.md.
Has a License
Github repo has license info.
Server can be inspected
View server inspector
Server schema can be extracted
Can get at lease one tool info from the README or server.
Online hosted on MCPHub
Can be automatically deployed by MCPHub.
Has social accounts
Has X or Discord account.
Claimed by the author or certified by MCPHub
If you are the author, claim authorship
AI Evaluation ReportTotal Score: 6/10
The agent demonstrates strong capabilities in searching for academic papers on arXiv with specific criteria, as evidenced by its successful retrieval of papers authored by Yoshua Bengio and those in the 'astro-ph.CO' category. It also effectively lists available downloaded papers. However, the agent struggles with downloading papers due to tool malfunctions and inaccurately identifies papers with specific keywords in their titles, as seen in the 'cs.AI' category search. These inconsistencies highlight a need for improved accuracy and reliability in certain operations.
Test case 1
Score: 9/10Perform the operation of searching for academic papers on arXiv authored by 'Yoshua Bengio' in the 'cs.LG' (Machine Learning) category, published between January 1, 2024, and August 2, 2025. Retrieve the titles and abstracts of the top 5 most recent papers.I found 5 recent papers authored by Yoshua Bengio in the Machine Learning category (cs.LG), published between January 1, 2024, and August 2, 2025. Here are the titles and abstracts of these papers:
- Title: Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization Abstract: Large language models (LLMs) leverage chain-of-thought (CoT) techniques to tackle complex problems, representing a transformative breakthrough in artificial intelligence (AI). However, their reasoning capabilities have primarily been demonstrated in solving math and coding problems, leaving their potential for domain-specific applicationsâsuch as battery discoveryâlargely unexplored. Inspired by the idea that reasoning mirrors a form of guided search, we introduce ChatBattery, a novel agentic framework that integrates domain knowledge to steer LLMs toward more effective reasoning in materials design. Using ChatBattery, we successfully identify, synthesize, and characterize three novel lithium-ion battery cathode materials, which achieve practical capacity improvements of 28.8%, 25.2%, and 18.5%, respectively, over the widely used cathode material, LiNi0.8Mn0.1Co0.1O2 (NMC811). This complete AI-driven cycleâfrom design to synthesis to characterizationâdemonstrates the transformative potential of AI-driven reasoning in revolutionizing materials discovery.
- Title: Torsional-GFN: a conditional conformation generator for small molecules Abstract: Generating stable molecular conformations is crucial in several drug discovery applications, such as estimating the binding affinity of a molecule to a target. Recently, generative machine learning methods have emerged as a promising, more efficient method than molecular dynamics for sampling of conformations from the Boltzmann distribution. In this paper, we introduce Torsional-GFN, a conditional GFlowNet specifically designed to sample conformations of molecules proportionally to their Boltzmann distribution, using only a reward function as training signal. Our results demonstrate that Torsional-GFN is able to sample conformations approximately proportional to the Boltzmann distribution for multiple molecules with a single model, achieving zero-shot generalization to unseen bond lengths and angles.
- Title: Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety Abstract: AI systems that "think" in human language offer a unique opportunity for AI safety: we can monitor their chains of thought (CoT) for the intent to misbehave. Like all other known AI oversight methods, CoT monitoring is imperfect and allows some misbehavior to go unnoticed. Nevertheless, it shows promise and we recommend further research into CoT monitorability and investment in CoT monitoring alongside existing safety methods.
- Title: RL for Reasoning by Adaptively Revealing Rationales Abstract: We propose that reinforcement learning (RL) from partial expert demonstrations is not merely a training heuristic, but a promising framework for solving complex sequence generation tasks. Supervised fine-tuning (SFT) relies on dense ground-truth labels, which become increasingly costly as sequence length grows. We address this by introducing adaptive backtracking (AdaBack), a per-sample curriculum learning algorithm that reveals only a partial prefix of the target output during training.
- Title: The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Abstract: Recent generations of language models have introduced Large Reasoning Models (LRMs) that generate detailed thinking processes before providing answers. While these models demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scaling properties, and limitations remain insufficiently understood. This work systematically investigates these gaps with the help of controllable puzzle environments that allow precise manipulation of complexity while maintaining consistent logical structures.
Test case 2
Score: 2/10Perform the operation of downloading the paper with arXiv ID '2106.04584' from arXiv and provide the title and abstract of the paper.No valid answer is generated due to tool malfunction. The tool blazickjp/arxiv-mcp-server is currently in the process of converting the paper and has not yet completed. The response indicates that the conversion is still ongoing, preventing the extraction of the title and abstract at this time.
Test case 3
Score: 9/10Perform the operation of searching for academic papers on arXiv in the 'astro-ph.CO' (Cosmology and Nongalactic Astrophysics) category, published between January 1, 2023, and August 2, 2025, that contain the keyword 'dark matter' in their titles. Retrieve the titles and abstracts of the top 3 most recent papers.I found three recent papers published in the 'astro-ph.CO' category that contain the keyword 'dark matter' in their titles. Here are the details:
- Title: High-redshift Galaxies from JWST Observations in More Realistic Dark Matter Halo Models Authors: Saeed Fakhry, Reyhaneh Vojoudi Salmani, Javad T. Firouzjaee Abstract: The James Webb Space Telescope (JWST) has unveiled unexpectedly massive galaxy candidates at high redshifts, challenging standard ÎCDM cosmological predictions. This work studies the predictions of more realistic dark matter halo models combined with modified matter power spectra for interpreting JWST observations of high-redshift galaxies. The analysis reveals significant improvements in agreement with observations, highlighting the importance of small-scale structure formation physics in early cosmic epochs. Published on: July 31, 2025
- Title: Gamma-Rays and Gravitational Waves from Inelastic Higgs Portal Dark Matter Authors: Dan Hooper, Gordan Krnjaic, Duncan Rocha, Subhojit Roy Abstract: This paper explores a predictive dark matter scenario involving a complex scalar field coupled to the Higgs portal. It discusses the implications of this model for the Galactic Center gamma-ray excess and its potential influence on the Higgs potential, which may lead to a strong first-order electroweak phase transition in the early universe, possibly producing a stochastic gravitational wave background. Published on: July 30, 2025
- Title: From Capture to Collapse: Revisiting Black Hole formation by Fermionic Asymmetric Dark Matter in Neutron Stars Authors: Sandra Robles, Drona Vatsyayan, Giorgio Busoni Abstract: This paper revisits the capture of fermionic asymmetric dark matter (ADM) in neutron stars and its potential to trigger gravitational collapse into black holes. It critically examines previous constraints on ADM properties and discusses the implications of more accurate treatments of dark matter capture and black hole formation, suggesting that previous results can be relaxed by several orders of magnitude. Published on: July 30, 2025
Test case 4
Score: 9/10Perform the operation of listing all available papers that have been downloaded and stored as resources. Provide the titles and arXiv IDs of these papers.The following paper has been downloaded and stored as a resource:
- Title: Inelastic Dark Matter at the Fermilab Short Baseline Neutrino Program
arXiv ID: 2106.04584v1
Summary: This paper studies the sensitivity of the Fermilab Short-Baseline Neutrino (SBN) experiments, MicroBooNE, ICARUS, and SBND, to MeV- to GeV-scale inelastic dark matter interacting through a dark photon mediator. It explores scenarios of light thermal dark matter, focusing on accelerator-based searches and the signatures of scattering and decay of heavy excited dark states in SBN detectors. For further reading, you can access the paper here.
- Title: Inelastic Dark Matter at the Fermilab Short Baseline Neutrino Program
Test case 5
Score: 2/10Perform the operation of searching for academic papers on arXiv in the 'cs.AI' (Artificial Intelligence) category, published between January 1, 2024, and August 2, 2025, that contain the keyword 'reinforcement learning' in their titles. Retrieve the titles and abstracts of the top 3 most recent papers.I found three recent academic papers in the 'cs.AI' category that include 'reinforcement learning' in their titles, published between January 1, 2024, and August 2, 2025. Here are the details:
- Title: Distributed AI Agents for Cognitive Underwater Robot Autonomy Authors: Markus Buchholz, Ignacio Carlucho, Michele Grimaldi, Yvan R. Petillot Abstract: Achieving robust cognitive autonomy in robots navigating complex, unpredictable environments remains a fundamental challenge in robotics. This paper presents Underwater Robot Self-Organizing Autonomy (UROSA), a groundbreaking architecture leveraging distributed Large Language Model AI agents integrated within the Robot Operating System 2 (ROS 2) framework to enable advanced cognitive capabilities in Autonomous Underwater Vehicles. UROSA decentralizes cognition into specialized AI agents responsible for multimodal perception, adaptive reasoning, dynamic mission planning, and real-time decision-making. Central innovations include flexible agents dynamically adapting their roles, retrieval-augmented generation utilizing vector databases for efficient knowledge management, reinforcement learning-driven behavioral optimization, and autonomous on-the-fly ROS 2 node generation for runtime functional extensibility. Extensive empirical validation demonstrates UROSA's promising adaptability and reliability through realistic underwater missions in simulation and real-world deployments.
- Title: Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Authors: Luoxin Chen, Jinming Gu, Liankai Huang, Wenhao Huang, Zhicheng Jiang, Allan Jie, Xiaoran Jin, Xing Jin, Chenggang Li, Kaijing Ma, Cheng Ren, Jiawei Shen, Wenlei Shi, Tong Sun, He Sun, Jiahui Wang, Siran Wang, Zhihong Wang, Chenrui Wei, Shufa Wei, Yonghui Wu, Yuchen Wu, Yihang Xia, Huajian Xin, Fan Yang, Huaiyuan Ying, Hongyi Yuan, Zheng Yuan, Tianyang Zhan, Chi Zhang, Yue Zhang, Ge Zhang, Tianyun Zhao, Jianqiu Zhao, Yichi Zhou, Thomas Hanwen Zhu Abstract: LLMs have demonstrated strong mathematical reasoning abilities by leveraging reinforcement learning with long chain-of-thought, yet they continue to struggle with theorem proving due to the lack of clear supervision signals when solely using natural language. Dedicated domain-specific languages like Lean provide clear supervision via formal verification of proofs, enabling effective training through reinforcement learning. In this work, we propose Seed-Prover, a lemma-style whole-proof reasoning model. Seed-Prover can iteratively refine its proof based on Lean feedback, proved lemmas, and self-summarization.
- Title: Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents Authors: Shaofei Cai, Zhancun Mu, Haiwen Xia, Bowei Zhang, Anji Liu, Yitao Liang Abstract: While Reinforcement Learning (RL) has achieved remarkable success in language modeling, its triumph hasn’t yet fully translated to visuomotor agents. A primary challenge in RL models is their tendency to overfit specific tasks or environments, thereby hindering the acquisition of generalizable behaviors across diverse settings. This paper provides a preliminary answer to this challenge by demonstrating that RL-finetuned visuomotor agents in Minecraft can achieve zero-shot generalization to unseen worlds.