The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

AI-generated keywords: Artificial Intelligence Scientific Discovery AI Scientist-v2 Autonomous Manuscripts Peer Review

AI-generated Key Points

  • Artificial Intelligence (AI) is revolutionizing scientific research
  • The AI Scientist-v2 autonomously generates peer-reviewed workshop papers without human-authored code templates
  • Progressive agentic tree-search methodology allows for hypothesis formulation, experiment design, data analysis, and manuscript authoring without human intervention
  • Integration of Vision-Language Model (VLM) feedback loop enhances content and aesthetics in figures
  • One AI-generated manuscript surpassed the average human acceptance threshold at a peer-reviewed ICLR workshop
  • Identified strengths include exploration of temporal consistency regularization and choice of synthetic arithmetic tasks; areas for improvement include clarifying descriptions, addressing omitted references, improving experimental evaluation with real-world tasks and longer sequences, and resolving dataset overlap issues
  • Generation process focused on core machine learning topics and real-world applications such as finance, psychology, agriculture, environmental science, and public health
  • The AI Scientist-v2 represents a significant advancement in autonomous scientific discovery technologies with open-sourced code to encourage further development
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yutaro Yamada, Robert Tjarko Lange, Cong Lu, Shengran Hu, Chris Lu, Jakob Foerster, Jeff Clune, David Ha

License: CC BY 4.0

Abstract: AI is increasingly playing a pivotal role in transforming how scientific discoveries are made. We introduce The AI Scientist-v2, an end-to-end agentic system capable of producing the first entirely AI generated peer-review-accepted workshop paper. This system iteratively formulates scientific hypotheses, designs and executes experiments, analyzes and visualizes data, and autonomously authors scientific manuscripts. Compared to its predecessor (v1, Lu et al., 2024 arXiv:2408.06292), The AI Scientist-v2 eliminates the reliance on human-authored code templates, generalizes effectively across diverse machine learning domains, and leverages a novel progressive agentic tree-search methodology managed by a dedicated experiment manager agent. Additionally, we enhance the AI reviewer component by integrating a Vision-Language Model (VLM) feedback loop for iterative refinement of content and aesthetics of the figures. We evaluated The AI Scientist-v2 by submitting three fully autonomous manuscripts to a peer-reviewed ICLR workshop. Notably, one manuscript achieved high enough scores to exceed the average human acceptance threshold, marking the first instance of a fully AI-generated paper successfully navigating a peer review. This accomplishment highlights the growing capability of AI in conducting all aspects of scientific research. We anticipate that further advancements in autonomous scientific discovery technologies will profoundly impact human knowledge generation, enabling unprecedented scalability in research productivity and significantly accelerating scientific breakthroughs, greatly benefiting society at large. We have open-sourced the code at https://github.com/SakanaAI/AI-Scientist-v2 to foster the future development of this transformative technology. We also discuss the role of AI in science, including AI safety.

Submitted to arXiv on 10 Apr. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2504.08066v1

In the realm of scientific discovery, Artificial Intelligence (AI) is revolutionizing the way research is conducted. The AI Scientist-v2, an advanced agentic system, has been developed to autonomously generate peer-reviewed workshop papers. This system goes beyond its predecessor by eliminating the need for human-authored code templates and demonstrating effective generalization across various machine learning domains. One key feature of The AI Scientist-v2 is its progressive agentic tree-search methodology managed by a dedicated experiment manager agent. This allows the system to iteratively formulate hypotheses, design experiments, analyze data, and author scientific manuscripts without human intervention. Additionally, the integration of a Vision-Language Model (VLM) feedback loop enhances the AI reviewer component for iterative refinement of content and aesthetics in figures. To evaluate the capabilities of The AI Scientist-v2, three fully autonomous manuscripts were submitted to a peer-reviewed ICLR workshop. Remarkably, one manuscript surpassed the average human acceptance threshold, marking a significant milestone in AI-generated research successfully navigating peer review. This achievement underscores the increasing potential of AI in driving scientific breakthroughs and enhancing research productivity on a large scale. The authors identified strengths and weaknesses in their paper assessment. While they appreciated the exploration of temporal consistency regularization and the choice of synthetic arithmetic tasks for testing hypotheses, areas for improvement were noted. These included clarifying unclear descriptions, addressing omitted references, improving experimental evaluation with real-world tasks and longer sequences, and resolving issues related to dataset overlap. The generation process for the workshop-accepted paper involved idea generation phases focused on core machine learning topics and real-world applications such as finance, psychology, agriculture, environmental science, and public health. Three promising initial ideas were selected based on alignment with the workshop theme and potential interest. The system autonomously executed experimental pipelines using parallelized agentic tree search multiple times to produce high-quality manuscripts for submission. Overall, The AI Scientist-v2 represents a significant advancement in autonomous scientific discovery technologies. By open-sourcing their code to encourage further development in this transformative field, the authors anticipate that AI will continue to play a crucial role in accelerating scientific progress and benefiting society at large.
Created on 29 Aug. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.