Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

AI-generated keywords: Artificial Intelligence Chess Reinforcement Learning AlphaZero Tabula Rasa

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Paper titled "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"
  • Authors: David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran,Thore Graepel,Timothy Lillicrap,Karen Simonyan,and Demis Hassabis
  • Discusses the success of artificial intelligence in chess through reinforcement learning algorithms like AlphaZero
  • AlphaZero achieved superhuman performance in chess without prior knowledge through self-play
  • Demonstrated excellence in other complex domains such as shogi and Go within 24 hours
  • Showcases the power of tabula rasa approaches in AI capabilities without domain-specific expertise or pre-existing data
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy Lillicrap, Karen Simonyan, Demis Hassabis

Abstract: The game of chess is the most widely-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. In contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go, by tabula rasa reinforcement learning from games of self-play. In this paper, we generalise this approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains. Starting from random play, and given no domain knowledge except the game rules, AlphaZero achieved within 24 hours a superhuman level of play in the games of chess and shogi (Japanese chess) as well as Go, and convincingly defeated a world-champion program in each case.

Submitted to arXiv on 05 Dec. 2017

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1712.01815v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm," authored by David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran,Thore Graepel,Timothy Lillicrap,Karen Simonyan,and Demis Hassabis,discusses the groundbreaking success of artificial intelligence in the domain of chess. Chess has been extensively studied in AI history and the best programs have relied on advanced search techniques and manually crafted evaluation functions. However,the authors highlight the transformative potential of reinforcement learning algorithms like AlphaZero which achieved superhuman performance in chess through self-play without any prior knowledge. This approach was further demonstrated to excel in other complex domains such as shogi (Japanese chess) and Go within 24 hours. The study showcases the power of tabula rasa approaches like AlphaZero in pushing the boundaries of AI capabilities and achieving remarkable proficiency without relying on domain-specific expertise or pre-existing data. , , , , .
Created on 14 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.