Are AlphaZero-like Agents Robust to Adversarial Perturbations?

AI-generated keywords: Adversarial Perturbations AlphaZero Go AI Robustness Complex Games

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • AlphaZero has demonstrated that neural-network-based Go AIs can surpass human performance by a large margin
  • Researchers have raised concerns about whether these agents are robust to adversarial perturbations
  • Li-Cheng Lan and colleagues investigate the existence of adversarial states in Go AIs that may lead them to play surprisingly wrong actions
  • Adversarial state is one that leads to an undoubtedly inferior action that is obvious even for Go beginners
  • The authors develop the first adversarial attack on Go AIs which can efficiently search for adversarial states by strategically reducing the search space
  • Both Policy-Value neural network (PV-NN) and Monte Carlo tree search (MCTS) can be misled by adding one or two meaningless stones
  • 90% of examples indeed lead the AI agent to play an obviously inferior action when evaluated with amateur human Go players
  • This study highlights potential vulnerabilities in current AI systems' robustness when faced with adversarial perturbations in complex games like Go.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Li-Cheng Lan, Huan Zhang, Ti-Rong Wu, Meng-Yu Tsai, I-Chen Wu, Cho-Jui Hsieh

Accepted by Neurips 2022

Abstract: The success of AlphaZero (AZ) has demonstrated that neural-network-based Go AIs can surpass human performance by a large margin. Given that the state space of Go is extremely large and a human player can play the game from any legal state, we ask whether adversarial states exist for Go AIs that may lead them to play surprisingly wrong actions. In this paper, we first extend the concept of adversarial examples to the game of Go: we generate perturbed states that are ``semantically'' equivalent to the original state by adding meaningless moves to the game, and an adversarial state is a perturbed state leading to an undoubtedly inferior action that is obvious even for Go beginners. However, searching the adversarial state is challenging due to the large, discrete, and non-differentiable search space. To tackle this challenge, we develop the first adversarial attack on Go AIs that can efficiently search for adversarial states by strategically reducing the search space. This method can also be extended to other board games such as NoGo. Experimentally, we show that the actions taken by both Policy-Value neural network (PV-NN) and Monte Carlo tree search (MCTS) can be misled by adding one or two meaningless stones; for example, on 58\% of the AlphaGo Zero self-play games, our method can make the widely used KataGo agent with 50 simulations of MCTS plays a losing action by adding two meaningless stones. We additionally evaluated the adversarial examples found by our algorithm with amateur human Go players and 90\% of examples indeed lead the Go agent to play an obviously inferior action. Our code is available at \url{https://PaperCode.cc/GoAttack}.

Submitted to arXiv on 07 Nov. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2211.03769v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In recent years, the success of AlphaZero (AZ) has demonstrated that neural-network-based Go AIs can surpass human performance by a large margin. However, researchers have raised concerns about whether these agents are robust to adversarial perturbations. In this paper titled "Are AlphaZero-like Agents Robust to Adversarial Perturbations? ", Li-Cheng Lan and colleagues investigate the existence of adversarial states in Go AIs that may lead them to play surprisingly wrong actions. They extend the concept of adversarial examples to the game of Go by generating perturbed states that are semantically equivalent to the original state by adding meaningless moves to the game. An adversarial state is one that leads to an undoubtedly inferior action that is obvious even for Go beginners. The authors acknowledge that searching for such states is challenging due to the large, discrete, and non-differentiable search space in Go. To tackle this challenge, they develop the first adversarial attack on Go AIs which can efficiently search for adversarial states by strategically reducing the search space. This method can also be extended to other board games such as NoGo. Experimentally, they show that both Policy-Value neural network (PV-NN) and Monte Carlo tree search (MCTS) can be misled by adding one or two meaningless stones. For example, on 58% of AlphaGo Zero self-play games their method can make KataGo agent with 50 simulations of MCTS plays a losing action by adding two meaningless stones. The authors further evaluated their algorithm's found adversarial examples with amateur human Go players and found that 90% of examples indeed lead the AI agent to play an obviously inferior action. Overall, this study highlights potential vulnerabilities in current AI systems' robustness when faced with adversarial perturbations in complex games like Go. The authors provide their code at \url{https://PaperCode.cc/GoAttack} for further exploration and development. The findings of this study have implications for developing more robust AI systems in complex games and other domains.
Created on 27 Apr. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.