On the Robustness of Explanations of Deep Neural Network Models: A Survey

AI-generated keywords: Explainability Deep Neural Network (DNN) Attributional Attack Robustness Responsible Use

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Responsible and trustworthy use of machine learning models requires explainability
  • Deep Neural Network (DNN) models are increasingly used in risk-sensitive and safety-critical domains
  • Many methods have been proposed to explain the decisions made by DNN models
  • However, explanations can be distorted or attacked by minor input perturbations
  • There has been no effort to assimilate the different methods and metrics proposed to study the robustness of explanations of DNN models
  • The paper titled "On the Robustness of Explanations of Deep Neural Network Models: A Survey" presents a comprehensive survey of methods that study, understand, attack, and defend explanations of DNN models
  • The paper also provides a detailed review of different metrics used to evaluate explanation methods while describing attributional attack and defense methods
  • The authors conclude with lessons and takeaways for the community towards ensuring robust explanations of DNN model predictions
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Amlan Jyoti, Karthik Balaji Ganesh, Manoj Gayala, Nandita Lakshmi Tunuguntla, Sandesh Kamath, Vineeth N Balasubramanian

Under Review ACM Computing Surveys "Special Issue on Trustworthy AI"
License: CC BY-NC-ND 4.0

Abstract: Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains, many methods have been proposed to explain the decisions of these models. Recent years have also seen concerted efforts that have shown how such explanations can be distorted (attacked) by minor input perturbations. While there have been many surveys that review explainability methods themselves, there has been no effort hitherto to assimilate the different methods and metrics proposed to study the robustness of explanations of DNN models. In this work, we present a comprehensive survey of methods that study, understand, attack, and defend explanations of DNN models. We also present a detailed review of different metrics used to evaluate explanation methods, as well as describe attributional attack and defense methods. We conclude with lessons and take-aways for the community towards ensuring robust explanations of DNN model predictions.

Submitted to arXiv on 09 Nov. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2211.04780v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The responsible and trustworthy use of machine learning models requires explainability, which has been widely recognized as a cornerstone. With the increasing use of Deep Neural Network (DNN) models in risk-sensitive and safety-critical domains, many methods have been proposed to explain the decisions made by these models. However, recent years have seen concerted efforts that demonstrate how such explanations can be distorted or attacked by minor input perturbations. While there have been many surveys that review explainability methods themselves, there has been no effort hitherto to assimilate the different methods and metrics proposed to study the robustness of explanations of DNN models. In this work titled "On the Robustness of Explanations of Deep Neural Network Models: A Survey," authors Amlan Jyoti, Karthik Balaji Ganesh, Manoj Gayala, Nandita Lakshmi Tunuguntla, Sandesh Kamath, and Vineeth N Balasubramanian present a comprehensive survey of methods that study, understand, attack, and defend explanations of DNN models. The paper also provides a detailed review of different metrics used to evaluate explanation methods while describing attributional attack and defense methods. The authors conclude with lessons and takeaways for the community towards ensuring robust explanations of DNN model predictions. Overall, this work highlights the importance of understanding the robustness of explanations provided by DNN models to ensure their responsible and trustworthy use in various domains.
Created on 11 Apr. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.