Efficient Shapley Values Estimation by Amortization for Text Classification

AI-generated keywords: Shapley Values Neural Text Classification Amortized Model KernelSHAP Computation Time

AI-generated Key Points

  • Authors address the challenge of computing Shapley Values for large pretrained models in neural text classification
  • Computation of Shapley Values is time-consuming due to the large number of model evaluations required
  • Authors propose an amortized model that directly predicts each input feature's Shapley Value without additional model evaluations
  • Estimated Shapley Values are sensitive to random seed choices, especially for examples with longer input texts
  • Proposed amortized model provides stable estimations as the inference is deterministic
  • Experimental results show that the amortized model accurately estimates Shapley Values with a substantial speedup compared to traditional methods (60 times faster process)
  • Functionality of the model evaluated by examining quality of explanations in downstream tasks such as feature selection and domain calibration
  • Model compared with computationally expensive KernelSHAP (KS) method and demonstrates superior performance
  • Overall, paper presents an efficient and effective approach for estimating Shapley Values in neural text classification models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Chenghao Yang, Fan Yin, He He, Kai-Wei Chang, Xiaofei Ma, Bing Xiang

ACL 2023 Camera Ready
License: CC BY 4.0

Abstract: Despite the popularity of Shapley Values in explaining neural text classification models, computing them is prohibitive for large pretrained models due to a large number of model evaluations. In practice, Shapley Values are often estimated with a small number of stochastic model evaluations. However, we show that the estimated Shapley Values are sensitive to random seed choices -- the top-ranked features often have little overlap across different seeds, especially on examples with longer input texts. This can only be mitigated by aggregating thousands of model evaluations, which on the other hand, induces substantial computational overheads. To mitigate the trade-off between stability and efficiency, we develop an amortized model that directly predicts each input feature's Shapley Value without additional model evaluations. It is trained on a set of examples whose Shapley Values are estimated from a large number of model evaluations to ensure stability. Experimental results on two text classification datasets demonstrate that our amortized model estimates Shapley Values accurately with up to 60 times speedup compared to traditional methods. Furthermore, the estimated values are stable as the inference is deterministic. We release our code at https://github.com/yangalan123/Amortized-Interpretability.

Submitted to arXiv on 31 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.19998v1

In this paper, the authors address the challenge of computing Shapley Values for large pretrained models in neural text classification. They highlight that while Shapley Values are popular for explaining these models, their computation is time-consuming due to the large number of model evaluations required. To overcome this limitation, the authors propose an amortized model that directly predicts each input feature's Shapley Value without additional model evaluations. The authors demonstrate that estimated Shapley Values are sensitive to random seed choices, especially for examples with longer input texts. This lack of stability can only be mitigated by aggregating thousands of model evaluations, which introduces significant computational overheads. In contrast, their proposed amortized model provides stable estimations as the inference is deterministic. Experimental results on two text classification datasets show that the amortized model accurately estimates Shapley Values with a substantial speedup compared to traditional methods. The computation time per instance is reduced from about 3.47 seconds to less than 50 milliseconds, resulting in a 60 times faster process. Furthermore, the authors evaluate the functionality of their model by examining the quality of explanations in downstream tasks such as feature selection and domain calibration. They compare their method with the computationally expensive KernelSHAP (KS) method and demonstrate superior performance. Overall, this paper presents an efficient and effective approach for estimating Shapley Values in neural text classification models. The proposed amortized model significantly reduces computation time while providing stable estimations, making it a valuable tool for interpreting and understanding these models more effectively and efficiently than before.
Created on 03 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.