Action Centered Contextual Bandits

AI-generated keywords: Contextual Bandits Mobile Health Linear Model Baseline Reward Treatment Effect

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper explores the use of contextual bandits in mobile health applications
  • Contextual bandits provide a middle ground between simple multi-armed bandit approaches and complex reinforcement learning methods
  • They have been successful in web applications due to their interpretability and ease of implementation, as well as strong performance guarantees when the linear model assumption holds true
  • However, this assumption is not feasible in emerging mobile health applications
  • The authors propose an extension of the linear model for contextual bandits that consists of two parts: baseline reward and treatment effect
  • The theory presented in the paper is supported by experiments conducted on data gathered from a recent mobile health study
  • This paper contributes to advancing contextual bandit algorithms for mobile health applications by accommodating nonlinearity in baseline modeling while preserving strong performance guarantees similar to those offered by linear models.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Kristjan Greenewald, Ambuj Tewari, Predrag Klasnja, Susan Murphy

to appear at NIPS 2017

Abstract: Contextual bandits have become popular as they offer a middle ground between very simple approaches based on multi-armed bandits and very complex approaches using the full power of reinforcement learning. They have demonstrated success in web applications and have a rich body of associated theoretical guarantees. Linear models are well understood theoretically and preferred by practitioners because they are not only easily interpretable but also simple to implement and debug. Furthermore, if the linear model is true, we get very strong performance guarantees. Unfortunately, in emerging applications in mobile health, the time-invariant linear model assumption is untenable. We provide an extension of the linear model for contextual bandits that has two parts: baseline reward and treatment effect. We allow the former to be complex but keep the latter simple. We argue that this model is plausible for mobile health applications. At the same time, it leads to algorithms with strong performance guarantees as in the linear model setting, while still allowing for complex nonlinear baseline modeling. Our theory is supported by experiments on data gathered in a recently concluded mobile health study.

Submitted to arXiv on 09 Nov. 2017

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1711.03596v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper titled "Action Centered Contextual Bandits" by Kristjan Greenewald, Ambuj Tewari, Predrag Klasnja, and Susan Murphy explores the use of contextual bandits in mobile health applications. Contextual bandits provide a middle ground between simple multi-armed bandit approaches and complex reinforcement learning methods. They have been successful in web applications due to their interpretability and ease of implementation as well as strong performance guarantees when the linear model assumption holds true. However, in emerging mobile health applications this assumption is not feasible. To address this limitation, the authors propose an extension of the linear model for contextual bandits that consists of two parts: baseline reward and treatment effect. While the former can be complex, the latter remains simple yet allows for nonlinearity in baseline modeling while maintaining simplicity and interpretability. The theory presented in the paper is supported by experiments conducted on data gathered from a recent mobile health study. This paper contributes to advancing contextual bandit algorithms for mobile health applications by accommodating nonlinearity in baseline modeling while preserving strong performance guarantees similar to those offered by linear models.
Created on 19 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.