Reinforcement Learning with Maskable Stock Representation for Portfolio Management in Customizable Stock Pools
AI-generated Key Points
- Portfolio management (PM) involves reallocating capital into different stocks to maximize long-term profits
- Reinforcement learning (RL) is a promising approach for training profitable agents for PM by interacting with financial markets
- Existing RL methods have focused on fixed stock pools, not customizable stock pools (CSPs)
- EarnMore framework utilizes Maskable Stock Representation to handle PM with CSPs through one-shot training in a global stock pool (GSP)
- EarnMore aims to improve performance and efficiency in PM with CSPs by masking out stocks outside the target pool, learning meaningful stock representations, and implementing re-weighting mechanisms
- EarnMore significantly outperformed 14 state-of-the-art baselines in experiments across two real US financial markets, showing over 40% improvement in profit
- EarnMore effectively met investors' preferences and decisions in the trading process by constructing customizable stock pools based on different investor preferences
- Ablation studies were conducted to assess the usefulness of each component of EarnMore and its efficiency in handling PM with CSPs
- EarnMore demonstrated generality and effectiveness across various scenarios and market conditions, presenting a promising approach for optimizing portfolio management strategies
Authors: Wentao Zhang
Abstract: Portfolio management (PM) is a fundamental financial trading task, which explores the optimal periodical reallocation of capitals into different stocks to pursue long-term profits. Reinforcement learning (RL) has recently shown its potential to train profitable agents for PM through interacting with financial markets. However, existing work mostly focuses on fixed stock pools, which is inconsistent with investors' practical demand. Specifically, the target stock pool of different investors varies dramatically due to their discrepancy on market states and individual investors may temporally adjust stocks they desire to trade (e.g., adding one popular stocks), which lead to customizable stock pools (CSPs). Existing RL methods require to retrain RL agents even with a tiny change of the stock pool, which leads to high computational cost and unstable performance. To tackle this challenge, we propose EarnMore, a rEinforcement leARNing framework with Maskable stOck REpresentation to handle PM with CSPs through one-shot training in a global stock pool (GSP). Specifically, we first introduce a mechanism to mask out the representation of the stocks outside the target pool. Second, we learn meaningful stock representations through a self-supervised masking and reconstruction process. Third, a re-weighting mechanism is designed to make the portfolio concentrate on favorable stocks and neglect the stocks outside the target pool. Through extensive experiments on 8 subset stock pools of the US stock market, we demonstrate that EarnMore significantly outperforms 14 state-of-the-art baselines in terms of 6 popular financial metrics with over 40% improvement on profit.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.