Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow

AI-generated keywords: Quantitative investing Large language models Stock return forecasting Financial newsflow Portfolio construction

AI-generated Key Points

  • Quantitative investing study focuses on using large language models (LLMs) for stock return forecasting with financial newsflow
  • Importance of return forecasting in quantitative portfolio construction for tasks like stock picking and portfolio optimization
  • Direct incorporation of LLMs to model relationship between text representations and future stock returns, rather than traditional feature extraction and validation
  • Impact of different LLM design choices on return forecasting investigated, including encoder-only vs. decoder-only LLMs, bottleneck vs. aggregated representations
  • Aggregated representations from LLMs' token-level embeddings generally enhance performance of long-only and long-short portfolios
  • Decoder LLMs tend to lead to stronger portfolios in larger investment universes; Mistral shows more robust performance across different universes compared to DeBERTa and Llama
  • Return predictions from LLMs' text representations outperform conventional sentiment scores for portfolio construction
  • Open questions raised for future research include underperformance of encoder-only DeBERTa in large investment universes and performance variations among decoder-only models like Mistral and Llama across different universes
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Tian Guo, Emmanuel Hauptmann

arXiv: 2407.18103v1 - DOI (q-fin.CP)
License: CC BY 4.0

Abstract: Large language models (LLMs) and their fine-tuning techniques have demonstrated superior performance in various language understanding and generation tasks. This paper explores fine-tuning LLMs for stock return forecasting with financial newsflow. In quantitative investing, return forecasting is fundamental for subsequent tasks like stock picking, portfolio optimization, etc. We formulate the model to include text representation and forecasting modules. We propose to compare the encoder-only and decoder-only LLMs, considering they generate text representations in distinct ways. The impact of these different representations on forecasting performance remains an open question. Meanwhile, we compare two simple methods of integrating LLMs' token-level representations into the forecasting module. The experiments on real news and investment universes reveal that: (1) aggregated representations from LLMs' token-level embeddings generally produce return predictions that enhance the performance of long-only and long-short portfolios; (2) in the relatively large investment universe, the decoder LLMs-based prediction model leads to stronger portfolios, whereas in the small universes, there are no consistent winners. Among the three LLMs studied (DeBERTa, Mistral, Llama), Mistral performs more robustly across different universes; (3) return predictions derived from LLMs' text representations are a strong signal for portfolio construction, outperforming conventional sentiment scores.

Submitted to arXiv on 25 Jul. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2407.18103v1

This paper delves into the realm of quantitative investing by exploring the use of large language models (LLMs) and their fine-tuning techniques for stock return forecasting with financial newsflow. The study emphasizes the importance of return forecasting in quantitative portfolio construction, highlighting its significance for tasks such as stock picking and portfolio optimization. Unlike traditional approaches that involve feature extraction and validation, this research directly incorporates LLMs to model the relationship between text representations and future stock returns. The paper specifically investigates the impact of different LLM design choices on return forecasting, focusing on encoder-only versus decoder-only LLMs, as well as bottleneck versus aggregated representations. Through experiments conducted on real financial news data across various investment universes and portfolios, several key findings emerge. Firstly, aggregated representations from LLMs' token-level embeddings generally enhance the performance of long-only and long-short portfolios. Secondly, in larger investment universes, decoder LLMs tend to lead to stronger portfolios compared to encoder-only models; however, performance variations are observed in smaller universes. Among the three LLMs studied (DeBERTa, Mistral, Llama), Mistral demonstrates more robust performance across different investment universes. Additionally, the research shows that return predictions derived from LLMs' text representations serve as a strong signal for portfolio construction, outperforming conventional sentiment scores. The paper also highlights several open questions for future research. For instance <fd>, it raises queries about the underperformance of encoder-only DeBERTa in large investment universes and explores reasons behind varying performance of DeBERTa in different small universes</fd>. The study suggests evaluating recently proposed large encoder-only LLMs as a potential follow-up research direction. Furthermore <fd>, within the decoder-only LLM family</fd>, further exploration is needed to understand why there are performance variations among models like Mistral and Llama across different investment universes. In conclusion, this paper contributes valuable insights into leveraging advanced language models for stock return prediction using financial newsflow data. It underscores the potential of utilizing LLMs' text representations as a powerful tool for enhancing quantitative portfolio construction strategies and signals a promising avenue for future research in this domain.
Created on 23 Dec. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.