Exploring the Advantages of Transformers for High-Frequency Trading

AI-generated keywords: HFformer Transformer LSTM Trading Strategies Autoencoders

AI-generated Key Points

  • The paper explores the use of deep learning Transformers architectures for high-frequency Bitcoin-USDT log-return forecasting and compares them to traditional LSTM models.
  • The authors introduce a hybrid Transformer model called HFformer, which incorporates a Transformer encoder, linear decoder, spiking activations, and quantile loss function without using position encoding.
  • Possible high-frequency trading strategies for use with the HFformer model are discussed, including trade sizing, trading signal aggregation, and minimal trading threshold.
  • Future lines of research are suggested to improve the LOB snapshot pre-processing pipeline's noise reduction by using autoencoders with automated feature selection.
  • More extensive performance assessments of the HFformer on large forecast horizons and using altcoin trading pairs such as ETH-USDT are recommended.
  • Implementing the HFformer with other types of Attention modules such as auto-correlation Attention is proposed.
  • A more realistic backtesting environment that accounts for the impact of placed orders and emulates other market participants' activity is suggested to assess the performance of the HFformer.
  • When testing LSTM and HFformer models in log-returns forecasting from 1 to 30 ticks ahead, they achieved higher R2 scores than other deep learning architectures.
  • Both LSTM and HFformer models achieved similar performance for classification tasks.
  • Backtested on different trading strategies involving 1-5 trade signals during multiple signals' trades in BTC-USDT LOB data collected over two days and a month after training and validation data were found that using more than one trade signal decreases the number of trades and increases cumulative PnL of a long-short trading strategy.
  • The HFformer generates long and short trade signals that result in a more balanced trading strategy than LSTM when complemented with trade sizing to improve cumulative PnL. However these methods may yield different results when trading another cryptocurrency pair or financial asset as machine learning methods are data driven and sometimes less generalizable than traditional statistical methods.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Fazl Barez, Paul Bilokon, Arthur Gervais, Nikita Lisitsyn

arXiv: 2302.13850v1 - DOI (q-fin.ST)
License: CC BY 4.0

Abstract: This paper explores the novel deep learning Transformers architectures for high-frequency Bitcoin-USDT log-return forecasting and compares them to the traditional Long Short-Term Memory models. A hybrid Transformer model, called \textbf{HFformer}, is then introduced for time series forecasting which incorporates a Transformer encoder, linear decoder, spiking activations, and quantile loss function, and does not use position encoding. Furthermore, possible high-frequency trading strategies for use with the HFformer model are discussed, including trade sizing, trading signal aggregation, and minimal trading threshold. Ultimately, the performance of the HFformer and Long Short-Term Memory models are assessed and results indicate that the HFformer achieves a higher cumulative PnL than the LSTM when trading with multiple signals during backtesting.

Submitted to arXiv on 20 Feb. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2302.13850v1

This paper explores the use of deep learning Transformers architectures for high-frequency Bitcoin-USDT log-return forecasting and compares them to traditional Long Short-Term Memory (LSTM) models. The authors introduce a hybrid Transformer model, called HFformer, which incorporates a Transformer encoder, linear decoder, spiking activations, and quantile loss function without using position encoding. They also discuss possible high-frequency trading strategies for use with the HFformer model, including trade sizing, trading signal aggregation, and minimal trading threshold. The paper suggests several future lines of research that can be undertaken to improve the LOB snapshot pre-processing pipeline's noise reduction by using autoencoders with automated feature selection. Additionally, they suggest performing more extensive performance assessments of the HFformer on large forecast horizons and using altcoin trading pairs such as ETH-USDT. The authors also recommend implementing the HFformer with other types of Attention modules such as auto-correlation Attention. Moreover, they propose implementing a more realistic backtesting environment that accounts for the impact of placed orders and emulates other market participants' activity to assess the performance of the HFformer. When testing LSTM and HFformer models in log-returns forecasting from 1 to 30 ticks ahead, they achieved higher R2 scores than other deep learning architectures. Moreover, both LSTM and HFformer models achieved similar performance for classification tasks. Finally, when backtested on different trading strategies involving 1-5 trade signals during multiple signals' trades in BTC-USDT LOB data collected over two days and a month after training and validation data were found that using more than one trade signal decreases the number of trades and increases the cumulative PnL of a long-short trading strategy. The HFformer generates long and short trade signals that result in a more balanced trading strategy than LSTM when complemented with trade sizing to improve cumulative PnL. However these methods may yield different results when trading another cryptocurrency pair or financial asset as machine learning methods are data driven and sometimes less generalizable than traditional statistical methods.
Created on 08 Apr. 2023

Assess the quality of the AI-generated content by voting

Score: 1

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.