Learning the Spoofability of Limit Order Books With Interpretable Probabilistic Neural Networks

AI-generated keywords: Spoofing Detection

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper focuses on real-time detection of spoofing activity in limit order books, particularly on cryptocurrency centralized exchanges.
  • Innovative order flow variables based on multi-scale Hawkes processes are introduced to consider size and placement distance of new limit orders from current best prices.
  • A neural network model is trained using Level-3 data set to forecast mid-price movements based on these features.
  • Posting distance of limit orders is emphasized as crucial in the price formation process, with existing spoofing detection models lacking accuracy without considering this factor.
  • A spoofing detection framework is proposed based on probabilistic market manipulation gain of a spoofing agent.
  • Empirical analysis shows that 31% of large orders have potential to manipulate or "spoof" the market.
  • The neural network model can operate in real-time, providing a practical tool for monitoring and mitigating spoofing activities in both cryptocurrency exchanges and traditional financial markets.
  • The research contributes significantly to enhancing market integrity by offering a robust methodology for detecting and addressing spoofing behavior effectively.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Timothée Fabre, Damien Challet

arXiv: 2504.15908v1 - DOI (q-fin.TR)
22 pages

Abstract: This paper investigates real-time detection of spoofing activity in limit order books, focusing on cryptocurrency centralized exchanges. We first introduce novel order flow variables based on multi-scale Hawkes processes that account both for the size and placement distance from current best prices of new limit orders. Using a Level-3 data set, we train a neural network model to predict the conditional probability distribution of mid price movements based on these features. Our empirical analysis highlights the critical role of the posting distance of limit orders in the price formation process, showing that spoofing detection models that do not take the posting distance into account are inadequate to describe the data. Next, we propose a spoofing detection framework based on the probabilistic market manipulation gain of a spoofing agent and use the previously trained neural network to compute the expected gain. Running this algorithm on all submitted limit orders in the period 2024-12-04 to 2024-12-07, we find that 31% of large orders could spoof the market. Because of its simple neuronal architecture, our model can be run in real time. This work contributes to enhancing market integrity by providing a robust tool for monitoring and mitigating spoofing in both cryptocurrency exchanges and traditional financial markets.

Submitted to arXiv on 22 Apr. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2504.15908v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper "Learning the Spoofability of Limit Order Books With Interpretable Probabilistic Neural Networks" by Timothée Fabre and Damien Challet delves into the real-time detection of spoofing activity in limit order books, specifically focusing on cryptocurrency centralized exchanges. The study introduces innovative order flow variables based on multi-scale Hawkes processes that consider both the size and placement distance from current best prices of new limit orders. By utilizing a Level-3 data set, the researchers train a neural network model to forecast the conditional probability distribution of mid-price movements based on these features. Through empirical analysis, the authors emphasize the crucial role of the posting distance of limit orders in the price formation process. They demonstrate that existing spoofing detection models that neglect this posting distance are insufficient to accurately describe market data. To address this limitation, a spoofing detection framework is proposed based on the probabilistic market manipulation gain of a spoofing agent. The trained neural network is then employed to calculate the expected gain. Applying this algorithm to all submitted limit orders during a specific period reveals that 31% of large orders have the potential to manipulate or "spoof" the market. Notably, due to its simple neuronal architecture, this model can operate in real-time, offering a practical tool for monitoring and mitigating spoofing activities in both cryptocurrency exchanges and traditional financial markets. In conclusion, this research contributes significantly to enhancing market integrity by providing a robust methodology for detecting and addressing spoofing behavior effectively. The findings underscore the importance of considering posting distances in developing accurate spoofing detection models and highlight <KD>the value</KD> <KD>of interpretable probabilistic neural networks</KD> in understanding and combating market manipulation tactics.
Created on 12 May. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.