Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

AI-generated keywords: Compositional Generalization

AI-generated Key Points

  • Generated SQL statements in Text-to-SQL task are refined iteratively based on user input utterance
  • Input text from each interaction can be viewed as component modifications to previous SQL statements
  • Modification patterns can be extracted from these interactions and combined with other SQL statements
  • Two challenging benchmarks, CoSQL-CG and SParC-CG, were constructed by recombining modification patterns and existing SQL statements
  • Current models struggle to perform well on these benchmarks
  • Alignment of previous SQL statements with input utterance improves compositional generalization ability of models
  • A method called p-align is proposed to improve compositional generalization by aligning text and SQL statements accurately and incorporating previous SQL statements into model architecture
  • The research highlights the importance of compositional generalization in context-dependent Text-to-SQL settings
  • The proposed method significantly enhances model performance in handling complex queries.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Aiwei Liu, Wei Liu, Xuming Hu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen

Accepted to ACL 2023 (Findings), Long Paper, 11 pages. arXiv admin note: substantial text overlap with arXiv:2205.07686, arXiv:2210.11888 by other authors
License: CC BY 4.0

Abstract: In the context-dependent Text-to-SQL task, the generated SQL statements are refined iteratively based on the user input utterance from each interaction. The input text from each interaction can be viewed as component modifications to the previous SQL statements, which could be further extracted as the modification patterns. Since these modification patterns could also be combined with other SQL statements, the models are supposed to have the compositional generalization to these novel combinations. This work is the first exploration of compositional generalization in context-dependent Text-to-SQL scenarios. To facilitate related studies, we constructed two challenging benchmarks named \textsc{CoSQL-CG} and \textsc{SParC-CG} by recombining the modification patterns and existing SQL statements. The following experiments show that all current models struggle on our proposed benchmarks. Furthermore, we found that better aligning the previous SQL statements with the input utterance could give models better compositional generalization ability. Based on these observations, we propose a method named \texttt{p-align} to improve the compositional generalization of Text-to-SQL models. Further experiments validate the effectiveness of our method. Source code and data are available.

Submitted to arXiv on 29 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2306.04480v1

In the context-dependent Text-to-SQL task, the generated SQL statements are refined iteratively based on the user input utterance from each interaction. The input text from each interaction can be viewed as component modifications to the previous SQL statements, which could be further extracted as modification patterns. These modification patterns can also be combined with other SQL statements, requiring models to have compositional generalization to handle novel combinations. This paper explores compositional generalization in context-dependent Text-to-SQL scenarios and introduces two challenging benchmarks named CoSQL-CG and SParC-CG, constructed by recombining modification patterns and existing SQL statements. The experiments conducted on these benchmarks reveal that current models struggle to perform well. It is observed that better alignment of previous SQL statements with the input utterance can enhance models' compositional generalization ability. Based on this observation, a method called p-align is proposed to improve the compositional generalization of Text-to-SQL models. Further experiments validate the effectiveness of this method. The authors highlight several contributions of their work: they are the first to explore compositional generalization in context-dependent Text-to-SQL; they construct challenging benchmarks for related studies; and they propose a method (p-align) that effectively improves current models' compositional generalization ability by aligning text and SQL statements more accurately and incorporating previous SQL statements into their model architecture. Overall, this research sheds light on the importance of compositional generalization in context dependent Text to SQL settings and presents a method that significantly enhances model performance in handling complex queries.
Created on 09 Aug. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.