Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing

AI-generated keywords: Text-to-SQL Semantic Parsing

AI-generated Key Points

  • Authors Xi Victoria Lin, Richard Socher, and Caiming Xiong introduce BRIDGE, a sequential architecture for cross-domain text-to-SQL semantic parsing.
  • BRIDGE represents the question and DB schema as a tagged sequence enriched with cell values referenced in the question.
  • The hybrid sequence is encoded using BERT with minimal additional layers, leveraging deep attention mechanisms for text-DB contextualization.
  • BRIDGE achieves state-of-the-art performance on benchmarks like Spider (71.1% dev, 67.5% test) and WikiSQL (92.6% dev, 91.9% test).
  • The model effectively captures cross-modal dependencies and demonstrates potential for generalization to other text-DB related tasks.
  • BRIDGE emphasizes interpretability by utilizing anchor texts to enhance alignment between textual mentions and DB schema components.
  • The implementation of BRIDGE is publicly available at \url{https://github.com/salesforce/TabularSemanticParsing}, providing a valuable resource for further exploration in this domain.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xi Victoria Lin, Richard Socher, Caiming Xiong

EMNLP Findings 2020 long paper extended; 23 pages
License: CC BY 4.0

Abstract: We present BRIDGE, a powerful sequential architecture for modeling dependencies between natural language questions and relational databases in cross-DB semantic parsing. BRIDGE represents the question and DB schema in a tagged sequence where a subset of the fields are augmented with cell values mentioned in the question. The hybrid sequence is encoded by BERT with minimal subsequent layers and the text-DB contextualization is realized via the fine-tuned deep attention in BERT. Combined with a pointer-generator decoder with schema-consistency driven search space pruning, BRIDGE attained state-of-the-art performance on popular cross-DB text-to-SQL benchmarks, Spider (71.1\% dev, 67.5\% test with ensemble model) and WikiSQL (92.6\% dev, 91.9\% test). Our analysis shows that BRIDGE effectively captures the desired cross-modal dependencies and has the potential to generalize to more text-DB related tasks. Our implementation is available at \url{https://github.com/salesforce/TabularSemanticParsing}.

Submitted to arXiv on 23 Dec. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2012.12627v2

, , , , In their paper titled "Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing," authors Xi Victoria Lin, Richard Socher, and Caiming Xiong introduce BRIDGE, a sequential architecture designed to model dependencies between natural language questions and relational databases in cross-DB semantic parsing. The key innovation of BRIDGE lies in its representation of the question and DB schema as a tagged sequence, where certain fields are enriched with cell values referenced in the question. This hybrid sequence is then encoded using BERT with minimal additional layers, leveraging deep attention mechanisms for text-DB contextualization. Furthermore, BRIDGE incorporates a pointer-generator decoder that prioritizes schema-consistency driven search space pruning. This approach has enabled BRIDGE to achieve state-of-the-art performance on prominent cross-DB text-to-SQL benchmarks like Spider (71.1% dev, 67.5% test with ensemble model) and WikiSQL (92.6% dev, 91.9% test). Through detailed analysis, the authors demonstrate that BRIDGE effectively captures cross-modal dependencies and exhibits potential for generalization to other text-DB related tasks. The authors emphasize the interpretability of BRIDGE by utilizing anchor texts to enhance alignment between textual mentions and DB schema components. By maximizing the utilization of pre-trained language models like BERT, BRIDGE excels at linking text references with database structures. Overall, their findings suggest that BRIDGE is adept at handling natural language variations and structural patterns while achieving outstanding performance on challenging text-to-SQL benchmarks such as WikiSQL. This comprehensive study extends beyond traditional semantic parsing approaches by emphasizing the importance of bridging textual and tabular data effectively for enhanced performance in cross-domain applications. The implementation of BRIDGE is publicly available at \url{https://github.com/salesforce/TabularSemanticParsing}, providing researchers and practitioners with a valuable resource for further exploration in this domain.
Created on 19 Feb. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.