SpanBERT: Improving Pre-training by Representing and Predicting Spans

AI-generated keywords: SpanBERT pre-training representation prediction text spans

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • SpanBERT is a method developed by Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, and Omer Levy to improve the accuracy and efficiency of text spans.
  • It introduces two key innovations: masking contiguous random spans during pre-training and training span boundary representations to predict masked spans without relying on token-level information.
  • SpanBERT consistently outperforms BERT and other baselines in tasks like question answering and coreference resolution.
  • With equivalent training data and model size as BERT-large, a single SpanBERT model achieves impressive F1 scores of 94.6% on SQuAD 1.1 and 88.7% on SQuAD 2.0.
  • SpanBERT sets new state-of-the-art performance in coreference resolution with an F1 score of 79.6% on the OntoNotes dataset and significant gains in relation extraction with a score of 70.8% on the TACRED benchmark.
  • It demonstrates improvements across various tasks including GLUE benchmarks, showcasing its effectiveness in capturing complex linguistic structures within text data.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, Omer Levy

Abstract: We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text. Our approach extends BERT by (1) masking contiguous random spans, rather than random tokens, and (2) training the span boundary representations to predict the entire content of the masked span, without relying on the individual token representations within it. SpanBERT consistently outperforms BERT and our better-tuned baselines, with substantial gains on span selection tasks such as question answering and coreference resolution. In particular, with the same training data and model size as BERT-large, our single model obtains 94.6% and 88.7% F1 on SQuAD 1.1 and 2.0, respectively. We also achieve a new state of the art on the OntoNotes coreference resolution task (79.6% F1) and the TACRED relation extraction benchmark (70.8% F1), and even show gains on GLUE.

Submitted to arXiv on 24 Jul. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1907.10529v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

SpanBERT: Enhancing Performance in Natural Language Processing Tasks is a groundbreaking method developed by Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, and Omer Levy that aims to improve the and of text spans. Unlike its predecessor BERT, SpanBERT introduces two key innovations: masking contiguous random spans instead of individual tokens during pre-training and training span boundary representations to predict the entire content of the masked span without relying on token-level information within it. The results of SpanBERT's implementation are impressive. It consistently outperforms BERT and other baselines in various span selection tasks such as question answering and coreference resolution. Notably, with equivalent training data and model size as BERT-large, a single SpanBERT model achieves remarkable F1 scores of 94.6% on SQuAD 1.1 and 88.7% on SQuAD 2.0. Additionally, SpanBERT sets a new state-of-the-art performance in coreference resolution with an F1 score of 79.6% on the OntoNotes dataset and achieves significant gains in relation extraction with a score of 70.8% on the TACRED benchmark. Moreover, SpanBERT demonstrates improvements across various tasks including GLUE (General Language Understanding Evaluation) benchmarks. This comprehensive evaluation showcases the effectiveness and versatility of SpanBERT in capturing complex linguistic structures and relationships within text data. In summary, in enhancing performance across a range of natural language processing tasks,.
Created on 02 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.