BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

AI-generated keywords: BART

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • BART is a denoising autoencoder for pretraining sequence-to-sequence models
  • It follows a two-step process: corrupting text and learning to reconstruct it
  • BART utilizes a Transformer-based neural machine translation architecture
  • Randomly shuffling sentences and using mask tokens yield the best performance for noising approaches
  • BART excels in text generation tasks and performs well in comprehension tasks
  • It achieves comparable results to RoBERTa on GLUE and SQuAD datasets, showcasing state-of-the-art performance in various tasks
  • BART outperforms back-translation systems for machine translation by providing a 1.1 BLEU increase with only target language pretraining
  • Ablation experiments within the BART framework identify factors that influence end-task performance
  • Overall, BART proves to be an effective denoising autoencoder for pretraining sequence-to-sequence models, showing versatility across natural language processing tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Ves Stoyanov, Luke Zettlemoyer

Abstract: We present BART, a denoising autoencoder for pretraining sequence-to-sequence models. BART is trained by (1) corrupting text with an arbitrary noising function, and (2) learning a model to reconstruct the original text. It uses a standard Tranformer-based neural machine translation architecture which, despite its simplicity, can be seen as generalizing BERT (due to the bidirectional encoder), GPT (with the left-to-right decoder), and many other more recent pretraining schemes. We evaluate a number of noising approaches, finding the best performance by both randomly shuffling the order of the original sentences and using a novel in-filling scheme, where spans of text are replaced with a single mask token. BART is particularly effective when fine tuned for text generation but also works well for comprehension tasks. It matches the performance of RoBERTa with comparable training resources on GLUE and SQuAD, achieves new state-of-the-art results on a range of abstractive dialogue, question answering, and summarization tasks, with gains of up to 6 ROUGE. BART also provides a 1.1 BLEU increase over a back-translation system for machine translation, with only target language pretraining. We also report ablation experiments that replicate other pretraining schemes within the BART framework, to better measure which factors most influence end-task performance.

Submitted to arXiv on 29 Oct. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1910.13461v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

BART is a denoising autoencoder designed for pretraining sequence-to-sequence models. It follows a two-step process: corrupting text with a noising function and then learning to reconstruct the original text. BART utilizes a Transformer-based neural machine translation architecture, which is an extension of BERT and GPT. The model explores various noising approaches and finds that randomly shuffling the order of sentences and using an in-filling scheme with mask tokens yield the best performance. When fine-tuned, BART excels in text generation tasks but also performs well in comprehension tasks. It achieves comparable results to RoBERTa on GLUE and SQuAD datasets, demonstrating state-of-the-art performance in abstractive dialogue, question answering, and summarization tasks with up to 6 ROUGE gains. Additionally, BART outperforms back-translation systems for machine translation by providing a 1.1 BLEU increase with only target language pretraining. The authors conduct ablation experiments within the BART framework to identify the factors that most influence end-task performance. Overall, BART proves to be an effective denoising autoencoder for pretraining sequence-to-sequence models, showcasing its versatility across various natural language processing tasks.
Created on 17 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.