Show Your Work: Scratchpads for Intermediate Computation with Language Models

AI-generated keywords: Scratchpads Language Models Multi-Step Computations Transformers Few-Shot Regime

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper explores the limitations of large pre-trained language models when it comes to tasks that require unbounded multi-step computation
  • Large pre-trained language models perform well on tasks that can be done "in one pass", but struggle with more complex computations
  • These same models are able to perform complex multi-step computations, even in the few-shot regime, when asked to perform the operation "step by step", showing the results of intermediate computations
  • The authors train transformers to perform multi-step computations by asking them to emit intermediate computation steps into a "scratchpad"
  • Scratchpads serve as a temporary storage space for intermediate results and allow the model to keep track of its progress throughout the computation
  • On a series of increasingly complex tasks ranging from long addition to the execution of arbitrary programs, scratchpads dramatically improve the ability of language models to perform multi-step computations
  • Scratchpads can help language models solve problems that were previously beyond their capabilities, such as arithmetic expressions with nested parentheses and multiple operations or program synthesis from natural language descriptions.
  • Overall, this paper highlights an important breakthrough in improving the capabilities of large pre-trained language models for more complex computational tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Maxwell Nye, Anders Johan Andreassen, Guy Gur-Ari, Henryk Michalewski, Jacob Austin, David Bieber, David Dohan, Aitor Lewkowycz, Maarten Bosma, David Luan, Charles Sutton, Augustus Odena

Abstract: Large pre-trained language models perform remarkably well on tasks that can be done "in one pass", such as generating realistic text or synthesizing computer programs. However, they struggle with tasks that require unbounded multi-step computation, such as adding integers or executing programs. Surprisingly, we find that these same models are able to perform complex multi-step computations -- even in the few-shot regime -- when asked to perform the operation "step by step", showing the results of intermediate computations. In particular, we train transformers to perform multi-step computations by asking them to emit intermediate computation steps into a "scratchpad". On a series of increasingly complex tasks ranging from long addition to the execution of arbitrary programs, we show that scratchpads dramatically improve the ability of language models to perform multi-step computations.

Submitted to arXiv on 30 Nov. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2112.00114v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper "Show Your Work: Scratchpads for Intermediate Computation with Language Models" explores the limitations of large pre-trained language models when it comes to tasks that require unbounded multi-step computation, such as adding integers or executing programs. While these models perform remarkably well on tasks that can be done "in one pass", they struggle with more complex computations. However, the authors find that these same models are able to perform complex multi-step computations, even in the few-shot regime, when asked to perform the operation "step by step", showing the results of intermediate computations. To achieve this, the authors train transformers to perform multi-step computations by asking them to emit intermediate computation steps into a "scratchpad". The scratchpad serves as a temporary storage space for intermediate results and allows the model to keep track of its progress throughout the computation. On a series of increasingly complex tasks ranging from long addition to the execution of arbitrary programs, the authors show that scratchpads dramatically improve the ability of language models to perform multi-step computations. The paper presents several experiments demonstrating how scratchpads can help language models solve problems that were previously beyond their capabilities. For example, on a task involving arithmetic expressions with nested parentheses and multiple operations, scratchpads enabled a transformer model to achieve near-perfect accuracy after just a few training examples. Similarly, on a task involving program synthesis from natural language descriptions, scratchpads helped improve performance by over 20%. Overall, this paper highlights an important breakthrough in improving the capabilities of large pre-trained language models for more complex computational tasks. By introducing scratchpads as an intermediary tool for tracking intermediate results during multi-step computations, these models can now tackle problems that were previously out of reach.
Created on 08 Jun. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.