Code Execution with Pre-trained Language Models

AI-generated keywords: Code Execution

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors explore importance of code execution in programming language semantics
  • Existing pre-trained models for code intelligence focus on source code and syntax, ignoring execution trace
  • Authors develop mutation-based data augmentation technique to create realistic Python dataset for code execution
  • Dataset challenges existing models like Codex
  • Introduce CodeExecutor, a Transformer model that leverages code execution pre-training and curriculum learning
  • Evaluate CodeExecutor's performance on code execution tasks and demonstrate promising results as well as limitations
  • Potential benefits of CodeExecutor for other code intelligence tasks such as zero-shot code-to-code search and text-to-code generation
  • Study provides insights into learning and generalization abilities of pre-trained models for code execution
  • Incorporating execution trace aims to improve understanding and performance of pre-trained models in handling complex programming tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Chenxiao Liu, Shuai Lu, Weizhu Chen, Daxin Jiang, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan, Nan Duan

Accepted to the Findings of ACL 2023

Abstract: Code execution is a fundamental aspect of programming language semantics that reflects the exact behavior of the code. However, most pre-trained models for code intelligence ignore the execution trace and only rely on source code and syntactic structures. In this paper, we investigate how well pre-trained models can understand and perform code execution. We develop a mutation-based data augmentation technique to create a large-scale and realistic Python dataset and task for code execution, which challenges existing models such as Codex. We then present CodeExecutor, a Transformer model that leverages code execution pre-training and curriculum learning to enhance its semantic comprehension. We evaluate CodeExecutor on code execution and show its promising performance and limitations. We also demonstrate its potential benefits for code intelligence tasks such as zero-shot code-to-code search and text-to-code generation. Our analysis provides insights into the learning and generalization abilities of pre-trained models for code execution.

Submitted to arXiv on 08 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.05383v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the paper titled "Code Execution with Pre-trained Language Models," authors Chenxiao Liu, Shuai Lu, Weizhu Chen, Daxin Jiang, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan, and Nan Duan explore the importance of code execution in programming language semantics. They highlight that most pre-trained models for code intelligence focus solely on source code and syntactic structures while ignoring the execution trace. To address this limitation, the authors develop a mutation-based data augmentation technique to create a realistic Python dataset for code execution. This dataset challenges existing models like Codex. Additionally, they introduce CodeExecutor, a Transformer model that leverages code execution pre-training and curriculum learning to enhance its semantic comprehension. The authors evaluate CodeExecutor's performance on code execution tasks and demonstrate its promising results as well as its limitations. Furthermore, they showcase the potential benefits of CodeExecutor for other code intelligence tasks such as zero-shot code-to-code search and text-to-code generation. Overall, this study provides valuable insights into the learning and generalization abilities of pre-trained models for code execution. By incorporating the execution trace into their approach, the authors aim to improve the understanding and performance of pre-trained models in handling complex programming tasks.
Created on 24 Aug. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.