LongCodeZip: Compress Long Context for Code Language Models

AI-generated keywords: Code Generation Long Contexts Large Language Models Code Compression Efficiency

AI-generated Key Points

Authors address the need for code generation under long contexts in Large Language Models (LLMs)
Introduction of LongCodeZip, a code compression framework tailored for code LLMs
Dual-stage strategy: coarse-grained compression ranks function-level chunks based on conditional perplexity, fine-grained compression segments functions into blocks using perplexity metrics
LongCodeZip achieves up to a 5.6x compression ratio without compromising task performance
Enables LLMs to scale better in real-world large-scale code scenarios and enhances efficiency and capability of code intelligence applications
Provides insights into datasets used for evaluation, results on long-code completion tasks, and emphasizes the importance of refining compression techniques

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuling Shi, Yichun Qian, Hongyu Zhang, Beijun Shen, Xiaodong Gu

arXiv: 2510.00446v1 - DOI (cs.CL)

Accepted to ASE 2025. Code available at https://github.com/YerbaPage/LongCodeZip

License: CC BY 4.0

Abstract: Code generation under long contexts is becoming increasingly critical as Large Language Models (LLMs) are required to reason over extensive information in the codebase. While recent advances enable code LLMs to process long inputs, high API costs and generation latency remain substantial bottlenecks. Existing context pruning techniques, such as LLMLingua, achieve promising results for general text but overlook code-specific structures and dependencies, leading to suboptimal performance in programming tasks. In this paper, we propose LongCodeZip, a novel plug-and-play code compression framework designed specifically for code LLMs. LongCodeZip employs a dual-stage strategy: (1) coarse-grained compression, which identifies and ranks function-level chunks using conditional perplexity with respect to the instruction, retaining only the most relevant functions; and (2) fine-grained compression, which segments retained functions into blocks based on perplexity and selects an optimal subset under an adaptive token budget to maximize relevance. Evaluations across multiple tasks, including code completion, summarization, and question answering, show that LongCodeZip consistently outperforms baseline methods, achieving up to a 5.6x compression ratio without degrading task performance. By effectively reducing context size while preserving essential information, LongCodeZip enables LLMs to better scale to real-world, large-scale code scenarios, advancing the efficiency and capability of code intelligence applications.

Submitted to arXiv on 01 Oct. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2510.00446v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

In their paper "LongCodeZip: Compress Long Context for Code Language Models," authors Yuling Shi, Yichun Qian, Hongyu Zhang, Beijun Shen, and Xiaodong Gu address the increasing need for code generation under long contexts in Large Language Models (LLMs). Recent advancements have enabled code LLMs to process extensive information in codebases. However, high API costs and generation latency remain significant challenges. To tackle these issues, the authors introduce LongCodeZip - a novel plug-and-play code compression framework tailored for code LLMs. LongCodeZip adopts a dual-stage strategy: coarse-grained compression identifies and ranks function-level chunks based on conditional perplexity with respect to instructions. It retains only the most relevant functions. Fine-grained compression further segments retained functions into blocks using perplexity metrics and selects an optimal subset within an adaptive token budget to maximize relevance. The evaluations conducted across various tasks such as code completion, summarization, and question answering demonstrate that LongCodeZip consistently outperforms baseline methods by achieving up to a 5.6x compression ratio without compromising task performance. By effectively reducing context size while preserving essential information, LongCodeZip enables LLMs to scale better in real-world large-scale code scenarios. This enhances the efficiency and capability of code intelligence applications. Additionally, the authors provide insights into datasets used for evaluating long-context code compression and present results on long-code completion tasks. They also discuss the limitations of existing methods and emphasize the importance of refining compression techniques for stricter compression ratios. Overall, offers a promising solution to address the challenges associated with processing long contexts in . Its implementation can greatly improve in tasks, especially under . With its focus on , LongCodeZip provides a valuable contribution to the field and has the potential to enhance the capabilities of code intelligence applications.

- Authors address the need for code generation under long contexts in Large Language Models (LLMs)
- Introduction of LongCodeZip, a code compression framework tailored for code LLMs
- Dual-stage strategy: coarse-grained compression ranks function-level chunks based on conditional perplexity, fine-grained compression segments functions into blocks using perplexity metrics
- LongCodeZip achieves up to a 5.6x compression ratio without compromising task performance
- Enables LLMs to scale better in real-world large-scale code scenarios and enhances efficiency and capability of code intelligence applications
- Provides insights into datasets used for evaluation, results on long-code completion tasks, and emphasizes the importance of refining compression techniques

Summary- Authors talk about the importance of making code shorter in big language models. - They introduce LongCodeZip, a way to make code smaller in these models. - LongCodeZip uses a two-step process to compress code based on how hard it is to understand. - With LongCodeZip, code can be made 5.6 times smaller without making it harder for the model to do its job. - This helps big language models work better with large amounts of code and improves how well they understand and use it. Definitions- Authors: People who write books, articles, or research papers. - Code generation: Creating new pieces of code automatically. - Large Language Models (LLMs): Big computer programs that can understand and generate human language text. - Compression: Making something smaller by removing unnecessary parts or using fewer bits to represent it. - Perplexity: A measure of how hard it is to predict the next word in a sequence of words.

Introduction

In recent years, Large Language Models (LLMs) have made significant strides in natural language processing tasks such as text completion and translation. However, their application to code generation has been limited due to the complexity of programming languages and the need for long contexts. Long contexts refer to a large amount of information that is required to generate accurate and relevant code. To address this issue, Yuling Shi, Yichun Qian, Hongyu Zhang, Beijun Shen, and Xiaodong Gu have introduced a novel framework called LongCodeZip in their paper "LongCodeZip: Compress Long Context for Code Language Models." This framework aims to compress long context for code LLMs while maintaining high performance on various coding tasks. In this blog article, we will discuss the key points of this research paper and its potential impact on the field of code intelligence.

The Need for Long Context Compression

The authors highlight two major challenges faced by current LLMs when dealing with long contexts in code generation - high API costs and generation latency. As LLMs process extensive information from large codebases, they require access to numerous APIs (Application Programming Interfaces) which can be costly. Additionally, generating relevant code under long contexts can take a considerable amount of time. To overcome these challenges, the authors propose LongCodeZip - a compression framework specifically designed for long-context scenarios in coding tasks.

The Dual-Stage Strategy

LongCodeZip adopts a dual-stage strategy that involves coarse-grained compression followed by fine-grained compression. The first stage identifies function-level chunks based on conditional perplexity with respect to instructions. It then ranks these chunks based on their relevance and retains only the most important ones. In the second stage, fine-grained compression further segments retained functions into blocks using perplexity metrics. It then selects an optimal subset within an adaptive token budget to maximize relevance. This two-stage approach allows LongCodeZip to effectively reduce the size of long contexts while preserving essential information.

Evaluation and Results

The authors evaluated LongCodeZip on various coding tasks such as code completion, summarization, and question answering. They compared its performance with baseline methods and found that it consistently outperformed them by achieving up to a 5.6x compression ratio without compromising task performance. In addition, the authors also provide insights into datasets used for evaluating long-context code compression and present results on long-code completion tasks. These evaluations demonstrate the effectiveness of LongCodeZip in improving the efficiency and capability of LLMs in real-world large-scale code scenarios.

Limitations and Future Work

While LongCodeZip shows promising results, the authors acknowledge some limitations of their framework. One limitation is that it currently only focuses on function-level chunks and does not consider other types of context such as class or file-level information. The authors suggest exploring these areas in future work to further improve compression techniques for stricter ratios.

Conclusion

LongCodeZip offers a valuable solution to address the challenges associated with processing long contexts in coding tasks. By effectively compressing long contexts while maintaining high performance, this framework has the potential to enhance the capabilities of code intelligence applications. The dual-stage strategy adopted by LongCodeZip allows it to achieve impressive compression ratios without sacrificing task performance. Its implementation can greatly improve efficiency in coding tasks, especially under long contexts. With its focus on addressing specific challenges faced by LLMs when dealing with large-scale codebases, LongCodeZip provides a valuable contribution to the field of code intelligence. Overall, this research paper highlights the importance of refining compression techniques for stricter ratios and presents a promising solution that can have a significant impact on improving LLMs' capabilities in handling long-context scenarios in coding tasks.

Created on 15 Jan. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

61.3%

Code Llama: Open Foundation Models for Code

cs.CL

60.3%

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study an…

cs.CL

60.3%

Long Context vs. RAG for LLMs: An Evaluation and Revisits

cs.CL

60.3%

UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Ret…

cs.CL

60.1%

Effective Long-Context Scaling of Foundation Models

cs.CL

60.0%

CCF: A Context Compression Framework for Efficient Long-Sequence Language Mod…

cs.CL

59.7%

Training LLMs over Neurally Compressed Text

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.