Signal Collapse in One-Shot Pruning: When Sparse Models Fail to Distinguish Neural Representations

AI-generated keywords: Neural network pruning signal collapse weight selection strategies REFLOW model efficiency

AI-generated Key Points

  • Neural network pruning is crucial for reducing model complexity for deployment on resource-constrained hardware.
  • Signal collapse, rather than the removal of critical parameters, is identified as the primary cause of performance decline in pruned networks.
  • Traditional one-shot pruning methods often rely on computationally expensive second-order approximations and weight selection strategies.
  • Addressing signal collapse is essential for enhancing the accuracy of pruned networks.
  • REFLOW is introduced as a novel approach to mitigate signal collapse without updating trainable weights, unveiling high-quality sparse sub-networks within the original parameter space.
  • REFLOW achieves state-of-the-art performance results by enabling magnitude pruning, restoring ResNeXt101 accuracy from under 4.1% to 78.9% on ImageNet with only 20% of the weights retained, surpassing existing approaches.
  • These findings challenge conventional beliefs about neural network pruning for large-scale models and emphasize considering signal collapse when optimizing pruned networks based on their absolute values.
  • This research offers promising insights for improving model efficiency and performance in resource-constrained settings.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dhananjay Saikumar, Blesson Varghese

License: CC BY 4.0

Abstract: Neural network pruning is essential for reducing model complexity to enable deployment on resource constrained hardware. While performance loss of pruned networks is often attributed to the removal of critical parameters, we identify signal collapse a reduction in activation variance across layers as the root cause. Existing one shot pruning methods focus on weight selection strategies and rely on computationally expensive second order approximations. In contrast, we demonstrate that mitigating signal collapse, rather than optimizing weight selection, is key to improving accuracy of pruned networks. We propose REFLOW that addresses signal collapse without updating trainable weights, revealing high quality sparse sub networks within the original parameter space. REFLOW enables magnitude pruning to achieve state of the art performance, restoring ResNeXt101 accuracy from under 4.1% to 78.9% on ImageNet with only 20% of the weights retained, surpassing state of the art approaches.

Submitted to arXiv on 18 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2502.15790v1

Neural network pruning plays a crucial role in reducing model complexity for deployment on resource-constrained hardware. This study identifies signal collapse as the primary cause of performance decline in pruned networks, rather than the removal of critical parameters. Traditional one-shot pruning methods often rely on computationally expensive second-order approximations and weight selection strategies. However, this research emphasizes that addressing signal collapse is essential for enhancing the accuracy of pruned networks. Introducing REFLOW as a novel approach, this study demonstrates how mitigating signal collapse without updating trainable weights can unveil high-quality sparse sub-networks within the original parameter space. By enabling magnitude pruning, REFLOW achieves state-of-the-art performance results and restores ResNeXt101 accuracy from under 4.1% to an impressive 78.9% on ImageNet with only 20% of the weights retained, surpassing existing approaches. These findings challenge conventional beliefs about neural network pruning for large-scale models and highlight the importance of considering signal collapse when optimizing pruned networks based on their absolute values. This research sheds light on a new perspective in neural network optimization and offers promising insights for improving model efficiency and performance in resource-constrained settings.
Created on 25 Feb. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.