Differentiable Product Quantization for End-to-End Embedding Compression

AI-generated keywords: Differentiable Product Quantization End-to-End Embedding Compression Memory and Storage Constraints Novel Compression Framework Continuous Embedding Vectors

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors Ting Chen, Lala Li, and Yizhou Sun introduce differentiable product quantization (DPQ) to address memory and storage constraints in embedding layers.
  • DPQ offers significant compression ratios ranging from 14 to 238 times.
  • The framework includes two instantiations with different approximation techniques to ensure differentiability in end-to-end learning.
  • DPQ can replace existing embedding layers without compromising performance across various language tasks, as shown empirically on 10 datasets.
  • This approach reduces the computational burden while maintaining semantic meanings of symbols through continuous embedding vectors, making it valuable for natural language processing applications.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ting Chen, Lala Li, Yizhou Sun

ICML'2020. Code at https://github.com/chentingpc/dpq_embedding_compression

Abstract: Embedding layers are commonly used to map discrete symbols into continuous embedding vectors that reflect their semantic meanings. Despite their effectiveness, the number of parameters in an embedding layer increases linearly with the number of symbols and poses a critical challenge on memory and storage constraints. In this work, we propose a generic and end-to-end learnable compression framework termed differentiable product quantization (DPQ). We present two instantiations of DPQ that leverage different approximation techniques to enable differentiability in end-to-end learning. Our method can readily serve as a drop-in alternative for any existing embedding layer. Empirically, DPQ offers significant compression ratios (14-238$\times$) at negligible or no performance cost on 10 datasets across three different language tasks.

Submitted to arXiv on 26 Aug. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1908.09756v3

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Differentiable Product Quantization for End-to-End Embedding Compression," authors Ting Chen, Lala Li, and Yizhou Sun address the challenge of memory and storage constraints posed by the linear increase in parameters in embedding layers with the number of symbols. They introduce a novel compression framework called differentiable product quantization (DPQ) that is generic, end-to-end learnable, and offers significant compression ratios ranging from 14 to 238 times. The framework includes two instantiations that utilize different approximation techniques to ensure differentiability in end-to-end learning. DPQ can seamlessly replace existing embedding layers without compromising performance across various language tasks, as demonstrated empirically on 10 datasets. This innovative approach not only reduces the computational burden but also maintains the semantic meanings of discrete symbols through continuous embedding vectors, making it a valuable tool for efficient and effective natural language processing applications.
Created on 27 May. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.