Neural Video Compression with Diverse Contexts

AI-generated keywords: Neural Video Compression

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper addresses coding efficiency in video codecs
  • Importance of finding relevant contexts for effective encoding
  • Traditional codecs are time-consuming but show significant coding gains with more contexts
  • Neural video codecs (NVC) have limited contexts, resulting in low compression ratio
  • Proposed solution: increase context diversity in temporal and spatial dimensions
  • Introduce hierarchical quality pattern learning approach to capture long-term and high-quality temporal contexts across frames
  • Leverage optical flow-based coding frameworks by introducing group-based offset diversity for enhanced context mining through cross-group interaction
  • Adopt quadtree-based partitioning technique to increase spatial context diversity during parallel encoding of latent representations
  • Experimental results show 23.5% bitrate saving compared to previous state-of-the-art NVC approaches
  • Outperforms next-generation traditional codecs/ECMs in terms of PSNR for RGB and YUV420 colorspaces
  • Implementation codes available at https://github.com/microsoft/DCVC
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jiahao Li, Bin Li, Yan Lu

Accepted by CVPR 2023. Codes are at https://github.com/microsoft/DCVC

Abstract: For any video codecs, the coding efficiency highly relies on whether the current signal to be encoded can find the relevant contexts from the previous reconstructed signals. Traditional codec has verified more contexts bring substantial coding gain, but in a time-consuming manner. However, for the emerging neural video codec (NVC), its contexts are still limited, leading to low compression ratio. To boost NVC, this paper proposes increasing the context diversity in both temporal and spatial dimensions. First, we guide the model to learn hierarchical quality patterns across frames, which enriches long-term and yet high-quality temporal contexts. Furthermore, to tap the potential of optical flow-based coding framework, we introduce a group-based offset diversity where the cross-group interaction is proposed for better context mining. In addition, this paper also adopts a quadtree-based partition to increase spatial context diversity when encoding the latent representation in parallel. Experiments show that our codec obtains 23.5% bitrate saving over previous SOTA NVC. Better yet, our codec has surpassed the under-developing next generation traditional codec/ECM in both RGB and YUV420 colorspaces, in terms of PSNR. The codes are at https://github.com/microsoft/DCVC.

Submitted to arXiv on 28 Feb. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2302.14402v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper titled "Neural Video Compression with Diverse Contexts" addresses the issue of coding efficiency in video codecs. It emphasizes the importance of finding relevant contexts from previous reconstructed signals for effective encoding. While traditional codecs have shown that more contexts lead to significant coding gains, they are time-consuming. On the other hand, emerging neural video codecs (NVC) have limited contexts, resulting in a low compression ratio. To address this limitation and enhance NVC, the authors propose increasing context diversity in both temporal and spatial dimensions. They introduce a hierarchical quality pattern learning approach that enables the model to capture long-term and high-quality temporal contexts across frames. Additionally, they leverage optical flow-based coding frameworks by introducing group-based offset diversity, which enhances context mining through cross-group interaction. Furthermore, the paper adopts a quadtree-based partitioning technique to increase spatial context diversity during parallel encoding of latent representations. Experimental results demonstrate that their codec achieves a 23.5% bitrate saving compared to previous state-of-the-art NVC approaches. Moreover, their codec outperforms next-generation traditional codecs/ECMs in terms of PSNR for both RGB and YUV420 colorspaces. The authors provide the implementation codes for their codec at https://github.com/microsoft/DCVC. The paper has been accepted by CVPR 2023.
Created on 03 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.