Learning Continuous Image Representation with Local Implicit Image Function

AI-generated keywords: Computer Vision

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Representing images in a continuous manner is challenging due to machines processing visual information using 2D arrays of pixels
  • Recent advancements in 3D reconstruction with implicit neural representation have led to the development of Local Implicit Image Function (LIIF)
  • LIIF involves using image coordinates and surrounding deep features to predict RGB values, allowing for arbitrary resolution presentation
  • An encoder trained with LIIF representation through self-supervised tasks like super-resolution enhances continuous image representation
  • LIIF can extrapolate learned representations to x30 higher resolution levels even without explicit training tasks
  • LIIF bridges the gap between discrete and continuous representations in 2D, supporting learning tasks with size-varied image ground-truths
  • Authors Yinbo Chen, Sifei Liu, and Xiaolong Wang's study on LIIF's efficacy has been recognized at CVPR 2021 (oral) and includes a project page for further exploration
  • LIIF outperforms methods involving resizing ground-truths, showcasing its potential to revolutionize image representation and processing in computer vision applications
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yinbo Chen, Sifei Liu, Xiaolong Wang

CVPR 2021 (oral). Project page with videos and code: https://yinboc.github.io/liif/

Abstract: How to represent an image? While the visual world is presented in a continuous manner, machines store and see the images in a discrete way with 2D arrays of pixels. In this paper, we seek to learn a continuous representation for images. Inspired by the recent progress in 3D reconstruction with implicit neural representation, we propose Local Implicit Image Function (LIIF), which takes an image coordinate and the 2D deep features around the coordinate as inputs, predicts the RGB value at a given coordinate as an output. Since the coordinates are continuous, LIIF can be presented in arbitrary resolution. To generate the continuous representation for images, we train an encoder with LIIF representation via a self-supervised task with super-resolution. The learned continuous representation can be presented in arbitrary resolution even extrapolate to x30 higher resolution, where the training tasks are not provided. We further show that LIIF representation builds a bridge between discrete and continuous representation in 2D, it naturally supports the learning tasks with size-varied image ground-truths and significantly outperforms the method with resizing the ground-truths.

Submitted to arXiv on 16 Dec. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2012.09161v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the realm of computer vision, representing images in a continuous manner poses a challenge due to machines storing and processing visual information in a discrete fashion using 2D arrays of pixels. Seeking to bridge this gap, recent advancements in 3D reconstruction with implicit neural representation have inspired the development of the Local Implicit Image Function (LIIF). This innovative approach involves taking an image coordinate and surrounding 2D deep features as inputs to predict the RGB value at that specific coordinate, allowing for arbitrary resolution presentation. To further enhance the continuous representation of images, an encoder is trained with LIIF representation through a self-supervised task involving super-resolution. The resulting learned continuous representation can be extrapolated to x30 higher resolution levels even when training tasks are not explicitly provided. Notably, LIIF serves as a pivotal link between discrete and continuous representations in 2D, enabling support for learning tasks with size-varied image ground-truths. The study conducted by authors Yinbo Chen, Sifei Liu, and Xiaolong Wang delves into the efficacy of LIIF representation in enhancing image processing tasks. Their work has been recognized at CVPR 2021 (oral) and includes a project page featuring videos and code for further exploration. By demonstrating superior performance compared to methods involving resizing ground-truths, LIIF showcases its potential to revolutionize how images are represented and processed in computer vision applications.
Created on 01 May. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.