ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields

AI-generated keywords: Neural Radiance Fields Visibility Prior View Synthesis Sparse Input Views Plane Sweep Volumes

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper presents a novel approach called ViP-NeRF to address the limitations of Neural Radiance Fields (NeRFs) in synthesizing photo-realistic novel views.
  • NeRFs require a large number of images per scene, making them computationally expensive and impractical for many applications.
  • Training NeRFs on sparse input views often leads to overfitting and incorrect scene depth estimation, resulting in artifacts in rendered novel views.
  • Previous approaches have used dense depth estimated from pre-trained networks as supervision, but these depth priors may be inaccurate due to generalization issues.
  • The authors propose using the visibility of pixels in different input views as more reliable dense supervision by computing a visibility prior using plane sweep volumes.
  • By incorporating this visibility prior into the NeRF training process, they successfully train NeRFs with only a few input views.
  • They reformulate the NeRF model to directly output the visibility of a 3D point from a given viewpoint, reducing training time while enforcing the visibility constraint.
  • The proposed ViP-NeRF model outperforms existing sparse input NeRF models on multiple datasets.
  • ViP-NeRF leverages visibility priors instead of depth priors and reduces computational requirements for view synthesis using sparse input views.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Nagabhushan Somraj, Rajiv Soundararajan

ACM SIGGRAPH 2023 Conference Proceedings, Article 71, Pages 1-11
SIGGRAPH 2023

Abstract: Neural radiance fields (NeRF) have achieved impressive performances in view synthesis by encoding neural representations of a scene. However, NeRFs require hundreds of images per scene to synthesize photo-realistic novel views. Training them on sparse input views leads to overfitting and incorrect scene depth estimation resulting in artifacts in the rendered novel views. Sparse input NeRFs were recently regularized by providing dense depth estimated from pre-trained networks as supervision, to achieve improved performance over sparse depth constraints. However, we find that such depth priors may be inaccurate due to generalization issues. Instead, we hypothesize that the visibility of pixels in different input views can be more reliably estimated to provide dense supervision. In this regard, we compute a visibility prior through the use of plane sweep volumes, which does not require any pre-training. By regularizing the NeRF training with the visibility prior, we successfully train the NeRF with few input views. We reformulate the NeRF to also directly output the visibility of a 3D point from a given viewpoint to reduce the training time with the visibility constraint. On multiple datasets, our model outperforms the competing sparse input NeRF models including those that use learned priors. The source code for our model can be found on our project page: https://nagabhushansn95.github.io/publications/2023/ViP-NeRF.html.

Submitted to arXiv on 28 Apr. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.0041v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper titled "ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields" presents a novel approach to address the limitations of Neural Radiance Fields (NeRFs) in synthesizing photo-realistic novel views. NeRFs have shown impressive performance in view synthesis by encoding neural representations of a scene. However, they require a large number of images per scene, making them computationally expensive and impractical for many applications. One major challenge with NeRFs is that training them on sparse input views often leads to overfitting and incorrect scene depth estimation, resulting in artifacts in the rendered novel views. To overcome this limitation, previous approaches have used dense depth estimated from pre-trained networks as supervision to regularize the training process. However, these depth priors may be inaccurate due to generalization issues. In this paper, the authors propose an alternative solution by hypothesizing that the visibility of pixels in different input views can provide more reliable dense supervision. They compute a visibility prior using plane sweep volumes which does not require any pre-training. By incorporating this visibility prior into the NeRF training process they successfully train NeRFs with only a few input views. Furthermore, the authors reformulate the NeRF model to directly output the visibility of a 3D point from a given viewpoint which reduces the training time while enforcing the visibility constraint. The proposed model outperforms existing sparse input NeRF models including those that use learned priors on multiple datasets. Overall, this paper introduces ViP-NeRF as an effective method for improving view synthesis using sparse input views by leveraging visibility priors instead of depth priors and reducing computational requirements. The source code for their model is available on their project page for further exploration and implementation.
Created on 05 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.