ICON: Implicit Clothed humans Obtained from Normals

AI-generated keywords: 3D clothed avatars implicit functions local features SMPL(-X) body model animatable avatar

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors address limitations of current methods for learning 3D clothed avatars
  • Proposed method uses implicit functions to capture intricate details like hair and clothing
  • Introduces ICON framework leveraging local features for diverse human poses
  • Utilizes multiple frames and SCANimate for creating animatable avatars with superior performance
  • Evaluation on AGORA and CAPE datasets shows robustness to out-of-distribution samples
  • Represents significant advancement in 3D clothed human reconstruction from real-world images
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuliang Xiu, Jinlong Yang, Dimitrios Tzionas, Michael J. Black

21 pages, 18 figures, 7 tables. Project page: https://github.com/YuliangXiu/ICON

Abstract: Current methods for learning realistic and animatable 3D clothed avatars need either posed 3D scans or 2D images with carefully controlled user poses. In contrast, our goal is to learn the avatar from only 2D images of people in unconstrained poses. Given a set of images, our method estimates a detailed 3D surface from each image and then combines these into an animatable avatar. Implicit functions are well suited to the first task, as they can capture details like hair or clothes. Current methods, however, are not robust to varied human poses and often produce 3D surfaces with broken or disembodied limbs, missing details, or non-human shapes. The problem is that these methods use global feature encoders that are sensitive to global pose. To address this, we propose ICON ("Implicit Clothed humans Obtained from Normals"), which uses local features, instead. ICON has two main modules, both of which exploit the SMPL(-X) body model. First, ICON infers detailed clothed-human normals (front/back) conditioned on the SMPL(-X) normals. Second, a visibility-aware implicit surface regressor produces an iso-surface of a human occupancy field. Importantly, at inference time, a feedback loop alternates between refining the SMPL(-X) mesh using the inferred clothed normals and then refining the normals. Given multiple reconstructed frames of a subject in varied poses, we use SCANimate to produce an animatable avatar from them. Evaluation on the AGORA and CAPE datasets shows that ICON outperforms the state of the art in reconstruction, even with heavily limited training data. Additionally, it is much more robust to out-of-distribution samples, e.g., in-the-wild poses/images and out-of-frame cropping. ICON takes a step towards robust 3D clothed human reconstruction from in-the-wild images. This enables creating avatars directly from video with personalized and natural pose-dependent cloth deformation.

Submitted to arXiv on 16 Dec. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2112.09127v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the paper "ICON: Implicit Clothed humans Obtained from Normals," authors Yuliang Xiu, Jinlong Yang, Dimitrios Tzionas, and Michael J. Black address the limitations of current methods for learning realistic and animatable 3D clothed avatars. The proposed method utilizes implicit functions to capture intricate details such as hair and clothing in the avatar reconstruction process. To overcome challenges with diverse human poses, the authors introduce ICON, a novel framework that leverages local features instead of global ones. By reconstructing multiple frames of a subject in different poses and utilizing SCANimate, the method can create an animatable avatar with superior performance compared to state-of-the-art techniques. Evaluation on AGORA and CAPE datasets demonstrates ICON's robustness to out-of-distribution samples like in-the-wild poses/images and out-of-frame cropping. This approach represents a significant advancement towards achieving robust 3D clothed human reconstruction from real-world images.
Created on 12 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.