SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes

AI-generated keywords: 3D generative model clothed and textured human meshes unpaired learning pose-dependent geometry space visual question answering models

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors introduce SCULPT model for creating pose-dependent clothed and textured human meshes
  • Model uses deep neural network to capture geometry and appearance distribution of clothed human bodies
  • Challenges in training due to limited availability of datasets addressed by leveraging medium-sized 3D scan datasets like CAPE and large-scale 2D image datasets
  • Proposed approach involves developing a geometry conditioned texture generator using both 3D scan data and 2D image data
  • Attribute labels such as clothing types for geometry and clothing colors for texture generation used to disentangle pose from clothing type and appearance
  • Conditioning labels automatically generated for 2D images based on visual question answering models BLIP and CLIP
  • Method validated on SCULPT dataset and compared against state-of-the-art 3D generative models for clothed human bodies
  • Codebase to be released for research purposes to facilitate further exploration in the field
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Soubhik Sanyal, Partha Ghosh, Jinlong Yang, Michael J. Black, Justus Thies, Timo Bolkart

Abstract: We present SCULPT, a novel 3D generative model for clothed and textured 3D meshes of humans. Specifically, we devise a deep neural network that learns to represent the geometry and appearance distribution of clothed human bodies. Training such a model is challenging, as datasets of textured 3D meshes for humans are limited in size and accessibility. Our key observation is that there exist medium-sized 3D scan datasets like CAPE, as well as large-scale 2D image datasets of clothed humans and multiple appearances can be mapped to a single geometry. To effectively learn from the two data modalities, we propose an unpaired learning procedure for pose-dependent clothed and textured human meshes. Specifically, we learn a pose-dependent geometry space from 3D scan data. We represent this as per vertex displacements w.r.t. the SMPL model. Next, we train a geometry conditioned texture generator in an unsupervised way using the 2D image data. We use intermediate activations of the learned geometry model to condition our texture generator. To alleviate entanglement between pose and clothing type, and pose and clothing appearance, we condition both the texture and geometry generators with attribute labels such as clothing types for the geometry, and clothing colors for the texture generator. We automatically generated these conditioning labels for the 2D images based on the visual question answering model BLIP and CLIP. We validate our method on the SCULPT dataset, and compare to state-of-the-art 3D generative models for clothed human bodies. We will release the codebase for research purposes.

Submitted to arXiv on 21 Aug. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2308.10638v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes," authors Soubhik Sanyal, Partha Ghosh, Jinlong Yang, Michael J. Black, Justus Thies, and Timo Bolkart introduce a groundbreaking for creating . The model utilizes a deep neural network to capture the geometry and appearance distribution of clothed human bodies. One of the main challenges faced in training such a model is the limited availability of datasets containing textured 3D meshes for humans. To address this issue, the authors leverage medium-sized 3D scan datasets like CAPE and large-scale 2D image datasets of clothed humans, demonstrating that multiple appearances can be mapped to a single geometry. To effectively learn from these two data modalities, the authors propose an approach for . They first develop a using 3D scan data by representing it as per vertex displacements with respect to the SMPL model. Subsequently, they train a geometry conditioned texture generator in an unsupervised manner using 2D image data. Intermediate activations from the learned geometry model are used to condition the texture generator. To disentangle pose from clothing type and appearance, attribute labels such as clothing types for geometry and clothing colors for texture generation are employed as conditioning factors. Furthermore, the authors automatically generate these conditioning labels for 2D images based on the visual question answering models BLIP and CLIP. The proposed method is validated on the SCULPT dataset and compared against state-of-the-art 3D generative models for clothed human bodies. The authors plan to release their codebase for research purposes to facilitate further exploration in this field. Overall, this innovative approach presents a significant advancement in generating realistic clothed and textured human meshes through , showcasing potential applications in various fields such as virtual try-on systems, gaming industry, virtual reality environments, and more.
Created on 14 Jun. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.