Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images

AI-generated keywords: Virtual Try-On Fashion Industry In-the-Wild Try-On StreetTryOn Benchmark Unpaired Data

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Virtual try-on research has primarily focused on showcasing garments on studio models for the fashion industry in a cost-effective manner.
  • There is a growing recognition of the need to extend virtual try-on technology to enable customers to visualize clothing items on themselves using their own everyday photos (in-the-wild try-on).
  • Existing methods struggle with in-the-wild scenarios due to the reliance on paired data, which is more readily available for studio settings compared to diverse real-world scenes.
  • A team of researchers led by Aiyu Cui, Jay Mahajan, Viraj Shah, Preeti Gomathinayagam, Chang Liu, and Svetlana Lazebnik introduced a groundbreaking approach:
  • Established the StreetTryOn benchmark to support in-the-wild virtual try-on applications.
  • Proposed an innovative method that learns virtual try-on directly from unpaired person images taken in natural settings without paired data.
  • Utilized DensePose warping correction and diffusion-based conditional inpainting techniques to address challenges posed by in-the-wild scenarios.
  • Through extensive experiments, the team demonstrated competitive performance for traditional studio try-on tasks and state-of-the-art results for street try-on and cross-domain try-on tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Aiyu Cui, Jay Mahajan, Viraj Shah, Preeti Gomathinayagam, Chang Liu, Svetlana Lazebnik

The abstract and intro are updated. Some typos and some pdf rendering errors have been fixed in the version

Abstract: Most virtual try-on research is motivated to serve the fashion business by generating images to demonstrate garments on studio models at a lower cost. However, virtual try-on should be a broader application that also allows customers to visualize garments on themselves using their own casual photos, known as in-the-wild try-on. Unfortunately, the existing methods, which achieve plausible results for studio try-on settings, perform poorly in the in-the-wild context. This is because these methods often require paired images (garment images paired with images of people wearing the same garment) for training. While such paired data is easy to collect from shopping websites for studio settings, it is difficult to obtain for in-the-wild scenes. In this work, we fill the gap by (1) introducing a StreetTryOn benchmark to support in-the-wild virtual try-on applications and (2) proposing a novel method to learn virtual try-on from a set of in-the-wild person images directly without requiring paired data. We tackle the unique challenges, including warping garments to more diverse human poses and rendering more complex backgrounds faithfully, by a novel DensePose warping correction method combined with diffusion-based conditional inpainting. Our experiments show competitive performance for standard studio try-on tasks and SOTA performance for street try-on and cross-domain try-on tasks.

Submitted to arXiv on 27 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.16094v3

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the realm of virtual try-on research, the primary focus has been on catering to the fashion industry's needs by creating images that showcase garments on studio models in a cost-effective manner. However, there is a growing recognition that virtual try-on technology should extend beyond this narrow scope to enable customers to visualize clothing items on themselves using their own everyday photos, a concept known as in-the-wild try-on. The challenge lies in the fact that existing methods, which excel in generating realistic results for studio settings, struggle when applied to in-the-wild scenarios due to their reliance on paired data (garment images matched with images of individuals wearing those garments) for training. While obtaining such paired data is relatively straightforward for studio environments through shopping websites, it proves to be a significant hurdle in capturing diverse and uncontrolled real-world scenes. To address this gap, a team of researchers led by Aiyu Cui, Jay Mahajan, Viraj Shah, Preeti Gomathinayagam, Chang Liu, and Svetlana Lazebnik introduces a groundbreaking approach. They first establish the StreetTryOn benchmark aimed at supporting in-the-wild virtual try-on applications. Subsequently, they propose an innovative method that learns virtual try-on directly from a collection of unpaired person images taken in natural settings without the need for paired data. This novel technique tackles the unique challenges posed by in-the-wild scenarios by employing a DensePose warping correction method coupled with diffusion-based conditional inpainting. Through extensive experiments, the team demonstrates not only competitive performance for traditional studio try-on tasks but also state-of-the-art results for street try-on and cross-domain try-on tasks. In essence, this research represents a significant step forward in expanding the capabilities of virtual try-on technology beyond controlled studio environments towards more versatile and practical applications that cater to real-world scenarios where customers can seamlessly visualize clothing items on themselves using their own photographs.
Created on 27 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.