Parser-Free Virtual Try-on via Distilling Appearance Flows

AI-generated keywords: Image Virtual Try-On Knowledge Distillation Human Parsing Teacher-Tutor-Student Appearance Flows

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Traditional methods in the field of garment try-on rely heavily on human parsing, which can lead to unrealistic results with noticeable artifacts if segmentation is inaccurate.
  • A recent innovative approach reduces dependence on human parsing by using try-on images generated by a parser-based model to train a "student" network without relying on segmentation.
  • The limitation of this approach is that the image quality of the student network is constrained by the performance of the parser-based model.
  • The "teacher-tutor-student" technique has been proposed to overcome this limitation and aims to produce highly realistic images without relying on human parsing, offering several advantages over previous approaches.
  • This new approach treats fake images generated by the parser-based method as "tutor knowledge" and corrects them using real "teacher knowledge" extracted from actual person images in a self-supervised manner.
  • Instead of using real images as direct supervision, the focus is on distilling appearance flows between person and garment images to identify accurate dense correspondences and achieve high-quality results.
  • Extensive evaluations have shown significant superiority of this novel approach compared to existing methods, producing more realistic virtual try-on results and offering a robust and accurate solution without heavy reliance on human parsing techniques.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo

Accepted by CVPR2021

Abstract: Image virtual try-on aims to fit a garment image (target clothes) to a person image. Prior methods are heavily based on human parsing. However, slightly-wrong segmentation results would lead to unrealistic try-on images with large artifacts. Inaccurate parsing misleads parser-based methods to produce visually unrealistic results where artifacts usually occur. A recent pioneering work employed knowledge distillation to reduce the dependency of human parsing, where the try-on images produced by a parser-based method are used as supervisions to train a "student" network without relying on segmentation, making the student mimic the try-on ability of the parser-based model. However, the image quality of the student is bounded by the parser-based model. To address this problem, we propose a novel approach, "teacher-tutor-student" knowledge distillation, which is able to produce highly photo-realistic images without human parsing, possessing several appealing advantages compared to prior arts. (1) Unlike existing work, our approach treats the fake images produced by the parser-based method as "tutor knowledge", where the artifacts can be corrected by real "teacher knowledge", which is extracted from the real person images in a self-supervised way. (2) Other than using real images as supervisions, we formulate knowledge distillation in the try-on problem as distilling the appearance flows between the person image and the garment image, enabling us to find accurate dense correspondences between them to produce high-quality results. (3) Extensive evaluations show large superiority of our method (see Fig. 1).

Submitted to arXiv on 08 Mar. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2103.04559v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the field of , the goal is to seamlessly fit a garment image onto a person image. Traditional methods rely heavily on , which can lead to unrealistic results with noticeable artifacts if the segmentation is slightly inaccurate. This issue arises because inaccurate parsing can mislead parser-based methods, resulting in visually unrealistic try-on images. To address this challenge, a recent innovative approach utilized to reduce the dependence on human parsing. In this method, try-on images generated by a parser-based model are used as guidance to train a "student" network without relying on segmentation, enabling the student network to mimic the try-on ability of the parser-based model. However, one limitation of this approach is that the image quality of the student network is constrained by the performance of the parser-based model. To overcome this limitation, a novel technique called "teacher-tutor-student" has been proposed. This method aims to produce highly realistic images without relying on human parsing and offers several advantages over previous approaches. One key aspect of this new approach is that it treats fake images generated by the parser-based method as "tutor knowledge", which can be corrected using real "teacher knowledge" extracted from actual person images in a self-supervised manner. Additionally, instead of using real images as direct supervision, in this context focuses on distilling appearance flows between person and garment images. By doing so, accurate dense correspondences between these images can be identified, leading to high-quality results. Extensive evaluations have demonstrated significant superiority of this novel approach compared to existing methods. The refined technique not only produces more realistic virtual try-on results but also offers a more robust and accurate solution for generating visually appealing images without relying heavily on human parsing techniques.
Created on 27 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.