SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis

AI-generated keywords: Computer graphics

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Synthesizing realistic images from human-drawn sketches is a challenging task in computer graphics and vision.
  • Existing approaches require exact edge maps or rely on retrieving existing photographs.
  • "SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis" proposes a novel approach using Generative Adversarial Networks (GANs).
  • The proposed GAN model generates plausible images from 50 different categories, including motorcycles, horses, and couches.
  • A fully automatic data augmentation technique for sketches significantly improves the performance of the GAN model.
  • A new network building block enhances information flow by injecting the input image at multiple scales.
  • Compared to state-of-the-art image translation methods, the authors' approach generates more realistic images and achieves significantly higher Inception Scores.
  • The study presents an innovative solution leveraging GANs, data augmentation, and network building blocks for synthesizing realistic images from human-drawn sketches.
  • Chen and Hays demonstrate significant improvements over existing methods in terms of image realism and quality.
  • The research has important implications for various applications in computer graphics and vision fields.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Wengling Chen, James Hays

Accepted to CVPR 2018

Abstract: Synthesizing realistic images from human drawn sketches is a challenging problem in computer graphics and vision. Existing approaches either need exact edge maps, or rely on retrieval of existing photographs. In this work, we propose a novel Generative Adversarial Network (GAN) approach that synthesizes plausible images from 50 categories including motorcycles, horses and couches. We demonstrate a data augmentation technique for sketches which is fully automatic, and we show that the augmented data is helpful to our task. We introduce a new network building block suitable for both the generator and discriminator which improves the information flow by injecting the input image at multiple scales. Compared to state-of-the-art image translation methods, our approach generates more realistic images and achieves significantly higher Inception Scores.

Submitted to arXiv on 09 Jan. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1801.02753v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In the field of computer graphics and vision, synthesizing realistic images from human-drawn sketches has always been a challenging task. Existing approaches either require exact edge maps or rely on retrieving existing photographs. However, in this study titled "SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis," authors Wengling Chen and James Hays propose a novel approach using Generative Adversarial Networks (GANs) to generate plausible images from 50 different categories, including motorcycles, horses, and couches. One key contribution of this work is the demonstration of a fully automatic data augmentation technique for sketches. The authors show that augmenting the data in this way significantly improves the performance of their proposed GAN model. Additionally, they introduce a new network building block that can be used in both the generator and discriminator components of the GAN. This building block enhances information flow by injecting the input image at multiple scales. Compared to state-of-the-art image translation methods, the authors' approach generates more realistic images and achieves significantly higher Inception Scores. The Inception Score is a widely used metric for evaluating the quality and diversity of generated images. Overall, this study presents an innovative solution to the problem of synthesizing realistic images from human-drawn sketches by leveraging GANs and introducing novel techniques such as data augmentation and network building blocks. Chen and Hays demonstrate significant improvements over existing methods in terms of image realism and quality. This research has important implications for various applications in computer graphics and vision fields.
Created on 23 Jan. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.