ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness

AI-generated keywords: ImageNet-trained CNNs texture bias shape bias object recognition machine learning

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Study titled "ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness"
  • Researchers evaluated CNNs and human observers on images with a texture-shape cue conflict
  • ImageNet-trained CNNs exhibit a strong bias towards recognizing textures over shapes
  • Training the network on a stylized version of ImageNet can shift its representation from texture-based to shape-based
  • Shape-based representation leads to improved object detection and enhanced robustness against image distortions
  • Nine experiments totaling 48,560 psychophysical trials across 97 observers in a well-controlled lab setting
  • Advantages of adopting a shape-based representation in CNNs for accurate and robust object recognition tasks
  • Study currently under review at ICLR 2019 with favorable scores (8, 8, 7)
  • Implications for improving machine learning algorithms
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Robert Geirhos, Patricia Rubisch, Claudio Michaelis, Matthias Bethge, Felix A. Wichmann, Wieland Brendel

Under review at ICLR 2019 (review scores 8,8,7)

Abstract: Convolutional Neural Networks (CNNs) are commonly thought to recognise objects by learning increasingly complex representations of object shapes. Some recent studies hint to a more important role of image textures. We here put these conflicting hypotheses to a quantitative test by evaluating CNNs and human observers on images with a texture-shape cue conflict. We show that ImageNet-trained CNNs are strongly biased towards recognising textures rather than shapes, which is in stark contrast to human behavioural evidence and reveals fundamentally different classification strategies. We then demonstrate that the same standard architecture (ResNet-50) that learns a texture-based representation on ImageNet is able to learn a shape-based representation instead when trained on "Stylized-ImageNet", a stylized version of ImageNet. This provides a much better fit for human behavioural performance in our well-controlled psychophysical lab setting (nine experiments totalling 48,560 psychophysical trials across 97 observers) and comes with a number of unexpected emergent benefits such as improved object detection performance and previously unseen robustness towards a wide range of image distortions, highlighting advantages of a shape-based representation.

Submitted to arXiv on 29 Nov. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1811.12231v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their study titled "ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness," authors Robert Geirhos, Patricia Rubisch, Claudio Michaelis, Matthias Bethge, Felix A. Wichmann, and Wieland Brendel delve into the classification strategies of Convolutional Neural Networks (CNNs) in recognizing objects. The researchers evaluated CNNs and human observers on images with a texture-shape cue conflict to test conflicting hypotheses about the role of textures and shapes in object recognition. Their findings reveal that ImageNet-trained CNNs exhibit a strong bias towards recognizing textures over shapes, highlighting fundamentally different classification strategies employed by machines compared to humans. However, the authors demonstrate that training the network on a stylized version of ImageNet can shift its representation from texture-based to shape-based. This not only aligns more closely with human performance but also leads to improved object detection and enhanced robustness against image distortions. Through nine experiments totaling 48,560 psychophysical trials across 97 observers in a well-controlled lab setting, the study showcases the advantages of adopting a shape-based representation in CNNs for accurate and robust object recognition tasks. Currently under review at ICLR 2019 with favorable scores (8, 8, 7), this study sheds light on the mechanisms underlying object recognition in neural networks and has implications for improving machine learning algorithms.
Created on 18 Feb. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.