Self-training with Noisy Student improves ImageNet classification

AI-generated keywords: Self-training Noisy student ImageNet classification Robustness testing EfficientNet

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors: Qizhe Xie, Eduard Hovy, Minh-Thang Luong, Quoc V. Le
  • Self-training method significantly enhances ImageNet classification accuracy
  • Achieves 87.4% top-1 accuracy, surpassing state-of-the-art model by 1.0%
  • Improvements on robustness test sets:
  • ImageNet-A top-1 accuracy increased from 16.6% to 74.2%
  • Mean corruption error on ImageNet-C reduced from 45.7 to 31.2
  • Mean flip rate on ImageNet-P decreased from 27.8 to 16.1
  • Methodology involves training an EfficientNet teacher model initially on labeled images and using it to generate pseudo labels for unlabeled images
  • Larger EfficientNet student model is trained using both labeled and pseudo-labeled data in combination
  • Iterative process continues with student model becoming the new teacher in subsequent rounds of training
  • No noise introduced during pseudo label generation; noise deliberately injected during student learning phase for improved performance based on noisy labels
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Qizhe Xie, Eduard Hovy, Minh-Thang Luong, Quoc V. Le

Abstract: We present a simple self-training method that achieves 87.4% top-1 accuracy on ImageNet, which is 1.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. On robustness test sets, it improves ImageNet-A top-1 accuracy from 16.6% to 74.2%, reduces ImageNet-C mean corruption error from 45.7 to 31.2, and reduces ImageNet-P mean flip rate from 27.8 to 16.1. To achieve this result, we first train an EfficientNet model on labeled ImageNet images and use it as a teacher to generate pseudo labels on 300M unlabeled images. We then train a larger EfficientNet as a student model on the combination of labeled and pseudo labeled images. We iterate this process by putting back the student as the teacher. During the generation of the pseudo labels, the teacher is not noised so that the pseudo labels are as good as possible. But during the learning of the student, we inject noise such as data augmentation, dropout, stochastic depth to the student so that the noised student is forced to learn harder from the pseudo labels.

Submitted to arXiv on 11 Nov. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1911.04252v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Self-training with Noisy Student improves ImageNet classification," authors Qizhe Xie, Eduard Hovy, Minh-Thang Luong, and Quoc V. Le introduce a simple self-training method that significantly enhances image classification accuracy on the ImageNet dataset. The proposed approach achieves an impressive 87.4% top-1 accuracy, surpassing the state-of-the-art model by 1.0%, which relies on a massive dataset of 3.5 billion weakly labeled Instagram images. Notably, the self-training method demonstrates remarkable improvements on robustness test sets as well. It elevates the ImageNet-A top-1 accuracy from 16.6% to an impressive 74.2%. Furthermore, it effectively reduces the mean corruption error on ImageNet-C from 45.7 to 31.2 and decreases the mean flip rate on ImageNet-P from 27.8 to 16.1. The methodology involves training an EfficientNet model initially on labeled ImageNet images, using it as a teacher to generate pseudo labels for a vast set of 300 million unlabeled images. Subsequently, a larger EfficientNet model is trained as a student using both labeled and pseudo-labeled data in combination. This iterative process continues by utilizing the student model as the new teacher in subsequent rounds of training. Notably, during the generation of pseudo labels, no noise is introduced to ensure high-quality labels are produced by the teacher model. However, during the learning phase of the student model, various forms of noise such as data augmentation, dropout techniques, and stochastic depth are injected deliberately to challenge and enhance the learning process for improved performance based on noisy labels. Overall, this innovative self-training approach with a noisy student not only achieves superior performance on ImageNet classification but also showcases significant advancements in robustness testing scenarios compared to existing state-of-the-art models in image recognition tasks.
Created on 11 Mar. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.