CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
AI-generated Key Points
⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.
- Recent advances in self-supervised contrastive learning for image-level representation learning
- Limitation of overlooking pixel-level detailed information in dense prediction tasks
- Introduction of CP2 (Copy-Paste Contrastive Pretraining) method
- CP2 facilitates image and pixel-level representation learning
- Copying foreground from an image and pasting onto various backgrounds
- Pretraining semantic segmentation model with two objectives: distinguishing foreground from background pixels and identifying composed images with the same foreground
- Strong performance of CP2 on downstream semantic segmentation tasks
- Impressive results achieved with fine-tuned CP2 pretrained models on PASCAL VOC 2012 dataset (ResNet-50: 78.6% mIoU, ViT-S: 79.5% mIoU)
- Addressing limitations of existing self-supervised contrastive learning methods in dense prediction tasks
Authors: Feng Wang, Huiyu Wang, Chen Wei, Alan Yuille, Wei Shen
Abstract: Recent advances in self-supervised contrastive learning yield good image-level representation, which favors classification tasks but usually neglects pixel-level detailed information, leading to unsatisfactory transfer performance to dense prediction tasks such as semantic segmentation. In this work, we propose a pixel-wise contrastive learning method called CP2 (Copy-Paste Contrastive Pretraining), which facilitates both image- and pixel-level representation learning and therefore is more suitable for downstream dense prediction tasks. In detail, we copy-paste a random crop from an image (the foreground) onto different background images and pretrain a semantic segmentation model with the objective of 1) distinguishing the foreground pixels from the background pixels, and 2) identifying the composed images that share the same foreground.Experiments show the strong performance of CP2 in downstream semantic segmentation: By finetuning CP2 pretrained models on PASCAL VOC 2012, we obtain 78.6% mIoU with a ResNet-50 and 79.5% with a ViT-S.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
⚠The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.