Transfer learning for galaxy feature detection: Finding Giant Star-forming Clumps in low redshift galaxies using Faster R-CNN
AI-generated Key Points
- Researchers use Deep Learning (DL) techniques to detect Giant Star-forming Clumps (GSFCs) in astrophysical imaging data.
- GSFCs are regions of intense star formation observed in high-redshift galaxies, with their formation and impact on galaxy evolution not well understood.
- Faster R-CNN object detection framework (FRCNN) is applied to detect GSFCs in low redshift galaxies (z<0.3).
- Training FRCNN models on authentic datasets demonstrates the feasibility of using DL-based object detection for GSFC localization.
- CNNs pre-trained for image classification using astrophysical images outperform those trained on terrestrial images.
- The domain-specific CNN named 'Zoobot' shows higher detection performance and requires smaller training datasets compared to generic classification backbones.
- The final model achieves a completeness and purity of >=0.8 in detecting GSFCs while being trained on approximately 5,000 galaxy images.
- Transfer learning with FRCNN models has potential in automatically identifying and localizing specific features like GSFCs in astrophysical imaging data, enhancing the accuracy of studying star formation processes within galaxies.
Authors: Jürgen Popp, Hugh Dickinson, Stephen Serjeant, Mike Walmsley, Dominic Adams, Kameswara Mantha, Vihang Mehta, James Dawson
Abstract: Giant Star-forming Clumps (GSFCs) are areas of intensive star-formation that are commonly observed in high-redshift (z>1) galaxies but their formation and role in galaxy evolution remain unclear. High-resolution observations of low-redshift clumpy galaxy analogues are rare and restricted to a limited set of galaxies but the increasing availability of wide-field galaxy survey data makes the detection of large clumpy galaxy samples increasingly feasible. Deep Learning, and in particular CNNs, have been successfully applied to image classification tasks in astrophysical data analysis. However, one application of DL that remains relatively unexplored is that of automatically identifying and localising specific objects or features in astrophysical imaging data. In this paper we demonstrate the feasibility of using Deep learning-based object detection models to localise GSFCs in astrophysical imaging data. We apply the Faster R-CNN object detection framework (FRCNN) to identify GSFCs in low redshift (z<0.3) galaxies. Unlike other studies, we train different FRCNN models not on simulated images with known labels but on real observational data that was collected by the Sloan Digital Sky Survey Legacy Survey and labelled by volunteers from the citizen science project `Galaxy Zoo: Clump Scout'. The FRCNN model relies on a CNN component as a `backbone' feature extractor. We show that CNNs, that have been pre-trained for image classification using astrophysical images, outperform those that have been pre-trained on terrestrial images. In particular, we compare a domain-specific CNN -`Zoobot' - with a generic classification backbone and find that Zoobot achieves higher detection performance and also requires smaller training data sets to do so. Our final model is capable of producing GSFC detections with a completeness and purity of >=0.8 while only being trained on ~5,000 galaxy images.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Look for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.