You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery

AI-generated keywords: YOLT Satellite Imagery Deep Learning Object Detection Resolution

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Detection of small objects in satellite imagery is challenging
  • Deep learning approaches have advanced ground-based object detection
  • Applying these techniques to overhead imagery is not straightforward due to large number of pixels and geographic extent per image
  • Objects of interest in satellite imagery are often minuscule, making traditional computer vision techniques ineffective
  • Authors propose a pipeline called You Only Look Twice (YOLT) for rapidly evaluating satellite images of any size
  • YOLT can detect objects of various scales using relatively little training data across multiple sensors
  • YOLT achieves high scores for vehicle localization with an F1 score greater than 0.8 on large test images at their native resolution
  • YOLT can accurately localize objects as small as 5 pixels with high confidence
  • YOLT offers a solution for efficiently detecting small objects in satellite imagery by leveraging deep learning techniques
  • Code for implementing YOLT is available on GitHub for further exploration and utilization
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Adam Van Etten

8 pages, 14 figures, 3 tables

Abstract: Detection of small objects in large swaths of imagery is one of the primary problems in satellite imagery analytics. While object detection in ground-based imagery has benefited from research into new deep learning approaches, transitioning such technology to overhead imagery is nontrivial. Among the challenges is the sheer number of pixels and geographic extent per image: a single DigitalGlobe satellite image encompasses >64 km2 and over 250 million pixels. Another challenge is that objects of interest are minuscule (often only ~10 pixels in extent), which complicates traditional computer vision techniques. To address these issues, we propose a pipeline (You Only Look Twice, or YOLT) that evaluates satellite images of arbitrary size at a rate of >0.5 km2/s. The proposed approach can rapidly detect objects of vastly different scales with relatively little training data over multiple sensors. We evaluate large test images at native resolution, and yield scores of F1 > 0.8 for vehicle localization. We further explore resolution and object size requirements by systematically testing the pipeline at decreasing resolution, and conclude that objects only ~5 pixels in size can still be localized with high confidence. Code is available at https://github.com/CosmiQ/yolt.

Submitted to arXiv on 24 May. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1805.09512v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The detection of small objects in satellite imagery is a challenging problem in the field of satellite imagery analytics. While ground-based object detection has seen advancements with deep learning approaches, applying these techniques to overhead imagery is not straightforward. This is due to the large number of pixels and geographic extent per image, with a single DigitalGlobe satellite image covering more than 64 km2 and containing over 250 million pixels. Additionally, the objects of interest in satellite imagery are often minuscule, spanning only around 10 pixels, which makes traditional computer vision techniques ineffective. To address these challenges, the authors propose a pipeline called You Only Look Twice (YOLT), which can rapidly evaluate satellite images of any size at a rate exceeding 0.5 km2/s. The YOLT pipeline is capable of detecting objects of various scales using relatively little training data across multiple sensors. The authors evaluate the performance of YOLT on large test images at their native resolution and achieve high scores for vehicle localization, with an F1 score greater than 0.8. Furthermore, the authors explore the resolution and object size requirements by systematically testing the YOLT pipeline at decreasing resolutions. They find that even objects as small as 5 pixels can be accurately localized with high confidence using this approach. In conclusion, the proposed YOLT pipeline offers a solution for efficiently detecting small objects in satellite imagery by leveraging deep learning techniques. Its ability to handle large swaths of imagery and detect objects at different scales makes it a valuable tool for satellite imagery analytics applications. The code for implementing YOLT is available on GitHub for further exploration and utilization.
Created on 23 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.