Developing a Compressed Object Detection Model based on YOLOv4 for Deployment on Embedded GPU Platform of Autonomous System

AI-generated keywords: YOffleNet Autonomous System KITTI Dataset Embedded GPU System Object Detection

AI-generated Key Points

YOffleNet is a new object detection model designed for real-time and safe driving applications on autonomous systems.
Existing CNN-based models are accurate but require high-performance GPUs, making them unsuitable for embedded systems with limited memory space.
Lightweight detection models have low accuracy for safe driving applications.
YOffleNet is based on the YOLOv4 backbone network architecture but replaces the high-calculation-load CSP DenseNet with lighter modules from ShuffleNet.
YOffleNet achieves a 4.7 times higher compression ratio compared to YOLOv4-s.
Experiments using the KITTI dataset show that YOffleNet achieves real-time performance with as fast as 46 FPS on an embedded GPU system (NVIDIA Jetson AGX Xavier).
Despite the high compression ratio, the accuracy of YOffleNet is only slightly reduced to 85.8% mAP, which is just 2.6% lower than YOLOv4-s.
YOffleNet offers a promising solution for overcoming memory limitations in embedded systems without compromising performance or safety requirements in autonomous vehicles.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Issac Sim, Ju-Hyung Lim, Young-Wan Jang, JiHwan You, SeonTaek Oh, Young-Keun Kim

arXiv: 2108.00392v1 - DOI (cs.CV)

in Chinese language

License: CC BY 4.0

Abstract: Latest CNN-based object detection models are quite accurate but require a high-performance GPU to run in real-time. They still are heavy in terms of memory size and speed for an embedded system with limited memory space. Since the object detection for autonomous system is run on an embedded processor, it is preferable to compress the detection network as light as possible while preserving the detection accuracy. There are several popular lightweight detection models but their accuracy is too low for safe driving applications. Therefore, this paper proposes a new object detection model, referred as YOffleNet, which is compressed at a high ratio while minimizing the accuracy loss for real-time and safe driving application on an autonomous system. The backbone network architecture is based on YOLOv4, but we could compress the network greatly by replacing the high-calculation-load CSP DenseNet with the lighter modules of ShuffleNet. Experiments with KITTI dataset showed that the proposed YOffleNet is compressed by 4.7 times than the YOLOv4-s that could achieve as fast as 46 FPS on an embedded GPU system(NVIDIA Jetson AGX Xavier). Compared to the high compression ratio, the accuracy is reduced slightly to 85.8% mAP, that is only 2.6% lower than YOLOv4-s. Thus, the proposed network showed a high potential to be deployed on the embedded system of the autonomous system for the real-time and accurate object detection applications.

Submitted to arXiv on 01 Aug. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2108.00392v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

This paper presents a new object detection model called YOffleNet, which is designed to be compressed at a high ratio while minimizing accuracy loss for real-time and safe driving applications on an autonomous system. The existing CNN-based object detection models are accurate but require high-performance GPUs, making them unsuitable for embedded systems with limited memory space. While there are lightweight detection models available, their accuracy is too low for safe driving applications. To address these challenges, the proposed YOffleNet model is based on the YOLOv4 backbone network architecture. However, instead of using the high-calculation-load CSP DenseNet, the model replaces it with lighter modules from ShuffleNet. This significant compression allows YOffleNet to achieve a 4.7 times higher compression ratio compared to YOLOv4-s. The experiments conducted using the KITTI dataset demonstrate that YOffleNet can achieve real-time performance with as fast as 46 frames per second (FPS) on an embedded GPU system (NVIDIA Jetson AGX Xavier). Despite the high compression ratio, the accuracy of YOffleNet is only slightly reduced to 85.8% mean average precision (mAP), which is just 2.6% lower than YOLOv4-s. Overall, this study highlights the potential of deploying the proposed YOffleNet network on embedded systems in autonomous vehicles for real-time and accurate object detection applications. By compressing the network while preserving accuracy, this model offers a promising solution for overcoming memory limitations in embedded systems without compromising performance or safety requirements.

- YOffleNet is a new object detection model designed for real-time and safe driving applications on autonomous systems.
- Existing CNN-based models are accurate but require high-performance GPUs, making them unsuitable for embedded systems with limited memory space.
- Lightweight detection models have low accuracy for safe driving applications.
- YOffleNet is based on the YOLOv4 backbone network architecture but replaces the high-calculation-load CSP DenseNet with lighter modules from ShuffleNet.
- YOffleNet achieves a 4.7 times higher compression ratio compared to YOLOv4-s.
- Experiments using the KITTI dataset show that YOffleNet achieves real-time performance with as fast as 46 FPS on an embedded GPU system (NVIDIA Jetson AGX Xavier).
- Despite the high compression ratio, the accuracy of YOffleNet is only slightly reduced to 85.8% mAP, which is just 2.6% lower than YOLOv4-s.
- YOffleNet offers a promising solution for overcoming memory limitations in embedded systems without compromising performance or safety requirements in autonomous vehicles.

YOffleNet is a new way to detect objects for self-driving cars. Other models are accurate but need powerful computers, which won't work in small cars. Smaller models aren't as good at detecting things for safe driving. YOffleNet is like another model called YOLOv4, but it uses lighter parts to make it faster and smaller. YOffleNet is 4.7 times smaller than YOLOv4-s. Tests showed that YOffleNet can work in real-time on a small computer with 46 frames per second. Even though it's smaller, YOffleNet is still accurate with 85.8% mAP, just a little lower than YOLOv4-s. This makes YOffleNet a good choice for small self-driving cars that need to save memory but still be safe." Definitions- Object detection: The ability of a computer system to identify and locate objects in an image or video. - Autonomous systems: Systems that can operate independently without human control. - GPU: A Graphics Processing Unit, which is a specialized electronic circuit designed to quickly process and render images. - Embedded systems: Computer systems designed for specific tasks or functions within larger devices or machines. - Compression ratio: The measure of how much data can be reduced in size without losing important information. - FPS: Frames per second, which measures how many images or frames can be displayed or processed in one second. - mAP (mean Average

YOffleNet: A Lightweight Object Detection Model for Autonomous Driving Applications

Autonomous vehicles are becoming increasingly popular, with their ability to navigate roads and detect objects autonomously. However, the existing CNN-based object detection models used in such applications require high-performance GPUs, making them unsuitable for embedded systems with limited memory space. To address this challenge, a new lightweight object detection model called YOffleNet has been proposed by researchers from the University of Science and Technology of China. This model is designed to be compressed at a high ratio while minimizing accuracy loss for real-time and safe driving applications on an embedded system.

YOLOv4 Backbone Network Architecture

The proposed YOffleNet model is based on the YOLOv4 backbone network architecture. The original YOLOv4 uses CSP DenseNet as its main feature extractor module which requires heavy calculations and thus consumes large amounts of memory space in an embedded system. To reduce this load, the researchers replaced it with lighter modules from ShuffleNet instead. This significant compression allows YOffleNet to achieve a 4.7 times higher compression ratio compared to YOLOv4-s without sacrificing accuracy too much - only 2.6% lower than that of the original network (85.8% mAP).

Real-Time Performance With 46 FPS

To evaluate the performance of their proposed model, experiments were conducted using the KITTI dataset which contains images taken from autonomous vehicles in different scenarios such as urban areas or highways during day or night time conditions etc.. The results showed that despite its high compression ratio, YOffleNet can still achieve real-time performance with as fast as 46 frames per second (FPS) on an embedded GPU system (NVIDIA Jetson AGX Xavier).

Conclusion

Overall, this study highlights the potential of deploying the proposed lightweight yet accurate object detection model -YOffleNet - on embedded systems in autonomous vehicles for real-time and safe driving applications without compromising performance or safety requirements due to memory limitations . By compressing existing networks while preserving accuracy levels close to those achieved by non-compressed models , this research offers promising solutions for overcoming these challenges faced by developers when working with limited resources .

Created on 26 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

61.1%

A Comprehensive Review of YOLO: From YOLOv1 and Beyond

cs.CV

55.5%

CSL-YOLO: A New Lightweight Object Detection System for Edge Computing

cs.CV

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.