LPYOLO: Low Precision YOLO for Face Detection on FPGA

AI-generated keywords: Edge Computing

AI-generated Key Points

Significant advancements in edge computing devices and artificial intelligence applications
Edge computing involves moving decision-making processes and computations to edge devices
Field-Programmable Gate Arrays (FPGAs) are suitable for running Convolutional Neural Networks (CNNs)
Face detection on surveillance systems is a highly anticipated application in the security market
Researchers focused on redesigning and deploying the TinyYolov3 architecture for face detection
PYNQ-Z2 was selected as the target board, featuring a low-end Xilinx Zynq 7020 SoC
Redesigned TinyYolov3 model defined using various bit width precisions with the Brevitas library
Model trained using the WiderFace dataset in a quantized structure
On-chip memory of FPGA used to store all network parameters, last activation function modified
High degree of parallelism applied to logical resources of FPGA
Model converted into an HLS-based application using FINN framework and FINN-HLS library
CPU of SoC utilized with multithreading mechanism for preprocessing, postprocessing, and TCP/IP streaming operations
System achieved 2.4 Watts power consumption, 18 FPS throughput rate, and 0.757 mAP accuracy rate with a 4-bit precision model

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Bestami Günay, Sefa Burak Okcu, Hasan Şakir Bilge

Proceedings of the 8th World Congress on Electrical Engineering and Computer Systems and Sciences (2022)

arXiv: 2207.10482v1 - DOI (cs.CV)

Accepted to MVML2022

License: CC BY 4.0

Abstract: In recent years, number of edge computing devices and artificial intelligence applications on them have advanced excessively. In edge computing, decision making processes and computations are moved from servers to edge devices. Hence, cheap and low power devices are required. FPGAs are very low power, inclined to do parallel operations and deeply suitable devices for running Convolutional Neural Networks (CNN) which are the fundamental unit of an artificial intelligence application. Face detection on surveillance systems is the most expected application on the security market. In this work, TinyYolov3 architecture is redesigned and deployed for face detection. It is a CNN based object detection method and developed for embedded systems. PYNQ-Z2 is selected as a target board which has low-end Xilinx Zynq 7020 System-on-Chip (SoC) on it. Redesigned TinyYolov3 model is defined in numerous bit width precisions with Brevitas library which brings fundamental CNN layers and activations in integer quantized form. Then, the model is trained in a quantized structure with WiderFace dataset. In order to decrease latency and power consumption, onchip memory of the FPGA is configured as a storage of whole network parameters and the last activation function is modified as rescaled HardTanh instead of Sigmoid. Also, high degree of parallelism is applied to logical resources of the FPGA. The model is converted to an HLS based application with using FINN framework and FINN-HLS library which includes the layer definitions in C++. Later, the model is synthesized and deployed. CPU of the SoC is employed with multithreading mechanism and responsible for preprocessing, postprocessing and TCP/IP streaming operations. Consequently, 2.4 Watt total board power consumption, 18 Frames-Per-Second (FPS) throughput and 0.757 mAP accuracy rate on Easy category of the WiderFace are achieved with 4 bits precision model.

Submitted to arXiv on 21 Jul. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2207.10482v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

In recent years, there has been a significant advancement in edge computing devices and artificial intelligence applications. Edge computing involves moving decision-making processes and computations from servers to edge devices, requiring cheap and low-power devices. Field-Programmable Gate Arrays (FPGAs) are particularly suitable for running Convolutional Neural Networks (CNNs), which are the fundamental unit of an AI application, as they offer low power consumption and are capable of parallel operations. One highly anticipated application in the security market is face detection on surveillance systems. In this study, the researchers focused on redesigning and deploying the TinyYolov3 architecture for face detection. TinyYolov3 is a CNN-based object detection method specifically developed for embedded systems. The researchers selected the PYNQ-Z2 as their target board, which features a low-end Xilinx Zynq 7020 System-on-Chip (SoC). They defined the redesigned TinyYolov3 model in various bit width precisions using the Brevitas library, which allows for integer quantization of fundamental CNN layers and activations. The model was then trained using the WiderFace dataset in a quantized structure. To reduce latency and power consumption, the researchers configured the on-chip memory of the FPGA to store all network parameters. They also modified the last activation function to use rescaled HardTanh instead of Sigmoid. Additionally, they applied a high degree of parallelism to logical resources of the FPGA. The model was converted into an HLS-based application using the FINN framework and FINN-HLS library, which includes layer definitions in C++. After synthesis and deployment, the CPU of the SoC was utilized with a multithreading mechanism to handle preprocessing, postprocessing, and TCP/IP streaming operations. The results showed that with a 4-bit precision model, the system achieved a total board power consumption of 2.4 Watts, an 18 Frames-Per-Second (FPS) throughput rate, and a 0.757 mAP accuracy rate on Easy category images from WiderFace dataset. In conclusion, this study successfully redesigned and deployed TinyYolov3 architecture for face detection on PYNQ-Z2 FPGA board with efficient edge computing capabilities resulting in high performance with reduced latency and power consumption.

- Significant advancements in edge computing devices and artificial intelligence applications
- Edge computing involves moving decision-making processes and computations to edge devices
- Field-Programmable Gate Arrays (FPGAs) are suitable for running Convolutional Neural Networks (CNNs)
- Face detection on surveillance systems is a highly anticipated application in the security market
- Researchers focused on redesigning and deploying the TinyYolov3 architecture for face detection
- PYNQ-Z2 was selected as the target board, featuring a low-end Xilinx Zynq 7020 SoC
- Redesigned TinyYolov3 model defined using various bit width precisions with the Brevitas library
- Model trained using the WiderFace dataset in a quantized structure
- On-chip memory of FPGA used to store all network parameters, last activation function modified
- High degree of parallelism applied to logical resources of FPGA
- Model converted into an HLS-based application using FINN framework and FINN-HLS library
- CPU of SoC utilized with multithreading mechanism for preprocessing, postprocessing, and TCP/IP streaming operations
- System achieved 2.4 Watts power consumption, 18 FPS throughput rate, and 0.757 mAP accuracy rate with a 4-bit precision model

Significant advancements in edge computing devices and artificial intelligence applications: There have been important improvements in devices that can do calculations and make decisions on their own, as well as in computer programs that can think like humans. Edge computing involves moving decision-making processes and computations to edge devices: Instead of sending information to a central computer for processing, the calculations are done on the device itself. Field-Programmable Gate Arrays (FPGAs) are suitable for running Convolutional Neural Networks (CNNs): FPGAs are special chips that can be programmed to perform specific tasks. They work well for running computer programs that can recognize things like faces. Face detection on surveillance systems is a highly anticipated application in the security market: People are excited about using computers to automatically find and identify faces in video footage, especially for security purposes. Researchers focused on redesigning and deploying the TinyYolov3 architecture for face detection: Scientists worked on improving a specific type of computer program called TinyYolov3 so it can better detect faces.

The Advancement of Edge Computing and Artificial Intelligence Applications

In recent years, the advancement of edge computing devices and artificial intelligence (AI) applications has been significant. Edge computing involves moving decision-making processes and computations from servers to edge devices, requiring cheap and low-power devices. Field-Programmable Gate Arrays (FPGAs) are particularly suitable for running Convolutional Neural Networks (CNNs), which are the fundamental unit of an AI application, as they offer low power consumption and are capable of parallel operations. One highly anticipated application in the security market is face detection on surveillance systems.

Redesigning TinyYolov3 Architecture for Face Detection

In this study, researchers focused on redesigning and deploying the TinyYolov3 architecture for face detection. TinyYolov3 is a CNN-based object detection method specifically developed for embedded systems. The researchers selected the PYNQ-Z2 as their target board, which features a low-end Xilinx Zynq 7020 System-on-Chip (SoC). They defined the redesigned TinyYolov3 model in various bit width precisions using the Brevitas library, which allows for integer quantization of fundamental CNN layers and activations. The model was then trained using the WiderFace dataset in a quantized structure.

Improving Efficiency with On-Chip Memory Configuration

To reduce latency and power consumption, the researchers configured the on-chip memory of FPGA to store all network parameters. They also modified last activation function to use rescaled HardTanh instead of Sigmoid. Additionally, they applied high degree of parallelism to logical resources of FPGA by converting model into HLS based application using FINN framework & FINN HLS library including layer definitions in C++ language after synthesis & deployment CPU was utilized with multithreading mechanism to handle preprocessing post processing & TCP/IP streaming operations .

Results Show High Performance with Reduced Latency & Power Consumption

The results showed that with 4 bit precision model system achieved total board power consumption 2:4 Watts 18 Frames Per Second throughput rate 0:757 mAP accuracy rate on Easy category images from WiderFace dataset . In conclusion this study successfully redesigned & deployed TinyYolov3 architecture for face detection on PYNQ Z2 FPGA board with efficient edge computing capabilities resulting high performance with reduced latency & power consumption .

Created on 26 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.