JLPEA, Vol. 13, Pages 40: Efficient GEMM Implementation for Vision-Based Object Detection in Autonomous Driving Applications

1 year ago 68

JLPEA, Vol. 13, Pages 40: Efficient GEMM Implementation for Vision-Based Object Detection in Autonomous Driving Applications

Journal of Low Power Electronics and Applications doi: 10.3390/jlpea13020040

Authors: Fatima Zahra Guerrouj Sergio Rodríguez Flórez Mohamed Abouzahir Abdelhafid El Ouardi Mustapha Ramzi

Convolutional Neural Networks (CNNs) have been incredibly effective for object detection tasks. YOLOv4 is a state-of-the-art object detection algorithm designed for embedded systems. It is based on YOLOv3 and has improved accuracy, speed, and robustness. However, deploying CNNs on embedded systems such as Field Programmable Gate Arrays (FPGAs) is difficult due to their limited resources. To address this issue, FPGA-based CNN architectures have been developed to improve the resource utilization of CNNs, resulting in improved accuracy and speed. This paper examines the use of General Matrix Multiplication Operations (GEMM) to accelerate the execution of YOLOv4 on embedded systems. It reviews the most recent GEMM implementations and evaluates their accuracy and robustness. It also discusses the challenges of deploying YOLOv4 on autonomous vehicle datasets. Finally, the paper presents a case study demonstrating the successful implementation of YOLOv4 on an Intel Arria 10 embedded system using GEMM.

Read Entire Article

JLPEA, Vol. 13, Pages 40: Efficient GEMM Implementation for Vision-Based Object Detection in Autonomous Driving Applications

Related

New Tardigrade Discovery Reveals Secrets of Radiation Resist...

Once thought a fantasy, effort to sequence DNA of millions o...

Ancient Mesopotamian clay seals offer clues to the origin of...