TY - GEN
T1 - Improved YOLOv5 Algorithm for Intensive Pedestrian Detection
AU - Zhao, Siqi
AU - Tian, Ye
AU - Hao, Ning
AU - Zhou, Jianbo
AU - Zhang, Xian
PY - 2024
Y1 - 2024
N2 - Pedestrian detection is an important basic research topic in the field of target detection, which can provide effective information support for public places with large flow density such as shopping malls and scenic spots as well as intelligent security fields. To solve the problem of tracking target loss and low detection and recognition rate caused by pedestrian occlusion or portrait over-lap in pedestrian target detection in a crowded area or scene, an improved YOLOv5 algorithm integrating the attention mechanism was proposed. The relationship between feature channels and spatial information of the feature map was deeply mined by introducing an attention mechanism to further enhance feature extraction of pedestrian target visual area. To improve the feature fusion ability, Bidirectional Feature Pyramid Network (BiFPN) feature pyramid was used to enrich the cross-scale connection mode, preserve the shallow layer characteristics, and improve the detection accuracy. To improve the convergence ability of the model, EIoU was used to replace the original loss function of YOLOv5 to optimize the regression prediction of the anchor, which reduces the training difficulty of the network and improves the detection rate under occlusion. Compared with the general YOLOv5 algorithm, the improved algorithm proposed in this paper has higher accuracy and a lower missing rate in pedestrian detection in crowded areas or scenes, while the real-time performance of the original algorithm is still maintained.
AB - Pedestrian detection is an important basic research topic in the field of target detection, which can provide effective information support for public places with large flow density such as shopping malls and scenic spots as well as intelligent security fields. To solve the problem of tracking target loss and low detection and recognition rate caused by pedestrian occlusion or portrait over-lap in pedestrian target detection in a crowded area or scene, an improved YOLOv5 algorithm integrating the attention mechanism was proposed. The relationship between feature channels and spatial information of the feature map was deeply mined by introducing an attention mechanism to further enhance feature extraction of pedestrian target visual area. To improve the feature fusion ability, Bidirectional Feature Pyramid Network (BiFPN) feature pyramid was used to enrich the cross-scale connection mode, preserve the shallow layer characteristics, and improve the detection accuracy. To improve the convergence ability of the model, EIoU was used to replace the original loss function of YOLOv5 to optimize the regression prediction of the anchor, which reduces the training difficulty of the network and improves the detection rate under occlusion. Compared with the general YOLOv5 algorithm, the improved algorithm proposed in this paper has higher accuracy and a lower missing rate in pedestrian detection in crowded areas or scenes, while the real-time performance of the original algorithm is still maintained.
KW - Attention mechanism
KW - Crowded pedestrian scene
KW - Pedestrian detection
KW - YOLOv5
UR - http://www.scopus.com/inward/record.url?scp=85184126643&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-44947-5_47
DO - 10.1007/978-3-031-44947-5_47
M3 - Conference paper
AN - SCOPUS:85184126643
SN - 9783031449468
VL - 146
T3 - Mechanisms and Machine Science
SP - 587
EP - 594
BT - Computational and Experimental Simulations in Engineering - Proceedings of ICCES 2023—Volume 3
A2 - Li, Shaofan
PB - Springer Science and Business Media B.V.
T2 - 29th International Conference on Computational and Experimental Engineering and Sciences, ICCES 2023
Y2 - 26 May 2023 through 29 May 2023
ER -