ZHU FU-ZHEN1, HAN HAO1, JIA HENG-FEI1, ZHU BING21.College of Electronic Engineering, Heilongjiang University, Harbin , 150080
CSTR:
Affiliation:

Heilongjiang University

Fund Project:

This work is supported in part by the National Natural Science Foundation China (61601174),in part by the Postdoctoral Research Foundation of Heilongjiang Province (LBH-Q17150),in part by the Science and Technology Innovative Research Team in Higher Educational Institutions of Heilongjiang Province (No. 2012TD007)

  • Article
  • | |
  • Metrics
  • |
  • Reference [20]
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    The current infrared image pedestrian detectors have problems with high rates of false positives and false negatives. To solve these problems, we proposed an improved anchor-free FCOS object detection algorithm. Firstly, we introduced the channel attention module SE-Block in the FCOS backbone network, which was used to learn how to model the relative importance between different feature channels, and to achieve the weight recalibration of the features extracted from the convolution neural network, and improve the weight values that are more important for pedestrian target detection. Secondly, soft non-maximum suppression (soft-NMS) replaced the conventional non-maximum suppression NMS within the algorithm's post-processing phase, which was used to reduce the probability of missed detection for occluded pedestrians. The experimental results show that our improved FCOS algorithm improves the Average Precision(AP) value by 6.71% on the original dataset and 7.97% on the augmented KAIST pedestrian dataset compared with the original FCOS algorithm. Our improvements effectively meet the real-time requirements and there is no significant decrease in speed compared with the original FCOS algorithm, and decreased the false positives and false negatives for infrared image pedestrian detection.

    Reference
    [1] Park S, Choi D H, Kim J U, et al. Robust thermal infrared pedestrian detection by associating visible pedestrian knowledge[C]. ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022: 4468-4472.
    [2] Guan D, Cao Y, Yang J, et al. Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection[J]. Information Fusion, 2019, 50: 148-157.
    [3] Manssor S A F, Sun S, Abdalmajed M, et al. Real-time human detection in thermal infrared imaging at night using enhanced Tiny-yolov3 network[J]. Journal of Real-Time Image Processing, 2022, 19(2): 261-274.
    [4] Lee W Y, Jovanov L, Philips W. Multi-view target transformation for pedestrian detection[C]. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2023: 90-99.
    [5] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 39(6): 1137-1149.
    [6] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
    [7] Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]. Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.
    [8] Zhao L, Li S. Object detection algorithm based on improved YOLOv3[J]. Electronics, 2020, 9(3): 537.
    [9] Terven J, Córdova-Esparza D M, Romero-González J A. A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas[J]. Machine Learning and Knowledge Extraction, 2023, 5(4): 1680-1716.
    [10] Tian, Z., Shen, C., Chen, H., He, T.. FCOS: Fully Convolutional One-Stage Object Detection[C]. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019: 9626-9635.
    [11] Zhang Y, Zhai B, Wang G, et al. Pedestrian Detection Method Based on Two-Stage Fusion of Visible Light Image and Thermal Infrared Image[J]. Electronics, 2023, 12(14): 3171.
    [12] Liu Z, Dai C, Li X. Pedestrian detection method in infrared image based on improved YOLOv7[C]. 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA). IEEE, 2023, 3: 946-954.
    [13] Zeng J, Xiong J, Fu X, et al. ReFPN-FCOS: One-stage object detection for feature learning and accurate localization[J]. IEEE Access, 2020, 8: 225052-225063.
    [14] Sun Z, Dai M, Leng X, et al. An anchor-free detection method for ship targets in high-resolution SAR images[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 7799-7816.
    [15] Yao S, Zhu Q, Zhang T, et al. Infrared image small-target detection based on improved FCOS and spatio-temporal features[J]. Electronics, 2022, 11(6): 933.
    [16] Xuanhao Q, Min Z. A Review of Attention Mechanisms in Computer Vision[C]. 2023 8th International Conference on Image, Vision and Computing (ICIVC). IEEE, 2023: 577-583.
    [17] Guo M H, Xu T X, Liu J J, et al. Attention mechanisms in computer vision: A survey[J]. Computational visual media, 2022, 8(3): 331-368.
    [18] Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]. Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7132-7141.
    [19] Chen F, Zhang L, Kang S, et al. Soft-NMS-enabled YOLOv5 with SIOU for small water surface floater detection in UAV-captured images[J]. Sustainability, 2023, 15(14): 10751.
    [20] Yang S, An W, Li S, et al. An improved FCOS method for ship detection in SAR images[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15: 8910-8927.
    Related
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation
Share
Article Metrics
  • Abstract:21
  • PDF: 0
  • HTML: 0
  • Cited by: 0
History
  • Received:July 25,2024
  • Revised:September 20,2024
  • Adopted:October 23,2024
Article QR Code