Small object detection on highways via balance feature fusion and task-specific encoding network
CSTR:
Author:
Affiliation:

1. College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China;2. Hangzhou Xsuan Technology Co., Ltd., Hangzhou 310051, China;3. Quzhou University, Quzhou 324000, China;4. Huzhou Institute of Zhejiang University, Huzhou 313002, China[* This work has been partially supported by the National Natural Science Foundation of China (Nos.61906168 and 62202337), the Zhejiang Provincial Natural Science Foundation of China (Nos.LY23F020023 and LZ23F020001), the Construction of Hubei Provincial Key Laboratory for Intelligent Visual Monitoring of Hydropower Projects (No.2022SDSJ01), and the Hangzhou AI Major Scientific and Technological Innovation Project (No.2022AIZD0061).

  • Article
  • | |
  • Metrics
  • |
  • Reference [11]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    Detecting small objects on highways is a novel research topic. Due to the small pixel of objects on highways, traditional detectors have difficulty in capturing discriminative features. Additionally, the imbalance of feature fusion methods and the inconsistency between classification and regression tasks lead to poor detection performance on highways. In this paper, we propose a balance feature fusion and task-specific encoding network to address these issues. Specifically, we design a balance feature pyramid network (FPN) to integrate the importance of each layer of feature maps and construct long-range dependencies among them, thereby making the features more discriminative. In addition, we present task-specific decoupled head, which utilizes task-specific encoding to moderate the imbalance between the classification and regression tasks. As demonstrated by extensive experiments and visualizations, our method obtains outstanding detection performance on small object detection on highways (HSOD) dataset and AI-TOD dataset.

    Reference
    [1] WANG C Y, BOCHKOVSKIY A, LIAO H M. Yolov7:trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 17-24, 2023, Vancouver, BC, Canada. New York:IEEE, 2023:7464-7475.
    [2] CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transformers[C]//Proceedings of the European Conference on Computer Vision (ECCV), August 23-28, 2020, Glasgow, UK. Berlin, Heidelberg:Springer, 2020:213-229.
    [3] CHAN S X, LIU P, ZHANG Z. Webox:locating small objects from weak edges[J]. Optoelectronics letters, 2021, 17(6):349-353.
    [4] KIM S, KOOK H, SUN J, et al. Parallel feature pyramid network for object detection[C]//Proceedings of the European Conference on Computer Vision (ECCV), September 8-14, 2018, Munich, Germany. Berlin, Heidelberg:Springer, 2018:239-256.
    [5] LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, June 18-22, 2018, Salt Lake City, UT, USA. New York:IEEE, 2018:8759-8768.
    [6] WU Y, CHEN Y P, YUAN L, et al. Rethinking classification and localization for object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 13-19, 2020, Seattle, WA, USA. New York:IEEE, 2020:10183-10192.
    [7] GE Z, LIU S T, WANG F, et al. YOLOX:exceeding YOLO series in 2021[EB/OL]. (2021-07-18) [2023-06-24]. https://arxiv.org/abs/2107.08430.
    [8] BOCHKOVSKIY A, WANG C, LIAO H M. Yolov4:optimal speed and accuracy of object detection[J]. (2020-04-23) [2023-06-24]. https://arxiv.org/abs/2004.
    10934.
    [9] ZHANG Y F, REN W Q, ZHANG Z, et al. Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506:146-157.
    [10] WANG J W, YANG W, GUO H, et al. Tiny object detection in aerial images[C]//2020 25th International Conference on Pattern Recognition (ICPR), January 10-15, 2021, Milan, Italy. New York:IEEE, 2021:3791-3798.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

YU Minming, CHAN Sixian, ZHOU Xiaolong, LAI Zhounian. Small object detection on highways via balance feature fusion and task-specific encoding network[J]. Optoelectronics Letters,2024,20(7):424-429

Copy
Share
Article Metrics
  • Abstract:174
  • PDF: 11
  • HTML: 0
  • Cited by: 0
History
  • Received:August 29,2023
  • Revised:November 30,2023
  • Online: May 28,2024
Article QR Code