Improved remote sensing image target detection based on YOLOv7

doi:https://doi.org/10.1007/s11801-024-3063-z

Home > Archive>Volume 20, Issue 4, 2024 >234-242. DOI:https://doi.org/10.1007/s11801-024-3063-z

Improved remote sensing image target detection based on YOLOv7
DOI:
                        https://doi.org/10.1007/s11801-024-3063-z
                    
CSTR:
                        [cstr]
                    
Author:
                        XU ShuanglongXU Shuanglong
School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
CHEN ZhihongCHEN Zhihong
School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG HaiweiZHANG Haiwei
School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
XUE LifangXUE Lifang
School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
SU HuijunSU Huijun
School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference [17]

Cited by

Materials

Comments

Abstract:

Remote sensing images are taken at high altitude from above, with complex spatial scenes of images and a large number of target types. The detection of image targets on large scale remote sensing images suffers from the problem of small target size and target density. This paper proposes an improved model for remote sensing image detection based on you only look once version 7 (YOLOv7). First, the small-scale detection layer is added to reacquire tracking frames to improve the network's recognition ability of small-scale targets, and then Bottleneck Transformers are fused in the backbone to make full use of the convolutional neural network (CNN)+Transformer architecture to enhance the feature extraction ability of the network. After that, the convolutional block attention module (CBAM) mechanism is added in the head to improve the model's ability of small-scale target. Finally, the non-maximum suppressed (NMS) of YOLOv7 algorithm is changed to distance intersection over union-non maximum suppression (DIOU-NMS) to improve the detection ability of overlapping targets in the network. The results show that the method in this paper can improve the detection rate of small-scale targets in remote sensing images and effectively solve the problem of high overlap and is tested on the NWPU-VHR10 and DOTA1.0 datasets, and the accuracy of the improved model is improved by 6.3% and 4.2%, respectively, compared with the standard YOLOv7 algorithm.

Reference

[1]LU X, ZHENG X, YUAN Y. Remote sensing scene classification by unsupervised representation learning[J]. IEEE transactions on geoscience and remote sensing, 2017, 55(9): 5148-5157.

[2] AFAQ Y, MANOCHA A. Analysis on change detection techniques for remote sensing applications: a review[J]. Ecological informatics, 2021, 63: 101310.

[3] ZHAO Z Q, ZHENG P, XU S, et al. Object detection with deep learning: a review[J]. IEEE transactions on neural networks and learning systems, 2019, 30(11): 3212-3232.

[4] SHAFIQUE A, CAO G, KHAN Z, et al. Deep learning-based change detection in remote sensing images: a review[J]. Remote sensing, 2022, 14(4): 871.

[5] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2014, 978: 580-587.

[6] REN S Q, HE K, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis & machine intelligence, 2017, 39(06): 1137-1149.

[7] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//Computer Vision-ECCV 2016: 14th European Conference, October 11-14, 2016, Amsterdam, Netherlands. Berlin, Heidelberg: Springer International Publishing, 2016: 21-37.

[8] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 779-788.

[9] REDMON J, FARHADI A. Yolo9000: better, faster, stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, July 21-26, 2017, Honolulu, HI, USA. New York: IEEE, 2017: 7263-7271.

[10] REDMON J, FARHADI A. Yolov3: an incremental improvement[EB/OL]. (2018-04-08) [2023-01-23]. https://arxiv.org/abs/1804.02767.

[11] ZHANG H, CISSE M, DAUPHIN Y N, et al. Mixup: beyond empirical risk minimization[EB/OL]. (2017-10-25) [2023-01-23]. https://arxiv.org/abs/1710.09412.

[12] WANG C, SHI J, YANG X, et al. Geospatial object detection via deconvolutional region proposal network[J]. IEEE journal of selected topics in applied earth observations and remote sensing, 2019, 12(8): 3014-3027.

[13] CHENG G, ZHOU P, HAN J. Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images[J]. IEEE transactions on geoscience and remote sensing, 2016, 54(12): 7405-7415.

[14] YU X, GONG Y, JIANG N, et al. Scale match for tiny person detection[C]//2020 IEEE Winter Conference on Applications of Computer Vision (WACV), March 1-5, 2020, Snowmass, CO, USA. New York: IEEE, 2020: 1257-1265.

[15] LUO H, WANG P, CHEN H, et al. Object detection method based on shallow feature fusion and semantic information enhancement[J]. IEEE sensors journal, 2021, 21(19): 21839-21851.

[16] DENG C, WANG M, LIU L, et al. Extended feature pyramid network for small object detection[J]. IEEE transactions on multimedia, 2021, 24: 1968-1979.

[17] BOCHKOVSKIY A, WANG C Y, LIAO H Y M. Yolov4: optimal speed and accuracy of object detection[EB/OL]. (2020-04-23) [2023-01-23]. https://arxiv.org/

Get Citation

XU Shuanglong, CHEN Zhihong, ZHANG Haiwei, XUE Lifang, SU Huijun. Improved remote sensing image target detection based on YOLOv7[J]. Optoelectronics Letters,2024,20(4):234-242

Copy

Article Metrics

Abstract:263
PDF: 723
HTML: 0
Cited by: 0

History

Received:April 08,2023
Revised:September 21,2023
Adopted:
Online: March 05,2024
Published:

Home

About us

Authors

Editors

News

Contents

Contact us

Get Citation

Share

Article Metrics

History

Article QR Code