| Citation: | GUO Lin, QIN Shiyin. Deep learning and optimization algorithm for high efficient searching and detection of aircraft targets in remote sensing images[J]. Journal of Beijing University of Aeronautics and Astronautics, 2019, 45(1): 159-173. doi: 10.13700/j.bh.1001-5965.2018.0239(in Chinese) |
In order to achieve high-performance detection and accurate positioning of aircraft targets in large-scale remote sensing images, in this paper, a kind of efficient aircraft target detection algorithm based on synthetic integration of searching and detection is presented. First, through the end-to-end deep neural networks (DNN), the efficient and accurate pixel-level segmentation of apron and runway area is achieved so that the searching range of aircraft targets is greatly narrowed and the probability of false alarm is also reduced effectively and the goal of high speed searching of aircraft targets candidate detection areas is achieved accordingly. In view of the segmented areas of apron and runway, the strategy of transfer reinforcement learning is employed to pre-trained YOLO networks with supervised signals of positive datasets by manual labelling. In this way, pre-trained networks can make up the deficiency of capacity of manual data sets, and the advantage of real-time property of YOLO networks can also be utilized to deal with the classification and locations of aircraft targets so as to achieve a satisfied solution of regression problems and promote the efficiency of detection significantly. It is obvious that the apron and runway segmentation with DNN networks can play important role in getting performance superiority of high precision and efficiency. Meanwhile, YOLO networks based on transfer reinforcement learning not only possess the characteristics of high efficiency, but also maintain the precision of detection at a high level. A series of comprehensive experiments and comparative analyses verify the effectiveness and good performance of the proposed searching and detection algorithm of aircraft targets with DNN cascade combination and synthetic integration.
| [1] |
HAN Z X, ZHANG H, ZHANG J F, et al.Fast aircraft detection based on region locating network in large-scale remote sensing images[C]//24th IEEE International Conference on Image Processing.Piscataway, NJ: IEEE Press, 2017: 2294-2298.
|
| [2] |
GIRSHICK R, DONAHUE J, DARRELL T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE Press, 2014: 580-587.
|
| [3] |
REN S, HE K, GIRSHICK R, et al.Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031
|
| [4] |
赵雪梅, 李玉, 赵泉华.基于隐马尔可夫高斯随机场模型的模糊聚类高分辨率遥感影像分割算法[J].电子学报, 2016, 44(3):679-686. doi: 10.3969/j.issn.0372-2112.2016.03.028
ZHAO X M, LI Y, ZHAO Q H.Hidden Markov Gaussian random field based fuzzy clustering algorithm for high-resolution remote sensing image segmentation[J]. Acta Electronica Sinica, 2016, 44(3):679-686(in Chinese). doi: 10.3969/j.issn.0372-2112.2016.03.028
|
| [5] |
陈荣元, 徐雪松, 李广琼, 等.自适应特征加权的Gibbs随机场影像分割方法[J].电子学报, 2016, 44(10):2351-2356. doi: 10.3969/j.issn.0372-2112.2016.10.010
CHEN R Y, XU X S, LI G Q, et al.Image segmentation by combining adaptively weighted features with Gibbs random field[J]. Acta Electronica Sinica, 2016, 44(10):2351-2356(in Chinese). doi: 10.3969/j.issn.0372-2112.2016.10.010
|
| [6] |
BUDAK U, HALICI U, SENGUR A, et al.Efficient airport detection using line segment detector and Fisher vector representation[J]. IEEE Geoscience & Remote Sensing Letters, 2016, 13(8):1079-1083.
|
| [7] |
WANG Y, PAN L.Automatic airport recognition based on saliency detection and semantic information[J]. ISPRS International Journal of Geo-Information, 2016, 5(7):115-118. doi: 10.3390/ijgi5070115
|
| [8] |
KRIZHEVSKY A, SUTSKEVER I, HINTON G E.ImageNet classification with deep convolutional neural networks[C]//International Conference on Neural Information Processing Systems, 2012: 1097-1105.
|
| [9] |
LONG J, SHELHAMER E, DARRELL T.Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651. doi: 10.1109/TPAMI.2016.2572683
|
| [10] |
BADRINARAYANAN V, KENDALL A, CIPOLLA R.SegNet:A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615
|
| [11] |
RONNEBERGER O, FISCHER P, BROX T.U-Net: Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-assisted Intervention.Berlin: Springer, 2015: 234-241.
|
| [12] |
DAN C C, GIUSTI A, GAMBARDELLA L M, et al.Deep neural networks segment neuronal membranes in electron microscopy images[J]. Advances in Neural Information Processing Systems, 2012, 25:2852-2860.
|
| [13] |
CHEN L, PAPANDREOU G, KOKKINOS I, et al.Semantic image segmentation with deep convolutional nets and fully connected CRFs[J]. Computer Science, 2014(4):357-361.
|
| [14] |
ZHAO H, SHI J, QI X, et al.Pyramid scene parsing network[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE Press, 2017: 6230-6239.
|
| [15] |
CHEN L C, PAPANDREOU G, KOKKINOS I, et al.DeepLab:Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFS[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848. doi: 10.1109/TPAMI.2017.2699184
|
| [16] |
HE K, GKIOXARI G, DOLLAR P, et al.Mask R-CNN[C]//IEEE International Conference on Computer Vision.Piscataway, NJ: IEEE Press, 2017: 2980-2988.
|
| [17] |
GIRSHICK R.Fast R-CNN[C]//IEEE International Conference on Computer Vision.Piscataway, NJ: IEEE Press, 2015: 1440-1448.
|
| [18] |
HE K, ZHANG X, REN S, et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9):1904-1916. doi: 10.1109/TPAMI.2015.2389824
|
| [19] |
REDMON J, DIVVALA S, GIRSHICK R, et al.You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE Press, 2016: 779-788.
|
| [20] |
TORRALBA A, RUSSELL B C, YUEN J.LabelMe:Online image annotation and applications[J]. Proceedings of the IEEE, 2010, 98(8):1467-1484. doi: 10.1109/JPROC.2010.2050290
|
| [21] |
HE K, ZHANG X, REN S, et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE Press, 2016: 770-778.
|
| [22] |
HUANG G, LIU Z, WEINBERGER K Q, et al.Densely connected convolutional networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE Press, 2017: 3-8.
|
| [23] |
SU W, WANG Z.Widening residual skipped network for semantic segmentation[J]. IET Image Processing, 2017, 11(10):880-887. doi: 10.1049/iet-ipr.2017.0070
|
| [24] |
KINGMA D P, BA J.Adam: A method for stochastic optimization[EB/OL]. (2014-12-22).
|