Volume 51 Issue 3
Mar.  2025
Turn off MathJax
Article Contents
LIU C J,QIAO Z,YAN H W,et al. Semantic segmentation network of remote sensing images based on dual path supervision[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(3):732-741 (in Chinese) doi: 10.13700/j.bh.1001-5965.2023.0155
Citation: LIU C J,QIAO Z,YAN H W,et al. Semantic segmentation network of remote sensing images based on dual path supervision[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(3):732-741 (in Chinese) doi: 10.13700/j.bh.1001-5965.2023.0155

Semantic segmentation network of remote sensing images based on dual path supervision

doi: 10.13700/j.bh.1001-5965.2023.0155
Funds:

National Key Research and Development Program of China (2022YFB3903604); Natural Science Foundation of Gansu Province (21JR7RA289); Key Research and Development Projects of Gansu Province (20YF8GA035) 

More Information
  • Corresponding author: E-mail:43452740@qq.com
  • Received Date: 31 Mar 2023
  • Accepted Date: 26 May 2023
  • Available Online: 10 Jul 2023
  • Publish Date: 30 Jun 2023
  • A dual path supervision and attention filtering network was proposed to solve the problem of blurry boundary classification of target objects in semantic segmentation tasks of remote sensing images. A supervised boundary extraction module was introduced to increase the channel of boundary information, improve the weight of boundary information in semantic segmentation, and enhance attention to the boundary pixels of the target object. The attention filtering module was introduced to filter out spatial details in shallow networks and abstract semantic information in deep networks through attention maps, discarding redundant information in the network to prevent overfitting. The mean intersection over union of the dual path supervision and attention filtering network on the Potsdam dataset and the Jiage dataset was 85.44% and 86.07% respectively, which increased by 1.24%, 1.28% and 1.54%, 1.27% compared with the suboptimal network MagNet and SAPNet. Experimental results show that the proposed network can more accurately segment the boundaries of target objects.

     

  • loading
  • [1]
    FAN M, LAI S, HUAN G J, et al. Rethinking BiseNet for real-time semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2021: 9716-9725.
    [2]
    LATHA P, RAJ T. Advanced deep learning method for aerial image segmentation of landscape changes in pre-and post-disaster scenarios[J]. Global NEST Journal, 2022, 24(3): 544-552.
    [3]
    GUO Z L, SHENGOKU H, WU G M, et al. Semantic segmentation for urban planning maps based on U-Net[C]//Proceedings of the IEEE International Geoscience and Remote Sensing Symposium. Piscataway: IEEE Press, 2018: 6187-6190.
    [4]
    FAN R Y, LI F P, HAN W, et al. Fine-scale urban informal settlements mapping by fusing remote sensing images and building data via a Transformer-based multimodal fusion network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5630316.
    [5]
    ZHU Q Q, ZHANG Y N, WANG L Z, et al. A global context-aware and batch-independent network for road extraction from VHR satellite imagery[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2021, 175: 353-365. doi: 10.1016/j.isprsjprs.2021.03.016
    [6]
    ZHOU M T, SUI H G, CHEN S X, et al. BT-RoadNet: a boundary and topologically-aware neural network for road extraction from high-resolution remote sensing imagery[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 168: 288-306. doi: 10.1016/j.isprsjprs.2020.08.019
    [7]
    LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2015: 3431-3440.
    [8]
    RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation[C]//Proceedings of the International Conference on Medical Image Computing and Computer-assisted Intervention. Berlin: Springer, 2015: 234-241.
    [9]
    WANG X, LI Z S, HUANG Y P, et al. Multimodal medical image segmentation using multi-scale context-aware network[J]. Neurocomputing, 2022, 486: 135-146. doi: 10.1016/j.neucom.2021.11.017
    [10]
    LUO J, ZHAO L, ZHU L, et al. Multi-scale receptive field fusion network for lightweight image super-resolution[J]. Neurocomputing, 2022, 493: 314-326. doi: 10.1016/j.neucom.2022.04.038
    [11]
    LIN D, SHEN D G, SHEN S T, et al. ZigZagNet: fusing top-down and bottom-up context for object segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2019: 7482-7491.
    [12]
    吴泽康, 赵姗, 李宏伟, 等. 遥感图像语义分割空间全局上下文信息网络[J]. 浙江大学学报(工学版), 2022, 56(4): 795-802.

    WU Z K, ZHAO S, LI H W, et al. Spatial global context information network for semantic segmentation of remote sensing image[J]. Journal of Zhejiang University (Engineering Science), 2022, 56(4): 795-802(in Chinese).
    [13]
    CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848. doi: 10.1109/TPAMI.2017.2699184
    [14]
    CHEN L C, PAPANDREOU G, SCHROFF F, et al. Rethinkingatrous convolution for semantic image segmentation[EB/OL]. (2017-06-17)[2023-03-01]. http://arxiv.org/abs/1706.05587.
    [15]
    ZHAO H S, SHI J P, QI X J, et al. Pyramid scene parsing network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 6230-6239.
    [16]
    HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 7132-7141.
    [17]
    WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision. Berlin: Springer, 2018: 3-19.
    [18]
    ZHOU Z, ZHOU Y, WANG D L, et al. Self-attention feature fusion network for semantic segmentation[J]. Neurocomputing, 2021, 453: 50-59. doi: 10.1016/j.neucom.2021.04.106
    [19]
    谭大宁, 刘瑜, 姚力波, 等. 基于视觉注意力机制的多源遥感图像语义分割[J]. 信号处理, 2022, 38(6): 1180-1191.

    TAN D N, LIU Y, YAO L B, et al. Semantic segmentation of multi-source remote sensing images based on visual attention mechanism[J]. Journal of Signal Processing, 2022, 38(6): 1180-1191(in Chinese).
    [20]
    FU J, LIU J, TIAN H J, et al. Dual attention network for scene segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2019: 3141-3149.
    [21]
    HUANG Z L, WANG X G, HUANG L C, et al. CCNet: criss-cross attention for semantic segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE Press, 2019: 603-612.
    [22]
    LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 936-944.
    [23]
    SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2014-09-04)[2023-03-01]. http://arxiv.org/abs/1409.1556.
    [24]
    BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. doi: 10.1109/TPAMI.2016.2644615
    [25]
    PENG C, ZHANG X Y, YU G, et al. Large kernel matters—improve semantic segmentation by global convolutional network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 1743-1751.
    [26]
    LI X, ZHONG Z S, WU J L, et al. Expectation-maximization attention networks for semantic segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE Press, 2019: 9166-9175.
    [27]
    YANG Y, DONG J W, WANG Y H, et al. DMAU-Net: an attention-based multiscale max-pooling dense network for the semantic segmentation in VHR remote-sensing images[J]. Remote Sensing, 2023, 15(5): 1328. doi: 10.3390/rs15051328
    [28]
    LI X, XU F, LIU F, et al. A synergistical attention model for semantic segmentation of remote sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5400916.
    [29]
    HUYNH C, TRAN A T, LUU K, et al. Progressive semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2021: 16750-16759.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(6)  / Tables(4)

    Article Metrics

    Article views(469) PDF downloads(23) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return