面向多源SAR图像的多级特征注意力水体提取网络

Multi-level Feature Attention Fusion Network for Water Extraction from Multi-source SAR Images

  • 摘要: 利用合成孔径雷达(synthetic aperture radar,SAR)图像进行水体提取在城市水体监测、海岸线监测、洪涝灾害监测等方面具有重要的应用价值。目前对单频段SAR图像数据水体提取已经取得了较大成功。但由于水体及地物目标在不同频段SAR图像中呈现的特性有较大差别,针对单频段SAR图像设计的深度学习网络,在应用到不同频段时提取精度较差,因此,如何实现多源SAR图像水体自动提取依然是个不小的挑战。对此,提出了一种新的网络框架,即多级特征注意力融合网络(multi-level feature attention fusion network,MFAFNet)。该网络由编码器和解码器组成,编码器先使用ResNet-101提取具有不同分辨率的4级特征,再由所提出的中间级特征融合模块和有效通道空洞空间卷积池化金字塔模块并行处理,对中高级特征进行深度融合;在解码器中引入注意调制模块对低级特征进行权重分配,进一步与来自编码器的高级特征融合并进一步处理,获得水体提取结果。为了验证所提网络框架的有效性,对不同频段和分辨率的SAR图像(哨兵1号、TerraSAR和高分三号)进行了实验,并与3个典型网络进行了对比。结果表明,MFAFNet对多源SAR图像的水体提取效果显著优于其他网络,平均用户精度可达87%,平均交并比为0.80,实现了多源SAR图像水体的高精度自动提取。

     

    Abstract:
    Objective At present, water extraction from single frequency-band synthetic aperture radar (SAR) images has achieved great success. However, due to the large differences in the characteristics of water bodies and other surface targets in SAR images with different frequency-bands, deep learning network designed for single frequency-band has poor extraction accuracy when applied to SAR images with different frequency bands. So, how to achieve excellent extraction performance for water automatically from multi-source SAR images is still a challenge.
    Methods We propose a new network framework, namely multi-level feature attention fusion network (MFAFNet). The network consists of encoder and decoder. The encoder utilizes ResNet-101 to generate four-level features with different resolutions, which are then processed in parallel by the intermediate level feature fusion module and efficient channel atrous spatial pyramid pooling module, and the intermediate and high-level features are output after deep fusion. The attention modulation module attention mechanism is introduced into decoder to distribute the weight of low-level features, then which are further fused with the high-level features from the encoder to obtain the water extraction results.
    Results Experiments are carried out on SAR images with different frequency bands and resolutions (Sentinel-1, TerraSAR and Gaofen-3), and three good networks have been compared. The results show that the MFAFNet extraction effect is obviously better than other networks, with an average pixel accuracy of 87% and an average intersection over union of 0.80.
    Conclusions The water body extraction from multi-band and multi-resolution SAR images is achieved, which promotes the practical application value of deep learning network in water body detection.

     

/

返回文章
返回