基于混合注意力生成对抗网络的遥感图像去雾方法

doi:10.12133/j.smartag.SA202410011

摘要/Abstract

摘要：

【目的/意义】 近年来，深度学习在遥感图像去雾领域取得了显著进展，尤其是在引入注意力机制以提升特征学习方面。然而，传统的注意力机制大多依赖全局平均池化，导致模型对特定影响点的敏感性过高，难以有效应对遥感图像中的去雾问题。为了提高去雾技术的效果，满足农业、城市规划等领域对图像质量日益增长的需求，现有方法亟需改进。 【方法】 本研究提出了一种混合注意力生成对抗网络（Hybrid Attention-Based Generative Adversarial Network, HAB-GAN）。该模型通过结合高效通道注意力模块与空间注意力模块，嵌入生成对抗网络架构中，实现了对遥感图像去雾效果的显著提升。高效通道注意力模块通过降低全局特征聚合中的冗余信息，既保留了性能，又减少了模型复杂度；空间注意力模块则从局部到全局对遥感图像中的雾化区域进行识别和聚焦，增强了对这些区域的恢复能力。这种方法能够更加有效地应对遥感图像中复杂多变的景观，尤其适用于农业等需要高质量遥感数据的领域。 【结果与讨论】 在RESISC（Remote Sensing Image Scene Classification）45数据集上，与现有的其他注意力机制去雾模型，如SpA GAN和HyA-GAN进行比较，HAB-GAN模型去雾效果更优，其中峰值信噪比（Peak Signal-to-Noise Ratio, PSNR）分别增加了2.64和1.14 dB，结构相似度（Structural Similarity Index, SSIM）分别增加了0.012 2和0.001 9。此外，消融实验验证了混合注意力机制的有效性，去除HAB模块后，HAB-GAN模型的PSNR下降了3.87 dB，SSIM下降了0.033 4。 【结论】 提出的HAB-GAN模型显著提升了遥感图像的去雾效果，使生成的图像更加接近无雾图像，特别是对于复杂的农业、环境监测等场景具有重要应用价值。HAB模块在提升模型性能方面发挥了关键作用，为未来的遥感图像处理和相关领域提供了有力的技术支持。

关键词: 遥感图像, 深度学习, 生成对抗网络, 高效通道注意力模块, 空间注意力模块, 去雾

Abstract:

[Objective] Remote sensing images have become an important data source in fields such as surface observation, environmental monitoring, and natural disaster prediction. However, the acquisition of remote sensing images is often affected by weather phenomena such as fog and clouds, which reduces the image quality and poses challenges to subsequent analysis and processing tasks. In recent years, the introduction of attention mechanisms has enabled models to better capture and utilize important features in images, thereby significantly improving defogging performance. However, traditional channel attention mechanisms usually rely on global average pooling to summarize feature information. Although this method simplifies the complexity of calculations, it is not satisfactory when dealing with images with significant local changes and sensitivity to outliers. In addition, remote sensing images usually cover a wide area, and the diverse terrain makes the fog pattern more complex. Therefore, to address this issue, a hybrid attention-based generative adversarial network hybrid attention-based generative adversarial network (HAB-GAN) was proposed in this research, which integrates an efficient channel attention (ECA) module and a spatial attention block (SAB). [Method] By merging feature extraction from both channel and spatial dimensions, the model effectively enhanced its ability to identify and recover hazy areas in remote sensing images. In HAB-GAN, the ECA module captured local cross-channel interactions, addressing the shortcomings of traditional global averaged pooling in terms of insufficient sensitivity to local detail information. The ECA module used a global average pooling strategy without dimensionality reduction, automatically adapting to the characteristics of each channel without introducing extra parameters, thereby enhancing the inter-channel dependencies. ECA emploied a one-dimensional convolution operation, which used a learnable kernel size to adaptively determine the range of channel interactions. This design effectively avoided the over-smoothing of global features common in traditional pooling layers, allowing the model to more precisely extract local detailed while maintaining low computational complexity. The SAB module introduced a weighted mechanism on the spatial dimension by constructing a spatial attention map to enhance the model's ability to identify hazy areas in the image. This module extracted feature maps through convolution operations and applies attention weighting in both horizontal and vertical directions, highlighting regions with severe haze, allowing the model to better capture spatial information in the image, thereby enhancing dehazing performance. The generator of HAB-GAN combined residual network structures with hybrid attention modules. It first extracted initial features from input images through convolutional layers and then passed these features through several residual blocks. The residual blocks effectively mitigated the vanishing gradient problem in deep neural networks and maintain feature consistency and continuity by passing input features directly to deeper network layers through skip connections. Each residual block incorporated ECA and SAB modules, enabling precise feature learning through weighted processing in both channel and spatial dimensions. After extracting effective features, the generator generated dehazed images through convolution operations. The discriminator adopted a standard convolutional neural network architecture, focusing on extracting local detail features from the images generated by the generator. It consisted of multiple convolutional layers, batch normalization layers, and Leaky ReLU activation functions. By extracting local features layer by layer and down-sampling, the discriminator progressively reduced the spatial resolution of the images, evaluating their realism at both global and local levels. The generator and discriminator were jointly optimized through adversarial training, where the generator aimed to produce increasingly realistic dehazed images, and the discriminator continually improved its ability to distinguish between real and generated images, thereby enhancing the learning effectiveness and image quality of the generator. [Results and Discussions] To validate the effectiveness of HAB-GAN, experiments were conducted on the remote sensing image scene classification 45 (RESISC45) dataset. The experimental results demonstrated that compared to existing dehazing models, HAB-GAN excels in key evaluation metrics such as peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). Specifically, compared to SpA GAN, HAB-GAN improved PSNR by 2.642 5 dB and SSIM by 0.012 2; Compared to HyA-GAN, PSNR improved by 1.138 dB and SSIM by 0.001 9. Additionally, to assess the generalization capability of HAB-GAN, further experiments were conducted on the RICE2 dataset to verify its performance in cloud removal tasks. The results showed that HAB-GAN also performs exceptionally well in cloud removal tasks, with PSNR improving by 3.593 2 dB and SSIM improving by 0.040 2. Compared to HyA-GAN, PSNR and SSIM increased by 1.854 dB and 0.012 4, respectively. To further explored the impact of different modules on the model's performance, ablation experiments were designed, gradually removing the ECA module, the SAB module, and the entire hybrid attention module. The experimental results showed that removing the ECA module reduced PSNR by 2.642 5 dB and SSIM by 0.012 2; Removing the SAB module reduced PSNR by 2.955 dB and SSIM by 0.008 7, and removing the entire hybrid attention module reduced PSNR and SSIM by 3.866 1 dB and 0.033 4, respectively. [Conclusions] The proposed HAB-GAN model not only performs excellently in dehazing and beclouding tasks but also significantly enhances the clarity and detail recovery of dehazed images through the synergistic effect of the ECA module and the SAB module. Additionally, its strong performance across different remote sensing datasets further validates its effectiveness and generalization ability, showcasing broad application potential particularly in fields such as agriculture, environmental monitoring, and disaster prediction, where high-quality remote sensing data is crucial. HAB-GAN is poised to become a valuable tool for improving data reliability and supporting more accurate decision-making and analysis.

Key words: remote sensing image, deep learning, generative adversarial network, efficient channel attention module, spatial attention module, defogging

中图分类号:

TP391
TP751

马六, 毛克彪, 郭中华. 基于混合注意力生成对抗网络的遥感图像去雾方法[J]. 智慧农业(中英文), 2025, 7(2): 172-182.

MA Liu, MAO Kebiao, GUO Zhonghua. Defogging Remote Sensing Images Method Based on a Hybrid Attention-Based Generative Adversarial Network[J]. Smart Agriculture, 2025, 7(2): 172-182.

图/表 12

图1

图2

图3

图4

图5

图6

图7

图8

表1

图9

表2

表3

参考文献 25

1	OSCO L P, MARCATO J, MARQUES RAMOS A P, et al. A review on deep learning in UAV remote sensing[J]. International journal of applied earth observation and geoinformation, 2021, 102: ID 102456.
2	PEYGHAMBARI S, ZHANG Y. Hyperspectral remote sensing in lithological mapping, mineral exploration, and environmental geology: An updated review[J]. Journal of applied remote sensing, 2021, 15(3): ID 031501.
3	WOOSTER M J, ROBERTS G J, GIGLIO L, et al. Satellite remote sensing of active fires: History and current status, applications and future requirements[J]. Remote sensing of environment, 2021, 267: ID 112694.
4	ZHANG H D, WANG L Q, TIAN T, et al. A review of unmanned aerial vehicle low-altitude remote sensing (UAV-LARS) use in agricultural monitoring in China[J]. Remote sensing, 2021, 13(6): ID 1221.
5	MA X F, WANG Q M, TONG X H. A spectral grouping-based deep learning model for haze removal of hyperspectral images[J]. ISPRS journal of photogrammetry and remote sensing, 2022, 188: 177-189.
6	HE J, YUAN Q Q, LI J, et al. A self-supervised remote sensing image fusion framework with dual-stage self-learning and spectral super-resolution injection[J]. ISPRS journal of photogrammetry and remote sensing, 2023, 204: 131-144.
7	HE J, LI J, YUAN Q Q, et al. Spectral response function-guided deep optimization-driven network for spectral super-resolution[J]. IEEE transactions on neural networks and learning systems, 33(9): 4213-4227.
8	张圆, 孔祥思, 张烁, 等. 深度学习技术在遥感影像滑坡识别中的应用[J]. 北京测绘, 2022, 36(10): 1385-1390.
	ZHANG Y, KONG X S, ZHANG S, et al. Application of deep learning technology in remote sensing image landslide identification[J]. Beijing surveying and mapping, 2022, 36(10): 1385-1390.
9	孙志军, 薛磊, 许阳明, 等. 深度学习研究综述[J]. 计算机应用研究, 2012, 29(8): 2806-2810.
	SUN Z J, XUE L, XU Y M, et al. Overview of deep learning[J]. Application research of computers, 2012, 29(8): 2806-2810.
10	DUAN Y L, LUO F L, FU M X, et al. Classification via structure-preserved hypergraph convolution network for hyperspectral image[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: ID 5507113.
11	LUO F L, ZHOU T Y, LIU J M, et al. Multiscale diff-changed feature fusion network for hyperspectral image change detection[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: 1-13.
12	GU Z Q, ZHAN Z Q, YUAN Q Q, et al. Single remote sensing image dehazing using a prior-based dense attentive network[J]. Remote sensing, 2019, 11(24): ID 3008.
13	HU A N, XIE Z, XU Y Y, et al. Unsupervised haze removal for high-resolution optical remote-sensing images based on improved generative adversarial networks[J]. Remote sensing, 2020, 12(24): ID 4162.
14	JIANG B, CHEN G T, WANG J S, et al. Deep dehazing network for remote sensing image with non-uniform haze[J]. Remote sensing, 2021, 13(21): ID 4443.
15	邱雨珉, 郭剑辉, 楼根铨, 等. 结合小波变换和注意力机制的U-NET图像去雾算法[J]. 计算机与数字工程, 2024, 52(6): 1859-1863.
	QIU Y M, GUO J H, LOU G Q, et al. U-NET image dehazing algorithm combining wavelet transform and attention mechanism[J]. Computer & digital engineering, 2024, 52(6): 1859-1863.
16	李玉峰, 任静波, 黄煜峰. 基于深度学习的遥感图像去雾算法[J]. 计算机应用研究, 2021, 38(7): 2194-2199.
	LI Y F, REN J B, HUANG Y F. Remote sensing image haze removal algorithm using deep learning[J]. Application research of computers, 2021, 38(7): 2194-2199.
17	王梦瑶, 孟祥超, 邵枫, 等. 基于深度学习的SAR辅助下光学遥感图像去云方法[J]. 光学学报, 2021, 41(12): ID 1228002.
	WANG M Y, MENG X C, SHAO F, et al. SAR-assisted optical remote sensing image cloud removal method based on deep learning[J]. Acta optica sinica, 2021, 41(12): ID 1228002.
18	任欢, 王旭光. 注意力机制综述[J]. 计算机应用, 2021, 41(S1): 1-6.
	REN H, WANG X G. Review of attention mechanism[J]. Journal of computer applications, 2021, 41(S1): 1-6.
19	吴胜垚, 陈星. 图像任务中空间和通道注意力机制研究综述[J/OL]. 微电子学与计算机, 2024: 1-13. (2024-05-09).
	WU S Y, CHEN X. A review of the spatial and channel attention mechanisms in image tasks[J/OL]. Microelectronics & computer, 2024: 1-13. (2024-05-09).
20	杨云, 杨欣悦, 张小璇. 基于注意力机制的生成对抗网络图像超分辨重建[J]. 陕西科技大学学报, 2024, 42(2): 216-223, 232.
	YANG Y, YANG X Y, ZHANG X X. Generative adversarial network image super-resolution reconstruction based on attention mechanism[J]. Journal of Shaanxi university of science & technology, 2024, 42(2): 216-223, 232.
21	HU J, SHEN L, ALBANIE S, et al. Squeeze-and-excitation networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2020, 42(8): 2011-2023.
22	WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block attention module[C]// Computer Vision – ECCV 2018. Cham, Germany: Springer International Publishing, 2018: 3-19.
23	HAO Y, JIANG W Z, LIU W F, et al. Dynamic feature attention network for remote sensing image dehazing[J]. Neural processing letters, 2023, 55(6): 8081-8094.
24	WANG Q L, WU B G, ZHU P F, et al. ECA-net: Efficient channel attention for deep convolutional neural networks[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020: 11534-11542.
25	KOURMOULI G, KOSTAGIOLAS N, NICOLAOU M A, et al. Locality-preserving directions for interpreting the latent space of satellite image GANs[J]. IEEE geoscience and remote sensing letters, 2024, 21: 1-5.

模型	PSNR/dB	SSIM
SpA GAN	31.780 0	0.948 7
HyA-GAN	33.284 5	0.959 0
HAB-GAN	34.422 5	0.960 9

模型	PSNR/dB	SSIM
SpA GAN	27.921 6	0.919 7
HyA-GAN	29.660 8	0.947 5
HAB-GAN	31.514 8	0.959 9

模型	PSNR/dB	SSIM
HAB-GAN	34.422 5	0.960 9
HAB-GAN（无ECA）	31.780 0	0.948 7
HAB-GAN（无SAB）	31.467 5	0.952 2
HAB-GAN（无HAB）	30.556 4	0.927 5

[1]	李瑞杰, 王爱冬, 吴华星, 李子秋, 冯向前, 洪卫源, 汤学军, 覃金华, 王丹英, 褚光, 张运波, 陈松. 水稻生育期遥感监测的研究进展、瓶颈问题与技术优化路径[J]. 智慧农业(中英文), 2025, 7(3): 89-107.
[2]	韩宇, 齐康康, 郑纪业, 李金瑷, 姜富贵, 张相伦, 游伟, 张霞. 基于改进YOLOv11的轻量化肉牛面部识别方法[J]. 智慧农业(中英文), 2025, 7(3): 173-184.
[3]	许世卫, 李乾川, 栾汝朋, 庄家煜, 刘佳佳, 熊露. 农产品市场监测预警深度学习智能预测方法[J]. 智慧农业(中英文), 2025, 7(1): 57-69.
[4]	宫宇, 王玲, 赵荣强, 尤海波, 周沫, 刘劼. 基于多模态数据表型特征提取的番茄生长高度预测方法[J]. 智慧农业(中英文), 2025, 7(1): 97-110.
[5]	齐梓均, 牛当当, 吴华瑞, 张礼麟, 王仑峰, 张宏鸣. 基于双维信息与剪枝的中文猕猴桃文本命名实体识别方法[J]. 智慧农业(中英文), 2025, 7(1): 44-56.
[6]	张辉, 胡军, 石航, 刘昶希, 吴淼. 融合远端深度学习识别模型的白菜株心精准对靶喷雾系统[J]. 智慧农业(中英文), 2024, 6(6): 85-95.
[7]	芦碧波, 梁迪, 杨洁, 宋爱青, 皇甫尚卫. 基于改进ENet的复杂背景下山药叶片图像分割方法[J]. 智慧农业(中英文), 2024, 6(6): 109-120.
[8]	罗友璐, 潘勇浩, 夏顺兴, 陶友志. 基于改进YOLOv8的苹果叶病害轻量化检测算法[J]. 智慧农业(中英文), 2024, 6(5): 128-138.
[9]	刘伊, 张彦军. ReluformerN：轻量化高低频增强高光谱农业地物分类方法[J]. 智慧农业(中英文), 2024, 6(5): 74-87.
[10]	年悦, 赵凯旋, 姬江涛. 基于改进DeepLabCut模型的奶牛滑蹄检测方法[J]. 智慧农业(中英文), 2024, 6(5): 153-163.
[11]	张岩琪, 周硕, 张凝, 柴秀娟, 孙坦. 基于改进实例分割算法的区域养殖生猪计数系统[J]. 智慧农业(中英文), 2024, 6(4): 53-63.
[12]	翁智, 范琦, 郑志强. 基于多模态图像信息及改进实例分割网络的肉牛体尺自动测量方法[J]. 智慧农业(中英文), 2024, 6(4): 64-75.
[13]	侯依廷, 饶元, 宋贺, 聂振君, 王坦, 何豪旭. 复杂大田场景下基于改进YOLOv8的小麦幼苗期叶片数快速检测方法[J]. 智慧农业(中英文), 2024, 6(4): 128-137.
[14]	李豪, 杜雨秋, 肖星竹, 陈彦羲. 基于深度学习的四川盆地丘陵区县域耕地遥感识别研究[J]. 智慧农业(中英文), 2024, 6(3): 34-45.
[15]	聂刚刚, 饶洪辉, 李泽锋, 刘木华. 基于改进YOLACT的油茶叶片炭疽病感染严重程度分级模型[J]. 智慧农业(中英文), 2024, 6(3): 138-147.