Defogging Remote Sensing Images Method Based on a Hybrid Attention-Based Generative Adversarial Network

doi:10.12133/j.smartag.SA202410011

Abstract

Abstract:

[Objective] Remote sensing images have become an important data source in fields such as surface observation, environmental monitoring, and natural disaster prediction. However, the acquisition of remote sensing images is often affected by weather phenomena such as fog and clouds, which reduces the image quality and poses challenges to subsequent analysis and processing tasks. In recent years, the introduction of attention mechanisms has enabled models to better capture and utilize important features in images, thereby significantly improving defogging performance. However, traditional channel attention mechanisms usually rely on global average pooling to summarize feature information. Although this method simplifies the complexity of calculations, it is not satisfactory when dealing with images with significant local changes and sensitivity to outliers. In addition, remote sensing images usually cover a wide area, and the diverse terrain makes the fog pattern more complex. Therefore, to address this issue, a hybrid attention-based generative adversarial network hybrid attention-based generative adversarial network (HAB-GAN) was proposed in this research, which integrates an efficient channel attention (ECA) module and a spatial attention block (SAB). [Method] By merging feature extraction from both channel and spatial dimensions, the model effectively enhanced its ability to identify and recover hazy areas in remote sensing images. In HAB-GAN, the ECA module captured local cross-channel interactions, addressing the shortcomings of traditional global averaged pooling in terms of insufficient sensitivity to local detail information. The ECA module used a global average pooling strategy without dimensionality reduction, automatically adapting to the characteristics of each channel without introducing extra parameters, thereby enhancing the inter-channel dependencies. ECA emploied a one-dimensional convolution operation, which used a learnable kernel size to adaptively determine the range of channel interactions. This design effectively avoided the over-smoothing of global features common in traditional pooling layers, allowing the model to more precisely extract local detailed while maintaining low computational complexity. The SAB module introduced a weighted mechanism on the spatial dimension by constructing a spatial attention map to enhance the model's ability to identify hazy areas in the image. This module extracted feature maps through convolution operations and applies attention weighting in both horizontal and vertical directions, highlighting regions with severe haze, allowing the model to better capture spatial information in the image, thereby enhancing dehazing performance. The generator of HAB-GAN combined residual network structures with hybrid attention modules. It first extracted initial features from input images through convolutional layers and then passed these features through several residual blocks. The residual blocks effectively mitigated the vanishing gradient problem in deep neural networks and maintain feature consistency and continuity by passing input features directly to deeper network layers through skip connections. Each residual block incorporated ECA and SAB modules, enabling precise feature learning through weighted processing in both channel and spatial dimensions. After extracting effective features, the generator generated dehazed images through convolution operations. The discriminator adopted a standard convolutional neural network architecture, focusing on extracting local detail features from the images generated by the generator. It consisted of multiple convolutional layers, batch normalization layers, and Leaky ReLU activation functions. By extracting local features layer by layer and down-sampling, the discriminator progressively reduced the spatial resolution of the images, evaluating their realism at both global and local levels. The generator and discriminator were jointly optimized through adversarial training, where the generator aimed to produce increasingly realistic dehazed images, and the discriminator continually improved its ability to distinguish between real and generated images, thereby enhancing the learning effectiveness and image quality of the generator. [Results and Discussions] To validate the effectiveness of HAB-GAN, experiments were conducted on the remote sensing image scene classification 45 (RESISC45) dataset. The experimental results demonstrated that compared to existing dehazing models, HAB-GAN excels in key evaluation metrics such as peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). Specifically, compared to SpA GAN, HAB-GAN improved PSNR by 2.642 5 dB and SSIM by 0.012 2; Compared to HyA-GAN, PSNR improved by 1.138 dB and SSIM by 0.001 9. Additionally, to assess the generalization capability of HAB-GAN, further experiments were conducted on the RICE2 dataset to verify its performance in cloud removal tasks. The results showed that HAB-GAN also performs exceptionally well in cloud removal tasks, with PSNR improving by 3.593 2 dB and SSIM improving by 0.040 2. Compared to HyA-GAN, PSNR and SSIM increased by 1.854 dB and 0.012 4, respectively. To further explored the impact of different modules on the model's performance, ablation experiments were designed, gradually removing the ECA module, the SAB module, and the entire hybrid attention module. The experimental results showed that removing the ECA module reduced PSNR by 2.642 5 dB and SSIM by 0.012 2; Removing the SAB module reduced PSNR by 2.955 dB and SSIM by 0.008 7, and removing the entire hybrid attention module reduced PSNR and SSIM by 3.866 1 dB and 0.033 4, respectively. [Conclusions] The proposed HAB-GAN model not only performs excellently in dehazing and beclouding tasks but also significantly enhances the clarity and detail recovery of dehazed images through the synergistic effect of the ECA module and the SAB module. Additionally, its strong performance across different remote sensing datasets further validates its effectiveness and generalization ability, showcasing broad application potential particularly in fields such as agriculture, environmental monitoring, and disaster prediction, where high-quality remote sensing data is crucial. HAB-GAN is poised to become a valuable tool for improving data reliability and supporting more accurate decision-making and analysis.

Key words: remote sensing image, deep learning, generative adversarial network, efficient channel attention module, spatial attention module, defogging

CLC Number:

TP391
TP751

MA Liu, MAO Kebiao, GUO Zhonghua. Defogging Remote Sensing Images Method Based on a Hybrid Attention-Based Generative Adversarial Network[J]. Smart Agriculture, 2025, 7(2): 172-182.

Figures/Tables 12

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Fig. 7

Fig. 8

Table 1

Fig. 9

Table 2

Table 3

References 25

1	OSCO L P, MARCATO J, MARQUES RAMOS A P, et al. A review on deep learning in UAV remote sensing[J]. International journal of applied earth observation and geoinformation, 2021, 102: ID 102456.
2	PEYGHAMBARI S, ZHANG Y. Hyperspectral remote sensing in lithological mapping, mineral exploration, and environmental geology: An updated review[J]. Journal of applied remote sensing, 2021, 15(3): ID 031501.
3	WOOSTER M J, ROBERTS G J, GIGLIO L, et al. Satellite remote sensing of active fires: History and current status, applications and future requirements[J]. Remote sensing of environment, 2021, 267: ID 112694.
4	ZHANG H D, WANG L Q, TIAN T, et al. A review of unmanned aerial vehicle low-altitude remote sensing (UAV-LARS) use in agricultural monitoring in China[J]. Remote sensing, 2021, 13(6): ID 1221.
5	MA X F, WANG Q M, TONG X H. A spectral grouping-based deep learning model for haze removal of hyperspectral images[J]. ISPRS journal of photogrammetry and remote sensing, 2022, 188: 177-189.
6	HE J, YUAN Q Q, LI J, et al. A self-supervised remote sensing image fusion framework with dual-stage self-learning and spectral super-resolution injection[J]. ISPRS journal of photogrammetry and remote sensing, 2023, 204: 131-144.
7	HE J, LI J, YUAN Q Q, et al. Spectral response function-guided deep optimization-driven network for spectral super-resolution[J]. IEEE transactions on neural networks and learning systems, 33(9): 4213-4227.
8	张圆, 孔祥思, 张烁, 等. 深度学习技术在遥感影像滑坡识别中的应用[J]. 北京测绘, 2022, 36(10): 1385-1390.
	ZHANG Y, KONG X S, ZHANG S, et al. Application of deep learning technology in remote sensing image landslide identification[J]. Beijing surveying and mapping, 2022, 36(10): 1385-1390.
9	孙志军, 薛磊, 许阳明, 等. 深度学习研究综述[J]. 计算机应用研究, 2012, 29(8): 2806-2810.
	SUN Z J, XUE L, XU Y M, et al. Overview of deep learning[J]. Application research of computers, 2012, 29(8): 2806-2810.
10	DUAN Y L, LUO F L, FU M X, et al. Classification via structure-preserved hypergraph convolution network for hyperspectral image[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: ID 5507113.
11	LUO F L, ZHOU T Y, LIU J M, et al. Multiscale diff-changed feature fusion network for hyperspectral image change detection[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: 1-13.
12	GU Z Q, ZHAN Z Q, YUAN Q Q, et al. Single remote sensing image dehazing using a prior-based dense attentive network[J]. Remote sensing, 2019, 11(24): ID 3008.
13	HU A N, XIE Z, XU Y Y, et al. Unsupervised haze removal for high-resolution optical remote-sensing images based on improved generative adversarial networks[J]. Remote sensing, 2020, 12(24): ID 4162.
14	JIANG B, CHEN G T, WANG J S, et al. Deep dehazing network for remote sensing image with non-uniform haze[J]. Remote sensing, 2021, 13(21): ID 4443.
15	邱雨珉, 郭剑辉, 楼根铨, 等. 结合小波变换和注意力机制的U-NET图像去雾算法[J]. 计算机与数字工程, 2024, 52(6): 1859-1863.
	QIU Y M, GUO J H, LOU G Q, et al. U-NET image dehazing algorithm combining wavelet transform and attention mechanism[J]. Computer & digital engineering, 2024, 52(6): 1859-1863.
16	李玉峰, 任静波, 黄煜峰. 基于深度学习的遥感图像去雾算法[J]. 计算机应用研究, 2021, 38(7): 2194-2199.
	LI Y F, REN J B, HUANG Y F. Remote sensing image haze removal algorithm using deep learning[J]. Application research of computers, 2021, 38(7): 2194-2199.
17	王梦瑶, 孟祥超, 邵枫, 等. 基于深度学习的SAR辅助下光学遥感图像去云方法[J]. 光学学报, 2021, 41(12): ID 1228002.
	WANG M Y, MENG X C, SHAO F, et al. SAR-assisted optical remote sensing image cloud removal method based on deep learning[J]. Acta optica sinica, 2021, 41(12): ID 1228002.
18	任欢, 王旭光. 注意力机制综述[J]. 计算机应用, 2021, 41(S1): 1-6.
	REN H, WANG X G. Review of attention mechanism[J]. Journal of computer applications, 2021, 41(S1): 1-6.
19	吴胜垚, 陈星. 图像任务中空间和通道注意力机制研究综述[J/OL]. 微电子学与计算机, 2024: 1-13. (2024-05-09).
	WU S Y, CHEN X. A review of the spatial and channel attention mechanisms in image tasks[J/OL]. Microelectronics & computer, 2024: 1-13. (2024-05-09).
20	杨云, 杨欣悦, 张小璇. 基于注意力机制的生成对抗网络图像超分辨重建[J]. 陕西科技大学学报, 2024, 42(2): 216-223, 232.
	YANG Y, YANG X Y, ZHANG X X. Generative adversarial network image super-resolution reconstruction based on attention mechanism[J]. Journal of Shaanxi university of science & technology, 2024, 42(2): 216-223, 232.
21	HU J, SHEN L, ALBANIE S, et al. Squeeze-and-excitation networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2020, 42(8): 2011-2023.
22	WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block attention module[C]// Computer Vision – ECCV 2018. Cham, Germany: Springer International Publishing, 2018: 3-19.
23	HAO Y, JIANG W Z, LIU W F, et al. Dynamic feature attention network for remote sensing image dehazing[J]. Neural processing letters, 2023, 55(6): 8081-8094.
24	WANG Q L, WU B G, ZHU P F, et al. ECA-net: Efficient channel attention for deep convolutional neural networks[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020: 11534-11542.
25	KOURMOULI G, KOSTAGIOLAS N, NICOLAOU M A, et al. Locality-preserving directions for interpreting the latent space of satellite image GANs[J]. IEEE geoscience and remote sensing letters, 2024, 21: 1-5.

模型	PSNR/dB	SSIM
SpA GAN	31.780 0	0.948 7
HyA-GAN	33.284 5	0.959 0
HAB-GAN	34.422 5	0.960 9

模型	PSNR/dB	SSIM
SpA GAN	27.921 6	0.919 7
HyA-GAN	29.660 8	0.947 5
HAB-GAN	31.514 8	0.959 9

模型	PSNR/dB	SSIM
HAB-GAN	34.422 5	0.960 9
HAB-GAN（无ECA）	31.780 0	0.948 7
HAB-GAN（无SAB）	31.467 5	0.952 2
HAB-GAN（无HAB）	30.556 4	0.927 5

[1]	LI Ruijie, WANG Aidong, WU Huaxing, LI Ziqiu, FENG Xiangqian, HONG Weiyuan, TANG Xuejun, QIN Jinhua, WANG Danying, CHU Guang, ZHANG Yunbo, CHEN Song. Remote Sensing for Rice Growth Stages Monitoring: Research Progress, Bottleneck Problems and Technical Optimization Paths [J]. Smart Agriculture, 2025, 7(3): 89-107.
[2]	HAN Yu, QI Kangkang, ZHENG Jiye, LI Jinai, JIANG Fugui, ZHANG Xianglun, YOU Wei, ZHANG Xia. Lightweight Cattle Facial Recognition Method Based on Improved YOLOv11 [J]. Smart Agriculture, 2025, 7(3): 173-184.
[3]	XU Shiwei, LI Qianchuan, LUAN Rupeng, ZHUANG Jiayu, LIU Jiajia, XIONG Lu. Agricultural Market Monitoring and Early Warning: An Integrated Forecasting Approach Based on Deep Learning [J]. Smart Agriculture, 2025, 7(1): 57-69.
[4]	GONG Yu, WANG Ling, ZHAO Rongqiang, YOU Haibo, ZHOU Mo, LIU Jie. Tomato Growth Height Prediction Method by Phenotypic Feature Extraction Using Multi-modal Data [J]. Smart Agriculture, 2025, 7(1): 97-110.
[5]	QI Zijun, NIU Dangdang, WU Huarui, ZHANG Lilin, WANG Lunfeng, ZHANG Hongming. Chinese Kiwifruit Text Named Entity Recognition Method Based on Dual-Dimensional Information and Pruning [J]. Smart Agriculture, 2025, 7(1): 44-56.
[6]	ZHANG Hui, HU Jun, SHI Hang, LIU Changxi, WU Miao. Precision Target Spraying System Integrated with Remote Deep Learning Recognition Model for Cabbage Plant Centers [J]. Smart Agriculture, 2024, 6(6): 85-95.
[7]	LU Bibo, LIANG Di, YANG Jie, SONG Aiqing, HUANGFU Shangwei. Image Segmentation Method of Chinese Yam Leaves in Complex Background Based on Improved ENet [J]. Smart Agriculture, 2024, 6(6): 109-120.
[8]	LUO Youlu, PAN Yonghao, XIA Shunxing, TAO Youzhi. Lightweight Apple Leaf Disease Detection Algorithm Based on Improved YOLOv8 [J]. Smart Agriculture, 2024, 6(5): 128-138.
[9]	LIU Yi, ZHANG Yanjun. ReluformerN: Lightweight High-Low Frequency Enhanced for Hyperspectral Agricultural Lancover Classification [J]. Smart Agriculture, 2024, 6(5): 74-87.
[10]	NIAN Yue, ZHAO Kaixuan, JI Jiangtao. Cow Hoof Slippage Detecting Method Based on Enhanced DeepLabCut Model [J]. Smart Agriculture, 2024, 6(5): 153-163.
[11]	ZHANG Yanqi, ZHOU Shuo, ZHANG Ning, CHAI Xiujuan, SUN Tan. A Regional Farming Pig Counting System Based on Improved Instance Segmentation Algorithm [J]. Smart Agriculture, 2024, 6(4): 53-63.
[12]	WENG Zhi, FAN Qi, ZHENG Zhiqiang. Automatic Measurement Method of Beef Cattle Body Size Based on Multimodal Image Information and Improved Instance Segmentation Network [J]. Smart Agriculture, 2024, 6(4): 64-75.
[13]	HOU Yiting, RAO Yuan, SONG He, NIE Zhenjun, WANG Tan, HE Haoxu. A Rapid Detection Method for Wheat Seedling Leaf Number in Complex Field Scenarios Based on Improved YOLOv8 [J]. Smart Agriculture, 2024, 6(4): 128-137.
[14]	LI Hao, DU Yuqiu, XIAO Xingzhu, CHEN Yanxi. Remote Sensing Identification Method of Cultivated Land at Hill County of Sichuan Basin Based on Deep Learning [J]. Smart Agriculture, 2024, 6(3): 34-45.
[15]	NIE Ganggang, RAO Honghui, LI Zefeng, LIU Muhua. Severity Grading Model for Camellia Oleifera Anthracnose Infection Based on Improved YOLACT [J]. Smart Agriculture, 2024, 6(3): 138-147.