基于改进CycleGAN的水稻叶片病害图像增强方法

doi:10.12133/j.smartag.SA202407019

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (6): 96-108.doi: 10.12133/j.smartag.SA202407019

• 专题--农业知识智能服务和智慧无人农场（上） • 上一篇下一篇

基于改进CycleGAN的水稻叶片病害图像增强方法

严从宽¹(), 朱德泉¹, 孟凡凯¹, 杨玉青¹, 唐七星¹, 张爱芳², 廖娟¹()

^1. 安徽农业大学工学院，安徽合肥 230036，中国
^2. 安徽省农业科学院植物保护与农产品质量安全研究所，安徽合肥 230031，中国

收稿日期:2024-07-18 出版日期:2024-11-30
基金项目:
国家重点研发计划项目子课题(2022YFD2001801-3); 国家自然科学基金项目(32201665)
作者简介:
严从宽，研究方向为机器视觉、农作物病害识别等领域。E-mail：23721833@stu.ahau.edu.cn
通信作者:
廖娟，博士，副教授，研究方向为机器视觉和农业智能信息处理。E-mail：liaojuan@ahau.edu.cn

Rice Leaf Disease Image Enhancement Based on Improved CycleGAN

YAN Congkuan¹(), ZHU Dequan¹, MENG Fankai¹, YANG Yuqing¹, TANG Qixing¹, ZHANG Aifang², LIAO Juan¹()

^1. School of Engineering, Anhui Agricultural University, Hefei 230036, China
^2. Institute of Plant Protection and Agricultural Product Quality and Safety, Anhui Academy of Agricultural Sciences, Hefei 230031, China

Received:2024-07-18 Online:2024-11-30
Foundation items:Sub-project of the National Key Research and Development Program(2022YFD2001801-3); National Natural Science Foundation of China Project(32201665)
About author:
YAN Congkuan, E-mail: 23721833@stu.ahau.edu.cn
Corresponding author:
LIAO Juan, E-mail: liaojuan@ahau.edu.cn

摘要/Abstract

摘要：

目的/意义 针对水稻病害图像识别任务存在数据集获取困难、样本不足及不同类别病害样本不均衡等问题，提出了一种基于改进CycleGAN（Cycle-Consistent Adversarial Networks）的水稻叶片病害图像数据增强方法。方法以CycleGAN为基本框架，将CBAM（Convolution Block Attention Module）注意力机制嵌入到生成器的残差模块中，增强CycleGAN对病害特征的提取能力，使网络更准确地捕捉小目标病害或域间差异不明显的特征；在损失函数中引入感知图像相似度损失，以指导模型在训练过程中生成高质量的样本图像，并提高模型训练的稳定性。基于生成的水稻病害样本，在不同目标检测模型上进行迁移训练，通过比较迁移学习前后模型性能的变化，验证生成的病害图像数据的有效性。 结果和讨论 改进的CycleGAN网络生成的水稻叶片病害图像质量优于原始CycleGAN，病斑区域的视觉特征更加明显，结构相似性（Structural Similarity, SSIM）指标提升约3.15%，峰值信噪比（Peak Signal-to-Noise Ratio, PSNR）指标提升约8.19%。同时，使用YOLOv5s、YOLOv7-tiny和YOLOv8s这3种模型在生成的数据集上进行迁移学习后，模型的检测性能均有提升，如YOLOv5s模型的病害检测精度从79.7%提升至93.8%。结论本研究提出的方法有效解决了水稻病害图像数据集匮乏的问题，为水稻病害识别模型的训练提供了可靠的数据支撑。

关键词: 水稻叶片病害, 数据增强, CycleGAN, CBAM, 感知相似度损失, 迁移训练

Abstract:

Objective Rice diseases significantly impact both the yield and quality of rice production. Automatic recognition of rice diseases using computer vision is crucial for ensuring high yields, quality, and efficiency. However, rice disease image recognition faces challenges such as limited availability of datasets, insufficient sample sizes, and imbalanced sample distributions across different disease categories. To address these challenges, a data augmentation method for rice leaf disease images was proposed based on an improved CycleGAN model in this reseach which aimed to expand disease image datasets by generating disease features, thereby alleviating the burden of collecting real disease data and providing more comprehensive and diverse data to support automatic rice disease recognition. Methods The proposed approach built upon the CycleGAN framework, with a key modification being the integration of a convolutional block attention module (CBAM) into the generator's residual module. This enhancement strengthened the network's ability to extract both local key features and global contextual information pertaining to rice disease-affected areas. The model increased its sensitivity to small-scale disease targets and subtle variations between healthy and diseased domains. This design effectively mitigated the potential loss of critical feature information during the image generation process, ensuring higher fidelity in the resulting images. Additionally, skip connections were introduced between the residual modules and the CBAM. These connections facilitate improved information flow between different layers of the network, addressing common issues such as gradient vanishing during the training of deep networks. Furthermore, a perception similarity loss function, designed to align with the human visual system, was incorporated into the overall loss function. This addition enabled the deep learning model to more accurately measure perceptual differences between the generated images and real images, thereby guiding the network towards producing higher-quality samples. This adjustment also helped to reduce visual artifacts and excessive smoothing, while concurrently improving the stability of the model during the training process. To comprehensively evaluate the quality of the rice disease images generated by the proposed model and to assess its impact on disease recognition performance, both subjective and objective evaluation metrics were utilized. These included user perception evaluation (UPE), structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), and the performance of disease recognition within object detection frameworks. Comparative experiments were conducted across multiple GAN models, enabling a thorough assessment of the proposed model's performance in generating rice disease images. Additionally, different attention mechanisms, including efficient channel attention (ECA), coordinate attention (CA), and CBAM, were individually embedded into the generator's residual module. These variations allowed for a detailed comparison of the effects of different attention mechanisms on network performance and the visual quality of the generated images. Ablation studies were further performed to validate the effectiveness of the CBAM residual module and the perception similarity loss function in the network's overall architecture. Based on the generated rice disease samples, transfer learning experiments were conducted using various object detection models. By comparing the performance of these models before and after transfer learning, the effectiveness of the generated disease image data in enhancing the performance of object detection models was empirically verified. Results and Discussions The rice disease images generated by the improved CycleGAN model surpassed those produced by other GAN variants in terms of image detail clarity and the prominence of disease-specific features. In terms of objective quality metrics, the proposed model exhibited a 3.15% improvement in SSIM and an 8.19% enhancement in PSNR compared to the original CycleGAN model, underscoring its significant advantage in structural similarity and signal-to-noise ratio. The comparative experiments involving different attention mechanisms and ablation studies revealed that embedding the CBAM into the generator effectively increased the network's focus on critical disease-related features, resulting in more realistic and clearly defined disease-affected regions in the generated images. Furthermore, the introduction of the perception similarity loss function substantially enhanced the network's ability to perceive and represent disease-related information, thereby improving the visual fidelity and realism of the generated images. Additionally, transfer learning applied to object detection models such as YOLOv5s, YOLOv7-tiny, and YOLOv8s led to significant improvements in disease detection performance on the augmented dataset. Notably, the detection accuracy of the YOLOv5s model increased from 79.7% to 93.8%, representing a considerable enhancement in both generalization ability and robustness. This improvement also effectively reduced the rates of false positives and false negatives, resulting in more stable and reliable performance in rice disease detection tasks. Conclusions The rice leaf disease image generation method based on the improved CycleGAN model, as proposed in this study, effectively transforms images of healthy leaves into those depicting disease symptoms. By addressing the challenge of insufficient disease samples, this method significantly improves the disease recognition capabilities of object detection models. Therefore, it holds considerable application potential in the domain of leaf disease image augmentation and offers a promising new direction for expanding datasets of disease images for other crops.

Key words: rice leaf disease, data enhancement, CycleGAN, CBAM, perceptual similarity loss, transfer learning

中图分类号:

TP391
S43

严从宽, 朱德泉, 孟凡凯, 杨玉青, 唐七星, 张爱芳, 廖娟. 基于改进CycleGAN的水稻叶片病害图像增强方法[J]. 智慧农业(中英文), 2024, 6(6): 96-108.

YAN Congkuan, ZHU Dequan, MENG Fankai, YANG Yuqing, TANG Qixing, ZHANG Aifang, LIAO Juan. Rice Leaf Disease Image Enhancement Based on Improved CycleGAN[J]. Smart Agriculture, 2024, 6(6): 96-108.

图/表 20

图1

表1

图2

图3

图4

图5

图6

图7

图8

图9

表2

图10

表3

表4

图11

表5

表6

表7

表8

图12

参考文献 35

1	王辉, 陈睿鹏, 余志雪, 等. 基于卟啉和半导体单壁碳纳米管的场效应气体传感器检测草莓恶疫霉[J]. 智慧农业(中英文), 2022, 4 (3): 143-151.
	WANG H, CHEN R P, YU Z X, et al. Porphyrin and semiconducting single wall carbon nanotubes based semiconductor field effect gas sensor for determination of phytophthora strawberries[J]. Smart agriculture, 2022, 4(3): 143-151.
2	FUENTES A F, YOON S, LEE J, et al. High-performance deep neural network-based tomato plant diseases and pests diagnosis system with refinement filter bank[J]. Frontiers in plant science, 2018, 9: ID 1162.
3	KUMAR SAHU S, PANDEY M. An optimal hybrid multiclass SVM for plant leaf disease detection using spatial Fuzzy C-Means model[J]. Expert systems with applications, 2023, 214: ID 118989.
4	AHMED I, YADAV P K. Plant disease detection using machine learning approaches[J]. Expert systems, 2023, 40(5): ID e13136.
5	高荣华, 冯璐, 张月, 等. 基于多维随机森林的番茄灰霉病高光谱图像早期检测[J]. 光谱学与光谱分析, 2022, 42(10): 3226-3234.
	GAO R H, FENG L, ZHANG Y, et al. Early detection of tomato gray mold based on multidimensional random forest hyperspectral image[J]. Spectroscopy and spectral analysis, 2022, 42(10): 3226-3234.
6	FERENTINOS K P. Deep learning models for plant disease detection and diagnosis[J]. Computers and electronics in agriculture, 2018, 145: 311-318.
7	LIAO J, CHEN M H, ZHANG K, et al. SC-Net: A new strip convolutional network model for rice seedling and weed segmentation in paddy field[J]. Computers and electronics in agriculture, 2024, 220: ID 108862.
8	HASSAN S M, MAJI A K. Plant disease identification using a novel convolutional neural network[J]. IEEE access, 2022, 10: 5390-5401.
9	RAHMAN C R, ARKO P S, ALI M E, et al. Identification and recognition of rice diseases and pests using convolutional neural networks[J]. Biosystems engineering, 2020, 194: 112-120.
10	FU G, LIU C, ZHOU R, et al. Classification for high resolution remote sensing imagery using a fully convolutional network[J]. Remote sensing. 2017, 9(5): ID 498.
11	SOEB M J A, JUBAYER MF. TARIN T A,et al. Tea leaf disease detection and identification based on YOLOv7 (YOLO-T)[J]. Scientific reports, 2023, 13: ID 6078.
12	崔金荣, 魏文钊, 赵敏. 基于改进MobileNetV3的水稻病害识别模型[J]. 农业机械学报, 2023, 54(11): 217-224, 276.
	CUI J R, WEI W Z, ZHAO M. Rice disease identification model based on improved MobileNetV3[J]. Transactions of the Chinese society for agricultural machinery, 2023, 54(11): 217-224, 276.
13	PASALKAR J, GORDE G, MORE C, et al. Potato leaf disease detection using machine learning[J]. Current agriculture research journal, 2023; 11(3): 949-954.
14	CUBUK E D, ZOPH B, MANE D, et al. AutoAugment: Learning augmentation strategies from data[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2019: 113-123.
15	CONG W Y, ZHANG J F, NIU L, et al. DoveNet: Deep image harmonization via domain verification[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020: 8394-8403.
16	HONG Y J, HWANG U, YOO J, et al. How generative adversarial networks and their variants work[J]. ACM computing surveys, 2020, 52(1): 1-43.
17	GUO H L, LI M Y, HOU R Z, et al. Sample expansion and classification model of maize leaf diseases based on the self-attention CycleGAN[J]. Sustainability, 2023, 15(18): ID 13420.
18	HU G S, WU H Y, ZHANG Y, et al. A low shot learning method for tea leaf's disease identification[J]. Computers and electronics in agriculture, 2019, 163: ID 104852.
19	李天俊, 杨信廷, 陈晓, 等. 基于生成对抗网络和视觉-语义对齐的零样本害虫识别方法[J]. 智慧农业(中英文), 2024, 6(2): 72-84.
	LI T J, YANG X T, CHEN X, et al. Zero-shot pest identification based on generative adversarial networks and visual-semantic alignment[J]. Smart agriculture, 2024, 6(2): 72-84.
20	ABBAS A, JAIN S, GOUR M, et al. Tomato plant disease detection using transfer learning with C-GAN synthetic images[J]. Computers and electronics in agriculture, 2021, 187: ID 106279.
21	ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]// 2017 IEEE International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2017: 2223-2232.
22	BARTH R, HEMMING J, VAN HENTEN E J. Optimising realism of synthetic images using cycle generative adversarial networks for improved part segmentation[J]. Computers and electronics in agriculture, 2020, 173: ID 105378.
23	VAN MARREWIJK B M, POLDER G, KOOTSTRA G. Investigation of the added value of CycleGAN on the plant pathology dataset[J]. IFAC-papers on line, 2022, 55(32): 89-94.
24	GUO M H, XU T X, LIU J J, et al. Attention mechanisms in computer vision: A survey[J]. Computational visual media, 2022, 8(3): 331-368.
25	WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[M]// Lecture Notes in Computer Science. Cham: Springer International Publishing, 2018: 3-19.
26	ISOLA P, ZHU J Y, ZHOU T H, et al. Image-to-image translation with conditional adversarial networks[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2017: 1125-1134.
27	ZHANG R, ISOLA P, EFROS A A, et al. The unreasonable effectiveness of deep features as a perceptual metric[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, New Jersey, USA: IEEE, 2018: 586-595.
28	ZHAI G, MIN X. Perceptual image quality assessment: A survey[J]. Science China information sciences, 2020, 63: 1-52.
29	韩烨, 侯睿峥, 陈霄. 基于循环一致对抗网络的玉米灰斑病图像迁移方法研究[J]. 中国农机化学报, 2023, 44(2): 163-171.
	HAN Y, HOU R Z, CHEN X. Research on images migration method of maize gray disease based on cyclic consistent adversarial network[J]. Journal of Chinese agricultural mechanization, 2023, 44(2): 163-171.
30	WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: From error visibility to structural similarity[J]. IEEE transactions on image processing, 2004, 13(4): 600-612.
31	KORHONEN J, YOU J Y. Peak signal-to-noise ratio revisited: Is simple beautiful?[C]// 2012 Fourth International Workshop on Quality of Multimedia Experience. Piscataway, New Jersey, USA: IEEE, 2012: 37-38.
32	KIM J, KIM M, KANG H, et al. U-GAT-IT: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation[EB/OL]. arXiv: 1907.10830, 2019.
33	CAP Q H, UGA H, KAGIWADA S, et al. LeafGAN: An effective data augmentation method for practical plant disease diagnosis[J]. IEEE transactions on automation science and engineering, 2022, 19(2): 1258-1267.
34	WANG Q L, WU B G, ZHU P F, et al. ECA-net: Efficient channel attention for deep convolutional neural networks[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020: 11534-11542.
35	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2021: 13713-13722.

病害种类	样本数量/张	训练集/张	测试集/张
褐斑病	240	192	48
条纹病	245	196	49
稻瘟病	241	193	48
健康叶片	856	685	171

病害种类	SSIM↑					PSNR/dB ↑
病害种类	U-GAT-IT	LeafGAN	CG-ResNet6	CG-ResNet9	本研究CG-CL	U-GAT-IT	LeafGAN	CG-ResNet6	CG-ResNet9	本研究CG-CL
褐斑病	0.849	0.867	0.845	0.857	0.884	22.124	22.457	22.042	22.511	23.293
条纹病	0.805	0.809	0.797	0.827	0.852	20.709	20.923	20.618	20.751	22.971
稻瘟病	0.775	0.777	0.771	0.786	0.812	19.009	20.827	18.740	19.494	21.090

病害种类	SSIM↑				PSNR/dB ↑
病害种类	CG-ResNet9	CG-ResNet9+ECA	CG-ResNet9+CA	CG-ResNet9+CBAM	CG-ResNet9	CG-ResNet9+ECA	CG-ResNet9+CA	CG-ResNet9+CBAM
褐斑病	0.857	0.855	0.861	0.862	22.511	22.554	22.853	23.109
条纹病	0.827	0.828	0.829	0.835	20.751	20.992	20.898	21.386
稻瘟病	0.786	0.802	0.802	0.804	19.494	19.544	19.550	19.574

病害种类	SSIM↑				PSNR/dB ↑
病害种类	CG-ResNet9	CG-ResNet9+CBAM	CG-ResNet9+LPIPS	本研究 CG-CL	CG-ResNet9	CG-ResNet9+CBAM	CG-ResNet9+LPIPS	本研究 CG-CL
褐斑病	0.857	0.862	0.878	0.884	22.511	23.109	22.622	23.293
条纹病	0.827	0.835	0.839	0.852	20.751	21.386	21.741	22.971
稻瘟病	0.786	0.804	0.796	0.812	19.494	19.574	20.347	21.090

数据集类型	训练集/张	验证集/张	测试集/张	总数/张
真实数据集	280	80	40	400
增强数据集	280	80	40	400

基于改进CycleGAN的水稻叶片病害图像增强方法

Rice Leaf Disease Image Enhancement Based on Improved CycleGAN

在线阅读

知网下载

本地下载

可视化

摘要/Abstract

引用本文

使用本文

图/表 20

参考文献 35

相关文章 5

编辑推荐

Metrics

本文评价

病害类型	初始模型			迁移训练后模型
病害类型	P/%	R/%	mAP/%	P/%	R/%	mAP/%
褐斑病	78.0	53.9	64.7	94.6	80.0	91.9
条纹病	65.4	45.4	51.5	89.7	64.9	81.0
稻瘟病	95.5	82.0	91.2	97.2	97.7	97.5
综合	79.7	60.4	69.1	93.8	80.9	90.1

[1]	刘畅, 孙雨, 杨晶, 王凤超, 陈进. 基于3C-YOLOv8n和深度相机的葡萄识别与定位方法[J]. 智慧农业(中英文), 2024, 6(6): 121-131.
[2]	年悦, 赵凯旋, 姬江涛. 基于改进DeepLabCut模型的奶牛滑蹄检测方法[J]. 智慧农业(中英文), 2024, 6(5): 153-163.
[3]	张荣华, 白雪, 樊江川. 复杂场景下害虫目标检测算法：YOLOv8-Extend[J]. 智慧农业(中英文), 2024, 6(2): 49-61.
[4]	刘易雪, 宋育阳, 崔萍, 房玉林, 苏宝峰. 基于无人机遥感和深度学习的葡萄卷叶病感染程度诊断方法[J]. 智慧农业(中英文), 2023, 5(3): 49-61.
[5]	陈桂芬, 赵姗, 曹丽英, 傅思维, 周佳鑫. 基于迁移学习与卷积神经网络的玉米植株病害识别[J]. 智慧农业(中英文), 2019, 1(2): 34-44.

病害类型	初始模型			迁移训练后模型
病害类型	P/%	R/%	mAP/%	P/%	R/%	mAP/%
褐斑病	92.0	91.0	95.0	93.8	98.5	96.9
条纹病	87.0	92.2	95.5	91.9	92.6	97.7
稻瘟病	96.7	97.9	99.3	97.1	100.0	99.6
综合	91.9	93.7	96.6	94.2	97.0	98.1

病害类型	初始模型			迁移训练后模型
病害类型	P/%	R/%	mAP/%	P/%	R/%	mAP/%
褐斑病	89.8	89.4	94.4	95.2	92.7	96.3
条纹病	83.3	84.1	89.5	91.4	85.0	92.7
稻瘟病	96.5	97.8	97.9	98.4	98.9	98.4
综合	89.9	90.4	93.9	95.0	92.2	95.8