基于改进YOLOv8的苹果叶病害轻量化检测算法

doi:10.12133/j.smartag.SA202406012

摘要/Abstract

摘要：

【目的/意义】 苹果是中国重要的农产品，为了保障苹果的健康生长，降低其患病率，研发苹果叶病害检测技术具有重要意义。本研究旨在应对苹果生长过程中出现的病害快速检测问题，提出一种基于改进YOLOv8的苹果叶病害检测算法。 【方法】 选用YOLOv8n模型对苹果在生长期间的多种病害（褐腐病、褐纹病、黑星病和锈病）进行识别。引入SPD-Conv替代传统卷积层，降低模型参数量和运算量的同时提高检测精度。在Neck层中添加多尺度空洞注意力机制（Multi-Scale Dilated Attention, MSDA），使模型通过动态感受野自适应地聚焦于图像中的关键区域，增强病害特征提取能力。此外，参考重参数化卷积神经网络（Reparameterized Convolutional Neural Network, RepVGG）架构，优化了原有检测头，实现检测和推理过程的架构分离，加快了模型的推理速度，提升了其特征学习能力。最后，构建了一个包含上述病害的苹果叶片数据集，并在此数据集上进行试验。 【结果和讨论】 改进后的模型在运算量降低0.1 G的同时，mAP50和mAP50∶95分别达到了88.2%和37.0%，较原模型分别提高了2.7%和1.3%，模型大小仅为7.8 MB。准确率和召回率分别为83.1%和80.2%，较原模型分别提升了0.9%和1.1%。分别与YOLOv7-tiny、YOLOv9-c、RetinaNet、Faster-RCNN等多个模型进行对比试验，结果表明，提出的YOLOv8n-SMR模型表现出优异性能，有效控制了计算复杂度和参数量。优化后的网络结构在模型大小，浮点运算次数和参数量上均保持较低水平，适合在无人机系统等硬件资源受限设备上高效部署。 【结论】 改进后的模型能够实现对苹果叶病害的准确检测，该方法不仅提高了检测精度，还通过轻量化设计有效减少了模型的运算量，为后续的苹果生长和果实收集提供可靠的数据支持，并为进一步苹果叶病害研究和探索提供了有利的参考。

关键词: 深度学习, YOLOv8, 苹果叶病害检测, MSDA, SPD-Conv

Abstract:

[Objective] As one of China's most important agricultural products, apples hold a significant position in cultivation area and yield. However, during the growth process, apples are prone to various diseases that not only affect the quality of the fruit but also significantly reduce the yield, impacting farmers' economic benefits and the stability of market supply. To reduce the incidence of apple diseases and increase fruit yield, developing efficient and fast apple leaf disease detection technology is of great significance. An improved YOLOv8 algorithm was proposed to identify the leaf diseases that occurred during the growth of apples. [Methods] YOLOv8n model was selected to detect various leaf diseases such as brown rot, rust, apple scab, and sooty blotch that apples might encounter during growth. SPD-Conv was introduced to replace the original convolutional layers to retain fine-grained information and reduce model parameters and computational costs, thereby improving the accuracy of disease detection. The multi-scale dilated attention (MSDA) attention mechanism was added at appropriate positions in the Neck layer to enhance the model's feature representation capability, which allowed the model to learn the receptive field dynamically and adaptively focus on the most representative regions and features in the image, thereby enhancing the ability to extract disease-related features. Finally, inspired by the RepVGG architecture, the original detection head was optimized to achieve a separation of detection and inference architecture, which not only accelerated the model's inference speed but also enhanced feature learning capability. Additionally, a dataset of apple leaf diseases containing the aforementioned diseases was constructed, and experiments were conducted. [Results and Discussions] Compared to the original model, the improved model showed significant improvements in various performance metrics. The mAP50 and mAP50:95 achieved 88.2% and 37.0% respectively, which were 2.7% and 1.3% higher than the original model. In terms of precision and recall, the improved model increased to 83.1% and 80.2%, respectively, representing an improvement of 0.9% and 1.1% over the original model. Additionally, the size of the improved model was only 7.8 MB, and the computational cost was reduced by 0.1 G FLOPs. The impact of the MSDA placement on model performance was analyzed by adding it at different positions in the Neck layer, and relevant experiments were designed to verify this. The experimental results showed that adding MSDA at the small target layer in the Neck layer achieved the best effect, not only improving model performance but also maintaining low computational cost and model size, providing important references for the optimization of the MSDA mechanism. To further verify the effectiveness of the improved model, various mainstream models such as YOLOv7-tiny, YOLOv9-c, RetinaNet, and Faster-RCNN were compared with the propoed model. The experimental results showed that the improved model outperformed these models by 1.4%, 1.3%, 7.8%, and 11.6% in mAP50, 2.8%, 0.2%, 3.4%, and 5.6% in mAP50:95. Moreover, the improved model showed significant advantages in terms of floating-point operations, model size, and parameter count, with a parameter count of only 3.7 MB, making it more suitable for deployment on hardware-constrained devices such as drones. In addition, to assess the model's generalization ability, a stratified sampling method was used, selecting 20% of the images from the dataset as the test set. The results showed that the improved model could maintain a high detection accuracy in complex and variable scenes, with mAP50 and mAP50:95 increasing by 1.7% and 1.2%, respectively, compared to the original model. Considering the differences in the number of samples for each disease in the dataset, a class balance experiment was also designed. Synthetic samples were generated using oversampling techniques to increase the number of minority-class samples. The experimental results showed that the class-balanced dataset significantly improved the model's detection performance, with overall accuracy increasing from 83.1% to 85.8%, recall from 80.2% to 83.6%, mAP50 from 88.2% to 88.9%, and mAP50:95 from 37.0% to 39.4%. The class-balanced dataset significantly enhanced the model's performance in detecting minority diseases, thereby improving the overall performance of the model. [Conclusions] The improved model demonstrated significant advantages in apple leaf disease detection. By introducing SPD-Conv and MSDA attention mechanisms, the model achieved noticeable improvements in both precision and recall while effectively reducing computational costs, leading to more efficient detection capabilities. The improved model could provide continuous health monitoring throughout the apple growth process and offer robust data support for farmers' scientific decision-making before fruit harvesting.

Key words: deep learning, YOLOv8, apple leaf disease detection, MSDA, SPD-Conv

中图分类号:

TP391.4

罗友璐, 潘勇浩, 夏顺兴, 陶友志. 基于改进YOLOv8的苹果叶病害轻量化检测算法[J]. 智慧农业(中英文), 2024, 6(5): 128-138.

LUO Youlu, PAN Yonghao, XIA Shunxing, TAO Youzhi. Lightweight Apple Leaf Disease Detection Algorithm Based on Improved YOLOv8[J]. Smart Agriculture, 2024, 6(5): 128-138.

图/表 15

表 1

表 2

图1

图2

表 3

表 4

表 5

表 6

图3

图4

图5

表 7

图6

图7

表8

参考文献 27

1	田有文, 程怡, 王小奇, 等. 基于高光谱成像的苹果虫害检测特征向量的选取[J]. 农业工程学报, 2014, 30(12): 132-139.
	TIAN Y W, CHENG Y, WANG X Q, et al. Feature vectors determination for pest detection on apples based on hyperspectral imaging[J]. Transactions of the Chinese society of agricultural engineering, 2014, 30(12): 132-139.
2	WANG Y, WANG Y, ZHAO J. MGA-YOLO: A lightweight one-stage network for apple leaf disease detection[J]. Frontiers in plant science, 2022, 13: ID 927424.
3	王帅, 王利众, 朱丽平, 等. 基于改进YOLOv5s的苹果病害检测技术研究[J]. 山西农业大学学报(自然科学版), 2024, 44(4): 118-129.
	WANG S, WANG L Z, ZHU L P, et al. Research on apple disease detection technology based on improved YOLOv5s[J]. Journal of Shanxi agricultural university (natural science edition), 2024, 44(4): 118-129.
4	王君婵, 洪俐, 朱少龙, 等. 基于深度学习的病害识别方法研究[J]. 农业展望, 2023, 19(8): 90-99.
	WANG J C, HONG L, ZHU S L, et al. Research on disease recognition method based on deep learning[J]. Agricultural outlook, 2023, 19(8): 90-99.
5	YANG R T, HE Y B, HU Z W, et al. CA-YOLOv5: A YOLO model for apple detection in the natural environment[J]. Systems science & control engineering, 2024, 12(1): ID 2278905.
6	WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[M]// Lecture Notes in Computer Science. Cham: Springer International Publishing, 2018: 3-19.
7	ZHANG S W, WANG D W, YU C Q. Apple leaf disease recognition method based on Siamese dilated Inception network with less training samples[J]. Computers and electronics in agriculture, 2023, 213: ID 108188.
8	ZHU R, ZOU H, LI Z, et al. Apple-net: A model based on improved YOLOv5 to detect the apple leaf diseases[J]. Plants (basel, Switzerland), 2022, 12(1): ID 169.
9	LIU H F, PENG P, CHEN T, et al. FECANet: Boosting few-shot semantic segmentation with feature-enhanced context-aware network[J]. IEEE transactions on multimedia, 2023, 25: 8580-8592.
10	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2021: 13713-13722.
11	MATHEW M P, MAHESH T Y. Determining the region of apple leaf affected by disease using YOLOv3[C]// 2021 International conference on communication, control and information sciences (ICCISc). Piscataway, New Jersey, USA: IEEE, 2021, 1: 1-4.
12	LIU B, HUANG X L, SUN L M, et al. MCDCNet: Multi-scale constrained deformable convolution network for apple leaf disease detection[J]. Computers and electronics in agriculture, 2024, 222: ID 109028.
13	DING X H, ZHANG X Y, MA N N, et al. RepVGG: making VGG-style ConvNets great again[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2021: 13733-13742.
14	JIAO J Y, TANG Y M, LIN K Y, et al. DilateFormer: Multi-scale dilated transformer for visual recognition[J]. IEEE transactions on multimedia, 2023, 25: 8906-8919.
15	SUNKARA R, LUO T. No more strided convolutions or Pooling: A new CNN building block for Low-resolution images and Small objects[M]// Lecture Notes in Computer Science. Cham: Springer Nature Switzerland, 2023: 443-459.
16	陆丽娜, 于啸. 深度学习在大豆叶片图像数据管理中的识别与分类研究[J].农业图书情报学报,2023,35(2):87-94.
	LU L N, YU X. Recognition and classification of deep learning in soybean leaf image data management[J]. Journal of library and information science in agriculture, 2023, 35(2): 87-94.
17	CAI D L, ZHANG Z Y, ZHANG Z. Corner-point and foreground-area IoU loss: Better localization of small objects in bounding box regression[J]. Sensors, 2023, 23(10): ID 4961.
18	SHEPLEY A J, FALZON G, KWAN P, et al. Confluence: A robust non-IoU alternative to non-maxima suppression in object detection[J]. IEEE transactions on pattern analysis and machine intelligence, 2023, 45(10): 11561-11574.
19	WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2023: 7464-7475.
20	WANG C Y, YEH I H, LIAO H Y M. YOLOv9: Learning what you want to learn using programmable gradient information[EB/OL]. arXiv: 2402.13616, 2024.
21	LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection[C]// 2017 IEEE International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2017: 2980-2988.
22	REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(6): 1137-1149.
23	石展鲲, 杨风, 韩建宁, 等. 基于Faster-RCNN的自然环境下苹果识别[J]. 计算机与现代化, 2023(2): 62-65.
	SHI Z K, YANG F, HAN J N, et al. Apples recognition in natural environment based on Faster-RCNN[J]. Computer and modernization, 2023(2): 62-65.
24	ZHANG Y K, ZHOU G X, CHEN A B, et al. A precise apple leaf diseases detection using BCTNet under unconstrained environments[J]. Computers and electronics in agriculture, 2023, 212: ID 108132.
25	杨锋, 姚晓通. 基于改进YOLOv8的小麦叶片病虫害检测轻量化模型[J].智慧农业(中英文), 2024, 6(1): 147-157.
	YANG Feng, YAO Xiaotong. Lightweighted wheat leaf diseases and pests detection model based on improved YOLOv8[J]. Smart agriculture, 2024, 6(1): 147-157.
26	郑宇达, 陈仁凡, 杨长才, 等. 基于改进YOLOv5s模型的柑橘病虫害识别方法[J]. 华中农业大学学报, 2024, 43(2): 134-143.
	ZHENG Y D, CHEN R F, YANG C C, et al. Improved YOLOv5s based identification of pests and diseases in citrus[J]. Journal of Huazhong agricultural university, 2024, 43(2): 134-143.
27	陈禹, 吴雪梅, 张珍, 等. 基于改进YOLOv5s的自然环境下茶叶病害识别方法[J]. 农业工程学报, 2023, 39(24): 185-194.
	CHEN Y, WU X M, ZHANG Z, et al. Method for identifying tea diseases in natural environment using improved YOLOv5s[J]. Transactions of the Chinese society of agricultural engineering, 2023, 39(24): 185-194.

苹果叶病害类别	训练集样本数量/张	验证集样本数量/张	测试集样本数量/张	样本总数量/张
褐腐病	256	36	77	369
褐纹病	372	54	106	532
黑星病	277	40	79	396
锈病	288	42	82	412

病害类别	特征	病害表现
褐腐病	叶片上形成病斑，通常呈现为不规则褐色或黑色斑点，随着时间推移病斑逐渐扩大，会融合成较大病斑
褐纹病	叶片上形成病斑，通常开始作为小的、淡褐色的斑点，随着病害的发展逐渐扩大，颜色逐渐变为深褐色或黑褐色，伴有灰色的霉层
黑星病	病斑呈现为小而圆形的黑褐色斑点，病斑的颜色逐渐变为黑色或暗褐色，边缘通常呈锐利的环状或不规则状
锈病	叶片上形成病斑，呈现为黄色或橙色的小斑点，呈圆形或半圆形，并且在叶片上形成突起

试验	添加位置	mAP50/%	mAP50：95/%	浮点运算次数 FLOPs / G	模型大小/MB
0	无添加	86.7	36.6	7.7	7.6
1	一号位置	88.0	37.0	8.1	8.2
2	二号位置	88.2	37.0	8.0	7.8
3	三号位置	87.1	36.8	8.0	7.7
4	四号位置	86.6	36.4	8.0	7.8

试验	SPD-Conv	MSDA	RepHead	准确率/%	召回率/%	mAP50/%	mAP50：95/%	浮点运算次数FLOPs/ G	模型大小/MB
1	√			82.9	78.5	85.9	36.3	7.4	5.6
2		√		83.6	81.7	87.3	36.6	8.4	8.3
3			√	84.2	78.7	86.5	35.9	8.4	8.2
4	√	√		82.9	83.0	87.7	37.7	7.9	5.9
5	√		√	81.9	80.4	86.7	36.6	7.7	7.6
6		√	√	84.0	80.1	87.5	37.0	8.8	8.6
7	√	√	√	83.1	80.2	88.2	37.0	8.0	7.8

模型	准确率/%	召回率/%	mAP50/%	mAP50：95/%	浮点运算次数FLOPs/G	模型大小/MB	参数量/M
YOLOv8n-SMR	83.1	80.2	88.2	37.0	8.0	7.8	3.7
YOLOv9-c	83.8	81.0	86.9	36.8	102.3	51.6	25.5
YOLOv7-tiny	82.8	81.8	86.8	34.2	13.2	12.3	6.0
RetinaNet	78.3	78.2	80.4	33.0	191.4	139.0	36.3
Faster-RCNN	73.5	74.3	76.6	31.4	370.2	108.0	136.7