欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2026, Vol. 8 ›› Issue (1): 62-71.doi: 10.12133/j.smartag.SA202509014

• 专题--农业病虫害智能识别与诊断 • 上一篇    下一篇

基于改进YOLOv11n的多尺度茶叶病害检测方法

肖瑞宏1(), 谭立新1,2(), 王日凤3(), 宋敏1,4, 胡程喜5   

  1. 1. 湖南农业大学 信息与智能科学技术学院,湖南 长沙 410125,中国
    2. 湖南信息职业技术学院 电子工程学院,湖南 长沙 410200,中国
    3. 广西科技师范学院 人工智能学院,广西 来宾 546199,中国
    4. 长沙幼儿师范高等专科学校,湖南 长沙 410000,中国
    5. 湖南软件职业技术大学,湖南 湘潭 411100,中国
  • 收稿日期:2025-09-05 出版日期:2026-01-30
  • 作者简介:

    肖瑞宏,硕士研究生,研究方向为计算机视觉及应用。E-mail:

  • 通信作者:
    谭立新,硕士,教授,研究方向为机器人与智能系统、模式识别与智能信息处理。E-mail:
    王日凤,博士,教授,研究方向为图像处理、计算机视觉。E-mail:

Multi-Scale Tea Leaf Disease Detection Method Based on Improved YOLOv11n

XIAO Ruihong1(), TAN Lixin1,2(), WANG Rifeng3(), SONG Min1,4, HU Chengxi5   

  1. 1. College of Information and Intelligence, Hunan Agricultural University, Changsha 410125, China
    2. School of Electrical and Electronic Engineering, Hunan College of Information, Changsha 410200, China
    3. School of Artificial Intelligence, Guangxi Science & Technology Normal University, Laibin 546199, China
    4. Changsha Preschool Education College, Changsha 410000, China
    5. Hunan Software Vocational and Technical University, Xiangtan 411100, China
  • Received:2025-09-05 Online:2026-01-30
  • Foundation items:Guangxi?Science and Technology Program(AD23026282)
  • About author:

    XIAO Ruihong, E-mail:

  • Corresponding author:
    TAN Lixin, E-mail:
    WANG Rifeng, E-mail:

摘要:

[目的/意义] 传统模型使用传统标准化数据集训练后,对实际识别中较远或较近的多尺度茶叶病害目标存在错检或漏检,以及性能不足的情况。针对茶田病害巡检时存在的茶叶病害检测环境中病害形态多样、识别距离不固定且容易受到背景影响而被误判漏判等问题,本研究提出一种集成多尺度特征分解、可切换空洞卷积与自适应空间融合的改进模型YOLO-SADMFA(YOLO Switchable Atrous Dynamic Multi-Scale Frequency-Aware Adaptive)。 [方法] 增加模型卷积、特征提取、上采样与检测头轮次加强多尺度能力,提出一种多尺度特征分析解算与动态频率调整融合的动态多尺度频率感知上采样模块进行上采样。首先,该模块可以在有效融合多尺度特征的情况下控制上下采样的信息丢失;其次,引入可切换空洞卷积模块代替原有跨阶段部分核心模块,通过结合不同的空洞率结果进一步加强捕捉目标多尺度信息,同时采用权重锁定机制提升了模型性能;最后,在head结构中引入自适应空间特征融合(Adaptively Spatial Feature Fusion, ASFF),其技术特性形成ASFF检测头,自适应地学习空间融合权重,有效地过滤相冲突的信息。同时建立了1个含有2 880张图像9种茶叶病害类别的茶叶病害数据集。 [结果和讨论] 该方法在茶叶病害检测的任务中精确度、召回率和平均精度值分别为89.7%、82.6%和86.3%。YOLO-SADMFA较原版YOLOv11n模型精确度、召回率、平均精度值分别提升4.4、8.4、3.7个百分点,尤其在处理病斑面积占比10%~65%的多尺度目标时表现突出。在低光照、复杂背景等田间实际场景下,模型仍保持较高的检测稳定性,能够有效区分形态相似的病害类型,并在边缘计算设备上实现约161帧/s的实时检测速度。 [结论] 本研究所提出的YOLO-SADMFA有效解决了茶园复杂环境下多尺度病害检测难题,显著提升了检测准确性和鲁棒性,为自动化茶叶病害巡检系统提供了可靠的技术支持,对促进茶产业智能化发展具有重要应用价值。

关键词: YOLO, 茶叶病害, 目标检测, DMF-Upsample, ASFF

Abstract:

[Objective] Preventing and containing leaf diseases is a critical component of tea production, and accurate identification and localization of symptoms are essential for modern, automated plantation management. Field inspection in tea gardens poses distinctive challenges for vision-based detection algorithms: targets appeared at widely varying scales and morphologies under complex backgrounds and unfixed acquisition distances, which easily misled detectors. Models trained on standardized datasets with uniform distance and background often underperform, leading to false alarms and missed detections. To support method development under realistic constraints, YOLO-SADMFA (You Only Look Once-Switchable Atrous Dynamic Multi-scale Frequency-aware Adaptive), a detector based on the YOLOv11n backbone was proposed. The architecture aims to preserve fine details during repeated re-sampling (down- and up-sampling), strengthen modeling of lesions at varying scales, and refine multi-scale feature fusion. [Methods] The proposed architecture incorporated additional convolutional, feature extraction, upsampling, and detection head stages to better handle multi-scale representations, and a DMF-Upsample (Dynamic Multi-scale Frequency-aware Upsample) module that performed upsampling through multi-scale feature analysis and dynamic frequency adjustment fusion was introduced. This module enabled efficient multi-scale feature integration while effectively mitigating information loss during up- and down-sampling. Concretely, the DMF-Upsample analyzed multi-frequency responses from adjacent pyramid levels and fused them with dynamically learned frequency-selective weights, which preserved high-frequency lesion boundaries and textures while retaining low-frequency contextual structure such as leaf contours and global shading. A lightweight gating mechanism estimates per-location and per-channel coefficients to regulate the contribution of different bands, and a residual bypass preserved identity information to further reduce aliasing and oversmoothing introduced by repeated resampling. Furthermore, the baseline C3k2 block was replaced with a switchable atrous convolution (SAConv) module, which enhanced multi-scale feature capture by combining outputs from different dilation rates and incorporates a weight locking mechanism to improve model stability and performance. In practice, the SAConv aggregated parallel atrous branched at multiple dilation factors through learned coefficients under weight locking, which expanded the effective receptive field without sacrificing spatial resolution and suppressed gridding artifacts, while incurring modest parameter overhead. Lastly, an adaptive spatial feature fusion (ASFF) mechanism was integrated into the detection head, forming an ASFF-Head that learned spatially varying fusion weights across different feature scales, effectively filters conflicting information, and strengthens the model's robustness and overall detection accuracy. Together, these components formed a deeper yet efficient multi-scale pathway suited to complex field scenes. [Results and Discussions] Compared with the original YOLOv11n model, YOLO-SADMFA improved precision, recall, and mAP by 4.4, 8.4, and 3.7 percentage points, respectively, indicating more reliable identification and localization across diverse field scenes. The detector was particularly effective for multi-scale targets where the lesion area occupied approximately 10%-65% of the image, reflecting the variability introduced by unfixed acquisition distance during tea garden patrols. Under low illumination and in complex backgrounds with occlusions and clutter, it maintained stable performance, reduced both missed detections and false alarms, and effectively distinguished disease categories with similar morphology and color. On edge computing devices, it sustained about 161 f/s, which met real-time requirements for mobile inspection robots and portable systems. These outcomes demonstrated strengthened robustness to background interference and improved sensitivity at extreme scales, which was consistent with practical demands where the acquisition distance was not fixed. From an ablation perspective, DMF-Upsample preserved high-frequency lesion boundaries while retaining low-frequency structural context after resampling, SAConv expanded receptive fields through multi-dilation aggregation under a weight-locking mechanism, and the ASFF-Head mitigated conflicts among feature pyramids. Their combination yielded cumulative gains in stability and accuracy. Qualitative analyses further supported the quantitative results: Boundary localization improved for small, speckled lesions, large blotches were captured with fewer spurious edges, and distractors such as veins, shadows, and soil textures were less frequently misclassified, confirming the benefits of dynamic multi-scale frequency-aware fusion and adaptive spatial weighting in real field conditions. [Conclusions] The proposed YOLO-SADMFA effectively addressed the multi-scale disease detection challenge in complex tea garden environments, where acquisition distance was not fixed, lesion morphology and color were diverse, and cluttered backgrounds easily caused misjudgments and omissions. It significantly improved detection accuracy and robustness relative to the original YOLOv11n model across a wide range of target scales, and it maintained stable performance under low illumination and complex backgrounds typical of field inspections. It provided reliable technical support for automated tea leaf disease inspection systems by enabling accurate localization and identification of lesions in real operating conditions and by sustaining real-time inference on edge devices suitable for patrol-style deployment.

Key words: YOLO, tea leaf diseases, object detection, DMF-Upsample, ASFF

中图分类号: