欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2025, Vol. 7 ›› Issue (5): 169-181.doi: 10.12133/j.smartag.SA202507024

• 专刊--光智农业创新技术与应用 • 上一篇    

联合空间深度转换卷积与多尺度注意力机制的灯诱稻飞虱害虫检测方法

李汶政1,2,3,4, 杨信廷2,3,4, 孙传恒2,3,4, 崔腾鹏2,3,4,5, 王慧2,3,4,6, 李珊珊2,3,4,7, 李文勇2,3,4()   

  1. 1. 上海海洋大学 信息学院,上海 201306,中国
    2. 北京市农林科学院信息技术研究中心,北京 100097,中国
    3. 国家农业信息化工程技术研究中心,北京 100097,中国
    4. 农产品质量安全追溯技术及应用国家工程研究中心,北京 100097,中国
    5. 仲恺农业工程学院 人工智能学院,广东 广州 510225,中国
    6. 山东农业大学 信息科学与工程学院,山东 泰安 271018,中国
    7. 吉林农业大学 智慧农业研究院,吉林 长春 130118,中国
  • 收稿日期:2025-07-16 出版日期:2025-09-30
  • 基金项目:
    国家重点研发计划课题(2022YFD2001801); 北京市自然科学基金资助项目(4252037)
  • 作者简介:

    李汶政,硕士研究生,研究方向为田间小体积害虫智能检测识别技术。E-mail:

  • 通信作者:
    李文勇,博士,副研究员,研究方向为病虫害智能检测识别技术。E-mail:

Light-Trapping Rice Planthopper Detection Method by Combining Spatial Depth Transform Convolution and Multi-scale Attention Mechanism

LI Wenzheng1,2,3,4, YANG Xinting2,3,4, SUN Chuanheng2,3,4, CUI Tengpeng2,3,4,5, WANG Hui2,3,4,6, LI Shanshan2,3,4,7, LI Wenyong2,3,4()   

  1. 1. College of Information Technology, Shanghai Ocean University, Shanghai 201306, China
    2. Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
    3. National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
    4. National Engineering Laboratory for Agri-product Quality Traceability, Beijing 100097, China
    5. College of Artificial Intelligence, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, China
    6. College of Information Science and Engineering, Shandong Agricultural University, Tai'an, China 271018
    7. Institute of Smart Agriculture, Jilin Agricultural University, Jilin 130118, China
  • Received:2025-07-16 Online:2025-09-30
  • Foundation items:National Key Research and Development Program Project(2022YFD2001801); Beijing Natural Science Foundation(4252037)
  • About author:

    LI Wenzheng, E-mail:

  • Corresponding author:
    LI Wenyong, E-mail:

摘要:

[目的/意义] 为解决智能化灯诱设备检测密集、遮挡的低分辨率小体积飞虱类害虫时易出现精度低、误检、漏检的问题,基于YOLOv11x提出了一种联合空间深度转换卷积与多尺度注意力机制的水稻飞虱类小体积害虫图像检测识别方法。 [方法] 首先,通过使用EMA(Efficient Multi-Scale Attention)机制改进YOLOv11x原网络中的C3k2模块,加强模型在密集、遮挡情况下对小体积害虫特征的感知与融合能力。其次,使用SPD-Conv(Space-to-Depth-Convolution)卷积替换原始模型中Conv普通卷积模块,进一步提升模型对低分辨率小体积害虫特征的提取精度并降低了模型参数量。另外,在原始的网络基础上添加P2检测层并去除P5检测层,从而有针对性地增强模型对小目标的检测性能。最后,通过引入动态非单调聚焦机制损失函数(Wise-Intersection over Union Version 3, WIoUv3),提升模型的定位能力,进而降低误检率和漏检率。 [结果和讨论] 改进后的模型在自建飞虱类害虫数据集dataset_Planthopper上的准确率P、召回率R、平均检测精度mAP50和mAP50-95分别达到了77.5%、73.5%、80.8%和44.9%,与基准模型YOLOv11x模型相比,分别提高了4.8、3.5、5.5和4.7个百分点,参数量从56 M减小到40 M,减少了29%。与现在主流的目标检测模型YOLOv5x、YOLOv8x、YOLOv10x、YOLOv11x、YOLOv12x、Salience DETR-R50、Relation DETR-R50、RT-DETR-x相比,改进后的模型综合性能最佳。 [结论] 改进后的YOLOv11x模型,有效提升了在密集、遮挡虫情下检测低分辨率、小体积飞虱类害虫的性能,降低了漏检和误检的概率。

关键词: 稻飞虱, 小体积目标, 密集遮挡, 检测识别, 深度学习, YOLOv11x

Abstract:

[Objective] Planthoppers suck the sap from the phloem of rice plants, causing malnutrition and slow growth of the plants, resulting in large-scale yield reduction. Therefore, timely and effective monitoring of planthopper pests and analysis of their occurrence degree are of vital importance for the prevention of rice diseases. The traditional detection of planthopper pests mainly relies on manual methods for diagnosis and identification. However, due to the tiny size of planthopper pests, on-site manual investigation is not only time-consuming and labor-intensive but also greatly influenced by human subjectivity, making it easy to misjudge. In response to the above issues, the intelligent light traps can be used to assist in the work. When using intelligent light traps to detect dense and occluded low-resolution and small-sized planthopper pests, problems such as low accuracy, false detection, and missed detection are prone to occur. For this purpose, based on YOLOv11x, a light-trapping rice planthopper detection method by combining spatial depth transform convolution and multi-scale attention mechanism was proposed in this research. [Methods] The image data in this research were collected by multiple light-induced pest monitoring devices installed in the experimental rice fields. The images included two types of planthopper pests, the brown planthopper and the white-backed planthopper. The image sizes were both 5 472 pixels ×3 648 pixels, totaling 998 images. The original dataset was divided into a training set and a validation set in a 4:1 ratio. To enhance the learning efficiency of the model during training, two data augmentation operations, horizontal flipping and vertical flipping, were performed on the images in the training set. A total of 2 388 images in the training set were obtained for model training, and 200 images in the validation set were used for model inference validation. To improve the model performance, first of all, the C3k2 module in the original YOLOv11x network was improved by using the efficient multi-scale attention (EMA) mechanism to enhance the perception of the model and the fusion ability of small-volume pest features in dense and occlusions. Secondly, the space-to-depth-convolution (SPD-Conv) was used to replace the Conv common convolution module in the original model, further improving the extraction accuracy of the model for low-resolution and small-volume pest features and reducing the number of parameters. In addition, a P2 detection layer was added to the original network and the P5 detection layer was removed, thereby enhancing the model's detection performance for small targets in a targeted manner. Finally, by introducing the dynamic non-monotonic focusing mechanism loss function wise-intersection over union (WIoU)v3, the positioning ability of the model was enhanced, thereby reducing the false detection rate and missed detection rate. [Results and Discussions] The test results showed that the precision (P), recall (R), mean average precision at IoU equals 0.50 (mAP50) and the mean average precision at IoU thresholded from 0.50 to 0.95 with a step size of 0.05 (mAP50-95) of the improved model on the self-built rice planthopper dataset (dataset_Planthopper) reached 77.5%, 73.5%, 80.8%, and 44.9% respectively. Compared with the baseline model YOLOv11x, it has increased by 4.8, 3.5, 5.5 and 4.7 percent points, respectively. The number of parameters has been reduced from 56 M to 40 M, a reduction of 29%. Compared with the current mainstream object detection models YOLOv5x, YOLOv8x, YOLOv10x, YOLOv11x, YOLOv12x, Salience DETR-R50, Relation DETR-R50, RT-DETR-x, the mAP50 of the improved model was 6.8, 7.8, 8.6, 5.5, 5.6, 8.7, 6.9 and 6.9 percentage points higher, respectively, and it had the best comprehensive performance. [Conclusions] The improved YOLOv11x model effectively enhances the performance of detecting low-resolution and small-sized planthopper pests under dense and occluded insect conditions, and reduces the probability of missed detection and false detection. In practical applications, it could assist in achieving precise monitoring of farmland pests and scientific prevention and control decisions, thereby reducing the use of chemical pesticides and promoting the intelligent development of agriculture. Although this method has achieved significant improvements in multiple indicators, it still had certain limitations. Firstly, the species of planthoppers were numerous and their forms were diverse. The current models mainly targeted some typical species, and their generalization ability needed to be further verified. Secondly, due to the limitations of the data collection environment, there was still room for improvement in the performance of the model under extreme lighting changes and extremely occluded scenarios. Finally, although the number of parameters had decreased, the real-time detection speed still needed to be optimized to meet the requirements of some low-power edge devices. Future research can focus on expanding the generalization, robustness and lightweighting of more types of rice planthopper models in more complex situations.

Key words: rice planthopper, small-sized target, dense occlusion, detection and recognition, deep learning, YOLOv11x

中图分类号: