欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture

• •    

YOLO系列模型在水果缺陷检测的研究进展及发展趋势

修贤超1, 费士祺1,2,3, 张伟1(), 黄文倩2,3, 苗中华1   

  1. 1. 上海大学机电工程与自动化学院,上海 200444,中国
    2. 北京农林科学院,北京 100097,中国
    3. 北京市农林科学院信息技术研究中心,北京 100097,中国
  • 收稿日期:2026-03-13 出版日期:2026-05-22
  • 基金项目:
    国家自然科学基金(32401712); 上海市自然科学基金(24ZR1424800); 博士后科学基金(2024M751937)
  • 作者简介:

    修贤超,副教授,硕士生导师,研究方向为大模型与具身智能。E-mail:

  • 通信作者:
    张 伟,博士,助理研究员,研究方向为农业机器人具身智能。E-mail:

​​Research Progress and Development Trends of YOLO Family of Algorithms in Fruit Defect Detection

XIU Xianchao1, FEI Shiqi1,2,3, ZHANG Wei1(), HUANG Wenqian2,3, MIAO Zhonghua1   

  1. 1. School of Mechatronic Engineering and Automation, Shanghai University, Shanghai 200444, China
    2. Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
    3. Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
  • Received:2026-03-13 Online:2026-05-22
  • Foundation items:National Natural Science Foundation of China(32401712); Natural Science Foundation of Shanghai(24ZR1424800); China Postdoctoral Science Foundation(2024M751937)
  • About author:

    XIU Xianchao, E-mail:

  • Corresponding author:
    ZHANG Wei, E-mail:

摘要:

【目的/意义】 作为水果产后的关键环节,高效精准的质量检测对保障食品安全与提升产业效率至关重要。本文综述了YOLO模型在水果缺陷检测领域的研究进展,旨在为农业水果缺陷检测技术的进一步发展提供理论支撑与技术参考。[进展] 近年来,基于深度学习的目标检测算法,特别是YOLO(You Only Look Once)系列模型在实时性、高精度和低计算量的优势,在水果缺陷检测研究获得了出色的效果。本文聚焦水果缺陷检测,首先回顾了YOLOv1至YOLO26的发展历程,并分析每代YOLO的创新改进。在此基础上,综述了YOLO系列模型在多种水果缺陷检测任务中的应用现状,列举了这些算法在应对水果缺陷检测挑战方面的贡献,具体包括内部与隐性缺陷的识别、复杂表面形态下的缺陷检测,以及模型的轻量化与边缘部署。同时,本文也探讨了这些技术如何满足高速生产线的实时处理,以及在资源受限的硬件设备上实现高效部署的工业需求。 【结论/展望】 最后,总结了算法局限性,数据集特异性,硬件部署约束和复杂检测环境影响四方面的挑战,并从公共数据集的构建,模型可解释性,人机协作策略机制和大模型与YOLO的结合的四个方面展望了未来研究方向。

关键词: 深度学习, 目标检测, YOLO, 自动化, 水果缺陷检测

Abstract:

[Significance] Efficient and accurate quality detection is a key link in the post-harvest fruit industry. It is critical for ensuring food safety and improving industrial efficiency. With the growth of global population and the increasing demand for food safety, traditional manual inspection has become incompetent. Manual inspection relies heavily on the experience of operators. It is time-consuming, labor-intensive and subjective. The results are often unstable and inaccurate. It also may cause secondary damage to fruits. In addition, the rising cost of human resources makes the transformation of the fruit industry towards automation and intelligence an inevitable trend. Computer vision and machine learning technologies provide a solution. They can improve detection accuracy and efficiency, while reducing labor costs. Among various object detection algorithms, the YOLO (You Only Look Once) series stands out for its balance of real-time performance, high accuracy and low computational cost. It has achieved excellent results in fruit defect detection. However, there is still a lack of comprehensive reviews on the application of YOLO series algorithms in this field. The purpose of this paper is to fill the blank of YOLO series algorithms in the field of fruit defect detection and provide strong support for the further development of agricultural fruit defect detection technology. [Progress] Focusing on fruit defect detection, this paper reviews the development history of YOLO models from YOLOv1 to YOLO26. It also analyzes the innovative improvements of each generation. The development of YOLO models can be divided into three stages. The first stage (YOLOv1-YOLOv3) established the single-stage detection framework. YOLOv1 proposed the idea of single-shot detection, laying a preliminary foundation for real-time fruit defect detection. YOLOv2 introduced the anchor box mechanism and a more efficient backbone network, improving the positioning robustness for fruits and their defects of different sizes and shapes. YOLOv3 proposed a multi-scale feature pyramid, enhancing the detection ability for small defects such as early lesions and minor bruises. The second stage (YOLOv4-YOLOv8) focused on modular design, engineering construction and efficiency optimization. YOLOv4 clarified the modular design of backbone, neck and head, providing a clear framework for algorithm improvement. YOLOv5, YOLOv6 and YOLOv8 made continuous breakthroughs in lightweight design, anchor-free detection and decoupled head structure. YOLOv7 introduced the E-ELAN (Extended Efficient Layer Aggregation Network) architecture and auxiliary head, balancing detection accuracy and speed. The third stage (YOLOv9-YOLO26) focused on information extraction and transmission, post-processing simplification and high-level semantic modeling. YOLOv9 alleviated information attenuation in deep networks. YOLOv10 simplified post-processing to meet the high throughput needs of online sorting systems. YOLOv13 and YOLO26 further improved the model's performance in complex scenarios. This review also summarizes the application of YOLO series models in various fruit defect detection tasks, including apples, citrus, tomatoes and pears. It focuses on three key challenges: detection of internal and latent defects, defect detection under complex surface morphologies, and model lightweighting and edge deployment. For internal and latent defects, researchers combine YOLO models with multi-modal imaging technologies such as near-infrared and hyperspectral imaging. For complex surface morphologies, they improve the model's feature extraction ability by introducing attention mechanisms and multi-scale feature fusion. For lightweight and edge deployment, they adopt strategies such as lightweight architecture design, model compression and training optimization. [Conclusions and Prospects] The YOLO series models have become one of the most widely used technical frameworks in fruit defect detection. They have achieved remarkable phased results. However, they still face several challenges. These include algorithm limitations, such as the coupling optimization conflict between positioning and classification, and the limitation of local feature extraction. Dataset specificity leads to poor model generalization. Hardware constraints restrict the deployment of models on edge devices. Complex detection environments reduce the model's robustness. Future research directions include four aspects. First, build standardized and large-scale public fruit defect datasets, covering different fruit varieties and defect types. Second, improve the interpretability of YOLO models, building intuitive explanation interfaces for non-professional users. Third, develop human-machine collaboration strategies, combining the advantages of deep learning models and farmers' practical experience. Fourth, integrate YOLO with large models, optimizing data generation strategies and model lightweighting to achieve efficient deployment. This review aims to provide theoretical support and technical reference for the further development of agricultural fruit defect detection technology.

Key words: deep learning, object detection, YOLO, automation, fruit defect detection

中图分类号: