欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (6): 155-167.doi: 10.12133/j.smartag.SA202408014

• 专题--农业知识智能服务和智慧无人农场(上) • 上一篇    下一篇

改进YOLOv11的水面膨化饲料颗粒图像实时检测算法

周秀珊1(), 文露婷2, 介百飞3, 郑海锋1, 吴其琦1, 李克讷1, 梁军能2, 黎一键2, 文家燕1(), 江林源2()   

  1. 1. 广西科技大学 自动化学院,广西 柳州 545006,中国
    2. 广西壮族自治区水产科学研究院,广西 南宁 530021,中国
    3. 广西壮族自治区水产技术推广站,广西 南宁 530022,中国
  • 收稿日期:2024-08-23 出版日期:2024-11-30
  • 基金项目:
    广西重点研发计划项目(桂科AB21220019); 国家现代农业产业技术体系广西虾类贝类产业创新团队首席专家项目(nycytxgxcxtd-2023-14-01); 水产产业科技先锋队项目桂农科盟(202410)
  • 作者简介:
    周秀珊,研究方向为机器视觉与模式识别。E-mail:
  • 通信作者:
    文家燕,博士,教授,研究方向为复杂系统动力学与控制、智慧渔业等。E-mail:
    江林源,硕士,研究员,研究方向为水产养殖。E-mail:

Real-time Detection Algorithm of Expanded Feed Image on the Water Surface Based on Improved YOLOv11

ZHOU Xiushan1(), WEN Luting2, JIE Baifei3, ZHENG Haifeng1, WU Qiqi1, LI Kene1, LIANG Junneng2, LI Yijian2, WEN Jiayan1(), JIANG Linyuan2()   

  1. 1. Automation College, Guangxi University of Science and Technology, Liuzhou 545006, China
    2. Guangxi Academy of Fishery Sciences, Nanning 530021, China
    3. Aquatic Technology Promotion Station of Guangxi Zhuang Autonomous Region, Nanning 530022, China
  • Received:2024-08-23 Online:2024-11-30
  • Foundation items:Guangxi Key Research and Development Program Project(桂科AB21220019); Chief Expert of Guangxi Shrimp and Mollusk Industry Innovation Team under the National Modern Agricultural Industry Technology System(nycytxgxcxtd-2023-14-01); Aquaculture Industry Science and Technology Pioneer Team Guangxi Agricultural Science Alliance(202410)
  • About author:
    ZHOU Xiushan, E-mail:
  • Corresponding author:
    WEN Jiayan, E-mail:
    JIANG Lingyuan, E-mail:

摘要:

[目的/意义] 针对水面膨化饲料的图像在水产养殖水体中存在水体浑浊导致饲料与背景对比不明显、光照不均匀、鱼群抢食引起的水花导致饲料重叠粘连以及增氧设备产生的气泡遮挡饲料成像等问题,提出一种高效的水面膨化饲料图像检测YOLOv11-AP2S模型,为水产集约化养殖模式下的智能投喂决策提供准确依据。 [方法] 在YOLOv11的骨干网络的第10层C2PSA后增加细粒度分类的注意力机制(Attention for Fine-Grained Categorization, AFGC),将C3k2模块替换为VoV-GSCSP模块,以及在YOLOv11的基础上增加P2层。为了保持模型的实时性,在P2层使用轻量级的VoV-GSCSP模块进行特征融合。在不降低检测速度和不损失模型轻量化程度的情况下提高检测精度,提出YOLOv11-AP2S水面膨化饲料实时检测模型。 [结果与讨论]实验结果显示,YOLOv11-AP2S模型在识别精确度、召回率上均达到了78.70%,IoU阈值为0.5时的平均精度值(mAP50)高达80.00%, F1分数也达到了79.00%。与原YOLOv11网络相比,这些指标分别提高了提高6.7个百分点、9.0个百分点、9.4个百分点和8.0个百分点。与其他YOLO模型相比,YOLOv11-AP2S模型在自制数据集上的检测结果也具有明显优势,且在同等迭代次数下具有更高的平均精度均值和更低的损失。 [结论] YOLOv11-AP2S模型能够通过摄像头对水面膨化饲料颗粒的剩余情况进行实时检测,进而实现对鱼群摄食行为的准确观测与分析,为智慧渔业精准投喂的研究和应用提供有力支持。

关键词: 鱼群摄食行为, 膨化饲料, 水产养殖, YOLOv11, 实时检测

Abstract:

[Objective] During the feeding process of fish populations in aquaculture, the video image characteristics of floating extruded feed on the water surface undergo continuous variations due to a myriad of environmental factors and fish behaviors. These variations pose significant challenges to the accurate detection of feed particles, which is crucial for effective feeding management. To address these challenges and enhance the detection of floating extruded feed particles on the water surface, ,thereby providing precise decision support for intelligent feeding in intensive aquaculture modes, the YOLOv11-AP2S model, an advanced detection model was proposed. [Methods] The YOLOv11-AP2S model enhanced the YOLOv11 algorithm by incorporating a series of improvements to its backbone network, neck, and head components. Specifically, an attention for fine-grained categorization (AFGC) mechanism was introduced after the 10th layer C2PSA of the backbone network. This mechanism aimed to boost the model's capability to capture fine-grained features, which were essential for accurately identifying feed particles in complex environments with low contrast and overlapping objects. Furthermore, the C3k2 module was replaced with the VoV-GSCSP module, which incorporated more sophisticated feature extraction and fusion mechanisms. This replacement further enhanced the network's ability to extract relevant features and improve detection accuracy. To improve the model's detection of small targets, a P2 layer was introduced. However, adding a P2 layer may increase computational complexity and resource consumption, so the overall performance and resource consumption of the model must be carefully balanced. To maintain the model's real-time performance while improving detection accuracy, a lightweight VoV-GSCSP module was utilized for feature fusion at the P2 layer. This approach enabled the YOLOv11-AP2S model to achieve high detection accuracy without sacrificing detection speed or model lightweights, making it suitable for real-time applications in aquaculture. [Results and Discussions] The ablation experimental results demonstrated the superiority of the YOLOv11-AP2S model over the original YOLOv11 network. Specifically, the YOLOv11-AP2S model achieved a precision ( P) and recall ( R) of 78.70%. The mean average precision (mAP50) at an intersection over union (IoU) threshold of 0.5 was as high as 80.00%, and the F1-Score had also reached 79.00%. These metrics represented significant improvements of 6.7%, 9.0%, 9.4% (for precision, as previously mentioned), and 8.0%, respectively, over the original YOLOv11 network. These improvements showed the effectiveness of the YOLOv11-AP2S model in detecting floating extruded feed particles in complex environments. When compared to other YOLO models, the YOLOv11-AP2S model exhibits clear advantages in detecting floating extruded feed images on a self-made dataset. Notably, under the same number of iterations, the YOLOv11-AP2S model achieved higher mAP50 values and lower losses, demonstrating its superiority in detection performance. This indicated that the YOLOv11-AP2S model strikes a good balance between learning speed and network performance, enabling it to efficiently and accurately detect images of floating extruded feed on the water surface. Furthermore, the YOLOv11-AP2S model's ability to handle complex detection scenarios, such as overlapping and adhesion of feed particles and occlusion by bubbles, was noteworthy. These capabilities were crucial for accurate detection in practical aquaculture environments, where such challenges were common and can significantly impair the performance of traditional detection systems. The improvements in detection accuracy and efficiency made the YOLOv11-AP2S model a valuable tool for intelligent feeding systems in aquaculture, as it could provide more reliable and timely information on fish feeding behavior. Additionally, the introduction of the P2 layer and the use of the lightweight VoV-GSCSP module for feature fusion at this layer contributed to the model's overall performance. These enhancements enabled the model to maintain high detection accuracy while keeping computational costs and resource consumption within manageable limits. This was particularly important for real-time applications in aquaculture, where both accuracy and efficiency were critical for effective feeding management. [Conclusions] The successful application of the YOLOv11-AP2S model in detecting floating extruded feed particles demonstrates its potential to intelligent feeding systems in aquaculture. By providing accurate and timely information on fish feeding behavior, the model can help optimize feeding strategies, reduce feed waste, and improve the overall efficiency and profitability of aquaculture operations. Furthermore, the model's ability to handle complex detection scenarios and maintain high detection accuracy while keeping computational costs within manageable limits makes it a practical and valuable tool for real-time applications in aquaculture. Therefore, the YOLOv11-AP2S model holds promise for wide application in intelligent aquaculture management, contributing to the sustainability and growth of the aquaculture industry.

Key words: fish feeding behavior, expanded feed, aquaculture, YOLOv11, real-time detection

中图分类号: