欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2023, Vol. 5 ›› Issue (2): 1-12.doi: 10.12133/j.smartag.SA202305004

• 专题--机器视觉与农业智能感知 •    下一篇

用于边缘计算设备的果树挂果量轻量化估测模型

夏雪1(), 柴秀娟1, 张凝1(), 周硕1, 孙琦鑫1, 孙坦2()   

  1. 1.中国农业科学院农业信息研究所/农业农村部农业大数据重点实验室,北京 100081
    2.中国农业科学院,北京 100081
  • 收稿日期:2023-05-11 出版日期:2023-06-30
  • 基金资助:
    国家重点研发计划项目(2022YFD2002205);中央级公益性科研院所基本科研业务费专项(Y2022QC17);中国农业科学院科技创新工程(CAAS-ASTIP-2021-AII-08)
  • 作者简介:夏 雪,助理研究员,研究方向为果树表型研究与应用。E-mail:xiaxue@caas.cn
  • 通信作者: 1. 张 凝,博士,副研究员,研究方向为作物表型、定量遥感。E-mail:zhangning@caas.cn;2
    孙 坦,博士,研究馆员,研究方向为数字信息描述与组织、智慧农业。E-mail:suntan@caas.cn

A Lightweight Fruit Load Estimation Model for Edge Computing Equipment

XIA Xue1(), CHAI Xiujuan1, ZHANG Ning1(), ZHOU Shuo1, SUN Qixin1, SUN Tan2()   

  1. 1.Agricultural Information Institute, Chinese Academy of Agricultural Sciences/Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
    2.Chinese Academy of Agricultural Sciences, Beijing 100081, China
  • Received:2023-05-11 Online:2023-06-30

摘要:

[目的/意义] 挂果量是果树栽培管理的重要指标。传统人力抽样估测果树挂果量的方法不仅耗时费力,而且容易产生较大误差。本研究提出一种用于边缘计算设备的轻量化模型,实现视频中树上柑橘挂果量的自动估测。 [方法] 该模型采用CSPDarkNet53+PAFPN结构作为特征提取网络,实现更快的推理速度和更低的模型复杂度,在果实跟踪过程中引入Byte算法改进FairMOT的数据关联策略,对视频中的柑橘进行预测跟踪,以提升挂果量估测准确性。[结果和讨论]在边缘计算设备NVIDIA Jetson AGX上进行模型性能测试结果表明,本研究所建模型对柑橘挂果量的平均估测精度(Average Estimating Precision,AEP)和处理速度(Frames Per Second,FPS)分别达到91.61%和14.76,模型估测值与人工测得真实值的决定系数R2为0.9858,均方根误差(Root Mean Square Error,RMSE)为4.1713,模型参数量、计算量(Floating Point Operations,FLOPs)和模型大小分别为5.01 M、36.44 G和70.20 MB,展现出较对比模型更优的挂果量估测性能和更低的模型复杂度。 [结论] 试验结果证明了本研究所建模型在边缘计算设备上对柑橘挂果量估测的有效性,基于算法模型研发的果园挂果量远程监测系统可满足用于果园移动平台行进状态下的果树挂果量估测需求。本研究可为果园生产力自动监测分析提供技术支持。

关键词: 精准园艺, 挂果量估测, 边缘计算, 深度学习, 多目标跟踪, 轻量化模型

Abstract:

[Objective] The fruit load estimation of fruit tree is essential for horticulture management. Traditional estimation method by manual sampling is not only labor-intensive and time-consuming but also prone to errors. Most existing models can not apply to edge computing equipment with limited computing resources because of their high model complexity. This study aims to develop a lightweight model for edge computing equipment to estimate fruit load automatically in the orchard. [Methods] The experimental data were captured using the smartphone in the citrus orchard in Jiangnan district, Nanning city, Guangxi province. In the dataset, 30 videos were randomly selected for model training and other 10 for testing. The general idea of the proposed algorithm was divided into two parts: Detecting fruits and extracting ReID features of fruits in each image from the video, then tracking fruit and estimating the fruit load. Specifically, the CSPDarknet53 network was used as the backbone of the model to achieve feature extraction as it consumes less hardware computing resources, which was suitable for edge computing equipment. The path aggregation feature pyramid network PAFPN was introduced as the neck part for the feature fusion via the jump connection between the low-level and high-level features. The fused features from the PAFPN were fed into two parallel branches. One was the fruit detection branch and another was the identity embedding branch. The fruit detection branch consisted of three prediction heads, each of which performed 3×3 convolution and 1×1 convolution on the feature map output by the PAFPN to predict the fruit's keypoint heat map, local offset and bounding box size, respectively. The identity embedding branch distinguished between different fruit identity features. In the fruit tracking stage, the byte mechanism from the ByteTrack algorithm was introduced to improve the data association of the FairMOT method, enhancing the performance of fruit load estimation in the video. The Byte algorithm considered both high-score and low-score detection boxes to associate the fruit motion trajectory, then matches the identity features' similarity of fruits between frames. The number of fruit IDs whose tracking duration longer than five frames was counted as the amount of citrus fruit in the video. [Results and Discussions] All experiments were conducted on edge computing equipment. The fruit detection experiment was conducted under the same test dataset containing 211 citrus tree images. The experimental results showed that applying CSPDarkNet53+PAFPN structure in the proposed model achieved a precision of 83.6%, recall of 89.2% and F1 score of 86.3%, respectively, which were superior to the same indexes of FairMOT (ResNet34) model, FairMOT (HRNet18) model and Faster RCNN model. The CSPDarkNet53+PAFPN structure adopted in the proposed model could better detect the fruits in the images, laying a foundation for estimating the amount of citrus fruit on trees. The model complexity experimental results showed that the number of parameters, FLOPs (Floating Point Operations) and size of the proposed model were 5.01 M, 36.44 G and 70.2 MB, respectively. The number of parameters for the proposed model was 20.19% of FairMOT (ResNet34) model's and 41.51% of FairMOT (HRNet18) model's. The FLOPs for the proposed model was 78.31% less than FairMOT (ResNet34) model's and 87.63% less than FairMOT (HRNet18) model's. The model size for the proposed model was 23.96% of FairMOT (ResNet34) model's and 45.00% of FairMOT (HRNet18) model's. Compared with the Faster RCNN, the model built in this study showed advantages in the number of parameters, FLOPs and model size. The low complexity proved that the proposed model was more friendly to edge computing equipment. Compared with the lightweight backbone network EfficientNet-Lite, the CSPDarkNet53 applied in the proposed model's backbone performed better fruit detection and model complexity. For fruit load estimation, the improved tracking strategy that integrated the Byte algorithm into the FairMOT positively boosted the estimation accuracy of fruit load. The experimental results on the test videos showed that the AEP (Average Estimating Precision) and FPS (Frames Per Second) of the proposed model reached 91.61% and 14.76 f/s, which indicated that the proposed model could maintain high estimation accuracy while the FPS was 2.4 times and 4.7 times of the comparison models, respectively. The RMSE (Root Mean Square Error) of the proposed model was 4.1713, which was 47.61% less than FairMOT (ResNet34) model's and 22.94% less than FairMOT (HRNet18) model's. The R2 of the determination coefficient between the algorithm-measured value and the manual counted value was 0.9858, which was superior to other comparison models. The proposed model revealed better performance in estimating fruit load and lower model complexity than other comparatives. [Conclusions] The experimental results proved the validity of the proposed model for fruit load estimation on edge computing equipment. This research could provide technical references for the automatic monitoring and analysis of orchard productivity. Future research will continue to enrich the data resources, further improve the model's performance, and explore more efficient methods to serve more fruit tree varieties.

Key words: smart orchard, fruit load estimation, edge computing, deep learning, multiple object tracking, lightweight model

中图分类号: