欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (5): 119-127.doi: 10.12133/j.smartag.SA202403016

• 技术方法 • 上一篇    下一篇

基于改进DeepLabV3+的轻量化茶叶嫩芽采摘点识别模型

胡程喜1, 谭立新1,2(), 王文胤1, 宋敏1   

  1. 1. 湖南农业大学 信息与智能科学技术学院,湖南 长沙 410125,中国
    2. 湖南信息职业技术学院 电子工程学院,湖南 长沙 410200,中国
  • 收稿日期:2024-03-13 出版日期:2024-09-30
  • 基金项目:
    中国高校产学研创新基金——新一代信息技术创新项目资助课题(2022IT82); 湖南省教育科学规划课题(XJK24BZY037)
  • 作者简介:
    胡程喜,研究方向为计算机视觉及应用。E-mail:
  • 通信作者:
    谭立新,硕士,教授,研究方向为机器人与智能系统、模式识别与智能信息处理。E-mail:

Lightweight Tea Shoot Picking Point Recognition Model Based on Improved DeepLabV3+

HU Chengxi1, TAN Lixin1,2(), WANG Wenyin1, SONG Min1   

  1. 1. College of Information and Intelligence, Hunan Agricultural University, Changsha 410125, China
    2. School of Electrical and Electronic Engineering, Hunan College of Information, Changsha 410200, China
  • Received:2024-03-13 Online:2024-09-30
  • Foundation items:Innovation Fund for University-Industry Cooperation in China - Supported Project for New Generation Information Technology Innovation(2022IT82); Hunan Provincial Educational Science Planning Project(XJK24BZY037)
  • About author:
    HU Chengxi, E-mail:
  • Corresponding author:
    TAN Lixin, E-mail:

摘要:

【目的/意义】 名优茶的采摘是茶产业中至关重要的环节,识别和定位名优茶嫩芽采摘点是现代化采茶过程中的重要组成部分。传统神经网络方法存在着模型体量大、训练时间长,以及应对场景复杂等问题。本研究以湖南省溪清茶园为实际场景,提出一种新型深度学习算法解决名优茶采摘点的精确分割难题。 【方法】 对传统的DeepLabV3+算法进行轻量化改进。首先,针对其模型体量大、训练时间长的问题,使用MobilenetV2网络提取图像的初始特征,并按照网络结构划分深浅层特征;其次,将高效通道注意力网络(Efficient Channel Attention Network, ECANet)与空洞空间卷积池化金字塔(Atrous Spatial Pyramid Pooling, ASPP)模块结合,得到ECA_ASPP模块,并将深层特征输入到ECA_ASPP模块中进行多尺度特征融合以减少无效信息,将经过处理后的深浅层特征相加,随后通过卷积和上采样的方式对特征信息进行还原,得到分割结果;最后,通过对识别结果进行处理以获得茶叶嫩芽采摘点。 【结果和讨论】 改进后的DeepLabV3+在茶叶嫩芽数据集上的平均交并比达到93.71%,平均像素准确率达到97.25%,模型参数量由原来以Xception为底层网络的54.714 M下降至5.818 M。 【结论】 本研究在茶叶嫩芽结构分割上相对于原版DeepLabV3+的检测速度更快、参数量更小,同时保证了较高的准确率,为智能采茶机器人的采摘提供了新的定位方法。

关键词: 轻量化模型, DeepLabV3+, 注意力机制, 茶叶嫩芽, ECANet, 名优茶, 空洞空间卷积池化金字塔

Abstract:

[Objective] The picking of famous and high-quality tea is a crucial link in the tea industry. Identifying and locating the tender buds of famous and high-quality tea for picking is an important component of the modern tea picking robot. Traditional neural network methods suffer from issues such as large model size, long training times, and difficulties in dealing with complex scenes. In this study, based on the actual scenario of the Xiqing Tea Garden in Hunan Province, proposes a novel deep learning algorithm was proposed to solve the precise segmentation challenge of famous and high-quality tea picking points. [Methods] The primary technical innovation resided in the amalgamation of a lightweight network architecture, MobilenetV2, with an attention mechanism known as efficient channel attention network (ECANet), alongside optimization modules including atrous spatial pyramid pooling (ASPP). Initially, MobilenetV2 was employed as the feature extractor, substituting traditional convolution operations with depth wise separable convolutions. This led to a notable reduction in the model's parameter count and expedited the model training process. Subsequently, the innovative fusion of ECANet and ASPP modules constituted the ECA_ASPP module, with the intention of bolstering the model's capacity for fusing multi-scale features, especially pertinent to the intricate recognition of tea shoots. This fusion strategy facilitated the model's capability to capture more nuanced features of delicate shoots, thereby augmenting segmentation accuracy. The specific implementation steps entailed the feeding of image inputs through the improved network, whereupon MobilenetV2 was utilized to extract both shallow and deep features. Deep features were then fused via the ECA_ASPP module for the purpose of multi-scale feature integration, reinforcing the model's resilience to intricate backgrounds and variations in tea shoot morphology. Conversely, shallow features proceeded directly to the decoding stage, undergoing channel reduction processing before being integrated with upsampled deep features. This divide-and-conquer strategy effectively harnessed the benefits of features at differing levels of abstraction and, furthermore, heightened the model's recognition performance through meticulous feature fusion. Ultimately, through a sequence of convolutional operations and upsampling procedures, a prediction map congruent in resolution with the original image was generated, enabling the precise demarcation of tea shoot harvesting points. [Results and Discussions] The experimental outcomes indicated that the enhanced DeepLabV3+ model had achieved an average Intersection over Union (IoU) of 93.71% and an average pixel accuracy of 97.25% on the dataset of tea shoots. Compared to the original model based on Xception, there was a substantial decrease in the parameter count from 54.714 million to a mere 5.818 million, effectively accomplishing a significant lightweight redesign of the model. Further comparisons with other prevalent semantic segmentation networks revealed that the improved model exhibited remarkable advantages concerning pivotal metrics such as the number of parameters, training duration, and average IoU, highlighting its efficacy and precision in the domain of tea shoot recognition. This considerable decreased in parameter numbers not only facilitated a more resource-economical deployment but also led to abbreviated training periods, rendering the model highly suitable for real-time implementations amidst tea garden ecosystems. The elevated mean IoU and pixel accuracy attested to the model's capacity for precise demarcation and identification of tea shoots, even amidst intricate and varied datasets, demonstrating resilience and adaptability in pragmatic contexts. [Conclusions] This study effectively implements an efficient and accurate tea shoot recognition method through targeted model improvements and optimizations, furnishing crucial technical support for the practical application of intelligent tea picking robots. The introduction of lightweight DeepLabV3+ not only substantially enhances recognition speed and segmentation accuracy, but also mitigates hardware requirements, thereby promoting the practical application of intelligent picking technology in the tea industry.

Key words: lightweight model, DeepLabV3+, attention mechanism, tender tea buds, ECANet, famous quality tea, ASPP

中图分类号: