欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2022, Vol. 4 ›› Issue (2): 77-85.doi: 10.12133/j.smartag.SA202201001

• 专题——智慧畜牧关键技术与装备 • 上一篇    下一篇

基于迁移学习的多尺度特征融合牦牛脸部识别算法

陈占琦1(), 张玉安1(), 王文志1, 李丹1, 何杰1, 宋仁德2   

  1. 1.青海大学 计算机技术与应用系,青海 西宁 810016
    2.青海省玉树州动物疫病预防控制中心,青海 玉树 815000
  • 收稿日期:2022-01-02 出版日期:2022-06-30
  • 基金资助:
    青海省科技计划项目(2020-QY-218);国家现代农业产业技术体系资助(CARS-37)
  • 作者简介:陈占琦(1996-),男,硕士研究生,研究方向为智慧畜牧。E-mail:1046788801@qq.com
  • 通信作者: 张玉安(1981-),男,博士,教授,研究方向为进化计算、智慧畜牧。E-mail:2011990029@qhu.edu.cn

Multiscale Feature Fusion Yak Face Recognition Algorithm Based on Transfer Learning

CHEN Zhanqi1(), ZHANG Yu'an1(), WANG Wenzhi1, LI Dan1, HE Jie1, SONG Rende2   

  1. 1.Department of Computer Technology and Application, Qinghai University, Xining 810016, China
    2.Animal Disease Prevention and Control Center of Yushu Tibetan Autonomous Prefecture, Yushu 815000, China
  • Received:2022-01-02 Online:2022-06-30

摘要:

牦牛个体身份标识是实现个体建档、行为监测、精准饲喂、疫病防控及食品溯源的前提。针对智慧畜牧智能化、信息化等养殖平台中动物个体识别技术应用需求,本研究提出一种基于迁移学习的多尺度特征融合牦牛脸部识别算法(Transfer Learning-Multiscale Feature Fusion-VGG, T-M-VGG)。以预训练的视觉几何组网络(Visual Geometry Group Network,VGG)为骨干网络构建基于迁移学习的卷积神经网络模型,获取其Block3、Block4、Block5输出的特征图,分别用F3、F4、F5表示,将F3和F5经过三个不同膨胀系数的空洞卷积组成的并行空洞卷积模块增大感受野后,送入改进的特征金字塔进行多尺度特征融合;最后利用全局平均池化代替全连接层分类输出。试验结果表明,本研究提出的T-M-VGG算法在194头牦牛的38,800张数据集中识别准确率达到96.01%,模型大小为70.75 MB。随机选取12张不同类别牦牛图像进行面部遮挡测试,识别准确率为83.33%。本算法可以为牦牛脸部识别研究提供参考。

关键词: 牦牛, 脸部识别, 迁移学习, 特征金字塔, T-M-VGG

Abstract:

Identifying of yak is indispensable for individual documentation, behavior monitoring, precise feeding, disease prevention and control, food traceability, and individualized breeding. Aiming at the application requirements of animal individual identification technology in intelligent informatization animal breeding platforms, a yak face recognition algorithm based on transfer learning and multiscale feature fusion, i.e., transfer learning-multiscale feature fusion-VGG(T-M-VGG) was proposed. The sample data set of yak facial images was produced by a camera named GoPro HERO8 BLACK. Then, a part of dataset was increased by the data enhancement ways that involved rotating, adjusting the brightness and adding noise to improve the robustness and accuracy of model. T-M-VGG, a kind of convolutional neural network based on pre-trained visual geometry group network and transfer learning was input with normalized dataset samples. The feature map of Block3, Block4 and Block5 were considered as F3, F4 and F5, respectively. What's more, F3 and F5 were taken by the structure that composed of three parallel dilated convolutions, the dilation rate were one, two and three, respectively, to dilate the receptive filed which was the map size of feature map. Further, the multiscale feature maps were fused by the improved feature pyramid which was in the shape of stacked hourglass structure. Finally, the fully connected layer was replaced by the global average pooling to classify and reduce a large number of parameters. To verify the effectiveness of the proposed model, a comparative experiment was conducted. The experimental results showed that recognition accuracy rate in 38,800 data sets of 194 yaks reached 96.01%, but the storage size was 70.75 MB. Twelve images representing different yak categories from dataset were chosen randomly for occlusion test. The origin images were masked with different shape of occlusions. The accuracy of identifying yak individuals was 83.33% in the occlusion test, which showed that the model had mainly learned facial features. The proposed algorithm could provide a reference for research of yak face recognition and would be the foundation for the establishment of smart management platform.

Key words: yak, face recognition, transfer learning, feature pyramid structure, T-M-VGG

中图分类号: