基于改进Mask R-CNN模型的秀珍菇表型参数自动测量方法

doi:10.12133/j.smartag.SA202309024

摘要/Abstract

摘要：

［目的/意义］ 秀珍菇表型是其品质和栽培环境适应性的反映，但目前人工测量表型参数耗时费力、主观性强，亟需自动化分析手段。 ［方法］ 一种基于改进Mask R-CNN的秀珍菇测量模型PG-Mask R-CNN（Pleurotus geesteranus-Mask Region-Based Convolutional Neural Network），提出以损伤率为指标的裂纹评价方法，并对其进行量化评价。PG-Mask R-CNN模型以Mask R-CNN为主体，通过向特征提取网络Resnet101中添加SimAM注意力机制，在不增加原始网络参数的情况下提高网络性能；采用改进的特征金字塔进行多尺度融合，融合多层级的信息进行预测；将GIoU（Generalized Intersection over Union）边界框回归损失函数替代原有的IoU（Intersection over Union）损失函数，完善图像重叠度的计算，进一步提高模型性能。 ［结果和讨论］ PG-Mask R-CNN模型目标检测的mAP和mAR分别为84.8%和87.7%，均高于目前主流的YOLACT（You Only Look At CoefficienTs）、InstaBoost、QueryInst和Mask R-CNN模型；实例分割结果的MRE（Mean Relative Error）为0.90%，均低于其他实例分割模型；PG-Mask R-CNN模型的参数量为51.75 M，略大于原始的Mask R-CNN，均小于其他实例分割模型。对分割后的菌盖和裂纹进行测量，所得结果的MRE分别为1.30%和7.54%，损伤率的MAE（Mean Absolute Error）为0.14%。 ［结论］ 本研究提出的PG-Mask R-CNN模型对秀珍菇的菌柄、菌盖及裂纹识别与分割具有较高的准确率，在此基础上能够实现对秀珍菇表型参数的自动化测量，这为后续秀珍菇智慧化育种、智能栽培与分级奠定了技术基础。

关键词: 秀珍菇, Mask R-CNN, SimAM模块, Resnet101, 表型分析, 改进的特征金字塔

Abstract:

[Objective] Pleurotus geesteranus is a rare edible mushroom with a fresh taste and rich nutritional elements, which is popular among consumers. It is not only cherished for its unique palate but also for its abundant nutritional elements. The phenotype of Pleurotus geesteranus is an important determinant of its overall quality, a specific expression of its intrinsic characteristics and its adaptation to various cultivated environments. It is crucial to select varieties with excellent shape, integrity, and resistance to cracking in the breeding process. However, there is still a lack of automated methods to measure these phenotype parameters. The method of manual measurement is not only time-consuming and labor-intensive but also subjective, which lead to inconsistent and inaccurate results. Thus, the traditional approach is unable to meet the demand of the rapid development Pleurotus geesteranus industry. [Methods] To solve the problems which mentioned above, firstly, this study utilized an industrial-grade camera (Daheng MER-500-14GM) and a commonly available smartphone (Redmi K40) to capture high-resolution images in DongSheng mushroom industry (Jiujiang, Jiangxi province). After discarding blurred and repetitive images, a total of 344 images were collected, which included two commonly distinct varieties, specifically Taixiu 57 and Gaoyou 818. A series of data augmentation algorithms, including rotation, flipping, mirroring, and blurring, were employed to construct a comprehensive Pleurotus geesteranus image dataset. At the end, the dataset consisted of 3 440 images and provided a robust foundation for the proposed phenotype recognition model. All images were divided into training and testing sets at a ratio of 8:2, ensuring a balanced distribution for effective model training. In the second part, based upon foundational structure of classical Mask R-CNN, an enhanced version specifically tailored for Pleurotus geesteranus phenotype recognition, aptly named PG-Mask R-CNN (Pleurotus geesteranus-Mask Region-based Convolutional Neural Network) was designed. The PG-Mask R-CNN network was refined through three approaches: 1) To take advantage of the attention mechanism, the SimAM attention mechanism was integrated into the third layer of ResNet101feature extraction network after analyzing and comparing carefully, it was possible to enhance the network's performance without increasing the original network parameters. 2) In order to avoid the problem of Mask R-CNN's feature pyramid path too long to split low-level feature and high-level feature, which may impair the semantic information of the high-level feature and lose the positioning information of the low-level feature, an improved feature pyramid network was used for multiscale fusion, which allowed us to amalgamate information from multiple levels for prediction. 3) To address the limitation of IoU (Intersection over Union) bounding box, which only considered the overlapping area between the prediction box and target box while ignoring the non-overlapping area, a more advanced loss function called GIoU (Generalized Intersection over Union) was introduced. This replacement improved the calculation of image overlap and enhanced the performance of the model. Furthermore, to evaluate crack state of Pleurotus geesteranus more scientifically, reasonably and accurately, the damage rate as a new crack quantification evaluation method was introduced, which was calculated by using the proportion of cracks in the complete pileus of the mushroom and utilized the MRE (Mean Relative Error) to calculate the mean relative error of the Pleurotus geesteranus's damage rate. Thirdly, the PG-Mask R-CNN network was trained and tested based on the Pleurotus geesteranus image dataset. According to the detection and segmentation results, the measurement and accuracy verification were conducted. Finally, considering that it was difficult to determine the ground true of the different shapes of Pleurotus geesteranus, the same method was used to test 4 standard blocks of different specifications, and the rationality of the proposed method was verified. [Results and Discussions] In the comparative analysis, the PG-Mask R-CNN model was superior to Grabcut algorithm and other 4 instance segmentation models, including YOLACT (You Only Look At Coefficien Ts), InstaBoost, QueryInst, and Mask R-CNN. In object detection tasks, the experimental results showed that PG-Mask R-CNN model achieved a mAP of 84.8% and a mAR (mean Average Recall) of 87.7%, respectively, higher than the five methods were mentioned above. Furthermore, the MRE of the instance segmentation results was 0.90%, which was consistently lower than that of other instance segmentation models. In addition, from a model size perspective, the PG-Mask R-CNN model had a parameter count of 51.75 M, which was slightly larger than that of the unimproved Mask R-CNN model but smaller than other instance segmentation models. With the instance segmentation results on the pileus and crack, the MRE were 1.30% and 7.54%, respectively, while the MAE of the measured damage rate was 0.14%. [Conclusions] The proposed PG-Mask R-CNN model demonstrates a high accuracy in identifying and segmenting the stipe, pileus, and cracks of Pleurotus geesteranus. Thus, it can help the automated measurements of phenotype measurements of Pleurotus geesteranus, which lays a technical foundation for subsequent intelligent breeding, smart cultivation and grading of Pleurotus geesteranus.

Key words: Pleurotus geesteranus, Mask R-CNN, SimAM attention mechanism, Resnet101, phenotype analysis, improved feature pyramid network

中图分类号:

周华茂, 王婧, 殷华, 陈琦. 基于改进Mask R-CNN模型的秀珍菇表型参数自动测量方法[J]. 智慧农业(中英文), 2023, 5(4): 117-126.

ZHOU Huamao, WANG Jing, YIN Hua, CHEN Qi. Phenotype Analysis of Pleurotus Geesteranus Based on Improved Mask R-CNN[J]. Smart Agriculture, 2023, 5(4): 117-126.

图/表 13

图1

图2

图3

图4

图5

表1

表2

表3

图6

图7

图8

表4

表5

参考文献 25

1	刘凌云, 周宇, 陈华, 等. 秀珍菇研究进展[J]. 微生物学通报, 2020, 47(11): 3650-3657.
	LIU L Y, ZHOU Y, CHEN H, et al. Research progress of Pleurotus geesteranus [J]. Microbiology China, 2020, 47(11): 3650-3657.
2	徐云碧. 作物科学中的环境型鉴定(Envirotyping)及其应用[J]. 中国农业科学, 2015, 48(17): 3354-3371.
	XU Y B. Envirotyping and its applications in crop science[J]. Scientia agricultura sinica, 2015, 48(17): 3354-3371.
3	T/GXEFA 0002—2022.富硒秀珍菇生产技术规程 [S]. 广西: 广西食用菌协会, 2022.
4	YIN H, YI W L, HU D M, Computer vision and machine learning applied in the mushroom industry: A critical review[J]. Computers and electronics in agriculture, 2022, 198: ID 107015.
5	王玲, 徐伟, 杜开炜, 等. 基于SR300深度相机的褐蘑菇原位测量技术[J]. 农业机械学报, 2018, 49(12): 13-19, 108.
	WANG L, XU W, DU K W, et al. Portabella mushrooms measurement in situ based on SR300 depth camera[J]. Transactions of the Chinese society for agricultural machinery, 2018, 49(12): 13-19, 108.
6	LIU Q, FANG M, LI Y S, et al. Deep learning based research on quality classification of shiitake mushrooms[J]. LWT, 2022, 168: ID 113902.
7	YIN H A, XU J L, WANG Y L, et al. A novel method of situ measurement algorithm for Oudemansiella raphanipies caps based on YOLOv4 and distance filtering[J]. Agronomy, 2022, 13(1): ID 134.
8	黄星奕, 姜爽, 陈全胜, 等. 基于机器视觉技术的畸形秀珍菇识别[J]. 农业工程学报, 2010, 26(10): 350-354.
	HUANG X Y, JIANG S, CHEN Q S, et al. Identification of defect Pleurotus geesteranus based on computer vision[J]. Transactions of the Chinese society of agricultural engineering, 2010, 26(10): 350-354.
9	HE K M, GKIOXARI G, DOLLAR P, et al. Mask R-CNN[J]. IEEE transactions on pattern analysis and machine intelligence, 2020, 42(2): 386-397.
10	REZATOFIGHI H, TSOI N, GWAK J, et al. Generalized intersection over union: A metric and a loss for bounding box regression[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2019: 658-666.
11	HU J, SHEN L, ALBANIE S, et al. Squeeze-and-excitation networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2020, 42(8): 2011-2023.
12	HOU Q B, ZHOU D Q, FENG J S. Coordinateattention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2021.
13	WANG Q L, WU B G, ZHU P F, et al. ECA-net: Efficient channel attention for deep convolutional neural networks[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020.
14	YANG L X, ZHANG R Y, LI L D, et al. SimAM: A simple, parameter-free attention module for convolutional neural networks[C]// International Conference on Machine Learning. New York, USA: PMLR, 2021: 11863-11874.
15	LIU S, QI L, QIN H F, et al. Path aggregation network for instance segmentation[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, New Jersey, USA: IEEE, 2018.
16	俞林森, 陈志国. 融合前景注意力的轻量级交通标志检测网络[J]. 电子测量与仪器学报, 2023, 37(1): 21-31.
	YU L S, CHEN Z G. Lightweight traffic sign detection network with fused foreground attention[J]. Journal of electronic measurement and instrumentation, 2023, 37(1): 21-31.
17	ZHENG Z Z, HU Y H, YANG H B, et al. AFFU-Net: Attention feature fusion U-net with hybrid loss for winter jujube crack detection[J]. Computers and electronics in agriculture, 2022, 198: ID 107049.
18	ZHANG Z Y. Aflexible new technique for camera calibration[J]. IEEE transactions on pattern analysis and machine intelligence, 2000, 22(11): 1330-1334.
19	YANG S, ZHENG L H, YANG H J, et al. A synthetic datasets based instance segmentation network for high-throughput soybean pods phenotype investigation[J]. Expert systems with applications, 2022, 192: ID 116403.
20	周丽, 冯百明, 关煜, 等. 面向智能手机拍摄的变形文档图像校正[J]. 计算机工程与科学, 2022, 44(1): 102-109.
	ZHOU L, FENG B M, GUAN Y, et al. Correcting distorted document images on smartphones[J]. Computer engineering & science, 2022, 44(1): 102-109.
21	ROTHER C, KOLMOGOROV V, BLAKE A. "GrabCut": Interactive foreground extraction using iterated graph cuts[J]. ACM transactions on graphics, 23(3): 309-314.
22	BOLYA D, ZHOU C, XIAO F Y, et al. YOLACT: Real-time instance segmentation[C]// 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2019: 9157-9166.
23	FANG H S, SUN J H, WANG R Z, et al. InstaBoost: Boosting instance segmentation via probability map guided copy-pasting[C]// 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2019: 682-691.
24	FANG Y X, YANG S S, WANG X G, et al. QueryInst: Parallelly supervised mask query for instance segmentation[EB/OL]. arXiv: 2105.01928, 2021.
25	朱怡航, 张小斌, 沈颖越, 等. 基于图像识别技术的金针菇表型高通量采集与分析[J]. 菌物学报, 2021, 40(3): 626-640.
	ZHU Y H, ZHANG X B, SHEN Y Y, et al. High-throughput phenotyping collection and analysis of Flammulina filiformis based on image recognition technology[J]. Mycosystema, 2021, 40(3): 626-640.

Module	mAP/%	mAR/%	Inference time/ms
conv1+SimAM	84.1	87.6	0.067
conv2+SimAM	84.4	87.7	0.060
conv3+SimAM	84.8	87.7	0.069

Module	Exp No.1	Exp No.2	Exp No.3	Exp No.4
SimAM	×	√	×	√
DFPN	×	×	√	√
GIoU	×	×	×	√
mAP/%	82.2	84.2	82.6	84.8
mAR/%	86.4	87.4	86.1	87.7
Inference time/ms	0.050	0.063	0.060	0.069
Parameters/M	45.84	45.84	51.75	51.75

Algorithm （Bbox）	mAP@0.5/%	mAP/%	mAR/%	Parameters/M	MRE/%
Mask R-CNN	95.2	82.2	86.4	45.84	1.24
YOLACT	91.7	72.5	77.2	53.73	1.41
InstaBoost	81.0	60.4	67.1	62.75	4.13
QueryInst	93.0	76.4	84.2	191.27	7.49
PG-Mask R-CNN （ours）	95.3	84.8	87.7	51.75	0.90

秀珍菇序号	菌盖面积/mm²	菌柄长度/mm	菌柄宽度/mm	裂纹数/条	菌盖损伤率/%
0	551.18	35.06	12.58	1	1.48
1	589.12	32.08	13.71	2	1.98
2	977.29	43.77	14.09	1	0.63
3	602.26	39.23	12.22	1	0.29
4	1 000.49	43.98	14.24	2	1.01
5	797.17	36.00	14.80	3	1.58
6	754.48	46.11	14.66	1	0.66
7	462.24	45.12	12.99	1	0.94
8	557.78	28.39	13.13	1	2.17
9	328.67	40.59	9.42	0	0.00
10	796.91	53.51	15.40	1	0.47
11	1 664.96	52.04	19.29	3	3.10
12	561.90	27.71	15.16	2	1.80
13	716.65	39.11	14.12	1	0.45
14	1 650.42	49.41	16.82	2	2.11

量块	长度				宽度				面积
材料	实测值/mm	图像测量值/mm	MRE/%	MAE/mm	实测值/mm	图像测量值/mm	MRE/%	MAE/mm	实测值/mm²	图像测量值/mm²	MRE/%	MAE/mm²
a	80.03±0.02	82.11±1.02	2.60	2.08	34.87±0.01	34.47±0.58	1.21	0.42	2 790.91±1.23	2 830.92±82.25	1.80	50.32
b	60.01±0.01	60.63±0.94	1.14	0.68	34.94±0.03	34.79±0.67	0.59	0.21	2 096.53±1.58	2 109.60±72.69	1.26	26.32
c	40.02±0.01	40.58±0.95	1.63	0.65	35.06±0.01	35.18±0.78	1.05	0.37	1 402.96±0.89	1 428.03±66.02	2.65	47.16
d	34.67±0.02	35.44±0.91	2.26	0.79	25.00±0.01	25.22±0.64	1.22	0.31	866.93±0.42	893.80±44.34	3.51	30.45
均值	‒	‒	1.96	1.05	‒	‒	1.02	0.33	‒	‒	2.30	36.06