Root Image Segmentation Method Based on Improved UNet and Transfer Learning

doi:10.12133/j.smartag.SA202308003

Abstract

Abstract:

[Objective] The root system is an important component of plant composition, and its growth and development are crucial for plants. Root image segmentation is an important method for obtaining root phenotype information and analyzing root growth patterns. Research on root image segmentation still faces difficulties, because of the noise and image quality limitations, the intricate and diverse soil environment, and the ineffectiveness of conventional techniques. This paper proposed a multi-scale feature extraction root segmentation algorithm that combined data augmentation and transfer learning to enhance the generalization and universality of the root image segmentation models in order to increase the speed, accuracy, and resilience of root image segmentation. [Methods] Firstly, the experimental datasets were divided into a single dataset and a mixed dataset. The single dataset acquisition was obtained from the experimental station of Hebei Agricultural University in Baoding city. Additionally, a self-made RhizoPot device was used to collect images with a resolution pixels of 10,200×14,039, resulting in a total of 600 images. In this experiment, 100 sheets were randomly selected to be manually labeled using Adobe Photoshop CC2020 and segmented into resolution pixels of 768×768, and divided into training, validation, and test sets according to 7:2:1. To increase the number of experimental samples, an open source multi-crop mixed dataset was obtained in the network as a supplement, and it was reclassified into training, validation, and testing sets. The model was trained using the data augmentation strategy, which involved performing data augmentation operations at a set probability of 0.3 during the image reading phase, and each method did not affect the other. When the probability was less than 0.3, changes would be made to the image. Specific data augmentation methods included changing image attributes, randomly cropping, rotating, and flipping those images. The UNet structure was improved by designing eight different multi-scale image feature extraction modules. The module structure mainly included two aspects: Image convolution and feature fusion. The convolution improvement included convolutional block attention module (CBAM), depthwise separable convolution (DP Conv), and convolution (Conv). In terms of feature fusion methods, improvements could be divided into concatenation and addition. Subsequently, ablation tests were conducted based on a single dataset, data augmentation, and random loading of model weights, and the optimal multi-scale feature extraction module was selected and compared with the original UNet. Similarly, a single dataset, data augmentation, and random loading of model weights were used to compare and validate the advantages of the improved model with the PSPNet, SegNet, and DeeplabV3Plus algorithms. The improved model used pre-trained weights from a single dataset to load and train the model based on mixed datasets and data augmentation, further improving the model's generalization ability and root segmentation ability. [Results and Discussions] The results of the ablation tests indicated that Conv_ 2+Add was the best improved algorithm. Compared to the original UNet, the mIoU, mRecall, and root F₁ values of the model increased by 0.37%, 0.99%, and 0.56%, respectively. And, comparative experiments indicate Unet+Conv_2+Add model was superior to the PSPNet, SegNet, and DeeplabV3Plus models, with the best evaluation results. And the values of mIoU, mRecall, and the harmonic average of root F₁ were 81.62%, 86.90%, and 77.97%, respectively. The actual segmented images obtained by the improved model were more finely processed at the root boundary compared to other models. However, for roots with deep color and low contrast with soil particles, the improved model could only achieve root recognition and the recognition was sparse, sacrificing a certain amount of information extraction ability. This study used the root phenotype evaluation software Rhizovision to analyze the root images of the Unet+Conv_2+Add improved model, PSPNet, SegNet, and DeeplabV3Plu, respectively, to obtain the values of the four root phenotypes (total root length, average diameter, surface area, and capacity), and the results showed that the average diameter and surface area indicator values of the improved model, Unet+Conv_2+Add had the smallest differences from the manually labeled indicator values and the SegNet indicator values for the two indicators. Total root length and volume were the closest to those of the manual labeling. The results of transfer learning experiments proved that compared with ordinary training, the transfer training of the improved model UNet+Conv_2+Add increased the IoU value of the root system by 1.25%. The Recall value of the root system was increased by 1.79%, and the harmonic average value of F₁ was increased by 0.92%. Moreover, the overall convergence speed of the model was fast. Compared with regular training, the transfer training of the original UNet improved the root IoU by 0.29%, the root Recall by 0.83%, and the root F₁ value by 0.21%, which indirectly confirmed the effectiveness of transfer learning. [Conclusions] The multi-scale feature extraction strategy proposed in this study can accurately and efficiently segment roots, and further improve the model's generalization ability using transfer learning methods, providing an important research foundation for crop root phenotype research.

Key words: deep learning, root image segmentation, UNet, multi-scale characteristics, transfer learning

TANG Hui, WANG Ming, YU Qiushi, ZHANG Jiaxi, LIU Liantao, WANG Nan. Root Image Segmentation Method Based on Improved UNet and Transfer Learning[J]. Smart Agriculture, 2023, 5(3): 96-109.

Figures/Tables 16

Fig. 1

Fig. 2

Table 1

Fig. 3

Fig. 4

Fig. 5

Table 2

Table 3

Fig. 6

Table 4

Fig. 7

Fig. 8

Table 5

Fig. 9

Table 6

Fig. 10

References 32

1	王宁, 李继光, 娄翼来, 等. 作物根系形态对施肥措施的响应[J]. 中国农学通报, 2020, 36(3): 53-58.
	WANG N, LI J G, LOU Y L, et al. Response of crop root morphology to fertilization measures[J]. Chinese agricultural science bulletin, 2020, 36(3): 53-58.
2	DONG H Z, NIU Y H, LI W J, et al. Effects of cotton rootstock on endogenous cytokinins and abscisic acid in xylem sap and leaves in relation to leaf senescence[J]. Journal of experimental botany, 2008, 59(6): 1295-1304.
3	吴茜, 张伟欣, 张玲玲, 等. 植物根系表型信息获取技术研究进展[J]. 江苏农业科学, 2021, 49(5): 31-37.
	WU Q, ZHANG W X, ZHANG L L, et al. Research progress on acquisition of plant root phenotype information[J]. Jiangsu agricultural sciences, 2021, 49(5): 31-37.
4	ZHANG B W. Plant root research methods and trends[J]. Agricultural science & technology, 2017, 18(12): 2295-2298, 2302.
5	肖爽, 刘连涛, 张永江, 等. 植物微根系原位观测方法研究进展[J]. 植物营养与肥料学报, 2020, 26(2): 370-385.
	XIAO S, LIU L T, ZHANG Y J, et al. Review on new methods of in situ observation of plant micro-roots and interpretation of root images[J]. Journal of plant nutrition and fertilizers, 2020, 26(2): 370-385.
6	赵先丽, 蔡福, 李荣平, 等. 春玉米根系图像语义分割最佳分辨率和概率阈值研究[J]. 核农学报, 2023, 37(8): 1690-1699.
	ZHAO X L, CAI F, LI R P, et al. Optimal resolution and probability threshold for the semantic segmentation of spring maize root image[J]. Journal of nuclear agricultural sciences, 2023, 37(8): 1690-1699.
7	何勇, 李禧尧, 杨国峰, 等. 室内高通量种质资源表型平台研究进展与展望[J]. 农业工程学报, 2022, 38(17): 127-141.
	HE Y, LI X Y, YANG G F, et al. Research progress and prospect of indoor high-throughput germplasm phenotyping platforms[J]. Transactions of the Chinese society of agricultural engineering, 2022, 38(17): 127-141.
8	PERELMAN A, LAZAROVITCH N, VANDERBORGHT J, et al. Quantitative imaging of sodium concentrations in soil-root systems using magnetic resonance imaging (MRI)[J]. Plant and soil, 2020, 454(1/2): 171-185.
9	SCOTSON C, DUNCAN S, WILLIAMS K, et al. X‐ray computed tomography imaging of solute movement through ridged and flat plant systems[J]. European journal of soil science, 2021, 72 (1): 198-214
10	HAMMAC W A, PAN W L, BOLTON R P, et al. High resolution imaging to assess oilseed species' root hair responses to soil water stress[J]. Plant and soil, 2011, 339(1/2): 125-135.
11	MOHAMED A, MONNIER Y, MAO Z, et al. An evaluation of inexpensive methods for root image acquisition when using rhizotrons[J]. Plant methods, 2017, 13(1): 1-13.
12	ZHAO H J, WANG N, SUN H C, et al. RhizoPot platform: A high-throughput in situ root phenotyping platform with integrated hardware and software[J]. Frontiers in plant science, 2022, 13: ID 1004904.
13	DAS A, SCHNEIDER H, BURRIDGE J, et al. Digital imaging of root traits (DIRT): A high-throughput computing and collaboration platform for field-based root phenomics[J]. Plant methods, 2015, 11: ID 51.
14	GALKOVSKYI T, MILEYKO Y, BUCKSCH A, et al. GiA Roots: Software for the high throughput analysis of plant root system architecture[J]. BMC plant biology, 2012, 12: ID 116.
15	PIERRET A, GONKHAMDEE S, JOURDAN C, et al. IJ_Rhizo: An open-source software to measure scanned images of root samples[J]. Plant and soil, 2013, 373(1/2): 531-539.
16	ARMENGAUD P, ZAMBAUX K, HILLS A, et al. EZ-Rhizo: Integrated software for the fast and accurate measurement of root system architecture[J]. The plant journal, 2009, 57(5): 945-956.
17	LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[EB/OL]. arXiv: 1411.4038, 2014.
18	KAMAL S, SHENDE V G, SWAROOPA K, et al. FCN network-based weed and crop segmentation for IoT-aided agriculture applications[J]. Wireless communications and mobile computing, 2022, 2022: 1-10.
19	BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation[EB/OL]. arXiv: 1511.00561, 2015.
20	WANG T, ROSTAMZA M, SONG Z H, et al. SegRoot: A high throughput segmentation method for root image analysis[J]. Computers and electronics in agriculture, 2019, 162: 845-854.
21	ZHAO H S, SHI J P, QI X J, et al. Pyramid scene parsing network[EB/OL]. arXiv: 1612.01105, 2016.
22	ZHANG R, CHEN J, FENG L, et al. A Refined Pyramid Scene Parsing Network for Polarimetric SAR Image Semantic Segmentation in Agricultural Areas[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 1-5.
23	CHEN L C, ZHU Y K, PAPANDREOU G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[EB/OL]. arXiv: 1802.02611. 2018.
24	CHEN L C, PAPANDREOU G, SCHROFF F, et al. Rethinking atrous convolution for semantic image segmentation[EB/OL]. arXiv: 1706.05587, 2017.
25	KANG J, LIU L T, ZHANG F C, et al. Semantic segmentation model of cotton roots in situ image based on attention mechanism[J]. Computers and electronics in agriculture, 2021, 189: ID 106370.
26	SHEN C, LIU L T, ZHU L X, et al. High-throughput in situ root image segmentation based on the improved DeepLabv3+ method[J]. Frontiers in plant science, 2020, 11: ID 576791.
27	RONNEBERGER O, FISCHER P, BROX T. U-net: Convolutional networks for biomedical image segmentation[EB/OL]. arXiv: 1505.04597, 2015.
28	林娜, 何静, 王斌, 等. 结合植被光谱特征与Sep-UNet的城市植被信息智能提取方法[J]. 地球信息科学学报, 2023, 25(8): 1717-1729.
	LIN N, HE J, WANG B, et al. Intelligent extraction of urban vegetation information based on vegetation spectral signature and sep-UNet[J]. Journal of geo-information science, 2023, 25(8): 1717-1729.
29	申传庆, 王凯, 王文杰. 基于ResNet-UNet的地表覆盖自动分类技术研究[J]. 地理空间信息, 2023, 21(6): 21-23, 27.
	SHEN C Q, WANG K, WANG W J. Research on automatic classification technology of land coverage based on ResNet-UNet[J]. Geospatial information, 2023, 21(6): 21-23, 27.
30	陈桂芬, 赵姗, 曹丽英, 等. 基于迁移学习与卷积神经网络的玉米植株病害识别[J]. 智慧农业, 2019, 1(2): 34-44.
	CHEN G F, ZHAO S, CAO L Y, et al. Corn plant disease recognition based on migration learning and convolutional neural network[J]. Smart agriculture, 2019, 1(2): 34-44.
31	XU W H, YU G H, CUI Y M, et al. PRMI: A dataset of minirhizotron images for diverse plant root study[EB/OL]. arXiv: 2201.08002, 2022.
32	SEETHEPALLI A, DHAKAL K, GRIFFITHS M, et al. RhizoVision Explorer: Open-source software for root image analysis and measurement standardization[J]. AoB PLANTS, 2021, 13(6): ID plab056.

数据集类型	棉花	木瓜	花生	花生	芝麻	芝麻	向日葵
分辨率/px	736×552	736×552	640×480	736×552	640×480	736×552	640×480
训练集/张	1271	282	10,087	11,485	1438	8637	2211
验证集/张	564	131	3413	3347	318	2625	722
测试集/张	577	133	3542	4793	404	3048	967

模型改进策略	模型解释
Conv_1+Concat	进行一次完整卷积计算，并将其上采样和每层编码器中特征图拼接
Conv_2+Concat	进行两次完整卷积计算，并将其上采样和每层编码器中特征图拼接
DP Conv+Concat	进行两次深度可分离卷积计算，并将其上采样和每层编码器中特征图拼接
CBAM+Concat	进行完整卷积后再进行注意力机制计算，并将其上采样和每层编码器中特征图拼接
Conv_1+Add	进行一次完整卷积计算，并将其与跳跃连接的特征图相加并和上采样进行拼接
Conv_2+Add （本研究）	进行两次完整卷积计算，并将其与跳跃连接的特征图相加并和上采样进行拼接
DP Conv+Add	进行两次深度可分离卷积计算，并将其与跳跃连接的特征图相加并和上采样进行拼接
CBAM+Add	进行完整卷积后，进行注意力机制计算，并将其与跳跃连接的特征图相加并和上采样进行拼接

评估指标	UNet	Conv_1+Concat	Conv_2+Concat	DP Conv+Concat	CBAM+Concat	Conv_1+Add	Conv_2+Add	DP Conv +Add	CBAM+Add
R IoU/%	63.71	55.61	63.83	62.81	63.71	63.39	64.44	63.00	63.90
B IoU/%	98.79	98.56	98.79	98.76	98.79	98.77	98.79	98.75	98.79
mIoU/%	81.25	77.08	81.31	80.79	81.25	81.08	81.62	80.88	81.35
R Recall/%	72.39	61.68	72.68	71.18	72.22	72.57	74.25	72.36	72.89
B Recall/%	99.60	99.68	99.59	99.61	99.60	99.57	99.55	99.56	99.56
mRecall/%	85.99	80.68	86.13	85.39	85.91	86.07	86.90	85.96	86.24
R Precision/%	84.16	84.95	83.98	84.23	84.39	83.36	83.00	82.96	83.82
B Precision/%	99.18	98.87	99.19	99.15	99.18	99.19	99.24	99.18	99.20
mPrecision/%	91.67	91.91	91.59	91.69	91.78	91.27	91.12	91.07	91.51
R F ₁/%	77.83	71.47	77.92	77.16	77.83	77.59	78.38	77.30	77.97
B F ₁/%	99.39	99.27	99.39	99.38	99.39	99.38	99.39	99.37	99.38

估计指标	DeeplabV3Plus	PSPNet	SegNet	改进模型（UNet+Conv_2+Add）
Root IoU/%	64.00	54.33	63.08	64.44
Background IoU/%	98.79	98.53	98.79	98.79
mIoU/%	81.39	76.43	89.93	81.62
Root Recall/%	73.53	59.51	73.86	74.25
Background Recall/%	99.47	99.72	99.55	99.55
mRecall/%	86.50	79.61	86.71	86.90
Root Precision/%	81.18	86.17	82.87	83.00
Background Precision/%	99.31	98.81	99.23	99.24
mPrecision/%	90.24	92.49	91.05	91.12
Root F ₁/%	77.17	70.40	78.11	78.38
Background F ₁/%	99.39	99.26	99.39	99.39

方法	总根长/px	平均直径/px	容量/px³	表面积/px²
手工标注	281,884.9367	16.4984	86,505,316.2980	13,216,115.0220
改进模型UNet+Conv_2+Add	236,648.6779	16.2529	90,592,259.8600	13,275,772.0500
PSPNet	186,125.1123	14.1353	61,695,138.3499	9,377,353.0364
SegNet	240,006.0245	15.7012	85,858,025.2235	12,975,598.3651
DeeplabV3Plus	225,178.9484	15.6688	78,377,863.8983	12,039,045.5469