Image Segmentation Method of Chinese Yam Leaves in Complex Background Based on Improved ENet

doi:10.12133/j.smartag.SA202407007

Abstract

Abstract:

[Objective] Crop leaf area is an important indicator reflecting light absorption efficiency and growth conditions. This paper established a diverse Chinese yam image dataset and proposesd a deep learning-based method for Chinese yam leaf image segmentation. This method can be used for real-time measurement of Chinese yam leaf area, addressing the inefficiency of traditional measurement techniques. This will provide more reliable data support for genetic breeding, growth and development research of Chinese yam, and promote the development and progress of the Chinese yam industry. [Methods] A lightweight segmentation network based on improved ENet was proposed. Firstly, based on ENet, the third stage was pruned to reduce redundant calculations in the model. This improved the computational efficiency and running speed, and provided a good basis for real-time applications. Secondly, PConv was used instead of the conventional convolution in the downsampling bottleneck structure and conventional bottleneck structure, the improved bottleneck structure was named P-Bottleneck. PConv applied conventional convolution to only a portion of the input channels and left the rest of the channels unchanged, which reduced memory accesses and redundant computations for more efficient spatial feature extraction. PConv was used to reduce the amount of model computation while increase the number of floating-point operations per second on the hardware device, resulting in lower latency. Additionally, the transposed convolution in the upsampling module was improved to bilinear interpolation to enhance model accuracy and reduce the number of parameters. Bilinear interpolation could process images smoother, making the processed images more realistic and clear. Finally, coordinate attention (CA) module was added to the encoder to introduce the attention mechanism, and the model was named CBPA-ENet. The CA mechanism not only focused on the channel information, but also keenly captured the orientation and position-sensitive information. The position information was embedded into the channel attention to globally encode the spatial information, capturing the channel information along one spatial direction while retaining the position information along the other spatial direction. The network could effectively enhance the attention to important regions in the image, and thus improve the quality and interpretability of segmentation results. [Results and Discussions] Trimming the third part resulted in a 28% decrease in FLOPs, a 41% decrease in parameters, and a 9 f/s increase in FPS. Improving the upsampling method to bilinear interpolation not only reduces the floating-point operation and parameters, but also slightly improves the segmentation accuracy of the model, increasing FPS by 4 f/s. Using P-Bottleneck instead of downsampling bottleneck structure and conventional bottleneck structure can reduce mIoU by only 0.04%, reduce FLOPs by 22%, reduce parameters by 16%, and increase FPS by 8 f/s. Adding CA mechanism to the encoder could only increase a small amount of FLOPs and parameters, improving the accuracy of the segmentation network. To verify the effectiveness of the improved segmentation algorithm, classic semantic segmentation networks of UNet, DeepLabV3+, PSPNet, and real-time semantic segmentation network LinkNet, DABNet were selected to train and validate. These six algorithms got quite high segmentation accuracy, among which UNet had the best mIoU and the mPA, but the model size was too large. The improved algorithm only accounts for 1% of the FLOPs and 0.41% of the parameters of UNet, and the mIoU and mPA were basically the same. Other classic semantic segmentation algorithms, such as DeepLabV3+, had similar accuracy to improved algorithms, but their large model size and slow inference speed were not conducive to embedded development. Although the real-time semantic segmentation algorithm LinkNet had a slightly higher mIoU, its FLOPs and parameters count were still far greater than the improved algorithm. Although the PSPNet model was relatively small, it was also much higher than the improved algorithm, and the mIoU and mPA were lower than the algorithm. The experimental results showed that the improved model achieved a mIoU of 98.61%. Compared with the original model, the number of parameters and FLOPs significantly decreased. Among them, the number of model parameters decreased by 51%, the FLOPs decreased by 49%, and the network operation speed increased by 38%. [Conclusions] The improved algorithm can accurately and quickly segment Chinese yam leaves, providing not only a more accurate means for determining Chinese yam phenotype data, but also a new method and approach for embedded research of Chinese yam. Using the model, the morphological feature data of Chinese yam leaves can be obtained more efficiently, providing a reliable foundation for further research and analysis.

Key words: Chinese yam, image segmentation, deep learning, ENet, partial convolution, CA mechanism

CLC Number:

S513

LU Bibo, LIANG Di, YANG Jie, SONG Aiqing, HUANGFU Shangwei. Image Segmentation Method of Chinese Yam Leaves in Complex Background Based on Improved ENet[J]. Smart Agriculture, 2024, 6(6): 109-120.

Figures/Tables 18

Table 1

Fig. 1

Fig. 2

Table 2

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Fig. 7

Fig. 8

Fig. 9

Table 3

Ablation experiments on segmentation of yam leaves images

Test No.	Model	mIoU/%	mPA/%	Accuracy/%	FPS/（f/s）	Inference time/ms	FLOPs/G	Params/ $M$
0	ENet	98.58	99.24	99.57	50	20.00	2.178	0.3492
1	C-ENet	98.48	99.19	99.54	59	18.87	1.563	0.2046
2	CB-ENet	98.53	99.23	99.55	63	15.87	1.428	0.1995
3	CP-ENet	98.45	99.18	99.53	68	14.71	1.295	0.1758
4	CPB-ENet	98.49	99.25	99.53	71	14.08	1.112	0.1681
5	CPBA-ENet	98.61	99.32	99.62	69	14.49	1.114	0.1714

Table 3

Fig. 10

Fig. 11

Fig. 12

Table 4

Fig. 13

Fig. 14

References 32

1	RATSIMBAZAFY M K, SHARP P A, RAZANAMPARANY L, et al. Wild edible yams from Madagascar: New insights into nutritional composition support their use for food security and conservation[J]. Food science & nutrition, 2024, 12(1): 280-291.
2	ZHOU S Y, HUANG G L, CHEN G Y. Extraction, structural analysis, derivatization and antioxidant activity of polysaccharide from Chinese yam[J]. Food chemistry, 2021, 361: ID 130089.
3	HWANG J H, PARK Y S, KIM H S, et al. Yam-derived exosome-like nanovesicles stimulate osteoblast formation and prevent osteoporosis in mice[J]. Journal of controlled release, 2023, 355: 184-198.
4	CHANG H Y, TONG X Y, YANG H Q, et al. Chinese yam (dioscorea opposita) and its bioactive compounds: The beneficial effects on gut microbiota and gut health[J]. Current opinion in food science, 2024, 55: ID 101121.
5	ZENG X X, LIU D H, HUANG L Q. Metabolome profiling of eight Chinese yam (Dioscorea polystachya Turcz.) varieties reveals metabolite diversity and variety specific uses[J]. Life, 2021, 11(7): ID 687.
6	WU Z G, JIANG W, NITIN M, et al. Characterizing diversity based on nutritional and bioactive compositions of yam germplasm (Dioscorea spp.) commonly cultivated in China[J]. Journal of food and drug analysis, 2016, 24(2): 367-375.
7	温建荣. 山药传统生产与现代生产的区别与比较[J]. 江西农业, 2018(18): 14.
	WEN J R. Difference and comparison between traditional production and modern production of yam[J]. Jiangxi agriculture, 2018(18): 14.
8	王永乐. 让"科研之花"结出山药"产业之果"[N]. 河南日报, 2024-03-17(13).
9	郝雅洁, 张吴平, 史维杰, 等. 基于计算机视觉的小麦叶面积测量[J]. 湖北农业科学, 2019, 58(16): 129-132.
	HAO Y J, ZHANG W P, SHI W J, et al. Measurement of wheat leaf area based on computer vision[J]. Hubei agricultural sciences, 2019, 58(16): 129-132.
10	GONG A P, WU X, QIU Z J, et al. A handheld device for leaf area measurement[J]. Computers and electronics in agriculture, 2013, 98: 74-80.
11	LI Z B, GUO R H, LI M, et al. A review of computer vision technologies for plant phenotyping[J]. Computers and electronics in agriculture, 2020, 176: ID 105672.
12	WENG Y, ZENG R, WU C M, et al. A survey on deep-learning-based plant phenotype research in agriculture[J]. Scientia sinica vitae, 2019, 49(6): 698-716.
13	ZHANG H C, WANG L, JIN X L, et al. High-throughput phenotyping of plant leaf morphological, physiological, and biochemical traits on multiple scales using optical sensing[J]. The crop journal, 2023, 11(5): 1303-1318.
14	李方一, 黄璜, 官春云. 作物叶面积测量的研究进展[J]. 湖南农业大学学报(自然科学版), 2021, 47(3): 274-282.
	LI F Y, HUANG H, GUAN C Y. Review on measurement of crop leaf area[J]. Journal of Hunan agricultural university (natural sciences), 2021, 47(3): 274-282.
15	崔世钢, 秦建华. 图像处理法测定油菜叶面积的研究[J]. 湖北农业科学, 2017, 56(14): 2756-2757, 2767.
	CUI S G, QIN J H. Study on the determination of leaf area of rape by image processing[J]. Hubei agricultural sciences, 2017, 56(14): 2756-2757, 2767.
16	于东玉, 冯天祥, 李奕昕, 等. 基于植物图像的活体叶片面积测量方法研究与实现[J]. 智能计算机与应用, 2019, 9(4): 173-176.
	YU D Y, FENG T X, LI Y X, et al. Research and implementation of living leaf area measurement based on plant image[J]. Intelligent computer and applications, 2019, 9(4): 173-176.
17	李秋洁, 杨远明, 袁鹏成, 等. 基于饱和度分割的叶面积图像测量方法[J]. 林业工程学报, 2021, 6(4): 147-152.
	LI Q J, YANG Y M, YUAN P C, et al. Image measurement method of leaf area based on saturation segmentation[J]. Journal of forestry engineering, 2021, 6(4): 147-152.
18	ViVEKANANTHAN V, VIGNESH R, VASANTHASEELAN S, et al. Concrete bridge crack detection by image processing technique by using the improved OTSU method[J]. Materials today: Proceedings, 2023, 74: 1002-1007.
19	YUAN H B, ZHU J J, WANG Q F, et al. An improved DeepLab v3+ deep learning network applied to the segmentation of grape leaf black rot spots[J]. Frontiers in plant science, 2022, 13: ID 795410.
20	BHAGAT S, KOKARE M, HASWANI V, et al. Eff-UNet++: A novel architecture for plant leaf segmentation and counting[J]. Ecological informatics, 2022, 68: ID 101583.
21	LU J W, LU B B, MA W L, et al. EAIS-Former: An efficient and accurate image segmentation method for fruit leaf diseases[J]. Computers and electronics in agriculture, 2024, 218: ID 108739.
22	陈从平, 钮嘉炜, 丁坤, 等. 基于深度学习的马铃薯病害智能识别[J]. 计算机仿真, 2023, 40(2): 214-217, 222.
	CHEN C P, NIU J W, DING K, et al. Intelligent identification of potato diseases based on deep learning[J]. Computer simulation, 2023, 40(2): 214-217, 222.
23	杜鹏飞, 黄媛, 高欣娜, 等. 基于语义分割的复杂背景下黄瓜叶部病害严重程度分级研究[J]. 中国农机化学报, 2023, 44(11): 138-147.
	DU P F, HUANG Y, GAO X N, et al. Research on cucumber leaf disease severity classification in complex background based on semantic segmentation[J]. China agricultural machinery chemistry, 2023, 44(11): 138-147.
24	RONNEBERGER O, FISCHER P, BROX T. U-net: Convolutional networks for biomedical image segmentation[M]// NAVAB N, HORNEGGER J, WELLS W M, et al, eds. Lecture Notes in Computer Science. Cham: Springer International Publishing, 2015: 234-241.
25	BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(12): 2481-2495.
26	管博伦, 张立平, 朱静波, 等. 农业病虫害图像数据集构建关键问题及评价方法综述[J]. 智慧农业(中英文), 2023, 5(3): 17-34.
	GUAN B L, ZHANG L P, ZHU J B, et al. The key issues and evaluation methods for constructing agricultural pest and disease image datasets: A review[J]. Smart agriculture, 2023, 5(3): 17-34.
27	PASZKE A, CHAURASIA A, KIM S, et al. ENet: A deep neural network architecture for real-time semantic segmentation[EB/OL]. arXiv: 1606.02147, 2016.
28	CHEN J R, KAO S H, HE H, et al. Run, don't walk: Chasing higher FLOPS for faster neural networks[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2023: 12021-12031.
29	KIM K H, SHIM P S, SHIN S. An alternative bilinear interpolation method between spherical grids[J]. Atmosphere, 2019, 10(3): ID 123.
30	GUO M H, XU T X, LIU J J, et al. Attention mechanisms in computer vision: A survey[J]. Computational visual media, 2022, 8(3): 331-368.
31	NIU Z Y, ZHONG G Q, YU H. A review on the attention mechanism of deep learning[J]. Neurocomputing, 2021, 452: 48-62.
32	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2021: 13713-13722.

类别	初始数量/张	数据增强	最终数量/张
室内	1 077	是	1 500
室外	129	是	1 032
总计	1 206	—	2 532

[1]	MA Liu, MAO Kebiao, GUO Zhonghua. Defogging Remote Sensing Images Method Based on a Hybrid Attention-Based Generative Adversarial Network [J]. Smart Agriculture, 2025, 7(2): 172-182.
[2]	MA Weiwei, CHEN Yue, WANG Yongmei. Recognition of Sugarcane Leaf Diseases in Complex Backgrounds Based on Deep Network Ensembles [J]. Smart Agriculture, 2025, 7(1): 136-145.
[3]	XU Shiwei, LI Qianchuan, LUAN Rupeng, ZHUANG Jiayu, LIU Jiajia, XIONG Lu. Agricultural Market Monitoring and Early Warning: An Integrated Forecasting Approach Based on Deep Learning [J]. Smart Agriculture, 2025, 7(1): 57-69.
[4]	YANG Xinting, HU Huan, CHEN Xiao, LI Wenzheng, ZHOU Zijie, LI Wenyong. Lightweight Detection and Recognition Model for Small Target Pests on Sticky Traps in Multi-Source Scenarios [J]. Smart Agriculture, 2025, 7(1): 111-123.
[5]	GONG Yu, WANG Ling, ZHAO Rongqiang, YOU Haibo, ZHOU Mo, LIU Jie. Tomato Growth Height Prediction Method by Phenotypic Feature Extraction Using Multi-modal Data [J]. Smart Agriculture, 2025, 7(1): 97-110.
[6]	QI Zijun, NIU Dangdang, WU Huarui, ZHANG Lilin, WANG Lunfeng, ZHANG Hongming. Chinese Kiwifruit Text Named Entity Recognition Method Based on Dual-Dimensional Information and Pruning [J]. Smart Agriculture, 2025, 7(1): 44-56.
[7]	ZHANG Hui, HU Jun, SHI Hang, LIU Changxi, WU Miao. Precision Target Spraying System Integrated with Remote Deep Learning Recognition Model for Cabbage Plant Centers [J]. Smart Agriculture, 2024, 6(6): 85-95.
[8]	LUO Youlu, PAN Yonghao, XIA Shunxing, TAO Youzhi. Lightweight Apple Leaf Disease Detection Algorithm Based on Improved YOLOv8 [J]. Smart Agriculture, 2024, 6(5): 128-138.
[9]	LIU Yi, ZHANG Yanjun. ReluformerN: Lightweight High-Low Frequency Enhanced for Hyperspectral Agricultural Lancover Classification [J]. Smart Agriculture, 2024, 6(5): 74-87.
[10]	NIAN Yue, ZHAO Kaixuan, JI Jiangtao. Cow Hoof Slippage Detecting Method Based on Enhanced DeepLabCut Model [J]. Smart Agriculture, 2024, 6(5): 153-163.
[11]	ZHANG Yanqi, ZHOU Shuo, ZHANG Ning, CHAI Xiujuan, SUN Tan. A Regional Farming Pig Counting System Based on Improved Instance Segmentation Algorithm [J]. Smart Agriculture, 2024, 6(4): 53-63.
[12]	WENG Zhi, FAN Qi, ZHENG Zhiqiang. Automatic Measurement Method of Beef Cattle Body Size Based on Multimodal Image Information and Improved Instance Segmentation Network [J]. Smart Agriculture, 2024, 6(4): 64-75.
[13]	FAN Mingshuo, ZHOU Ping, LI Miao, LI Hualong, LIU Xianwang, MA Zhirun. Automatic Navigation and Spraying Robot in Sheep Farm [J]. Smart Agriculture, 2024, 6(4): 103-115.
[14]	HOU Yiting, RAO Yuan, SONG He, NIE Zhenjun, WANG Tan, HE Haoxu. A Rapid Detection Method for Wheat Seedling Leaf Number in Complex Field Scenarios Based on Improved YOLOv8 [J]. Smart Agriculture, 2024, 6(4): 128-137.
[15]	ZHANG Yu, LI Xiangting, SUN Yalin, XUE Aidi, ZHANG Yi, JIANG Hailong, SHEN Weizheng. Real-Time Monitoring Method for Cow Rumination Behavior Based on Edge Computing and Improved MobileNet v3 [J]. Smart Agriculture, 2024, 6(4): 29-41.