基于改进ENet的复杂背景下山药叶片图像分割方法

doi:10.12133/j.smartag.SA202407007

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (6): 109-120.doi: 10.12133/j.smartag.SA202407007

• 专题--农业知识智能服务和智慧无人农场（上） • 上一篇下一篇

基于改进ENet的复杂背景下山药叶片图像分割方法

芦碧波¹(), 梁迪¹, 杨洁²(), 宋爱青², 皇甫尚卫²

^1. 河南理工大学计算机科学与技术学院，河南焦作 454003，中国
^2. 焦作市农林科学研究院特色农业研究所，河南焦作 454150，中国

收稿日期:2024-07-05 出版日期:2024-11-30
基金项目:
国家自然科学基金面上项目(42272178); 2024年度河南省高等学校重点科研项目(24B520013); 2022年度河南省重点研发与推广专项（科技攻关）项目(222102210131); 河南理工大学基本科研业务费专项项目（自然科学类）(NSFRF240508)
作者简介:
芦碧波，研究方向为图像处理，人工智能。E-mail：lubibo@hpu.edu.cn
通信作者:
杨洁，硕士研究生，农艺师，研究方向为山药、地黄等特色作物的育种和栽培技术的研究与推广。E-mail：jznlytsyjs@163.com

Image Segmentation Method of Chinese Yam Leaves in Complex Background Based on Improved ENet

LU Bibo¹(), LIANG Di¹, YANG Jie²(), SONG Aiqing², HUANGFU Shangwei²

^1. School of Computer Science and Technology, Henan University of Technology, Jiaozuo 454003, China
^2. Institute of Characteristic Agriculture, Jiaozuo Academy of Agriculture and Forestry Sciences, Jiaozuo 454150, China

Received:2024-07-05 Online:2024-11-30
Foundation items:National Natural Science Foundation of China(42272178); 2024 Key Scientific Research Project of Colleges and Universities in Henan Province(24B520013); 2022 Henan Provincial Key R&D and Promotion Special Project(222102210131); Henan Polytechnic University Fundamental Research Funds Special Project (Natural Sciences)(NSFRF240508)
About author:
LU Bibo, E-mail: lubibo@hpu.edu.cn
Corresponding author:
YANG Jie, E-mail: jznlytsyjs@163.com

摘要/Abstract

摘要：

［目的/意义］ 作物叶面积是反映光合作用效率和生长状况的重要指标，建立一个品种丰富的山药图像数据集并提出一种基于深度学习的山药叶片图像分割方法，可以用于实时测定山药叶片面积，解决传统测量效率低的问题。 ［方法］ 基于改进ENet的轻量化分割网络，在ENet的基础上，裁剪掉第3阶段，减少模型中的冗余计算；将瓶颈结构里面的常规卷积用PConv替换，构成P-Bottleneck，减少模型参数量，加快推理速度；改进上采样模块中的转置卷积为双线性插值，提升模型分割精度，减少参数量；最后在模型编码阶段加入CA注意力机制模块，强化对叶片边缘语义特征的提取能力。训练时使用Adam优化器，根据历史梯度信息自适应地调节学习率，加速收敛过程，提高模型的泛化能力。 ［结果和讨论］ 改进的模型在包含40个品种的山药室内图像数据集和室外数据集上进行实验，平均交并比和均像素精度分别达到98.61%和99.32%，模型参数量下降51%，浮点运算量下降49%，并且网络运算速度提高38%。与原始模型相比，在保证分割精度的同时显著降低网络的参数量和浮点运算量，提升运行速度，减少资源占用，使其更加适合应用到农业监测设备。 ［结论］ 改进算法能够精准快速地分割山药叶片，为复杂背景下山药叶片面积的研究提供了参考依据。

关键词: 山药, 图像分割, 深度学习, ENet, 部分卷积, CA注意力机制

Abstract:

[Objective] Crop leaf area is an important indicator reflecting light absorption efficiency and growth conditions. This paper established a diverse Chinese yam image dataset and proposesd a deep learning-based method for Chinese yam leaf image segmentation. This method can be used for real-time measurement of Chinese yam leaf area, addressing the inefficiency of traditional measurement techniques. This will provide more reliable data support for genetic breeding, growth and development research of Chinese yam, and promote the development and progress of the Chinese yam industry. [Methods] A lightweight segmentation network based on improved ENet was proposed. Firstly, based on ENet, the third stage was pruned to reduce redundant calculations in the model. This improved the computational efficiency and running speed, and provided a good basis for real-time applications. Secondly, PConv was used instead of the conventional convolution in the downsampling bottleneck structure and conventional bottleneck structure, the improved bottleneck structure was named P-Bottleneck. PConv applied conventional convolution to only a portion of the input channels and left the rest of the channels unchanged, which reduced memory accesses and redundant computations for more efficient spatial feature extraction. PConv was used to reduce the amount of model computation while increase the number of floating-point operations per second on the hardware device, resulting in lower latency. Additionally, the transposed convolution in the upsampling module was improved to bilinear interpolation to enhance model accuracy and reduce the number of parameters. Bilinear interpolation could process images smoother, making the processed images more realistic and clear. Finally, coordinate attention (CA) module was added to the encoder to introduce the attention mechanism, and the model was named CBPA-ENet. The CA mechanism not only focused on the channel information, but also keenly captured the orientation and position-sensitive information. The position information was embedded into the channel attention to globally encode the spatial information, capturing the channel information along one spatial direction while retaining the position information along the other spatial direction. The network could effectively enhance the attention to important regions in the image, and thus improve the quality and interpretability of segmentation results. [Results and Discussions] Trimming the third part resulted in a 28% decrease in FLOPs, a 41% decrease in parameters, and a 9 f/s increase in FPS. Improving the upsampling method to bilinear interpolation not only reduces the floating-point operation and parameters, but also slightly improves the segmentation accuracy of the model, increasing FPS by 4 f/s. Using P-Bottleneck instead of downsampling bottleneck structure and conventional bottleneck structure can reduce mIoU by only 0.04%, reduce FLOPs by 22%, reduce parameters by 16%, and increase FPS by 8 f/s. Adding CA mechanism to the encoder could only increase a small amount of FLOPs and parameters, improving the accuracy of the segmentation network. To verify the effectiveness of the improved segmentation algorithm, classic semantic segmentation networks of UNet, DeepLabV3+, PSPNet, and real-time semantic segmentation network LinkNet, DABNet were selected to train and validate. These six algorithms got quite high segmentation accuracy, among which UNet had the best mIoU and the mPA, but the model size was too large. The improved algorithm only accounts for 1% of the FLOPs and 0.41% of the parameters of UNet, and the mIoU and mPA were basically the same. Other classic semantic segmentation algorithms, such as DeepLabV3+, had similar accuracy to improved algorithms, but their large model size and slow inference speed were not conducive to embedded development. Although the real-time semantic segmentation algorithm LinkNet had a slightly higher mIoU, its FLOPs and parameters count were still far greater than the improved algorithm. Although the PSPNet model was relatively small, it was also much higher than the improved algorithm, and the mIoU and mPA were lower than the algorithm. The experimental results showed that the improved model achieved a mIoU of 98.61%. Compared with the original model, the number of parameters and FLOPs significantly decreased. Among them, the number of model parameters decreased by 51%, the FLOPs decreased by 49%, and the network operation speed increased by 38%. [Conclusions] The improved algorithm can accurately and quickly segment Chinese yam leaves, providing not only a more accurate means for determining Chinese yam phenotype data, but also a new method and approach for embedded research of Chinese yam. Using the model, the morphological feature data of Chinese yam leaves can be obtained more efficiently, providing a reliable foundation for further research and analysis.

Key words: Chinese yam, image segmentation, deep learning, ENet, partial convolution, CA mechanism

中图分类号:

S513

芦碧波, 梁迪, 杨洁, 宋爱青, 皇甫尚卫. 基于改进ENet的复杂背景下山药叶片图像分割方法[J]. 智慧农业(中英文), 2024, 6(6): 109-120.

LU Bibo, LIANG Di, YANG Jie, SONG Aiqing, HUANGFU Shangwei. Image Segmentation Method of Chinese Yam Leaves in Complex Background Based on Improved ENet[J]. Smart Agriculture, 2024, 6(6): 109-120.

图/表 18

表1

图1

图2

表2

图3

图4

图5

图6

图7

图8

图9

表3

山药叶片图像分割消融实验

Test No.	Model	mIoU/%	mPA/%	Accuracy/%	FPS/（f/s）	Inference time/ms	FLOPs/G	Params/ $M$
0	ENet	98.58	99.24	99.57	50	20.00	2.178	0.3492
1	C-ENet	98.48	99.19	99.54	59	18.87	1.563	0.2046
2	CB-ENet	98.53	99.23	99.55	63	15.87	1.428	0.1995
3	CP-ENet	98.45	99.18	99.53	68	14.71	1.295	0.1758
4	CPB-ENet	98.49	99.25	99.53	71	14.08	1.112	0.1681
5	CPBA-ENet	98.61	99.32	99.62	69	14.49	1.114	0.1714

表3

图10

图11

图12

表4

图13

图14

参考文献 32

1	RATSIMBAZAFY M K, SHARP P A, RAZANAMPARANY L, et al. Wild edible yams from Madagascar: New insights into nutritional composition support their use for food security and conservation[J]. Food science & nutrition, 2024, 12(1): 280-291.
2	ZHOU S Y, HUANG G L, CHEN G Y. Extraction, structural analysis, derivatization and antioxidant activity of polysaccharide from Chinese yam[J]. Food chemistry, 2021, 361: ID 130089.
3	HWANG J H, PARK Y S, KIM H S, et al. Yam-derived exosome-like nanovesicles stimulate osteoblast formation and prevent osteoporosis in mice[J]. Journal of controlled release, 2023, 355: 184-198.
4	CHANG H Y, TONG X Y, YANG H Q, et al. Chinese yam (dioscorea opposita) and its bioactive compounds: The beneficial effects on gut microbiota and gut health[J]. Current opinion in food science, 2024, 55: ID 101121.
5	ZENG X X, LIU D H, HUANG L Q. Metabolome profiling of eight Chinese yam (Dioscorea polystachya Turcz.) varieties reveals metabolite diversity and variety specific uses[J]. Life, 2021, 11(7): ID 687.
6	WU Z G, JIANG W, NITIN M, et al. Characterizing diversity based on nutritional and bioactive compositions of yam germplasm (Dioscorea spp.) commonly cultivated in China[J]. Journal of food and drug analysis, 2016, 24(2): 367-375.
7	温建荣. 山药传统生产与现代生产的区别与比较[J]. 江西农业, 2018(18): 14.
	WEN J R. Difference and comparison between traditional production and modern production of yam[J]. Jiangxi agriculture, 2018(18): 14.
8	王永乐. 让"科研之花"结出山药"产业之果"[N]. 河南日报, 2024-03-17(13).
9	郝雅洁, 张吴平, 史维杰, 等. 基于计算机视觉的小麦叶面积测量[J]. 湖北农业科学, 2019, 58(16): 129-132.
	HAO Y J, ZHANG W P, SHI W J, et al. Measurement of wheat leaf area based on computer vision[J]. Hubei agricultural sciences, 2019, 58(16): 129-132.
10	GONG A P, WU X, QIU Z J, et al. A handheld device for leaf area measurement[J]. Computers and electronics in agriculture, 2013, 98: 74-80.
11	LI Z B, GUO R H, LI M, et al. A review of computer vision technologies for plant phenotyping[J]. Computers and electronics in agriculture, 2020, 176: ID 105672.
12	WENG Y, ZENG R, WU C M, et al. A survey on deep-learning-based plant phenotype research in agriculture[J]. Scientia sinica vitae, 2019, 49(6): 698-716.
13	ZHANG H C, WANG L, JIN X L, et al. High-throughput phenotyping of plant leaf morphological, physiological, and biochemical traits on multiple scales using optical sensing[J]. The crop journal, 2023, 11(5): 1303-1318.
14	李方一, 黄璜, 官春云. 作物叶面积测量的研究进展[J]. 湖南农业大学学报(自然科学版), 2021, 47(3): 274-282.
	LI F Y, HUANG H, GUAN C Y. Review on measurement of crop leaf area[J]. Journal of Hunan agricultural university (natural sciences), 2021, 47(3): 274-282.
15	崔世钢, 秦建华. 图像处理法测定油菜叶面积的研究[J]. 湖北农业科学, 2017, 56(14): 2756-2757, 2767.
	CUI S G, QIN J H. Study on the determination of leaf area of rape by image processing[J]. Hubei agricultural sciences, 2017, 56(14): 2756-2757, 2767.
16	于东玉, 冯天祥, 李奕昕, 等. 基于植物图像的活体叶片面积测量方法研究与实现[J]. 智能计算机与应用, 2019, 9(4): 173-176.
	YU D Y, FENG T X, LI Y X, et al. Research and implementation of living leaf area measurement based on plant image[J]. Intelligent computer and applications, 2019, 9(4): 173-176.
17	李秋洁, 杨远明, 袁鹏成, 等. 基于饱和度分割的叶面积图像测量方法[J]. 林业工程学报, 2021, 6(4): 147-152.
	LI Q J, YANG Y M, YUAN P C, et al. Image measurement method of leaf area based on saturation segmentation[J]. Journal of forestry engineering, 2021, 6(4): 147-152.
18	ViVEKANANTHAN V, VIGNESH R, VASANTHASEELAN S, et al. Concrete bridge crack detection by image processing technique by using the improved OTSU method[J]. Materials today: Proceedings, 2023, 74: 1002-1007.
19	YUAN H B, ZHU J J, WANG Q F, et al. An improved DeepLab v3+ deep learning network applied to the segmentation of grape leaf black rot spots[J]. Frontiers in plant science, 2022, 13: ID 795410.
20	BHAGAT S, KOKARE M, HASWANI V, et al. Eff-UNet++: A novel architecture for plant leaf segmentation and counting[J]. Ecological informatics, 2022, 68: ID 101583.
21	LU J W, LU B B, MA W L, et al. EAIS-Former: An efficient and accurate image segmentation method for fruit leaf diseases[J]. Computers and electronics in agriculture, 2024, 218: ID 108739.
22	陈从平, 钮嘉炜, 丁坤, 等. 基于深度学习的马铃薯病害智能识别[J]. 计算机仿真, 2023, 40(2): 214-217, 222.
	CHEN C P, NIU J W, DING K, et al. Intelligent identification of potato diseases based on deep learning[J]. Computer simulation, 2023, 40(2): 214-217, 222.
23	杜鹏飞, 黄媛, 高欣娜, 等. 基于语义分割的复杂背景下黄瓜叶部病害严重程度分级研究[J]. 中国农机化学报, 2023, 44(11): 138-147.
	DU P F, HUANG Y, GAO X N, et al. Research on cucumber leaf disease severity classification in complex background based on semantic segmentation[J]. China agricultural machinery chemistry, 2023, 44(11): 138-147.
24	RONNEBERGER O, FISCHER P, BROX T. U-net: Convolutional networks for biomedical image segmentation[M]// NAVAB N, HORNEGGER J, WELLS W M, et al, eds. Lecture Notes in Computer Science. Cham: Springer International Publishing, 2015: 234-241.
25	BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(12): 2481-2495.
26	管博伦, 张立平, 朱静波, 等. 农业病虫害图像数据集构建关键问题及评价方法综述[J]. 智慧农业(中英文), 2023, 5(3): 17-34.
	GUAN B L, ZHANG L P, ZHU J B, et al. The key issues and evaluation methods for constructing agricultural pest and disease image datasets: A review[J]. Smart agriculture, 2023, 5(3): 17-34.
27	PASZKE A, CHAURASIA A, KIM S, et al. ENet: A deep neural network architecture for real-time semantic segmentation[EB/OL]. arXiv: 1606.02147, 2016.
28	CHEN J R, KAO S H, HE H, et al. Run, don't walk: Chasing higher FLOPS for faster neural networks[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2023: 12021-12031.
29	KIM K H, SHIM P S, SHIN S. An alternative bilinear interpolation method between spherical grids[J]. Atmosphere, 2019, 10(3): ID 123.
30	GUO M H, XU T X, LIU J J, et al. Attention mechanisms in computer vision: A survey[J]. Computational visual media, 2022, 8(3): 331-368.
31	NIU Z Y, ZHONG G Q, YU H. A review on the attention mechanism of deep learning[J]. Neurocomputing, 2021, 452: 48-62.
32	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2021: 13713-13722.

类别	初始数量/张	数据增强	最终数量/张
室内	1 077	是	1 500
室外	129	是	1 032
总计	1 206	—	2 532

[1]	马六, 毛克彪, 郭中华. 基于混合注意力生成对抗网络的遥感图像去雾方法[J]. 智慧农业(中英文), 2025, 7(2): 172-182.
[2]	马巍巍, 陈悦, 王咏梅. 基于深度网络集成的复杂背景甘蔗叶片病害识别[J]. 智慧农业(中英文), 2025, 7(1): 136-145.
[3]	许世卫, 李乾川, 栾汝朋, 庄家煜, 刘佳佳, 熊露. 农产品市场监测预警深度学习智能预测方法[J]. 智慧农业(中英文), 2025, 7(1): 57-69.
[4]	杨信廷, 胡焕, 陈晓, 李汶政, 周子洁, 李文勇. 多源场景下粘虫板小目标害虫轻量化检测识别模型[J]. 智慧农业(中英文), 2025, 7(1): 111-123.
[5]	宫宇, 王玲, 赵荣强, 尤海波, 周沫, 刘劼. 基于多模态数据表型特征提取的番茄生长高度预测方法[J]. 智慧农业(中英文), 2025, 7(1): 97-110.
[6]	齐梓均, 牛当当, 吴华瑞, 张礼麟, 王仑峰, 张宏鸣. 基于双维信息与剪枝的中文猕猴桃文本命名实体识别方法[J]. 智慧农业(中英文), 2025, 7(1): 44-56.
[7]	张辉, 胡军, 石航, 刘昶希, 吴淼. 融合远端深度学习识别模型的白菜株心精准对靶喷雾系统[J]. 智慧农业(中英文), 2024, 6(6): 85-95.
[8]	傅卓军, 胡政, 邓阳君, 龙陈锋, 朱幸辉. 基于Deep-Semi-NMF的苹果斑点落叶病检测方法[J]. 智慧农业(中英文), 2024, 6(6): 144-154.
[9]	罗友璐, 潘勇浩, 夏顺兴, 陶友志. 基于改进YOLOv8的苹果叶病害轻量化检测算法[J]. 智慧农业(中英文), 2024, 6(5): 128-138.
[10]	刘伊, 张彦军. ReluformerN：轻量化高低频增强高光谱农业地物分类方法[J]. 智慧农业(中英文), 2024, 6(5): 74-87.
[11]	年悦, 赵凯旋, 姬江涛. 基于改进DeepLabCut模型的奶牛滑蹄检测方法[J]. 智慧农业(中英文), 2024, 6(5): 153-163.
[12]	张岩琪, 周硕, 张凝, 柴秀娟, 孙坦. 基于改进实例分割算法的区域养殖生猪计数系统[J]. 智慧农业(中英文), 2024, 6(4): 53-63.
[13]	翁智, 范琦, 郑志强. 基于多模态图像信息及改进实例分割网络的肉牛体尺自动测量方法[J]. 智慧农业(中英文), 2024, 6(4): 64-75.
[14]	范铭铄, 周平, 李淼, 李华龙, 刘先旺, 麻之润. 羊场自动导航喷药机器人设计与实验[J]. 智慧农业(中英文), 2024, 6(4): 103-115.
[15]	侯依廷, 饶元, 宋贺, 聂振君, 王坦, 何豪旭. 复杂大田场景下基于改进YOLOv8的小麦幼苗期叶片数快速检测方法[J]. 智慧农业(中英文), 2024, 6(4): 128-137.

基于改进ENet的复杂背景下山药叶片图像分割方法

Image Segmentation Method of Chinese Yam Leaves in Complex Background Based on Improved ENet

在线阅读

知网下载

本地下载

可视化

摘要/Abstract

引用本文

使用本文

图/表 18

参考文献 32

相关文章 15

编辑推荐

Metrics

本文评价

序号	品种名称	数量/张	序号	品种名称	数量/张	序号	品种名称	数量/张	序号	品种名称	数量/张
1	砀山山药	45	11	嵩野2号	49	21	太和长芋	42	31	南京采药	34
2	梅岱山药	39	12	靳家岭山药	37	22	安顺山药	48	32	山东牛腿米	36
3	僵野1号	47	13	惠楼山药	48	23	山王庄铁棍	30	33	泌阳野山药	40
4	苏北淮山药	45	14	辉县太行山药	50	24	宿生野山药	46	34	太古8号	30
5	日本山药	34	15	临泉笨山药	37	25	平遥山药	30	35	新城细毛	33
6	太原8号	34	16	双胞山药	33	26	山西榆次山药	43	36	怀山药1号	34
7	温科3号	37	17	2018 -1号山药	46	27	四川雅山药	41	37	铁棍雌株	33
8	安顺2号	46	18	桑县10号	34	28	砀山山药2号	41	38	神农山山药	36
9	小白嘴山药	36	19	铁棍山药1号	31	29	白玉山药	39	39	日本白山药	35
10	安顺5号	32	20	丰县铁棍山药	41	30	白皮山药	36	40	陇山药1号	30

Model	mIoU/%	mPA/%	Accuracy/%	FPS/（f/s）	FLOPs/G	Params/M
UNet	99.09	99.57	99.72	17	92.0	43.9
DeepLabV3+	98.58	99.35	99.57	31	83.4	54.7
PSPNet	97.45	98.67	99.22	23	61.6	49.1
LinkNet	98.65	99.41	99.63	58	12.1	11.5
DABNet	98.23	99.26	99.05	62	5.3	0.8
CBPA-ENet	98.61	99.32	99.62	69	1.1	0.2