Lodging Region Detection Method in Flax Based on Lightweight Improved YOLOv11n-seg Model

doi:10.12133/j.smartag.SA202508013

Abstract

Abstract:

[Objective] Lodging is a major agronomic constraint that adversely affects both yield and quality in field crops, with flax (Linum usitatissimum L.) being especially vulnerable due to its slender stems and susceptibility to wind and rainfall. Precise delineation of lodged areas from field imagery remains a significant challenge owing to the complex and heterogeneous morphology of lodging patterns, irregular and blurred boundaries, and substantial background interference from upright plants, weeds, and soil textures. These factors necessitate the development of a segmentation framework that combines high precision and strong boundary adherence with computational efficiency, enabling deployment on resource-constrained agricultural monitoring platforms. In response to this need, a lightweight accurate lodging segmentation approach based on improved YOLOv11n-seg architecture was proposed to enhance fine-grained feature sensitivity, multi-scale representation capability, and boundary precision, while markedly reducing parameter count, giga floating-point operations (GFLOPs), and model size. [Methods] The proposed architecture integrated targeted modifications across the backbone, neck, and output stages. In the backbone, standard C3k2 modules were replaced with C3k2_SDW blocks, which combined a StarBlock structure with depthwise separable convolutions to reduce redundancy and computation without sacrificing spatial and contextual representational capacity. To counteract potential reductions in channel discrimination resulting from light-weighting, a multi-scale efficient channel attention (MS-ECA) mechanism was embedded within selected backbone layers, yielding C3k2_SDW_MS-ECA modules. These modules incorporated parallel convolution branches with varying kernel sizes to capture channel-wise dependencies across multiple receptive fields, thereby adaptively recalibrating lodging-related features with minimal computational overhead. In the neck, a bidirectional feature pyramid network (BiFPN) was introduced to facilitate efficient bidirectional information exchange between scales. By assigning normalized, trainable fusion weights, the BiFPN adaptively balanced contributions from low- and high-level feature maps, while a multi-stage semantic fusion strategy further enriched the integration of spatial details and contextual semantics, thereby improving the detection of small and fragmented lodged patches. At the output stage, a boundary refinement procedure was applied to the predicted masks, improving contour sharpness, enhancing boundary compactness, and mitigating false detections in complex visual environments.The experimental dataset comprised unmanned aerial vehicle (UAV) RGB imagery at a resolution of 4 032×2 268 pixels, acquired from flax fields in Dingxi, Gansu province. Lodged regions were manually annotated with polygonal masks. To increase robustness against variability in illumination, background complexity, and lodging morphology, data augmentation techniques, including random rotation, brightness and contrast adjustment, and blurring were employed, expanding the dataset to 3 852 images. The dataset was divided into training, validation, and testing subsets in a 75%, 15% and 10% split. Model training was conducted with 640×640 pixel inputs for 300 epochs using stochastic gradient descent (initial learning rate 0.01, momentum 0.937, weight decay 0.000 5) in PyTorch 2.0.0. Evaluation involved comparison with YOLACT, YOLOv7-seg, YOLOv8n-seg, and the original YOLOv11n-seg using precision (P), recall (R), mAP@0.5, mAP@0.5:0.95, parameter count, GFLOPs, and model size. [Results and Discussions] Ablation experiments demonstrated the incremental contributions of each architectural component. Substituting C3k2 with C3k2_SDW reduced parameters from 2.83 M to 2.14 M and computation from 10.2 to 8.1 GFLOPs, with slight performance improvements. Incorporating BiFPN further lowered complexity to 1.68 M parameters and 7.7 GFLOPs, accompanied by notable gains in detection metrics. The addition of MS-ECA attention achieved the highest performance, delivering P of 92.6%, R of 92.0%, and mAP@0.5 of 95.2%, corresponding to improvements of 3.7 percentage points in Precision and 2.1 percentage points in mAP@0.5 over the YOLOv11n-seg baseline, without increasing model size. Qualitative Grad-CAM visualizations revealed more precise focus on lodging regions and reduced false activations in upright stems and non-lodged soil areas. Generalization capability was further validated on the public WE3DS agricultural segmentation dataset, where the proposed model achieved average improvements of 4.3, 1.9, and 2.6 percentage points in precision, recall, and mAP@0.5, respectively, compared to the baseline. [Conclusions] The improved YOLOv11n-seg architecture achieves a superior balance between accuracy and efficiency for flax lodging segmentation by combining the C3k2_SDW_MS-ECA backbone, BiFPN with multi-stage semantic fusion in the neck, and output boundary refinement. This combination of high accuracy, lightweight design, and robust boundary delineation renders the model highly applicable to real-time, in-field deployment for intelligent lodging monitoring and precision agriculture. The results further suggest that the approach is transferable to broader agricultural segmentation tasks, providing a practical and scalable solution for modern smart farming applications.

Key words: flax, image segmentation, lightweight model, lodging detection, YOLOv11n-seg, attention mechanism

CLC Number:

TP391.41

SU Yujie, LI Yue, WEI Linjing, WU Bing, GUO Linhai, YAN Bin, ZHOU Hui, GAO Yuhong, KANG Lianghe, LIU Huan, SU Shunchang. Lodging Region Detection Method in Flax Based on Lightweight Improved YOLOv11n-seg Model[J]. Smart Agriculture, 2026, 8(2): 35-47.

Figures/Tables 14

Fig. 1

Fig. 2

Table 1

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Table 2

Table 3

Table 4

Fig. 7

Fig. 8

Table 5

Table 6

References 28

[1]	邓欣, 陈信波, 邱财生, 等. 我国亚麻种质资源研究与利用概述[J]. 中国麻业科学, 2015, 37(6): 322-329.
	DENG X, CHEN X B, QIU C S, et al. Review on research and utilization of flax (Linum usitatissimum L.) germplasm resources[J]. Plant Fiber Sciences in China, 2015, 37(6): 322-329.
[2]	GAO Y H. Oilseed flax (Linum usitatissimum L.), an emerging functional cash crop of China[J]. Oil Crop Science, 2020, 5(2): 23.
[3]	王炜, 陈军, 叶春雷, 等. 甘肃胡麻地方及育成品种农艺性状分析及评价[J]. 中国种业, 2022(12): 75-81.
	WANG W, CHEN J, YE C L, et al. Analysis and evaluation of agronomic characters of local and bred linseed varieties in Gansu Province[J]. China Seed Industry, 2022(12): 75-81.
[4]	张瑞兰, 武万里, 王淑丽, 等. 气候变化条件下宁南山区旱地胡麻土壤水分变化特征及对发育期和产量的影响[J]. 农业与技术, 2022, 42(21): 69-72.
	ZHANG R L, WU W L, WANG S L, et al. Characteristics of soil moisture change and its influence on development period and yield of dry land flax in southern Ningxia under climate change[J]. Agriculture & Technology, 2022, 42(21): 69-72.
[5]	刘玄, 董宏伟, 高玉红, 等. 不同供钾水平下胡麻木质素代谢及其抗倒伏特性研究[J]. 中国生态农业学报(中英文), 2021, 29(5): 821-832.
	LIU X, DONG H W, GAO Y H, et al. Study on lignin metabolism and lodging resistance of flax under different potassium supply levels[J]. Chinese Journal of Eco-Agriculture, 2021, 29(5): 821-832.
[6]	张明. 胡麻生育时期界定标准探讨[J]. 甘肃农业科技, 2018(3): 9-14.
	ZHANG M. Gansu agricultural science and technology[J]. Gansu Agricultural Science and Technology, 2018(3): 9-14.
[7]	李政升, 麻丽娟, 董宏伟, 等. 钾肥用量对不同品种旱地胡麻抗倒伏能力及产量的影响[J]. 中国农学通报, 2021, 37(23): 69-76.
	LI Z S, MA L J, DONG H W, et al. Effects of potassium fertilizer on lodging resistance and yield of different varieties of dryland flax[J]. Chinese Agricultural Science Bulletin, 2021, 37(23): 69-76.
[8]	ZHAO X, YUAN Y T, SONG M D, et al. Use of unmanned aerial vehicle imagery and deep learning U-Net to extract rice lodging[J]. Sensors, 2019, 19(18): 3859.
[9]	梁美静, 毛克彪, 郭中华, 等. 深度学习在农业领域的研究与应用[J]. 农业工程, 2024, 14(1): 30-36.
	LIANG M J, MAO K B, GUO Z H, et al. Research and application of deep learning in agriculture[J]. Agricultural Engineering, 2024, 14(1): 30-36.
[10]	LEI L, YANG Q L, YANG L, et al. Deep learning implementation of image segmentation in agricultural applications: A comprehensive review[J]. Artificial Intelligence Review, 2024, 57(6): 149.
[11]	ZHANG K, ZHANG R D, YANG Z Q, et al. Efficient wheat lodging detection using UAV remote sensing images and an innovative multi-branch classification framework[J]. Remote Sensing, 2023, 15(18): 4572.
[12]	SONG Z S, ZHANG Z T, YANG S Q, et al. Identifying sunflower lodging based on image fusion and deep semantic segmentation with UAV remote sensing imaging[J]. Computers and Electronics in Agriculture, 2020, 179: 105812.
[13]	张淦, 严海峰, 胡根生, 等. 基于深度学习语义分割和迁移学习策略的麦田倒伏面积识别方法[J]. 智慧农业(中英文), 2023, 5(3): 75-85.
	ZHANG G, YAN H F, HU G S, et al. Identification method of lodging area in wheat field based on deep learning semantic segmentation and transfer learning strategy[J]. Smart Agriculture, 2023, 5(3): 75-85.
[14]	龙佳宁, 张昭, 刘晓航, 等. 利用改进EfficientNetV2和无人机图像检测小麦倒伏类型[J]. 智慧农业(中英文), 2023, 5(3): 62-74.
	LONG J N, ZHANG Z, LIU X H, et al. Detection of wheat lodging types by improved EfficientNetV2 and UAV images[J]. Smart Agriculture, 2023, 5(3): 62-74.
[15]	杨蜀秦, 王鹏飞, 王帅, 等. 基于MHSA+DeepLab v3+的无人机遥感影像小麦倒伏检测[J]. 农业机械学报, 2022, 53(8): 213-219, 239.
	YANG S Q, WANG P F, WANG S, et al. Wheat lodging detection in UAV remote sensing images based on MHSA+DeepLab v3+[J]. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(8): 213-219, 239.
[16]	翟肇裕, 张梓涵, 徐焕良, 等. YOLO算法在动植物表型研究中应用综述[J]. 农业机械学报, 2024, 55(11): 1-20.
	ZHAI Z Y, ZHANG Z H, XU H L, et al. Review on the application of YOLO algorithm in the study of plant and animal phenotypes[J]. Transactions of the Chinese Society for Agricultural Machinery, 2024, 55(11): 1-20.
[17]	MA J, ZHAO Y K, FAN W P, et al. An improved YOLOv8 model for Lotus seedpod instance segmentation in the Lotus pond environment[J]. Agronomy, 2024, 14(6): 1325.
[18]	YUE X, QI K, NA X Y, et al. Improved YOLOv8-seg network for instance segmentation of healthy and diseased tomato plants in the growth stage[J]. Agriculture, 2023, 13(8): 1643.
[19]	WU Y, HAN Q B, JIN Q L, et al. LCA-YOLOv8-seg: An improved lightweight YOLOv8-seg for real-time pixel-level crack detection of dams and bridges[J]. Applied Sciences, 2023, 13(19): 10583.
[20]	张姝瑾, 许兴时, 邓洪兴, 等. 基于YOLOv8n-seg-FCA-BiFPN的奶牛身体分割方法[J]. 农业机械学报, 2024, 55(3): 282-289, 391.
	ZHANG S J, XU X S, DENG H X, et al. Body segmentation method of dairy cows based on YOLOv8n-seg-FCA-BiFPN[J]. Transactions of the Chinese Society for Agricultural Machinery, 2024, 55(3): 282-289, 391.
[21]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2016: 779-788.
[22]	TAN M X, PANG R M, LE Q V. EfficientDet: Scalable and efficient object detection[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020: 10778-10787.
[23]	WANG Q L, WU B G, ZHU P F, et al. ECA-Net: Efficient channel attention for deep convolutional neural networks[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2020: 11531-11539.
[24]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, New Jersey, USA: IEEE, 2018: 7132-7141.
[25]	WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional block attention module[C]// Computer Vision-ECCV 2018. Cham, Germany: Springer, 2018: 3-19.
[26]	任长忠, 杨才. 中国燕麦品种志[M]. 北京: 中国农业出版社, 2018: 23.
[27]	高珍妮, 郭丽琢, 李丽, 等. 氮肥对胡麻茎秆木质素合成酶活性及其抗倒性的影响[J]. 中国油料作物学报, 2014, 36(5): 610-615.
	GAO Z N, GUO L Z, LI L, et al. Effect of nitrogen fertilizer on lignin synthase activity and lodging resistance of flax stalk[J]. Chinese Journal of Oil Crop Sciences, 2014, 36(5): 610-615.
[28]	KITZLER F, BARTA N, NEUGSCHWANDTNER R W, et al. WE3DS: An RGB-D image dataset for semantic segmentation in agriculture[J]. Sensors, 2023, 23(5): 2713.

初始数据集	增强数据集
初始数据集	训练集	验证集	测试集	总计
107	2 889	577	386	3 852

模型	P/%	R/%	mAP@0.5/%	Boundary IoU	参数量/M	计算量/GFLOPs
Baseline	88.9	86.2	93.1	80.3	2.83	10.2
C3k2_SDW	89.8	87.6	93.0	79.8	2.14	8.1
C3k2_SDW+BiFPN	90.1	91.5	93.8	81.0	1.68	7.7
Improved model	92.6	92.0	95.2	82.3	1.73	8.0

模型	P/%	R/%	mAP@0.5/%	Boundary IoU	参数量/M	计算量/GFLOPs	模型体积/MB	推理速度/（帧/s）
YOLACT	56.8	59.6	60.3	65.5	9.38	35.0	37.5	38.10
YOLOv7-seg	95.1	93.0	95.0	83.9	37.84	141.9	73.0	87.31
YOLOv8n-seg	92.2	87.2	93.4	78.5	3.26	12.0	6.8	111.18
YOLOv11n-seg	88.9	86.2	93.1	80.3	2.83	10.2	6.0	98.72
Improved model	92.6	92.0	95.2	82.3	1.73	8.0	3.8	119.20

模型	Boundary IoU	参数量/M	计算量/GFLOPs	推理速度/（帧/s）
U-Net	77.2	24.90	44.1	68.93
DeepLabv3+	83.9	54.40	53.0	50.00
Improved model	82.3	1.73	8.0	119.20

倒伏等级	倒伏面积率对应倒伏程度
0级（未倒伏）	未倒伏
1级（轻度倒伏）	0%~<15%
2级（中度倒伏）	15%~<45%
3级（重度倒伏）	45%及以上