CD-YOLO: A Method for Detecting Carrot Seedlings in Field Based on Improved YOLOv11s

doi:10.12133/j.smartag.SA202511008

Abstract

Abstract:

[Objective] In field environments under natural conditions, leaf occlusion and mutual plant shading pose significant challenges to the accurate identification of carrot seedlings. Furthermore, practical agricultural applications often rely on edge devices with limited computational power, necessitating a detection model that combines lightweight design, high accuracy, and robust anti-occlusion capability. The purpose of this research is to develop a robust recognition method for carrot seedlings suitable for complex field conditions, thereby enhancing the accuracy and efficiency of seedling emergence statistics in automated seedling raising processes and providing reliable technical support for precise farm management. [Methods] The CD-YOLO (Carrot Detection-You Only Look Once), a lightweight detection model was proposed based on an improved YOLOv11s. First, to reduce model complexity, several standard convolutions in the backbone network were replaced with depthwise separable convolutions (DWConv), thereby decreasing floating-point operations (FLOPs) and the number of parameters, establishing a lightweight foundation for edge deployment. Secondly, the efficient multi scale attention (EMA) mechanism was embedded into the critical feature extraction module C3k2, constructing a C3k2_EMA module. This module enhanced dynamic perception of local key features and reconstructed cross-scale contextual dependencies broken by occlusion through its parallel multi-branch structure, effectively suppressing background and occlusion noise. Finally, the DynamicHead detection head was introduced. Leveraging its scale-aware and spatial-aware mechanisms, it achieved a dynamic fusion of multi-level features and adaptive weight adjustment, further improving the model's decision-making robustness in complex scenes. To comprehensively evaluate model performance, a carrot seedling dataset covering various field scenarios was independently constructed. Through offline data augmentation, the original 1 274 images were expanded to 4 796, which were then split into training, validation, and test sets in an 8:1:1 ratio. Meanwhile, to systematically quantify the model's anti-occlusion performance, an occlusion severity assessment criterion based on the overlapping area of bounding boxes was proposed. Targets were categorized into three occlusion levels: mild, moderate, and severe. Based on this, a dedicated "Occlusion Test Subset" was separated from the main test set, providing an objective and reproducible benchmark for evaluating the model's anti-occlusion capability. [Results and Discussions] Experimental results on the custom dataset demonstrated that CD-YOLO comprehensively improved detection performance while maintaining its lightweight characteristics. Compared to the baseline model YOLOv11s, CD-YOLO reduced computational load by 6.2 GFLOPs (a 28.8% decrease), decreased model size by 4.8 MB (a 25.0% reduction), improved single-image inference speed by 4.7 ms, reaching 9.6 ms. Concurrently, precision, recall, and mean average precision (mAP_0.5) increased by 3.0, 1.5, and 2.4 percentage points, respectively, ultimately reaching 81.2%, 76.4%, and 84.0%. In comparisons with other lightweight backbone networks like MobileNetv3 and ShuffleNetv2, CD-YOLO consistently outperformed them on the accuracy-speed comprehensive metric, validating the effectiveness of its improvement strategies. In occlusion performance tests, the missed detection rate of CD-YOLO on the occlusion test subset was 13.4%, a 5.7 percentage points decrease compared to YOLOv11s. Its mAP_0.5 on the occlusion subset reached 80.6%, a 5.1 percentage points improvement over the baseline, whereas the improvement on the regular subset was 1.8 percentage points, proving the model's enhanced efficacy in occlusion scenarios. After deploying the model on an NVIDIA Jetson Orin NX edge device and accelerating it with TensorRT, the inference frame rate increased to 32.5 f/s. On random test images, CD-YOLO achieved missed detection and false detection rates of 5.1% and 2.7%, respectively, representing decreases of 7.7% and 2.6% compared to YOLOv11s, demonstrating promising practical application potential. Ablation studies and feature map visualizations further indicated that DWConv, C3k2_EMA, and DynamicHead formed a synergistic optimization loop: DWConv achieved computational compression, freeing up computational budget for subsequent modules; C3k2_EMA enhanced local perception and contextual reconstruction of occluded targets during the feature extraction stage; and DynamicHead performed dynamic fusion of multi-scale features at the decision-making end. Together, they ensured high-precision detection of incomplete targets under limited computational resources. [Conclusions] Through the synergistic design of "lightweighting, feature enhancement, and dynamic fusion", the CD-YOLO model achieved an excellent balance between computational efficiency, detection accuracy, and anti-occlusion capability. The model not only significantly reduced reliance on the computational power of edge devices but also effectively improved robustness and adaptability in complex field environments through structured attention and dynamic fusion mechanisms.

Key words: carrot seedlings, occlusion, object detection, YOLOv11s, lightweight

CLC Number:

TP391.4
S24

LIU Haoran, WANG Yu, ZHAO Xueguan, WU Huarui, FU Hao, PANG Shujie, ZHAI Changyuan. CD-YOLO: A Method for Detecting Carrot Seedlings in Field Based on Improved YOLOv11s[J]. Smart Agriculture, 2026, 8(2): 158-174.

Figures/Tables 20

Fig. 1

Fig. 2

Table 1

Fig. 3

Fig. 4

Fig. 5

Table 2

Table 3

Fig. 6

Fig.7

Fig. 8

Table 4

Table 5

Table6

Table 7

Fig. 9

Table 8

Table 9

Fig. 10

Fig. 11

References 32

[1]	刘星, 黄建新, 欧承刚, 等.胡萝卜根色及其色素组分的遗传和育种研究进展[J]. 植物遗传资源学报, 2022,23(5): 1241-1248.
	LIU X, HUANG J X, OU C G, et al. Current advances on inheritance and breeding of carrot root color and its pigment components[J]. Journal of Plant Genetic Resources, 2022,23(5): 1241-1248.
[2]	赵童, 米月花, 籍镭钒, 等. 胡萝卜收割机的结构优化设计[J]. 工程机械, 2025, 56(6): 154-156, I0008.
	ZHAO T, MI Y H, JI L F, et al. Structural optimization design of carrot harvester[J]. Construction Machinery and Equipment, 2025, 56(6): 154-156, I0008.
[3]	张清蓉, 王国栋, 赵正伟, 等. 基于自动控制技术的胡萝卜种植收割一体机设计[J]. 南方农机, 2024, 55(21): 46-50.
	ZHANG Q R, WANG G D, ZHAO Z W, et al. Design of carrot planting and harvesting integrated machine based on automatic control technology[J]. South Agricultural Machinery, 2024, 55(21): 46-50.
[4]	倪建功, 李娟, 邓立苗, 等. 基于知识蒸馏的胡萝卜外观品质等级智能检测[J]. 农业工程学报, 2020, 36(18): 181-187.
	NI J G, LI J, DENG L M, et al. Intelligent detection of carrot appearance quality grade based on knowledge distillation[J]. Transactions of the Chinese Society of Agricultural Engineering, 2020, 36(18): 181-187.
[5]	XIE W J, WEI S, ZHENG Z H, et al. Recognition of defective carrots based on deep learning and transfer learning[J]. Food and Bioprocess Technology, 2021, 14(7): 1361-1374.
[6]	王春桃, 梁炜健, 郭庆文, 等. 农业害虫智能视觉检测研究综述[J]. 中国农机化学报, 2023, 44(7): 207-213.
	WANG C T, LIANG W J, GUO Q W, et al. Summary of research on intelligent vision detection of agricultural pests[J]. Journal of Chinese Agricultural Mechanization, 2023, 44(7): 207-213.
[7]	黄友锐, 王小桥, 韩涛, 等. 基于改进YOLOv8n的甜菜杂草检测算法研究[J]. 江苏农业科学, 2024, 52(24): 196-204.
	HUANG Y R, WANG X J, HAN T, et al. A detection method for sugar beets and weeds based on improved YOLOv8n algorithm[J]. Jiangsu Agricultural Sciences, 2024, 52(24): 196-204.
[8]	曲福恒, 李金状, 杨勇, 等. 基于改进DeepLabv3+的轻量化作物杂草识别方法[J]. 石河子大学学报(自然科学版), 2024, 42(1): 117-125.
	QU F H, LI J Z, YANG Y, et al. Lightweight crop and weed recognition method based on imporved DeepLabv3+[J]. Journal of Shihezi University (Natural Science), 2024, 42(1): 117-125.
[9]	NIU L T, SU W H, ZHANG H Y, et al. Development of intelligent equipment for weed identification and variable spraying in lettuce fields based on instance segmentation framework[J]. Engineering Applications of Artificial Intelligence, 2025, 159: 111634.
[10]	孟庆宽, 张漫, 杨晓霞, 等. 基于轻量卷积结合特征信息融合的玉米幼苗与杂草识别[J]. 农业机械学报, 2020, 51(12): 238-245, 303.
	MENG Q K, ZHANG M, YANG X X, et al. Recognition of maize seedling and weed based on light weight convolution and feature fusion[J]. Transactions of the Chinese Society for Agricultural Machinery, 2020, 51(12): 238-245, 303.
[11]	张志远, 罗铭毅, 郭树欣, 等. 基于改进YOLOv5的自然环境下樱桃果实识别方法[J]. 农业机械学报, 2022, 53(S1): 232-240.
	ZHANG Z Y, LUO M Y, GUO S X, et al. Cherry fruit detection method in natural scene based on improved YOLOv5[J]. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(S1): 232-240.
[12]	ZHANG B Y, ZHANG F K, AN S, et al. SCORE-DETR: An efficient Transformer-based network for small and occluded citrus detection[J]. Computers and Electronics in Agriculture, 2025, 238: 110843.
[13]	汤晨, 刘振青, 邵阳, 等. 基于改进YOLOv11n的密集遮挡环境百香果识别方法[J]. 农业机械学报, 2026, 57 (5): 167-176.
	TANG C, LIU Z Q, SHAO Y, et al. Passion fruit recognition method in densely occluded environments based on improved YOLOv11n[J]. Transactions of the Chinese Society for Agricultural Machinery, 2026, 57 (5): 167-176.
[14]	李文峰, 胡世康, 杨琳琳, 等. 基于轻量化YOLOv4对不同遮挡程度成熟番茄果实的识别[J]. 云南农业大学学报(自然科学版), 2024(4): 184-189.
	LI W F, HU S K, YANG L L, et al. Recognition of mature tomato fruits with different occlusion degrees based on lightweight YOLOv4[J]. Journal of Yunnan Agricultural University (Natural Science), 2024(4): 184-189.
[15]	王元昊, 娄欢欢, 罗红品, 等. 基于改进YOLOv8算法对被遮挡柑橘的识别与定位优化[J]. 西南大学学报(自然科学版), 2025, 47(2): 171-183.
	WANG Y H, LOU H H, LUO H P, et al. Recognition and location optimization of shaded Citrus based on improved YOLOv8 algorithm [J]. Journal of Southwest University (Natural Science), 2025, 47(2): 171-183.
[16]	李会, 郭家文, 黄世醒, 等. 基于改进YOLOv7的甘蔗幼苗检测方法试验研究[J]. 农机化研究, 2025, 47(9): 146-154.
	LI H, GUO J W, HUANG S X, et al. Experiment on sugarcane seedling detection method based on improved YOLOv7[J]. Journal of Agricultural Mechanization Research, 2025, 47(9): 146-154.
[17]	郑健林, 黄世醒, 郑丁科, 等. 基于改进YOLOv5的机收蔗含杂率检测方法试验研究[J]. 农机化研究, 2026, 48(2): 217-224.
	ZHENG J L, HUANG S X, ZHENG D K, et al. Experimental study on impurity content detection method of machine－harvested sugarcane based on improved YOLOv5[J]. Journal of Agricultural Mechanization Research, 2026, 48(2): 217-224.
[18]	牛子昂, 裘正军. 基于改进YOLOv11-Pose的玉米植株骨架及表型参数提取方法[J]. 智慧农业(中英文), 2025(2): 95-105.
	NIU Z A, QIU Z J. Extraction method of maize plant skeleton and phenotypic parameters based on improved YOLOv11-Pose[J]. Smart Agriculture, 2025(2): 95-105.
[19]	谭泗桥, 陈涵, 朱磊, 等. 基于改进YOLOv8m的稻田害虫识别方法[J]. 农业工程学报, 2025, 41(2): 185-195.
	TAN S Q, CHEN H, ZHU L, et al. Identification method of rice pests based on improved YOLOv8m[J]. Transactions of the Chinese Society of Agricultural Engineering, 2025, 41(2): 185-195.
[20]	HOWARD A G, ZHU M L, CHEN B, et al. MobileNets: Efficient convolutional neural networks for mobile vision applications[EB/OL]. arXiv: 1704.04861, 2017.
[21]	李亚, 蒋晨, 王海瑞, 等. 基于EDW-YOLOv8的棉花叶片病害检测[J]. 华中农业大学学报, 2025, 44(5): 189-197.
	LI Y, JIANG C, WANG H R, et al. Cotton leaf disease detection based on EDW-YOLOv8[J]. Journal of Huazhong Agricultural University, 2025, 44(5): 189-197.
[22]	DENG L, MIAO Z H, ZHAO X G, et al. HAD-YOLO: An accurate and effective weed detection model based on improved YOLOV5 network [J]. Agronomy, 2025, 15(1): 57.
[23]	DENG J L, LIANG Q, HE J J, et al. Flavor grading of zanthoxylum based on computer vision-multi-chromatography fusion [J]. Journal of Food Composition and Analysis, 2025, 148: 108323.
[24]	刘坤, 吉宏亚, 黄程菲, 等. 基于改进YOLOv5s的番茄成熟度识别技术研究[J]. 中国农机化学报, 2025, 46(5): 79-85.
	LIU K, JI H Y, HUANG C F, et al. Research on tomato maturity recognition technology based on improved YOLOv5s[J]. Journal of Chinese Agricultural Mechanization, 2025, 46(5): 79-85.
[25]	曹玉莹, 刘银川, 高新悦, 等. LightTassel-YOLO:一种基于无人机遥感的玉米雄穗实时检测方法(英文)[J]. 智慧农业(中英文), 2025, 7(6): 96-110.
	CAO Y Y, LIU Y C, GAO X Y, et al. LightTassel-YOLO: A real-time detection method for maize tassels based on UAV remote sensing[J]. Smart Agriculture, 2025, 7(6): 96-110.
[26]	李大华, 孔舒, 李栋, 等. 基于改进SSD模型的柑橘叶片病害轻量化检测模型[J]. 浙江农业学报, 2024, 36(3): 662-670.
	LI D H, KONG S, LI D, et al. Lightweight detection model of citrus leaf diseases based on improved SSD model[J]. Acta Agriculturae Zhejiangensis, 2024, 36(3): 662-670.
[27]	REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017,39(6): 1137-1149.
[28]	WEN F, WU H, ZHANG X X, et al. Accurate recognition and segmentation of northern corn leaf blight in drone RGB Images: A CycleGAN-augmented YOLOv5-Mobile-Seg lightweight network approach[J]. Computers and Electronics in Agriculture, 2025, 236: 110433.
[29]	JIA X F, HUA Z L, SHI H T, et al. A soybean pod accuracy detection and counting model based on improved YOLOv8[J]. Agriculture, 2025, 15(6): 617.
[30]	LIU H R, WANG Y, ZHAI C Y, et al. DWG-YOLOv8: A lightweight recognition method for broccoli in multi-scene field environments based on improved YOLOv8s[J]. Agronomy, 2025, 15(10): 2361.
[31]	李茂, 肖洋轶, 宗望远, 等. 基于改进YOLOv8模型的轻量化板栗果实识别方法[J]. 农业工程学报, 2024, 40(1): 201-209.
	LI M, XIAO Y Y, ZONG W Y, et al. Detecting chestnuts using improved lightweight YOLOv8[J]. Transactions of the Chinese Society of Agricultural Engineering, 2024, 40(1): 201-209.
[32]	DHASARATHAN C, GNANASEKARAN S, PATTANAYAK A, et al. Tensor RT optimized driver drowsiness detection system using edge device[J]. Ain Shams Engineering Journal, 2025, 16(10): 103620.

类别	图像数量/张	标签数量/张
训练集	3 837	43 880
验证集	480	6 156
测试集	479	6 867

训练参数	值
训练周期	100
初始学习率	0.001
优化器	Adam
每批次图像数量	24
动量	0.937
优化器权重衰减系数	0.000 5
输入尺寸	640×640

试验编号	改进方式			P/%	R/%	mAP_0.5/%	计算量/GFLOPs	模型大小/MB	单张图片处理时间/ms
试验编号	DWConv	C3k2_EMA	DynamicHead	P/%	R/%	mAP_0.5/%	计算量/GFLOPs	模型大小/MB	单张图片处理时间/ms
1	×	×	×	78.2	74.9	81.6	21.5	19.2	14.3
2	√	×	×	79.8	73.6	81.4	14.8	13.8	11.5
3	×	√	×	78.7	77.1	82.9	21.3	19.2	11.7
4	×	×	√	79.9	76.7	83.7	21.5	19.8	12.3
5	√	√	×	79.7	74.5	81.7	14.7	13.9	10.6
6	√	×	√	79.0	74.9	81.8	15.0	14.4	9.8
7	×	√	√	80.2	75.1	83.2	21.6	19.8	11.6
8	√	√	√	81.2	76.4	84.0	15.3	14.4	9.6

主干网络	P/%	R/%	mAP_0.5/%	计算量/GFLOPs	参数量	模型大小/MB	单张图片处理时间/ms
MobileNetv3	76.2	73.0	79.9	13.2	7.21×10⁶	16.9	14.6
ShuffleNetv2	73.0	72.3	77.8	10.4	5.34×10⁶	11.0	12.2
EfficientVit	76.0	74.5	79.7	14.6	7.39×10⁶	15.7	11.4
CD-YOLO	81.2	76.4	84.0	15.3	7.06×10⁶	14.4	9.6

模型	P/%	R/%	mAP_0.5/%	计算量/GFLOPs	模型大小/MB	单张图片处理时间/ms
SSD	63.5	81.4	72.2	63.5	90.5	36.6
Faster R-CNN	69.1	84.5	77.7	142.6	108.1	113.3
YOLOv5s	80.2	73.9	80.5	16.2	15.1	14.2
YOLOv8s	79.1	74.8	81.6	49.1	22.5	9.8
YOLOv11s	78.2	74.9	81.6	21.5	19.2	14.3
YOLOv8s-P2	79.3	74.4	81.5	55.3	24.8	10.7
DWG-YOLOv8	78.2	73.2	82.0	17.4	14.7	10.5
HAD-YOLO	77.9	74.3	81.1	35.4	12.5	19.3
CD-YOLO	81.2	76.4	84.0	15.3	14.4	9.6