YOLOv8n-SSND：改进的无人机航拍藜麦穗目标检测轻量模型

doi:10.12133/j.smartag.SA202508021

摘要/Abstract

摘要：

【目的/意义】 藜麦穗是藜麦产量评估的重要指标之一。为提高藜麦穗目标检测精度和检测效率，提出一种适用于无人机搭载的目标识别轻量模型YOLOv8n-SSND（YOLOv8n with Switchable Atrous Convolution, Slim Neck, and Deformable Attention）。 【方法】 以YOLOv8n和YOLOv11n为基准模型，考虑藜麦穗形态大小不一及结构复杂，在Backbone层加入可切换空洞卷积模块以提升检测复杂特征的能力；引入Slim-Neck特征融合层，轻量化主干网络；添加可变形注意力机制，使模型能够动态识别藜麦穗部的复杂特征，同时保持较高的推理效率。 【结果和讨论】 YOLOv8n-SSND模型平均精度均值达94.3%，相较于YOLOv11n-SSND、YOLOv11n、YOLOv12n、YOLOv7、YOLOv5s、SSD（Single Shot MultiBox Detector）、Fast R-CNN（Fast Region-Based Convolutional Neural Network）和YOLOv8n模型，分别提升0.7、0.9、2.1、1.4、2.0、23.1、19.6、1.8个百分点。该模型推理速度达166.7帧/s，较基准模型提高26.7%；计算量为6.8 GFLOPs，较基准模型降低16.0%。 【结论】 YOLOv8n-SSND模型在藜麦穗部识别上表现出更高的精确度、更快的推理速度以及更少的浮点运算量，为无人机搭载针对藜麦穗部的藜麦目标检测提供了可行方法，也为藜麦产量评估与智能农业管理提供了高效的技术方案。

关键词: 藜麦穗, 无人机, YOLOv8n, 目标检测, 可变形注意力, 作物表型

Abstract:

[Objective] The Chenopodium quinoa panicle is a critical phenotypic indicator for estimating crop yield and evaluating the growth condition of Chenopodium quinoa plants. Accurate and efficient recognition of Chenopodium quinoa panicles in complex field environments is therefore of great significance for intelligent agriculture, yield prediction, and automatic crop management. However, unmanned aerial vehicle (UAV)-acquired field imagery often exhibits complex characteristics such as diverse panicle morphology, uneven illumination, overlapping occlusion, and background interference, et al., posing substantial challenges for conventional target detection algorithms. To address these issues, a lightweight target detection model, named YOLOv8n-SSND (YOLOv8n with Switchable Atrous Convolution, Slim Neck, and Deformable Attention) is proposed, and specifically optimized for UAV-based Chenopodium quinoa panicle identification to improve the detection accuracy and inference efficiency for Chenopodium quinoa panicles while maintaining low computational cost and real-time performance suitable for embedded UAV deployment. [Methods] The proposed model was constructed based on the YOLOv8n and YOLOv11n frameworks, and incorporated several improvements tailored for small-object agricultural detection tasks. To enhance the ability to capture multi-scale and high-dimensional semantic features, the switchable atrous convolution (SAC) module was embedded into the backbone network. This module dynamically adjusted its receptive field according to spatial context, enabling more precise extraction of local and global texture details of Chenopodium quinoa panicles. In order to reduce redundant parameters and maintain high computational efficiency, a slim-neck lightweight feature fusion layer was designed, which effectively strengthened the integration of shallow spatial information and deep semantic features, allowing the network to maintain high accuracy without increasing model complexity. Additionally, a deformable attention (DA) mechanism was introduced to enable adaptive focus on regions with rich panicle-related features while suppressing irrelevant background noise. This attention mechanism assigned dynamic weights across both spatial and channel dimensions, improving the model's robustness against occlusions, illumination variations, and complex field textures commonly encountered in UAV images. [Results and Discussions] Comprehensive field experiments were conducted using UAV images of Chenopodium quinoa plots collected under different environmental conditions and growth stages. The results demonstrated that the proposed YOLOv8n-SSND model achieved a mean average precision (mAP50) of 94.3%, showing a remarkable improvement over multiple baseline and comparative models. Specifically, compared with YOLOv11n-SSND, YOLOv11n, YOLOv12n, YOLOv7, YOLOv5s, single shot multibox detector (SSD), fast region-based convolutional neural network (Fast R-CNN) and YOLOv8n, the proposed model achieved improvements of 0.7, 0.9, 2.1, 1.4, 2.0, 23.1, 19.6 and 1.8 percentage points respectively (SSD and Fast R-CNN). In terms of computational efficiency, the inference speed reached 166.7 f/s, representing a 26.7% increase over the YOLOv8n baseline, which ensured real-time detection capability for UAV-mounted onboard processors. Moreover, the total operation count was reduced to 6.8 GFLOPs, reflecting a 16.0% reduction compared with the baseline model, thus demonstrating the improved efficiency of the proposed architecture. The experimental comparison also indicated that the integration of SAC enhanced the model's sensitivity to complex spatial patterns, while the DA module effectively improved feature selectivity and prevented overfitting to background textures. The Slim-Neck design contributed significantly to reducing parameter redundancy and facilitated smooth feature propagation across layers. [Conclusions] The YOLOv8n-SSND model effectively achieves a balance among detection accuracy, inference speed, and computational cost, making it well-suited for real-time UAV-based agricultural monitoring. The experimental outcomes confirm that the model not only provides high-precision detection of Chenopodium quinoa panicles but also offers superior inference efficiency with minimal computational resources. These characteristics make it a promising solution for UAV-deployed intelligent agricultural systems, where power and processing capacity are limited. Furthermore, the proposed method provides a technical foundation for large-scale and automated monitoring of Chenopodium quinoa growth, enabling accurate yield estimation, phenotypic analysis, and precision crop management.

Key words: Chenopodium quinoa panicle, UAV, YOLOv8n, object detection, deformable attention, crop phenotype

中图分类号:

吴婷婷, 郭俊睿, 陶秋洁, 陈世华, 郭善利. YOLOv8n-SSND：改进的无人机航拍藜麦穗目标检测轻量模型[J]. 智慧农业(中英文), 2026, 8(2): 59-71.

WU Tingting, GUO Junrui, TAO Qiujie, CHEN Shihua, GUO Shanli. YOLOv8n-SSND: An Improved Lightweight Model for Aerial Chenopodium Chenopodium quinoa Willd. Spike Target[J]. Smart Agriculture, 2026, 8(2): 59-71.

图/表 14

表1

图1

图2

图3

图4

图5

图6

图7

图8

表2

表3

表4

表5

表6

参考文献 27

[1]	姚燕辉. 苗菜型藜麦新品系筛选及遗传多样性评价[D]. 太谷: 山西农业大学, 2022.
	YAO Y H. Screening and genetic diversity evaluation of new strains in seedling type quinoa[D]. Taigu: Shanxi Agricultural University, 2022.
[2]	朱丽丽, 张发玉, 安宁, 等. 116份藜麦种质资源萌发期抗旱性综合评价[J]. 干旱地区农业研究, 2024, 42(1): 23-31.
	ZHU L L, ZHANG F Y, AN N, et al. Comprehensive evaluation of drought resistance of 116 quinoa germplasm resources during germination[J]. Agricultural Research in the Arid Areas, 2024, 42(1): 23-31.
[3]	AHMAD J, KHAN I, MANZOOR A, et al. Quinoa: An underutilized pseudocereal with promising health and industrial benefits[J]. Journal of Agricultural and Food Chemistry, 2025, 73(24): 14722-14741.
[4]	姜睿, 刘文瑜, 王旺田, 等. 50份藜麦种质材料萌发期耐低温综合评价[J/OL]. 草业科学, 2025: 1-17. (2025-06-20)[2020-08-15].
	JIANG R, LIU W Y, WANG W T, et al. Comprehensive evaluation of low temperature tolerance of 50 quinoa germplasm materials at germination stage[J/OL]. Pratacultural science, 2025: 1-17. (2025-06-20) [2020-08-15].
[5]	ZHANG S, WANG X R, LIN H, et al. A review of the application of UAV multispectral remote sensing technology in precision agriculture[J]. Smart Agricultural Technology, 2025, 12: 101406.
[6]	龙拥兵, 李坤, 刘丹丹, 等. 基于无人机遥感图像的水稻稻穗识别与估产方法研究[J]. 农业机械学报, 2026, 57(3): 109-118.
	LONG Y B, LI K, LIU D D, et al. Rice panicle recognition and yield estimation method based on UAV remote sensing images[J]. Transactions of the Chinese Society for Agricultural Machinery, 2026, 57(3): 109-118.
[7]	彭小丹, 陈锋军, 朱学岩, 等. 基于无人机图像和改进LSC-CNN模型的密集苗木检测和计数方法[J]. 智慧农业(中英文), 2024, 6(5): 88-97.
	PENG X D, CHEN F J, ZHU X Y, et al. Dense nursery stock detecting and counting based on UAV aerial images and improved LSC-CNN[J]. Smart Agriculture, 2024, 6(5): 88-97.
[8]	杨福芹, 李昌浩, 张英发, 等. 融合高光谱和数码影像的冬小麦氮营养指数遥感监测[J]. 光谱学与光谱分析, 2025, 45(6): 1719-1728.
	YANG F Q, LI C H, ZHANG Y F, et al. Remote sensing monitoring of nitrogen nutrient index in winter wheat by integrating hyperspectral and digital imagery[J]. Spectroscopy and Spectral Analysis, 2025, 45(6): 1719-1728.
[9]	DASH S K, SEMBHI H, LANGSDALE M, et al. Assessing the field-scale crop water condition over an intensive agricultural plain using UAV-based thermal and multispectral imagery[J]. Journal of Hydrology, 2025, 655: 132966.
[10]	井梅秀, 穆天红, 肖明, 等. 基于无人机多光谱影像的藜麦长势分析[J]. 现代农业科技, 2025(2): 179-182.
	JING M X, MU T H, XIAO M, et al. Growth analysis of quinoa based on multi-spectral images of UAV[J]. Modern Agricultural Science and Technology, 2025(2): 179-182.
[11]	RUIZ D A C, VILLACÍS M G M, KIRBY E, et al. Correlation of NDVI obtained by different methodologies of spectral data collection in a commercial crop of quinoa (Chenopodium quinoa) in central Ecuador[C]// 2020 Seventh International Conference on eDemocracy & eGovernment (ICEDEG). Piscataway, New Jersey, USA: IEEE, 2020: 208-215.
[12]	FLORES A. Classification of organic quinoa crops using multispectral aerial imagery and machine learning techniques[C]// 2022 IEEE International Conference on Automation/XXV Congress of the Chilean Association of Automatic Control (ICA-ACCA). Piscataway, New Jersey, USA: IEEE, 2022: 1-6.
[13]	VELUSAMY P, RAJENDRAN S, MAHENDRAN R K, et al. Unmanned aerial vehicles (UAV) in precision agriculture: Applications and challenges[J]. Energies, 2022, 15(1): 217.
[14]	赵峻, 聂志刚, 李广, 等. 基于无人机低空近景图像的玉米螟虫害检测方法[J]. 智慧农业(中英文), 2025, 7(6): 111-123.
	ZHAO J, NIE Z G, LI G, et al. A study on corn borer detection using low-altitude close-range UAV imagery[J]. Smart agriculture, 2025, 7(6): 111-123.
[15]	翁海勇, 姚越, 黄德耀, 等. 无人机低空遥感结合YOLOv7快速评估水稻穗颈瘟抗性[J]. 农业工程学报, 2024, 40(21): 110-118.
	WENG H Y, YAO Y, HUANG D Y, et al. Rapid evaluation of rice neck blast resistance using low altitude remote sensing of UAV combined with YOLOv7[J]. Transactions of the Chinese Society of Agricultural Engineering, 2024, 40(21): 110-118.
[16]	JIA Y J, FU K, LAN H, et al. Maize tassel detection with CA-YOLO for UAV images in complex field environments[J]. Computers and Electronics in Agriculture, 2024, 217: 108562.
[17]	QIU F, SHEN X J, ZHOU C, et al. Rice ears detection method based on multi-scale image recognition and attention mechanism[J]. IEEE Access, 2024, 12: 68637-68647.
[18]	张晓勐. 无人机遥感图像中玉米雄穗检测模型研究及应用[D]. 重庆: 重庆师范大学, 2023.
	ZHANG X M. Research and application of maize tassels detection model in UAV remote sensing images[D]. Chongqing: Chongqing Normal University, 2023.
[19]	高姻燕, 孙义, 李葆春. 基于无人机RGB影像估测田间小麦穗数[J]. 中国农业科技导报, 2022, 24(3): 103-110.
	GAO Y Y, SUN Y, LI B C. Estimating of wheat ears number in field based on RGB images using unmanned aerial vehicle[J]. Journal of Agricultural Science and Technology, 2022, 24(3): 103-110.
[20]	QIAO S Y, CHEN L C, YUILLE A. DetectoRS: Detecting objects with recursive feature pyramid and switchable atrous convolution[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2021: 10208-10219.
[21]	WEI X T, LI Z S, WANG Y T. SED-YOLO based multi-scale attention for small object detection in remote sensing[J]. Scientific Reports, 2025, 15: 3125.
[22]	WANG S Y, LI Q J, YANG T, et al. LSD-YOLO: Enhanced YOLOv8n algorithm for efficient detection of lemon surface diseases[J]. Plants, 2024, 13(15): 2069.
[23]	WEN C M, CHENG Y, LI S P, et al. Slim-YOLO: An improved sugarcane tail tip recognition algorithm based on YOLO11n for complex field environments[J]. Applied Sciences, 2025, 15(8): 4286.
[24]	ANCHA V K, GONUGUNTLA V, VADDI R. GSS-YOLO: An improved YOLOV5 prediction head with slim-neck for defect detection in printed circuit board assembly[J]. Signal, Image and Video Processing, 2025, 19(11): 915.
[25]	XIA Z F, PAN X R, SONG S J, et al. Vision transformer with deformable attention[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2022: 4784-4793.
[26]	沈德宇, 陈锋军, 朱学岩, 等. 基于无人机航拍与改进YOLOv5s的油茶果实检测[J]. 中国农机化学报, 2024, 45(12): 238-244.
	SHEN D Y, CHEN F J, ZHU X Y, et al. Camellia oleifera fruit detection based on UAV aerial photography and improved YOLOv5s[J]. Journal of Chinese Agricultural Mechanization, 2024, 45(12): 238-244.
[27]	杨启良, 禹璐, 梁嘉平. 基于改进YOLOv11的采后芦笋分级检测方法[J]. 智慧农业(中英文), 2025, 7(4): 84-94.
	YANG Q L, YU L, LIANG J P. Grading Asparagus officinalis L. using improved YOLOv11[J]. Smart Agriculture, 2025, 7(4): 84-94.

通道	中心波长/nm	半波宽/nm	反射率校正系数/%
Blue	475	20	48.871
Green	560	20	49.028
Red	668	10	49.028
Red-edge	717	10	48.969
NIR	840	40	48.758

模型	模型大小/MB	P/%	R/%	mAP50/%	计算量/GFLOPs	Params/M	FPS/（帧/s）
YOLOv8n-DA	6.8	89.1	86.7	93.6	8.3	3.3	87.7
YOLOv8n-HA	16.9	89.8	85.7	93.4	8.1	8.2	294.1
YOLOv8n-DLKA	9.8	89.0	85.9	93.2	9.5	4.8	185.2

模型	模型大小/MB	P/%	R/%	mAP50/%	计算量/GFLOPs	Params/M	FPS/（帧/s）
YOLOv8n	6.3	90.4	84.0	92.5	8.1	3.0	131.6
YOLOv11n	5.5	89.6	85.0	93.4	6.3	2.6	59.5
YOLOv12n	5.4	87.9	83.9	92.2	5.8	2.5	243.9

YOLOv8n	SAC	Slim-Neck	DA	模型大小/MB	P/%	R/%	mAP50/%	计算量/GFLOPs	Params/M	FPS/（帧/s）
√	×	×	×	6.3	90.4	84.0	92.5	8.1	3.0	131.6
√	√	×	×	6.9	89.0	87.6	93.8	7.4	3.3	108.7
√	×	√	×	5.9	87.9	84.9	92.7	7.3	2.8	204.1
√	×	×	√	6.8	89.1	86.7	93.6	8.3	3.3	87.7
√	√	√	×	6.5	89.6	85.7	93.4	6.6	3.4	208.3
√	√	×	√	7.5	89.6	86.9	93.9	7.6	3.6	270.3
√	×	√	√	6.5	88.3	86.8	93.2	7.5	3.1	227.3
√	√	√	√	7.1	90.8	86.2	94.3	6.8	3.4	166.7

YOLOv11	SAC	Slim-Neck	DA	模型大小/MB	P/%	R/%	mAP50/%	计算量/GFLOPs	Params/M	FPS/（帧/s）
√	×	×	×	5.5	89.6	85.0	93.4	6.3	2.6	59.5
√	√	×	×	7.8	90.4	85.8	93.8	6.5	3.7	166.7
√	×	√	×	5.9	88.9	84.7	92.7	6.4	2.8	217.4
√	×	×	√	6.1	89.7	83.3	92.2	6.5	2.9	222.2
√	√	√	×	8.9	89.0	86.1	93.2	7.8	4.2	125.0
√	√	×	√	8.4	89.7	85.1	93.6	6.7	4.0	131.6
√	×	√	√	6.5	88.4	85.7	93.2	6.6	3.1	172.4
√	√	√	√	9.4	89.2	85.9	93.6	8.1	4.5	137.0