基于Flor-YOLO的香石竹鲜切花分级轻量化检测方法

doi:10.12133/j.smartag.SA202512007

摘要/Abstract

摘要：

【目的/意义】 针对香石竹鲜切花开放度人工分级主观性强、效率低，以及通用目标检测模型难以兼顾花瓣细粒度纹理表征与模型轻量化的问题，提出一种用于香石竹鲜切花开放度分级轻量化检测模型（Flower openness recognition You Only Look Once, Flor-YOLO）。 【方法】 该模型以YOLO11n为基线，对骨干网络、下采样方式及检测头结构进行针对性改进。首先，构建轻量化嵌合特征骨干网络，引入重参数化卷积起始结构与基于部分卷积的C3k2模块，在降低参数量的同时增强对花瓣高频纹理特征的表征能力，并集成融合了上下文锚点注意力的重参数化聚合网络模块以增强对高层语义长程依赖的捕获能力；其次，针对传统空间下采样导致的纹理混叠与细节丢失问题，引入小波池化下采样模块，利用二维离散小波变换在频域显式保留花瓣边缘褶皱等高频判别特征，有效抑制下采样引起的纹理混叠与细节丢失；最后，设计共享细节轻量检测头，通过跨尺度权重共享与细节增强卷积，在降低参数量的同时，解决分类置信度与定位质量不对齐问题。 【结果和讨论】 Flor-YOLO在自建香石竹数据集上的平均精度均值达到96.10%，较基准模型提升3.25个百分点；模型参数量与浮点运算量分别为1.26 M和1.1 GFLOPs，同比降低51.2%和82.5%；在RTX4060上的推理速度达到616.09 f/s。 【结论】 该算法在实现轻量化的同时显著提升了分级精度，具备在低算力移动终端部署的理论可行性，可为香石竹鲜切花自动化分级装备的研发提供技术支撑。

关键词: 香石竹分级, 目标检测, Flor-YOLO, 轻量化, 小波池化, 频域分析

Abstract:

[Objective] Carnation (Dianthus caryophyllus L.) is one of the most economically valuable cut flower crops worldwide. Postharvest openness is a key quality indicator influencing pricing, logistics tolerance, and shelf life. However, manual grading is inefficient and subjective due to dense petal overlap and complex edge structures. With the shift toward large-scale production and rising labor costs, accurate automated grading has become essential. Existing object detection models face a trade-off between computational efficiency and feature fidelity: High-precision architectures are computationally expensive for edge deployment, while lightweight models often lack sufficient feature representation. Additionally, conventional spatial downsampling introduces spectral aliasing, leading to the loss of high-frequency petal texture information and limiting the separability of adjacent openness grades. Therefore, a lightweight yet detail-preserving detection framework is required. To address this need, Flor-YOLO (Flower openness recognition You Only Look Once) is proposed integrating frequency-domain perception with structural re-parameterization for efficient and accurate carnation openness grading. [Methods] Based on the YOLO11n baseline, the Flor-YOLO architecture was proposed with targeted improvements to the backbone, downsampling mechanism, and detection head. Backbone reconstruction: A lightweight LiteChimeraNet was constructed to enhance feature expression under limited computing power. A RepStem re-parameterization module was introduced at the input stage to establish an anti-aliasing mechanism via multi-branch training and single-path inference. Simultaneously, the C3k2_PConv module, utilizing partial convolution, was integrated to reduce memory access cost (MAC) and focus computation on petal foregrounds. Additionally, a RepNCSPELAN4_CAA module embedded with context anchor attention was incorporated in deep layers to capture long-range dependencies of the global flower topology. Frequency-domain downsampling: To mitigate texture aliasing and detail loss caused by spatial downsampling, a WaveletPool module was introduced. Utilizing the 2D discrete wavelet transform (2D-DWT), this module orthogonally decomposed feature maps into low- and high-frequency sub-bands, explicitly preserving high-frequency information in horizontal, vertical, and diagonal directions to alleviate spectral aliasing. Detection head optimization: A lightweight shared detail-enhanced detection head (SDL-Head) was designed. It reduced parameter redundancy through cross-scale weight sharing and incorporated detail-enhanced convolution (DEConv), fusing central and angular difference operators, to boost sensitivity to the geometric morphology of petal edges. Furthermore, a scale-adaptive layer combined with Intersection over Union (IoU)-aware soft labels was applied to improve multi-scale feature alignment. A dataset comprising 1 748 original images of "Red kang" carnations was collected and expanded to 6 580 samples via hybrid data augmentation. The model was trained on an NVIDIA RTX 4060 GPU for 250 epochs using SGD optimization, and comparative evaluations were conducted against the YOLO series, NanoDet-m, and Hyper-YOLO-t. [Results and Discussion] Ablation studies and comparative experiments on the self-constructed dataset revealed significant performance gains. Ablation analysis: Reconstructing the backbone to LiteChimeraNet reduced FLOPs from 6.3 G (baseline) to 1.5 G, a decrease of 76.2%, while maintaining stable mean Average Precision (mAP@50), verifying its efficiency in removing background redundancy. Introducing WaveletPool significantly improved mAP@50 by 1.79 percentage points, confirming the critical role of explicitly preserving high-frequency components for serrated texture representation. Integrating SDL-Head further optimized feature alignment, increasing the recall rate to 94.47%. Overall performance: Flor-YOLO achieved a precision of 93.04%, recall of 94.47%, and mAP@50 of 96.10%. Compared to the YOLO11n baseline, these metrics improved by 3.52, 1.34, and 3.25 percentage points, respectively. Meanwhile, parameters and FLOPs were reduced by 51.2% to 1.26 M and 1.1 G (82.54% reduction). Flor-YOLO exhibited distinct advantages over YOLOv5n, YOLOv8n, YOLOv9t, YOLOv10n, and YOLOv12n in accuracy, mAP, and inference speed. Mechanism analysis: Spectral energy statistics showed that high-frequency energy intensified with increasing openness grades, aligning with the visual characteristics of petal expansion and wrinkle formation, thus validating the discriminative value of high-frequency information. Grad-CAM++ visualizations further validated that the improved model stably focused on petal edges and flower centers, demonstrating superior robustness over the baseline in complex backgrounds. [Conclusions] By constructing the LiteChimeraNet backbone, incorporating frequency-domain downsampling, and designing a detail-enhanced head, the proposed model effectively enhances the representation of critical details such as petal edges and flower centers while maintaining extremely low computational costs. Comprehensively, Flor-YOLO achieves an optimal balance between accuracy, model size, and real-time performance, demonstrating strong potential for deployment on low-power mobile terminals and embedded sorting equipment. Furthermore, the proposed frequency-aware lightweight design paradigm provides a valuable reference for other agricultural vision tasks relying on subtle textural differences.

Key words: Dianthus caryophyllus L. grading, object detection, Flor-YOLO, lightweight, wavelet pooling, frequency domain analysis

中图分类号:

李传孟, 杨洁, 张晓宇. 基于Flor-YOLO的香石竹鲜切花分级轻量化检测方法[J]. 智慧农业(中英文), doi: 10.12133/j.smartag.SA202512007.

LI Chuanmeng, YANG Jie, ZHANG Xiaoyu. Lightweight Detection Method for Grading Fresh Cut Dianthus caryophyllus L. Based on Flor-YOLO[J]. Smart Agriculture, doi: 10.12133/j.smartag.SA202512007.

图/表 16

图1

图2

图3

图4

图5

图6

表1

图7

表2

表3

表4

表5

表6

图8

图9

图10

参考文献 35

[1]	NAYAK A, PATTANAIK A, SAMANTARAY P, et al. Cultivation and cultural practices followed in carnation (Dianthus caryophyllus L.) for better production: a review[J]. Agricultural Reviews, 2024, 46(4): 612-620
[2]	WANG M, PI Z K, PAN Z K, et al. Studies on the mother flower carnation: past, present, and future[J]. Horticulture Research, 2025, 12(8): uhaf118.
[3]	VERDONK J C, VAN IEPEREN W, CARVALHO D R A, et al. Effect of preharvest conditions on cut-flower quality[J]. Frontiers in Plant Science, 2023, 14: 1281456.
[4]	顾仲阳. 我国种苗花卉企业年产值超5200亿元云南鲜切花国内市场占有率达70%[EB/OL].中国政府网, 2025-03-05[2026-02-22].
[5]	HE Z, MA X. Research on logistics transportation of fresh cut flowers in cold chain system: Take the development of fresh cut flower logistics in Yunnan province as an example[J]. World Scientific Research Journal, 2020, 6(7): 185-192.
[6]	DOLE J M, STAMPS R H, CARLSON A S, et al. Postharvest Handling of Cut Flowers and Greens: A Practical Guide for Commercial Growers, Wholesalers, and Retailers[M]. Arkansas: Association of Specialty Cut Flower Growers, 2017.
[7]	云南省农业农村厅. 云南省花卉产业发展报告[R]. 昆明: 云南省农业农村厅,2022.
[8]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA, June 27–30, 2016. Piscataway, NJ: IEEE, 2016: 779-788.
[9]	肖瑞宏, 谭立新, 王日凤, 等. 基于改进YOLOv11n的多尺度茶叶病害检测方法[J]. 智慧农业(中英文), 2026, 8(1): 62-71.
	XIAO R H, TAN L X, WANG R F, et al. Multi-scale tea leaf disease detection method based on improved YOLOv11n[J]. Smart Agriculture, 2026, 8(1): 62-71.
[10]	王雪, 高雅, 陶桂香, 等. 基于CBLP-YOLO 11n的无人机稻穗轻量化检测方法[J]. 农业机械学报, 2025, 56(11): 461-470.
	WANG X, GAO Y, TAO G X, et al. Lightweight detection method of rice panicles based on CBLP-YOLO 11n[J]. Transactions of the Chinese Society for Agricultural Machinery, 2025, 56(11): 461-470.
[11]	SUN X Y, LI Z Y, ZHU T T, et al. Four-dimension deep learning method for flower quality grading with depth information[J]. Electronics, 2021, 10(19): 2353.
[12]	DUAN Z Y, LIU W H, ZENG S, et al. Research on a real-time, high-precision end-to-end sorting system for fresh-cut flowers[J]. Agriculture, 2024, 14(9): 1532.
[13]	张玉玉, 邴树营, 纪元浩, 等. 基于改进YOLOv8s的玫瑰鲜切花分级方法[J]. 智慧农业(中英文), 2024, 6(2): 118-127.
	ZHANG Y Y, BING S Y, JI Y H, et al. Grading method of fresh cut rose flowers based on improved YOLOv8s[J]. Smart Agriculture, 2024, 6(2): 118-127.
[14]	钱晔, 陈江权, 李兆文, 等. 基于多重注意力协同优化的鲜切花等级分类模型[J/OL]. 南京农业大学学报. (2025-09-08)[2025-12-01].
	QIAN Y, CHEN J Q, LI Z W, et al. Fresh-cut flower grade classification model based on multi-attention collaborative optimization[J/OL]. Journal of Nanjing Agricultural University. (2025-09-08) [2025-12-01].
[15]	CHEN F N, LI Y, SUN H W, et al. Petal damage and bent flower detection method of rose cut flowers based on computer vision[J]. Scientia Horticulturae, 2025, 340: 113927.
[16]	FEI Y Q, LI Z Y, ZHU T T, et al. A lightweight attention-based convolutional neural networks for fresh-cut flower classification[J]. IEEE Access, 2023, 11: 17283-17293.
[17]	LAI Q H, YANG Z W, SU W, et al. Enhancement of the prediction of the openness of fresh-cut roses with an improved YOLOv8s model validated by an automatic Grading Machine[J]. Frontiers in Plant Science, 2025, 16: 1546503.
[18]	LI J Y, LI M. Flowering index intelligent detection of spray rose cut flowers using an improved YOLOv5s model[J]. Applied Sciences, 2024, 14(21): 9879.
[19]	WÄLDCHEN J, MÄDER P. Plant species identification using computer vision techniques: a systematic literature review[J]. Archives of Computational Methods in Engineering, 2018, 25(2): 507-543.
[20]	ZHANG R. MAKING convolutional networks shift-invariant again[C]// International Conference on Machine Learning. New York, USA: PMLR, 2019: 7324-7334.
[21]	NING J, SPRATLING M. The importance of anti-aliasing in tiny object detection[C]// Asian Conference on Machine Learning. New York, USA: PMLR, 2024: 975-990.
[22]	ANASOSALU VASU P K, GABRIEL J, ZHU J, et al. FastViT: a fast hybrid vision transformer using structural reparameterization[C]// 2023 IEEE/CVF International Conference on Computer Vision (ICCV). October 1–6, 2023. Paris, France. IEEE, 2023: 5762-5772.
[23]	CHEN J R, KAO S H, HE H, et al. Run, don't walk: chasing higher FLOPS for faster neural networks[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, WA, USA: IEEE, 2023: 12021-12031.
[24]	WANG C Y, YEH I H, MARK LIAO H Y. YOLOv9: learning what you want toLearn using programmable gradient information[C]// Computer Vision – ECCV 2024. Cham: Springer, 2025: 1-21.
[25]	CAI X H, LAI Q X, WANG Y W, et al. Poly kernel inception network for remote sensing detection[C]//2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, WA, USA: IEEE, 2024: 27706-27716.
[26]	GRABINSKI J, KEUPER J, KEUPER M. Aliasing and adversarial robust generalization of CNNs[J]. Machine Learning, 2022, 111(11): 3925-3951.
[27]	WILLIAMS T, LI R. Wavelet pooling for convolutional neural networks[C/OL]// International Conference on Learning Representations. 2018. [2025-12-05].
[28]	TIAN Z, SHEN C H, CHEN H, et al. FCOS: fully convolutional one-stage object detection[C]// 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2019: 9626-9635.
[29]	CHEN Z X, HE Z W, LU Z M. DEA-net: single image dehazing based on detail-enhanced convolution and content-guided attention[J]. IEEE Transactions on Image Processing, 2024: 1002-1015.
[30]	LI X, WANG W H, HU X L, et al. Generalized focal loss V2: learning reliable localization quality estimation for dense object detection[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Nashville, TN, USA: IEEE, 2021: 11627-11636.
[31]	国家市场监督管理总局, 国家标准化管理委员会. 香石竹切花等级: GB/T 41202—2021 [S]. 北京: 中国标准出版社, 2021.
	State Administration for Market Regulation; Standardization Administration of the People's Republic of China. Grade of cut carnation: GB/T 41202—2021 [S]. Beijing: Standards Press of China, 2021.
[32]	CAI Y X, ZHOU Y Z, HAN Q, et al. Reversible column networks[EB/OL]. arXiv: 2212.11696, 2022.
[33]	MA X, DAI X Y, BAI Y, et al. Rewrite the stars[C]// 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, WA, USA: IEEE, 2024: 5694-5703.
[34]	ZHAO Y A, LV W Y, XU S L, et al. DETRs beat YOLOs on real-time object detection[C]// 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, WA, USA: IEEE, 2024: 16965-16974.
[35]	FENG Y F, HUANG J G, DU S Y, et al. Hyper-YOLO: when visual object detection meets hypergraph computation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 47(4): 2388-2401..

开花指数/度	描述
1	花瓣从萼片中伸出约0.5 cm，花朵顶部呈“星形”。此阶段采收，开花指数过小，除非有强力的促进花蕾开放的技术措施，否则切花不易开放或开放不好，为不适宜采收时期
2	花瓣从萼片中伸出约1 cm，且花瓣直立。适宜夏秋季远距离运输销售
3	花瓣开始散开，但中心较紧实。适宜冬春季远距离运输销售
4	花瓣更松散些，且外瓣展开度小于水平线。适宜冬春季近距离运输销售
5	花瓣全面松散，外瓣展开度呈水平。此阶段花过于成熟，不宜采收；若采收应尽快销售

开放度	1度/张	2度/张	3度/张	4度/张	5度/张	合计/张
增强前	52	220	589	549	338	1 748
增强后	832	1 320	1 767	1 647	1 014	6 580

模型	R/%	F ₁/%	P/%	mAP@50/%	mAP@50~95/%	参数量/M	浮点运算量/GFLOPs	推理速度/（f/s）	模型大小/M
YOLO11n	93.13	91.22	89.52	92.85	74.74	2.58	6.3	275.00	5.2
YOLO11n+R	92.00	92.33	92.22	92.87	75.67	2.58	1.7	269.42	5.3
YOLO11n+P	94.98	91.86	89.39	93.94	74.87	2.40	5.9	330.13	4.9
YOLO11n+RC	95.82	92.33	89.23	95.01	76.38	2.18	6.0	298.09	4.5
YOLO11n+P+RC	92.19	92.82	93.71	94.37	76.51	2.00	5.6	270.06	4.1
YOLO11n+R+RC	90.63	91.90	88.69	94.95	75.95	2.18	1.6	507.18	4.5
LiteChimeraNet	90.76	92.18	93.92	95.04	76.13	2.00	1.5	521.60	4.1

模型	R/%	F ₁/%	P/%	mAP@50/%	mAP@50~95/%	参数量/M	FLOPs/G	推理速度/（f/s）	模型大小/M
YOLO11n	93.13	91.22	89.52	92.85	74.74	2.58	6.3	275.00	5.2
YOLO11n+Revcol	89.94	91.50	93.20	93.70	74.28	2.09	4.9	267.74	4.5
YOLO11n+StarNet	90.28	87.97	85.97	91.22	72.06	1.94	5.0	226.76	4.0
YOLO11n+HGNetV2	92.05	91.01	90.10	92.30	73.70	2.14	5.7	273.08	4.5
YOLO11n+LiteChimeraNet	90.76	92.18	93.92	95.04	76.13	2.00	1.5	521.60	4.1

模型	R%	F ₁/%	P/%	mAP@50/%	mAP@50~95/%	参数量/M	FLOPs/G	推理速度/（f/s）	模型大小/M
YOLO11n	93.13	91.22	89.52	92.85	74.74	2.58	6.3	275.00	5.2
YOLO+L	90.76	92.18	93.92	95.04	76.13	2.0	1.5	521.60	4.1
YOLO+W	91.89	92.03	92.18	94.64	76.11	2.17	5.4	282.99	4.4
YOLO+S	94.38	89.85	94.25	92.80	74.70	2.26	6.0	304.81	5.0
YOLO+W+S	93.88	92.72	91.88	93.66	74.71	2.26	6.2	265.55	5.0
YOLO+L+W	93.77	93.28	92.85	95.74	76.26	1.58	1.2	431.60	3.3
YOLO+L+S	92.75	91.95	93.39	95.14	76.10	1.32	1.2	519.75	3.2
YOLO+L+W+S（Flor-YOLO）	94.47	93.69	93.04	96.10	76.24	1.26	1.1	616.09	3.0