0 引 言
1 数据与预处理
2 MHSF-DETR模型架构设计
2.1 网络整体架构
2.2 HCSP-Net主干网络
2.2.1 M2-SCA模块
2.2.2 CSF模块
2.3 边界感知重构机制
2.3.1 ER-Conv卷积
2.3.2 ER-ConvC3特征聚合模块
2.4 LWC-Fusion模块
3 结果与分析
3.1 实验设置与评估指标
表1 MHSF-DETR模型实验参数设置Table 1 Experimental parameter settings for MHSF-DETR model |
| 参数名称 | 参数数值 |
|---|---|
| 输入图像尺寸 | 640×640 |
| 初始学习率 | 0.000 1 |
| 权重衰减 | 0.000 5 |
| 批次大小 | 4 |
| 总训练轮次 | 200 |
| 优化器 | AdamW |
3.2 消融实验
表2 MHSF-DETR模型核心模块消融实验结果Table 2 Ablation study results of core modules in the MHSF-DETR model |
| 实验 | LWC-Fusion | EARM | HCSP-Net | 准确率/% | 召回率/% | mAP50/% | 参数量/M | GFLOPs |
|---|---|---|---|---|---|---|---|---|
| 1 | × | × | × | 88.6 | 76.1 | 79.3 | 19.89 | 57.2 |
| 2 | √ | × | × | 86.4 | 75.5 | 79.8 | 20.44 | 57.6 |
| 3 | × | √ | × | 85.8 | 78.2 | 80.2 | 19.13 | 52.5 |
| 4 | × | × | √ | 84.8 | 77.8 | 81.1 | 16.00 | 55.0 |
| 5 | √ | √ | × | 87.1 | 78.9 | 80.9 | 19.66 | 53.2 |
| 6 | √ | × | √ | 88.2 | 78.5 | 81.8 | 16.48 | 55.4 |
| 7 | × | √ | √ | 88.4 | 79.2 | 82.1 | 15.01 | 49.4 |
| 8 | √ | √ | √ | 90.2 | 79.6 | 82.5 | 15.43 | 49.6 |
|
3.3 与主流模型对比实验
表3 MHSF-DETR与不同检测模型对比实验结果Table 3 Comparative experimental results between MHSF-DETR and different detection models |
| 算法 | 准确率/% | 召回率/% | mAP50/% | 参数量/M | GFLOPs |
|---|---|---|---|---|---|
| YOLOv5m | 87.7 | 77.4 | 81.9 | 21.32 | 49.2 |
| YOLOv8m | 89.1 | 78.2 | 83.1 | 25.84 | 78.7 |
| YOLOv10m | 85.3 | 74.5 | 79.6 | 15.41 | 59.3 |
| RT-DETR-R18 | 88.6 | 76.1 | 79.3 | 19.89 | 57.2 |
| RT-DETR-R50 | 90.3 | 80.5 | 83.8 | 42.65 | 110.5 |
| SSD | 78.5 | 68.2 | 74.3 | 26.28 | 62.4 |
| Faster R-CNN | 81.3 | 76.9 | 80.1 | 136.02 | 358.5 |
| EdgeNeXt-B | 82.4 | 71.5 | 76.9 | 18.51 | 3.84 |
| MobileViT-S | 76.5 | 64.2 | 70.8 | 5.63 | 2.03 |
| MHSF-DETR | 90.2 | 79.6 | 82.5 | 15.43 | 49.6 |
3.4 泛化实验
表4 MHSF-DETR在Plant Village数据集上的泛化实验结果Table 4 Generalization experiment results of MHSF-DETR on the Plant Village dataset |
| 算法 | 准确率/% | 召回率/% | mAP50/% | 参数量/M | GFLOPs |
|---|---|---|---|---|---|
| YOLOv5m | 84.2 | 74.1 | 78.5 | 21.32 | 49.2 |
| YOLOv8m | 86.5 | 75.3 | 80.1 | 25.84 | 78.7 |
| YOLOv10m | 82.1 | 71.8 | 78.4 | 15.41 | 59.3 |
| RT-DETR-R18 | 85.4 | 76.8 | 76.8 | 19.89 | 57.2 |
| RT-DETR-R50 | 87.2 | 80.5 | 80.7 | 42.65 | 110.5 |
| SSD | 75.2 | 70.1 | 70.2 | 26.28 | 62.4 |
| Faster R-CNN | 78.6 | 76.5 | 76.5 | 136.02 | 358.5 |
| EdgeNeXt-B | 80.2 | 69.6 | 74.1 | 18.51 | 3.84 |
| MobileViT-S | 74.5 | 62.8 | 68.6 | 5.63 | 2.03 |
| MHSF-DETR | 87.8 | 76.9 | 80.3 | 15.43 | 49.6 |
表5 MHSF-DETR在Roboflow Universe数据集上的泛化实验结果Table 5 Generalization experiment results of MHSF-DETR on the Roboflow Universe dataset |
| 算法 | 准确率/% | 召回率/% | mAP50/% | 参数量/M | GFLOPs |
|---|---|---|---|---|---|
| YOLOv5m | 89.5 | 78.4 | 84.2 | 21.32 | 49.2 |
| YOLOv8m | 92.2 | 82.5 | 89.6 | 25.84 | 78.7 |
| YOLOv10m | 92.8 | 81.9 | 88.1 | 15.41 | 59.3 |
| RT-DETR-R18 | 88.2 | 77.5 | 82.5 | 19.89 | 57.2 |
| RT-DETR-R50 | 93.5 | 83.1 | 88.6 | 42.65 | 110.5 |
| SSD | 82.4 | 72.1 | 76.8 | 26.28 | 62.4 |
| Faster R-CNN | 87.5 | 80.2 | 81.9 | 136.02 | 358.5 |
| EdgeNeXt-B | 83.2 | 73.3 | 78.5 | 18.51 | 3.84 |
| MobileViT-S | 80.5 | 69.4 | 73.1 | 5.63 | 2.03 |
| MHSF-DETR | 92.5 | 81.1 | 86.9 | 15.43 | 49.6 |
3.5 边缘计算下的模型部署性能评估
表6 MHSF-DETR与不同算法在边缘设备上的推理效率对比结果Table 6 Comparison of inference efficiency between MHSF-DETR and different algorithms on edge devices |
| 算法 | 参数量/M | mAP50/% | 推理耗时/ms | 帧率/(帧/s) |
|---|---|---|---|---|
| YOLOv8m | 25.84 | 83.1 | 2 054.56 | 0.49 |
| RT-DETR-R50 | 42.65 | 83.8 | 2 833.34 | 0.35 |
| RT-DETR-R18 | 19.89 | 79.3 | 1 685.23 | 0.61 |
| MHSF-DETR | 15.43 | 82.5 | 1 454.04 | 0.69 |





