基于无人机图像和改进LSC-CNN模型的密集苗木检测和计数方法

doi:10.12133/j.smartag.SA202404011

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (5): 88-97.doi: 10.12133/j.smartag.SA202404011

基于无人机图像和改进LSC-CNN模型的密集苗木检测和计数方法

彭小丹¹^,³, 陈锋军¹^,²^,³^,⁴(), 朱学岩¹^,⁴, 才嘉伟¹^,³, 顾梦梦⁵

^1. 北京林业大学工学院，北京 100083，中国
^2. 林木资源高效生产全国重点实验室，北京 100083，中国
^3. 城乡生态环境北京实验室，北京 100083，中国
^4. 林业装备与自动化国家林业和草原局重点实验室，北京 100083，中国
^5. 科罗拉多州立大学园艺与景观建筑系，柯林斯堡 80523，美国

收稿日期:2024-04-18 出版日期:2024-09-30
基金项目:
国家重点研发计划项目(2019YFD1002401); 北京林业大学科技创新计划项目(2021ZY74); 北京市共建项目专项
作者简介:
彭小丹，研究方向为林业信息检测。Email：pengxiaodan@bjfu.edu.cn
通信作者:
陈锋军，博士，教授，研究方向为林业信息检测与智能处理。Email： chenfj227@bjfu.edu.cn

Dense Nursery Stock Detecting and Counting Based on UAV Aerial Images and Improved LSC-CNN

PENG Xiaodan¹^,³, CHEN Fengjun¹^,²^,³^,⁴(), ZHU Xueyan¹^,⁴, CAI Jiawei¹^,³, GU Mengmeng⁵

^1. School of Technology, Beijing Forestry University, Beijing 100083, China
^2. National Key Laboratory of Efficient Production of Forest Resources, Beijing 100083, China
^3. Beijing Laboratory of Urban and Rural Ecological Environment, Beijing Municipal Education Commission, Beijing 100083, China
^4. Key Laboratory of State Forestry Administration for Forestry Equipment and Automation, Beijing 100083, China
^5. Architecture of Horticultural and Landscape, Colorado State University, Fort Collins CO 80523, USA

Received:2024-04-18 Online:2024-09-30
Foundation items:National Key Research and Development Program(2019YFD1002401); Beijing Forestry University Science and Technology Innovation Program Project(2021ZY74); Beijing Common Construction Project
About author:
PENG Xiaodan, E-mail: pengxiaodan@bjfu.edu.cn
Corresponding author:
CHEN Fengjun, E-mail: chenfj227@bjfu.edu.cn

摘要/Abstract

摘要：

【目的/意义】 快速、准确地统计密集种植的苗木数量对苗木经营管理具有重要意义。为解决无人机航拍的密集种植苗木图像中苗木粘连、尺度差异大的问题，提出以点标签数据为监督信号的改进密集检测计数模型（Locate, Size and Count, LSC-CNN），同时实现苗木的检测和计数。 【方法】 改进的LSC-CNN模型通过将LSC-CNN模型特征提取网络的最后一层卷积替换为扩张卷积（Dilated Convolutions, DConv），实现在保留苗木细节特征的同时扩大感受野，帮助模型更好地理解上下文信息以区分粘连苗木。此外，在多个尺度分支前引入注意力机制（Convolutional Block Attention Module, CBAM）使模型聚焦于有助于苗木检测和计数的关键特征，以更好地适应不同尺度的苗木。为解决类别不平衡问题，提高模型的泛化能力，将损失函数替换为标签平滑交叉熵损失函数。 【结果和讨论】 经测试，改进LSC-CNN模型在456幅苗木图像的测试集上的平均绝对误差（Mean Absolute Error, MAE）、均方根误差（Root Mean Square Error, RMSE）和平均计数准确率（Mean Counting Accurate, MCA）分别为14.24株、22.22株和91.23%，三项指标均优于IntegrateNet、PSGCNet、CANet、CSRNet、CLTR和LSC-CNN模型。 【结论】 改进LSC-CNN模型能够准确实现密集种植苗木的检测和计数，适用于多种树木的检测和计数工作。

关键词: 无人机, 密集种植, 计数, 多尺度, LSC-CNN

Abstract:

[Objective] The number, location, and crown spread of nursery stock are important foundations data for their scientific management. Traditional approach of conducting nursery stock inventories through on-site individual plant surveys is labor-intensive and time-consuming. Low-cost and convenient unmanned aerial vehicles (UAVs) for on-site collection of nursery stock data are beginning to be utilized, and the statistical analysis of nursery stock information through technical means such as image processing achieved. During the data collection process, as the flight altitude of the UAV increases, the number of trees in a single image also increases. Although the anchor box can cover more information about the trees, the cost of annotation is enormous in the case of a large number of densely populated tree images. To tackle the challenges of tree adhesion and scale variance in images captured by UAVs over nursery stock, and to reduce the annotation costs, using point-labeled data as supervisory signals, an improved dense detection and counting model was proposed to accurately obtain the location, size, and quantity of the targets. [Method] To enhance the diversity of nursery stock samples, the spruce dataset, the Yosemite, and the KCL-London publicly available tree datasets were selected to construct a dense nursery stock dataset. A total of 1 520 nursery stock images were acquired and divided into training and testing sets at a ratio of 7:3. To enhance the model's adaptability to tree data of different scales and variations in lighting, data augmentation methods such as adjusting the contrast and resizing the images were applied to the images in the training set. After enhancement, the training set consists of 3 192 images, and the testing set contains 456 images. Considering the large number of trees contained in each image, to reduce the cost of annotation, the method of selecting the center point of the trees was used for labeling. The LSC-CNN model was selected as the base model. This model can detect the quantity, location, and size of trees through point-supervised training, thereby obtaining more information about the trees. The LSC-CNN model was made improved to address issues of missed detections and false positives that occurred during the testing process. Firstly, to address the issue of missed detections caused by severe adhesion of densely packed trees, the last convolutional layer of the feature extraction network was replaced with dilated convolution. This change enlarges the receptive field of the convolutional kernel on the input while preserving the detailed features of the trees. So the model is better able to capture a broader range of contextual information, thereby enhancing the model's understanding of the overall scene. Secondly, the convolutional block attention module (CBAM) attention mechanism was introduced at the beginning of each scale branch. This allowed the model to focus on the key features of trees at different scales and spatial locations, thereby improving the model's sensitivity to multi-scale information. Finally, the model was trained using label smooth cross-entropy loss function and grid winner-takes-all strategy, emphasizing regions with highest losses to boost tree feature recognition. [Results and Discussions] The mean counting accuracy (MCA), mean absolute error (MAE), and root mean square error (RMSE) were adopted as evaluation metrics. Ablation studies and comparative experiments were designed to demonstrate the performance of the improved LSC-CNN model. The ablation experiment proved that the improved LSC-CNN model could effectively resolve the issues of missed detections and false positives in the LSC-CNN model, which were caused by the density and large-scale variations present in the nursery stock dataset. IntegrateNet, PSGCNet, CANet, CSRNet, CLTR and LSC-CNN models were chosen as comparative models. The improved LSC-CNN model achieved MCA, MAE, and RMSE of 91.23%, 14.24, and 22.22, respectively, got an increase in MCA by 6.67%, 2.33%, 6.81%, 5.31%, 2.09% and 2.34%, respectively; a reduction in MAE by 21.19, 11.54, 18.92, 13.28, 11.30 and 10.26, respectively; and a decrease in RMSE by 28.22, 28.63, 26.63, 14.18, 24.38 and 12.15, respectively, compared to the IntegrateNet, PSGCNet, CANet, CSRNet, CLTR and LSC-CNN models. These results indicate that the improved LSC-CNN model achieves high counting accuracy and exhibits strong generalization ability. [Conclusions] The improved LSC-CNN model integrated the advantages of point supervision learning from density estimation methods and the generation of target bounding boxes from detection methods.These improvements demonstrate the enhanced performance of the improved LSC-CNN model in terms of accuracy, precision, and reliability in detecting and counting trees. This study could hold practical reference value for the statistical work of other types of nursery stock.

Key words: UAV, intensive planting, counting, multi-scale, LSC-CNN

中图分类号:

TP391.4

彭小丹, 陈锋军, 朱学岩, 才嘉伟, 顾梦梦. 基于无人机图像和改进LSC-CNN模型的密集苗木检测和计数方法[J]. 智慧农业(中英文), 2024, 6(5): 88-97.

PENG Xiaodan, CHEN Fengjun, ZHU Xueyan, CAI Jiawei, GU Mengmeng. Dense Nursery Stock Detecting and Counting Based on UAV Aerial Images and Improved LSC-CNN[J]. Smart Agriculture, 2024, 6(5): 88-97.

图/表 10

图1

图2

图3

图4

图5

表1

图6

表2

图7

表3

改进LSC-CNN模型云杉苗木检测结果统计

网络模型	平均检测数/株			平均漏检数/株			评价指标
网络模型	云杉	Yosemite	KCL-London	云杉	Yosemite	KCL-London	漏检率/%	精确率P/%	召回率R/%	$F 1$ 分数/%
LSC-CNN	862	199	220	44	30	38	11.66	93.08	88.34	90.61
改进LSC-CNN	868	206	237	37	16	19	6.56	94.88	93.44	94.15

表3

参考文献 27

1	HAN P C, MA C B, CHEN J, et al. Fast tree detection and counting on UAVs for sequential aerial images with generating orthophoto mosaicing[J]. Remote sensing, 2022, 14(16): ID 4113.
2	YAO L, LIU T, QIN J, et al. Tree counting with high spatial-resolution satellite imagery based on deep neural networks[J]. Ecological indicators, 2021, 125: ID 107591.
3	DONMEZ C, VILLI O, BERBEROGLU S, et al. Computer vision-based citrus tree detection in a cultivated environment using UAV imagery[J]. Computers and electronics in agriculture, 2021, 187: ID 106273.
4	TONG P M, HAN P C, LI S C, et al. Counting trees with point-wise supervised segmentation network[J]. Engineering applications of artificial intelligence, 2021, 100: ID 104172.
5	JINTASUTTISAK T, EDIRISINGHE E, ELBATTAY A. Deep neural network based date palm tree detection in drone imagery[J]. Computers and electronics in agriculture, 2022, 192: ID 106560.
6	EGI Y, HAJYZADEH M, EYCEYURT E. Drone-computer communication based tomato generative organ counting model using YOLO V5 and deep-sort[J]. Agriculture, 2022, 12(9): ID 1290.
7	OCER N E, KAPLAN G, ERDEM F, et al. Tree extraction from multi-scale UAV images using Mask R-CNN with FPN[J]. Remote sensing letters, 2020, 11(9): 847-856.
8	ZHU Y C, ZHOU J, YANG Y H, et al. Rapid target detection of fruit trees using UAV imaging and improved light YOLOv4 algorithm[J]. Remote sensing, 2022, 14(17): ID 4324.
9	SONG Q Y, WANG C G, JIANG Z K, et al. Rethinking counting and localization in crowds: A purely point-based framework[C]// 2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2021: 3345-3354.
10	ZHU G Y, ZENG X, JIN X J, et al. Metro passengers counting and density estimation via dilated-transposed fully convolutional neural network[J]. Knowledge and information systems, 2021, 63(6): 1557-1575.
11	LIU L, LU H, LI Y, et al. High-throughput rice density estimation from transplantation to tillering stages using deep networks[J]. Plant phenomics, 2020, 2020: ID 1375957.
12	MA Y Y, SUN Z L, ZENG Z G, et al. Corn-plant counting using scare-aware feature and channel interdependence[J]. IEEE geoscience and remote sensing letters, 2022, 19: 1-5.
13	LU H, LIU L, LI Y N, et al. TasselNetV3: Explainable plant counting with guided upsampling and background suppression[J]. IEEE transactions on geoscience and remote sensing, 2022, 60: 1-15.
14	MA Z H, WEI X, HONG X P, et al. Bayesian loss for crowd count estimation with point supervision[C]// 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2019: 6141-6150.
15	ZOU H, LU H, LI Y, et al. Maize tassels detection: A benchmark of the state of the art[J]. Plant methods, 2020, 16: ID 108.
16	BAI X, LIU P, CAO Z, et al. Rice plant counting, locating, and sizing method based on high-throughput UAV RGB images[J]. Plant phenomics, 2023, 5: ID 0020.
17	SAM D B, PERI S V, SUNDARARAMAN M N, et al. Locate, size, and count: Accurately resolving people in dense crowds via detection[J]. IEEE trans pattern anal Mach intell, 2021, 43(8): 2739-2751.
18	CHEN G, SHANG Y. Transformer for tree counting in aerial images[J]. Remote sensing, 2022, 14(3): ID 476.
19	AMIRKOLAEE H A, SHI M J, MULLIGAN M. TreeFormer: A semi-supervised transformer-based framework for tree counting from a single high-resolution image[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: 1-15.
20	WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[M]// Computer Vision-ECCV 2018. Cham: Springer International Publishing, 2018: 3-19.
21	JIANG K L, XIE T Y, YAN R, et al. An attention mechanism-improved YOLOv7 object detection algorithm for hemp duck count estimation[J]. Agriculture, 2022, 12(10): ID 1659.
22	LU S L, SONG Z, CHEN W K, et al. Counting dense leaves under natural environments via an improved deep-learning-based object detection algorithm[J]. Agriculture, 2021, 11(10): ID 1003.
23	LIU W X, ZHOU J, WANG B W, et al. IntegrateNet: A deep learning network for maize stand counting from UAV imagery by integrating density and local count maps[J]. IEEE geoscience and remote sensing letters, 2022, 19: 1-5.
24	GAO G S, LIU Q J, HU Z H, et al. PSGCNet: A pyramidal scale and global context guided network for dense object counting in remote-sensing images[J]. IEEE transactions on geoscience and remote sensing, 2022, 60: 1-12.
25	LIU W Z, SALZMANN M, FUA P. Context-aware crowd counting[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2019: 5094-5103.
26	LI Y H, ZHANG X F, CHEN D M. CSRNet: dilated convolutional neural networks for understanding the highly congested scenes[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, New Jersey, USA: IEEE, 2018: 1091-1100.
27	LIANG D K, XU W, BAI X. An end-to-end transformer model for Crowd localization[M]// Lecture Notes in Computer Science. Cham: Springer Nature Switzerland, 2022: 38-54.

试验编号	DConv	LSCE Loss	CBAM	MAE	RMSE	MCA/%
1	×	×	×	24.50	34.37	88.89
2	√	×	×	20.93	28.53	90.34
3	×	√	×	21.94	30.13	89.15
4	×	×	√	18.19	25.07	90.70
5	√	√	√	14.24	22.22	91.23

模型	MAE/株	RMSE/株	MCA/%
IntegrateNet	35.43	50.44	84.56
PSGCNet	25.78	50.85	88.90
CANet	33.16	48.85	84.42
CSRNet	27.52	36.40	85.92
CLTR	25.54	46.60	89.14
LSC-CNN	24.50	34.37	88.89
改进LSC-CNN	14.24	22.22	91.23

[1]	叶大鹏, 景均, 张之得, 李辉煌, 吴昊宇, 谢立敏. MSH-YOLOv8：融合尺度重建的蘑菇小目标检测方法[J]. 智慧农业(中英文), 2024, 6(5): 139-152.
[2]	张岩琪, 周硕, 张凝, 柴秀娟, 孙坦. 基于改进实例分割算法的区域养殖生猪计数系统[J]. 智慧农业(中英文), 2024, 6(4): 53-63.
[3]	侯依廷, 饶元, 宋贺, 聂振君, 王坦, 何豪旭. 复杂大田场景下基于改进YOLOv8的小麦幼苗期叶片数快速检测方法[J]. 智慧农业(中英文), 2024, 6(4): 128-137.
[4]	李强, 余秋丽, 李浩鹏, 徐春保, 丁幼春. 油菜播种机除尘式播量监测系统设计与试验[J]. 智慧农业(中英文), 2024, 6(3): 107-117.
[5]	吴小燕, 郭威, 朱轶萍, 朱华吉, 吴华瑞. 基于改进YOLOv8s的大田甘蓝移栽状态检测算法[J]. 智慧农业(中英文), 2024, 6(2): 107-117.
[6]	庞春晖, 陈鹏, 夏懿, 章军, 王兵, 邹岩, 陈天娇, 康辰瑞, 梁栋. 用于小麦多生长阶段倒伏边界精准检测的分层交互特征金字塔网络[J]. 智慧农业(中英文), 2024, 6(2): 128-139.
[7]	李政凯, 于嘉辉, 潘时佳, 贾泽丰, 牛子杰. 冬季猕猴桃树单木骨架提取与冠层生长预测方法[J]. 智慧农业(中英文), 2023, 5(4): 92-104.
[8]	唐辉, 王铭, 于秋实, 张佳茜, 刘连涛, 王楠. 融合改进UNet和迁移学习的棉花根系图像分割方法[J]. 智慧农业(中英文), 2023, 5(3): 96-109.
[9]	龙佳宁, 张昭, 刘晓航, 李云霞, 芮照钰, 余江帆, 张漫, FLORES Paulo, 韩哲雄, 胡灿, 王旭峰. 利用改进EfficientNetV2和无人机图像检测小麦倒伏类型[J]. 智慧农业(中英文), 2023, 5(3): 62-74.
[10]	张淦, 严海峰, 胡根生, 张东彦, 程涛, 潘正高, 许海峰, 沈书豪, 朱科宇. 基于深度学习语义分割和迁移学习策略的麦田倒伏面积识别方法[J]. 智慧农业(中英文), 2023, 5(3): 75-85.
[11]	刘易雪, 宋育阳, 崔萍, 房玉林, 苏宝峰. 基于无人机遥感和深度学习的葡萄卷叶病感染程度诊断方法[J]. 智慧农业(中英文), 2023, 5(3): 49-61.
[12]	魏永康, 杨天聪, 丁信尧, 高越之, 袁鑫茹, 贺利, 王永华, 段剑钊, 冯伟. 基于不同空间分辨率无人机多光谱遥感影像的小麦倒伏区域识别方法[J]. 智慧农业(中英文), 2023, 5(2): 56-67.
[13]	赖佳政, 李贝贝, 程翔, 孙丰, 陈炬廷, 王晶, 张芊, 叶协锋. 基于无人机高光谱遥感的烤烟叶片叶绿素含量估测[J]. 智慧农业(中英文), 2023, 5(2): 68-81.
[14]	刘晓航, 张昭, 刘嘉滢, 张漫, 李寒, FLORES Paulo, 韩雄哲. 基于多种深度学习算法的田间玉米籽粒检测与计数[J]. 智慧农业(中英文), 2022, 4(4): 49-60.
[15]	付虹雨, 王薇, 廖澳, 岳云开, 许明志, 王梓薇, 陈建福, 佘玮, 崔国贤. 基于无人机遥感表型监测的苎麻优质种质资源筛选方法[J]. 智慧农业(中英文), 2022, 4(4): 74-83.

基于无人机图像和改进LSC-CNN模型的密集苗木检测和计数方法

Dense Nursery Stock Detecting and Counting Based on UAV Aerial Images and Improved LSC-CNN

在线阅读

知网下载

本地下载

可视化

摘要/Abstract

引用本文

使用本文

图/表 10

参考文献 27

相关文章 15

编辑推荐

Metrics

本文评价