利用改进EfficientNetV2和无人机图像检测小麦倒伏类型

doi:10.12133/j.smartag.SA202308010

Smart Agriculture ›› 2023, Vol. 5 ›› Issue (3): 62-74.doi: 10.12133/j.smartag.SA202308010

• 专刊--作物信息监测技术 • 上一篇下一篇

利用改进EfficientNetV2和无人机图像检测小麦倒伏类型

龙佳宁¹^,²(), 张昭¹^,²(), 刘晓航¹^,², 李云霞¹^,², 芮照钰¹^,², 余江帆¹^,², 张漫¹^,², FLORES Paulo³, 韩哲雄⁴^,⁵, 胡灿⁶, 王旭峰⁶

^1. 中国农业大学信息与电气工程学院，北京 100080，中国
^2. 中国农业大学农业农村部农业信息获取技术重点实验室，北京 100083，中国
^3. 北达科他州州立大学农业与生物工程系，北达科他州法戈 58102，美国
^4. 韩国江原大学生物系统工程系，江原道春川 24341，韩国
^5. 韩国江原大学智慧农业交叉学科，江原道春川 24341，韩国
^6. 塔里木大学机械电气化工程学院，新疆阿拉尔 843300，中国

收稿日期:2023-08-04 出版日期:2023-09-30
基金项目:
国家重点研发计划项目(2022YFD2001500)
作者简介:
龙佳宁，研究方向为农业机器人。E-mail：614020890@qq.com
通信作者:
张昭，博士，教授，研究方向为农业智能装备。E-mail：zhaozhangcau@cau.edu.cn

Wheat Lodging Types Detection Based on UAV Image Using Improved EfficientNetV2

LONG Jianing¹^,²(), ZHANG Zhao¹^,²(), LIU Xiaohang¹^,², LI Yunxia¹^,², RUI Zhaoyu¹^,², YU Jiangfan¹^,², ZHANG Man¹^,², FLORES Paulo³, HAN Zhexiong⁴^,⁵, HU Can⁶, WANG Xufeng⁶

^1. College of Information and Telecommunications, China Agricultural University, Beijing 100080, China
^2. Key Laboratory of Agricultural Information Acquisition Technology, Ministry of Agriculture and Rural Affairs, China Agricultural University, Beijing 100083, China
^3. Department of Agricultural and Bioengineering, North Dakota State University, Fargo 58102, USA
^4. Department of Biosystems Engineering, Kangwon University, Chuncheon 24341, Korea
^5. Department of Smart Agriculture Interdisciplinary, Gangwon University, Chuncheon 24341, Korea
^6. College of Mechanical and Electronic Engineering, Tarim University, Alar, Xinjiang 843300, China

Received:2023-08-04 Online:2023-09-30
Foundation items:National Key Research and Development Program of China(2022YFD2001500)
About author:LONG Jianing, E-mail：614020890@qq.com
Corresponding author:ZHANG Zhao, E-mail：zhaozhangcau@cau.edu.cn

摘要/Abstract

摘要：

[目的/意义] 不同类型的小麦倒伏（根部倒伏、茎部倒伏）对产量和质量会产生不同影响。本研究旨在通过无人机图像对小麦倒伏类型进行分类，并探究无人机飞行高度对分类性能的影响。 [方法] 研究设置3个无人机飞行高度（15、45、91 m）来获取小麦试验田的图像，并利用自动分割算法生成不同高度的数据集，提出一种EfficientNetV2-C改进模型对其进行分类识别。模型通过引入CA（Coordinate Attention）注意力机制来提升网络特征提取能力，并结合CB-Focal Loss（Class–Balanced Focal Loss）来解决数据不均衡对模型分类准确度的影响。 [结果和讨论] 改进的EfficientNetV2-C表现最佳，平均准确率达到93.58%。对比未改进的4种机器学习分类模型（支持向量机（Support Vector Machine，SVM）、K最近邻（K Nearest Neighbor，KNN）、决策树（Decision Tree，DT）和朴素贝叶斯（Naive Bayes，NB））与两种深度学习分类模型（ResNet101和EfficientNetV2），其中EfficientNetV2在各个高度下表现最优，平均准确率达到82.67%。无人机飞行高度对4种机器学习分类器性能无显著影响，但随飞行高度上升，由于图像特征信息损失，深度学习模型的分类性能下降。 [结论] 改进的EfficientNetV2-C在小麦倒伏类型检测方面取得了较高的准确率，为小麦倒伏预警和农作物管理提供了新的解决方案。

关键词: 小麦倒伏类型, 图像处理, 深度学习, 不平衡数据, 机器学习, 无人机

Abstract:

[Objective] Wheat, as one of the major global food crops, plays a key role in food production and food supply. Different influencing factors can lead to different types of wheat lodging, e.g., root lodging may be due to improper use of fertilizers. While stem lodging is mostly due to harsh environments, different types of wheat lodging can have different impacts on yield and quality. The aim of this study was to categorize the types of wheat lodging by unmanned aerial vehicle (UAV) image detection and to investigate the effect of UAV flight altitude on the classification performance. [Methods] Three UAV flight altitudes (15, 45, and 91 m) were set to acquire images of wheat test fields. The main research methods contained three parts: an automatic segmentation algorithm, wheat classification model selection, and an improved classification model based on EfficientNetV2-C. In the first part, the automatic segmentation algorithm was used to segment the UAV to acquire the wheat test field at three different heights and made it into the training dataset needed for the classification model. The main steps were first to preprocess the original wheat test field images acquired by the UAV through scaling, skew correction, and other methods to save computation time and improve segmentation accuracy. Subsequently, the pre-processed image information was analyzed, and the green part of the image was extracted using the super green algorithm, which was binarized and combined with the edge contour extraction algorithm to remove the redundant part of the image to extract the region of interest, so that the image was segmented for the first time. Finally, the idea of accumulating pixels to find sudden value added was used to find the segmentation coordinates of two different sizes of wheat test field in the image, and the region of interest of the wheat test field was segmented into a long rectangle and a short rectangle test field twice, so as to obtain the structural parameters of different sizes of wheat test field and then to generate the dataset of different heights. In the second part, four machine learning classification models of support vector machine (SVM), K nearest neighbor (KNN), decision tree (DT), and naive bayes (NB), and two deep learning classification models (ResNet101 and EfficientNetV2) were selected. Under the unimproved condition, six classification models were utilized to classify the images collected from three UAVs at different flight altitudes, respectively, and the optimal classification model was selected for improvement. In the third part, an improved model, EfficientNetV2-C, with EfficientNetV2 as the base model, was proposed to classify and recognized the lodging type of wheat in test field images. The main improvement points were attention mechanism improvement and loss function improvement. The attention mechanism was to replace the original model squeeze and excitation (SE) with coordinate attention (CA), which was able to embed the position information into the channel attention, aggregate the features along the width and height directions, respectively, during feature extraction, and capture the long-distance correlation in the width direction while retaining the long-distance correlation in the length direction, accurate location information, enhancing the feature extraction capability of the network in space. The loss function was replaced by class-balanced focal loss (CB-Focal Loss), which could assign different loss weights according to the number of valid samples in each class when targeting unbalanced datasets, effectively solving the impact of data imbalance on the classification accuracy of the model. [Results and Discussions] Four machine learning classification results: SVM average classification accuracy was 81.95%, DT average classification accuracy was 79.56%, KNN average classification accuracy was 59.32%, and NB average classification accuracy was 59.48%. The average classification accuracy of the two deep learning models, ResNet101 and EfficientNetV2, was 78.04%, and the average classification accuracy of ResNet101 was 81.61%. Comparing the above six classification models, the EfficientNetV2 classification model performed optimally at all heights. And the improved EfficientNetV2-C had an average accuracy of 90.59%, which was 8.98% higher compared to the average accuracy of EfficientNetV2. The SVM classification accuracies of UAVs at three flight altitudes of 15, 45, and 91 m were 81.33%, 83.57%, and 81.00%, respectively, in which the accuracy was the highest when the altitude was 45 m, and the classification results of the SVM model values were similar to each other, which indicated that the imbalance of the input data categories would not affect the model's classification effect, and the SVM classification model was able to solve the problem of high dimensionality of the data efficiently and had a good performance for small and medium-sized data sets. The SVM classification model could effectively solve the problem of the high dimensionality of data and had a better classification effect on small and medium-sized datasets. For the deep learning classification model, however, as the flight altitude increases from 15 to 91 m, the classification performance of the deep learning model decreased due to the loss of image feature information. Among them, the classification accuracy of ResNet101 decreased from 81.57% to 78.04%, the classification accuracy of EfficientNetV2 decreased from 84.40% to 81.61%, and the classification accuracy of EfficientNetV2-C decreased from 97.65% to 90.59%. The classification accuracy of EfficientNetV2-C at each of the three altitudes. The difference between the values of precision, recall, and F₁-Score results of classification was small, which indicated that the improved model in this study could effectively solve the problems of unbalanced model classification results and poor classification effect caused by data imbalance. [Conclusions] The improved EfficientNetV2-C achieved high accuracy in wheat lodging type detection, which provides a new solution for wheat lodging early warning and crop management and is of great significance for improving wheat production efficiency and sustainable agricultural development.

Key words: wheat lodging types, image processing, deep learning, unbalanced data, machine learning, UAV

龙佳宁, 张昭, 刘晓航, 李云霞, 芮照钰, 余江帆, 张漫, FLORES Paulo, 韩哲雄, 胡灿, 王旭峰. 利用改进EfficientNetV2和无人机图像检测小麦倒伏类型[J]. 智慧农业(中英文), 2023, 5(3): 62-74.

LONG Jianing, ZHANG Zhao, LIU Xiaohang, LI Yunxia, RUI Zhaoyu, YU Jiangfan, ZHANG Man, FLORES Paulo, HAN Zhexiong, HU Can, WANG Xufeng. Wheat Lodging Types Detection Based on UAV Image Using Improved EfficientNetV2[J]. Smart Agriculture, 2023, 5(3): 62-74.

图/表 11

图1

图2

图3

图4

图5

图6

图7

表1

表2

图8

图9

参考文献 28

1	胡卫国, 曹廷杰, 杨剑, 等. 小麦新品种(系)抗倒性及产量构成因素评价[J]. 种子, 2021, 40(2): 110-115.
	HU W G, CAO T J, YANG J, et al. Evaluation of lodging resistance and yield components of new wheat varieties (lines)[J]. Seed, 2021, 40(2): 110-115.
2	WU W, MA B L. A new method for assessing plant lodging and the impact of management options on lodging in canola crop production[J]. Scientific reports, 2016, 6: ID 31890.
3	PINTHUS M J. Lodging in wheat, barley, and oats: The phenomenon, its causes, and preventive measures[J]. Advances in agronomy, 1974, 25: 209-263.
4	王芬娥, 黄高宝, 郭维俊, 等. 小麦茎秆力学性能与微观结构研究[J]. 农业机械学报, 2009, 40(5): 92-95.
	WANG F E, HUANG G B, GUO W J, et al. Mechanical properties and micro-structure of wheat stems[J]. Transactions of the Chinese society for agricultural machinery, 2009, 40(5): 92-95.
5	BERRY P M, SPINK J. Predicting yield losses caused by lodging in wheat[J]. Field crops research, 2012, 137: 19-26.
6	BERRY P M, STERLING M, SPINK J H, et al. Understanding and reducing lodging in cereals[M]// Advances in agronomy. Amsterdam: Elsevier, 2004: 217-271.
7	孙盈盈, 王超, 王瑞霞, 等. 小麦倒伏原因、机理及其对产量和品质影响研究进展[J]. 农学学报, 2022, 12(3): 1-5.
	SUN Y Y, WANG C, WANG R X, et al. Wheat lodging: Cause and mechanism and its effect on wheat yield and quality[J]. Journal of agriculture, 2022, 12(3): 1-5.
8	赵静, 闫春雨, 杨东建, 等. 基于无人机多光谱遥感的台风灾后玉米倒伏信息提取[J]. 农业工程学报, 2021, 37(24): 56-64.
	ZHAO J, YAN C Y, YANG D J, et al. Extraction of maize lodging information after typhoon based on UAV multispectral remote sensing[J]. Transactions of the Chinese society of agricultural engineering, 2021, 37(24): 56-64.
9	董锦绘, 杨小冬, 高林, 等. 基于无人机遥感影像的冬小麦倒伏面积信息提取[J]. 黑龙江农业科学, 2016(10): 147-152.
	DONG J H, YANG X D, GAO L, et al. Information extraction of winter wheat lodging area based on UAV remote sensing image[J]. Heilongjiang agricultural sciences, 2016(10): 147-152.
10	刘良云, 王纪华, 宋晓宇, 等. 小麦倒伏的光谱特征及遥感监测[J]. 遥感学报, 2005, 9(3): 323-327.
	LIU L Y, WANG J H, SONG X Y, et al. The canopy spectral features and remote sensing of wheat lodging[J]. Journal of remote sensing, 2005, 9(3): 323-327.
11	ZHANG Z, FLORES P, IGATHINATHANE C, et al. Wheat lodging detection from UAS imagery using machine learning algorithms[J]. Remote sensing, 2020, 12(11): ID 1838.
12	BENDIG J, YU K, AASEN H, et al. Combining UAV-based plant height from crop surface models, visible, and near infrared vegetation indices for biomass monitoring in barley[J]. International journal of applied earth observation and geoinformation, 2015, 39: 79-87.
13	DU M M, NOGUCHI N. Multi-temporal monitoring of wheat growth through correlation analysis of satellite images, unmanned aerial vehicle images with ground variable[J]. IFAC-PapersOnLine, 2016, 49(16): 5-9.
14	LU Y Z, LU R F. Detection of surface and subsurface defects of apples using structured-illumination reflectance imaging with machine learning algorithms[J]. Transactions of the ASABE, 2018, 61(6): 1831-1842.
15	NAIK D L, KIRAN R. Identification and characterization of fracture in metals using machine learning based texture recognition algorithms[J]. Engineering fracture mechanics, 2019, 219: ID 106618.
16	RAJAPAKSA S, ERAMIAN M, DUDDU H, et al. Classification of crop lodging with gray level co-occurrence matrix[C]// 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). Piscataway, New Jersey, USA: IEEE, 2018: 251-258.
17	ZHANG Z, IGATHINATHANE C, FLORES P, et al. UAV mission height effects on wheat lodging ratio detection[M]// Unmanned aerial systems in precision agriculture. Singapore: Springer, 2022: 73-85.
18	YU J, CHENG T, CAI N, et al. Wheat lodging segmentation based on Lstm_PSPNet deep learning network[J]. Drones, 2023, 7(2): ID 143.
19	NEUPANE B, HORANONT T, HUNG N D. Deep learning based banana plant detection and counting using high-resolution red-green-blue (RGB) images collected from unmanned aerial vehicle (UAV)[J]. PLoS one, 2019, 14(10): ID e0223906.
20	MAHESH B. Machine learning algorithms: A review[J]. International journal of science and research, 2020, 9(1): 381-386.
21	韩安太, 郭小华, 廖忠, 等. 基于压缩感知理论的农业害虫分类方法[J]. 农业工程学报, 2011, 27(6): 203-207.
	HAN A T, GUO X H, LIAO Z, et al. Classification of agricultural pests based on compressed sensing theory[J]. Transactions of the Chinese society of agricultural engineering, 2011, 27(6): 203-207.
22	GUO G, WANG H, BELL D, et al. On the move to meaningful internet systems 2003: CoopIS, DOA, and ODBASE: OTM Confederated International Conferences, CoopIS, DOA, and ODBASE 2003, Catania, Sicily, Italy, November 3-7, 2003. Proceedings[M]. Berlin: Springer Berlin Heidelberg, 2003.
23	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey, USA: IEEE, 2016: 770-778.
24	TAN M X, LE Q V. EfficientNetV2: Smaller models and faster training[EB/OL]. arXiv: 2104.00298, 2021
25	ZHOU D Q, HOU Q B, CHEN Y P, et al. Rethinking bottleneck structure for efficient mobile network design[EB/OL]. arXiv: 2007.02269, 2020.
26	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, New Jersey,USA: IEEE, 2021: 13708-13717.
27	LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection[C]// 2017 IEEE International Conference on Computer Vision (ICCV). Piscataway, New Jersey, USA: IEEE, 2017: 2999-3007.
28	RUMELHART D E, HINTON G E, WILLIAMS R J. Learning representations by back-propagating errors[J]. Nature, 1986, 323(6088): 533-536.

高度/m	评价指标	未倒伏/%	根部倒伏/%	茎部倒伏/%
15	Precision	82.13	81.56	79.43
	Recall	83.45	100.00	72.81
	F ₁-Score	84.23	84.23	77.79
	Accuracy/%	81.33
45	Precision	83.56	95.13	85.50
	Recall	85.11	100.00	79.35
	F ₁-Score	84.11	98.73	82.44
	Accuracy/%	83.51
91	Precision	84.47	85.47	78.02
	Recall	73.97	100.00	81.28
	F ₁-Score	82.30	82.45	80.60
	Accuracy/%	81.00

		ResNet101			EfficientNetV2			EfficientNetV2-C
高度/m	倒伏类型	Precision/%	Recall/%	F ₁-Score/%	Precision/%	Recall/%	F ₁-Score/%	Precision/%	Recall/%	F ₁-Score/%
15	未倒伏	77.42	90.00	83.24	80.08	92.50	88.09	97.53	98.75	98.14
	根部倒伏	84.71	84.71	84.71	88.59	85.53	87.03	96.59	100.00	98.27
	茎部倒伏	83.12	71.11	76.65	82.22	73.78	79.05	98.84	94.44	96.59
	Accuracy/%	81.57			84.40			97.65
45	未倒伏	77.08	92.50	84.09	83.72	90.00	86.75	84.62	96.25	90.06
	根部倒伏	84.21	75.29	79.50	79.55	82.35	80.92	92.13	96.47	94.25
	茎部倒伏	77.11	71.11	73.99	76.54	68.89	72.51	93.59	81.11	86.90
	Accuracy/%	79.22			82.00			92.5
91	未倒伏	79.79	93.75	86.21	81.11	91.25	85.88	87.95	91.25	89.57
	根部倒伏	78.31	76.47	77.38	85.33	75.29	80.00	87.21	93.75	90.36
	茎部倒伏	75.64	65.56	70.24	73.33	73.33	73.33	92.41	81.11	86.39
	Accuracy/%	78.04			81.61			90.59

[1]	李瑞杰, 王爱冬, 吴华星, 李子秋, 冯向前, 洪卫源, 汤学军, 覃金华, 王丹英, 褚光, 张运波, 陈松. 水稻生育期遥感监测的研究进展、瓶颈问题与技术优化路径[J]. 智慧农业(中英文), 2025, 7(3): 89-107.
[2]	韩宇, 齐康康, 郑纪业, 李金瑷, 姜富贵, 张相伦, 游伟, 张霞. 基于改进YOLOv11的轻量化肉牛面部识别方法[J]. 智慧农业(中英文), 2025, 7(3): 173-184.
[3]	谢纪元, 张东彦, 牛圳, 程涛, 苑峰, 刘亚玲. 基于YOLOv10-MHSA的“三北”工程内蒙古地区植树位点精准检测研究[J]. 智慧农业(中英文), 2025, 7(3): 108-119.
[4]	赵培钦, 刘长斌, 郑婕, 孟炀, 梅新, 陶婷, 赵倩, 梅广源, 杨小冬. 面向干旱条件下的冬小麦估产HLM模型改进研究[J]. 智慧农业(中英文), 2025, 7(2): 106-116.
[5]	马六, 毛克彪, 郭中华. 基于混合注意力生成对抗网络的遥感图像去雾方法[J]. 智慧农业(中英文), 2025, 7(2): 172-182.
[6]	许世卫, 李乾川, 栾汝朋, 庄家煜, 刘佳佳, 熊露. 农产品市场监测预警深度学习智能预测方法[J]. 智慧农业(中英文), 2025, 7(1): 57-69.
[7]	宫宇, 王玲, 赵荣强, 尤海波, 周沫, 刘劼. 基于多模态数据表型特征提取的番茄生长高度预测方法[J]. 智慧农业(中英文), 2025, 7(1): 97-110.
[8]	齐梓均, 牛当当, 吴华瑞, 张礼麟, 王仑峰, 张宏鸣. 基于双维信息与剪枝的中文猕猴桃文本命名实体识别方法[J]. 智慧农业(中英文), 2025, 7(1): 44-56.
[9]	张辉, 胡军, 石航, 刘昶希, 吴淼. 融合远端深度学习识别模型的白菜株心精准对靶喷雾系统[J]. 智慧农业(中英文), 2024, 6(6): 85-95.
[10]	芦碧波, 梁迪, 杨洁, 宋爱青, 皇甫尚卫. 基于改进ENet的复杂背景下山药叶片图像分割方法[J]. 智慧农业(中英文), 2024, 6(6): 109-120.
[11]	彭小丹, 陈锋军, 朱学岩, 才嘉伟, 顾梦梦. 基于无人机图像和改进LSC-CNN模型的密集苗木检测和计数方法[J]. 智慧农业(中英文), 2024, 6(5): 88-97.
[12]	刘丽琪, 魏广源, 周萍. 基于机器学习优化建模的GF-5影像土壤总氮量预测填图[J]. 智慧农业(中英文), 2024, 6(5): 61-73.
[13]	罗友璐, 潘勇浩, 夏顺兴, 陶友志. 基于改进YOLOv8的苹果叶病害轻量化检测算法[J]. 智慧农业(中英文), 2024, 6(5): 128-138.
[14]	刘伊, 张彦军. ReluformerN：轻量化高低频增强高光谱农业地物分类方法[J]. 智慧农业(中英文), 2024, 6(5): 74-87.
[15]	年悦, 赵凯旋, 姬江涛. 基于改进DeepLabCut模型的奶牛滑蹄检测方法[J]. 智慧农业(中英文), 2024, 6(5): 153-163.

利用改进EfficientNetV2和无人机图像检测小麦倒伏类型

Wheat Lodging Types Detection Based on UAV Image Using Improved EfficientNetV2

在线阅读

知网下载

本地下载

可视化

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献 28

相关文章 15

编辑推荐

Metrics

本文评价