欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2023, Vol. 5 ›› Issue (3): 62-74.doi: 10.12133/j.smartag.SA202308010

• 专刊--作物信息监测技术 • 上一篇    下一篇

利用改进EfficientNetV2和无人机图像检测小麦倒伏类型

龙佳宁1,2(), 张昭1,2(), 刘晓航1,2, 李云霞1,2, 芮照钰1,2, 余江帆1,2, 张漫1,2, FLORES Paulo3, 韩哲雄4,5, 胡灿6, 王旭峰6   

  1. 1. 中国农业大学 信息与电气工程学院,北京 100080,中国
    2. 中国农业大学农业农村部农业信息获取技术重点实验室,北京 100083,中国
    3. 北达科他州州立大学 农业与生物工程系,北达科他州法戈 58102,美国
    4. 韩国江原大学 生物系统工程系,江原道 春川 24341,韩国
    5. 韩国江原大学 智慧农业交叉学科,江原道 春川 24341,韩国
    6. 塔里木大学 机械电气化工程学院,新疆 阿拉尔 843300,中国
  • 收稿日期:2023-08-04 出版日期:2023-09-30
  • 基金项目:
    国家重点研发计划项目(2022YFD2001500)
  • 作者简介:
    龙佳宁,研究方向为农业机器人。E-mail:
  • 通信作者:
    张 昭,博士,教授,研究方向为农业智能装备。E-mail:

Wheat Lodging Types Detection Based on UAV Image Using Improved EfficientNetV2

LONG Jianing1,2(), ZHANG Zhao1,2(), LIU Xiaohang1,2, LI Yunxia1,2, RUI Zhaoyu1,2, YU Jiangfan1,2, ZHANG Man1,2, FLORES Paulo3, HAN Zhexiong4,5, HU Can6, WANG Xufeng6   

  1. 1. College of Information and Telecommunications, China Agricultural University, Beijing 100080, China
    2. Key Laboratory of Agricultural Information Acquisition Technology, Ministry of Agriculture and Rural Affairs, China Agricultural University, Beijing 100083, China
    3. Department of Agricultural and Bioengineering, North Dakota State University, Fargo 58102, USA
    4. Department of Biosystems Engineering, Kangwon University, Chuncheon 24341, Korea
    5. Department of Smart Agriculture Interdisciplinary, Gangwon University, Chuncheon 24341, Korea
    6. College of Mechanical and Electronic Engineering, Tarim University, Alar, Xinjiang 843300, China
  • Received:2023-08-04 Online:2023-09-30
  • Foundation items:National Key Research and Development Program of China(2022YFD2001500)
  • About author:LONG Jianing, E-mail:614020890@qq.com
  • Corresponding author:ZHANG  Zhao, E-mail:

摘要:

[目的/意义] 不同类型的小麦倒伏(根部倒伏、茎部倒伏)对产量和质量会产生不同影响。本研究旨在通过无人机图像对小麦倒伏类型进行分类,并探究无人机飞行高度对分类性能的影响。 [方法] 研究设置3个无人机飞行高度(15、45、91 m)来获取小麦试验田的图像,并利用自动分割算法生成不同高度的数据集,提出一种EfficientNetV2-C改进模型对其进行分类识别。模型通过引入CA(Coordinate Attention)注意力机制来提升网络特征提取能力,并结合CB-Focal Loss(Class–Balanced Focal Loss)来解决数据不均衡对模型分类准确度的影响。 [结果和讨论] 改进的EfficientNetV2-C表现最佳,平均准确率达到93.58%。对比未改进的4种机器学习分类模型(支持向量机(Support Vector Machine,SVM)、K最近邻(K Nearest Neighbor,KNN)、决策树(Decision Tree,DT)和朴素贝叶斯(Naive Bayes,NB))与两种深度学习分类模型(ResNet101和EfficientNetV2),其中EfficientNetV2在各个高度下表现最优,平均准确率达到82.67%。无人机飞行高度对4种机器学习分类器性能无显著影响,但随飞行高度上升,由于图像特征信息损失,深度学习模型的分类性能下降。 [结论] 改进的EfficientNetV2-C在小麦倒伏类型检测方面取得了较高的准确率,为小麦倒伏预警和农作物管理提供了新的解决方案。

关键词: 小麦倒伏类型, 图像处理, 深度学习, 不平衡数据, 机器学习, 无人机

Abstract:

[Objective] Wheat, as one of the major global food crops, plays a key role in food production and food supply. Different influencing factors can lead to different types of wheat lodging, e.g., root lodging may be due to improper use of fertilizers. While stem lodging is mostly due to harsh environments, different types of wheat lodging can have different impacts on yield and quality. The aim of this study was to categorize the types of wheat lodging by unmanned aerial vehicle (UAV) image detection and to investigate the effect of UAV flight altitude on the classification performance. [Methods] Three UAV flight altitudes (15, 45, and 91 m) were set to acquire images of wheat test fields. The main research methods contained three parts: an automatic segmentation algorithm, wheat classification model selection, and an improved classification model based on EfficientNetV2-C. In the first part, the automatic segmentation algorithm was used to segment the UAV to acquire the wheat test field at three different heights and made it into the training dataset needed for the classification model. The main steps were first to preprocess the original wheat test field images acquired by the UAV through scaling, skew correction, and other methods to save computation time and improve segmentation accuracy. Subsequently, the pre-processed image information was analyzed, and the green part of the image was extracted using the super green algorithm, which was binarized and combined with the edge contour extraction algorithm to remove the redundant part of the image to extract the region of interest, so that the image was segmented for the first time. Finally, the idea of accumulating pixels to find sudden value added was used to find the segmentation coordinates of two different sizes of wheat test field in the image, and the region of interest of the wheat test field was segmented into a long rectangle and a short rectangle test field twice, so as to obtain the structural parameters of different sizes of wheat test field and then to generate the dataset of different heights. In the second part, four machine learning classification models of support vector machine (SVM), K nearest neighbor (KNN), decision tree (DT), and naive bayes (NB), and two deep learning classification models (ResNet101 and EfficientNetV2) were selected. Under the unimproved condition, six classification models were utilized to classify the images collected from three UAVs at different flight altitudes, respectively, and the optimal classification model was selected for improvement. In the third part, an improved model, EfficientNetV2-C, with EfficientNetV2 as the base model, was proposed to classify and recognized the lodging type of wheat in test field images. The main improvement points were attention mechanism improvement and loss function improvement. The attention mechanism was to replace the original model squeeze and excitation (SE) with coordinate attention (CA), which was able to embed the position information into the channel attention, aggregate the features along the width and height directions, respectively, during feature extraction, and capture the long-distance correlation in the width direction while retaining the long-distance correlation in the length direction, accurate location information, enhancing the feature extraction capability of the network in space. The loss function was replaced by class-balanced focal loss (CB-Focal Loss), which could assign different loss weights according to the number of valid samples in each class when targeting unbalanced datasets, effectively solving the impact of data imbalance on the classification accuracy of the model. [Results and Discussions] Four machine learning classification results: SVM average classification accuracy was 81.95%, DT average classification accuracy was 79.56%, KNN average classification accuracy was 59.32%, and NB average classification accuracy was 59.48%. The average classification accuracy of the two deep learning models, ResNet101 and EfficientNetV2, was 78.04%, and the average classification accuracy of ResNet101 was 81.61%. Comparing the above six classification models, the EfficientNetV2 classification model performed optimally at all heights. And the improved EfficientNetV2-C had an average accuracy of 90.59%, which was 8.98% higher compared to the average accuracy of EfficientNetV2. The SVM classification accuracies of UAVs at three flight altitudes of 15, 45, and 91 m were 81.33%, 83.57%, and 81.00%, respectively, in which the accuracy was the highest when the altitude was 45 m, and the classification results of the SVM model values were similar to each other, which indicated that the imbalance of the input data categories would not affect the model's classification effect, and the SVM classification model was able to solve the problem of high dimensionality of the data efficiently and had a good performance for small and medium-sized data sets. The SVM classification model could effectively solve the problem of the high dimensionality of data and had a better classification effect on small and medium-sized datasets. For the deep learning classification model, however, as the flight altitude increases from 15 to 91 m, the classification performance of the deep learning model decreased due to the loss of image feature information. Among them, the classification accuracy of ResNet101 decreased from 81.57% to 78.04%, the classification accuracy of EfficientNetV2 decreased from 84.40% to 81.61%, and the classification accuracy of EfficientNetV2-C decreased from 97.65% to 90.59%. The classification accuracy of EfficientNetV2-C at each of the three altitudes. The difference between the values of precision, recall, and F1-Score results of classification was small, which indicated that the improved model in this study could effectively solve the problems of unbalanced model classification results and poor classification effect caused by data imbalance. [Conclusions] The improved EfficientNetV2-C achieved high accuracy in wheat lodging type detection, which provides a new solution for wheat lodging early warning and crop management and is of great significance for improving wheat production efficiency and sustainable agricultural development.

Key words: wheat lodging types, image processing, deep learning, unbalanced data, machine learning, UAV