欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (3): 34-45.doi: 10.12133/j.smartag.SA202308002

• 专题--丘陵山区智慧农业技术与机械 • 上一篇    下一篇

基于深度学习的四川盆地丘陵区县域耕地遥感识别研究

李豪1(), 杜雨秋2,3, 肖星竹1, 陈彦羲1   

  1. 1. 四川农业大学 资源学院, 四川 成都 611130, 中国
    2. 中国科学院地理科学与资源研究所, 北京 100101, 中国
    3. 中国科学院大学, 北京 101408, 中国
  • 收稿日期:2023-07-28 出版日期:2024-05-30
  • 基金资助:
    国家自然科学基金项目(41501291); 国家级大学生创新训练计划项目(202110626010); 四川天府新区乡村振兴研究院“揭榜挂帅”项目(XZY1-14)
  • 通信作者:
    李 豪,博士,研究方向为地理大数据智能分析。E-mail:

Remote Sensing Identification Method of Cultivated Land at Hill County of Sichuan Basin Based on Deep Learning

LI Hao1(), DU Yuqiu2,3, XIAO Xingzhu1, CHEN Yanxi1   

  1. 1. College of Resources, Sichuan Agricultural University, Chengdu 611130, China
    2. Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
    3. University of Chinese Academy of Sciences, Beijing 101408, China
  • Received:2023-07-28 Online:2024-05-30
  • corresponding author:
    LI Hao, E-mail:
  • Supported by:
    The National Natural Science Foundation of China(41501291); The Innovation Training Project for Undergraduates of China(202110626010); The Sichuan Tianfu New Area Rural Vitalization Research Institute's 'Revealing the Leaderboard' Project(XZY1-14)

摘要:

[目的/意义] 耕地识别是农业遥感的重要应用领域之一,但现阶段深度学习等人工智能方法在中国西南丘陵区遥感识别的研究应用深度仍然不够,识别精度有待提升。 [方法] 为了快速、精确地获取耕地面积、分布等信息,基于高分辨率的高分六号(GF-6)遥感影像,运用UNet++、DeeplabV3+、UNet与PSPNet等新型深度学习模型对四川省绵阳市三台县耕地信息进行识别,并对各深度学习模型、传统机器学习方法——随机森林法及新型土地覆盖产品SinoLC-1的识别精度进行对比分析,以期深入探索深度学习方法在地物遥感识别领域的应用前景。 [结果和讨论] 深度学习模型在F1分数、整体精度(Overall Accuracy, OA)、Kappa系数等精度评价指标的表现上,相比于传统机器学习方法和新型土地覆盖产品均有显著提升,精度提升幅度分别可达20%和50%;其中添加了密集跳跃连接技术的UNet++模型的识别效果最好,其F1分数、交并比(Intersection over Union, IoU)、平均交并比(Mean Intersection over Union, MIoU)、OA 值和 Kappa系数值分别为0.92、85.93%、81.93%、90.60%和0.80。应用UNet++模型对2种由仅光谱特征以及光谱+地形特征两种不同特征构建的影像进行耕地提取,光谱+地形特征模型的IoU、OA和Kappa 3个指标比仅光谱特征模型分别提高了0.98%、1.10%和0.01。 [结论] 深度学习技术在应用于高分辨率遥感影像中的耕地识别方面展现出显著的实用价值,融合光谱和地形特征可以实现信息互补,能进一步改善耕地的识别效果。本研究可为相关部门更好地管理和利用耕地资源、推动农业可持续发展提供技术支撑。

关键词: 深度学习, 遥感影像, 耕地识别, 精度评价, 丘陵地区

Abstract:

[Objective] To fully utilize and protect farmland and lay a solid foundation for the sustainable use of land, it is particularly important to obtain real-time and precise information regarding farmland area, distribution, and other factors. Leveraging remote sensing technology to obtain farmland data can meet the requirements of large-scale coverage and timeliness. However, the current research and application of deep learning methods in remote sensing for cultivated land identification still requires further improvement in terms of depth and accuracy. The objective of this study is to investigate the potential application of deep learning methods in remote sensing for identifying cultivated land in the hilly areas of Southwest China, to provide insights for enhancing agricultural land utilization and regulation, and for harmonizing the relationship between cultivated land and the economy and ecology. [Methods] Santai county, Mianyang city, Sichuan province, China (30°42'34"~31°26'35"N, 104°43'04"~105°18'13"E) was selected as the study area. High-resolution imagery from two scenes captured by the Gaofen-6 (GF-6) satellite served as the primary image data source. Additionally, 30-meter resolution DEM data from the United States National Aeronautics and Space Administration (NASA) in 2020 was utilized. A land cover data product, SinoLC-1, was also incorporated for comparative evaluation of the accuracy of various extraction methods' results. Four deep learning models, namely Unet, PSPNet, DeeplabV3+, and Unet++, were utilized for remote sensing land identification research in cultivated areas. The study also involved analyzing the identification accuracy of cultivated land in high-resolution satellite images by combining the results of the random forest (RF) algorithm along with the deep learning models. A validation dataset was constructed by randomly generating 1 000 vector validation points within the research area. Concurrently, Google Earth satellite images with a resolution of 0.3 m were used for manual visual interpretation to determine the land cover type of the pixels where the validation points are located. The identification results of each model were compared using a confusion matrix to compute five accuracy evaluation metrics: Overall accuracy (OA), intersection over union (IoU), mean intersection over union (MIoU), F1-Score, and Kappa Coefficient to assess the cultivated land identification accuracy of different models and data products. [Results and Discussions] The deep learning models displayed significant advances in accuracy evaluation metrics, surpassing the performance of traditional machine learning approaches like RF and the latest land cover product, SinoLC-1 Landcover. Among the models assessed, the UNet++ model performed the best, its F1-Score, IoU, MIoU, OA, and Kappa coefficient values were 0.92, 85.93%, 81.93%, 90.60%, and 0.80, respectively. DeeplabV3+, UNet, and PSPNet methods followed suit. These performance metrics underscored the superior accuracy of the UNet++ model in precisely identifying and segmenting cultivated land, with a remarkable increase in accuracy of nearly 20% than machine learning methods and 50% for land cover products. Four typical areas of town, water body, forest land and contiguous cultivated land were selected to visually compare the results of cultivated land identification results. It could be observed that the deep learning models generally exhibited consistent distribution patterns with the satellite imageries, accurately delineating the boundaries of cultivated land and demonstrating overall satisfactory performance. However, due to the complex features in remote sensing images, the deep learning models still encountered certain challenges of omission and misclassification in extracting cultivated land. Among them, the UNet++ model showed the closest overall extraction results to the ground truth and exhibited advantages in terms of completeness of cultivated land extraction, discrimination between cultivated land and other land classes, and boundary extraction compared to other models. Using the UNet++ model with the highest recognition accuracy, two types of images constructed with different features—solely spectral features and spectral combined with terrain features—were utilized for cultivated land extraction. Based on the three metrics of IoU, OA, and Kappa, the model incorporating both spectral and terrain features showed improvements of 0.98%, 1.10%, and 0.01% compared to the model using only spectral features. This indicated that fusing spectral and terrain features can achieve information complementarity, further enhancing the identification effectiveness of cultivated land. [Conclusions] This study focuses on the practicality and reliability of automatic cultivated land extraction using four different deep learning models, based on high-resolution satellite imagery from the GF-6 in Santai county in China. Based on the cultivated land extraction results in Santai county and the differences in network structures among the four deep learning models, it was found that the UNet++ model, based on UNet, can effectively improve the accuracy of cultivated land extraction by introducing the mechanism of skip connections. Overall, this study demonstrates the effectiveness and practical value of deep learning methods in obtaining accurate farmland information from high-resolution remote sensing imagery.

Key words: deep learning, remote sensing images, cultivated land identification, accuracy evaluation, hilly region