欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2025, Vol. 7 ›› Issue (6): 185-195.doi: 10.12133/j.smartag.SA202507032

• 专刊--遥感+AI 赋能农业农村现代化 • 上一篇    

基于双分支与多尺度注意力机制的稻虾田遥感提取方法

张运1,2, 张露敏1,2, 许广涛1,2, 郝佳慧1,2   

  1. 1. 安徽师范大学 地理与旅游学院,安徽 芜湖 241002,中国
    2. 资源环境与地理信息工程安徽省工程技术研究中心,安徽 芜湖 241002,中国
  • 收稿日期:2025-07-23 出版日期:2025-11-30
  • 基金项目:
    高分辨率对地观测系统国家科技重大专项(76-Y50G14-0038-22/23)
  • 通信作者:
    张 运,博士,副教授,研究方向为资源与环境遥感。E-mail:

Remote Sensing Extraction Method of Rice-Crayfish Fields Based on Dual-Branch and Multi-Scale Attention

ZHANG Yun1,2, ZHANG Lumin1,2, XU Guangtao1,2, HAO Jiahui1,2   

  1. 1. School of Geography and Tourism, Anhui Normal University, Wuhu 241002, China
    2. Engineering Technology Research Center of Resources Environment and GIS of Anhui Province, Wuhu 241002, China
  • Received:2025-07-23 Online:2025-11-30
  • Foundation items:National Major Science and Technology Project of China High-resolution Earth Observation System(76-Y50G14-0038-22/23)
  • Corresponding author:
    ZHANG Yun, E-mail:

摘要:

[目的/意义] 稻虾共作是一种高效生态农业种养模式,具有显著的经济效益与生态效益,精准获取稻虾田信息对促进农业资源优化、生态保护及可持续发展具有重要意义。针对现有方法易受光谱混淆与边界模糊影响,提取精度不足的问题,提出一种融合多时相与多尺度特征的深度学习模型,以提升稻虾田遥感识别精度。 [方法] 以多时相高分二号遥感影像为数据源,构建一种基于DBAP-NetDual-branch Attention Pyramid Network)深度网络模型的稻虾田遥感提取方法。该模型以U-Net为基础,采用双分支编码器结构提取多时相时序特征,集成空洞空间金字塔池化(Atrous Spatial Pyramid Pooling, ASPP)模块增强多尺度空间信息表达能力,并嵌入注意力机制提高对稻虾田空间结构的关注能力。 [结果与讨论] DBAP-Net模型表现出优异的稻虾田提取能力,F1分数、交并比(Intersection over Union, IoU)及马修斯相关系数(Matthews Correlation Coefficient, MCC)分别为91.79%、84.82%、87.60%,与U-Net、PSPNet、DeepLabV3+、SegFormer及TransUNet等经典模型相比,IoU分别提高4.25、4.96、4.57、5.12、2.81个百分点。将该模型应用于研究区全域,总体精度(Overall Accuracy, OA)为96.00%,Kappa系数为0.920,在提取精度与空间完整性等方面均显著优于传统的水体季相差异法、随机森林法和时序指数阈值法。 [结论] DBAP-Net深度网络模型能够实现稻虾田高精度提取,为稻虾田精细化提取提供一种新的方法和思路。

关键词: 遥感, 语义分割, 深度学习, 稻虾田, 高分二号

Abstract:

[Objective] Rice-crayfish co-culture represents a highly efficient ecological agricultural system that simultaneously provides substantial economic returns and ecological benefits. Accurately obtaining spatial distribution information on rice-crayfish fields is of great importance for promoting the optimal allocation of agricultural resources, supporting ecological protection, and facilitating sustainable agricultural development. In regions characterized by complex terrain, fragmented plots, and diverse planting structures, traditional extraction approaches are often influenced by spectral confusion and indistinct field boundaries, making it difficult to achieve high-precision identification of rice-crayfish fields. The aim of this research is to develop a deep learning model that integrates multi-temporal and multi-scale feature information to improve the accuracy of remote sensing identification of rice-crayfish fields. [Methods] Multi-temporal GF-2 satellite imagery was employed as the primary data source, and a deep learning-based extraction method named the dual-branch attention pyramid network (DBAP-Net) was developed. The proposed model was established upon the U-Net framework and designed with a dual-branch encoder architecture to extract temporal features from different phenological stages, thereby fully capturing the spectral variations of rice-crayfish fields between the paddy flooding and rice-growing periods. The convolutional block attention module (CBAM) was embedded in each encoder layer and in the skip connections to adaptively adjust feature weights along both the channel and spatial dimensions, enhancing the network's capacity to emphasize critical spatial structures of rice-crayfish fields while effectively suppressing background noise and redundant information. During the feature fusion stage, an atrous spatial pyramid pooling (ASPP) module was incorporated to aggregate contextual information from multiple receptive fields through multi-scale atrous convolutions, improving the model's capability for multi-scale spatial information representation. [Results and Discussions] A quantitative performance evaluation of each DBAP-Net component was conducted through ablation experiments. The results demonstrated that the introduction of a dual-branch structure improved overall accuracy (OA), F1-Score, intersection over union (IoU), and Matthews correlation coefficient (MCC) by 0.48, 0.57, 0.94, and 0.99 percentage points, respectively. Incorporating the CBAM module further enhanced these metrics by 0.52, 0.85, 1.40, and 1.20 percentage points, while the addition of the ASPP module yielded further increases of 0.59, 1.09, 1.81, and 1.49 percentage points, respectively. The DBAP-Net model achieved the highest comprehensive performance, with an OA of 94.45%, F1-Score of 91.79%, IoU of 84.82%, and MCC of 87.60%. These values represented respective improvements of 1.83, 2.55, 4.25, and 3.98 percentage points compared with the baseline U-Net model. These findings indicated that each enhancement module made a substantial contribution to improving both feature representation and spatial boundary delineation. DBAP-Net was further compared with five representative semantic segmentation networks, U-Net, PSPNet, DeepLabV3+, SegFormer, and TransUNet, to comprehensively evaluate its generalization and segmentation performance. The results demonstrated that DBAP-Net consistently achieved higher overall accuracy, precision, F1-Score, and IoU than all other comparison models. Specifically, compared with U-Net, PSPNet, and DeepLabV3+, the F1-Scores increased by 2.55, 2.98, and 2.75 percentage points, while the IoU values improved by 4.25, 4.96, and 4.57 percentage points, respectively. In comparison with the more recent models, SegFormer and TransUNet, DBAP-Net's F1-Score was higher by 3.09 and 1.67 percentage points, and its IoU was enhanced by 5.12 and 2.81 percentage points. Visualization of the segmentation results further revealed that DBAP-Net produced clear segmentation boundaries with complete fields, significantly reducing misclassification and omission rates. In contrast, other models exhibited varying degrees of boundary blurring and fragmentation. When applied across the entire study area, DBAP-Net demonstrated strong robustness and stability. The model achieved an overall accuracy of 96.00%, with a Kappa coefficient of 0.920. The producer's accuracy and user's accuracy were 95.13% and 97.38%, respectively. Compared with traditional approaches such as the seasonal water-difference method, random forest classification, and temporal index thresholding, DBAP-Net significantly improved both extraction precision and spatial completeness, particularly under conditions of complex terrain and fragmented agricultural landscapes. [Conclusions] DBAP-Net, by integrating multi-temporal spectral and multi-scale spatial information, significantly improves the accuracy and completeness of remote sensing extraction of rice-crayfish fields from high-resolution remote sensing imagery. The model provides a reliable and adaptable technical framework for fine-scale monitoring, precise mapping, and sustainable management of rice-crayfish co-culture systems, offering valuable methodological support for agricultural resource assessment and ecological protection.

Key words: remote sensing, semantic segmentation, deep learning, rice-crayfish, GF-2

中图分类号: