Parcel-Scale Crop Distribution Mapping Based on Stacking Ensemble Learning

doi:10.12133/j.smartag.SA202509003

Abstract

Abstract:

[Objective] With the widespread availability of high-resolution and multi-source remote sensing data, remote sensing-based crop classification has played an increasingly vital role in agricultural monitoring, yield estimation, and land use management. However, traditional pixel-level classification methods often struggle to achieve stable, high-precision classification under conditions of intra-plot heterogeneity, spectral confusion, and noise interference.Therefore, this study aimed to improve parcel-level crop classification accuracy and spatial consistency by constructing a multi-source feature fusion and ensemble learning framework, which exploits complementary spectral, spatial, temporal, and productivity characteristics to enhance robustness and generalization in multi-crop classification tasks. [Methods] To enhance field-level classification accuracy and spatial consistency, a crop classification method integrating field-scale feature extraction, feature selection, and Stacking ensemble learning was proposed and validated. This approach aimed to fully leverage the complementarity of spectral, spatial, and temporal information through feature engineering and model fusion. The study area was located in Feicheng city, Shandong province. The data included multi-temporal Sentinel-2 optical imagery, Sentinel-1 SAR data, Gaofen remote sensing imagery, and parcel vector samples with crop_type attributes. All imagery underwent radiometric and atmospheric correction, projection registration, and cropping during preprocessing to ensure spatial consistency and temporal correspondence across sensors. The dataset was constructed at the field level, comprising 3 200 fields for model training and independent validation. This study systematically constructed four types of meta-features on the plot scale: raw bands and vegetation indices; spatial meta-features, including texture, morphology, and structural indicators calculated from high-resolution imagery to reflect internal spatial heterogeneity; temporal sequence meta-features, extracting vegetation indices, backscatter, and harmonic/temporal statistics from multi-temporal optical and SAR imagery to characterize crop growth cycles; crop primary productivity features to highlight differences in carbon fixation and biomass accumulation among crops. Subsequently, for the high-dimensional, multi-source feature set, a combined strategy of LightGBM and recursive feature elimination (RFE) was employed for feature importance assessment and selection. This retained a subset of features most critical to classification, enhancing model generalization and computational efficiency. Within the classification framework, a Stacking-based ensemble learning model was constructed. Base learners included random forest (RF), eXtreme gradient boosting (XGB), support vector machine (SVM), gradient boosting (GB), categorical boosting (CatBoost), adaptive boosting (ADA), back propagation (BP), K-nearest neighbors (KNN), and light gradient boosting machine (LightGBM). These base models learn and represent plot features from distinct perspectives, fully exploring nonlinear relationships among spectral, spatial, and temporal characteristics. During the meta-learner selection phase, to compare the impact of different feature fusion strategies on classification performance, XGBClassifier, LightGBMClassifier, MLPClassifier, and LogisticRegression were selected as meta-learners for experimental comparison. By contrasting the classification outcomes of different meta-models under the same base model outputs, their contributions to improving feature fusion accuracy and stability differences were analyzed. During model training, hierarchical cross-validation was employed to mitigate bias caused by class imbalance. Overall accuracy (OA), Kappa coefficient, and F₁-Score served as primary evaluation metrics, while recall and precision rates for each crop category underwent systematic analysis. [Results and Discussions] The findings indicated that feature selection significantly impacted classification performance. By integrating LightGBM with feature selection strategies, a subset of 102 optimal features was identified. This subset included gross primary production (GPP), spectral features, vegetation indices, textural features, temporal features, and harmonic features. This approach effectively mitigated feature redundancy and multicollinearity issues, enhancing model stability and generalization capability. Among these, GPP-related features and vegetation indices from key growth stages demonstrated high discriminative power in distinguishing crop categories, fully reflecting the close coupling between remote sensing features and crop phenological information. The Stacking ensemble strategy demonstrated outstanding classification performance. Among various meta-learners, the Stacking model with XGBClassifier as the final learner achieved the highest classification accuracy (OA = 95.66%, Kappa = 0.900 6), showcasing exceptional ensemble generalization capability. It performed particularly well in identifying major crops like maize while maintaining good adaptability for less common crops. The method's advantage extended beyond accuracy gains to its comprehensive integration of complementary spectral, temporal, and spatial feature processing capabilities across base learners. The meta-learner adaptively synthesized multi-model outputs, enhancing classification stability and spatial consistency. Compared to traditional pixel-level classification followed by parcel reclassification, direct feature extraction and classification based on vector parcels effectively avoided edge blending and noise interference inherent in pixel-level methods, significantly improving parcel recognition stability and accuracy. Experimental results demonstrated that parcel-level classification outperformed pixel-level strategies in overall accuracy and Kappa coefficient, with superior spatial consistency and robustness in classification outcomes. [Conclusions] The "optimal feature subset + Stacking ensemble learning + parcel-level classification" method developed in this research demonstrates outstanding accuracy and stability in multi-source remote sensing crop identification, providing an efficient and feasible technical pathway for parcel-level classification in complex agricultural landscapes. Future work will integrate high-resolution time-series data with deep learning models to further enhance the method's cross-regional adaptability and crop monitoring capabilities.

Key words: crop classification, field-scale mapping, spatiotemporal coordination, intelligent interpretation, feature fusion

CLC Number:

S274
S127

XIE Wenhao, ZHANG Xin, DONG Wen, ZHENG Yizhen, CHENG Bo, TU Wenli, SUN Fengqing. Parcel-Scale Crop Distribution Mapping Based on Stacking Ensemble Learning[J]. Smart Agriculture, 2025, 7(6): 196-209.

Figures/Tables 20

Fig. 1

Table 1

Fig. 2

Fig. 3

Table 2

Fig. 4

Table 3

Precision evaluation indicators and calculation methods

指标名称	计算公式
Overall accuracy（OA）	$O A = ∑ i P i, i ∑ i P i +$ （1）
Precision	$P r e c i s i o n = P i, i P + i$ （2）
Recall	$R e c a l l = P i, i P + i$ （3）
Kappa系数	$K a p p a = N ∑ i P - i, i ∑ i (P i + P + i) N 2 - ∑ i (P i + P + i)$ （4）
F ₁-Score	$F 1 = 2 × P r e c i s i o n × R e c a l l P r e c i s i o n + R e c a l l$ （5）

Table 3

Fig. 5

Fig. 6

Fig. 7

Fig. 8

Table 4

Table 5

Table 6

Table 7

Fig. 9

Table 8

Fig. 10

Fig. 11

Table 9

References 36

[1]	吴志峰, 骆剑承, 孙营伟, 等. 时空协同的精准农业遥感研究[J]. 地球信息科学学报, 2020, 22(4): 731-742.
	WU Z F, LUO J C, SUN Y W, et al. Research on precision agricultural based on the spatial-temporal remote sensing collaboration[J]. Journal of geo-information science, 2020, 22(4): 731-742.
[2]	胡琼, 吴文斌, 宋茜, 等. 农作物种植结构遥感提取研究进展[J]. 中国农业科学, 2015, 48(10): 1900-1914.
	HU Q, WU W B, SONG Q, et al. Recent progresses in research of crop patterns mapping by using remote sensing[J]. Scientia agricultura sinica, 2015, 48(10): 1900-1914.
[3]	SALMON J M, FRIEDL M A, FROLKING S, et al. Global rain-fed, irrigated, and paddy croplands: A new high resolution map derived from remote sensing, crop inventories and climate data[J]. International journal of applied earth observation and geoinformation, 2015, 38: 321-334.
[4]	YANG C H. Remote sensing and precision agriculture technologies for crop disease detection and management with a practical application example[J]. Engineering, 2020, 6(5): 102-112.
[5]	张冬韵, 吴田军, 李曼嘉, 等. 地块尺度农作物遥感分类及其不确定性分析[J]. 自然资源遥感, 2024, 36(4): 124-134.
	ZHANG D Y, WU T J, LI M J, et al. Remote sensing-based classification of crops on a farmland parcel scale and uncertainty analysis[J]. Remote sensing for natural resources, 2024, 36(4): 124-134.
[6]	吴炳方, 张淼, 曾红伟, 等. 大数据时代的农情监测与预警[J]. 遥感学报, 2016, 20(5): 1027-1037.
	WU B F, ZHANG M, ZENG H W, et al. Agricultural monitoring and early warning in the era of big data[J]. Journal of remote sensing, 2016, 20(5): 1027-1037.
[7]	WEISS M, JACOB F, DUVEILLER G. Remote sensing for agricultural applications: A meta-review[J]. Remote sensing of environment, 2020, 236: ID 111402.
[8]	宋茜, 胡琼, 陆苗, 等. 农作物空间分布遥感制图发展方向探讨[J]. 中国农业资源与区划, 2020, 41(6): 57-65.
	SONG Q, HU Q, LU M, et al. Prospect of crop mapping[J]. Chinese journal of agricultural resources and regional planning, 2020, 41(6): 57-65.
[9]	冯如意, 王力哲, 曾铁勇. 高光谱遥感图像亚像元信息提取方法综述[J]. 测绘学报, 2023, 52(7): 1187-1201.
	FENG R Y, WANG L Z, ZENG T Y. Review of hyperspectral remote sensing image subpixel information extraction[J]. Acta geodaetica et cartographica sinica, 2023, 52(7): 1187-1201.
[10]	BUI C V, VO Q T, VUONG N L. GAN vs. traditional methods: A multi-scale performance evaluation in satellite image classification[C]// 2024 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia). Piscataway, New Jersey, USA: IEEE, 2024: 1-4.
[11]	GARCÍA REYES R A. Geographic information system and remote sensing for the evaluation of the aptitude of rice lands in the mayarí agroecosystem, holguín[J]. Modern concepts & developments in agronomy, 2022, 11(2). 2811-2830.
[12]	LI H, ZHANG C, ZHANG Y, et al. A scale sequence objectbased convolutional neural network(SS-OCNN)for crop classification from fine spatial resolution remotely sensed imagery[J]. International journal of digital earth, 2021, 14(11): 1528-1546.
[13]	ZHANG P, HU S G, LI W D, et al. Improving parcel-level mapping of smallholder crops from VHSR imagery: An ensemble machine-learning-based framework[J]. Remote sensing, 2021, 13(11): ID 2146.
[14]	ERDANAEV E, KAPPAS M, WYSS D. The identification of irrigated crop types using support vector machine, random forest and maximum likelihood classification methods with Sentinel-2 data in 2018: Tashkent province, Uzbekistan[J/OL]. International journal of geoinformatics, 2022, 18(2). [2025-08-20].
[15]	YILMAZ C, GUNGOR O. Improving SVM classification accuracy with image fusion-based gabor texture features[EB/OL]. [2025-08-20].
[16]	RATANOPAD SUWANLEE S, KEAWSOMSEE S, IZQUIERDO-VERDIGUIER E, et al. Mapping sugarcane plantations in Northeast Thailand using multi-temporal data from multi-sensors and machine-learning algorithms[J]. Big earth data, 2025, 9(2): 187-216.
[17]	邓刘洋, 沈占锋, 柯映明, 等. 基于地块尺度多时相遥感影像的冬小麦种植面积提取[J]. 农业工程学报, 2018, 34(21): 157-164.
	DENG L Y, SHEN Z F, KE Y M, et al. Extraction of winter wheat planting area based on multi-temporal remote sensing images at plot scale[J]. Transactions of the Chinese society of agricultural engineering, 2018, 34(21): 157-164.
[18]	NARIN O G, ABDIKAN S, BAYIK C, et al. Coherence and backscatter based cropland mapping using multi-temporal sentinel-1 with dynamic time warping[J]. The international archives of the photogrammetry, remote sensing and spatial information sciences, 2021, XLIII-B5-2021: 37-41.
[19]	王志华, 杨晓梅, 刘岳明, 等. 遥感影像地学分析的地理学原理及等级斑块建模框架[J]. 遥感学报, 2024, 28(6): 1412-1424.
	WANG Z H, YANG X M, LIU Y M, et al. Geographical principle and hierarchical patch modeling framework for geo-analysis of remote sensing images[J]. National remote sensing bulletin, 2024, 28(6): 1412-1424.
[20]	刘巍, 吴志峰, 骆剑承, 等. 深度学习支持下的丘陵山区耕地高分辨率遥感信息分区分层提取方法[J]. 测绘学报, 2021, 50(1): 105-116.
	LIU W, WU Z F, LUO J C, et al. High-resolution remote sensing information partition and hierarchical extraction method of cultivated land in hilly and mountainous areas supported by deep learning[J]. Acta geodaetica et cartographica sinica, 2021, 50(1): 105-116.
[21]	骆剑承, 吴田军,吴志峰, 等. 遥感大数据智能计算[M]. 北京: 科学出版社, 2020.
[22]	FENG T, ZHU Y H, CHAI N, et al. Increased grain yield in modern genotypes of spring wheat for dryland cultivation in northwest China is associated with the decreased allocation of carbon to roots[J]. Field crops research, 2023, 303: ID 109114.
[23]	杨颖频, 吴志峰, 骆剑承, 等. 时空协同的地块尺度作物分布遥感提取[J]. 农业工程学报, 2021, 37(7): 166-174.
	YANG Y P, WU Z F, LUO J C, et al. Remote sensing extraction of crop distribution at plot scale based on spatio-temporal collaboration[J]. Transactions of the Chinese society of agricultural engineering, 2021, 37(7): 166-174.
[24]	寇雯齐, 沈占锋, 王浩宇, 等. 复杂场景下小农经营区地块级苹果园模块化制图方法框架[J]. 地球信息科学学报, 2024, 26(1): 197-211.
	KOU W Q, SHEN Z F, WANG H Y, et al. Modular mapping method framework of plot-level apple orchards in small-scale peasant management areas under complex scenes[J]. Journal of geo-information science, 2024, 26(1): 197-211.
[25]	秦肖伟, 程博, 杨志平, 等. 基于时序遥感影像的西南山区地块尺度作物类型识别[J]. 地球信息科学学报, 2023, 25(3): 654-668.
	QIN X W, CHENG B, YANG Z P, et al. Identification of crop types at plot scale in southwest mountainous areas based on time series remote sensing images[J]. Journal of geo-information science, 2023, 25(3): 654-668.
[26]	JIAO S H, HU D X, SHEN Z F, et al. Parcel-level mapping of horticultural crop orchards in complex mountain areas using VHR and time-series images[J]. Remote sensing, 2022, 14(9): ID 2015.
[27]	QU T F, WANG H, LI X B, et al. A fine crop classification model based on multitemporal Sentinel-2 images[J]. International journal of applied earth observation and geoinformation, 2024, 134: ID 104172.
[28]	RUßWURM M, KÖRNER M. Multi-temporal land cover classification with long short-term memory neural networks[J]. The international archives of the photogrammetry, remote sensing and spatial information sciences, 2017, XLII-1/W1: 551-558.
[29]	刘灵, 张加龙, 韩雪莲, 等. 基于GEE和Sentinel时序影像的优势树种识别研究[J]. 森林工程, 2023, 39(1): 63-72, 81.
	LIU L, ZHANG J L, HAN X L, et al. Study on identification of dominant tree species based on GEE and Sentinel time series images[J]. Forest engineering, 2023, 39(1): 63-72, 81.
[30]	MUSTAFA STÜNER, ABDIKAN S, BILGIN G, et al. Crop classification using light gradient boosting machines[J/OL]. Turkish Journal of RemOte Sensing and GlS.[2025-08-20].
[31]	冯蕴雯, 崔宇航, 贺谦, 等. 基于ISMA-Stacking集成建模和贝叶斯融合的全机结构试验可靠性评估[J/OL]. 航空学报. (2025-07-28)[2025-08-29].
	FENG Y W, CUI Y H, HE Q, et al. Reliability assessment of full-scale structural tests based on ISMA-stacking ensemble modeling and bayesian fusion[J/OL]. Acta Aeronautica et Astronautica Sinica. (2025-07-28)[2025-08-29].
[32]	CHEN T Q, GUESTRIN C. XGBoost: A scalable tree boosting system[C]// Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: ACM, 2016: 785-794.
[33]	HARIYANI G, SINGH A, PATIL P, et al. Analysis on crop yield prediction using various ensemble methods[C]// 2024 8th International Conference on Computing, Communication, Control and Automation (ICCUBEA). Piscataway, New Jersey, USA: IEEE, 2024: 1-6.
[34]	REDDY N V V, MANIMEGALAI T. Predicting the crop yield in agriculture using gradient boosting algorithm in comparison of naive Bayes algorithm[J]. Fifth international conference on applied sciences: ICAS2023, 2024, 3097: ID 020201.
[35]	SIKARWAR S S, PANDEY S, KUMAR S A, et al. Optimizing crop yield predictions using K-nearest neighbors regression: An analysis of temperature, rainfall and soil pH influences[C]// 2024 1st International Conference on Advances in Computing, Communication and Networking (ICAC2N). Piscataway, New Jersey, USA: IEEE, 2025: 1336-1341.
[36]	WANG H Y, WANG J, SHEN Z F, et al. Parcel-level mapping of apple orchard in smallholder agriculture areas based on feature-level fusion of VHR image and time-series images[J]. International journal of remote sensing, 2022, 43(17): 6195-6220.

样本类型	地块样本数量
玉米	2 198
花生	100
大豆	299
桃	113
苹果	150
番薯	77
菜花	135

类别	计算方法
光谱特征	统计地块内各波段均值、标准差等，用波段值计算各植被指数（NDVI、SAVI、GNDVI、NDWI、GBNDVI、VARI、EVI、TVI、ARVI、VDVI、RDVI）
雷达特征	VV、VH极化在地块内的均值、标准差
纹理特征	在地块内计算GLCM的对比度、同质性、相关性、熵和二阶矩，并取均值作为整体表征
生产力特征	地块内的GPP值取月均值，反映作物生物量积累水平
时序特征	基于地块时序曲线进行谐波分解，提取基频、振幅、相位等谐波特征；将地块的多时相曲线输入LSTM，输出学习到的深层时序特征

模型	最优参数设置
RF	n_estimators=325， max_depth=15， min_samples_split=3， min_samples_leaf=1， random_state=42
XGB	n_estimators=254， max_depth=9，learning_rate=0.186， gamma=0.013， min_child_weight=3， subsample=0.95， colsample_bytree=0.645， random_state=10
AdaBoost	estimator=DecisionTreeClassifier（max_depth=9）， n_estimators=417，learning_rate=0.056 5， algorithm='SAMME'， random_state=50
LGBM	n_estimators=491， learning_rate=0.060 4， num_leaves=68， max_bin=111， random_state=42
GB	n_estimators=252， learning_rate=0.193， max_depth=9， min_samples_split=5， min_samples_leaf=2， subsample=0.8， random_state=42
CatBoost	iterations=963， depth=7， learning_rate=0.226， l2_leaf_reg=1.56， random_seed=42， verbose=False
SVM	kernel='rbf'， C=8.5， gamma=0.064， probability=True， random_state=42
BP	hidden_layer_sizes=（294 251）， activation='relu'， solver='adam'， alpha=0.000 42， max_iter=1 000， learning_rate='adaptive'， learning_rate_init=0.036， random_state=42
KNN	n_neighbors=6， weights='distance'

模型名称	F ₁-macro均值	标准差
RF	0.890 5	0.018 7
XGB	0.896 6	0.017 3
SVM	0.141 5	0.033 4
GB	0.824 3	0.046 1
KNN	0.556 5	0.036 0
LightGBM	0.906 9	0.009 3
Catboost	0.918 3	0.006 1
BP	0.103 9	0.000 1
ADA	0.866 2	0.029 3

元模型	最优参数设置
LGBMClassifier	n_estimators=50，learning_rate=0.05，random_state=42
XGBClassifier	n_estimators=50，learning_rate=0.05，num_leaves=68，max_bin=111，random_state=42
MLPClassifier	hidden_layer_sizes=（100，），activation='relu'，solver='adam'，learning_rate='adaptive'，learning_rate_init=0.036，max_iter=500，random_state=42
LogisticRegression	solver='lbfgs'，max_iter=1 000，C=1.0，random_state=42