Welcome to Smart Agriculture 中文

Smart Agriculture

   

Forecasting Method for China's Soybean Demand Based on Improved Temporal Fusion Transformers

LIU Jiajia1,3, QIN Xiaojing2(), LI Qianchuan2(), XU Shiwei1,4, ZHAO Jichun2, WANG Yigang2, XIONG Lu1,4, LIANG Xiaohe1,3   

  1. 1. Institute of Agricultural Information, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    2. Institute of Data Science and Agricultural Economics, Beijing Academy of Agricultural and Forestry Sciences, Beijing 100097, China
    3. Key Laboratory of Agricultural Blockchain Application, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
    4. Key Laboratory of Agricultural Monitoring and Early Warning Technology, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
  • Received:2025-05-21 Online:2025-07-28
  • Foundation items:Youth Fund of Beijing Academy of Agricultural and Forestry Sciences(QNJJ202328); National Key Research and Development Program Project(2022YFD1600603); Fundamental Research Funds for Central Public Welfare Research Institutes(Y2025ZZ08); Beijing Social Science Fund(23JYB012)
  • About author:

    LIU Jiajia, E-mail:

  • corresponding author:
    QIN Xiaojing, E-mail:
    LI Qianchuan, E-mail:

Abstract:

[Objective] Accurate prediction of soybean demand is of profound practical significance for safeguarding national food security, optimizing industrial decision-making, and responding to fluctuations in international trade. Traditional soybean demand forecasting methods are plagued by inadequacies such as limited capacity to excavate data dimensionality and multivariate interactive features, insufficient ability to capture nonlinear relationships under the coupling of multi-dimensional dynamic factors, and challenges in model interpretability and domain adaptability. These limitations render them incapable of effectively supporting accurate prediction and interpretable analysis of China's soybean demand. When the temporal fusion transformers (TFT) model is applied to forecast China's soybean demand, it exhibits certain constraints in aspects like feature interaction layers and attention weight allocation. Consequently, there is an urgent need to explore a forecasting method based on the improved TFT model to enhance the accuracy and interpretability of soybean demand prediction. [Methods] Drawing on relevant domestic and international studies, this research applied the deep learning-based TFT model to China's soybean demand forecasting and proposed the MA-TFT (improved TFT model based on MDFI and AAWO) model, which was enhanced through multi-layer dynamic feature interaction (MDFI) and adaptive attention weight optimization (AAWO). Firstly, the study collected and collated a dataset for analyzing China's soybean demand, covering seven dimensions: consumption, production, trade, inventory, market, economy and policy, and international factors. This dataset, encompassing 4 652 relevant indicators spanning from 1980 to 2024, was subjected to data cleaning, transformation, augmentation, and feature engineering. The training, validation, and test sets for the model were constructed using the rolling window method. Secondly, based on the architecture of the TFT model for China's soybean demand forecasting, a multi-layer dynamic feature interaction module and an adaptive attention weight optimization strategy were designed. Additionally, the model's loss function, training strategy, and Bayesian hyperparameter tuning method were formulated, and the model performance evaluation metrics were determined. Subsequently, experiments were designed to compare the prediction performance of the MA-TFT model with that of the autoregressive integrated moving average model (ARIMA), long short-term memory (LSTM) model, and the original TFT model. Ablation experiments on the MDFI and AAWO modules were conducted separately. The SHapley Additive exPlanations (SHAP) tool was employed for interpretability analysis to identify key feature variables influencing China's soybean demand and their interaction relationships. Error analysis was performed between the predicted and actual values of China's historical soybean demand, and a comparative analysis of the predicted soybean demand in China from 2025 to 2034 was carried out. [Results and Discussions] The mean squared error (MSE) and mean absolute percentage error (MAPE) of the MA-TFT model were 0.036 and 5.89%, respectively, with a coefficient of determination R² of 0.91, all of which outperformed those of the comparative models, namely ARIMA (1,1,1), LSTM, and TFT. Compared with the benchmark TFT model, the root mean square error (RMSE) and MAPE of the MA-TFT model decreased cumulatively by 21.84% and 3.44%, respectively. These results indicated that the MA-TFT model, as an improved version of TFT, could capture complex relationships between features and enhance prediction performance and accuracy. Interpretability analysis using the SHAP tool revealed that the MA-TFT model exhibited high stability in explaining key feature variables affecting China's soybean demand. It was projected that China's soybean demand would reach 117.99 million tons, 110.33 million tons, and 113.78 million tons in 2025, 2030, and 2034, respectively. [Conclusions] The MA-TFT model, developed by improving the TFT model, provides an innovative solution to address the practical issues of insufficient accuracy and poor interpretability in existing soybean demand forecasting methods. It also offers valuable references for method optimization and application in time series forecasting of other bulk agricultural products.

Key words: Temporal Fusion Transformers (TFT), soybean demand forecast, multi-layer dynamic feature interaction, adaptive attention weight optimization, interpretability analysis

CLC Number: