Welcome to Smart Agriculture 中文

Smart Agriculture ›› 2025, Vol. 7 ›› Issue (1): 97-110.doi: 10.12133/j.smartag.SA202410032

• Topic--Intelligent Agricultural Knowledge Services and Smart Unmanned Farms (Part 2) • Previous Articles     Next Articles

Tomato Growth Height Prediction Method by Phenotypic Feature Extraction Using Multi-modal Data

GONG Yu1,2, WANG Ling1,2, ZHAO Rongqiang1,2,4(), YOU Haibo3, ZHOU Mo3, LIU Jie1,2,4   

  1. 1. Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150006, China
    2. National Key Laboratory of Smart Farming Technology and Systems, Harbin 150080, China
    3. Horticultural Branch, Heilongjiang Academy of Agricultural Sciences, Harbin 150040, China
    4. Harbin Institute of Technology Research Institute for Artificial Intelligence Inc. , Harbin 150000, China
  • Received:2024-10-20 Online:2025-01-30
  • Foundation items:
    The Fundamental Research Funds for the Central Universities in China(2023FRFK06013); The Key Research and Development Program of Heilongjiang Province(2023ZX01A24); Harbin Institute of Technology Horizontal Project(MH20240081)
  • About author:

    GONG Yu, E-mail:

  • corresponding author:
    ZHAO Rongqiang, E-mail:

Abstract:

[Objective] Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming. However, current prediction methods predominantly rely on empirical, mechanistic, or learning-based models that utilize either images data or environmental data. These methods fail to fully leverage multi-modal data to capture the diverse aspects of plant growth comprehensively. [Methods] To address this limitation, a two-stage phenotypic feature extraction (PFE) model based on deep learning algorithm of recurrent neural network (RNN) and long short-term memory (LSTM) was developed. The model integrated environment and plant information to provide a holistic understanding of the growth process, emploied phenotypic and temporal feature extractors to comprehensively capture both types of features, enabled a deeper understanding of the interaction between tomato plants and their environment, ultimately leading to highly accurate predictions of growth height. [Results and Discussions] The experimental results showed the model's effectiveness: When predicting the next two days based on the past five days, the PFE-based RNN and LSTM models achieved mean absolute percentage error (MAPE) of 0.81% and 0.40%, respectively, which were significantly lower than the 8.00% MAPE of the large language model (LLM) and 6.72% MAPE of the Transformer-based model. In longer-term predictions, the 10-day prediction for 4 days ahead and the 30-day prediction for 12 days ahead, the PFE-RNN model continued to outperform the other two baseline models, with MAPE of 2.66% and 14.05%, respectively. [Conclusions] The proposed method, which leverages phenotypic-temporal collaboration, shows great potential for intelligent, data-driven management of tomato cultivation, making it a promising approach for enhancing the efficiency and precision of smart tomato planting management.

Key words: tomato growth prediction, deep learning, phenotypic feature extraction, multi-modal data, recurrent neural network, long short-term memory, large language model

CLC Number: