欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2026, Vol. 8 ›› Issue (1): 203-212.doi: 10.12133/j.smartag.SA202505037

• 智能装备与系统 • 上一篇    

基于时序多源信息融合的果蔬新鲜度在线检测系统

黄先果(), 朱启兵(), 黄敏   

  1. 江南大学 物联网工程学院,江苏 无锡 214122,中国
  • 收稿日期:2025-05-30 出版日期:2026-01-30
  • 基金项目:
    国家重点研发计划项目(2022YFD2100601)
  • 作者简介:

    黄先果,硕士研究生,研究方向为传感器与检测技术、智能检测。E-mail:

  • 通信作者:
    朱启兵,博士,教授,研究方向为信息感知与智能处理、物联网系统集成。E-mail:

Online Detection System for Freshness of Fruits and Vegetables Based on Temporal Multi-source Information Fusion

HUANG Xianguo(), ZHU Qibing(), HUANG Min   

  1. School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
  • Received:2025-05-30 Online:2026-01-30
  • Foundation items:National Key R&D Program of China(2022YFD2100601)
  • About author:

    HUANG Xianguo, E-mail:

  • Corresponding author:
    ZHU Qibing, E-mail:

摘要:

[目的/意义] 为实现果蔬在储运过程中的实时、精准品质监控,提出并实现了一种基于时序多源信息融合的在线检测系统。该系统旨在克服传统检测方法存在的离线性、单一模态及无法捕捉动态演变等缺陷,为冷链供应链的精细化管理与早期腐败预警提供新的技术途径。 [方法] 该系统通过自主设计的便携式采集节点,同步采集时序的环境传感数据与视觉指示标签图像。为深度挖掘两种异构时序数据间的复杂关联,本研究提出了一种新颖的协同注意力卷积循环网络(Co-Attention-Based Convolutional Recurrent Network, Co-ACRN)深度学习模型。该模型独创性地采用“协同注意力+自注意力”双重机制,前端通过协同注意力对时序多模态数据进行智能对齐,后端利用自注意力对时序信息进行全局上下文复盘。最终将模型高效部署于Qt端,实现了对果蔬新鲜度的边缘侧在线检测。 [结果和讨论] Co-ACRN模型在芒果新鲜度检测任务中准确率达到了96.93%,显著优于多种先进的时序多模态融合模型;消融实验进一步证实了研究提出的“时序信息+多模态信息”输入的必要性,以及“双重注意力”架构的优越性,其中模型的各类别召回率最高可达99.16%。部署于客户端后,系统单次诊断耗时小于2 s,验证了该方案的高精度与实时性。 [结论] 开发的系统为分布式果蔬品质的在线、智能监控提供了一套兼具创新性与实用性的完整解决方案。

关键词: 果蔬, 冷链物流, 时序多模态融合, 双重注意力机制, 在线无损检测

Abstract:

[Objective] Real-time and accurate quality monitoring of fruits and vegetables during cold chain logistics is of great importance for ensuring supply chain quality and reducing economic losses. However, traditional detection methods generally suffer from several core deficiencies, such as being offline, relying on unimodal information, and being unable to capture dynamic evolution. To overcome these challenges, an online freshness detection system is proposed and implemented for fruits and vegetables based on temporal multi-source information fusion. The system was designed to achieve precise online detection of fruit and vegetable freshness, providing an effective technical solution for the refined management and early spoilage warning within the cold chain supply chain, thereby significantly reducing economic losses. [Methods] A complete system was constructed, consisting of a lower-computer data acquisition node, an IoT cloud platform, and an upper-computer Qt client. The lower-computer synchronously collected environmental temporal sensing data (temperature, humidity, CO2, ethylene) and visual temporal images of indicator tags via a self-designed portable acquisition node. A novel co-attention-based convolutional recurrent network (Co-ACRN) deep learning model was proposed for deeply mining the complex correlations between the two heterogeneous time-series data streams. This model innovatively employed a "co-attention + self-attention" dual mechanism. Firstly, in the early fusion stage, a co-attention module intelligently aligned and deeply integrated visual and sensor feature sequences by constructing a cross-modal affinity matrix. Subsequently, the fused sequence was fed into a long short-term memory (LSTM) network to encode temporal cumulative effects. Finally, a self-attention module performed a global contextual review on the LSTM output to capture long-range temporal dependencies. In the specific implementation, visual features were extracted by a lightweight convolutional neural network (CNN) with two convolutional-pooling layers; the co-attention calculated weights by generating context-aware intermediate features; and the self-attention adopted the standard scaled dot-product attention mechanism. For application deployment, the model was efficiently deployed to the Qt client in the open neural network exchange (ONNX) format, achieving real-time, edge-side inference. [Results and Discussions] Experimental results showed that the proposed Co-ACRN model achieved an overall accuracy of 96.93% on the test set in the three-class mango freshness detection task, with its performance significantly surpassing that of various mainstream baselines and advanced temporal multimodal fusion models, such as modality-invariant and specific-representations for multimodal sentiment analysis (MISA), recurrent attended variation embedding network (RAVEN), multimodal transformer (MulT), and heterogeneous hierarchical message passing network (HHMPN). To verify the rationale of the model design, two sets of ablation experiments were conducted. The input-based ablation study decisively proved that the combination of "time-series information + multimodal information" is a necessary prerequisite for accurate detection, as any model relying on unimodal or static information exhibited significant performance bottlenecks. The architecture-based ablation study further confirmed the superiority of the proposed "dual-attention" system; compared to a backbone network without any attention mechanism, its accuracy was improved by more than five percentage points, and the recall rate for the critical "spoiled" category was as high as 99.16%. An in-depth analysis of the confusion matrix revealed that the vast majority of the model's errors occurred between adjacent categories with the most similar physical states, with no serious cross-category misclassifications, demonstrating its strong robustness. After being deployed on the client side, the system's single diagnosis time was less than 2 s, verifying the solution's combination of high accuracy and real-time performance. [Conclusions] The developed online detection system and Co-ACRN model successfully enabled the real-time, accurate, and non-destructive intelligent detection of fruit and vegetable freshness. The research findings indicate that by combining advanced co-attention and self-attention mechanisms, the fusion challenges of complex multimodal temporal data can be effectively solved. In summary, this study provides a complete solution that combines theoretical innovation with engineering practicality for the online and intelligent detection of distributed fruit and vegetable freshness, and paves new paths for the development of this field in both theory and practice.

Key words: fruits and vegetables, cold chain logistics, temporal multimodal fusion, dual attention mechanism, online non-destructive testing

中图分类号: