欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture

• •    

知识图谱驱动下粮食生产大数据应用现状与展望

杨晨雪(), 李娴, 周清波()   

  1. 中国农业科学院农业信息研究所,北京 100081,中国
  • 收稿日期:2025-01-01 出版日期:2025-04-25
  • 基金项目:
    The National Key Research and Development Program of China(2023YFD2000102)
  • 作者简介:

    杨晨雪,博士,助理研究员,研究方向为基于视觉/多媒体分析的农业智能计算理论及应用研究。E-mail:

  • 通信作者:
    周清波,博士,研究员,研究方向为农业遥感和农业信息技术。E-mail:

Knowledge Graph Driven Grain Big Data Applications:Overview and Perspective

YANG Chenxue(), LI Xian, ZHOU Qingbo()   

  1. Agricultural Information Institute, China Academy of Agricultural Sciences, Beijing 100081, China

摘要:

【目的/意义】 中国粮食生产全过程全要素大数据分散无序且结构复杂,服务粮食生产决策核心算法缺乏整合利用,导致数据的潜力未能得到充分发挥。知识图谱技术可整合多源异构粮食生产数据,提升数据关联性与语义挖掘效率,实现知识结构化表达与智能推理,并为粮食生产的可持续发展提供智能分析与信息支持。【进展】本文综合分析了粮食生产大数据复杂“结构-关系-语义”的知识表示与关联解析方法,梳理总结了一套基于数据驱动与知识引导的知识图谱构建与知识推理框架,综合分析了粮食生产本体构建、多模态命名实体识别、多模态实体链接、时序推理等关键技术,构造产前调度规划、产中精准决策和产后定量评估等全过程多场景的智能化应用。【结论/展望】面向粮食生产大数据应用的知识图谱技术可以在全国、省域、县域和规模化农场等多个应用尺度范围内,为粮食生产各个阶段提供可视化和智能化的决策支持,对实现“藏粮于地、藏粮于技”战略及保障国家粮食安全具有重大科学和应用价值。

关键词: 粮食生产大数据, 知识表示, 多模态知识图谱, 命名实体识别, 实体链接, 时序推理

Abstract:

[Significance] Grain production in China spans multiple stages and involves numerous heterogeneous factors, including agronomic inputs, natural resources, environmental conditions, and socio-economic variables. However, the associated data generated throughout the entire production process—ranging from cultivation planning to harvest evaluation—remains highly fragmented, unstructured, and semantically diverse. This complexity data, combined with the lack of integrated core algorithms to support decision-making, has severely limited the potential of big data to drive innovation in grain production. Knowledge graph (KG) technology, by offering structured, semantically-rich representations of complex data, provides a promising approach to address these challenges. KGs enable the integration of multi-source and heterogeneous data, enhance semantic mining and reasoning capabilities, and offer intelligent, knowledge-driven support for sustainable grain production. [Progress] This paper systematically reviewed the current state of research and application of knowledge graphs in the domain of grain production big data. A comprehensive KG-driven framework was proposed based on a hybrid paradigm combining data-driven modeling and domain knowledge guidance. The framework was designed to support the entire grain production lifecycle and addressed three primary dimensions of data complexity: structural diversity, relational heterogeneity, and semantic ambiguity. The key techniques of constructing multimodal knowledge map and temporal reasoning for grain production were described. First, an agricultural ontology system for grain production was designed, incorporating domain-specific concepts, hierarchical relationships, and attribute constraints. This ontology provided the semantic foundation for knowledge modeling and alignment. Second, multimodal named entity recognition (NER) techniques were employed to extract entities such as crops, varieties, weather conditions, operations, and equipment from structured and unstructured data sources, including satellite imagery, agronomic reports, IoT sensor data, and historical statistics. Advanced deep learning models, such as BERT and vision-language transformers, were used to enhance recognition accuracy across text and image modalities. Third, the system implemented multimodal entity linking and disambiguation, which connected identical or semantically similar entities across different data sources by leveraging graph embeddings, semantic similarity measures, and rule-based matching. Finally, temporal reasoning modules were constructed using temporal KGs and logical rules to support dynamic inference over time-sensitive knowledge, such as crop growth stages, climate variations, and policy interventions. The proposed KG-driven system enabled the development of intelligent applications across multiple stages of grain production. In the pre-production stage, knowledge graphs supported decision-making in resource allocation, crop variety selection, and planting schedule optimization based on past data patterns and predictive inference. During the in-production stage, the system facilitated precision operations—such as real-time fertilization and irrigation—by reasoning over current field status, real-time sensor inputs, and historical trends. In the post-production stage, it enabled yield assessment and economic evaluation through integration of production outcomes, environmental factors, and policy constraints. Conclusions and Prospects Knowledge graph technologies offer a scalable and semantically-enhanced approach for unlocking the full potential of grain production big data. By integrating heterogeneous data sources, representing domain knowledge explicitly, and supporting intelligent reasoning, KGs can provide visualization, explainability, and decision support across various spatial scales, including national, provincial, county-level, and large-scale farm contexts. These technologies are of great scientific and practical significance in supporting China's national food security strategy and advancing the goals of storing grain in the land and storing grain in technology. Future directions include the construction of cross-domain agricultural knowledge fusion systems, dynamic ontology evolution mechanisms, and federated KG platforms for multi-region data collaboration under data privacy constraints.

Key words: grain production big data, knowledge representation, multimodal knowledge graph, named entity recognition, entity linking, temporal reasoning

中图分类号: