Knowledge Graph Driven Grain Big Data Applications: Overview and Perspective
YANG Chenxue, E-mail: yangchenxue@caas.cn |
Received date: 2025-01-01
Online published: 2025-04-25
Supported by
The National Key Research and Development Program of China(2023YFD2000102)
Copyright
[Significance] Grain production spans multiple stages and involves numerous heterogeneous factors, including agronomic inputs, natural resources, environmental conditions, and socio-economic variables. However, the associated data generated throughout the entire production process, ranging from cultivation planning to harvest evaluation, remains highly fragmented, unstructured, and semantically diverse. This complexity data, combined with the lack of integrated core algorithms to support decision-making, has severely limited the potential of big data to drive innovation in grain production. Knowledge graph technology, by offering structured and semantically-rich representations of complex data, enables the integration of multi-source and heterogeneous data, enhances semantic mining and reasoning capabilities, and provides intelligent, knowledge-driven support for sustainable grain production, thereby addressing these challenges effectively. [Progress] This paper systematically reviewed the current research and application progress of knowledge graphs in the grain production big data. A comprehensive knowledge graph driven framework was proposed based on a hybrid paradigm combining data-driven modeling and domain knowledge guidance to support the entire grain production lifecycle and addressed three primary dimensions of data complexity: Structural diversity, relational heterogeneity, and semantic ambiguity. The key techniques of constructing multimodal knowledge map and temporal reasoning for grain production were described. First, an agricultural ontology system for grain production was designed, incorporating domain-specific concepts, hierarchical relationships, and attribute constraints. This ontology provided the semantic foundation for knowledge modeling and alignment. Second, multimodal named entity recognition (NER) techniques were employed to extract entities such as crops, varieties, weather conditions, operations, and equipment from structured and unstructured data sources, including satellite imagery, agronomic reports, Internet of Things sensor data, and historical statistics. Advanced deep learning models, such as bidirectional encoder representations from transformers (BERT) and vision-language transformers, were used to enhance recognition accuracy across text and image modalities. Third, the system implemented multimodal entity linking and disambiguation, which connected identical or semantically similar entities across different data sources by leveraging graph embeddings, semantic similarity measures, and rule-based matching. Finally, temporal reasoning modules were constructed using temporal knowledge graphs and logical rules to support dynamic inference over time-sensitive knowledge, such as crop growth stages, climate variations, and policy interventions. The proposed knowledge graph driven system enabled the development of intelligent applications across multiple stages of grain production. In the pre-production stage, knowledge graphs supported decision-making in resource allocation, crop variety selection, and planting schedule optimization based on past data patterns and predictive inference. During the in-production stage, the system facilitated precision operations, such as real-time fertilization and irrigation by reasoning over current field status, real-time sensor inputs, and historical trends. In the post-production stage, it enabled yield assessment and economic evaluation through integration of production outcomes, environmental factors, and policy constraints. [Conclusions and Prospects] Knowledge graph technologies offer a scalable and semantically-enhanced approach for unlocking the full potential of grain production big data. By integrating heterogeneous data sources, representing domain knowledge explicitly, and supporting intelligent reasoning, knowledge graphs can provide visualization, explainability, and decision support across various spatial scales, including national, provincial, county-level, and large-scale farm contexts. These technologies are of great scientific and practical significance in supporting China's national food security strategy and advancing the goals of storing grain in the land and storing grain in technology. Future directions include the construction of cross-domain agricultural knowledge fusion systems, dynamic ontology evolution mechanisms, and federated knowledge graph platforms for multi-region data collaboration under data privacy constraints.
YANG Chenxue , LI Xian , ZHOU Qingbo . Knowledge Graph Driven Grain Big Data Applications: Overview and Perspective[J]. Smart Agriculture, 2025 , 7(2) : 26 -40 . DOI: 10.12133/j.smartag.SA202501004
本研究不存在研究者以及与公开研究成果有关的利益冲突。
1 |
周国民. 我国农业大数据应用进展综述[J]. 农业大数据学报, 2019, 1(1): 16-23.
|
2 |
叶思菁, 宋长青, 程昌秀, 等. 中国耕地资源利用的“五化”态势与治理对策[J]. 中国科学院院刊, 2023, 38(12): 1962-1976.
|
3 |
|
4 |
|
5 |
赵瑞雪, 杨晨雪, 郑建华, 等. 农业智能知识服务研究现状及展望[J]. 智慧农业(中英文), 2022, 4(4): 105-125.
|
6 |
姜侯, 杨雅萍, 孙九林. 农业大数据研究与应用[J]. 农业大数据学报, 2019, 1(1): 5-15.
|
7 |
张玉成, 张晓博, 高树琴, 等. “伏羲农场”: 智慧农业技术集成创新的实践探索与思考[J]. 中国科学院院刊, 2025, 40(2): 301-309.
|
8 |
周济,
|
9 |
陈学庚, 温浩军, 张伟荣, 等. 农业机械与信息技术融合发展现状与方向[J]. 智慧农业(中英文), 2020, 2(4): 1-16.
|
10 |
李振洪, 朱武, 余琛, 等. 影像大地测量学发展现状与趋势[J]. 测绘学报, 2023, 52(11): 1805-1834.
|
11 |
唐闻涛, 胡泽林. 农业知识图谱研究综述[J]. 计算机工程与应用, 2024, 60(2): 63-76.
|
12 |
|
13 |
|
14 |
|
15 |
|
16 |
|
17 |
|
18 |
|
19 |
|
20 |
|
21 |
贺纯佩, 李思经. 农业叙词表在中国的发展和农业本体论展望[J]. 农业图书情报学刊, 2003, 15(4): 16-19.
|
22 |
鲜国建. 农业科学叙词表向农业本体转化系统的研究与实现[D]. 北京: 中国农业科学院, 2008.
|
23 |
|
24 |
|
25 |
郑颖, 金松林, 张自阳, 等. 基于本体的小麦病虫害问答系统构建与实现[J]. 河南农业科学, 2016, 45(6): 143-146.
|
26 |
李悦, 孙坦, 鲜国建, 等. 面向多源数据深度融合的农作物病虫害本体构建研究[J]. 数字图书馆论坛, 2021(2): 2-10.
|
27 |
王川, 刘尚旺, 杨彧昕, 等. 小麦草害本体知识库构建研究[J]. 河南师范大学学报(自然科学版), 2014, 42(6): 138-142.
|
28 |
曹丽英, 姚玉霞, 于合龙, 等. 基于模糊本体的玉米病害诊断模型的构建[J]. 华南农业大学学报, 2014, 35(2): 101-104.
|
29 |
卜伟琼, 方逵, 张晓玲, 等. 基于本体的柑橘病虫害知识模型构建[J]. 江苏农业科学, 2013, 41(10): 363-366.
|
30 |
姜大庆, 蔡银杰. 基于本体的蔬菜病虫害知识库构建[J]. 江苏农业科学, 2012, 40(7): 368-370.
|
31 |
|
32 |
|
33 |
|
34 |
|
35 |
|
36 |
|
37 |
杨飘, 董文永. 基于BERT嵌入的中文命名实体识别方法[J]. 计算机工程, 2020, 46(4): 40-45, 52.
|
38 |
|
39 |
|
40 |
琚生根, 李天宁, 孙界平. 基于关联记忆网络的中文细粒度命名实体识别[J]. 软件学报, 2021, 32(8): 2545-2556.
|
41 |
|
42 |
李林, 周晗, 郭旭超, 等. 基于多源信息融合的中文农作物病虫害命名实体识别[J]. 农业机械学报, 2021, 52(12): 253-263.
|
43 |
赵鹏飞, 赵春江, 吴华瑞, 等. 基于注意力机制的农业文本命名实体识别[J]. 农业机械学报, 2021, 52(1): 185-192.
|
44 |
|
45 |
|
46 |
沈利言, 姜海燕, 胡滨, 等. 水稻病虫草害与药剂实体关系联合抽取算法[J]. 南京农业大学学报, 2020, 43(6):1151-1161.
|
47 |
|
48 |
|
49 |
|
50 |
|
51 |
|
52 |
计洁, 金洲, 王儒敬, 等. 基于递进式卷积网络的农业命名实体识别方法[J]. 智慧农业(中英文), 2023, 5(1): 122-131.
|
53 |
|
54 |
|
55 |
|
56 |
|
57 |
|
58 |
|
59 |
|
60 |
|
61 |
|
62 |
|
63 |
|
64 |
|
65 |
|
66 |
|
67 |
|
68 |
|
69 |
|
70 |
|
71 |
|
72 |
|
73 |
|
74 |
|
75 |
|
76 |
|
77 |
|
78 |
|
79 |
|
80 |
|
81 |
|
82 |
|
83 |
|
84 |
|
85 |
|
86 |
|
87 |
|
88 |
|
89 |
|
90 |
|
91 |
|
92 |
|
93 |
|
94 |
|
95 |
|
96 |
|
97 |
|
98 |
|
99 |
|
100 |
|
101 |
|
102 |
|
103 |
|
104 |
|
105 |
|
106 |
|
107 |
|
108 |
|
/
〈 |
|
〉 |