基于人工智能大模型技术的果蔬农技知识智能问答系统

doi:10.12133/j.smartag.SA202311005

Smart Agriculture ›› 2023, Vol. 5 ›› Issue (4): 105-116.doi: 10.12133/j.smartag.SA202311005

• 专题--面向智慧农业的人工智能和机器人技术 • 上一篇下一篇

基于人工智能大模型技术的果蔬农技知识智能问答系统

王婷¹^,²(), 王娜³, 崔运鹏¹^,²(), 刘娟¹^,²

^1. 中国农业科学院农业信息研究所，北京 100081，中国
^2. 农业农村部农业大数据重点实验室，北京 100081，中国
^3. 96962部队，北京 102206，中国

收稿日期:2023-11-01 出版日期:2023-12-30
基金项目:
北京市数字农业创新团队项目(BAIC10-2023); 中国农业科学院基本科研业务费项目(JBYW-AII-2023-31); 国家重点研发计划项目(2022YFF0711902)
作者简介:
王婷，研究方向为深度学习方法的理论研究与应用、生信分析。E-mail：wangting01@caas.cn
通信作者:
崔运鹏，博士，研究员，研究方向为农业大数据挖掘分析、自然语言处理、生信分析。E-mail：cuiyunpeng@caas.cn

Agricultural Technology Knowledge Intelligent Question-Answering System Based on Large Language Model

WANG Ting¹^,²(), WANG Na³, CUI Yunpeng¹^,²(), LIU Juan¹^,²

^1. Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
^2. Key Laboratory of Big Agri-data, Ministry of agriculture and rural areas, Beijing 100081, China
^3. Unit 96962, Beijing 102206, China

Received:2023-11-01 Online:2023-12-30
Foundation items:Beijing Digital Agriculture Innovation Consortium Project(BAIC10-2023); Fundamental Research Funds of Chinese Academy of Agricultural Sciences(JBYW-AII-2023-31); National Key Research and Development Program of China(2022YFF0711902)
About author:
WANG Ting, E-mail: wangting01@caas.cn
Corresponding author:
CUI Yunpeng, E-mail: cuiyunpeng@caas.cn

摘要/Abstract

摘要：

［目的/意义］ 乡村振兴战略给农业技术推广提出新的要求，使农业推广知识的供给形式有待进一步创新。以果蔬农技知识服务为需求导向，基于前沿大语言模型技术，面向新型农业知识导读和知识问答等农技推广服务，构建果蔬农技知识智能问答系统。 ［方法］ 基于草莓种植户需求分析，把草莓栽培农技知识划分为不同主题，形成知识对象识别和知识问答两种大模型下游任务，结合机器自动标注和人工标注的方法构建小样本高质量训练语料；通过对比已有的4种大语言模型：Baichuan2-13B-Chat、ChatGLM2-6B、Llama-2-13B-Chat、ChatGPT的性能表现，选择性能最优的模型作为基础模型，按照“优质语料+预训练大模型+微调”的研究思路，训练具有语义分析、上下文关联和生成能力，能够适应多种下游任务的深度神经网络，构建农业知识问答大模型；采用数据优化、检索增强生成技术等多种策略缓解大模型幻觉问题；研发果蔬农技知识智能问答系统，生成高精度、无歧义的农业知识答案，同时支持用户多轮问答。 ［结果和讨论］ 以精准率和召回率为命名实体识别任务的性能表现指标，参与测评的国内主流模型在微调后不同知识主题下的平均精准率均超过85%，平均召回率表现各异，其中知识实体类型的数量、标注语料数量等因素都会影响大模型性能；以幻觉率和语义相似度为知识问答任务的性能表现指标，数据优化、采用检索增强生成技术等策略以10%~40%的幅度有效降低大模型幻觉率，并有效提高大模型的语义相似度。 ［结论］ 在农业领域的命名实体识别和知识问答任务中，预训练大模型ChatGLM的表现性能最优。针对预训练大模型下游任务的微调和基于检索增强生成（Retrieval-Augmented Generation，RAG）技术的模型优化可以缓解大模型幻觉问题，显著提升大模型性能。大模型技术具有创新农技知识服务模式、优化农业知识推广的潜力，能够有效降低种植户获取高质量有效知识的时间成本，引导更多的种植户实现农业技术创新和转型。但是由于性能不稳定等诸多问题，大模型的优化方法和具体场景应用仍需进一步深入研究。

关键词: 大模型, 生成式预训练变换器, 农技知识, 智能问答, 命名实体识别

Abstract:

[Objective] The rural revitalization strategy presents novel requisites for the extension of agricultural technology. However, the conventional method encounters the issue of a contradiction between supply and demand. Therefore, there is a need for further innovation in the supply form of agricultural knowledge. Recent advancements in artificial intelligence technologies, such as deep learning and large-scale neural networks, particularly the advent of large language models (LLMs), render anthropomorphic and intelligent agricultural technology extension feasible. With the agricultural technology knowledge service of fruit and vegetable as the demand orientation, the intelligent agricultural technology question answering system was built in this research based on LLM, providing agricultural technology extension services, including guidance on new agricultural knowledge and question-and-answer sessions. This facilitates farmers in accessing high-quality agricultural knowledge at their convenience. [Methods] Through an analysis of the demands of strawberry farmers, the agricultural technology knowledge related to strawberry cultivation was categorized into six themes: basic production knowledge, variety screening, interplanting knowledge, pest diagnosis and control, disease diagnosis and control, and drug damage diagnosis and control. Considering the current situation of agricultural technology, two primary tasks were formulated: named entity recognition and question answering related to agricultural knowledge. A training corpus comprising entity type annotations and question-answer pairs was constructed using a combination of automatic machine annotation and manual annotation, ensuring a small yet high-quality sample. After comparing four existing Large Language Models (Baichuan2-13B-Chat, ChatGLM2-6B, Llama 2-13B-Chat, and ChatGPT), the model exhibiting the best performance was chosen as the base LLM to develop the intelligent question-answering system for agricultural technology knowledge. Utilizing a high-quality corpus, pre-training of a Large Language Model and the fine-tuning method, a deep neural network with semantic analysis, context association, and content generation capabilities was trained. This model served as a Large Language Model for named entity recognition and question answering of agricultural knowledge, adaptable to various downstream tasks. For the task of named entity recognition, the fine-tuning method of Lora was employed, fine-tuning only essential parameters to expedite model training and enhance performance. Regarding the question-answering task, the Prompt-tuning method was used to fine-tune the Large Language Model, where adjustments were made based on the generated content of the model, achieving iterative optimization. Model performance optimization was conducted from two perspectives: data and model design. In terms of data, redundant or unclear data was manually removed from the labeled corpus. In terms of the model, a strategy based on retrieval enhancement generation technology was employed to deepen the understanding of agricultural knowledge in the Large Language Model and maintain real-time synchronization of knowledge, alleviating the problem of LLM hallucination. Drawing upon the constructed Large Language Model, an intelligent question-answering system was developed for agricultural technology knowledge. This system demonstrates the capability to generate high-precision and unambiguous answers, while also supporting the functionalities of multi-round question answering and retrieval of information sources. [Results and Discussions] Accuracy rate and recall rate served as indicators to evaluate the named entity recognition task performance of the Large Language Models. The results indicated that the performance of Large Language Models was closely related to factors such as model structure, the scale of the labeled corpus, and the number of entity types. After fine-tuning, the ChatGLM Large Language Model demonstrated the highest accuracy and recall rate. With the same number of entity types, a higher number of annotated corpora resulted in a higher accuracy rate. Fine-tuning had different effects on different models, and overall, it improved the average accuracy of all models under different knowledge topics, with ChatGLM, Llama, and Baichuan values all surpassing 85%. The average recall rate saw limited increase, and in some cases, it was even lower than the values before fine-tuning. Assessing the question-answering task of Large Language Models using hallucination rate and semantic similarity as indicators, data optimization and retrieval enhancement generation techniques effectively reduced the hallucination rate by 10% to 40% and improved semantic similarity by more than 15%. These optimizations significantly enhanced the generated content of the models in terms of correctness, logic, and comprehensiveness. [Conclusion] The pre-trained Large Language Model of ChatGLM exhibited superior performance in named entity recognition and question answering tasks in the agricultural field. Fine-tuning pre-trained Large Language Models for downstream tasks and optimizing based on retrieval enhancement generation technology mitigated the problem of language hallucination, markedly improving model performance. Large Language Model technology has the potential to innovate agricultural technology knowledge service modes and optimize agricultural knowledge extension. This can effectively reduce the time cost for farmers to obtain high-quality and effective knowledge, guiding more farmers towards agricultural technology innovation and transformation. However, due to challenges such as unstable performance, further research is needed to explore optimization methods for Large Language Models and their application in specific scenarios.

Key words: LLM, generative pre-trained transformer, agricultural technology knowledge, intelligent question answering, name entity identity

王婷, 王娜, 崔运鹏, 刘娟. 基于人工智能大模型技术的果蔬农技知识智能问答系统[J]. 智慧农业(中英文), 2023, 5(4): 105-116.

WANG Ting, WANG Na, CUI Yunpeng, LIU Juan. Agricultural Technology Knowledge Intelligent Question-Answering System Based on Large Language Model[J]. Smart Agriculture, 2023, 5(4): 105-116.

图/表 12

表1

图1

表2

图2

表3

表4

表5

图3

图4

表6

大模型“微调+优化”前后的回复内容对比

问题

微调前

微调后

大拱棚半促成草莓栽培有什么特点

大拱棚半促成草莓栽培是一种草莓种植技术，其特点是：

1）大拱棚：该栽培结构较大，通常采用60~70 cm的高拱棚，顶部有通风口和排水口，可以保持土壤湿度和通风良好

2）半促成：在这种栽培结构中，草莓生长期较短，通常为20~30 d，而果实成熟期较长，通常为50~60 d。因此，草莓的生长期和果实成熟期比较接近，可以满足消费者对快速生长和大量果实的需求

3）草莓品种选择：半促成草莓栽培适合选择生长期较短、果实大小适中、口感鲜美的草莓品种，如红颜、赤星、巨峰等

4）土壤选择：半促成草莓栽培通常需要在土壤肥沃、排水良好的地区进行种植，同时需要合理施肥和浇水，以保持土壤湿度和营养供给

5）病虫害控制：半促成草莓栽培需要加强对病虫害的控制，可以使用农药进行预防和治理，以保证草莓的安全和丰收

大拱棚半促成草莓栽培是一种高效、经济、方便的草莓种植技术，适合在不同地区进行推广和应用。大拱棚半促成栽培是指草莓在露地促成栽培与大拱棚保护下促成栽培的混合栽培形式。相对于露地促成栽培，半促成栽培具有以下几个特点：

1）充分利用白天高温资源，加快草莓生长，缩短上市期，经济效益高

2）节省劳动力，减少人工辅助次数，节约劳动力成本

3）在半促成栽培期间，温度更为灵活，可以根据实际情况适时调整温度，使植株生长更为适宜

4）露地促成栽培草莓在低温下容易“休眠”，而半促成栽培可以在露地促成栽培与保护地促成栽培之间实现“中间状态”，顺利实现“休眠”与“清醒”的转变

5）大拱棚半促成栽培相对于露地促成栽培更为灵活，可提前采果，减少空袋率，提高果实产量

表6

图5

图6

参考文献 31

1	李秋生, 郑凯欣, 刘小春. 新时代基层农技推广体系改革创新实践探索、制约因素及深化路径[J]. 世界农业, 2022(2): 80-89.
	LI Q S, ZHENG K X, LIU X C. Practical exploration, constraints and deepening paths of the reform and innovation of grass-roots agricultural technology extension system in the new era[J]. World agriculture, 2022(2): 80-89.
2	刘娜. 浅析我国农业推广的现状及策略[J]. 河北农业, 2023, 10: 27-28.
	LIU N. Analysis of the current situation and strategy of agricultural extension in China[J]. Hebei agriculture, 2023, 10: 27-28.
3	RADFORD A, WU J, CHILD R, et al. Language models are unsupervised multitask learners[J]. Computer science, 2019: ID 160025533.
4	ZHAO W X, ZHOU K, LI J Y, et al. A survey of large language models[EB/OL]. arXiv: 2303.18223, 2023.
5	李冬梅, 罗斯斯, 张小平, 等. 命名实体识别方法研究综述[J]. 计算机科学与探索, 2022, 16(9): 1954-1968.
	LI D M, LUO S S, ZHANG X P, et al. Review on named entity recognition[J]. Journal of frontiers of computer science and technology, 2022, 16(9): 1954-1968.
6	MORWAL S. Named entity recognition using hidden markov model (HMM)[J]. International journal on natural language computing, 2012, 1(4): 15-23.
7	EKBAL A, BANDYOPADHYAY S. Named entity recognition using support vector machine: A language independent approach[J]. International journal of electrical and computer engineering, 2010, 4(3): 589-604.
8	SONG S L, ZHANG N, HUANG H T. Named entity recognition based on conditional random fields[J]. Cluster computing, 2019, 22(3): 5195-5206.
9	LUO L, YANG Z H, YANG P, et al. An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition[J]. Bioinformatics, 2018, 34(8): 1381-1388.
10	CHANG Y A, KONG L, JIA K J, et al. Chinese named entity recognition method based on BERT[C]// 2021 IEEE International Conference on Data Science and Computer Application (ICDSCA). Piscataway, New Jersey, USA: IEEE, 2021: 294-299.
11	ZHU Y Y, WANG G X, KARLSSON B F. CAN-NER: Convolutional attention network for Chinese named entity recognition[EB/OL]. arXiv: 1904.02141, 2019.
12	ZHANG Y, YANG J. Chinese NER using lattice LSTM[EB/OL]. arXiv: 1805.02023, 2018.
13	DEVLIN J, CHANG M W, LEE K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding[EB/OL]. arXiv: 1810.04805, 2018.
14	SUN Y, WANG S H, LI Y K, et al. ERNIE: Enhanced representation through knowledge integration[EB/OL]. arXiv: 1904.09223, 2019.
15	RILOFF E, THELEN M. A rule-based question answering system for reading comprehension tests[C]// Proceedings of the 2000 ANLP/NAACL Workshop on Reading Comprehension Tests as Evaluation for Computer-Based Language Understanding Sytems-Volume 6. New York,USA: ACM, 2000: 13-19.
16	YANI M, KRISNADHI A A. Challenges, techniques, and trends of simple knowledge graph question answering: A survey[J]. Information, 2021, 12(7): ID 271.
17	SHARMA Y, GUPTA S. Deep learning approaches for question answering system[J]. Procedia computer science, 2018, 132: 785-794.
18	LIU Y H, OTT M, GOYAL N, et al. RoBERTa: A robustly optimized BERT pretraining approach[EB/OL]. arXiv: 1907.11692, 2019.
19	CHIPMAN H A, GEORGE E I, MCCULLOCH R E. BART: Bayesian additive regression trees[J]. The annals of applied statistics, 2010, 4(1): 266-298.
20	PEREIRA J, FIDALGO R, LOTUFO R, et al. Visconde: Multi-document QA with GPT-3 and Neural Reranking[C]// European Conference on Information Retrieval. Cham: Springer Nature Switzerland, 2023: 534-543.
21	DAUDERT T. A web-based collaborative annotation and consolidation tool[J]. International conference on language resources and evaluation, 2020: 7053-7059.
22	YANG A Y, XIAO B, WANG B N, et al. Baichuan 2: Open large-scale language models[EB/OL]. arXiv: 2309.10305, 2023.
23	TOUVRON H, LAVRIL T, IZACARD G, et al. LLaMA: Open and efficient foundation language models[EB/OL]. arXiv: 2302.13971, 2023.
24	FLORIDI L, CHIRIATTI M. GPT-3: Its nature, scope, limits, and consequences[J]. Minds and machines, 2020, 30(4): 681-694.
25	DING N, QIN Y J, YANG G A, et al. Parameter-efficient fine-tuning of large-scale pre-trained language models[J]. Nature machine intelligence, 2023, 5(3): 220-235.
26	LIU X A, JI K X, FU Y C, et al. P-tuning: Prompt tuning can be comparable to fine-tuning across scales and tasks[C]// Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Stroudsburg, PA, USA: Association for Computational Linguistics, 2022: 61-68.
27	BANG Y J, CAHYAWIJAYA S, LEE N, et al. A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity[EB/OL]. arXiv: 2302.04023, 2023.
28	LEE K, IPPOLITO D, NYSTROM A, et al. Deduplicating training data makes language models better[EB/OL]. arXiv: 2107.06499, 2021.
29	PENG B L, GALLEY M, HE P C, et al. Check your facts and try again: Improving large language models with external knowledge and automated feedback[EB/OL]. arXiv: 2302.12813, 2023.
30	CHANG Y P, WANG X, WANG J D, et al. A survey on evaluation of large language models[EB/OL]. arXiv: 2307.03109, 2023.
31	CORLEY C, MIHALCEA R. Measuring the semantic similarity of texts[C]// Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment-EMSEE '05. Morristown, New Jersey, USA: Association for Computational Linguistics, 2005: 13-18.

样本种植户	描述	百分比/%
性别	男	95.5
性别	女	4.5
年龄	40岁及以下	29.1
	40~50	35.3
	50岁及以上	35.6
学历	初中及以下	81.3
学历	高中及以上	18.7
家庭务农人口比例	30%及以下	14.2
	30%~90%	72.1
	90%及以上	13.7
年家庭收入	5万及以下	24.4
	5万~8万	63.9
	8万及以上	11.7
草莓单产水平	20 t/hm²及以上	49.3
	5~20 t/hm²	47.2
	5 t/hm²及以下	3.5

技术类型	需求比例/%	种植户对现有技术水平的评价
技术类型	需求比例/%	满足/%	基本满足/%	有待提高/%	亟待提高/%
种苗培育技术	75.5	13.85	23.08	40.00	23.08
水肥及管理技术	81.2	7.69	32.31	38.46	21.54
病虫害防治技术	91.3	7.58	16.67	51.52	24.24
贮运及加工技术	85.6	13.85	29.23	47.69	9.23
优质生产技术	72.1	10.77	21.54	41.54	26.15
增加产量良种技术	93.3	21.54	33.85	23.08	21.54
节本高效栽培技术	90.7	7.69	26.15	53.85	12.31
省工机械技术	77.7	10.77	38.46	33.85	16.92
新品种应用	69.3	25.76	37.88	16.67	19.70
新农药和肥料应用技术	67.9	32.31	26.15	30.77	10.77
新农具应用技术	73.7	29.23	30.77	26.15	13.85
品牌经营技术	49.2	33.85	15.38	30.77	20.00

知识专题	生产基本知识	品种筛选	套种知识	虫害诊断与防治	病害诊断与防治	药害诊断与防治	总量
标注语料数量	151	107	61	212	232	106	869
知识对象数量	5	1	3	4	4	4	21
知识实体类型数量	9	2	4	6	6	5	32
问答对数量	63	47	22	44	48	36	260

知识主题	Baichuan		Llama		ChatGPT		ChatGLM
知识主题	B/%	A/%	B/%	A/%	B/%	A/%	B/%	A/%
生产基本知识	62.7	82.9	66.7	82.9	62.7	81.9	79.7	86.5
品种筛选	76.3	87.4	80.3	87.4	76.3	82.4	80.3	89.4
套种知识	71.8	85.9	77.8	85.9	73.8	79.9	77.8	87.9
虫害诊断与防治	72.3	85.9	78.3	88.5	74.7	80.9	78.3	91.6
病害诊断与防治	75.0	86.5	80.3	89.2	74.1	81.7	78.3	92.5
药害诊断与防治	72.2	85.2	76.2	85.2	72.2	80.2	76.2	87.2
平均值	71.7	85.6	76.6	86.5	72.3	81.2	78.4	89.2

知识主题	Baichuan		Llama		ChatGPT		ChatGLM
知识主题	B/%	A/%	B/%	A/%	B/%	A/%	B/%	A/%
生产基本知识	56.1	47.5	57.3	59.5	51.8	46.7	69.7	70.4
品种筛选	70.7	69.6	70.1	72.9	70.4	62.4	70.2	72.1
套种知识	66.6	67.5	69.2	71.5	68.9	60.1	68.0	70.6
虫害诊断与防治	68.5	63.2	68.2	76.7	69.4	63.5	67.5	75.9
病害诊断与防治	67.2	65.4	68.1	77.9	68.2	65.9	69.3	76.3
药害诊断与防治	65.3	67.1	67.5	77.2	67.3	61.4	65.9	70.1
平均值	65.7	63.4	66.7	72.6	66.0	60.0	68.4	72.6

基于人工智能大模型技术的果蔬农技知识智能问答系统

Agricultural Technology Knowledge Intelligent Question-Answering System Based on Large Language Model

在线阅读

知网下载

本地下载

可视化

摘要/Abstract

引用本文

使用本文

图/表 12

参考文献 31

相关文章 8

编辑推荐

Metrics

本文评价

[1]	杨晨雪, 李娴, 周清波. 知识图谱驱动下粮食生产大数据应用现状与展望[J]. 智慧农业(中英文), 2025, 7(2): 26-40.
[2]	齐梓均, 牛当当, 吴华瑞, 张礼麟, 王仑峰, 张宏鸣. 基于双维信息与剪枝的中文猕猴桃文本命名实体识别方法[J]. 智慧农业(中英文), 2025, 7(1): 44-56.
[3]	金宁, 郭宇峰, 韩晓东, 缪祎晟, 吴华瑞. 基于迁移学习的农业短文本语义相似度计算方法[J]. 智慧农业(中英文), 2025, 7(1): 33-43.
[4]	郭威, 吴华瑞, 郭旺, 顾静秋, 朱华吉. 特色农产品设施环境下品质智能管控技术研究现状与展望[J]. 智慧农业(中英文), 2024, 6(6): 44-62.
[5]	郭旺, 杨雨森, 吴华瑞, 朱华吉, 缪祎晟, 顾静秋. 农业大模型：关键技术、应用分析与发展方向[J]. 智慧农业(中英文), 2024, 6(2): 1-13.
[6]	张建华, 姚琼, 周国民, 吴雯迪, 修晓杰, 王健. 作物农艺性状与形态结构表型智能识别技术综述[J]. 智慧农业(中英文), 2024, 6(2): 14-27.
[7]	王彤, 王春山, 李久熙, 朱华吉, 缪祎晟, 吴华瑞. 基于RoFormer预训练模型的指针网络农业病害命名实体识别[J]. 智慧农业(中英文), 2024, 6(2): 85-94.
[8]	计洁, 金洲, 王儒敬, 刘海燕, 李志远. 基于递进式卷积网络的农业命名实体识别方法[J]. 智慧农业(中英文), 2023, 5(1): 122-131.