Agricultural Technology Knowledge Intelligent Question-Answering System Based on Large Language Model

doi:10.12133/j.smartag.SA202311005

Abstract

Abstract:

[Objective] The rural revitalization strategy presents novel requisites for the extension of agricultural technology. However, the conventional method encounters the issue of a contradiction between supply and demand. Therefore, there is a need for further innovation in the supply form of agricultural knowledge. Recent advancements in artificial intelligence technologies, such as deep learning and large-scale neural networks, particularly the advent of large language models (LLMs), render anthropomorphic and intelligent agricultural technology extension feasible. With the agricultural technology knowledge service of fruit and vegetable as the demand orientation, the intelligent agricultural technology question answering system was built in this research based on LLM, providing agricultural technology extension services, including guidance on new agricultural knowledge and question-and-answer sessions. This facilitates farmers in accessing high-quality agricultural knowledge at their convenience. [Methods] Through an analysis of the demands of strawberry farmers, the agricultural technology knowledge related to strawberry cultivation was categorized into six themes: basic production knowledge, variety screening, interplanting knowledge, pest diagnosis and control, disease diagnosis and control, and drug damage diagnosis and control. Considering the current situation of agricultural technology, two primary tasks were formulated: named entity recognition and question answering related to agricultural knowledge. A training corpus comprising entity type annotations and question-answer pairs was constructed using a combination of automatic machine annotation and manual annotation, ensuring a small yet high-quality sample. After comparing four existing Large Language Models (Baichuan2-13B-Chat, ChatGLM2-6B, Llama 2-13B-Chat, and ChatGPT), the model exhibiting the best performance was chosen as the base LLM to develop the intelligent question-answering system for agricultural technology knowledge. Utilizing a high-quality corpus, pre-training of a Large Language Model and the fine-tuning method, a deep neural network with semantic analysis, context association, and content generation capabilities was trained. This model served as a Large Language Model for named entity recognition and question answering of agricultural knowledge, adaptable to various downstream tasks. For the task of named entity recognition, the fine-tuning method of Lora was employed, fine-tuning only essential parameters to expedite model training and enhance performance. Regarding the question-answering task, the Prompt-tuning method was used to fine-tune the Large Language Model, where adjustments were made based on the generated content of the model, achieving iterative optimization. Model performance optimization was conducted from two perspectives: data and model design. In terms of data, redundant or unclear data was manually removed from the labeled corpus. In terms of the model, a strategy based on retrieval enhancement generation technology was employed to deepen the understanding of agricultural knowledge in the Large Language Model and maintain real-time synchronization of knowledge, alleviating the problem of LLM hallucination. Drawing upon the constructed Large Language Model, an intelligent question-answering system was developed for agricultural technology knowledge. This system demonstrates the capability to generate high-precision and unambiguous answers, while also supporting the functionalities of multi-round question answering and retrieval of information sources. [Results and Discussions] Accuracy rate and recall rate served as indicators to evaluate the named entity recognition task performance of the Large Language Models. The results indicated that the performance of Large Language Models was closely related to factors such as model structure, the scale of the labeled corpus, and the number of entity types. After fine-tuning, the ChatGLM Large Language Model demonstrated the highest accuracy and recall rate. With the same number of entity types, a higher number of annotated corpora resulted in a higher accuracy rate. Fine-tuning had different effects on different models, and overall, it improved the average accuracy of all models under different knowledge topics, with ChatGLM, Llama, and Baichuan values all surpassing 85%. The average recall rate saw limited increase, and in some cases, it was even lower than the values before fine-tuning. Assessing the question-answering task of Large Language Models using hallucination rate and semantic similarity as indicators, data optimization and retrieval enhancement generation techniques effectively reduced the hallucination rate by 10% to 40% and improved semantic similarity by more than 15%. These optimizations significantly enhanced the generated content of the models in terms of correctness, logic, and comprehensiveness. [Conclusion] The pre-trained Large Language Model of ChatGLM exhibited superior performance in named entity recognition and question answering tasks in the agricultural field. Fine-tuning pre-trained Large Language Models for downstream tasks and optimizing based on retrieval enhancement generation technology mitigated the problem of language hallucination, markedly improving model performance. Large Language Model technology has the potential to innovate agricultural technology knowledge service modes and optimize agricultural knowledge extension. This can effectively reduce the time cost for farmers to obtain high-quality and effective knowledge, guiding more farmers towards agricultural technology innovation and transformation. However, due to challenges such as unstable performance, further research is needed to explore optimization methods for Large Language Models and their application in specific scenarios.

Key words: LLM, generative pre-trained transformer, agricultural technology knowledge, intelligent question answering, name entity identity

WANG Ting, WANG Na, CUI Yunpeng, LIU Juan. Agricultural Technology Knowledge Intelligent Question-Answering System Based on Large Language Model[J]. Smart Agriculture, 2023, 5(4): 105-116.

Figures/Tables 12

Table 1

Fig. 1

Table 2

Fig. 2

Table 3

Table 4

Table 5

Fig. 3

Fig. 4

Table 6

The comparative analysis of LLMs' reply before and after fine-tuning and optimization

问题

微调前

微调后

大拱棚半促成草莓栽培有什么特点

大拱棚半促成草莓栽培是一种草莓种植技术，其特点是：

1）大拱棚：该栽培结构较大，通常采用60~70 cm的高拱棚，顶部有通风口和排水口，可以保持土壤湿度和通风良好

2）半促成：在这种栽培结构中，草莓生长期较短，通常为20~30 d，而果实成熟期较长，通常为50~60 d。因此，草莓的生长期和果实成熟期比较接近，可以满足消费者对快速生长和大量果实的需求

3）草莓品种选择：半促成草莓栽培适合选择生长期较短、果实大小适中、口感鲜美的草莓品种，如红颜、赤星、巨峰等

4）土壤选择：半促成草莓栽培通常需要在土壤肥沃、排水良好的地区进行种植，同时需要合理施肥和浇水，以保持土壤湿度和营养供给

5）病虫害控制：半促成草莓栽培需要加强对病虫害的控制，可以使用农药进行预防和治理，以保证草莓的安全和丰收

大拱棚半促成草莓栽培是一种高效、经济、方便的草莓种植技术，适合在不同地区进行推广和应用。大拱棚半促成栽培是指草莓在露地促成栽培与大拱棚保护下促成栽培的混合栽培形式。相对于露地促成栽培，半促成栽培具有以下几个特点：

1）充分利用白天高温资源，加快草莓生长，缩短上市期，经济效益高

2）节省劳动力，减少人工辅助次数，节约劳动力成本

3）在半促成栽培期间，温度更为灵活，可以根据实际情况适时调整温度，使植株生长更为适宜

4）露地促成栽培草莓在低温下容易“休眠”，而半促成栽培可以在露地促成栽培与保护地促成栽培之间实现“中间状态”，顺利实现“休眠”与“清醒”的转变

5）大拱棚半促成栽培相对于露地促成栽培更为灵活，可提前采果，减少空袋率，提高果实产量

Table 6

Fig. 5

Fig. 6

References 31

1	李秋生, 郑凯欣, 刘小春. 新时代基层农技推广体系改革创新实践探索、制约因素及深化路径[J]. 世界农业, 2022(2): 80-89.
	LI Q S, ZHENG K X, LIU X C. Practical exploration, constraints and deepening paths of the reform and innovation of grass-roots agricultural technology extension system in the new era[J]. World agriculture, 2022(2): 80-89.
2	刘娜. 浅析我国农业推广的现状及策略[J]. 河北农业, 2023, 10: 27-28.
	LIU N. Analysis of the current situation and strategy of agricultural extension in China[J]. Hebei agriculture, 2023, 10: 27-28.
3	RADFORD A, WU J, CHILD R, et al. Language models are unsupervised multitask learners[J]. Computer science, 2019: ID 160025533.
4	ZHAO W X, ZHOU K, LI J Y, et al. A survey of large language models[EB/OL]. arXiv: 2303.18223, 2023.
5	李冬梅, 罗斯斯, 张小平, 等. 命名实体识别方法研究综述[J]. 计算机科学与探索, 2022, 16(9): 1954-1968.
	LI D M, LUO S S, ZHANG X P, et al. Review on named entity recognition[J]. Journal of frontiers of computer science and technology, 2022, 16(9): 1954-1968.
6	MORWAL S. Named entity recognition using hidden markov model (HMM)[J]. International journal on natural language computing, 2012, 1(4): 15-23.
7	EKBAL A, BANDYOPADHYAY S. Named entity recognition using support vector machine: A language independent approach[J]. International journal of electrical and computer engineering, 2010, 4(3): 589-604.
8	SONG S L, ZHANG N, HUANG H T. Named entity recognition based on conditional random fields[J]. Cluster computing, 2019, 22(3): 5195-5206.
9	LUO L, YANG Z H, YANG P, et al. An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition[J]. Bioinformatics, 2018, 34(8): 1381-1388.
10	CHANG Y A, KONG L, JIA K J, et al. Chinese named entity recognition method based on BERT[C]// 2021 IEEE International Conference on Data Science and Computer Application (ICDSCA). Piscataway, New Jersey, USA: IEEE, 2021: 294-299.
11	ZHU Y Y, WANG G X, KARLSSON B F. CAN-NER: Convolutional attention network for Chinese named entity recognition[EB/OL]. arXiv: 1904.02141, 2019.
12	ZHANG Y, YANG J. Chinese NER using lattice LSTM[EB/OL]. arXiv: 1805.02023, 2018.
13	DEVLIN J, CHANG M W, LEE K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding[EB/OL]. arXiv: 1810.04805, 2018.
14	SUN Y, WANG S H, LI Y K, et al. ERNIE: Enhanced representation through knowledge integration[EB/OL]. arXiv: 1904.09223, 2019.
15	RILOFF E, THELEN M. A rule-based question answering system for reading comprehension tests[C]// Proceedings of the 2000 ANLP/NAACL Workshop on Reading Comprehension Tests as Evaluation for Computer-Based Language Understanding Sytems-Volume 6. New York,USA: ACM, 2000: 13-19.
16	YANI M, KRISNADHI A A. Challenges, techniques, and trends of simple knowledge graph question answering: A survey[J]. Information, 2021, 12(7): ID 271.
17	SHARMA Y, GUPTA S. Deep learning approaches for question answering system[J]. Procedia computer science, 2018, 132: 785-794.
18	LIU Y H, OTT M, GOYAL N, et al. RoBERTa: A robustly optimized BERT pretraining approach[EB/OL]. arXiv: 1907.11692, 2019.
19	CHIPMAN H A, GEORGE E I, MCCULLOCH R E. BART: Bayesian additive regression trees[J]. The annals of applied statistics, 2010, 4(1): 266-298.
20	PEREIRA J, FIDALGO R, LOTUFO R, et al. Visconde: Multi-document QA with GPT-3 and Neural Reranking[C]// European Conference on Information Retrieval. Cham: Springer Nature Switzerland, 2023: 534-543.
21	DAUDERT T. A web-based collaborative annotation and consolidation tool[J]. International conference on language resources and evaluation, 2020: 7053-7059.
22	YANG A Y, XIAO B, WANG B N, et al. Baichuan 2: Open large-scale language models[EB/OL]. arXiv: 2309.10305, 2023.
23	TOUVRON H, LAVRIL T, IZACARD G, et al. LLaMA: Open and efficient foundation language models[EB/OL]. arXiv: 2302.13971, 2023.
24	FLORIDI L, CHIRIATTI M. GPT-3: Its nature, scope, limits, and consequences[J]. Minds and machines, 2020, 30(4): 681-694.
25	DING N, QIN Y J, YANG G A, et al. Parameter-efficient fine-tuning of large-scale pre-trained language models[J]. Nature machine intelligence, 2023, 5(3): 220-235.
26	LIU X A, JI K X, FU Y C, et al. P-tuning: Prompt tuning can be comparable to fine-tuning across scales and tasks[C]// Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Stroudsburg, PA, USA: Association for Computational Linguistics, 2022: 61-68.
27	BANG Y J, CAHYAWIJAYA S, LEE N, et al. A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity[EB/OL]. arXiv: 2302.04023, 2023.
28	LEE K, IPPOLITO D, NYSTROM A, et al. Deduplicating training data makes language models better[EB/OL]. arXiv: 2107.06499, 2021.
29	PENG B L, GALLEY M, HE P C, et al. Check your facts and try again: Improving large language models with external knowledge and automated feedback[EB/OL]. arXiv: 2302.12813, 2023.
30	CHANG Y P, WANG X, WANG J D, et al. A survey on evaluation of large language models[EB/OL]. arXiv: 2307.03109, 2023.
31	CORLEY C, MIHALCEA R. Measuring the semantic similarity of texts[C]// Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment-EMSEE '05. Morristown, New Jersey, USA: Association for Computational Linguistics, 2005: 13-18.

样本种植户	描述	百分比/%
性别	男	95.5
性别	女	4.5
年龄	40岁及以下	29.1
	40~50	35.3
	50岁及以上	35.6
学历	初中及以下	81.3
学历	高中及以上	18.7
家庭务农人口比例	30%及以下	14.2
	30%~90%	72.1
	90%及以上	13.7
年家庭收入	5万及以下	24.4
	5万~8万	63.9
	8万及以上	11.7
草莓单产水平	20 t/hm²及以上	49.3
	5~20 t/hm²	47.2
	5 t/hm²及以下	3.5

技术类型	需求比例/%	种植户对现有技术水平的评价
技术类型	需求比例/%	满足/%	基本满足/%	有待提高/%	亟待提高/%
种苗培育技术	75.5	13.85	23.08	40.00	23.08
水肥及管理技术	81.2	7.69	32.31	38.46	21.54
病虫害防治技术	91.3	7.58	16.67	51.52	24.24
贮运及加工技术	85.6	13.85	29.23	47.69	9.23
优质生产技术	72.1	10.77	21.54	41.54	26.15
增加产量良种技术	93.3	21.54	33.85	23.08	21.54
节本高效栽培技术	90.7	7.69	26.15	53.85	12.31
省工机械技术	77.7	10.77	38.46	33.85	16.92
新品种应用	69.3	25.76	37.88	16.67	19.70
新农药和肥料应用技术	67.9	32.31	26.15	30.77	10.77
新农具应用技术	73.7	29.23	30.77	26.15	13.85
品牌经营技术	49.2	33.85	15.38	30.77	20.00

知识专题	生产基本知识	品种筛选	套种知识	虫害诊断与防治	病害诊断与防治	药害诊断与防治	总量
标注语料数量	151	107	61	212	232	106	869
知识对象数量	5	1	3	4	4	4	21
知识实体类型数量	9	2	4	6	6	5	32
问答对数量	63	47	22	44	48	36	260

知识主题	Baichuan		Llama		ChatGPT		ChatGLM
知识主题	B/%	A/%	B/%	A/%	B/%	A/%	B/%	A/%
生产基本知识	62.7	82.9	66.7	82.9	62.7	81.9	79.7	86.5
品种筛选	76.3	87.4	80.3	87.4	76.3	82.4	80.3	89.4
套种知识	71.8	85.9	77.8	85.9	73.8	79.9	77.8	87.9
虫害诊断与防治	72.3	85.9	78.3	88.5	74.7	80.9	78.3	91.6
病害诊断与防治	75.0	86.5	80.3	89.2	74.1	81.7	78.3	92.5
药害诊断与防治	72.2	85.2	76.2	85.2	72.2	80.2	76.2	87.2
平均值	71.7	85.6	76.6	86.5	72.3	81.2	78.4	89.2

知识主题	Baichuan		Llama		ChatGPT		ChatGLM
知识主题	B/%	A/%	B/%	A/%	B/%	A/%	B/%	A/%
生产基本知识	56.1	47.5	57.3	59.5	51.8	46.7	69.7	70.4
品种筛选	70.7	69.6	70.1	72.9	70.4	62.4	70.2	72.1
套种知识	66.6	67.5	69.2	71.5	68.9	60.1	68.0	70.6
虫害诊断与防治	68.5	63.2	68.2	76.7	69.4	63.5	67.5	75.9
病害诊断与防治	67.2	65.4	68.1	77.9	68.2	65.9	69.3	76.3
药害诊断与防治	65.3	67.1	67.5	77.2	67.3	61.4	65.9	70.1
平均值	65.7	63.4	66.7	72.6	66.0	60.0	68.4	72.6