欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2024, Vol. 6 ›› Issue (2): 1-13.doi: 10.12133/j.smartag.SA202403015

• 专刊--农业信息感知与模型 •    下一篇

农业大模型:关键技术、应用分析与发展方向

郭旺1,2,3, 杨雨森1,4, 吴华瑞1,2,3(), 朱华吉1,2,3, 缪祎晟1,2,3, 顾静秋1,2,3   

  1. 1. 国家农业信息化工程技术研究中心, 北京 100097, 中国
    2. 北京市农林科学院信息技术研究中心, 北京 100097, 中国
    3. 农业农村部数字乡村技术重点实验室, 北京 100097, 中国
    4. 新加坡国立大学 设计与工程学院, 新加坡 117583, 新加坡
  • 收稿日期:2024-03-13 出版日期:2024-03-30
  • 基金资助:
    科技创新2030“新一代人工智能”重大项目(2021ZD0113604); 财政部和农业农村部:国家现代农业产业技术体系资助(CARS-23-D07); 北京市农林科学院创新能力建设项目(KJCX20230219)
  • 作者简介:

    郭 旺,研究方向为农业人工智能与智能系统。E-mail:

  • 通信作者:
    吴华瑞,博士,研究员,研究方向为农业人工智能与大模型。E-mail:

Big Models in Agriculture: Key Technologies, Application and Future Directions

GUO Wang1,2,3, YANG Yusen1,4, WU Huarui1,2,3(), ZHU Huaji1,2,3, MIAO Yisheng1,2,3, GU Jingqiu1,2,3   

  1. 1. National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
    2. Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
    3. Key Laboratory of Digital Village Technology, Ministry of Agriculture and Rural Affairs, Beijing 100097, China
    4. Collage of Design and Engineering, National University of Singapore, Singapore 117583, Singapore
  • Received:2024-03-13 Online:2024-03-30
  • corresponding author:
    WU Huarui, E-mail:
  • About author:

    GUO Wang, E-mail:

  • Supported by:
    Innovation 2030 Major S&T Projects of China(2021ZD0113604); China Agriculture Research System of MOF and MARA Grant(CARS-23-D07); Beijing Academy of Agricultural and Forestry Sciences: Innovation Capacity Building Project(KJCX20230219)

摘要:

[目的/意义] 近年来,人工智能在农业领域的应用取得了显著进展,但仍面临诸如模型数据收集标记困难、模型泛化能力弱等挑战。大模型技术作为近期人工智能领域新的热点技术,已在多个行业的垂直领域中展现出了良好性能,尤其在复杂关联表示、模型泛化、多模态信息处理等方面较传统机器学习方法有着较大优势。[进展] 本文首先阐述了大模型的基本概念和核心技术方法,展示了在参数规模扩大与自监督训练下,模型通用能力与下游适应能力的显著提升。随后,分析了大模型在农业领域应用的主要场景;按照语言大模型、视觉大模型和多模态大模型三大类,在阐述模型发展的同时重点介绍在农业领域的应用现状,展示了大模型在农业上取得的研究进展。[结论/展望] 对农业大模型数据集少而分散、模型部署难度大、农业应用场景复杂等困难提出见解,展望了农业大模型未来的发展重点方向。预计大模型将在未来提供全面综合的农业决策系统,并为公众提供专业优质的农业服务。

关键词: 生成式人工智能, 大模型, 农业知识服务, 机器学习, 自主决策, 多模态, 深度学习

Abstract:

[Significance] Big Models, or Foundation Models, have offered a new paradigm in smart agriculture. These models, built on the Transformer architecture, incorporate numerous parameters and have undergone extensive training, often showing excellent performance and adaptability, making them effective in addressing agricultural issues where data is limited. Integrating big models in agriculture promises to pave the way for a more comprehensive form of agricultural intelligence, capable of processing diverse inputs, making informed decisions, and potentially overseeing entire farming systems autonomously. [Progress] The fundamental concepts and core technologies of big models are initially elaborated from five aspects: the generation and core principles of the Transformer architecture, scaling laws of extending big models, large-scale self-supervised learning, the general capabilities and adaptions of big models, and the emerging capabilities of big models. Subsequently, the possible application scenarios of the big model in the agricultural field are analyzed in detail, the development status of big models is described based on three types of the models: Large language models (LLMs), large vision models (LVMs), and large multi-modal models (LMMs). The progress of applying big models in agriculture is discussed, and the achievements are presented. [Conclusions and Prospects] The challenges and key tasks of applying big models technology in agriculture are analyzed. Firstly, the current datasets used for agricultural big models are somewhat limited, and the process of constructing these datasets can be both expensive and potentially problematic in terms of copyright issues. There is a call for creating more extensive, more openly accessible datasets to facilitate future advancements. Secondly, the complexity of big models, due to their extensive parameter counts, poses significant challenges in terms of training and deployment. However, there is optimism that future methodological improvements will streamline these processes by optimizing memory and computational efficiency, thereby enhancing the performance of big models in agriculture. Thirdly, these advanced models demonstrate strong proficiency in analyzing image and text data, suggesting potential future applications in integrating real-time data from IoT devices and the Internet to make informed decisions, manage multi-modal data, and potentially operate machinery within autonomous agricultural systems. Finally, the dissemination and implementation of these big models in the public agricultural sphere are deemed crucial. The public availability of these models is expected to refine their capabilities through user feedback and alleviate the workload on humans by providing sophisticated and accurate agricultural advice, which could revolutionize agricultural practices.

Key words: artificial intelligence generated content (AIGC), big models, agricultural knowledge services, machine learning, autonomous decision-making, multi-modality, deep learning