欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2025, Vol. 7 ›› Issue (4): 141-158.doi: 10.12133/j.smartag.SA202505008

• 综合研究 • 上一篇    

具身智能农业机器人:关键技术、应用分析、挑战与展望

卫培刚1,2, 曹姗姗1,2, 刘继芳1,2, 刘振虎4, 孙伟1,2(), 孔繁涛2,3()   

  1. 1. 中国农业科学院农业信息研究所,北京 100081,中国
    2. 中国农业科学院国家南繁研究院,海南 三亚 572024,中国
    3. 中国农业科学院农业经济与发展研究所,北京 100081,中国
    4. 中国农业科学院西部农业研究中心,新疆 昌吉 831100,中国
  • 收稿日期:2025-05-09 出版日期:2025-07-30
  • 基金项目:
    国家重点研发计划项目(2024YFD2000305); 中国农业科学院科技创新工程(10-IAED-RC-09-2025)
  • 作者简介:

    卫培刚,博士研究生,研究方向为具身智能机器人、多智能体系统。E-mail:

  • 通信作者:
    孙 伟,博士,研究员,研究方向为多模态感知、智能机器人系统。E-mail:
    孔繁涛,博士,研究员,研究方向为智能机器人系统。E-mail:

Embodied Intelligent Agricultural Robots: Key Technologies, Application Analysis, Challenges and Prospects

WEI Peigang1,2, CAO Shanshan1,2, LIU Jifang1,2, LIU Zhenhu4, SUN Wei1,2(), KONG Fantao2,3()   

  1. 1. Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    2. National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China
    3. Institute of Agricultural Economics and Development, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    4. Institute of Western Agriculture, Chinese Academic of Agricultural Sciences, Changji 831100, China
  • Received:2025-05-09 Online:2025-07-30
  • Foundation items:National Key R&D Programme Project(2024YFD2000305); Science and Technology Innovation Project of the Chinese Academy of Agricultural Sciences(10-IAED-RC-09-2025)
  • About author:

    WEI Peigang, E-mail:

  • Corresponding author:
    SUN Wei, E-mail: ;
    KONG Fantao, E-mail:

摘要:

【目的/意义】 随着农业智能化转型加速推进,具身智能作为融合环境感知、信息认知、自主决策与采取行动于一体的智能系统,正为农业机器人赋予更强的自主感知与复杂环境适应能力,成为推动农业智能机器人发展的重要方向。本文系统地梳理了具身智能在农业机器人领域的技术体系与应用实践,揭示其在提高环境适应性、决策自主性和作业灵活性方面的重要价值,为推动农业机器人向更高水平发展提供理论与实践参考。 【进展】 本文首先归纳了具身智能农业机器人的关键支撑技术,包括多模态融合感知、智能自主决策、自主行动控制与反馈自主学习。随后分析了具身智能在农业中的典型应用场景,构建了以“具身感知-具身认知-具身执行-具身进化”组成的核心框架,并结合农业场景对其在各个方面的实现路径进行了分类探讨。最后深入剖析了当前面临的技术瓶颈与落地挑战,包括系统集成约束多、数据虚实差距大、场景泛化能力弱等问题。 【结论/展望】 从高质量数据集与仿真平台构建、领域大模型融合应用、分层协同架构设计等方面对具身智能农业机器人的未来发展趋势进行了展望,为具身智能农业机器人的研究者与工程实践者提供参考。

关键词: 具身智能, 农业机器人, 具身感知, 具身认知, 具身执行, 具身进化

Abstract:

[Significance] Most current agricultural robots lack the ability to adapt to complex agricultural environments and still have limitations when facing variable, uncertain and unstructured agricultural scenarios. With the acceleration of agricultural intelligent transformation, embodied intelligence, as an intelligent system integrating environment perception, information cognition, autonomous decision-making and action, is giving agricultural robots stronger autonomous perception and complex environment adaptation ability, and becoming an important direction to promote the development of agricultural intelligent robots. In this paper, the technical system and application practice of embodied intelligence are sorted out systematically in the field of agricultural robots, its important value is revealed in improving environmental adaptability, decision-making autonomy and operational flexibility, and theoretical and practical references are provided to promote the development of agricultural robots to a higher level. [Progress] Firstly, the key supporting technologies of embodied intelligent agricultural robots are systematically sorted out, focusing on four aspects, namely, multimodal fusion perception, intelligent autonomous decision-making, autonomous action control and feedback autonomous learning. In terms of multimodal fusion perception, the modular artificial intelligence (AI) algorithm architecture and multimodal large model architecture are summarised. In terms of intelligent autonomous decision-making, two types of approaches based on artificial programming and dedicated task algorithms, and on large-scale pre-trained models are outlined. In terms of autonomous action control, three types of approaches based on the fusion of reinforcement learning and mainstream transformer, large model-assisted reinforcement learning, end-to-end mapping of semantics to action and action end-to-end mapping are summarised. In the area of feedback autonomous learning, the focus is on the related technological advances in the evolution of large model-driven feedback modules. Secondly, it analysed the typical application scenarios of embodied intelligence in agriculture, constructed a technical framework with "embodied perception - embodied cognition - embodied execution - embodied evolution" as the core, and discussed the implementation paths of each module according to the agricultural scenarios. The paths of each module are classified and discussed. Finally, the key technical bottlenecks and application challenges are analysed in depth, mainly including the high complexity of system integration, the significant gap between real and virtual data, and the limited ability of cross-scene generalisation. [Conclusions and Prospects] The future development trend of embodied intelligent agricultural robots is summarised and prospected from the construction of high-quality datasets and simulation platforms, the application of domain large model fusion, and the design of layered collaborative architectures, etc. It mainly focuses on the following aspects. Firstly, the construction of high-quality agricultural scenarios of embodied intelligence datasets is a key prerequisite to realise the embodied intelligence landing in agriculture. The development of embodied intelligent agricultural robots needs to rely on rich and accurate agricultural scene task datasets and highly realistic simulators to support physical interaction and behavioural learning. Secondly, the fusion of basic big model and agricultural domain model is the accelerator of intelligent perception and decision-making of agricultural robots. The in-depth fusion of general basic models in agricultural scenarios will bring stronger perception, understanding and reasoning capabilities to the embodied-intelligent agricultural robots. Thirdly, the "big model high-level planning + small model bottom-level control" architecture is an effective solution to balance intelligence and efficiency. Although large models have advantages in semantic understanding and global strategy planning, their reasoning latency and arithmetic demand can hardly meet the real-time and low-power requirements of agricultural robots. The use of large models for high-level task decomposition, scene semantic parsing and decision making, coupled with lightweight small models or traditional control algorithms to complete the underlying sensory response and motion control, can achieve the complementary advantages of the two.

Key words: embodied intelligence, agricultural robotics, embodied perception, embodied cognition, embodied execution, embodied evolution

中图分类号: