欢迎您访问《智慧农业(中英文)》官方网站! English

Smart Agriculture ›› 2023, Vol. 5 ›› Issue (4): 79-91.doi: 10.12133/j.smartag.SA202309012

• 专题--面向智慧农业的人工智能和机器人技术 • 上一篇    下一篇

基于联邦学习的粮食供应链隐私数据要素协同计算研究

许继平1,2(), 李卉1,2, 王浩宇1,2, 周燕1,2, 王昭洋1,2(), 于重重1,2   

  1. 1. 北京工商大学 计算机与人工智能学院,北京 100048,中国
    2. 中国轻工业工业互联网与大数据重点实验室,北京 100048,中国
  • 收稿日期:2022-09-11 出版日期:2023-12-30
  • 作者简介:
    许继平,研究方向为区块链、工业互联网和智能信息处理。E-mail:

    XU Jiping, E-mail:

  • 通信作者:
    王昭洋,博士,副教授,研究方向为可信定位与工业互联。E-mail:

Collaborative Computing of Food Supply Chain Privacy Data Elements Based on Federated Learning

XU Jiping1,2(), LI Hui1,2, WANG Haoyu1,2, ZHOU Yan1,2, WANG Zhaoyang1,2(), YU Chongchong1,2   

  1. 1. School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing 100048, China
    2. Key Laboratory of Industrial Internet and Big Data of China Light Industry, Beijing 100048, China
  • Received:2022-09-11 Online:2023-12-30
  • corresponding author:
    WANG Zhaoyang, E-mail:
  • Supported by:
    Project of Industrial Internet Innovation and Development Project of MIIT(TC200A00L-3); Project of Industrial Internet Innovation and Development Project of MIIT(TC210A02N); Project Grant of Science and Technology/Social Science Program of Beijing Municipal Commission of Education(KM202210011013)

摘要:

[目的/意义] 隐私数据要素的流转是保证粮食供应链安全高效运行的重要基础。实现粮食供应链中隐私数据要素的协同计算对保障粮食质量安全具有重大意义。 [方法] 针对供应链中不同主体间因数据的隐私性而无法共享并参与计算的难题,提出基于工业互联网标识解析技术与联邦学习的粮食供应链数据流转与协同计算架构,设计了支撑联邦学习数据互通的数据标识编码和任务标识编码及对应的参数、信息和评价数据模型;搭建了不同主体数据特征同构的单环节横向联邦学习模型和数据特征异构的跨环节纵向联邦学习模型,基于逻辑回归算法对模型参数进行快速调整计算,以粮食供应链安全风险评估场景为对象,依托开源FATE(Federated AI Technology Enabler)联邦学习平台进行测试验证。 [结果和讨论] 相比传统的单一主体评估计算,横向联邦学习评估准确率提升6.7%,纵向联邦学习评估准确率提升8.3%。 [结论] 采用联邦学习的方式提高了评估的准确性。本研究可为粮食供应链安全高效稳定运行提供技术支撑。

关键词: 联邦学习, 标识解析, 协同计算, 隐私数据交互, 粮食供应链, 数据要素

Abstract:

[Objective] The flow of private data elements plays a crucial role in the food supply chain, and the safe and efficient operation of the food supply chain can be ensured through the effective management and flow of private data elements. Through collaborative computing among the whole chain of the food supply chain, the production, transportation and storage processes of food can be better monitored and managed, so that possible quality and safety problems can be detected and solved in a timely manner, and the health and rights of consumers can be safeguarded. It can also be applied to the security risk assessment and early warning of the food supply chain. By analyzing big data, potential risk factors and abnormalities can be identified, and timely measures can be taken for early warning and intervention to reduce the possibility of quality and safety risks. This study combined the industrial Internet identification and resolution system with the federated learning algorithm, which can realize collaborative learning among multiple enterprises, and each enterprise can carry out collaborative training of the model without sharing the original data, which protects the privacy and security of the data while realizing the flow of the data, and it can also make use of the data resources distributed in different segments, which can realize more comprehensive and accurate collaborative calculations, and improve the safety and credibility of the industrial Internet system's security and credibility. [Methods] To address the problem of not being able to share and participate in collaborative computation among different subjects in the grain supply chain due to the privacy of data elements, this study first analyzed and summarized the characteristics of data elements in the whole link of grain supply chain, and proposed a grain supply chain data flow and collaborative computation architecture based on the combination of the industrial Internet mark resolution technology and the idea of federated learning, which was constructed in a layered and graded model to provide a good infrastructure for the decentralized between the participants. The data identification code for the flow of food supplied chain data elements and the task identification code for collaborative calculation of food supply chain, as well as the corresponding parameter data model, information data model and evaluation data model, were designed to support the interoperability of federated learning data. A single-link horizontal federation learning model with isomorphic data characteristics of different subjects and a cross-link vertical federation learning model with heterogeneous data characteristics were constructed, and the model parameters were quickly adjusted and calculated based on logistic regression algorithm, neural network algorithm and other algorithms, and the food supply chain security risk assessment scenario was taken as the object of the research, and the research was based on the open source FATE (Federated AI Technology) federation learning model. Enabler (Federated AI Technology) federated learning platform for testing and validation, and visualization of the results to provide effective support for the security management of the grain supply chain. [Results and Discussion] Compared with the traditional single-subject assessment calculation method, the accuracy of single-session isomorphic horizontal federation learning model assessment across subjects was improved by 6.7%, and the accuracy of heterogeneous vertical federation learning model assessment across sessions and subjects was improved by 8.3%. This result showed that the single-session isomorphic horizontal federated learning model assessment across subjects could make full use of the data information of each subject by merging and training the data of different subjects in the same session, thus improving the accuracy of security risk assessment. The heterogeneous vertical federated learning model assessment of cross-session and cross-subject further promotes the application scope of collaborative computing by jointly training data from different sessions and subjects, which made the results of safety risk assessment more comprehensive and accurate. The advantage of combining federated learning and logo resolution technology was that it could conduct model training without sharing the original data, which protected data privacy and security. At the same time, it could also realize the effective use of data resources and collaborative computation, improving the efficiency and accuracy of the assessment process. [Conclusions] The feasibility and effectiveness of this study in practical applications in the grain industry were confirmed by the test validation of the open-source FATE federated learning platform. This provides reliable technical support for the digital transformation of the grain industry and the security management of the grain supply chain, and helps to improve the intelligence level and competitiveness of the whole grain industry. Therefore, this study can provide a strong technical guarantee for realizing the safe, efficient and sustainable development of the grain supply chain.

Key words: federated learning, identification resolution, collaborative computation, privacy data exchange, food supply chain, data elements

中图分类号: