联邦忘却学习研究综述

扫码查看

原文链接

万方数据
维普

中文摘要：数据已经成为与土地、劳动力、资本、技术等并列的重要生产要素之一.利用数据分析挖掘数据的潜在价值,有助于推动产业创新、技术升级和区域经济发展.然而,在数据使用过程中,隐私泄露等风险限制了数据的流通和共享.因此,如何在数据流通和共享过程中保护数据隐私已成为研究热点.联邦忘却学习(Federated Un-learning)撤销用户数据对联邦学习模型的训练更新,可以进一步保护联邦学习用户的数据安全.本文综述了联邦忘却学习的研究工作,首先简要阐述了联邦学习架构,并引出忘却学习和联邦忘却学习的概念和定义;其次,根据修正对象的不同将联邦忘却学习算法分为面向全局模型和面向局部模型两类,并详细分析各类算法的实现细节以及优缺点;然后,本文还详述联邦忘却学习中常用评价指标,将评价指标划分为模型表现指标、遗忘效果指标和隐私保护指标三类,并分析不同类型评价指标的优缺点;最后,本文对联邦忘却学习未来的研究方向进行展望.

外文标题：A Survey on Federated Unlearning

外文摘要：Data has become an important factor of production alongside land,labor,capital,technology,etc.By leveraging data analysis to mine potential value,we can uncover profound insights into consumer behavior,market trends,and production efficiency,thereby promoting industrial innovation,technology upgrades,and regional economic development.However,it may cause privacy leakage problems when we use and share data.This oversight has also led to more serious issues,such as the leakage of sensitive data and illegal cross-border data transfers.For instance,some financial companies,due to the absence of comprehensive privacy protection mechanisms in the processes of collecting,circulating,and utilizing user data,have experienced incidents where data is used and traded without user consent.As a result,it severely stops the data circulation and sharing.To further protect user data privacy,federated unlearning can rollback the data-generated training updates to the machine learning model,which can further protect the data privacy and security of users.In this paper,we review the research work of federated unlearning.Firstly,we conduct an in-depth analysis of the federated learning training architecture,highlighting the specific types of privacy leakage threats.To reduce the risk of privacy leaks,we introduce the concept and definition of unlearning,and list different unlearning scenarios,thereby seamlessly transitioning to the concept of federated unlearning.On this basis,we outline the processes involved in federated unlearning and introduce unlearning granularity and challenges.Secondly,the federated unlearning algorithms are divided into two categories,including global model-oriented and local model-oriented algorithms according to the modified object.We further subdivide into several subcategories based on two major categories and analyze the implementation details of each algorithm in depth.To further compare the strengths and weaknesses,we conduct detailed comparative analyses across different categories of algorithms,focusing on aspects such as algorithm performance,types of requesters,and forgetting requests.Additionally,we also conducted an experiment to show the performance of different categories of federated unlearning algorithms in terms of model accuracy.Thirdly,the commonly used performance metrics are divided into three categories,including model performance metrics,forgetting effect metrics,and privacy protection metrics.We conduct a detailed comparison and analysis of these metrics in terms of the unlearning stage,as well as their advantages and drawbacks.Fourthly,we summarize the research and applications of federated unlearning in privacy protection and attack resistance,including the protection of commercial information privacy,federated recommendation systems and federated clustering,etc.Finally,this paper looks forward to the future research directions of unlearning algorithms and applications from the personalized perspective,including promoting the market circulation of data elements,deletion of low-quality data,forgetting applications in cross-domain machine learning,customized services,and federated unlearning in special scenarios.

外文关键词：

federated learningfederated unlearningdigital economyprivacy preservingedge intelligence

作者：

王鹏飞、魏宗正、周东生、宋威、肖蕴明、孙庚、于硕、张强

展开 >

作者单位：

大连理工大学计算机科学与技术学院辽宁大连 116024

大连理工大学社会计算与认知智能教育部重点实验室辽宁大连 116024

大连大学先进设计与智能计算教育部重点实验室辽宁大连 116622

美国西北大学计算机科学系埃文斯顿 60208 美国

吉林大学计算机科学与技术学院长春 130012

吉林大学符号计算与知识工程教育部重点实验室长春 130012

展开 >

关键词：

联邦学习联邦忘却学习数字经济隐私保护边缘智能

基金：

国家重点研发计划国家自然科学基金联合基金项目国家自然科学基金青年项目中国博士后科学基金面上项目中央高校基本科研业务费

项目编号：

2021ZD0112400U1908214622020802023M733354DUT23YG122

出版年：

2024

DOI：

10.11897/SP.J.1016.2024.00396

计算机学报

中国计算机学会中国科学院计算技术研究所

计算机学报

CSTPCD北大核心

影响因子：3.18

ISSN：0254-4164

年,卷(期)：2024.47(2)

参考文献量2