计算机工程与设计2024,Vol.45Issue(4) :967-973.DOI:10.16208/j.issn1000-7024.2024.04.002

基于分布式多关联属性的高维数据差分隐私保护方法

Differential privacy protection method of multi-associated attribute based on distributed high dimensional data

褚治广 李俊燕 陈昊 张兴
计算机工程与设计2024,Vol.45Issue(4) :967-973.DOI:10.16208/j.issn1000-7024.2024.04.002

基于分布式多关联属性的高维数据差分隐私保护方法

Differential privacy protection method of multi-associated attribute based on distributed high dimensional data

褚治广 1李俊燕 2陈昊 2张兴2
扫码查看

作者信息

  • 1. 北京工业大学信息学部,北京 100124;辽宁工业大学辽宁省工业互联网网络与数据安全重点实验室,辽宁锦州 121001
  • 2. 辽宁工业大学辽宁省工业互联网网络与数据安全重点实验室,辽宁锦州 121001
  • 折叠

摘要

针对高维数据发布的过程中存在由多关联属性引发的隐私信息泄露风险问题,在分布式环境下提出一种满足差分隐私保护的多关联属性高维数据发布方法(HDMPDP).根据数据维度,提出一种基于分布式划分的粗糙集高效降维方法,完成对高维复杂数据特征属性的划分,降低数据维度的同时提高处理效率;设计属性分类准则,利用属性信息熵改进关联分析方法;对得到的属性分别进行加噪,优化噪声添加的方式,减轻关联属性带来的隐私问题.在Spark分布式框架下实现隐私保护数据发布,通过高维数据实验验证了该方法的有效性和隐私保护的安全性.

Abstract

To solve the problem of privacy information leakage caused by multi associated attributes in the publishing process of high-dimensional data sets,a multi associated attribute high-dimensional data privacy protection method(HDMPDP)was pro-posed in distributed environment.According to the data dimension,an efficient dimensionality reduction method of rough set based on distributed partition was proposed,to complete the division of high-dimensional complex data feature attributes,reduce the data dimension and improve the processing efficiency.The attribute classification criterion was designed,and the attribute information entropy was used.The associated analysis method was improved.The noise was added to the obtained attributes respectively,the way of adding noise was optimized,and the privacy problem caused by associated attributes was alleviated.The privacy-preserving data release was realized under the Spark distributed framework,and the effectiveness of the method and the security of privacy-preserving were verified through high-dimensional data experiments.

关键词

高维数据/多关联属性/差分隐私/分布式/关联分析/粗糙集/隐私保护

Key words

high dimensional data/multi-associated attribute/differential privacy/distributed/association analysis/rough set/privacy protection

引用本文复制引用

基金项目

国家自然科学基金项目(61802161)

辽宁省教育厅科学研究基金项目(JZL202015404)

辽宁省教育厅科学研究基金项目(LJKZ0625)

出版年

2024
计算机工程与设计
中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心
影响因子:0.617
ISSN:1000-7024
参考文献量16
段落导航相关论文