A Multi-party Vertically Partitioned Data Synthesis Mechanism with Personalized Differential Privacy
In today's era,with the rapid development of big data technology and the continuous increase in data volume,large amounts of data are constantly collected by different companies or institutions,aggregating and publishing data owned by different companies or institutions helps to better provide services and support decision-making.However,their respective data may contain privacy information with different degrees of sensitivity,thus personalized privacy protection requirements need to be met while aggregating and publishing data from all parties.To solve the problem of multi-party data publication while ensuring that different privacy protection needs of all parties are met,a Multi-party Vertically partitioned Data Synthesis mechanism with Personalized Differential Privacy(PDP-MVDS)is proposed.Low-dimensional marginal distributions are firstly generated to reduce the dimension of high-dimensional data,then a randomly initialized dataset with these marginal distributions are updated,and finally a synthesized dataset whose distribution is similar to that of the real aggregated dataset from all parties is published.Personalized differential privacy protection is achieved by dividing the privacy budget;Secure scalar product protocol and threshold Paillier encryption algorithm are used to ensure the privacy of each party's data in the aggregation process;Distributed Laplace perturbation mechanism is used to effectively protect the privacy of marginal distributions that aggregated from those parties.Through rigorous theoretical analysis,it is proved that PDP-MVDS can ensure the security of each participant's data and the finally published dataset.Furthermore,the experimental results on public datasets show that PDP-MVDS mechanism can obtain a multi-party synthesized dataset with high utility under low overhead.
Privacy protectionMulti-party data publicationSecure Multi-Party Computation(SMPC)Personalized Differential Privacy(PDP)Vertically partitioned data