Simulation of Automatic Anonymous Method for Sensitive Data in Untrusted Environment
An automatic anonymity method for sensitive data in distrustful environment based on discovery of frequent item set is proposed.Semantic analysis is carried out for practical significance implied by attribute of sensitive data,and semantic similarity value and semantic diversity value among the values of attribute are calculated.Then,the attribute in prospective identifier is divided into two types of numeric attribute and categorical attribute,and corresponding generalization strategy of sensitive data is provided.Generalized loss of information of set of sensitive data is worked out and metric function of generalized anonymity table utility that can consider weight is defined with the discovery of frequent item set.In addition,average value of weighting information mount of equivalence group of sensitive data is worked out.Thus,the automatic anonymity is completed.Experimental results show that the proposed method can reduce risk probability of sensitive data leak effectively.It can also reduce loss of information caused by generalizing process of automatic anonymity.