Research on Privacy Data Identification and Measurement Based on Medical Information Text
The results of data classification in medical industry standards are fuzzy,with few accompanying measure-ment results.Considering existing problems,this study adopted medical information text mining to objectively measure medical data privacy.Measurement results can provide a reference for verifying and improving current medical data clas-sification results.In this study,the sources of medically sensitive data included industry standards,legal regulations,aca-demic papers,and breach cases.The medically sensitive data unit is composed of sensitive nouns(also known as sensi-tive data items),sensitive verbs,and sensitive degree words,which are used in the privacy recognition model.The priva-cy measurement model considers the sensitivity,semantic strength,and text strength of sensitive data.In ranking the re-sults of privacy values,medical application data ranked the highest,followed by health status,medical payment,and per-sonal attribute data.
medical information textpersonal privacyprivacy data identificationprivacy measurement