Research on Methods for Sensitive Information Detection in Chinese Text
To prevent the proliferation of inappropriate and sensitive information on the internet and to create a clean and civilized online environment,this article investigates the issue of sensitive information detection in Chinese text.Based on survey analysis,a detection framework composed of three stages-the construction of a sensitive word library,the discovery of suspicious text,and sensitive information detection-is proposed,along with strategies and methods for each stage.Experiments were conducted on a method of expanding the sensitive word library based on Word2vec,and the results showed that this method had significant effects.
Word2vecsensitive information detectionChinese text