Chinese data cleaning problem is surveyed in this paper. The relationships among total data quality management and data cleaning are clarified, and the definition and objects of data cleaning are given. The background of data cleaning problem, research status and hot research areas are introduced, and the basic principle and some models of data cleaning are presented briefly, existing algorithms are analyzed. According to the situation of the country and demand of projects, the methods of Chinese data cleaning are emphasized. The weakness of Chinese data cleaning is clarified, and the future research topics and application related to Chinese data cleaning problem are discussed.
Chinese data cleaningdata quality managementdata integration