Research on Multi-source Heterogeneous Data Governance Technology Based on Cloud Data Center
Currently,conventional multi-source heterogeneous data governance methods are mainly used to judge data attributes to achieve sub-regional data cleaning,which leads to poor governance performance due to the lack of non-linear data analysis.For this reason,a multi-source heterogeneous data governance technique based on cloud data center is proposed.The ETL function of relation-al database is adopted to clean the data,which defines the data transformation mode and data cleaning rules.The mutual information coefficient is introduced to determine the degree of data relevance,and analyze the data relevance analysis.The cloud data center is used as a carrier to construct the multi-source heterogeneous data governance system.In the experiments,the governance perform-ance of the proposed data governance technique is examined.The final test results show that the proposed data governance technique has a high checking accuracy rate and more ideal data governance effect.
cloud data centermulti-source heterogeneous datadata governancedata cleansing