首页|基于迁移学习的非结构化大数据缺失值插补算法

基于迁移学习的非结构化大数据缺失值插补算法

扫码查看
针对数字信息产生的海量、多角度的非结构化大数据,由于外界干扰、数据结构损坏等因素造成其信息丢失问题,提出了基于迁移学习的非结构化大数据缺失值插补算法.通过迁移学习算法,预测非结构化大数据缺失部位,利用朴素贝叶斯算法分类数据特征,度量属性间权重值,明确数据类别特征差异向量,辨别特征差异程度.采用核回归模型对数据缺失部分实施非线性映射,经过多项式变化编码,描述数据的跨空间互补条件,完成非结构化大数据缺失值插补.实验结果表明,所提算法可以有效完成非结构化大数据缺失值插补,具有较好的插补效果,能提高插补精度.
Missing Value Interpolation Algorithm of Unstructured Big Data Based on Transfer Learning
Due to the complexity of digital information,massive and multi-angle unstructured big data,and external interference,data structure damage and other factors cause its information loss,a missing value interpolation algorithm for unstructured big data based on transfer learning is proposed.Through the migration learning algorithm,the missing parts of unstructured big data are predicted,and the naive Bayesian algorithm is used to classify data features,to measure the weight value between attributes,to clarify the feature difference vector of data categories,and to identify the degree of feature difference.The kernel regression model is used to implement nonlinear mapping for the missing part of the data,and the polynomial change coding is used to describe the cross-space complementary condition of the data,completing the interpolation of the missing value of unstructured big data.The experimental results show that the proposed algorithm can effectively complete the interpolation of missing values of unstructured large data,has good interpolation effect and can improve the interpolation accuracy.

transfer learningunstructured big dataimputation of missing valuesmissing value predictionkernel regression function

颜远海、杨莉云

展开 >

广州华商学院数据科学学院,广东增城 511300

迁移学习 非结构化大数据 缺失值插补 缺失值预测 核回归函数

创新强校工程基金

2017KQNCX266

2024

吉林大学学报(信息科学版)
吉林大学

吉林大学学报(信息科学版)

CSTPCD
影响因子:0.607
ISSN:1671-5896
年,卷(期):2024.42(2)
  • 12