由于现有的多数概念演化检测方法本质上是基于监督学习,且通常用于解决一个时间段内仅出现一个新类,不能处理数据流中的类消失和类循环任务.为此,提出一种基于弱监督集成的概念演化自适应检测方法(AD_WE:Adaptive Detection Method for Concept Evolution Based on Weakly Supervised Ensemble).该方法利用弱监督集成策略构建集成学习器,对数据块中的训练样本进行局部预测,在此基础上,基于局部密度和相对距离识别特征空间中具有较强内聚性的相似数据并对其聚类,对聚类结果进行相似度比较,实现新类实例的检测及不同新类的区分;同时根据数据随时间变化特征建立动态衰减模型,及时消除消失类,并通过相似度比较检测循环类.实验表明,所提方法能对概念演化做出及时响应,可有效识别消失类和循环类,提高学习器的泛化性能.
Adaptive Detection Method for Concept Evolution Based on Weakly Supervised Ensemble
Most of the existing detection methods for concept evolution are essentially based on supervised learning and are often used to solve the problem that only one novel class appears in a period of time.However,they can not handle the task of a class disappearing and recurring in streaming data.To address the above problems,an adaptive detection method for concept evolution based on weakly supervised ensemble(AD_WE)is proposed.The weakly supervised ensemble strategy is used to construct an ensemble learner to make local predictions on the training samples in the data block.Similar data with strong cohesion in the feature space are detected and clustered using local density and relative distance.The similarity of the clustering results is then compared to detect novel class instances and distinguish between different novel classes.And a dynamic decay model is established according to the characteristics of data change over time.The vanished class is eliminated in time,and the recurring class is detected through similarity comparison.Experiments show that the proposed method can respond to concept evolution in a timely manner,effectively identify vanished classes and recurring classes,and improve the generalization performance of the learner.
concept evolutionweakly supervised ensembleadaptive modeldynamic decay modelvanished classrecurring class