Neighborhood-tolerance mutual information selection ensemble classification algorithm for incomplete data sets
In order to solve the classification problem of incomplete mixed information systems,the concept of neighborhood-tolerance mutual information is defined by combining neighborhood-tolerance and mutual information theory in granular computing,and a selective ensemble classification algorithm based on neighborhood-tolerance mutual information is proposed by using ensemble learning.In this algorithm,information particles are obtained according to the missing attributes,and the space is constructed by dividing the particles into different layers.A new base classifier is constructed by integrating the BP neural network as the base classifier on different layers.Then,the neighborhood-tolerance mutual information about class attributes is calculated according to the missing attributes of each information particle to measure the importance of each information particle,and the weight of the base classifier is redefined according to the prediction accuracy of the base classifier and the neighborhood-tolerance mutual information.Finally,based on the predicted samples,the weighted ensemble prediction results of base classifier are analyzed and compared with the traditional ensemble classification algorithm.For partial incomplete mixed data sets,the proposed ensemble classification algorithm can effectively improve the classification accuracy.
incomplete hybrid information systemneighborhood-tolerance mutual informationensemble learningclassification