Adaptive random tree ensemble for evolving data stream classification

扫码查看

原文链接

NETL
NSTL
Elsevier

外文摘要：Data stream mining with concept drift is a significant challenge in machine learning because this scenario requires the ability to handle unlimited and ever-changing data and real-time processing. An often employed strategy in data stream mining involves utilizing ensembles due to their capability to tackle concept drift and attain remarkably accurate predictions. However, developing a precise and efficient ensemble for data stream mining poses a significant challenge, as state-of-the-art algorithms are often highly inefficient, consuming excessive memory and processing time. In this study, we propose a novel ensemble-based classification algorithm for data streams named Adaptive Random Tree Ensemble (ARTE). The algorithm explores approaches that promote high prediction accuracy using a random-sized feature subspace for each element of the ensemble, online bagging, random choice of the cut-point for splitting the trees, and a method of classifier selection for final ensemble voting. This study also presents analyses on the contribution of the choice of subspace size and the random cut-point for splitting the tree's nodes to the ensemble's diversity. Following an extensive experimental investigation, ARTE exhibited high predictive performance and outperformed state-of-the-art ensembles on data streams for real and synthetic datasets while requiring fewer computational resources.

外文关键词：

Data stream miningEnsemble learningConcept driftRandom subspacesCLASSIFIERSSELECTION

作者：

Paim, Aldo M.、Enembreck, Fabricio

展开 >

作者单位：

Pontificia Univ Catolica Parana PUCPR

出版年：

2025

DOI：

10.1016/j.knosys.2024.112830

Knowledge-based systems

SCI

ISSN：0950-7051

年,卷(期)：2025.309(Jan.30)

参考文献量44