首页|A Unified Framework for Bandit Online Multiclass Prediction

A Unified Framework for Bandit Online Multiclass Prediction

扫码查看
Bandit online multiclass prediction plays an important role in many real-world applications. In this paper, we propose a unified Bandit Online Multiclass Prediction (BOMP) framework. This framework is based on our proposed margin-based gradient descent approach. Its update step provides an unbiased estimate of the surrogate loss gradient and has a lower variance than existing methods. It also enables our algorithms to update even for incorrect predictions by penalizing the wrong classes. The link function of the framework can evolve over time, gradually incorporating online data information including second-order information into the potential functions. Based on the proposed framework, we investigate first-order and second-order bandit online multiclass prediction algorithms. Theoretical analysis demonstrates the superiority of our proposed update rule and bandit online multiclass prediction framework. Finally, we compare our proposed first-order and second-order bandit online multiclass prediction algorithms with several state-of-the-art methods on two synthetic and four real-world datasets. The encouraging results show that our proposed algorithms significantly outperform state-of-the-art techniques.

Prediction algorithmsFastenersHeuristic algorithmsClassification algorithmsMachine learning algorithmsLogisticsVectors

Wanjin Feng、Xingyu Gao、Peilin Zhao、Steven C.H. Hoi

展开 >

Institute of Microelectronics, Chinese Academy of Sciences, Beijing, China|University of Chinese Academy of Sciences, Beijing, China

Tencent AI Lab, Shenzhen, China

School of Computing and Information Systems, Singapore Management University, Singapore

2025

IEEE transactions on knowledge and data engineering

IEEE transactions on knowledge and data engineering

ISSN:
年,卷(期):2025.37(5)
  • 63