Multi-instance partial-label learning:towards exploiting dual inexact supervision

Wei TANG ¹Weijia ZHANG ²Min-Ling ZHANG¹

扫码查看

作者信息

1. School of Computer Science and Engineering,Southeast University,Nanjing 210096,China;Key Laboratory of Computer Network and Information Integration(Southeast University),Ministry of Education,Nanjing 211189,China
2. School of Information and Physical Sciences,The University of Newcastle,Callaghan NSW 2308,Australia
折叠

Abstract

Weakly supervised machine learning algorithms are able to learn from ambiguous samples or labels,e.g.,multi-instance learning or partial-label learning.However,in some real-world tasks,each training sample is associated with not only multiple instances but also a candidate label set that contains one ground-truth label and some false positive labels.Specifically,at least one instance pertains to the ground-truth label while no instance belongs to the false positive labels.In this paper,we formalize such problems as multi-instance partial-label learning(MIPL).Existing multi-instance learning algorithms and partial-label learning algorithms are suboptimal for solving MIPL problems since the former fails to disambiguate a candidate label set,and the latter cannot handle a multi-instance bag.To address these issues,a tailored algorithm named MIPLGP,i.e.,multi-instance partial-label learning with Gaussian processes,is proposed.MIPLGP first assigns each instance with a candidate label set in an augmented label space,then transforms the candidate label set into a logarithmic space to yield the disambiguated and continuous labels via an exclusive disambiguation strategy,and last induces a model based on the Gaussian processes.Experimental results on various datasets validate that MIPLGP is superior to well-established multi-instance learning and partial-label learning algorithms for solving MIPL problems.

Key words

machine learning/multi-instance partial-label learning/multi-instance learning/partial-label learning/Gaussian processes

引用本文复制引用

基金项目

国家自然科学基金(62225602)

国家自然科学基金(62206047)

出版年

2024

中国科学:信息科学(英文版)

中国科学院

中国科学:信息科学(英文版)

CSTPCDEI

影响因子：0.715

ISSN：1674-733X

参考文献量46

段落导航