中国科学:信息科学(英文版)2024,Vol.67Issue(3) :44-57.DOI:10.1007/s11432-023-3771-6

Multi-instance partial-label learning:towards exploiting dual inexact supervision

Wei TANG Weijia ZHANG Min-Ling ZHANG
中国科学:信息科学(英文版)2024,Vol.67Issue(3) :44-57.DOI:10.1007/s11432-023-3771-6

Multi-instance partial-label learning:towards exploiting dual inexact supervision

Wei TANG 1Weijia ZHANG 2Min-Ling ZHANG1
扫码查看

作者信息

  • 1. School of Computer Science and Engineering,Southeast University,Nanjing 210096,China;Key Laboratory of Computer Network and Information Integration(Southeast University),Ministry of Education,Nanjing 211189,China
  • 2. School of Information and Physical Sciences,The University of Newcastle,Callaghan NSW 2308,Australia
  • 折叠

Abstract

Weakly supervised machine learning algorithms are able to learn from ambiguous samples or labels,e.g.,multi-instance learning or partial-label learning.However,in some real-world tasks,each training sample is associated with not only multiple instances but also a candidate label set that contains one ground-truth label and some false positive labels.Specifically,at least one instance pertains to the ground-truth label while no instance belongs to the false positive labels.In this paper,we formalize such problems as multi-instance partial-label learning(MIPL).Existing multi-instance learning algorithms and partial-label learning algorithms are suboptimal for solving MIPL problems since the former fails to disambiguate a candidate label set,and the latter cannot handle a multi-instance bag.To address these issues,a tailored algorithm named MIPLGP,i.e.,multi-instance partial-label learning with Gaussian processes,is proposed.MIPLGP first assigns each instance with a candidate label set in an augmented label space,then transforms the candidate label set into a logarithmic space to yield the disambiguated and continuous labels via an exclusive disambiguation strategy,and last induces a model based on the Gaussian processes.Experimental results on various datasets validate that MIPLGP is superior to well-established multi-instance learning and partial-label learning algorithms for solving MIPL problems.

Key words

machine learning/multi-instance partial-label learning/multi-instance learning/partial-label learning/Gaussian processes

引用本文复制引用

基金项目

国家自然科学基金(62225602)

国家自然科学基金(62206047)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量46
段落导航相关论文