山东大学学报(理学版)2024,Vol.59Issue(12) :130-140.DOI:10.6040/j.issn.1671-9352.7.2023.5296

一种新颖的无监督特征选择方法

A novel unsupervised feature selection method

汪廷华 胡振威 占宏祥
山东大学学报(理学版)2024,Vol.59Issue(12) :130-140.DOI:10.6040/j.issn.1671-9352.7.2023.5296

一种新颖的无监督特征选择方法

A novel unsupervised feature selection method

汪廷华 1胡振威 1占宏祥1
扫码查看

作者信息

  • 1. 赣南师范大学数学与计算机科学学院,江西赣州 341000
  • 折叠

摘要

大多数基于HSIC的特征选择方法都受到以下限制.首先,这些方法通常只适用于有标记的数据,这是不够的,因为现实世界应用中的大多数数据都是未标记的.其次,现有的基于HSIC的无监督特征选择方法只解决了所选特征与表达底层聚类结构的输出值之间的一般相关性,而忽略了不同特征之间的冗余.为了解决这些问题,提出了一种新的基于HSIC的无监督特征选择方法(UFSHSIC),该方法使用HSIC作为相关性准则来探索特征与总体样本结构之间的相关性及特征与特征之间的冗余度.与其它经典特征选择学习方法在多个真实数据集上的实验对比表明,该方法可以有效从无标签样本中进行特征选择,且选择的特征子集相比有监督特征选择方法而言能产生类似或更好的性能.

Abstract

Most HSIC-based feature selection methods are subject to the following limitations.First,these methods are typically only applied to labeled data,which is not feasible since most of the data in real-world applications is unlabeled.Second,existing HSIC-based unsupervised feature selection methods only address the general correlation between the selected features and the output values representing the underlying clustering structure,while ignoring the redundancy between different features.To address these issues,a new unsupervised feature selection method based on HSIC(UFSHSIC)is proposed,which utilizes HSIC as a correlation criterion to explore the correlation between features and the overall sample structure,as well as the redundancy between features.Experimental comparison with other classical feature selection methods on multiple real datasets shows that the proposed method can effectively perform feature selection from unlabeled samples,and the selected feature subset produces equivalent or better performance than supervised feature selection methods.

关键词

无监督特征选择/希尔伯特-施密特独立性准则/核方法/机器学习/特征冗余

Key words

unsupervised feature selection/Hilbert-Schmidt independence criterion(HSIC)/kernel method/machine learning/feature redundancy

引用本文复制引用

出版年

2024
山东大学学报(理学版)
山东大学

山东大学学报(理学版)

CSTPCDCSCD北大核心
影响因子:0.437
ISSN:1671-9352
段落导航相关论文