南京大学学报(自然科学版)2024,Vol.60Issue(3) :511-522.DOI:10.13232/j.cnki.jnju.2024.03.014

基于模糊邻域熵的离群点检测方法

Fuzzy neighborhood entropy-based outlier detection

刘佳莉 陈锦坤
南京大学学报(自然科学版)2024,Vol.60Issue(3) :511-522.DOI:10.13232/j.cnki.jnju.2024.03.014

基于模糊邻域熵的离群点检测方法

Fuzzy neighborhood entropy-based outlier detection

刘佳莉 1陈锦坤2
扫码查看

作者信息

  • 1. 闽南师范大学数学与统计学院,漳州,363000
  • 2. 闽南师范大学数学与统计学院,漳州,363000;闽南师范大学福建省粒计算及其应用重点实验室,漳州,363000
  • 折叠

摘要

离群点检测(又称异常点检测)是数据挖掘领域中一个重要的研究方向,其目的是找出显著区别于其他数据的数据点.针对基于传统粗糙集理论的离群点检测方法存在忽略样本的模糊性和邻域关系等问题,利用模糊邻域粗糙集弥补经典粗糙集的不足,并结合熵的不确定性,提出一种新的基于模糊邻域熵的离群点检测方法.首先,采用模糊邻域半径和混合模糊相似度构造模糊邻域近似空间;然后,定义一种特定的模糊邻域组合熵和相对模糊邻域组合熵来构建模糊邻域离群度,进而定义基于模糊邻域熵的离群因子实现离群点检测,并设计了基于模糊邻域熵的离群点检测算法(FNEOD).最后,将FNEOD算法与主要的离群点检测算法进行比较.实验结果表明,该方法具有较好的有效性和适应性.

Abstract

Outlier detection(also known as anomaly detection)is an important research direction in the field of data mining,with the aim of identifying data points that are significantly different.In response to the problem of neglecting the fuzziness and neighborhood relationships of samples in outlier detection methods based on traditional rough set theory,this paper uses fuzzy neighborhood rough sets to compensate for the shortcomings of classical rough sets,and combines the uncertainty of entropy to propose a novel outlier detection method based on fuzzy neighborhood entropy.Firstly,a fuzzy neighborhood approximation space is constructed using fuzzy neighborhood radius and mixed fuzzy similarity.Then,a specific fuzzy neighborhood combination entropy and a relative fuzzy neighborhood combination entropy are defined to construct fuzzy neighborhood outliers.Furthermore,an outlier detection algorithm based on fuzzy neighborhood entropy(FNEOD)was designed by combining the outlier factor based on fuzzy neighborhood combination entropy.Finally,the FNEOD algorithm is compared with the main outlier detection algorithms.The experimental results show that this method has good effectiveness and adaptability.

关键词

数据挖掘/离群点检测/模糊邻域组合熵/相对模糊邻域组合熵

Key words

data mining/outlier detection/fuzzy neighborhood combination entropy/relative fuzzy neighborhood combination entropy

引用本文复制引用

基金项目

国家自然科学基金(62076116)

国家自然科学基金(62076088)

福建省自然科学基金(2020J01792)

福建省自然科学基金(2021J02049)

出版年

2024
南京大学学报(自然科学版)
南京大学

南京大学学报(自然科学版)

CSTPCDCSCD北大核心
影响因子:0.756
ISSN:0469-5097
参考文献量32
段落导航相关论文