首页|可减轻腰椎间盘样本集类重叠的采样算法

可减轻腰椎间盘样本集类重叠的采样算法

扫码查看
医学数据的类重叠问题会严重影响疾病的智能诊断效果.为了减轻腰椎间盘样本的类重叠对分类器产生的不良影响,提出了一种可减轻类重叠的混合采样算法——CO_HS算法.该算法将训练样本划分为核心样本、边界样本和噪声样本,对重叠区域的样本进行采样,以减轻样本集的类重叠程度.采用CO_HS算法产生的新训练样本集训练RF等分类模型,并建立了6种新的腰椎间盘退变分类器.实验结果显示,建立的新分类器在多项性能指标上均实现了显著提升,其中准确度提升了7.8百分点~12.7百分点,kappa系数提升了11.6百分点~20.2百分点,敏感性提升了7.9百分点~16.8百分点,特异性提升了9.0百分点~18.2百分点,F指标提升了9.4百分点~18.4百分点.因此,CO_HS算法被证明是一种能有效解决样本类重叠问题、改善分类性能的高效方法.
Sampling Algorithm for Reducing Class Overlap in Lumbar Disc Samples
The class overlap problem in medical data can severely affect the performance of intelligent disease diagnosis.To mitigate the negative impact of class overlap in lumbar disc samples on classifiers,this paper proposes a CO_HS algorithm,a hybrid sampling algorithm to reduce class overlap.This algorithm divides the training samples into core samples,boundary samples,and noise samples,sampling from the overlapping region to reduce the degree of class overlap in the dataset.New training samples generated by the CO_HS algorithm are used to train classification models such as Random Forest(RF),resulting in the establishment of six new classifiers for lumbar disc degeneration.Experimental results indicate that the newly established classifiers show significant improvement across multiple performance metrics.Specifically,the accuracy has increased by 7.8 percentage points to 12.7 percentage points,the kappa coefficient has increased by 11.6 percentage points to 20.2 percentage points,sensitivity has been improved by 7.9 percentage points to 16.8 percentage points,specificity has been elevated by 9.0 percentage points to 18.2 percentage points,and the F-measure has been boosted by 9.4 percentage points to 18.4 percentage points.Therefore,the CO_HS algorithm is proven to be an effective method for addressing the class overlap issue and improving classification performance.

intelligent medicineclass overlaphybrid samplinglumbar disc degeneration

赵鑫鑫、吴晓锋

展开 >

闽南师范大学数学与统计学院,福建 漳州 363000

泉州师范学院数学与计算机科学学院,福建 泉州 362000

智能医学 类重叠 混合采样 腰椎间盘退变

2025

软件工程
东北大学 大连东软信息学院

软件工程

影响因子:0.527
ISSN:2096-1472
年,卷(期):2025.28(1)