计算机工程与科学2024,Vol.46Issue(2) :282-291.DOI:10.3969/j.issn.1007-130X.2024.02.011

融合特征权重与改进粒子群优化的特征选择算法

Feature selection algorithm based on feature weights and improved particle swarm optimization

刘振超 苑迎春 王克俭 何晨
计算机工程与科学2024,Vol.46Issue(2) :282-291.DOI:10.3969/j.issn.1007-130X.2024.02.011

融合特征权重与改进粒子群优化的特征选择算法

Feature selection algorithm based on feature weights and improved particle swarm optimization

刘振超 1苑迎春 2王克俭 2何晨1
扫码查看

作者信息

  • 1. 河北农业大学信息科学与技术学院,河北 保定 071000
  • 2. 河北农业大学信息科学与技术学院,河北 保定 071000;河北农业大学河北省农业大数据重点实验室,河北 保定 071000
  • 折叠

摘要

随着教育信息化的发展,教育数据呈现特征数量高、冗余度高等特点,这使目前的分类算法在教育数据上分类准确率不理想.提出一种将特征权重算法与改进粒子群优化算法融合的混合式特征选择算法(RF-ATPSO).该算法首先使用RELIEF-F算法计算各个特征的权重,筛除冗余特征,然后在筛选后的特征集合中利用改进粒子群算法搜索最优特征子集.实验结果表明,在6个UCI公共数据集上,经RF-ATPSO算法进行特征选择后,平均准确率提升了10.04%,且平均特征子集规模最小、收敛速度最快;在学生学业成绩画像特征数据集上,该算法以较小的特征子集规模达到较高的分类准确率,平均准确率为94.77%,明显优于其它特征选择算法,实验充分证明了该算法具有实际应用意义.

Abstract

With the development of educational informatization,educational data presents character-istics such as high feature counts and high redundancy,resulting in the classification accuracy of current classification algorithms not being ideal on educational data.Therefore,this paper proposes a hybrid feature selection algorithm(RF-ATPSO)that integrates feature weighting algorithm with improved par-ticle swarm optimization algorithm.The algorithm first uses the RELIEF-F algorithm to calculate the weights of each feature,removes redundant features,and then uses the improved particle swarm optimi-zation algorithm to search for the optimal feature subset in the filtered feature set.Experimental results show that on 6 UCI public datasets,after feature selection using the RF-ATPSO algorithm,the average accuracy is improved by 10.04%,and the average feature subset size is the smallest and the convergence speed is the fastest.In the student academic performance portrait feature dataset,the algorithm achieves high classification accuracy with a smaller feature subset size,with an average accuracy of 94.77%,which is significantly better than other feature selection algorithms.The experiment fully demonstrates the practical application significance of this algorithm.

关键词

特征选择/特征权重/改进粒子群优化/T-分布

Key words

feature selection/feature weight/improved PSO/T-distribution

引用本文复制引用

基金项目

河北省高等教育教学改革研究与实践项目(2020GJJG076)

出版年

2024
计算机工程与科学
国防科学技术大学计算机学院

计算机工程与科学

CSTPCD北大核心
影响因子:0.787
ISSN:1007-130X
参考文献量10
段落导航相关论文