The classification of phage virion proteins is one of the hot issues of bioinformatics.Concerning the assumption of feature independence in naive Bayes classification and the problem of viral protein feature extraction,this paper proposes a hybrid feature extraction method combining pseudo amino acid composition(PAAC)and k-spaced amino acid composition(CKSAAP)and applies the principal component analysis naive Bayes classification model(PNBC)to phage viral protein classification.The empirical analysis shows that compared with the naive Bayes classification and support vector machine models,the principal component analysis naive Bayes model has the best classification accuracy of 80%.
关键词
主成分分析/朴素贝叶斯/噬菌体/蛋白质分类
Key words
Principal Component Analysis/naive Bayes/phage/protein classification