An Accelerator for SVR Algorithms Based on Spatial Projection and Clustering Partitioning
Data not only generates value,but also provides the impetus for the scientific development of statistics.With the rapid development of science and technology,massive data has emerged,but the large-scale data makes it difficult for many traditional processing methods to meet the needs of data analysis in various fields.Facing the inefficiency of learning algorithms in the era of massive data,partitioning is usually considered as the most direct and widely used strategy to solve this problem.SVR is a powerful regression algorithm with wide applications in the fields of pattern recognition and data mining.However,SVR is inefficient in training when dealing with large-scale data.For this reason,we propose a SVR acceleration algorithm based on spatial projection and clustering division(PKM-SVR)by utilizing the idea of partitioning.The projection vector is used to project the data into a two-dimensional space;the clustering method is used to divide the data space into k disjoint regions;the SVR model is trained on each region;and the SVR model in each region is used to predict the to-be-recognized samples that fall into the same region.Comparison experiments are conducted with the traditional data partitioning method on standard datasets,and the experimental results show that the proposed algorithm is faster to train and exhibits better prediction performance.
large-scale datadivide and rule methodsupport vector regressionprincipal components analysisclustering