A Small File Merging Scheme based on Improved SVD++Algorithm and K-means++Algorithm
This paper proposes a small file merging scheme based on the improved SVD++al-gorithm and K-means++algorithm.By introducing an adaptive learning rate function and the parallel grouping based on the SVD++algorithm,the file merging process is optimized to en-hance the efficiency of storing small files in Hadoop.Additionally,the K-means++algorithm is employed to cluster the merged files and optimize the data storage method to reduce storage space wastage.Experiments conducted on the Hadoop platform demonstrate that the proposed scheme significantly improves the performance of storing and processing small files while main-taining data processing accuracy and stability.