Parallel optimization of FMM for cosmic N-body simulations on Sunway TaihuLight platform
Cosmological simulation is a typical N-body problem and one of the representative and challenging problems in high-performance computing.This paper ports and optimizes the performance of the fast multipole method(FMM),the main part of the astronomical N-body simulation software PhotoNs-2 on the Sunway TaihuLight platform.Aiming to increase the computational efficiency and reduce the communication expense in the current research,this paper optimizes the fast multipole method by reorganizing data,reconstructing the transcendental function computation and designing double buffering and merging the sending tree during message passing interface(MPI)communication based on the characteristics of the architecture of the SW26010 processor of Sunway TaihuLight.Compared with the algorithm before opitimization,the optimized PhotoNs-2 achieves a speedup of about 24 times under three different arithmetic cases.The proposed optimization scheme can provide a reference for the porting and optimization of other high-performance applications on the Sunway TaihuLight plat-form.