Non-Uniform Memory Access Optimization on Multi-GPU Systems:Research Progress and Prospect
Due to the slowdown of transistor scaling,it has become increasingly difficult to enhance the performance of a single GPU(Graphics Processing Units).Therefore,multi-GPU systems have become the main means to improve the performance of GPU systems.However,due to the constraints of off-chip physical design,the bandwidth imbalance be-tween processors in multi-GPU systems leads to non-uniform memory access(NUMA)problems,which seriously affects the performance of multi-GPU systems.In order to reduce the performance loss caused by non-uniform memory access,this paper first analyzes the causes of non-uniform memory access and compares existing solutions for non-uniform memory ac-cess.For non-uniform memory access with different dimensions,this paper summarizes optimization solutions for non-uni-form memory access from two directions:reducing remote access traffic and improving remote access performance.Finally,combining the advantages and disadvantages of these solutions,this paper proposes the future development direction of non-uniform memory access optimization for multi-GPU systems.