首页|Efficient Dynamic Resource Management for Spatial Multitasking GPUs

Efficient Dynamic Resource Management for Spatial Multitasking GPUs

扫码查看
The advent of microservice architecture enables complex cloud applications to be realized via a set of individually isolated components, increasing their flexibility and performance. As these applications require massive computing resources, graphics processing units (GPUs) are being widely used as high-speed parallel computing devices to meet the stringent demands. Although current GPUs allow application components to be executed concurrently via spatial multitasking, they face several challenges. The first challenge is allocating the computing resources to components dynamically to maximize efficiency. The second challenge is avoiding performance degradation caused by the data transfer overhead between the components. To address these challenges, we propose an efficient GPU resource management technique that dynamically allocates GPU resources to application components. The proposed method allocates resources based on component workloads and uses online performance monitoring to guarantee the application's performance. We also propose a GPU memory manager to reduce the data transfer overhead between components via shared memory. Our evaluation results indicate that the proposed dynamic resource allocation method improves application throughput by up to 134.12% compared to the state-of-the-art spatial multitasking techniques. We also show that using a shared memory results in 6x throughput improvement compared to the baseline User Datagram Protocol (UDP)-based technique.

Graphics processing unitsMicroservice architecturesResource managementCloud computingMultitaskingData transferKernelDynamic schedulingThroughputMemory management

Hoda Sedighi、Daniel Gehberger、Amin Ebrahimzadeh、Fetahi Wuhib、Roch H. Glitho

展开 >

Concordia Institute of Information System Engineering, Concordia University, Montreal, QC, Canada

Ericsson Research, Ericsson, Montreal, QC, Canada

Concordia Institute of Information System Engineering, Concordia University, Montreal, QC, Canada|Computer Science Program, University of the Western Cape, Cape Town, South Africa

2025

IEEE transactions on cloud computing

IEEE transactions on cloud computing

ISSN:
年,卷(期):2025.13(1)
  • 63