首页|Hybrid PDES Simulation of HPC Networks Using Zombie Packets

Hybrid PDES Simulation of HPC Networks Using Zombie Packets

扫码查看
Although high-fidelity network simulations have proven to be reliable and cost-effective tools to peer into architectural questions for high-performance computing (HPC) networks, they incur a high resource cost. The time spent in simulating a single millisecond of network traffic in the highest detail can take hours, even for static, well-behaved traffic patterns such as uniform random. Surrogate models offer a significant reduction in runtime, yet they cannot serve as complete replacements and should only be used when appropriate. Thus, there is a need for hybrid modeling, where high-fidelity simulation and surrogates run side-by-side. We present a surrogate model for HPC networks in which: packets bypass the network, while the network state is left untouched, i.e., suspended. To bypass the network, we use historical data to estimate the arrival time at which every packet should be scheduled at; to suspend the network, all in-flight packets are scheduled to arrive at their destinations, and are kept in the system to awaken as zombies when switching back to high-fidelity. Speedup for a hybrid model is relative to the proportion of surrogate to high-fidelity. This light-weight surrogate obtained up to 76× speedup. Keeping the zombies in the network showed an increase in the accuracy of the high-fidelity simulation on restart when compared to restarting the network from an empty state.

Parallel discrete-event simulationHPC networkssurrogate simulation

ELKIN CRUZ-CAMACHO、KEVIN BROWN、XIN WANG、XIONGXIAO XU、KAI SHU、ZHILING LAN、ROBERT B. ROSS、CHRISTOPHER CAROTHERS

展开 >

Computer Science,Rensselaer Polytechnic Institute,Troy,United States

Argonne National Laboratory,Lemont,United States

Computer Science,University of Illinois Chicago,Chicago,United States

Illinois Institute of Technology,Chicago,United States

University of Illinois Chicago,Chicago,United States

展开 >

2025

ACM Transactions on Modeling and Computer Simulation

ACM Transactions on Modeling and Computer Simulation

ISSN:1049-3301
年,卷(期):2025.35(2)
  • 24