9–13 Jul 2018
Sofia, Bulgaria
Europe/Sofia timezone

Modeling Allocation Utilization Strategies on Supercomputers

10 Jul 2018, 16:00
1h
Sofia, Bulgaria

Sofia, Bulgaria

National Culture Palace, Boulevard "Bulgaria", 1463 NDK, Sofia, Bulgaria
Poster Track 3 – Distributed computing Posters

Speakers

Alexey Poyda (National Research Centre Kurchatov Institute (RU)) Mikhail Titov (National Research Centre Kurchatov Institute (RU))

Description

Most supercomputers provide computing resources that are shared between users and projects, with utilization determined by predefined policies, load and quotas. The efficiency of the utilization of resources in terms of user/project depends on factors such as particular supercomputer policy and dynamic workload of supercomputer based on users' activities. The load on a resource is characterized by the number and parameters of jobs: the number of required nodes, required execution time (walltime), and jobs generation rate.
In this work we identify execution strategies geared towards the goal the maximizing the probability of utilization of allocated resources on a supercomputer. The execution strategies consist of find the optimal set of essential job parameters: number, size, length, rate. A simplified model for utilization of allocation time and a simulator based on queueing theory (with corresponding supercomputer Titan’s requirements) were designed, the model was tested on both synthetic and real log data over many months of Titan’s real work, identified strategies were compared with other possible strategies.
Experiments conducted using the simulator, showed that in most cases identified strategies increase the probability of utilizing allocation faster than a random choice of job processing parameters. We also find that the accuracy of the model will be higher if the amount of resources for utilization is larger, analyzed time intervals are longer and supercomputer’s state is steadier over these intervals.

Primary authors

Alexey Poyda (National Research Centre Kurchatov Institute (RU)) Mikhail Titov (National Research Centre Kurchatov Institute (RU)) Alexei Klimentov (Brookhaven National Laboratory (US)) Dr Jack C. Wells (Oak Ridge National Laboratory (US)) Sarp Oral (Oak Ridge National Laboratory (US)) Kaushik De (University of Texas at Arlington (US)) Danila Oleynik (University of Texas at Arlington (US)) Shantenu Jha (Rutgers University (US))

Presentation materials