Help us make Indico better by taking this survey! Aidez-nous à améliorer Indico en répondant à ce sondage !

9–13 Jul 2018
Sofia, Bulgaria
Europe/Sofia timezone

BigPanDA Workflow Management on Titan for HENP and extreme scale applications

9 Jul 2018, 11:45
15m
Hall 7 (National Palace of Culture)

Hall 7

National Palace of Culture

presentation Track 3 – Distributed computing T3 - Distributed computing

Speaker

Alexei Klimentov (Brookhaven National Laboratory (US))

Description

The Titan supercomputer at Oak Ridge National Laboratory prioritizes the scheduling of large leadership class jobs, but even when the supercomputer is fully loaded and large jobs are standing in the queue to run, 10 percent of the machine remains available for a mix of smaller jobs, essentially ‘filling in the cracks’ between the very large jobs. Such utilisation of the computer resources is called “backfill”.
Smaller scientific groups and data science experiments are able to conduct their computations on Titan in backfill mode. For example, simulations in high-energy physics don’t require large MPI scale scale jobs on a supercomputer. As a universal jobs scheduler to optimize Titan utilization it was successfully used to schedule and to submit ATLAS MC simulation jobs earlier. The current R&D project is aimed to enabling the convergence of HPC and HTC paradigms for a range of application types and communities in particle physics and beyond.
In March 2017, we implemented a new PanDA server instance within ORNL operating under Red Hat OpenShift Origin - a powerful container cluster management and orchestration system in order to serve various experiments at Titan supercomputer. We implemented a set of demonstrations serving diverse scientific workflows including LQCD and IceCube, biology studies of the genes and human brain, and molecular dynamics studies.

Primary authors

Dr Ruslan Mashinistov (University of Texas at Arlington (US)) Kaushik De (University of Texas at Arlington (US)) Jason C. Kincl (Oak Ridge National Laboratory (US)) Alexei Klimentov (Brookhaven National Laboratory (US)) Pavlo Svirin (Brookhaven National Laboratory (US)) Dr Jack C. Wells (Oak Ridge National Laboratory (US))

Presentation materials