9–13 Jul 2018
Sofia, Bulgaria
Europe/Sofia timezone

Operation of the ATLAS distributed computing

10 Jul 2018, 16:00
1h
Sofia, Bulgaria

Sofia, Bulgaria

National Culture Palace, Boulevard "Bulgaria", 1463 NDK, Sofia, Bulgaria
Poster Track 3 – Distributed computing Posters

Speaker

Ivan Glushkov (University of Texas at Arlington (US))

Description

We describe the central operation of the ATLAS distributed computing system. The majority of compute intensive activities within ATLAS are carried out on some 350,000 CPU cores on the Grid, augmented by opportunistic usage of significant HPC and volunteer resources. The increasing scale, and challenging new payloads, demand fine-tuning of operational procedures together with timely developments of the production system. We describe several such developments, motivated directly from operational experience. Optimization of inefficient task requests, from both official production and users, is made possible by automatic detection of payload properties. User education, job shaping or preventative throttling help to increase the overall throughput of the available resources.

Primary authors

Ivan Glushkov (University of Texas at Arlington (US)) Rodney Walker (Ludwig Maximilians Universitat (DE)) Federica Legger (Ludwig Maximilians Universitat (DE)) David Cameron (University of Oslo (NO)) Alessandro Di Girolamo (CERN) Tadashi Maeno (Brookhaven National Laboratory (US)) Andrej Filipcic (Jozef Stefan Institute (SI)) Fernando Harald Barreiro Megino (University of Texas at Arlington)

Presentation materials