Help us make Indico better by taking this survey! Aidez-nous à améliorer Indico en répondant à ce sondage !

19–25 Oct 2024
Europe/Zurich timezone

ATLAS usage of the Czech national HPC center: HyperQueue, cvmfsexec, and other news

TUE 09
22 Oct 2024, 15:18
57m
Exhibition Hall

Exhibition Hall

Poster Track 7 - Computing Infrastructure Poster session

Speaker

Michal Svatos (Czech Academy of Sciences (CZ))

Description

The distributed computing of the ATLAS experiment at the Large Hadron Collider (LHC) utilizes computing resources provided by the Czech national High Performance Computing (HPC) center, IT4Innovations. This is done through ARC-CEs deployed at the Czech Tier2 site, praguelcg2. Over the years, this system has undergone continuous evolution, marked by recent enhancements aimed at improving resource utilization efficiency.
One key enhancement involves the implementation of the HyperQueue meta-scheduler. It enables a division of whole-node jobs into several smaller, albeit longer, jobs, thereby enhancing CPU efficiency. Additionally, the integration of cvmfsexec enables access to the distributed CVMFS filesystem on compute nodes without requiring any special configurations, thereby substantially simplifying software distribution and broadening the range of tasks eligible for execution on the HPC. Another notable change was the migration of the batch system from PBSpro to Slurm.

Primary authors

Jiri Chudoba (Czech Academy of Sciences (CZ)) Michal Svatos (Czech Academy of Sciences (CZ)) Petr Vokac (Czech Technical University in Prague (CZ))

Presentation materials