Speaker
Description
AbstractMonitoring and improving the sustainability of large-scale computing infrastructures has become an increasingly important challenge in High Energy Physics. This work presents the design and implementation of a sustainability-oriented monitoring dashboard for an ATLAS Tier 2 computing centre. The dashboard integrates global site-level metrics and proposes a set of job-level metrics aimed at evaluating both computational efficiency and environmental impact. The dashboard is developed and deployed at the IFIC Tier 2 centre, where detailed operational and energy-related data are analysed to explore how sustainability indicators can be derived and linked to computing workloads. We present our approach to monitor per-job energy consumption, which is not trivial since computing nodes usually run multiple workloads and energy measurements are generally available only at the system level. We also explore ways to make the proposed model usable at other ATLAS Tier 2 sites, studying possible methods to access the necessary data for its adoption. The results provide a foundation for sustainability-aware monitoring within the ATLAS distributed computing infrastructure.