4–8 Nov 2019
Adelaide Convention Centre
Australia/Adelaide timezone

Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana

5 Nov 2019, 14:30
15m
Riverbank R3 (Adelaide Convention Centre)

Riverbank R3

Adelaide Convention Centre

Oral Track 3 – Middleware and Distributed Computing Track 3 – Middleware and Distributed Computing

Speaker

Thomas Beermann (University of Innsbruck (AT))

Description

For the last 10 years, the ATLAS Distributed Computing project has based its monitoring infrastructure on a set of custom designed dashboards provided by CERN-IT. This system functioned very well for LHC Runs 1 and 2, but its maintenance has progressively become more difficult and the conditions for Run 3, starting in 2021, will be even more demanding; hence a more standard code base and more automatic operations are needed. A new infrastructure has been provided by the CERN-IT Monit group, based on InfluxDB as the data store and Grafana as the display environment. ATLAS has adapted and further developed its monitoring tools to use this infrastructure for data and workflow management monitoring and accounting dashboards, expanding the range of previous possibilities with the aim of achieving a single, simpler, environment for all monitoring applications. This presentation will describe the tools used, the data flows for monitoring and accounting, the problems encountered and the solutions found.

Consider for promotion No

Primary authors

Dario Barberis (Università e INFN Genova (IT)) Aleksandr Alekseev (National Research Tomsk Polytechnic University (RU)) Thomas Beermann (University of Innsbruck (AT)) Sabine Crepe-Renaudin (LPSC-Grenoble, CNRS/UGA (FR)) Johannes Elmsheuser (Brookhaven National Laboratory (US)) Ivan Glushkov (University of Texas at Arlington (US)) Michal Svatos (Acad. of Sciences of the Czech Rep. (CZ)) Armen Vartapetian (University of Texas at Arlington (US)) Petr Vokac (Czech Technical University) Helmut Wolters (LIP Coimbra, Portugal)

Presentation materials