CHEP 2018 Conference, Sofia, Bulgaria

Name: CHEP 2018 Conference, Sofia, Bulgaria
Start: 2018-07-09T08:00:00+03:00
End: 2018-07-13T13:00:00+03:00
Location: Sofia, Bulgaria

9–13 Jul 2018

Sofia, Bulgaria

Europe/Sofia timezone

Contact us

Integrated automation for configuration management and operations in the ATLAS online computing farm

10 Jul 2018, 16:00

Sofia, Bulgaria

National Culture Palace, Boulevard "Bulgaria", 1463 NDK, Sofia, Bulgaria

Poster Track 8 – Networks and facilities Posters

Arturo Sanchez Pineda (Abdus Salam Int. Cent. Theor. Phys. (IT))

The online farm of the ATLAS experiment at the LHC, consisting of
nearly 4000 PCs with various characteristics, provides configuration
and control of the detector and performs the collection, processing,
selection, and conveyance of event data from the front-end electronics
to mass storage.

Different aspects of the farm management are already accessible via
several tools. The status and health of each host are monitored by a
system based on Icinga2 and Ganglia. PuppetDB gathers centrally all
the status information from Puppet, the configuration management tool
used to ensure configuration consistency of every host. The in-house
Configuration Database controls DHCP and PXE, integrating also
external information sources.

In this paper we present our roadmap for integrating these and other
data sources and systems, and building a higher level of abstraction
on top of this foundation. An automation and orchestration tool will
be able to use these systems and replace lengthy manual procedures,
some of which also require interactions with other systems and teams,
e.g. for the repair of a faulty host. Finally, an inventory and
tracking system will complement the available data sources, keep track
of host history, and improve the evaluation of long-term lifecycle
management and purchase strategies.

Arturo Sanchez Pineda (Abdus Salam Int. Cent. Theor. Phys. (IT)) Artem Amirkhanov (Budker Institute of Nuclear Physics (RU)) Sergio Ballestrero Franco Brasolin (Universita e INFN, Bologna (IT)) Chris Lee (University of Cape Town (ZA)) Mr Haydn du Plessis (University of Johannesburg (ZA)) Konstantinos Mitrogeorgos (Aristotle University of Thessaloniki (GR)) Marco Pernigotti (CERN) Diana Scannicchio (University of California Irvine (US)) Matthew Shaun Twomey (University of Washington (US))

ATL-COM-DAQ-2018-098.pdf

CHEP 2018 Conference, Sofia, Bulgaria

Contact us

Integrated automation for configuration management and operations in the ATLAS online computing farm

Sofia, Bulgaria

Speaker

Description

Authors

Presentation materials