14-18 October 2013
Amsterdam, Beurs van Berlage
Europe/Amsterdam timezone

ATLAS Distributed Computing Monitoring tools during the LHC Run I

14 Oct 2013, 15:00
45m
Grote zaal (Amsterdam, Beurs van Berlage)

Grote zaal

Amsterdam, Beurs van Berlage

Poster presentation Distributed Processing and Data Handling A: Infrastructure, Sites, and Virtualization Poster presentations

Speaker

Jaroslava Schovancova (Brookhaven National Laboratory (US))

Description

The ATLAS Distributed Computing (ADC) Monitoring targets three groups of customers: ADC Operations, ATLAS Management, and ATLAS sites and ATLAS funding agencies. The main need of ADC Operations is to identify malfunctions early and then escalate issues to an activity or a service expert. The ATLAS Management use visualisation of long-term trends and accounting information about the ATLAS Distributed Computing resources. The ATLAS sites and the ATLAS funding agencies utilize both real-time monitoring and long-term measurement of the performance of the provided computing resources. During the LHC Run I a significant development effort has been invested in standardization of the monitoring and accounting applications in order to provide an extensive monitoring and accounting suite. ADC Monitoring applications separate the data layer and the visualisation layer. The data layer exposes data in a predefined format. The visualisation layer is designed bearing in mind visual identity of the provided graphical elements, and reusability of the visualisation elements across the different tools. A rich family of filtering and searching options enhancing available user interfaces comes naturally with the data and visualisation layer separation. With a variety of reliable monitoring data accessible through standardized interfaces, the possibility of automating actions under well defined conditions, correlating multiple data sources, has become feasible. In this contribution we also discuss the automated exclusion of degraded resources and their automated recovery in different activities.

Primary author

Jaroslava Schovancova (Brookhaven National Laboratory (US))

Co-authors

Alessandro Di Girolamo (CERN) I Ueda (University of Tokyo (JP)) Simone Campana (CERN) Stephane Jezequel (Centre National de la Recherche Scientifique (FR)) Dr Torre Wenaus (Brookhaven National Laboratory (US))

Presentation Materials