Mar 21 – 27, 2009
The Service Level Status monitoring of the LHC Experiments Distributed Computing

Mar 26, 2009, 8:00 AM


Dr Alessandro Di Girolamo (CERN)


This contribution describes how part of the monitoring of the services used in the computing systems of the LHC experiments has been integrated with the Service Level Status (SLS) framework. The LHC experiments are using an increasingly number of complex and heterogeneous services: the SLS allows to group all these different services and to report their status and their availability by providing a web-based display. It dynamically shows availability, basic information and statistics about these services, as well as their dependencies. The SLS framework has been developed by the CERN-IT/FIO group and is currently dealing with more than 350 services, including administrative applications, physics and infrastructure services, Grid-related and experiment-specific services. The SLS can produce different views for different end-users. The service parameter set is highly customizable via a user-friendly XML format and can include subservices and various thresholds to generate alarms of increasing severity. Historical data is made available via web. All information is also retrievable via a programmatic interface and imported in other visualization tools, like Gridmap. SLS is now effectively used to monitor the status of the LHC experiments services.
