14-18 October 2013
Amsterdam, Beurs van Berlage
Europe/Amsterdam timezone

System performance monitoring of the ALICE Data Acquisition System with Zabbix

15 Oct 2013, 14:10
20m
Veilingzaal (Amsterdam, Beurs van Berlage)

Veilingzaal

Amsterdam, Beurs van Berlage

Oral presentation to parallel session Facilities, Production Infrastructures, Networking and Collaborative Tools Facilities, Infrastructures, Networking and Collaborative Tools

Speaker

Adriana Telesca (CERN)

Description

ALICE (A Large Ion Collider Experiment) is a heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). The ALICE Data-AcQuisition (DAQ) system handles the data flow from the sub-detector electronics to the permanent data storage in the CERN computing center. The DAQ farm consists of about 1000 devices of many different types ranging from direct accessible machines to storage arrays and custom optical links. The system performance monitoring tool used during the LHC run 1 will be replaced by a new tool for run 2. This presentation shows the results of an evaluation that has been conducted on six existing and publicly available monitoring tools. The evaluation has been carried out by taking into account selection criteria such as scalability, flexibility, reliability as well as data collection methods and display. All the tools have been prototyped and evaluated according to those criteria. We will describe the considerations that have brought to the selection of the Zabbix monitoring tool for the DAQ farm. The results of the tests conducted in the ALICE DAQ laboratory will be presented. In addition, the deployment of the software on the DAQ machines in terms of metrics collected and data collection methods will be described. We will illustrate how remote nodes are monitored with Zabbix by using SNMP-based agents and how DAQ specific metrics are retrieved and displayed. We will also show how the monitoring information is accessed and made available via the graphical user interface and how Zabbix communicates with the other DAQ online systems for notification and reporting.

Primary author

Co-authors

Alexandru Grigore (Polytechnic University of Bucharest (RO)) Mr Barthelemy Von Haller (CERN) Charles Delort (Ministere des affaires etrangeres et europeennes (FR)) Costin Ionita (CERN) Csaba Soos (CERN) Ervin Denes (Hungarian Academy of Sciences (HU)) Filippo Costa (CERN) Mr Franco Carena (CERN) Giuseppe Simonetti (Universita e INFN (IT)) Mr Pierre Vande Vyvre (CERN) Roberto Divia (CERN) Sylvain Chapeland (CERN) Ulrich Fuchs (CERN) Vasco Chibante Barroso (CERN) Wisla Carena (CERN)

Presentation Materials