Speaker
Mr
Wojciech Lapka
(Unknown)
Description
Since 2005 Worldwide LHC Computing Grid (WLCG) services have been monitored by the Service Availability Monitoring (SAM) system which has been the main source of information for the monthly WLCG availability and reliability calculations.
During this time SAM framework gained popularity amongst site and service managers and was very useful in building robust grid infrastructure.
Experience with this monitoring tool as well as preparation to the evolution of the European grid infrastructure from EGEE to national grid initiatives (NGI) led to design of the enhanced and distributed model for monitoring grid services. Nagios has been adopted as a monitoring framework and messaging technology (ActiveMq) has been chosen as a transport mechanism.
This talk covers the architecture of the new system.
Author
Mr
Wojciech Lapka
(Unknown)