22–26 Sept 2008
Harbiye Askeri Museum
Europe/Zurich timezone

Session

Nagios for Site Monitoring Tutorial

24 Sept 2008, 11:00
Harbiye Askeri Museum

Harbiye Askeri Museum

Istanbul

Description

A monitoring system enables grid site administrators to track usage of site resources and receive alarms in case of failure of services. As such it is essential for achieving better availability and reliability of large scale grid infrastructure.

A site-level grid services monitoring prototype based on the Nagios fabric monitoring system was developed within the EGEE-II project. Development of the system is continued within the Operations Automation Team in the EGEE-III project. The prototype enables sites to receive instant notification in case of host and service failures, and provides them with results from global monitoring systems such as SAM and the ENOC DownCollector.

Main aim of this session is to give overview of the Nagios based site-level monitoring prototype and demonstrate installation on a live grid site. The first part consists of presentations describing general Nagios monitoring framework and specific components of developed site-level monitoring prototype. In the second part practical installation of site-level monitoring prototype will be demonstrated.

Presentation materials

There are no materials yet.
Building timetable...