27 September 2004 to 1 October 2004
Interlaken, Switzerland
Europe/Zurich timezone

SAMGrid Monitoring Service and its Integration with MonALisa

29 Sep 2004, 10:00
1h
Coffee (Interlaken, Switzerland)

Coffee

Interlaken, Switzerland

Board: 38
poster Track 4 - Distributed Computing Services Poster Session 2

Speaker

A. Lyon (FERMI NATIONAL ACCELERATOR LABORATORY)

Description

The SAMGrid team is in the process of implementing a monitoring and information service, which fulfills several important roles in the operation of the SAMGrid system, and will replace the first generation of monitoring tools in the current deployments. The first generation tools are in general based on text logfiles and represent solutions which are not scalable or maintainable. The roles of the monitoring and information service are: 1) providing diagnostics for troubleshooting the operation of SAMGrid services; 2) providing support for monitoring at the level of user jobs; 3) providing runtime support for local configuration and other information currently which currently must be stored centrally (thus moving thesystem toward greater autonomy for the SAM station services, which include cache management and job management services); 4) providing intelligent collection of statistics in order to enable performance monitoring and tuning. The architecture of this service is quite flexible, permitting input from any instrumented SAM application or service. It will allow multiple backend storage for archiving of(possibly) filtered monitoring events, as well as real time information displays andactive notification service for alarm conditions. This service will be able to export, in a configurable manner, information to higher level Grid monitoring services, such as MonALisa. We describe our experience to date with using a prototype version together with MonAlisa.

Primary authors

A. Baranovski (FERMI NATIONAL ACCELERATOR LABORATORY) A. Kreymer (FERMI NATIONAL ACCELERATOR LABORATORY) A. Lyon (FERMI NATIONAL ACCELERATOR LABORATORY) A. Sill (Texas Tech University) F. Ratnikov (Rutgers University) G. Garzoglio (FERMI NATIONAL ACCELERATOR LABORATORY) I. Terekhov (FERMI NATIONAL ACCELERATOR LABORATORY) J. Trumbo (FERMI NATIONAL ACCELERATOR LABORATORY) L. Loebel Carpenter (Fermilab) L. Lueking (FERMI NATIONAL ACCELERATOR LABORATORY) M. Burgon-Lyon (Glasgow University) M. Leslie (Oxford University) R. Herber (FERMI NATIONAL ACCELERATOR LABORATORY) R. Illingworth (FERMI NATIONAL ACCELERATOR LABORATORY) R. Kennedy (FERMI NATIONAL ACCELERATOR LABORATORY) R. St.Denis (Glasgow University) S. Belforte (INFN/Trieste) S. Stonjek (FERMI NATIONAL ACCELERATOR LABORATORY/Oxford University) S. Veseli (FERMI NATIONAL ACCELERATOR LABORATORY) S. White (FERMI NATIONAL ACCELERATOR LABORATORY) U. Kerzel (Karlsruhe University) V. Bartsch (Oxford University) W. Merritt (FERMI NATIONAL ACCELERATOR LABORATORY)

Presentation Materials