21-27 March 2009
Prague
Europe/Prague timezone

On gLite WMS/LB Monitoring and Management through WMSMon

26 Mar 2009, 08:00
1h
Prague

Prague

Prague Congress Centre 5. května 65, 140 00 Prague 4, Czech Republic
Board: Thursday 050
poster Grid Middleware and Networking Technologies Poster session

Speaker

Daniele Cesini (INFN CNAF)

Description

The Workload Management System is the gLite service supporting the distributed production and analysis activities of various HEP experiments. It is responsible of dispatching computing jobs to remote computing facilities by matching job requirements and the resource status information collected from the Grid information services. Given the distributed and heterogeneous nature of the Grid, the monitoring of the job lifecycle and of the aggregate workflow patterns generated by multiple user communities, and the reliability of the service are of great importance. In this paper we deal with the problem of WMS monitoring and management. We present the architecture and implementation of the WMSMonitor, a tool for WMS monitoring and management, which has been designed to meet the needs of various WMS user categories: administrators, developers, advanced Grid users and performance testers. The tool was successfully deployed to monitor the progress of WMS job submission activities during HEP computing challenges. We also describe how, for each WMS in a cluster, WMSMon produces status indexes and a load metric that can be used for automated notification of critical events via Nagios, or for ranking of service instances deployed in load balancing mode.
Presentation type (oral | poster) oral

Primary author

Daniele Cesini (INFN CNAF)

Co-authors

Danilo Dongiovanni (INFN CNAF) Enrico Fattibene (INFN CNAF) Dr Luciana Carota (INFN CNAF) Dr Tiziana Ferrari (INFN CNAF)

Presentation Materials