21–25 May 2012
New York City, NY, USA
US/Eastern timezone

Towards a global monitoring system for CMS computing operations

22 May 2012, 17:50
25m
Room 914 (Kimmel Center)

Room 914

Kimmel Center

Parallel Distributed Processing and Analysis on Grids and Clouds (track 3) Distributed Processing and Analysis on Grids and Clouds

Speakers

Dr Andrea Sciaba (CERN) Lothar A.T. Bauerdick (Fermi National Accelerator Lab. (US))

Description

The operation of the CMS computing system requires a complex monitoring system to cover all its aspects: central services, databases, the distributed computing infrastructure, production and analysis workflows, the global overview of the CMS computing activities and the related historical information. Several tools are available to provide this information, developed both inside and outside of the collaboration and often used in common with other experiments. Despite the fact that the current monitoring allowed CMS to successfully perform its computing operations, an evolution of the system is clearly required, to adapt to the recent changes in the data and workload management tools and models and to address some shortcomings that make its usage less than optimal. Therefore, a recent and ongoing coordinated effort was started in CMS, aiming at improving the entire monitoring system by identifying its weaknesses and the new requirements from the stakeholders, rationalise and streamline existing components and drive future software development. This contribution gives a complete overview of the CMS monitoring system and a description of all the recent activities that have been started with the goal of providing a more integrated, modern and functional global monitoring system for computing operations.

Primary authors

Dr Andrea Sciaba (CERN) Lothar A.T. Bauerdick (Fermi National Accelerator Lab. (US))

Presentation materials