Shawn Mc Kee (University of Michigan (US))
The WLCG infrastructure has evolved from its original restrictive network topology, based on the MONARC model, to a more interconnected system, where data movement between regions or countries does not necessarily need to involve T1 centers. While this evolution brought obvious advantages, especially in terms of flexibility for the LHC experiment’s data management systems, it also raises the question of how to monitor and troubleshoot the increasing number of possible network paths, in order to provide a global, reliable network service. The perfSONAR network monitoring system (specifically the perfSONAR-PS implementation) has been evaluated and agreed as a proper solution to cover the WLCG network monitoring use cases: it allows WLCG to plan and execute latency and bandwidth tests between any instrumented endpoint through a central scheduling configuration, it allows archiving of the metrics in a local database, it provides a programmatic and a web based interface exposing the tests results; it also provides a graphical interface for remote management operations. In this presentation we will discuss our activity to deploy a perfSONAR-PS based network monitoring infrastructure, in the scope of the WLCG Operations Coordination initiative: we will motivate the main choices we agreed in terms of configuration and management, describe the additional tools we developed to complement the standard packages and present the status of the deployment, together with the possible future evolution.
Provide an overview of perfSONAR-PS monitoring for WLCG