Efficient monitoring system for large-scale federated data storages

chaired by Ignacio Reguero
Friday, 29 November 2013 from to (Europe/Zurich)
at CERN ( 31-3-004 - IT Amphitheatre )
Description
The computing models of the LHC experiments are gradually moving from hierarchical data models towards federated storage such as the XRootD federation. Having a good understanding of the data access pattern is required to provide hints for resource optimization and effective data distribution policy.
This session will deal with the real-time monitoring dataflow which operates at scale without adding any performance overhead. It will focus on monitoring for the ATLAS and CMS XRootD federations: Federated Atlas XRootD (FAX) and Any Data, Any Time, Anywhere (AAA)) implemented in the Experiment Dashboard monitoring framework.
Webcast Please note that this event will be available live via the Webcast Service.
Go to day
  • Friday, 29 November 2013
    • 10:00 - 10:50 Monitoring of the xrootd federations 50'
      The computing models of the LHC experiments are gradually moving from hierarchical data models with centrally managed data pre-placement towards federated storage which provides seamless access to data files independently of their location and dramatically improved recovery due to fail-over mechanisms.
      Enabling loosely coupled data clusters to act as a single storage resource should increase opportunities for data analysis and should enable more effective use of computational resources at sites with limited storage capacities.
      Construction of the data federations and understanding the impact of the new approach to data management on user analysis requires complete and detailed monitoring. Monitoring functionality should cover the status of all components of the federated storage, measuring data traffic and data access performance, as well as being able to detect any kind of inefficiencies and to provide hints for resource optimization and effective data distribution policy. Data mining of the collected monitoring data provides a deep insight into new patterns of usage of the storage resources, beyond that provided by other monitoring strategies. In the WLCG context, there are several federations currently based on the XRootD technology.
      The talk will focus on monitoring for the ATLAS and CMS XRootD federations (Federated Atlas XRootD (FAX) and Any Data, Any Time, Anywhere (AAA)) implemented in the Experiment Dashboard monitoring framework. Both federations consist of many dozens of sites accessed by many hundreds of clients and they continue to grow in size. Handling of the monitoring flow generated by these systems has to be well optimized in order to achieve the required performance.
      The talk will demonstrate that though FAX and AAA Dashboards are being developed for XRootD federations, the implementation is generic and can be easily adapted for other technologies, such as HTTP/WebDAV federations.
      Speaker: Alexandre Beche (CERN)
      Material: Slides powerpoint file pdf file Video in CDS link