25th International Conference on Computing in High Energy & Nuclear Physics

Name: 25th International Conference on Computing in High Energy & Nuclear Physics
Start: 2021-05-17T14:50:00+02:00
End: 2021-05-21T18:10:00+02:00
Location: No location set

17–21 May 2021

Europe/Paris timezone

Contact us

Analysis of data integrity and storage quality of a distributed storage system

20 May 2021, 15:52

13m

Short Talk Distributed Computing, Data Management and Facilities Monitoring

Adrian-Eduard Negru (University Politehnica of Bucharest (RO))

CERN uses the world's largest scientific computing grid, WLCG, for distributed data storage and processing. Monitoring of the CPU and storage resources is an important and essential element to detect operational issues in its systems, for example in the storage elements, and to ensure their proper and efficient function. The processing of experiment data depends strongly on the data access quality, as well as its integrity and both of these key parameters must be assured for the data lifetime. Given the substantial amount of data, O(200PB), already collected by ALICE and kept at various storage elements around the globe, scanning every single data chunk would be a very expensive process, both in terms of computing resources usage and in terms of execution time. In this paper, we describe a distributed file crawler that addresses these natural limits by periodically extracting and analyzing statistically significant samples of files from storage elements, evaluates the results and is integrated with the existing monitoring solution, MonALISA.

Adrian-Eduard Negru (University Politehnica of Bucharest (RO)) Betev Latchezar (CERN) Costin Grigoras (CERN) Mihai Carabas (University Politehnica of Bucharest (RO)) Sergiu Weisz (University Politehnica of Bucharest (RO)) Nicolae Tapus (University Politehnica of Bucharest (RO))

Recording

vCHEP_monitoring_track_data_analysis.pdf

NewVersionPaperCHEP2021 (2).pdf

25th International Conference on Computing in High Energy & Nuclear Physics

Contact us

Analysis of data integrity and storage quality of a distributed storage system

Speaker

Description

Authors

Presentation materials

Proceedings

Paper