Speaker
Dr
Vladimir Sapunenko
(INFN-CNAF (IT))
Description
Data management constitutes one of the major challenges that a geographically-distributed data centre has to face, especially when remote data access is involved. We discuss an integrated solution which enables transparent and efficient access to online and nearline data through high latency networks. This is based on the joint use of the General Parallel File System (GPFS) and of the Tivoli Storage Manager (TSM). Both products, developed by IBM, are well known and extensively used in the HEP computing world. Owing to a new feature introduced in GPFS 3.5, so-called Active File Management (AFM), the definition of a single, geographically-distributed namespace, characterised by automated data flow management between different locations, becomes possible. As a practical example, we present the implementation of AFM-based remote data access between two data centres located in Bologna and Rome, demonstrating the validity of the solution for the use case of the AMS experiment.
Author
Dr
Vladimir Sapunenko
(INFN-CNAF (IT))
Co-authors
Domenico D'Urso
(Universita e INFN (IT))
Luca dell'Agnello
(INFN-CNAF)
Vincenzo Vagnoni
(Universita e INFN (IT))