9–13 Jul 2018
Sofia, Bulgaria
Europe/Sofia timezone

Advancements in data management services for distributed e-infrastructures: the eXtreme-DataCloud project

12 Jul 2018, 14:45
15m
Hall 8 (National Palace of Culture)

Hall 8

National Palace of Culture

presentation Track 4 - Data Handling T4 - Data handling

Speaker

Daniele Cesini (Universita e INFN, Bologna (IT))

Description

The development of data management services capable to cope with very large data resources is a key challenge to allow the future e-infrastructures to address the needs of the next generation extreme scale scientific experiments.
To face this challenge, in November 2017 the H2020 “eXtreme DataCloud - XDC” project has been launched. Lasting for 27 months and combining the expertise of 8 large European research organisations, the project aims at developing scalable technologies for federating storage resources and managing data in highly distributed computing environments. The targeted platforms are the current and next generation e-Infrastructures deployed in Europe, such as the European Open Science Cloud (EOSC), the European Grid Infrastructure (EGI), and the Worldwide LHC Computing Grid (WLCG).
The project is use-case driven with a multidisciplinary approach, addressing requirements from research communities belonging to a wide range of scientific domains: High Energy Physics, Astronomy, Photon and Life Science, Medical research.
XDC will implement data management scalable services to address the following high level topics: policy driven data management based on Quality-of-Service, Data Life-cycle management, smart placement of data with caching mechanisms to reduce access latency, meta-data with no predefined schema handling, execution of pre-processing applications during ingestion, data management and protection of sensitive data in distributed e-infrastructures, intelligent data placement based on access patterns.
Experts from the project consortium will work on combining already established data management and orchestration tools to provide a highly scalable solution supporting the computing models of the current and next generation experiments. The XDC products will be based on tools such as ONEDATA, EOS, FTS, Indigo-Orchestrator, Indigo-CDMI server, Dynafed.
This contribution will introduce the project, present the foreseen overall architecture and the developments that are being carried on to implement the requested functionalities.

Authors

Dr Alessandro Costantini (INFN) Daniele Cesini (Universita e INFN, Bologna (IT)) Giacinto Donvito (INFN-Bari) Doina Cristina Duma (INFN - CNAF) Viljoen Matthew Serena Battaglia (ECRIN) Vincent Poireau (Laboratoire d'Annecy-le-Vieux de Physique des Particules (LAPP)) Luca Dell'Agnello (INFN-CNAF) Oliver Keeble (CERN) Rachid Lemrani (CNRS/IN2P3) Christian Ohmann Mr Jesus Marco de Lucas (Instituto de Física de Cantabria) Lukasz Dutka Patrick Fuhrmann (DESY) Fernando Aguilar Gomez (Universidad de Cantabria (ES))

Presentation materials