Prototype of the Russian Scientific Data Lake

19 May 2021, 10:50
13m
Short Talk Distributed Computing, Data Management and Facilities Storage

Speaker

Mr Andrey Kirianov (NRC Kurchatov Institute PNPI (RU))

Description

The High Luminosity phase of the LHC, which aims for a ten-fold increase in the luminosity of proton-proton collisions is expected to start operation in eight years. An unprecedented scientific data volume at the multi-exabyte scale will be delivered to particle physics experiments at CERN. This amount of data has to be stored and the corresponding technology must ensure fast and reliable data delivery for processing by the scientific community allover the world. The present LHC computing model will not be able to provide the required infrastructure growth even taking into account the expected hard-ware evolution. To address this challenge the Data Lake R&D project has been launched by the DOMA community in the fall of 2019. State-of-the-art data handling technologies are under active development, and their current status for the Russian Scientific Data Lake prototype is presented here.

Primary authors

Mr Andrey Kirianov (NRC Kurchatov Institute PNPI (RU)) Andrey Zarochentsev (St Petersburg State University (RU)) Alexei Klimentov (Brookhaven National Laboratory (US)) Aleksandr Alekseev (Universidad Andres Bello (CL)) Valeri Mitsyn (Joint Institute for Nuclear Research (RU)) Mr Danila Oleynik (Joint Institute for Nuclear Research (RU)) Xavier Espinal (CERN) Stephane Jezequel (LAPP-Annecy CNRS/USMB (FR)) Tatiana Korchuganova (Universidad Andres Bello (CL)) Alexander Smirnov (Plekhanov Russian University of Economics, Moscow, 117997, Russia) Sergei Smirnov (National Research Nuclear University MEPhI, Moscow, 115409, Russia)

Presentation materials

Proceedings

Paper