19–25 Oct 2024
Europe/Zurich timezone

ATLAS High-Luminosity LHC demonstrators with Data Carousel: Data-on-Demand and Tape Smart Writing

22 Oct 2024, 14:24
18m
Room 1.B (Medium Hall B)

Room 1.B (Medium Hall B)

Talk Track 1 - Data and Metadata Organization, Management and Access Parallel (Track 1)

Speaker

Xin Zhao (Brookhaven National Laboratory (US))

Description

The High Luminosity upgrade to the LHC (HL-LHC) is expected to generate scientific data on the scale of multiple exabytes. To tackle this unprecedented data storage challenge, the ATLAS experiment initiated the Data Carousel project in 2018. Data Carousel is a tape-driven workflow in which bulk production campaigns with input data resident on tape are executed by staging and promptly processing a sliding window to disk buffer such that only a small fraction of the input files are pinned on disk at any one time. Put in ATLAS production before Run3, Data Carousel continues to be our focus for seeking new opportunities in disk space savings, and enhancing tape usage throughout the ATLAS Distributed Computing (ADC) environment. These efforts are highlighted by two recent ATLAS HL-LHC demonstrator projects: data-on-demand and tape smart writing. We will discuss the recent studies and outcomes from these projects, along with various related improvements across the ATLAS distributed computing software. The research was conducted together with site experts at CERN and Tier-1 centers.

Primary authors

Alexei Klimentov (Brookhaven National Laboratory (US)) Doris Ressmann (Karlsruhe Institute of Technology (KIT)) Haykuhi Musheghyan (Karlsruhe Institute of Technology (KIT)) Julien Leduc (CERN) Mario Lassnig (CERN) Misha Borodin (University of Iowa (US)) Tadashi Maeno (Brookhaven National Laboratory (US)) Tatiana Korchuganova (University of Pittsburgh (US)) Xin Zhao (Brookhaven National Laboratory (US))

Presentation materials