28th Conference on Computing in High Energy and Nuclear Physics (CHEP 2026)

Name: 28th Conference on Computing in High Energy and Nuclear Physics (CHEP 2026)
Start: 2026-05-25T08:00:00+07:00
End: 2026-05-29T14:00:00+07:00
Location: Chulalongkorn University

25–29 May 2026

Chulalongkorn University

Asia/Bangkok timezone

The Next Generation of ATLAS Data Carousel: Architecture, Performance, and Refactoring

25 May 2026, 14:03

18m

Chulalongkorn University

Oral Presentation Track 1 - Data and metadata organization, management and access Track 1 - Data and metadata organization, management and access

Fernando Harald Barreiro Megino (University of Texas at Arlington)Mr Mikhail Borodin (CERN) Misha Borodin (University of Texas at Arlington (US))

In the current ATLAS Distributed Computing model, available disk capacity is insufficient to store even a single complete copy of all data actively in use. Consequently, tape systems serve not only as long-term backups but also as primary data sources. Efficient utilization of tapes at the ATLAS scale requires specialized orchestration mechanisms, as tape access is inherently slower and operationally more complex than disk access. Once data are staged from tape, they must be efficiently shared among all sites requiring them and, when likely to be reused, temporarily retained on disk to avoid redundant recalls. To address these challenges, the Data Carousel system was developed to coordinate large-scale tape staging across the distributed infrastructure. Its core functionality includes automated creation, sharing, retention, and deletion of staging rules based on dataset usage; dynamic staging profiles to balance tape load; dashboards and alert mechanisms for real-time monitoring; and both manual and automated recovery procedures for common tape issues and downtimes. In this paper, we describe the overall architecture of the Data Carousel, provide detailed usage statistics, and present a recent comprehensive refactoring of the system that significantly expands its scope. The refactored implementation integrates more closely with other Distributed Data Management activities, improves scalability and reliability, and prepares the system for future challenges of Run 4 and the HL-LHC era.

Aleksandr Alekseev (The University of Texas at Arlington (UTA)) Fernando Harald Barreiro Megino (University of Texas at Arlington) Mr Mikhail Borodin (CERN) Misha Borodin (University of Texas at Arlington (US)) Tadashi Maeno (Brookhaven National Laboratory (US)) Tatiana Korchuganova (University of Pittsburgh (US)) Wen Guan (Brookhaven National Laboratory (US))

There are no materials yet.

28th Conference on Computing in High Energy and Nuclear Physics (CHEP 2026)

The Next Generation of ATLAS Data Carousel: Architecture, Performance, and Refactoring

Chulalongkorn University

Speakers

Description

Authors

Presentation materials