25–29 May 2026
Chulalongkorn University
Asia/Bangkok timezone

Toward a Sustainable Workload Management Architecture for CMS at HL-LHC

26 May 2026, 14:57
18m
Chulalongkorn University

Chulalongkorn University

Oral Presentation Track 4 - Distributed computing Track 4 - Distributed computing

Speaker

Andrea Piccinelli (University of Notre Dame (US))

Description

The Compact Muon Solenoid (CMS) experiment is reassessing its Workload Management (WM) stack to meet HL-LHC scale, heterogeneity, and a 20–25-year sustainability horizon. Over the past year, we surveyed multiple pathways (including reuse of external WM systems, hybrid approaches, and a ground-up redesign) and developed a blueprint that emphasizes architectural principles of the HL-LHC WM project.

The blueprint centers on the separation of concerns between request intake/policy and execution, unique workflow specification portable across heterogeneous provisioners, data-locality-aware placement integrated with the data management and caching layers, modern security practices, modular architecture with clear APIs, strong observability, and automated operations. Scalability targets include hundreds of thousands of workflows per day, horizontal elasticity, priority-aware queuing, and unified retry semantics.

We present the evaluation framework used (functionality parity, operability, cost of change, ecosystem maturity), results from early prototypes, and a phased migration plan covering central production and user analysis. The goal is to converge on an implementation that adheres to these principles, reduces bespoke adapters, and remains adaptable as sites and technologies evolve through Run-4 and Run-5.

Author

Andrea Piccinelli (University of Notre Dame (US))

Presentation materials

There are no materials yet.