Speakers
Description
The last few years have seen a wide range of developments towards scalable solutions for end-user physics analysis to meet the upcoming HL-LHC computing challenges. The IRIS-HEP software institute has created projects in a “Challenge” format to checkpoint the progress. The “Analysis Grand Challenge” probes analysis workflows and interfaces with a limited dataset size, while the “200 Gbps Challenge” focuses on throughput at large scale. A new Challenge has recently been created to complement these, combining aspects from both: the “ATLAS Integration Challenge”. It defines a physics analysis task that captures the scale of available ATLAS data and the complexity of ATLAS analysis needs.
This contribution provides an overview of the analysis task in the Integration Challenge and of the pipeline developed for it. The implementation features two stages. Starting from datasets in a lightweight format for ATLAS physics analysis called PHYSLITE, NTuples are produced on the WLCG using ATLAS CP algorithms. These NTuples are then further processed into histograms for statistical inference at the University of Chicago Analysis Facility using the Scikit-HEP ecosystem of libraries. The Challenge focuses on three aspects in particular, which will be discussed: ensuring feature completeness for the physics analysis task, quantifying computational needs and performance to identify and address bottlenecks, and providing solutions for rapid turnaround for physics analysis development.