19–25 Oct 2024
Europe/Zurich timezone

Building a Columnar Analysis Demonstrator for ATLAS PHYSLITE Open Data using the Python Ecosystem

21 Oct 2024, 14:42
18m
Large Hall B

Large Hall B

Talk Track 5 - Simulation and analysis tools Parallel (Track 5)

Speaker

Matthew Feickert (University of Wisconsin Madison (US))

Description

The ATLAS experiment is in the process of developing a columnar analysis demonstrator, which takes advantage of the Python ecosystem of data science tools. This project is inspired by the analysis demonstrator from IRIS-HEP.
The demonstrator employs PHYSLITE OpenData from the ATLAS collaboration, the new Run 3 compact ATLAS analysis data format. The tight integration of ROOT features within PHYSLITE presents unique challenges when integrating with the Python analysis ecosystem. The demonstrator is constructed from ATLAS PHYSLITE OpenData, ensuring the accessibility and reproducibility of the analysis.
The analysis pipeline of the demonstrator incorporates a comprehensive suite of tools and libraries. These include uproot for data reading, awkward-array for data manipulation, Dask for parallel computing, and hist for histogram processing. For the purpose of statistical analysis, the pipeline integrates cabinetry and pyhf, providing a robust toolkit for analysis. A significant component of this project is the custom application of corrections, scale factors, and systematic errors using ATLAS software. Therefore for this component we conduct a comparative analysis of event processing throughput across both the event-loop and columnar analysis environments. The infrastructure and methodology for these applications will be discussed in detail during the presentation, underscoring the adaptability of the Python ecosystem for high-energy physics analysis.

Authors

Alexander Held (University of Wisconsin Madison (US)) Dr Giordon Holtsberg Stark (University of California,Santa Cruz (US)) Gordon Watts (University of Washington (US)) Kyungeon Choi (University of Texas at Austin (US)) Lukas Alexander Heinrich (Technische Universitat Munchen (DE)) Matthew Feickert (University of Wisconsin Madison (US)) Matthias Vigl (Technische Universitat Munchen (DE)) Nikolai Hartmann (Ludwig Maximilians Universitat (DE)) Nils Erik Krumnack (Iowa State University (US)) Vangelis Kourlitis (Technische Universitat Munchen (DE))

Presentation materials