5–9 Jul 2021
Europe/Zurich timezone

Distributed statistical inference with pyhf

6 Jul 2021, 15:00
30m
Notebook talk Plenary session Tuesday

Speaker

Matthew Feickert (Univ. Illinois at Urbana Champaign (US))

Description

pyhf is a pure-python implementation of the HistFactory statistical model for multi-bin histogram-based analysis with interval estimation based on asymptotic formulas. pyhf supports modern computational graph libraries such as JAX, PyTorch, and TensorFlow to leverage features such as auto-differentiation and hardware acceleration on GPUs to reduce the time to inference. pyhf is also well adapted to performing distributed statistical inference across heterogeneous computing resources (clusters, clouds, and supercomputers) and task execution providers (HTCondor, Slurm, Torque, and Kubernetes) when paired with high-performance Function as a Service platforms like funcX or highly scalable systems like Google Cloud Platform that allow for resource bursting. In this notebook talk we will give interactive examples of performing statistical inference on public probability models from ATLAS analyses published to HEPData in which we leverage these resources to reproduce the analyses results with wall times of minutes.

Authors

Giordon Holtsberg Stark (University of California,Santa Cruz (US)) Matthew Feickert (Univ. Illinois at Urbana Champaign (US)) Lukas Alexander Heinrich (CERN)

Presentation materials