8โ€“12 Sept 2025
Hamburg, Germany
Europe/Berlin timezone

Efficient bin by bin profile likelihood minimization for precision measurements

11 Sept 2025, 17:40
20m
ESA B

ESA B

Oral Track 2: Data Analysis - Algorithms and Tools Track 2: Data Analysis - Algorithms and Tools

Speaker

David Walter (Massachusetts Inst. of Technology (US))

Description

The High-Luminosity LHC era will deliver unprecedented data volumes, enabling measurements on fine-grained multidimensional histograms containing millions of bins with thousands of events each. Achieving ultimate precision requires modeling thousands of systematic uncertainty sources, creating computational challenges for likelihood minimization and parameter extraction. Fast minimization is crucial for efficient analysis development.

We present a novel tensorflow-based tool, CombineTF2, that leverages optimized parallelization on CPUs and GPUs for this task. Our implementation interfaces with boost histograms, supporting flexible likelihood configurations with symmetrization options to establish Gaussian approximations. The minimization utilizes automatic differentiation to compute (quasi) second-order derivatives, yielding robust and efficient results. We further provide analytic proof of deterministic solutions within linear approximation limits.

Our tool distinctly focuses on measuring physical observables rather than intrinsic parameters, disentangling likelihood parameterization from quantities of interest and creating a more intuitive, less error prone user experience. Comprehensive benchmarking demonstrates excellent scaling with increased threading and reveals significant efficiency gaps when compared to commonly used frameworks in the field. These performance differences highlight the need for continued development of optimized statistical tools for high-energy physics analyses.

Significance

New tool with consequent application of machine learning libraries with the focus on binned profile likelihood fits in the limit of large data samples (gaussian approximation).
New mathematical development, such as symmetrization procedures, analytic proof of deterministic solutions within linear approximation limits, nonlinear corrections to uncertainty breakdowns.
A comprehensive benchmarking with existing statistical frameworks in the community.

References

CombineTF1 https://indico.cern.ch/event/882824/contributions/3932491/ in context of a CMS analysis
https://github.com/WMass/combinetf2

Experiment context, if any The tool can be used by any experiment. Authors are from the CMS collaboration and use the tool within the experiment.

Authors

David Walter (Massachusetts Inst. of Technology (US)) Josh Bendavid (CERN) Kenneth Long (Massachusetts Inst. of Technology (US))

Presentation materials