ACAT 2025

Name: ACAT 2025
Start: 2025-09-08T08:00:00+02:00
End: 2025-09-12T16:30:00+02:00
Location: Hamburg, Germany

8–12 Sept 2025

Hamburg, Germany

Europe/Berlin timezone

Neural Fake Factor Estimation

8 Sept 2025, 11:00

30m

ESA W 'West Wing'

Poster Track 2: Data Analysis - Algorithms and Tools Poster session with coffee break

Jan Gavranovic (Jozef Stefan Institute (SI)) Jernej Debevc (Jozef Stefan Institute (SI)) Lara Calic (Lund University (SE))

In a physics data analysis, "fake" or non-prompt backgrounds refer to events that would not typically satisfy the selection criteria for a given signal region, but are nonetheless accepted due to misreconstructed particles. This can occur, for example, when particles from secondary decays are incorrectly identified as originating from the hard scatter interaction point (resulting in non-prompt leptons), or when other physics objects, such as hadronic jets, are mistakenly reconstructed as leptons (resulting in fake leptons). These fake particles are taken into account by calculating a scale factor (a fake factor) and applying it as an event weight obtained by a data-driven technique. Traditionally, fake factors have been estimated by histogramming and computing the ratio of two distributions, typically as functions of a few relevant physics variables such as $p_{\mathrm{T}}$, $\eta$, and MET. In this work, we present a novel approach based on density ratio estimation using a transformer neural network trained directly on event data in a high-dimensional feature space. This enables the computation of a continuous, unbinned fake factor on a per-event basis, offering a more flexible, precise and higher-dimensional alternative to the conventional method.

Significance

The method presented is a novel approach on calculating any scale factor (not just a fake factor) using machine learning. The final result of the method is a function (neural network) that takes in an event and calculates an appropriate scale factor in high feature dimensional spaces.

References

Systematic evaluation of generative machine learning capability to simulate distributions of observables at the large hadron collider: https://link.springer.com/article/10.1140/epjc/s10052-024-13284-6
Normalizing Flows for Physics Data Analyses, presented at Conference on Computing in High Energy and Nuclear Physics 2024: https://indico.cern.ch/event/1338689/contributions/6016108/

Experiment context, if any	The method can be used by any HEP analysis and is not experiment specific.

Jan Gavranovic (Jozef Stefan Institute (SI))

Borut Paul Kersevan (Jozef Stefan Institute (SI)) Jernej Debevc (Jozef Stefan Institute (SI)) Lara Calic (Lund University (SE))

Neural_Fake_Factor_Estimation_ACAT2025_FV.pdf

ACAT 2025

Neural Fake Factor Estimation

ESA W 'West Wing'

Speakers

Description

Significance

References

Author

Co-authors

Presentation materials

Choose timezone

ACAT 2025

Speakers

Description

Significance

References

Author

Co-authors

Presentation materials