ML4Jets2024

Name: ML4Jets2024
Start: 2024-11-04T08:30:00+01:00
End: 2024-11-08T17:00:00+01:00
Location: LPNHE, Paris, France

4–8 Nov 2024

LPNHE, Paris, France

Europe/Paris timezone

Contribution List

127. Welcome

04/11/2024, 09:00

Welcome & Opening Talks

119. Symbolic machine learning in physics

Francois Charton (Meta AI Paris)

04/11/2024, 09:40

Plenary talks

Transformers excel at symbolic data manipulation, but most of their applications in physics deal with numerical calculations. I present a number of applications of symbolic AI in mathematics, and one in theoretical physics: learning scattering amplitudes.

120. Experimental highlights: Edge AI for particle physics

Jennifer Ngadiuba (FNAL)

04/11/2024, 10:50

Plenary talks

The ever-growing data volumes produced by HEP experiments, particularly at the CERN Large Hadron Collider (LHC) and upcoming facilities, demand innovative approaches to data processing and analysis. Traditional data acquisition and processing methods are no longer adequate for handling the scale, speed, and complexity of this data. In response, the field has seen a transformative shift toward...

121. Experimental highlights: The State of ML in LHC Science

Daniel Thomas Murnane (Niels Bohr Institute, University of Copenhagen)

04/11/2024, 11:30

Plenary talks

Informed by the many fields in which machine learning (ML) has made impacts, the coming years promise to see exciting improvements in the discovery and measurement power of LHC experiments. But stepping back from the many exploratory studies ongoing, there are already dozens of concrete and rigorous public LHC results leveraging advanced ML. This review will examine common themes of those...

122. MadNIS - A journey towards the first ML event generator

Ramon Winterhalder (Università degli Studi di Milano)

04/11/2024, 13:50

Plenary talks

High-precision simulations based on first principles are a cornerstone of LHC physics research. In view of the HL-LHC era, there is an ever-increasing demand for both accuracy and speed in simulations. In this talk, I will first explain the basic principles of LHC event generation and highlight current methodologies and their bottlenecks. Afterwards, I will delve into the MadNIS journey and...

3. The Phase Space Distance Between Collider Events

Tianji Cai (SLAC National Accelerator Laboratory)

04/11/2024, 14:30

Mixed contributions

How can one fully harness the power of physics encoded in relativistic $N$-body phase space? Topologically, phase space is isomorphic to the product space of a simplex and a hypersphere and can be equipped with explicit coordinates and a Riemannian metric. This natural structure that scaffolds the space on which all collider physics events live opens up new directions for machine learning...

57. Meta-Learning Quantum Jet Properties with Quantum Generative Models

Yacine Haddad (Northeastern University (US))

04/11/2024, 14:50

Mixed contributions

Quantum Generative Models are emerging as a promising tool for modelling complex physical phenomena. In this work, we explore the application of Quantum Boltzmann Machines and Quantum Generative Adversarial Networks to the intricate task of jet substructure modelling in high-energy physics. Specifically, we use these quantum frameworks to model the kinematics and corrections of the leading...

100. Introducing Aspen Open Jets: a real-world ML-ready dataset for jet physics

Ian Pang

04/11/2024, 15:10

Mixed contributions

We present Aspen Open Jets, a dataset consisting of 170M unlabelled jets derived from the CMS Open Data 2016. We show how using this dataset in the context of pre-training a foundation model can reduce the need for expensive simulated datasets. The dataset includes event information, jet kinematics, jet tagging information, particle kinematics, displacement, charge, PID and PUPPI weights, and...

126. Time Series Anomaly Detection: Overview and New Trends

Paul Boniol (INRIA, ENS)

04/11/2024, 16:00

Plenary talks

Anomaly detection is an important problem in data analytics with applications in many domains. In recent years, there has been an increasing interest in anomaly detection tasks applied to time series. In this talk, we take a holistic view of anomaly detection in time series, discussing the challenges and research opportunities in this field. In addition, we will focus on the challenges related...

71. The Landscape of Unfolding with Machine Learning

Nathan Huetsch (Heidelberg University, ITP Heidelberg)

04/11/2024, 16:40

Mixed contributions

Recent innovations from machine learning allow for data unfolding, without binning and including correlations across many dimensions. We describe a set of known, upgraded, and new methods for ML-based unfolding. The performance of these approaches is evaluated on two benchmark datasets. We find that all techniques are capable of accurately reproducing the particle-level spectra across complex...

38. Fair Universe HiggsML Uncertainty Challenge

Ragansu Chakkappai (IJCLab-Orsay)

04/11/2024, 17:00

Mixed contributions

The Fair Universe project is organising the HiggsML Uncertainty Challenge, which has been running from Sep 2024 to 14th March 2025. It is a NeurIPS 2024 competition.

This HEP and Machine Learning competition is the first to strongly emphasise uncertainties: mastering uncertainties in the input training dataset and outputting credible confidence intervals.

The context is the measurement...

111. Deep learning on jet modification in the presence of the QGP background

RAN LI

05/11/2024, 09:00

Reconstruction

Jet interactions with the color-deconfined QCD medium in relativistic heavy-ion collisions are conventionally assessed by measuring the modification of the distributions of jet observables with respect to their proton-proton baselines. Deep learning methods allow us to evaluate the modification of jets on a jet-by-jet basis, and therefore significantly improve the capability of using jets to...

52. Learning the Simplicity of Scattering Amplitudes

Aurelien Dersy (Harvard University)

05/11/2024, 09:00

Theorie

The simplification and reorganization of complex expressions lies at the core of scientific progress, particularly in theoretical high-energy physics. This work explores the application of machine learning to a particular facet of this challenge: the task of simplifying scattering amplitudes expressed in terms of spinor-helicity variables. We demonstrate that an encoder-decoder transformer...

74. Taming perturbation theory in QCD with Normalizing Flows

Rikab Gambhir (MIT)

05/11/2024, 09:20

Theorie

When predicting the distribution of an observable, $p(x)$, in QCD, fixed-order (FO) perturbation theory can suffer from many undesirable artifacts, including large logarithms spoiling the expansion, unphysical divergences or negative bins, non-smooth kinks, and non-normalizability on physical $x$’s. However, one expects the "true" $p(x)$, as accessed by experiment, to be finite, positive,...

112. Transformer networks for constituent-based b-jet calibration with the ATLAS detector

Brendon Bullard (SLAC National Accelerator Laboratory (US))

05/11/2024, 09:20

Reconstruction

The precise measurement of kinematic features of jets is key to the physics program of the LHC. The determination of the energy and mass of jets containing bottom quarks 𝑏-jets is particularly difficult given their distinct radiation patterns and production of undetectable neutrinos via leptonic heavy flavor decays. This talk will describe a novel calibration technique for the b-jet kinematics...

76. Efficient SMEFT fits with neural importance sampling

Nikita Schmal

05/11/2024, 09:40

Theorie

Global SMEFT analyses have become a key interpretation framework for LHC physics, quantifying how well a large set of kinematic measurements agrees with the Standard Model. We show how normalizing flows can be used to accelerate sampling from the SMEFT likelihood. The networks are trained without a pre-generated dataset by combining neural importance sampling with Markov chain methods....

113. Jet Finding as a Real-Time Object Detection Task

Leon Bozianu (Universite de Geneve (CH))

05/11/2024, 09:40

Reconstruction

The High Luminosity upgrade to the LHC will deliver an unprecedented luminosity to the ATLAS experiment. Ahead of this increase in data the ATLAS trigger and data acquisition system will undergo a comprehensive upgrade. The key function of the trigger system is to maintain a high signal efficiency together with a high background rejection whilst adhering to the throughput constraints of the...

34. Machine learning the likelihoods

Rafal Maselek

05/11/2024, 10:00

Theorie

In recent years, the ATLAS collaboration has provided full statistical models for some of their analyses, enabling highly precise reinterpretation of experimental limits. These models account for multiple nuisance parameters and correlations between signal bins, but their complexity often leads to lengthy computation times. This project aims to develop a method for efficient yet accurate...

50. Transformer for Energy Calibration in the ATLAS Electromagnetic Calorimeter

Ryan Roberts (University of California Berkeley (US))

05/11/2024, 10:00

Reconstruction

The ATLAS experiment reconstructs electrons and photons from clusters of energy deposits in the electromagnetic calorimeter. The reconstructed electron and photon energy must be corrected from the measured energy deposits in the clusters to account for energy loss in passive material upstream of the calorimeter, in the passive material in the calorimeter, out of cluster energies and leakage in...

75. Exploring phase space with Flow Matching

Timo Janssen

05/11/2024, 10:50

Event generation

Generative models can speed up parton-level Monte Carlo event generation. Normalizing Flows are especially interesting due to their exact likelihood evaluation. Compared to discrete, layer-based flows, continuous Normalizing Flows (CNFs) have been shown to offer higher expressivity. New simulation-free training methods reduce their training costs significantly. We show that CNFs trained by...

118. Synergizing Physics: Deep Learning Techniques for Time-of-Flight Reconstruction and Jet Tagging in High Energy Physics

Konrad Helms (Georg-August-Universität Göttingen)

05/11/2024, 10:50

Reconstruction

This talk presents a synergy between quark/gluon jet tagging on LHC data, and charged hadron time-of-flight (TOF) regression on ILC data, in the form of one problem-solving mechanism that can address both tasks. They both involve processing data represented as unordered point clouds of varying sequence lengths, optimally handled using permutation-invariant architectures.

A...

35. Differentiable MadNIS-Lite

Theo Heimel (Heidelberg University)

05/11/2024, 11:10

Event generation

Differentiable programming opens exciting new avenues in particle physics, also affecting future event generators. These new techniques boost the performance of current and planned MadGraph implementations. Combining phase-space mappings with a set of very small learnable flow elements, MadNIS-Lite, can improve the sampling efficiency while being physically interpretable. This defines a third...

115. Particle flow and flavor tagging with DNN for Higgs factories

Taikan Suehara (ICEPP, The University of Tokyo (JP))

05/11/2024, 11:10

Reconstruction

Deep learning can give a significant impact on physics performance of electron-positron Higgs factories such as ILC and FCCee. We are working on two topics on event reconstruction to apply deep learning; one is jet flavor tagging. We apply particle transformer to ILD full simulation to obtain jet flavor, including strange tagging. The other one is particle flow, which clusters calorimeter hits...

59. Event Generation with Lorentz-Equivariant Geometric Algebra Transformers

Jonas Simon Spinner

05/11/2024, 11:30

Event generation

Extracting scientific understanding from particle-physics experiments requires solving diverse learning problems with high precision and good data efficiency. We propose the Lorentz Geometric Algebra Transformer (L-GATr), a new multi-purpose architecture for high-energy physics. L-GATr represents high-energy data in a geometric algebra over four-dimensional space-time and is equivariant under...

9. Classifying importance regions in Monte Carlo simulations with machine learning

Raymundo Ramos (Korea Institute for Advanced Study)

05/11/2024, 11:50

Event generation

We attempt to extend the typical stratification of parameter space used during Monte Carlo simulations by considering regions of arbitrary shape. Such regions are defined by directly using their importance for the simulation, for example, a likelihood or scattering amplitude. In particular, we consider the possibility that the parameter space may be high dimensional and the simulation costly...

23. Data-driven hadronization models

Manuel Szewc

05/11/2024, 13:50

Event generation

I'll discuss recent and ongoing developments related to the tuning and construction of machine-learning-based models of hadronization. Specifically, I will discuss efforts related to the extraction of microscopic hadronization dynamics from macroscopic 'jet-level' observables as well as efforts related to fully differentiable hadronization tunes utilizing post-hoc reweighting.
Based on...

92. The Fundamental Limit of Jet Tagging

Nishank Nilesh Gite (Lawrence Berkeley National Lab. (US))

05/11/2024, 13:50

Tagging

Identifying the origin of high-energy hadronic jets (`jet tagging') has been a critical benchmark problem for machine learning in particle physics. Jets are ubiquitous at colliders and are complex objects that serve as prototypical examples of collections of particles to be categorized. Over the last decade, machine learning-based classifiers have replaced classical observables as the state...

84. Fast simulation of backgrounds at LHCb - a generalised tool

Alex Marshall (University of Bristol (GB))

05/11/2024, 14:10

Event generation

Background estimation is already a bottleneck in several analyses at LHCb, and with the upcoming larger datasets, the demand for efficient background simulation will continue to grow. While there are existing tools that can provide quick, rough estimates of background reconstructed distributions (e.g. RapidSim), these cannot account for the effects of common selection criteria. The tool...

6. Streamlined jet tagging network assisted by jet prong structure

Prof. Mihoko Nojiri (Theory Center, IPNS, KEK)

05/11/2024, 14:10

Tagging

Attention-based transformer models have become increasingly prevalent in collider analysis, offering enhanced performance for tasks such as jet tagging. However, they are computationally intensive
and require substantial data for training. In this paper, we introduce a new jet classification network
using an MLP mixer, where two subsequent MLP operations serve to transform particle and...

102. Generating particle-clouds with discrete features using Markov jump processes

Dr Darius Faroughy (Rutgers University)

05/11/2024, 14:30

Event generation

In many real-world scenarios, data is hybrid — i.e. described by both continuous and discrete features. At high-energy accelerators like the LHC, jet constituents exhibit discrete properties such as electric charge or particle-id. In this talk, we introduce a novel generative model for discrete features based on continuous-time Markov jump processes. By combining our approach with well-known...

26. Learning Symmetry-Independent Jet Representation via Jet-Based Joint Embedding Predictive Architecture

Haoyang Li (Univ. of California San Diego (US))

05/11/2024, 14:30

Tagging

This study introduces an approach to learning augmentation-independent jet representations using a Jet-based Joint Embedding Predictive Architecture (J-JEPA). This approach aims to predict various physical targets from an informative context, using target positions as joint information. We study several methods for defining the targets and context, including grouping subjets within a jet, and...

87. (R) Application of generative models for full-detector, whole-event simulated event generation and jet background subtraction

Yeonju Go (Brookhaven National Laboratory (US))

05/11/2024, 14:50

Event generation

AI generative models, such as generative adversarial networks (GANs), have been widely used and studied as efficient alternatives to traditional scientific simulations like Geant4. Diffusion models, which have demonstrated great capability in generating high-quality text-to-image translations in industry, have yet to be applied in the high-energy heavy-ion physics.

In this talk, we present...

47. Jet tagging with Lorentz-Equivariant Geometric Algebra Transformers

Víctor Bresó Pla (University of Heidelberg)

05/11/2024, 14:50

Tagging

Extracting scientific understanding from particle-physics experiments requires solving diverse learning problems with high precision and good data efficiency. We present the Lorentz Geometric Algebra Transformer (L-GATr), a new multi-purpose architecture for high-energy physics. L-GATr represents high-energy data in a geometric algebra over four-dimensional space-time and is equivariant under...

51. Generative transformers for learning point-cloud simulations

Henning Rose

05/11/2024, 16:00

Detector Simulation

We successfully demonstrate the use of a generative transformer for learning point-cloud simulations of electromagnetic showers in the International Large Detector (ILD) calorimeter. By reusing the architecture and workflow of the “OmniJet-alpha” model, this transformer predicts sequences of tokens that represent energy deposits within the calorimeter. This autoregressive approach enables the...

73. CaloDREAM -- Detector Response Emulation via Attentive flow Matching

Luigi Favaro

05/11/2024, 16:20

Detector Simulation

Detector simulations are an exciting application of modern generative networks. Their sparse high-dimensional data combined with the required precision poses a serious challenge. We show how combining Conditional Flow Matching with transformer elements allows us to simulate the detector phase space reliably. Namely, we use an autoregressive transformer to simulate the energy of each layer, and...

89. Heavy-Flavour Jet Tagging at LHCb Using Graph Neural Networks

Nathan Grieser (University of Cincinnati (US))

05/11/2024, 16:20

Tagging

Efficient jet flavour-tagging is crucial for event reconstruction and particle analyses in high energy physics (HEP). Graph Neural Networks (GNNs) excel in capturing complex relationships within graph-structured data, and we aim to enhance the classification of b-jets using this method of deep learning. Presented in this work is the first application of a novel GNN b-jet tagger using the LHCb...

81. Higher Resolution and Angular Conditioning for Normalizing-Flow-based Generation of Calorimeter Showers

Thorsten Lars Henrik Buss (Universität Hamburg (DE))

05/11/2024, 16:40

Detector Simulation

Monte Carlo (MC) simulations are crucial for collider experiments, enabling the comparison of experimental data with theoretical predictions. However, these simulations are computationally demanding, and future developments, like increased event rates, are expected to surpass available computational resources. Generative modeling can substantially cut computing costs by augmenting MC...

103. Transforming Flavour Tagging on ATLAS

Greta Brianti (CERN)

05/11/2024, 16:40

Tagging

Flavour-tagging is a critical component of the ATLAS experiment's physics programme. Existing flavour tagging algorithms rely on several 'low-level' taggers, which are a combination of physically informed algorithms and machine learning models. A novel approach presented here instead uses a single machine learning model based on reconstructed tracks, avoiding the need for low-level taggers...

20. Calo4pQVAE: A calorimeter surrogate for high energy particle-calorimeter interactions using Dwave’s Zephyr topology

J. Quetzalcoatl Toledo-Marin (TRIUMF)

05/11/2024, 17:00

Detector Simulation

One potential roadblock towards the HL-LHC experiment, scheduled to begin in 2029, is the computational demand of traditional collision simulations. Projections suggest current methods will require millions of CPU-years annually, far exceeding existing computational capabilities. Replacing the event showers module in calorimeters with quantum-assisted deep learning surrogates can help bridge...

108. UParT: A unified approach for jet-based object identification in CMS in Run 3

Uttiya Sarkar (Rheinisch Westfaelische Tech. Hoch. (DE))

05/11/2024, 17:00

Tagging

The steady progress in machine learning leads to substantial performance improvements in various areas of high-energy physics, especially for object identification. Jet flavor identification (tagging) is a prominent benchmark that profits from elaborate architectures, leveraging information from low-level input variables and their correlations. Throughout the data-taking eras of the Large...

30. BitHEP – Are 1-Bit Networks all we need?

Daohan Wang (HEPHY ÖAW)

05/11/2024, 17:20

Detector Simulation

With the rise of modern and complex neural network architectures, there is a growing need for fast and memory-efficient implementations to avoid computational bottlenecks in high-energy physics (HEP). We explore the performance of the BITNET architecture in state-of-the-art HEP applications, focusing on classification, regression and generative modeling tasks. Specifically, we apply BITNET to...

109. Performance of the CNN-based tau identification algorithm with Domain Adaptation using Adversarial Machine Learning for Run 2

Olha Lavoryk (KIT - Karlsruhe Institute of Technology (DE))

05/11/2024, 17:20

Tagging

Precise tau identification is a crucial component for many studies targeting the Standard Model or searches for New Physics within the CMS physics program. The Deep Tau v2.5 algorithm is a convolutional neural network algorithm: an improved version of its predecessor, Deep Tau v2.1, deployed for the LHC Run 3. This updated version integrates several enhancements to improve classification...

10. Fast Perfekt: Regression-based refinement of fast simulation

Lars Stietz (Hamburg University of Technology (DE))

06/11/2024, 09:00

Detector Simulation

As data sets grow in size and complexity, simulated data play an increasingly important role in analysis. In many fields, two or more distinct simulation software applications are developed that trade off with each other in terms of accuracy and speed. The quality of insights extracted from the data stand to increase if the accuracy of faster, more economical simulation could be improved to...

8. Integrating Energy Flow Networks with Jet Substructure Observables for Enhanced Jet Quenching Studies

João A. Gonçalves (LIP - IST)

06/11/2024, 09:00

Tagging

The phenomena of Jet Quenching, a key signature of the Quark-Gluon Plasma (QGP) formed in Heavy-Ion (HI) collisions, provides a window of insight into the properties of this primordial liquid. In this study, we rigorously evaluate the discriminating power of Energy Flow Networks (EFNs), enhanced with substructure observables, in distinguishing between jets stemming from proton-proton (pp) and...

33. Jet Charge Classifiers

Rabia Husain

06/11/2024, 09:20

Tagging

While there has been tremendous progress on jet classification in the last decade, classifying samples which are very similar is still an open problem. One example of this is tagging up vs. down-quark initiated jets, which historically have utilized the observable $p_T$ weighted jet charge directly or as an input to neural networks. In this work, we explore whether this trend persists when...

99. Refining CMS Fast Simulation with ML-based regression

Samuel Louis Bein (Universite Catholique de Louvain (UCL) (BE))

06/11/2024, 09:20

Detector Simulation

The CMS Fast Simulation chain (FastSim) is roughly 10 times faster than the application based on the GEANT4 detector simulation and full reconstruction referred to as FullSim. This advantage however comes at the price of decreased accuracy in some of the final analysis observables. A machine learning-based technique to refine those observables has been developed and its status is presented...

63. Faster Than Fast: Pushing the Limits of Simulation with Generative Models in HEP

Sitian Qian (Peking University (CN))

06/11/2024, 09:40

Detector Simulation

Fast event and detector simulation in high-energy physics using generative models provides a viable solution for generating sufficient statistics within a constrained computational budget, particularly in preparation for the High Luminosity LHC. However, many of these applications suffer from a quality/speed tradeoff. Diffusion models offer some of the best sampling quality but slow generation...

78. WOTAN: Weakly-supervised Optimal Transport Attention-based Noise Mitigation

Nathan Suri Jr. (Yale University (US))

06/11/2024, 09:40

Tagging

We improve upon the existing literature on pileup mitigation techniques studied at Large Hadron Collider (LHC) experiments for disentangling proton-proton collisions. Pileup presents a salient problem that, if not checked, hinders the search for new physics and Standard Model precision measurements such as jet energy, jet substructure, missing momentum, and lepton isolation. The primary...

70. Parnassus: An Automated Approach to Accurate, Precise, and Fast Detector Simulation and Reconstruction

Dmitrii Kobylianskii (Weizmann Institute of Science (IL))

06/11/2024, 10:00

Detector Simulation

Simulating particle physics data is an essential yet computationally intensive process in analyzing data from the LHC. Traditional fast simulation techniques often use a surrogate calorimeter model followed by a reconstruction algorithm to produce reconstructed objects. In this work, we introduce Particle-flow Neural Assisted Simulations (Parnassus), a deep learning-based method for generating...

69. Supervised CWoLa: train supervised classifiers without background simulation

Stephen Mulligan (Universite de Genève)

06/11/2024, 10:00

Tagging

Supervised deep learning methods have found great success in the field of high energy physics (HEP) and the trend within the field is to move away from high level reconstructed variables to low level detector features. However, supervised methods require labelled data, which is typically provided by a simulator. The simulations of HEP datasets become harder to validate and calibrate as we...

11. Frequentist Uncertainties on Density Ratios with Ensembles

Sean Benevedes (Massachusetts Institute of Technology)

06/11/2024, 10:50

Uncertainties & Interpretability

We propose a novel framework to obtain asymptotic frequentist uncertainties on machine learned classifier outputs by using model ensembles. With the well-known likelihood trick, this framework can then be applied to the task of density ratio estimation to obtain statistically rigorous frequentist uncertainties on estimated likelihood ratios. As a toy example, we demonstrate that the framework...

82. Hadronic Top Quark Polarimetry with ParticleNet

Zhongtian Dong (University of Kansas)

06/11/2024, 10:50

Tagging

Observables sensitive to top quark polarization are important for characterizing and discovering new physics. The most powerful spin analyzer in the top decay is the down-type fermion from the W, which in the case of leptonic decay allows for very clean measurements. However, in many applications, it is useful to measure the polarization of hadronically decaying top quarks via an optimal...

41. Machine Learning the Top Mass

Katherine Fraser (Harvard University)

06/11/2024, 11:10

Uncertainties & Interpretability

Energy correlators have recently shown potential to improve the precision on the top mass precision measurement. However, existing measurement strategies still only use part of the information in the EEEC distribution and rely on arbitrary shape choices. In this talk, we explore the ability of Machine Learning to effectively optimize shape choice and reduce error on the top mass. Specifically,...

61. Pretrained event classification model for collider experiments

Joshua Anthony Ho (Lawrence Berkeley National Lab. (US))

06/11/2024, 11:10

Tagging

Analysis of collision data often involves training deep learning classifiers on very specific tasks and in regions of phase-space where the training datasets have limited statistics. Models pre-trained on a larger, more generic, sample may already have a useful representation of collider data which can be leveraged by many independent downstream analysis tasks. We introduce a class of...

39. Calibrating ATLAS calorimeter signals using an uncertainty-aware precision network

Mr Lorenz Vogel (Heidelberg University)

06/11/2024, 11:30

Uncertainties & Interpretability

ATLAS explores modern neural networks for a multi-dimensional calibration of its calorimeter signal defined by clusters of topologically connected cells (topo-clusters). The Bayesian neural network (BNN) approach yields a continuous and smooth calibration function, including uncertainties on the calibrated energy per topo-cluster. In this talk the performance of this BNN-derived calibration is...

68. Enhancing generalization in high energy physics using white-box adversarial attacks

Franck Rothen (Universite de Geneve (CH))

06/11/2024, 11:30

Tagging

Machine learning is becoming increasingly popular in the context of particle physics. Supervised learning, which uses labeled Monte Carlo simulations, remains one of the most widely used methods for discriminating signals beyond the Standard Model. However, this paper suggests that supervised models may depend excessively on artifacts and approximations from Monte Carlo simulations,...

101. Uncertainty Quantification and Anomaly Detection with Evidential Deep Learning

Mark Neubauer (Univ. Illinois at Urbana Champaign (US))

06/11/2024, 11:50

Uncertainties & Interpretability

Evidential Deep Learning (EDL) is an uncertainty-aware deep learning approach designed to provide confidence (or epistemic uncertainty) about test data. It treats learning as an evidence acquisition process where more evidence is interpreted as increased predictive confidence. This talk will provide a brief overview of EDL for uncertainty quantification (UQ) and its application to jet tagging...

62. Efficient Particle Tracking and Pileup Mitigation with State space model

Cheng Jiang (The University of Edinburgh (GB))

06/11/2024, 13:50

Reconstruction

Large-scale point cloud and long-sequence processing are crucial for high energy physics applications such as pileup mitigation and track reconstruction. The HL-LHC presents inevitable challenges to machine learning models, requiring both high stability and low computational complexity. Previous studies have primarily focused on graph-based approaches which are generally effective but often...

21. Fusing physics principles and machine learning: inferring dark matter densities of galaxies using stellar catalogs with incomplete kinematic information

Sung Hak Lim (Rutgers University)

06/11/2024, 13:50

Astro & Cosmo

Galactic dynamics studies often face the challenge of incomplete kinematic information in stellar catalogs.
This incompleteness poses a significant challenge to a complete and model-independent measurement of local galactic dark matter densities using stellar dynamics.
This talk presents two innovative approaches that fuse physics principles with machine learning techniques, specifically...

94. Graph Neural Network-Based Track Finding in the LHCb Vertex Detector

Fotis Giasemis (Centre National de la Recherche Scientifique (FR))

06/11/2024, 14:10

Reconstruction

The next decade will see an order of magnitude increase in data collected by high-energy physics experiments,
driven by the High-Luminosity LHC (HL-LHC). The reconstruction of charged particle trajectories (tracks) has
always been a critical part of offline data processing pipelines. The complexity of HL-LHC data will however
increasingly mandate track finding in all stages of an...

56. SKATR -- A Self-Supervised Summary Transformer for the Square Kilometre Array

Ayodele Ore

06/11/2024, 14:10

Astro & Cosmo

The upcoming Square Kilometre Array (SKA) will bring about a new era of radio astronomy by allowing 3D imaging of the Universe during the periods of Cosmic Dawn and Reionisation. Machine learning promises to be a powerful tool to analyse the highly structured and complex signal, however accurate training datasets are expensive to simulate and supervised learning may not generalise. We...

80. Accelerating Graph-based Tracking Tasks with Symbolic Regression

Nathalie Soybelman (Weizmann Institute of Science (IL))

06/11/2024, 14:30

Reconstruction

Reconstructing particle tracks from detector hits is computationally intensive due to the large combinatorics involved. Recent work has shown that ML techniques can enhance conventional tracking methods, but complex models are often difficult to implement on heterogeneous trigger systems, such as FPGAs. While deploying neural networks on FPGAs is possible, resource limitations pose challenges....

27. Sweeping the Dust Away: An Unbiased Map of the Milky Way's Dark Matter and Gravitational Potential with Unsupervised Machine Learning

Eric Putney (Rutgers, The State University of New Jersey)

06/11/2024, 14:30

Astro & Cosmo

The dynamics of stars in our galaxy encode crucial information about the Milky Way's dark matter halo. However, extinction from foreground dust can bias studies of stellar populations. By solving the equilibrium collisionless Boltzmann equation with novel machine learning techniques, we estimate the unbiased 6-dimensional phase space density of an equilibrated stellar population and the...

72. Denoising Graph Super-Resolution for Improved Collider Event Reconstruction

Nilotpal Kakati (Weizmann Institute of Science (IL))

06/11/2024, 14:50

Reconstruction

Accurately reconstructing particles from detector data is a critical challenge in experimental particle physics, where the spatial resolution of calorimeters plays a key role. This study explores the integration of super-resolution techniques into the Large Hadron Collider (LHC)-like reconstruction pipeline to enhance the granularity of calorimeter data. By applying super-resolution, we...

65. SkyCURTAINs: Model agnostic search for Stellar Streams with Gaia data

Stephen Brian Mulligan (Universite de Geneve (CH))

06/11/2024, 14:50

Astro & Cosmo

We introduce SkyCURTAINs, an adaptation of the CURTAINs method—a weakly supervised technique originally developed for anomaly detection in high-energy physics data—applied to data from the second Gaia Data Release (GDR2). SkyCURTAINs is employed to search for stellar streams, which appear as line-like overdensities against the background of the Milky Way. To validate the feasibility of this...

114. (R) Generative Neural Networks for Reconstructing Parton-Level Jet Showers after Hadronization

Umar Sohail Qureshi (Vanderbilt University)

06/11/2024, 15:10

Reconstruction

Recreating realistic parton-level event configurations from jets is a crucial task for various physics analyses. However, hadronization processes cannot be computed using perturbative QCD. Therefore, it has been traditionally intractable to reconstruct parton-level events after hadronization.

We present a generative machine learning approach for reconstructing jet showers at the parton...

58. Generation of Air Shower Images for Imaging Air Cherenkov Telescopes using Diffusion Models

Christian Elflein (Erlangen Centre for Astroparticle Physics)

06/11/2024, 15:10

Astro & Cosmo

The major goal of Imaging Atmospheric Cherenkov Telescopes (IACTs) is the investigation of gamma-ray sources through the detection of their induced air showers. For every detected gamma ray, there are up to 10000 cosmic ray protons present forming the background, which also needs to be studied. For a detailed understanding of the instrument for deriving its response to both gamma rays and...

125. Realtime reconstruction: Machine learning in reconstruction at LHC

Simon Akar (University of Cincinnati (US))

06/11/2024, 16:00

Plenary talks

The Large Hadron Collider (LHC) at CERN pushes the boundaries of particle physics, generating data at unprecedented rates and requiring advanced computational techniques to process information in real time. While experimental environments between LHC experiments can differ, common challenges can be identified in the area of real-time reconstruction including the use of specialized trigger...

40. A Continuous Calibration of the ATLAS Flavor-Tagging Classifiers via Optimal Transportation Maps

Chris Pollard (University of Warwick (GB))

06/11/2024, 16:40

Mixed contributions

A calibration of the ATLAS flavor-tagging algorithms using a new calibration procedure based on optimal transportation maps is presented. Simultaneous, continuous corrections to the $b$-, $c$-, and light flavor classification probabilities from jet tagging algorithms in simulation are derived for $b$-jets using $t\bar t \to b \bar b e \mu \nu \nu$ events. After application of the derived...

37. The Fast Calorimeter Challenge 2022: Final Evaluation & Lessons Learned

Dr Claudius Krause (HEPHY Vienna (ÖAW))

06/11/2024, 17:00

Mixed contributions

I report the final results of the Fast Calorimeter Challenge 2022: 23 collaborations submitted 59 samples across all 4 datasets. I will show how these rank regarding various metrics judging shower quality, generation time, and other properties. From these results, I present the current, state-of-the-art, Pareto fronts for using deep generative models on high-dimensional datasets in high-energy...

22. Semi-Supervised Permutation Invariant Particle-Level Anomaly Detection

Gabriel Pinheiro Matos (Columbia University (US))

07/11/2024, 09:00

Anomaly detection

The development of analysis methods that can distinguish potential beyond the Standard Model phenomena in a model-agnostic way can significantly enhance the discovery reach in collider experiments. However, the typical machine learning (ML) algorithms employed for this task require fixed length and ordered inputs that break the natural permutation invariance in collider events. To address this...

91. The good, the bad, and the Bayesian?

Nina Elmer

07/11/2024, 09:00

Uncertainties & Interpretability

Estimating uncertainties is a fundamental aspect in every physics problem, no measurements or calculations comes without uncertainties. Hence it is crucial to consider the effect of training a neural network to problems in physics. I will present our work on amplitude regression, using loop amplitudes from LHC processes, as an example to examine the impact of different uncertainties on the...

53. Calibrating Bayesian Generative Machine Learning for Bayesiamplification

Sebastian Guido Bieringer (Hamburg University)

07/11/2024, 09:20

Uncertainties & Interpretability

Generative models are on a fast track to becoming a mainstay in particle physics simulation chains, seeing active work towards adoption by nearly every large experiment and collaboration. However, the question of estimating the uncertainties and statistical expressiveness of samples produced by generative ML models is still far from settled.

Recently, combinations of generative and...

12. The versatility of flow-based fast calorimeter surrogate models

Ian Pang

07/11/2024, 09:20

Anomaly detection

Normalizing flows have proven to be state-of-the-art for fast calorimeter simulation. With access to the likelihood, these flow-based fast calorimeter surrogate models can be used for other tasks such as unsupervised anomaly detection (arXiv:2312.11618) and particle incident energy calibration (arXiv:2404.18992) without any additional training costs. Using CaloFlow as an example, we show that...

2. (R) Lorentz Group Equivariant Autoencoders

Zichun Hao (California Institute of Technology)

07/11/2024, 09:40

Anomaly detection

There has been significant work recently in developing machine learning (ML) models in high energy physics (HEP) for tasks such as classification, simulation, and anomaly detection. Often these models are adapted from those designed for datasets in computer vision or natural language processing, which lack inductive biases suited to HEP data, such as equivariance to its inherent symmetries....

48. KAN we improve on HEP classification tasks? Kolmogorov-Arnold Networks applied to an LHC physics example

Florian Alexander Mausolf (Rheinisch Westfaelische Tech. Hoch. (DE))

07/11/2024, 09:40

Uncertainties & Interpretability

Recently, Kolmogorov-Arnold Networks (KANs) have been proposed as an alternative to multilayer perceptrons, suggesting advantages in performance and interpretability. In this talk, we present the first application of KANs in high-energy physics, focusing on a typical binary classification task involving high-level features.
We study KANs with different depths and widths and include a...

19. New Physics Searches with Graph-Based Anomaly Detection in High-Energy Collisions

Ines Isabel Gouveia Cipriano Piedade Moreira (Laboratory of Instrumentation and Experimental Particle Physics (PT))

07/11/2024, 10:00

Anomaly detection

In the realm of high-energy physics, the use of graph network-based implementations offers the advantage of handling input datasets more closely aligned with their collection process in collider experiments. GNN-based approaches address the graph anomaly detection problem by utilizing information about graph features and structures to effectively learn to score anomalies. We represent a single...

107. ParticleNet: Calibration of the jet energy scale using pT regression with partial Run3 data collected by the CMS experiment

Matteo Malucchi (ETH Zurich (CH))

07/11/2024, 10:00

Uncertainties & Interpretability

We are presenting the first calibration of the jet pT regression (CMS-DP-2024-064), achieving an expected improvement in jet resolution of up to 17%, and the latest performance results for flavor identification and jet energy resolution estimation using ParticleNet. The pT regression, which focuses on correcting the reconstructed jet pT to the truth-level jet pT, is divided into two...

104. Model-agnostic search for dijet resonances with anomalous jet substructure in proton-proton collisions at $\sqrt{s}$ = 13 TeV with the CMS detector

Aritra Bal (KIT - Karlsruhe Institute of Technology (DE))

07/11/2024, 10:50

Anomaly detection

We introduce a model-agnostic search for new physics in the dijet final state. Other than the requirement of a narrow dijet resonance with a mass in the range of 1.8-6 TeV, minimal additional assumptions are placed on the signal hypothesis. Search regions are obtained by utilizing multivariate machine learning methods to select jets with anomalous substructure. A collection of complementary...

98. Efficient Resonant Anomaly Detection

Ranit Das (Rutgers University)

07/11/2024, 11:10

Anomaly detection

A key step in any resonant anomaly detection search is accurate estimation of the background distribution in each signal region. Data-driven methods like CATHODE accomplish this by training separate density estimators on the complement of each signal region, and interpolating them into their corresponding signal regions. Having to re-train the density estimator on essentially the entire...

60. TRANSIT your events into a new mass: Fast background interpolation for semi-supervised anomaly detection searches

Mr Ivan Oleksiyuk (UNIGE)

07/11/2024, 11:30

Anomaly detection

We introduce TRANSIT, a conditional adversarial network for continuous interpolation of data. It is designed to construct a background data template for semi-supervised searches for new physics processes at the LHC, by smoothly transforming sideband events to match signal region mass distributions.

We demonstrate the performance of TRANSIT using the LHC Olympics R&D dataset. The method...

13. A Library for ML-based Fast Calorimeter Shower Simulation at Future Collider Experiments and Beyond

Peter McKeown (CERN)

07/11/2024, 13:50

Detector Simulation

Experiments at current and future colliders rely fundamentally on precise detector simulation. While traditional simulation approaches based on Monte Carlo techniques provide a high degree of physics fidelity, they place an enormous burden on the available computational resources. This is particularly true of particle showers created in the calorimeters, which have been a focus of fast...

66. OmniFoldHI: Advanced ML Unfolding for Heavy-Ion Data

Alexandre Falcão (University of Bergen)

07/11/2024, 13:50

Unfolding & Inference

To compare collider experiments, measured data must be corrected for detector distortions through a process known as unfolding. As measurements become more sophisticated, the need for higher-dimensional unfolding increases, but traditional techniques have limitations. To address this, machine learning-based unfolding methods were recently introduced. In this work, we introduce OmniFoldHI, an...

64. Bridging the Generative Unfolding Gap

Sascha Diefenbacher (Lawrence Berkeley National Lab. (US))

07/11/2024, 14:10

Unfolding & Inference

Machine learning-based unfolding has started to establish itself as the go-to approach for precise, high-dimensional unfolding tasks. The current state-of-the-art unfolding methods can be divided into reweighting-based and generation-based methods. The latter of the two is comprised of conditional generative models, which generate new truth-level events from random noise conditioned on...

17. Towards Detector Agnostic Fast Calorimetry Simulation

Piyush Raikwar (CERN)

07/11/2024, 14:10

Detector Simulation

Calorimeter simulations based on Monte Carlo methods (Geant4), while accurate, are computationally expensive and time-consuming. In this regard, numerous efforts aim to accelerate these simulations faster via generative machine learning. Although these machine learning models tend to be faster than Geant4, their design demands a significant amount of time, computational resources, and...

83. CaloClouds III: Ultra-Fast Geometry-Independent Highly-Granular Calorimeter Simulation

Anatolii Korol

07/11/2024, 14:30

Detector Simulation

Ever-increasing collision rates place significant computational stress on the simulation of future experiments in high energy physics. Generative machine learning (ML) models have been found to speed up and augment the most computationally intensive part of the traditional simulation chain: the calorimeter simulation. Many previous studies relied on fixed grid-like data representation of...

36. Measurement of Jet Track Functions with OmniFold-based Binning Corrections in ATLAS Run 2 Data

Jingjing Pan (Yale University (US))

07/11/2024, 14:30

Unfolding & Inference

Measurements of jet substructure are key to probing the energy frontier at colliders, and many of them use track-based observables which take advantage of the angular precision of tracking detectors. Theoretical calculations of track-based observables require “track functions”, which characterize the transverse momentum fraction $r_q$ carried by charged hadrons from a fragmenting quark or...

29. Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion

Kevin Thomas Greif (University of California Irvine (US))

07/11/2024, 14:50

Unfolding & Inference

The measurements performed by particle physics experiments must account for the imperfect response of the detectors used to observe the interactions. One approach, unfolding, statistically adjusts the experimental data for detector effects. Recently, generative machine learning models have shown promise for performing unbinned unfolding in a high number of dimensions. However, all current...

44. Point-Clouds based Diffusion Model on Hadronic Showers

Martina Mozzanica (Hamburg University (DE))

07/11/2024, 14:50

Detector Simulation

Simulating showers of particles in highly-granular detectors is a key frontier in the application of machine learning to particle physics. Achieving high accuracy and speed with generative machine learning models can enable them to augment traditional simulations and alleviate a major computing constraint.
Recent developments have shown how diffusion based generative shower simulation...

67. How to Unfold Top Decays

Sofia Palacios Schweitzer (ITP, University Heidelberg)

07/11/2024, 15:10

Unfolding & Inference

Many physics analyses at the LHC rely on algorithms to remove detector effect, commonly known as unfolding. Whereas classical methods only work with binned, one-dimensional data, Machine Learning promises to overcome both problems. Using a generative unfolding pipeline, we show how it can be build into an existing LHC analysis, designed to measure the top mass. We discuss the model-dependence...

79. Advanced techniques for SBI in collider physics

Giovanni De Crescenzo (University of Heidelberg)

07/11/2024, 16:00

Unfolding & Inference

We present an application of Simulation-Based Inference (SBI) in collider physics, aiming to constrain anomalous interactions beyond the Standard Model (SM). This is achieved by leveraging Neural Networks to learn otherwise intractable likelihood ratios. We explore methods to incorporate the underlying physics structure into the likelihood estimation process. Specifically, we compare two...

32. Learning powerful jet representations via self-supervision

Shudong Wang (Chinese Academy of Sciences (CN))

07/11/2024, 16:00

Foundation models

We propose a new approach to learning powerful jet representations directly from unlabelled data. The method employs a Particle Transformer to predict masked particle representations in a latent space, overcoming the need for discrete tokenization and enabling it to extend to arbitrary input features beyond the Lorentz four-vectors. We demonstrate the effectiveness and flexibility of this...

28. OmniLearn: A Method to Simultaneously Facilitate All Jet Physics Tasks

Vinicius Massami Mikuni (Lawrence Berkeley National Lab. (US))

07/11/2024, 16:20

Foundation models

Machine learning has become an essential tool in jet physics. Due to their complex, high-dimensional nature, jets can be explored holistically by neural networks in ways that are not possible manually. However, innovations in all areas of jet physics are proceeding in parallel. We show that large machine learning models trained for a jet classification task can improve the accuracy, precision,...

55. Towards Universal Unfolding using Denoising Diffusion

Martin Klassen (Tufts University (US))

07/11/2024, 16:20

Unfolding & Inference

Correcting for detector effects in experimental data, particularly through unfolding, is critical for enabling precision measurements in high-energy physics. However, traditional unfolding methods face challenges in scalability, flexibility, and dependence on simulations. We introduce a novel approach to multidimensional particle-wise unfolding using conditional Denoising Diffusion...

43. OmniJet-alpha and beyond: foundation model updates

Anna Maria Cecilia Hallin (University of Hamburg)

07/11/2024, 16:40

Foundation models

OmniJet-alpha is the first cross-task foundation model for particle physics, demonstrating transfer learning between an unsupervised problem (jet generation) and a classic supervised task (jet tagging). While OmniJet-alpha is still at a prototype stage, the successful development of foundation models for physics data would represent a major breakthrough, as they have the potential to enhance...

96. Parameter Estimation with Neural Simulation-Based Inference in ATLAS

Mr Arnaud Jean Maury (Université Paris-Saclay (FR))

07/11/2024, 16:40

Unfolding & Inference

Neural Simulation-Based Inference (NSBI) is a powerful class of machine learning (ML)-based methods for statistical inference that naturally handle high dimensional parameter estimation without the need to bin data into low-dimensional summary histograms. Such methods are promising for a range of measurements at the Large Hadron Collider, where no single observable may be optimal to scan over...

31. A Novel Approach to Training Foundation Models for Jet-Related Tasks Without Vector Quantization

Masahiro Morinaga (University of Tokyo (JP))

07/11/2024, 17:00

Foundation models

This study proposes a new method for training foundation models designed explicitly for jet-related tasks. Like those seen in large language models, a foundation model is a pre-trained model that can be fine-tuned for various applications and is not limited to a specific task. Previous approaches often involve randomly masking inputs, such as tracks within a jet, and then predicting the masked...

4. Constraining the Higgs Potential with Neural Simulation-based Inference for Di-Higgs Production

Radha Mastandrea (University of California, Berkeley)

07/11/2024, 17:00

Unfolding & Inference

Determining the form of the Higgs potential is one of the most exciting challenges of modern particle physics. Higgs pair production directly probes the Higgs self-coupling and should be observed in the near future at the High-Luminosity LHC. We explore how to improve the sensitivity to physics beyond the Standard Model through per-event kinematics for di-Higgs events. In particular, we employ...

88. Gamma-ray spectrometry of fission fragments : ML analysis of multi-dimensional spectra

Mattéo Ballu

07/11/2024, 17:20

Unfolding & Inference

The analysis of gamma radiation emitted by fission fragments has become an essential tool for studying the nuclear fission process. It allows probing the intrinsic properties of the fragments or exploring effects that are little studied experimentally, such as the sharing of excitation energy between fragments during nuclear fission.

However, the analysis of experimental fission gamma-ray...

54. Large-Scale Pretraining and Finetuning for Efficient Jet Classification in Particle Physics

Zihan Zhao (Univ. of California San Diego (US))

07/11/2024, 17:20

Foundation models

This study introduces an innovative approach to analyzing unlabeled data in high-energy physics (HEP) through the application of self-supervised learning (SSL).
Faced with the increasing computational cost of producing high-quality labeled simulation samples at the CERN LHC, we propose leveraging large volumes of unlabeled data to overcome the limitations of supervised learning methods, which...

123. Implicit and Explicit Simulation-Based Inference for Cosmology

François LANUSSE

08/11/2024, 09:00

Plenary talks

Physical models in the form of simulations offer an avenue to model the data in all of its complexity, but until very recently using such models to estimate physical fields and parameters remained an open problem.

In this talk, I will discuss two possible points of view on simulators, depending on whether they are “black-box” or “open-box” models, and the different methodologies and...

117. ML assisted Event Reconstruction in the CMS Phase-2 High Granularity Calorimeter Endcap

Theo Cuisset (LLR / École Polytechnique (FR))

08/11/2024, 09:40

Mixed contributions

The high-luminosity era of the LHC will pose unprecedented challenges to the detectors. To meet these challenges, the CMS detector will undergo several upgrades, including the replacement the current endcap calorimeters with a novel High-Granularity Calorimeter (HGCAL). To make optimal use of this innovative detector, novel algorithms have to be invented. A dedicated reconstruction framework,...

49. Resonant Searches as Cut and Count Experiments

Marie Hein (RWTH Aachen University)

08/11/2024, 10:00

Mixed contributions

Weakly supervised anomaly detection has been shown to have great potential for improving traditional resonance searches. We demonstrate that weak supervision offers a unique opportunity to turn a resonance search into a simple cut-and-count experiment, where the potential problem of background sculpting in a traditional bump hunt is absent. Moreover, the cut-and-count setting allows working...

124. Theory Overview

Sven-Ludwig Krippendorf (LMU MUNICH)

08/11/2024, 10:50

Plenary talks

In this talk I will give a biased review on the work at the intersection of machine learning and theoretical physics. This includes how we can use transformers to obtain symbolic expressions without having information about the target expression. In turn, I present a benchmark human physicists have failed in solving, namely that of compact Calabi-Yau metrics and give a short status report on...

128. Closing remarks

Anja Butter (Centre National de la Recherche Scientifique (FR))

08/11/2024, 11:30

Closing remarks

Choose timezone

ML4Jets2024