6th Inter-experiment Machine Learning Workshop

Name: 6th Inter-experiment Machine Learning Workshop
Start: 2024-01-29T09:00:00+01:00
End: 2024-02-02T19:00:00+01:00
Location: CERN

29 January 2024 to 2 February 2024

CERN

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

Session

Contributed Talks

29 Jan 2024, 16:30

503/1-001 - Council Chamber (CERN)

503/1-001 - Council Chamber

CERN

162

Show room on map

There are no materials yet.

59. Attention to the strengths of physics interactions: Enhanced Deep Learning Event Classification for Particle Physics Experiments

Polina Moskvitina (Nikhef National institute for subatomic physics (NL))

29/01/2024, 16:30

2 ML for analysis : event classification, statistical analysis and inference, including anomaly detection

Contributed talk

A major task in particle physics is the measurement of rare signal processes. These measurements are highly dependent on the classification accuracy of these events in relation to the huge background of other Standard Model processes. Reducing the background by a few tens of percent with the same signal efficiency can already increase the sensitivity considerably.

This study demonstrates...

39. Modeling $N_{\mathrm{ch}}$ distributions and $p_{\mathrm{T}}$ spectra in high-energy pp collisions with DNNs

Maria Alejandra Calmon Behling (Goethe University Frankfurt (DE))

29/01/2024, 16:50

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Contributed talk

During the data-taking campaigns Run 1 and Run 2 of the Large Hadron Collider (LHC), the ALICE collaboration collected a large amount of proton-proton (pp) collisions across a variety of center-of-mass energies ($\sqrt{s\,}$). This extensive dataset is well suited to study the energy dependence of particle production. Deep neural networks (DNNs) provide a powerful regression tool to capture...

31. The DL Advocate: Playing the devil's advocate with hidden systematic uncertainties

Andrea Mauri (Imperial College (GB))

29/01/2024, 17:10

2 ML for analysis : event classification, statistical analysis and inference, including anomaly detection

Contributed talk

We propose a new method based on machine learning to play the devil’s advocate and investigate the impact of unknown systematic effects in a quantitative way. This method proceeds by reversing the measurement process and using the physics results to interpret systematic effects under the Standard Model hypothesis. We explore this idea with two alternative approaches, one relies on a...

58. the Fair Universe project and the HiggsML Uncertainty Challenge

David Rousseau (IJCLab-Orsay)

29/01/2024, 17:30

2 ML for analysis : event classification, statistical analysis and inference, including anomaly detection

Contributed talk

The Fair Universe project is building a large-compute-scale AI ecosystem for sharing datasets, training large models and hosting challenges and benchmarks. Furthermore, the project is exploiting this ecosystem for an AI challenge series focused on minimizing the effects of systematic uncertainties in High-Energy Physics (HEP), and on predicting accurate confidence intervals. This talk will...

55. Accelerating Graph-Based Tracking with Symbolic Regression

Nathalie Soybelman (Weizmann Institute of Science (IL))

30/01/2024, 11:30

4 Fast ML : Application of Machine Learning to DAQ/Trigger/Real Time Analysis

Contributed talk

Tracking, the reconstruction of particle trajectories from hits in the inner detector is a computationally intensive task due to the large combinatorics of detector signals. Recent efforts have proven that ML techniques can be successfully applied to the tracking problem, extending and improving the conventional methods based on feature engineering. However, the inference of complex networks...

10. Reinforcement Learning Algorithms for Charged Particle Tracking with Applications in Proton Computed Tomography

Tobias Kortus

30/01/2024, 11:50

1 ML for object identification and reconstruction

Contributed talk

Deep learning, especially graph neural networks, significantly improved tracking performances in modern particle detectors while reducing runtimes compared to previous state of the art approaches. However, training neural networks requires significant amount of labeled data, usually acquired by performing complex particle simulations. We present first studies of leveraging deep reinforcement...

6. Differentiable Vertex Fitting for Jet Flavour Tagging

Rachel Emma Clarke Smith (SLAC National Accelerator Laboratory (US)), Ruben Miguel De Almeida Inacio (LIP - Laboratorio de Instrumentação e Física Experimental de Partículas (PT))

30/01/2024, 12:10

1 ML for object identification and reconstruction

Contributed talk

We propose a differentiable vertex fitting algorithm that can be used for secondary vertex fitting, and that can be seamlessly integrated into neural networks for jet flavour tagging. Vertex fitting is formulated as an optimization problem where gradients of the optimized solution vertex are defined through implicit differentiation and can be passed to upstream or downstream neural network...

49. Advances in developing deep neural networks for finding primary vertices in proton-proton collisions at the LHC

Simon Akar (University of Cincinnati (US))

30/01/2024, 12:30

4 Fast ML : Application of Machine Learning to DAQ/Trigger/Real Time Analysis

Contributed talk

We have been studying the use of deep neural networks (DNNs) to identify and locate primary vertices (PVs) in proton-proton collisions at the LHC. Earlier work focused on finding primary vertices in simulated LHCb data using a hybrid approach that started with kernel density estimators (KDEs) derived from the ensemble of charged track parameters heuristically and predicted “target histogram”...

28. End-to-end Reconstruction Algorithm for Highly Granular Calorimeters

Mr Philipp Zehetner (Ludwig Maximilians Universitat (DE))

30/01/2024, 14:00

1 ML for object identification and reconstruction

Contributed talk

We present an end-to-end reconstruction algorithm for highly granular calorimeters that includes track information to aid the reconstruction of charged particles. The algorithm starts from calorimeter hits and reconstructed tracks, and outputs a coordinate transformation in which all shower objects are well separated from each other, and in which clustering becomes trivial. Shower properties...

20. Electron and Proton Classification with AMS ECAL Using Convolutional Vision Transformers and Domain Adaptation

Berk Turk (Middle East Technical University (TR))

30/01/2024, 14:20

7 ML for astroparticle

Contributed talk

Alpha Magnetic Spectrometer (AMS-02) is a precision high-energy cosmic-ray experiment on the ISS operating since 2011 and has collected more than 228 billion particles. Among them, positrons are important to understand the particle nature of dark matter. Separating the positrons from cosmic background protons is challenging above 1 TeV. Therefore, we use state-of-the-art convolutional and...

46. Parametrising profiled likelihoods with neural networks

Dr Humberto Reyes-González (University of Genoa)

30/01/2024, 14:40

8 ML for phenomenology and theory

Contributed talk

Full statistical models encapsulate the complete information of an experimental result, including the likelihood function given observed data. Since a few years ago ATLAS started publishing statistical models that can be reused via the pyhf framework; a major step towards fully publishing LHC results. In the case of fast Simplified Model Spectra based reinterpretation we are often only...

19. Accelerating the search for mass bumps using the Data-Directed Paradigm

Bruna Pascual (Universite de Montreal (CA))

30/01/2024, 15:00

2 ML for analysis : event classification, statistical analysis and inference, including anomaly detection

Contributed talk

The Data-Directed paradigm (DDP) is a search strategy for efficiently probing new physics in a large number of spectra with smoothly-falling SM backgrounds. Unlike the traditional analysis strategy, DDP avoids the need for a simulated or functional-form based background estimate by directly predicting the statistical significance using a convolutional neural network trained to regress the...

21. Training and optimisation of large transformer models at CERN: an ATLAS case study on Kubeflow

Maxence Draguet (University of Oxford (GB))

30/01/2024, 15:50

5 ML infrastructure : Hardware and software for Machine Learning

Contributed talk

Heavy flavour jets underpin a large part of the ATLAS physics programme, such as analyses of Higgs boson decays to quarks and super-symmetry searches with b-jets. The algorithms for identifying jets originating from b- and c-quarks are instrumental in these efforts, with the recently introduced GN2 model [1] showing remarkable improvements in tagging efficiency. Given its complexity and data...

43. Masked particle modelling

Samuel Byrne Klein (Universite de Geneve (CH))

30/01/2024, 16:10

1 ML for object identification and reconstruction

Contributed talk

The Bert pretraining paradigm has proven to be highly effective in many domains including natural language processing, image processing and biology. To apply the Bert paradigm the data needs to be described as a set of tokens, and each token needs to be labelled. To date the Bert paradigm has not been explored in the context of HEP. The samples that form the data used in HEP can be described...

54. Finetuning Foundation Models for Joint Analysis Optimization

Matthias Vigl (Technische Universitat Munchen (DE))

30/01/2024, 16:30

2 ML for analysis : event classification, statistical analysis and inference, including anomaly detection

Contributed talk

Most searches at the LHC employ an analysis pipeline consisting of various discrete components, each individually optimized and later combined to provide relevant features used to discriminate SM background from potential signal. These are typically high-level features constructed from particle four-momenta. However, the combination of individually optimized tasks doesn't guarantee an optimal...

48. Re-simulation-based self-supervision for representation learning

Jeffrey Krupa (Massachusetts Institute of Technology)

30/01/2024, 16:50

1 ML for object identification and reconstruction

Contributed talk

Self-Supervised Learning (SSL) is at the core of training modern large ML models, providing a scheme for learning powerful representations in base models that can be used in a variety of downstream tasks. However, SSL training strategies must be adapted to the type of training data, thus driving the question: what are powerful SSL strategies for collider physics data? In the talk, we present a...

3. DeepTreeGANv2: Iterative Pooling of Point Clouds

Mr Moritz Scham (Deutsches Elektronen-Synchrotron (DE))

02/02/2024, 09:00

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Contributed talk

In High Energy Physics, detailed and time-consuming simulations are used for particle interactions with detectors. To bypass these simulations with a generative model, the generation of large point clouds in a short time is required, while the complex dependencies between the particles must be correctly modelled. Particle showers are inherently tree-based processes, as each particle is...

4. Out-of-Distribution Multi-set Generation with Context Extrapolation for Amortized Simulation and Inverse Problems

Hosein Hashemi (LMU Munich)

02/02/2024, 09:20

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Contributed talk

Addressing the challenge of Out-of-Distribution (OOD) multi-set generation, we introduce YonedaVAE, a novel equivariant deep generative model inspired by Category Theory, motivating the Yoneda-Pooling mechanism. This approach presents a learnable Yoneda Embedding to encode the relationships between objects in a category, providing a dynamic and generalizable representation of complex...

56. Conditional Set-to-Set Generation for Fast Simulation using Diffusion and Graph-to-Graph Translation

Dmitrii Kobylianskii (Weizmann Institute of Science (IL))

02/02/2024, 09:40

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Contributed talk

Simulating particle physics data is a crucial yet computationally expensive aspect of analyzing data at the LHC. Typically, in fast simulation methods, we rely on a surrogate calorimeter model with a subsequent reconstruction algorithm to generate a set of reconstructed objects. This work demonstrates the potential to generate these reconstructed objects in one shot, effectively replacing both...

26. Unweighted event generation with matrix element surrogates

Timo Janssen

02/02/2024, 10:00

8 ML for phenomenology and theory

Contributed talk

We show that employing a sophisticated neural network emulation of QCD multijet matrix elements based on dipole factorisation can lead to a drastic acceleration of unweighted event generation in high-multiplicity LHC production processes. We incorporate these emulations as fast and accurate surrogates in a two-stage rejection sampling algorithm within the SHERPA Monte Carlo that yields...

22. A Deep Generative Model for Hadronization

Jay Chan (Lawrence Berkeley National Laboratory)

02/02/2024, 10:50

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Contributed talk

Hadronization is a critical step in the simulation of high-energy particle and nuclear physics experiments. As there is no first principles understanding of this process, physically-inspired hadronization models have a large number of parameters that are fit to data. We propose an alternative approach that uses deep generative models, which are a natural replacement for classical techniques,...

47. Reinforcement learning for automatic data quality monitoring in HEP experiments

Olivia Jullian Parra (CERN)

02/02/2024, 11:10

2 ML for analysis : event classification, statistical analysis and inference, including anomaly detection

Contributed talk

The usage of modern ML techniques to automate the search for anomalies in collider physics is a very active and prolific field. Typical cases are the search for signatures of physics beyond the Standard Model and the identification of problems in the detector systems that would lead to bad-quality data, unusable for physics data analysis. We are interested in the second type of task, which can...

1. Longitudinal Beam Diagnostics and Phase Space Reconstruction in the LHC Using ML

Konstantinos Iliakis (CERN)

02/02/2024, 11:30

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Contributed talk

Accurate knowledge of longitudinal beam parameters is essential for optimizing the performance and operational efficiency of particle accelerators like the Large Hadron Collider (LHC). However, conventional methods to determine them, such as fitting techniques and tracking-based longitudinal tomography, are time-consuming and limited to analyzing data from a few bunches only. To address this,...

Building timetable...

Choose timezone

6th Inter-experiment Machine Learning Workshop

Contact

Presentation materials