ML4Jets2022

Name: ML4Jets2022
Start: 2022-11-01T08:00:00-04:00
End: 2022-11-04T17:50:00-04:00
Location: Rutgers University

1–4 Nov 2022

Rutgers University

US/Eastern timezone

Contact

ml4jets2022@googlegroups.com

Contribution List

76. Registration

01/11/2022, 08:00

77. Welcome from Local Organizers

David Shih, Dean Thu Nguyen (Rutgers University)

01/11/2022, 09:00

Welcome and Introduction

79. Experimental Opening I

Petar Maksimovic (Johns Hopkins University (US))

01/11/2022, 09:15

Welcome and Introduction

80. Experimental Opening II

Tobias Golling (Universite de Geneve (CH))

01/11/2022, 09:45

Welcome and Introduction

78. Theory Opening

Ian James Moult

01/11/2022, 10:15

Welcome and Introduction

27. Does Lorentz-symmetric design boost network performance in jet physics?

Congqiao Li (Peking University (CN))

01/11/2022, 11:15

Zoom

Equivariance and New Architectures

In the deep learning era, improving the neural network performance in jet physics is a rewarding task as it directly contributes to more accurate physics measurements at the LHC. Recent research has proposed various network designs in consideration of the full Lorentz symmetry, but its benefit is still not systematically asserted, given that there remain many successful networks without taking...

30. Transformer models for heavy flavor jet identification in CMS

Sitian Qian (Peking University (CN))

01/11/2022, 11:35

Equivariance and New Architectures

During Run2 of the Large Hadron Collider (LHC), deep-learning-based algorithms were established and led to a significantly improved heavy flavor (b and c) jet tagging performance. In the scope of large-radius boosted jets like top-quark jets, Graph Neural Network (GNN) based models, e.g. ParticleNet, have reached state-of-the-art performance. As a step further, we present Particle Transformer...

65. A Holistic Approach to Predicting Top Quark Kinematic Properties with the Covariant Particle Transformer

Shikai Qiu (Lawrence Berkeley National Lab. (US))

01/11/2022, 11:55

Equivariance and New Architectures

Precise reconstruction of top quark properties is a challenging task at the Large Hadron Collider due to combinatorial backgrounds and missing information. We introduce a physics-informed neural network architecture called the Covariant Particle Transformer (CPT) for directly predicting the top quark kinematic properties from reconstructed final state objects. This approach is permutation...

58. Equivariant Neural Networks for Particle Physics: PELICAN

Alexander Bogatskiy (Flatiron Institute, Simons Foundation)

01/11/2022, 12:15

Equivariance and New Architectures

A lot of attention has been paid to the applications of common machine learning methods in physics experiments and theory. However, much less attention is paid to the methods themselves and their viability as physics modeling tools. One of the most fundamental aspects of modeling physical phenomena is the identification of the symmetries that govern them. Incorporating symmetries into a model...

92. Symmetries, Safety, and Self-Supervision

Peter Rangi Sorrenson (Universität Heidelberg)

01/11/2022, 14:00

Equivariance and New Architectures

Collider searches face the challenge of defining a representation of high-dimensional data such that physical symmetries are manifest, the discriminating features are retained, and the choice of representation is new-physics agnostic. We introduce JetCLR to solve the mapping from low-level data to optimized observables though self-supervised contrastive learning. As an example, we construct a...

47. Transformer Architectures for Quenched Jet Tagging

Mr João Pedro de Arruda Gonçalves (LIP)

01/11/2022, 14:20

Zoom

Equivariance and New Architectures

In high-energy heavy-ion collisions, the unconfined state of partons known as the Quark Gluon Plasma (QGP), is known to suppress the yield of jets with respect to proton-proton collision, as well as modify the structure of jets that transverse it. Nonetheless, samples of heavy-ion jets, even at the highest centralities, will contain a significant fraction of jets that, for one reason or the...

22. Topological Data Analysis for Collider Events

Tianji Cai (University of California, Santa Barbara)

01/11/2022, 14:40

Equivariance and New Architectures

We introduce a novel framework to capture the inherent topological structure of collider events. Using persistence homology, the evolution of various topological features across scales is recorded graphically in a persistence diagram, and further encoded as scalars and vectors amenable to machine learning classifiers, showing excellent performance on both jet tagging and event classification...

7. Solving Combinatorial Problems in Multijet Signatures Using Machine Learning

Lawrence Lee Jr (University of Tennessee (US))

01/11/2022, 15:00

Equivariance and New Architectures

High-multiplicity signatures at particle colliders can arise in Standard Model processes and beyond. With such signatures, difficulties often arise from the large dimensionality of the kinematic space. For final states containing a single type of particle signature, this results in a combinatorial problem that hides underlying kinematic information. We explore using a neural network that...

6. Equivariant Point Cloud Generation for Particle Jets

Erik Buhmann (Hamburg University (DE))

01/11/2022, 15:20

Equivariance and New Architectures

With current and future high-energy collider experiments' vast data collecting capabilities comes an increasing demand for computationally efficient simulations. Generative machine learning models allow fast event generation, yet so far are largely constrained to fixed data and detector geometries.

We introduce a Deep Sets based permutation equivariant generative adversarial network (GAN)...

72. Particle Cloud Generation

Raghav Kansal (Univ. of California San Diego (US))

01/11/2022, 16:10

Zoom

Generative Models -- Particle Level

Particle Cloud Generation

There has been significant development recently in generative models for accelerating LHC simulations. Work on simulating jets has primarily used image-based representations, which tend to be sparse and of limited resolution. We advocate for the more natural ‘particle cloud’ representation of jets, i.e. as a set of particles in momentum space, and discuss...

53. Point Cloud Generation using Transformer Encoders and Normalising Flows

Benno Kach (Deutsches Elektronen-Synchrotron (DE))

01/11/2022, 16:30

Generative Models -- Particle Level

Machine-learning-based data generation has become a major topic in particle physics, as the current Monte Carlo simulation approach is computationally challenging for future colliders, which will have a significantly higher luminosity. The generation of particles poses difficult problems similar as is the case for point clouds. We propose that a transformer setup is well fitted to this task....

45. Conditional generative networks for pure quark and gluon jets

Ayodele Ore

01/11/2022, 16:50

Generative Models -- Particle Level

The separation of quarks and gluons is of key interest at hadron colliders. While it is only possible to obtain mixed samples of quark and gluon jets from experimental data, some recent works have proposed methods for disentangling the underlying distributions in an unsupervised manner. However, these approaches typically lack a generative model for the separated distributions. In this work we...

4. Modeling Hadronization with Machine Learning

Manuel Szewc

01/11/2022, 17:10

Generative Models -- Particle Level

A fundamental part of event generation, hadronization is currently
simulated with the help of fine-tuned empirical models. In this talk,
I'll present MLHAD, a proposed alternative for hadronization where the
empirical model is replaced by a surrogate Machine Learning-based
model to be ultimately data-trainable. I'll detail the current stage
of development and discuss possible ways forward.

17. MadNIS: Neural networks for multi-channel integration

Ramon Winterhalder (UC Louvain)

01/11/2022, 17:30

Generative Models -- Particle Level

High-precision theory predictions require the numerical integration of high-dimensional phase-space integrals and the simultaneous generation of unweighted events to feed the full simulation chain and subsequent analyses. While current methods are based on first principles and are mathematically guaranteed to converge to the correct answer, the computational cost to decrease the numerical...

86. Introduction to Anomaly Detection

Dr Barry Dillon (University of Heidelberg)

02/11/2022, 09:00

Anomaly Detection

I will give an overview of recent progress in less-than-supervised methods for new physics searches at the LHC.

10. Results from Unsupervised Machine Learning in an ATLAS Dijet Resonance Search

Julia Lynne Gonski (Columbia University (US))

02/11/2022, 09:25

Anomaly Detection

An application of unsupervised machine learning-based anomaly detection to a generic dijet resonance is presented using the full LHC Run 2 dataset collected by ATLAS. A novel variational recurrent neural network (VRNN) is trained over data, specifically large-radius jets that are modeled using a sequence of constituent four-vectors and substructure variables, to identify anomalous jets based...

52. A Normalized Autoencoder for LHC triggers

Luigi Favaro

02/11/2022, 09:45

Anomaly Detection

The main goal for the upcoming LHC runs is still to discover BSM physics. It will require analyses able to probe regions not linked to specific models but generally identified as beyond the Standard Model. Autoencoders are the typical choice for fast anomaly detection models. However, they have shown to misidentify anomalies of low complexity signals over background events. I will present an...

62. Robust anomaly detection using NuRD

Abhijith Gandrakota (Fermi National Accelerator Lab. (US))

02/11/2022, 10:05

Anomaly Detection

Anomaly Detection algorithms are crucial tools for identifying unusual decays from proton collisions at the LHC and are efficient methods for seeking out the possibility of new physics. These detection algorithms should be robust against nuisance kinematic variables and detector conditions. To achieve this robustness, popular detection models built via autoencoders, for example, have to go...

12. Challenges for unsupervised anomaly detection in particle physics

Katherine Fraser (Harvard University)

02/11/2022, 10:25

Anomaly Detection

I discuss several approaches to anomaly detection in collider physics, including using variational autoencoders, which rely on the ability to reconstruct certain types of data (background) but not others (signals), and optimal transport distances, which which measures how easily one pT distribution can be changed into another. I discuss advantages and challenges associated with each approach....

83. ML Keynote Talk -- Generative models, manifolds and symmetries: From QFT to molecules

Danilo Rezende

02/11/2022, 11:15

ML Keynote Talk

The study of symmetries in physics has revolutionized our understanding of the world. Inspired by this, the development of methods to incorporate internal (Gauge) and external (space-time) symmetries into machine learning models is a very active field of research. We will introduce some of the latest work in the field. We will then present our latest work on invariant generative models and its...

90. Panel Discussion

Danilo Rezende, David Shih, Jesse Thaler (MIT), Nick Dunn (Two Sigma), Savannah Jennifer Thais (Princeton University (US)), Tilman Plehn

02/11/2022, 11:55

ML Keynote Talk

"ML connections between industry and HEP"

85. Introduction to Generative Models for Fast Detector Simulation

Dr Claudius Krause (Rutgers University)

02/11/2022, 14:15

Generative Models -- Detector Level

71. AtlFast3, the new ATLAS fast simulation tool

Michele Faucci Giannelli (INFN e Universita Roma Tor Vergata (IT))

02/11/2022, 14:40

Generative Models -- Detector Level

AtlFast3 is the new, high-precision fast simulation in ATLAS that was deployed by the collaboration to replace AtlFastII, the fast simulation tool that was successfully used for most of Run2. AtlFast3 combines a parametrization-based Fast Calorimeter Simulation and a new machine-learning-based Fast Calorimeter Simulation based on Generative Adversarial Networks (GANs). The new fast simulation...

70. CaloFlow for CaloChallenge

Ian Pang (Rutgers), Yi En Ian Pang

02/11/2022, 15:00

Generative Models -- Detector Level

Simulating particle detector response is the single most computationally expensive step in the Large Hadron Collider computational pipeline. Recently it was shown that normalizing flows can accelerate this process while achieving unprecedented levels of accuracy (CaloFlow).

Applying CaloFlow to the photon and charged pion GEANT4 showers of Dataset 1 of the Fast Calorimeter Simulation...

2. Score-based Generative Models for Calorimeter Shower Simulation

Vinicius Massami Mikuni (Lawrence Berkeley National Lab. (US))

02/11/2022, 15:20

Generative Models -- Detector Level

Score-based generative models are a new class of generative algorithms that have been shown to produce realistic images even in high dimensional spaces, currently surpassing other state-of-the-art models for different benchmark categories and applications. In this work we introduce CaloScore, a score-based generative model for collider physics applied to calorimeter shower generation. Three...

37. CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds

Jesse Cresswell (Layer 6 AI)

02/11/2022, 15:40

Generative Models -- Detector Level

The efficient simulation of particle propagation and interaction within the detectors of the Large Hadron Collider is of primary importance for precision measurements and new physics searches. The most computationally expensive simulations involve calorimeter showers, which will become ever more costly and high-dimensional as the Large Hadron Collider moves into its High Luminosity era....

29. Generative Models for Fast Simulation of Electromagnetic and Hadronic Showers in Highly Granular Calorimeters

Sascha Daniel Diefenbacher (Hamburg University (DE))

02/11/2022, 16:30

Zoom

Generative Models -- Detector Level

Simulation in High Energy Physics (HEP) places a heavy burden on the available computing resources and is expected to become a major bottleneck for the upcoming high luminosity phase of the LHC and for future Higgs factories, motivating a concerted effort to develop computationally efficient solutions. Methods based on generative machine learning methods hold promise to alleviate the...

56. Fast calorimeter simulation with VQVAE

Chase Owen Shimmin (Yale University (US))

02/11/2022, 16:50

Generative Models -- Detector Level

Simulation of calorimeter response is important for modern high energy physics experiments. With the increasingly large and high granularity design of calorimeters, the computational cost of conventional MC-based simulation of each particle-material interaction is becoming a major bottleneck. We propose a new generative model based on a Vector-Quantized Variational Autoencoder (VQ-VAE) to...

19. IEA-GAN: Intra-Event Aware GAN with Relational Reasoning for the Fast Detector Simulation

Hosein Hashemi (LMU Munich), Dr Nikolai Hartmann (LMU Munich)

02/11/2022, 17:10

Zoom

Generative Models -- Detector Level

A realistic detector simulation is an essential component of experimental particle physics. However, it is currently very inefficient computationally since large amounts of resources are required to produce, store, and distribute simulation data. Deep generative models allow for more cost-efficient and faster simulations. Nevertheless, generating detector responses is a highly non-trivial task...

87. Discussion

02/11/2022, 17:30

Generative Models -- Detector Level

3. Multi-differential Jet Substructure Measurement in High $Q^{2}$ Deep-Inelastic Scattering with the H1 Detector

Vinicius Massami Mikuni (Lawrence Berkeley National Lab. (US))

03/11/2022, 09:00

Measurement

A study of different jet observables in high $Q^{2}$ Deep-Inelastic Scattering events close to the Born kinematics is presented. Differential and multi-differential cross-sections are presented as a function of the jet’s charged constituent multiplicity, momentum dispersion, jet charge, as well as three values of jet angularities. Results are split into multiple $Q^{2}$ intervals, probing the...

38. Recent ML-usage in searches with boosted jets in CMS

Oz Amram (Johns Hopkins University (US))

03/11/2022, 09:00

Classification

CMS has a wide search program making use of ML for jet tagging and event reconstruction. This talk will report recent usage of ML in searches for heavy resonances involving boosted W, Z, H and top quark jets.

28. Constituent-Based Top-Quark Tagging with the ATLAS Detector

Kevin Thomas Greif (University of California Irvine (US))

03/11/2022, 09:20

Classification

This talk will present the performance of constituent-based jet taggers on large radius boosted top quark jets reconstructed from optimized jet input objects in simulated collisions at s√=13 TeV. Several taggers which consider all the information contained in the kinematic information of the jet constituents are tested, and compared to a tagger which relies on high-level summary quantities...

54. Machine learning for top physics in CMS

Philip Daniel Keicher (Hamburg University (DE))

03/11/2022, 09:20

Measurement

Machine learning (ML) plays a significant role in the physics analyses at the CMS experiment. Many different techniques and strategies have been deployed to a wide range of applications. In this presentation we will illustrate the most advanced techniques used in top quark physics measurements, such as using ML algorithms to improve the extraction of effective field theory contributions, and...

23. Adversarial training for b-tagging algorithms in CMS

Annika Stein (Rheinisch Westfaelische Tech. Hoch. (DE)), CMS Collaboration (CMS Experiment, CERN)

03/11/2022, 09:40

Zoom

Classification

Deep learning is a standard tool in high-energy physics, facilitating identification of physics objects. In particular, complex neural network architectures play a major role for jet flavor tagging. However, these methods are reliant on accurate simulations and a calibration is required to treat non-negligible performance differences with respect to data. In order to reduce residual...

73. ML Unfolding based on conditional Invertible Neural Networks using iterative training

Mathias Josef Backes (Universität Heidelberg)

03/11/2022, 09:40

Measurement

The unfolding of detector effects impacting experimental measurements is crucial for the comparison of data to theory predictions. While traditional methods were limited to low dimensional data, machine learning has enabled new tech- niques to unfold high-dimensional data. Generative networks like conditional Invertible Neural Networks (cINN) enable a probabilistic unfolding, which map...

9. Moment Unfolding using Deep Learning

Krish Desai

03/11/2022, 10:00

Measurement

Deconvolving ('unfolding') detector distortions is a critical step in the comparison of cross section measurements with theoretical predictions. However, most of these approaches require binning while many predictions are at the level of moments. We develop a new approach to directly unfold distribution moments as a function of any other observables without having to first discretize. Our...

11. Truth tagging for efficiency parametrization of b-jets using Graph Neural Networks

Krunal Bipin Gedia (ETH Zurich (CH))

03/11/2022, 10:00

Classification

In high-energy physics experiments, estimating the efficiency of a process using selection cuts is a widely used technique. However, this method is limited by the number of events that could be simulated in the required analysis phase space. A way to improve this sensitivity is to use efficiency weights instead of selecting events by selection cuts. This method of efficiency measurements is...

5. Heterogeneous Graph Representation for Identifying Hadronically Decayed Tau Leptons at the High Luminosity LHC

Andris Huang (University of California-Berkeley), Xiangyang Ju (Lawrence Berkeley National Lab. (US))

03/11/2022, 10:20

Zoom

Classification

We present a new algorithm that identifies reconstructed jets originating from hadronic decays of tau leptons against those from quarks or gluons. No tau lepton reconstruction algorithm is used. Instead, the algorithm represents jets as heterogeneous graphs using the associated low-level objects such as tracks and energy clusters and trains a Graph Neural Network (GNN) to identify hadronically...

16. Invertible Networks for the Matrix Element Method

Theo Heimel (Heidelberg University)

03/11/2022, 10:20

Measurement

The matrix element method is widely considered the perfect approach to LHC inference, but computationally expensive. We show how a combination of two conditional Invertible Neural Networks can be used to learn the transfer function between parton level and reconstructed objects, and to make integrating out the partonic phase space numerically tractable. We illustrate our approach for the...

46. Constraining quark and gluon jet energy loss distributions in quark-gluon plasma using Bayesian inference

Alexandre Falcão (University of Bergen)

03/11/2022, 11:10

Zoom

Measurement

QCD factorization allows us to model the jet energy-loss in A-A collisions as a convolution between the jet cross section in p-p collisions and an energy loss distribution. Meanwhile, Bayesian inference provides a data-driven way of constraining the energy loss distribution parameterization. Only a few efforts have been made in this direction, and solely using untagged jets. However, gluon and...

93. Identification of hadronic tau decays using a deep neural network with the CMS experiment at LHC

Mykyta Shchedrolosiev (Deutsches Elektronen-Synchrotron (DE))

03/11/2022, 11:10

Zoom

Classification

Tau leptons are a key ingredient to perform many Standard Model measurements and searches for new physics at LHC. The CMS experiment has released a new algorithm to discriminate hadronic tau lepton decays against jets, electrons, and muons. The algorithm is based on a deep neural network and combines fully connected and convolutional layers. It combines information from all individual...

13. Estimating Uncertainties for Trained Neural Networks

Sebastian Guido Bieringer (Hamburg University)

03/11/2022, 11:30

Zoom

Measurement

Uncertainty estimation is a crucial issue when considering the application of deep neural network to problems in high energy physics such as jet energy calibrations.

We introduce and benchmark a novel algorithm that quantifies uncertainties by Monte Carlo sampling from the models Gibbs posterior distribution. Unlike the established 'Bayes By Backpropagation' training regime, it does not...

34. Robust Signal Detection using a Classifier Decorrelated through Optimal Transport (CDOT)

Purvasha Chakravarti (University College London)

03/11/2022, 11:30

Classification

New physics searches are usually done by training a supervised classifier to separate a signal model from a background model. However, even when the signal model is correct, systematic errors in the background model can influence supervised classifiers and might adversely affect the signal detection procedure. To tackle this problem, one approach is to find a classifier constrained to be...

95. How can Bayesian networks be used for uncertainty quantification in particle physics?

Christina Peters (University of Delaware)

03/11/2022, 11:50

Measurement

Uncertainty quantification is crucial for data analysis and hypothesis testing. Many machine learning algorithms were not designed to provide information about the reliability of their predictions, and the methods for estimating uncertainties from these algorithms can lack transparency. In this talk we demonstrate the Bayesian network framework, which was developed using a rigorous formalism...

36. VBF vs. GGF Higgs with Full-Event Deep Learning: Towards a Decay-Agnostic Tagger

Cheng-Wei Chiang (National Taiwan University)

03/11/2022, 11:50

Classification

We study the benefits of jet- and event-level deep learning methods in distinguishing vector boson fusion (VBF) from gluon-gluon fusion (GGF) Higgs production at the LHC. We show that a variety of classifiers (CNNs, attention-based networks) trained on the complete low-level inputs of the full event achieve significant performance gains over shallow machine learning methods (BDTs) trained on...

32. Machine learning based jet and event classification at the Electron-Ion Collider

James Mulligan (University of California, Berkeley (US))

03/11/2022, 12:10

Classification

In this talk, we explore machine learning-based event and jet identification at the future Electron-Ion Collider (EIC). We study the effectiveness of machine learning-based classifiers at the relatively low EIC energies, focusing on (i) identifying the flavor of the jet, in terms of both quark flavor tagging and quark vs. gluon tagging, and (ii) identifying the hard-scattering process, using...

69. Using Machine Learning to Improve our Understanding of the Jet Background in Nucleus-Nucleus Collisions.

Tanner Mengel (University of Tennessee)

03/11/2022, 12:10

Measurement

Jets in heavy ion collisions contain contributions from a background of soft-particles. The kinematic reach into low jet momentum is largely driven by the precision of the method used to subtract this background. This precision is also a significant contribution to uncertainties of jet measurements. Previous studies have suggested that deep neural networks can improve momentum resolution at...

82. Loop Amplitudes from Precision Networks

Tilman Plehn

03/11/2022, 12:30

Measurement

Evaluating loop amplitudes is a time-consuming part of LHC event generation. I will shown for di-photon production with jets how simple, Bayesian networks can learn such amplitudes and model their uncertainties reliably. A boosted training of the Bayesian network further improves the uncertainty estimate and the network precision in critical phase space regions. In general, boosted network...

41. Search for dimuon events in IceCube using decision trees

Nakul Aggarwal (University of Alberta)

03/11/2022, 12:30

Zoom

Classification

The dominant neutrino-nucleon interaction above 100 GeV is Deep Inelastic Scattering (DIS) in which an incoming neutrino scatters off a quark in the nucleon by exchanging a weak boson, producing an outgoing lepton accompanied by a hadron shower. Two sub-dominant processes are expected to produce two high energy charged leptons in the final state. The first one is a subset of DIS where a...

33. CURTAINs for your Sliding Window: Constructing Unobserved Regions by Transporting Adjacent INtervals

Johnny Raine (Universite de Geneve (CH))

03/11/2022, 14:00

Anomaly Detection

We introduce a new model independent technique for constructing background data templates for use in searches for new physics processes at the LHC.

This method, called CURTAINs, uses invertible neural networks to parametrise the distribution of side band data as a function of the resonant observable. The network learns a transformation to map any data point from its value of the resonant...

66. Generative Models for Resonant Anomaly Detection

Elham E Khoda (University of Washington (US))

03/11/2022, 14:20

Anomaly Detection

Machine learning-based anomaly detection techniques offer exciting possibilities to significantly extend the search for new physics at the Large Hadron Collider (LHC) and elsewhere by reducing the model dependence. In this work, we focus on resonant anomaly detection, where generative models can be trained in sideband regions and interpolated into a signal region to provide an estimate of the...

31. HEP-Sim2Real: creating background templates with normalizing flows

Radha Mastandrea (University of California, Berkeley)

03/11/2022, 14:40

Anomaly Detection

Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a normalizing flow to create a mapping between...

61. Weakly supervised methods for LHC analyses

Thorben Finke

03/11/2022, 15:00

Anomaly Detection

We investigate how weakly supervised methods like CWoLa and CATHODE can be used to enhance the sensitivity of searches at the LHC. These methods do not rely on truth level labels and are thus applicable in a model agnostic setting. In particular, we examine how these methods generalize to low level features, i.e. to higher dimensional inputs. As one example, we show how CWoLa can enhance the...

84. Resonant anomaly detection without background sculpting

Manuel Sommerhalder (Hamburg University (DE))

03/11/2022, 15:20

Anomaly Detection

We introduce a new technique named Latent CATHODE (LaCATHODE) for performing "enhanced bump hunts", a type of resonant anomaly search that combines conventional one-dimensional bump hunts with a model-agnostic anomaly score in an auxiliary feature space where potential signals could also be localized. The main advantage of LaCATHODE over existing methods is that it provides an anomaly score...

55. Overview of ML for Gravitational Waves

Eric Anton Moreno (Massachusetts Institute of Technology (US))

03/11/2022, 16:10

Beyond Jets

At an increasing number of interferometer sites with constantly-changing detector conditions, AI can play an important role in real-time and offline data processing. In this talk, we develop novel algorithms and training schemes that sift through noise and instrumental glitches to detect gravitational waves (GW) from compact binary coalescences (CBCs). For real-time processing, we create...

91. Overview of ML for Gaia

Matthew Buckley

03/11/2022, 16:40

In-person

Beyond Jets

The Gaia space telescope measures the position and proper motion of a billion stars in the neighborhood of the Sun. This dataset contains stellar streams, tidal debris, and other structures that can cast light on the structure of the Galaxy, its merger history, and its dark matter component. I review the machine learning approaches -- including classifiers, normalizing flows, and anomaly...

88. Overview of ML for Astro/Cosmo

Miles Cranmer (Princeton)

03/11/2022, 17:10

Zoom

Beyond Jets

I will give an overview of recent progress in ML applications to Astro/Cosmo.

89. Overview of ML for Neutrinos

Fernanda Psihas (Fermi National Accelerator Laboratory)

03/11/2022, 17:40

Zoom

Beyond Jets

I will give an overview of ML applications to Neutrino Physics.

68. Infra-red and collinear safe Graph Neural Networks

Vishal Singh Ngairangbam

04/11/2022, 09:00

Zoom

Interpretability

Hadronic signals of new-physics origin at the Large Hadron Collider can remain hidden within the copiously produced hadronic jets. Unveiling such signatures require highly performant deep-learning algorithms. We construct a class of Graph Neural Networks (GNN) in the message-passing formalism that makes the network output infra-red and collinear (IRC) safe, an important criterion satisfied...

94. Machine learning for particle flow at CMS

Dylan Sheldon Rankin (Massachusetts Inst. of Technology (US))

04/11/2022, 09:00

Reconstruction

The particle-flow (PF) algorithm is of central importance to event reconstruction at the CMS detector, and has been a focus of developments in light of planned Phase-2 running conditions with an increased pileup and detector granularity. Current rule-based implementations rely on extrapolating tracks to the calorimeters, correlating them with calorimeter clusters, subtracting charged energy...

44. Point Cloud Deep Learning Methods for Pion Reconstruction in the ATLAS Experiment

Piyush Karande (Lawrence Livermore National Laboratory)

04/11/2022, 09:20

Reconstruction

The reconstruction and calibration of hadronic final states in the ATLAS detector present complex experimental challenges. For isolated pions in particular, classifying $\pi^0$ versus $\pi^{\pm}$ and calibrating pion energy deposits in the ATLAS calorimeters are key steps in the hadronic reconstruction process. The baseline methods for local hadronic calibration were optimized early in the...

48. Resilience of Quark-Gluon Tagging

Lorenz Vogel (ITP, Heidelberg University)

04/11/2022, 09:20

Interpretability

Discriminating quark-initiated from gluon-initiated jets is an extremely challenging yet important task in high-energy physics. Recent studies have shown that the discriminating features between quark and gluon jets produced by the Monte Carlo generator Pythia differ significantly from the features produced by Herwig. To understand this simulation-dependent discrepancy, we propose a Bayesian...

1. Boost-Invariant Polynomials: an efficient and interpretable approach to jet tagging

Mr Jose Miguel Munoz Arias (EIA University)

04/11/2022, 09:40

Zoom

Interpretability

Besides modern architectures designed via geometric deep learning achieving high accuracies via Lorentz group invariance, this process involves high amounts of computation. Moreover, the framework is restricted to a particular classification scheme and lacks interpretability.
To tackle this issue, we present BIP, an efficient and computationally cheap framework to build rotational,...

50. Particle reconstruction in jets with set transformer and hypergraph prediction architectures

Etienne Dreyer (Weizmann Institute of Science (IL)), Nilotpal Kakati (Weizmann Institute of Science (IL))

04/11/2022, 09:40

Reconstruction

Particle reconstruction is a task underlying virtually all analyses of collider-detector data. Recently, the application of deep learning algorithms on graph-structured low-level features has suggested new possibilities beyond the scope of traditional parametric approaches. In particular, we explore the possibility to reconstruct and classify individual neutral particles in a collimated...

8. Learning to Identify Semi-Visible Jets

Taylor James Faucett (University of California, Irvine)

04/11/2022, 10:00

Zoom

Interpretability

We train a network to identify jets with fractional dark decay (semi-visible jets) using the pattern of their low-level jet constituents, and explore the nature of the information used by the network by mapping it to a space of jet substructure observables. Semi-visible jets arise from dark matter particles which decay into a mixture of dark sector (invisible) and Standard Model (visible)...

24. Optimal transport solutions for pileup mitigation at hadron colliders

Fabio Iemmi (Chinese Academy of Sciences (CN))

04/11/2022, 10:00

Reconstruction

Hadronic jets and missing transverse energy are key experimental probes when searching for new physics or performing standard model precision measurements in collision events at the LHC. In this work, we propose a graph neural network algorithm for obtaining a global event description that demonstrates greatly improved resolution in the aforementioned objects obtained with a fast simulation of...

39. Feature selection with Distance Correlation

RANIT DAS

04/11/2022, 10:20

Interpretability

Feature selection algorithms can be an important tool for AI explainability. If the performance of neural networks trained on low-level data can be reproduced by a small set of high-level features, we can hope to understand “what the machine learned”. We present a new algorithm that selects features by ranking their Distance Correlation (DisCo) values with truth labels. We apply this algorithm...

49. ν-flows: Conditional neutrino momentum regression

Mr Matthew Leigh (University of Geneva)

04/11/2022, 10:20

Reconstruction

We present ν-Flows, a novel method for restricting the likelihood space of neutrino kinematics in high energy collider experiments using conditional normalizing flows and deep invertible neural networks.
This method allows the recovery of the full neutrino momentum, which is usually left as a free parameter, and permits one to sample neutrino values under a learned conditional likelihood...

75. Advances in developing deep neural networks for finding primary vertices in proton-proton collisions at the LHC

Michael David Sokoloff (University of Cincinnati (US))

04/11/2022, 11:10

Reconstruction

We have been studying the use of deep neural networks (DNNs) to identify and locate primary vertices (PVs) in proton-proton collisions at the LHC. Earlier work focused on finding primary vertices in simulated LHCb data using a hybrid approach that started with kernel density estimators (KDEs) derived from the ensemble of charged track parameters and predicted “target histograms” from which...

42. Jet tagging with deep sets of subjets

Dimitrios Athanasakos

04/11/2022, 11:10

Interpretability

We introduce a complete basis of subjets for machine learning-based jet tagging. The subjets are obtained with (i) a fixed radius or (ii) the clustering is performed until a fixed number of subjets is obtained.
For nonzero values of the subjet radius, the resulting classifier is Infrared-Collinear (IRC) safe. By lowering the subjet radius, we can increase the sensitivity to nonperturbative...

21. Blueprints for Training Information Bottlenecks for Collider Analyses

Prasanth Shyamsundar (Fermi National Accelerator Laboratory)

04/11/2022, 11:30

Interpretability

Dimensionality reduction is a crucial aspect of data analysis in high energy physics, even if accompanied by information loss. Several methods, including histogram- and kernel-based analyses, are only computationally feasible for low-dimensional data. Furthermore, simulation models used in HEP can often only be validated for low-dimensional data. We provide several blueprints for using machine...

25. Graph Neural Networks for a Deep-learning based Full Event Interpretation (DFEI) at the LHCb trigger

Julian Garcia Pardinas (Universita & INFN, Milano-Bicocca (IT))

04/11/2022, 11:30

Reconstruction

In a decade from now, the Upgrade II of LHCb experiment will face an instantaneous luminosity ten times higher than in the current Run 3 conditions. This will bring LHCb to a new era, with huge event sizes and typically several signal heavy-hadron decays per event. The trigger scope will shift from deciding ‘which events are interesting?’ to ‘which parts of the event are interesting?’. To...

59. Likelihood-Free Frequentist Inference for Calorimetric Muon Energy Measurement

Luca Masserano (Carnegie Mellon University)

04/11/2022, 11:50

Reconstruction

Calorimetric muon energy estimation in high-energy physics is an example of a likelihood-free inference (LFI) problem, where simulators that implicitly encode the likelihood function are used to mimic complex particle interactions at different values of the physical parameters. Recently, Kieseler et al. (2022) exploited simulated measurements from a dense, finely segmented calorimeter to infer...

51. Weakly Supervised Learning for Muon Discrimination in Unlabeled Collider Data

Edmund Witkowski (UCI)

04/11/2022, 11:50

Zoom

Interpretability

We use unlabeled collision data from CMS and weakly-supervised learning to train models which can distinguish prompt muons from non-prompt muons using patterns of low-level particle activity in vicinity of the muon, and interpret the models in the space of energy flow polynomials. Particle activity associated with muons is a valuable tool for identifying prompt muons, those due to heavy boson...

20. A boosted kNN regressor with 66 million parameters

Tommaso Dorigo (Universita e INFN, Padova (IT))

04/11/2022, 12:10

Zoom

Reconstruction

We develop a nearest neighbor algorithm for regressor for the problem of estimating the energy of multi-TeV muons in a high-granularity calorimeter, exploiting the pattern of soft photon deposits around the muon track. The algorithm is heavily overparametrized by assigning weights and biases to the training events. Parameters are learnt by batch gradient descent. The performance compares...

40. Can You Hear the Shape of a Jet?

Rikab Gambhir (MIT)

04/11/2022, 12:10

Interpretability

The identification of interesting substructures within jets is an important tool to search for new physics and probe the Standard Model. In this talk, we present SHAPER, a general framework for defining computing shape-based observables, which generalizes the $N$-jettiness from point clusters to any extended shape. This is accomplished by minimizing the $p$-Wasserstein metric between events...

60. Jet SIFT-ing

Joel Walker (Sam Houston State University)

04/11/2022, 12:30

Reconstruction

We describe a new scale-invariant jet clustering algorithm which does not impose a fixed cone size on the event. The proposed construction unifies fat-jet finding, substructure axis-finding, and recursive filtering of soft wide-angle radiation into a single procedure. The sequential clustering measure history facilitates high-performance substructure tagging with a boosted decision tree. ...

35. Neural Estimation of Energy Movers Distance

Ouail Kitouni (Massachusetts Inst. of Technology (US))

04/11/2022, 12:30

Interpretability

We propose a novel neural architecture that enforces an exact upper bound on the Lipschitz constant of the model by constraining the norm of its weights. This architecture was useful in developing new algorithms for the LHCb trigger which have robustness guarantees as well as powerful inductive biases leveraging the neural network’s ability to be monotonic in any subset of features. A new and...

96. Anomaly detection in a perspective of interdisciplinary research

Taoli Cheng (University of Montreal)

04/11/2022, 14:00

Zoom

Anomaly Detection

Following the previous work of leveraging Standard Model jet classifiers as generic anomalous jet taggers (https://arxiv.org/abs/2201.07199), we present an analysis of regularized SM jet classifiers serving as anti-QCD taggers. In the second part of the presentation, from the perspective of interdisciplinary research, we initiate a discussion on the opportunities and challenges involved in the...

18. Optimal Mass Variables for Semivisible Jets

Kevin Pedro (Fermi National Accelerator Lab. (US))

04/11/2022, 14:20

Anomaly Detection

We apply the artificial event variable technique, a deep neural network with an information bottleneck, to strongly coupled hidden sector models. These models of physics beyond the standard model predict collider production of invisible, composite dark matter candidates mixed with regular hadrons in the form of semivisible jets. We explore different resonant production mechanisms to determine...

26. Neural Embedding: Learning the Embedding of the Manifold of Physics Data

Sang Eon Park (Massachusetts Inst. of Technology (US))

04/11/2022, 14:40

Anomaly Detection

There is a growing recent interest in endowing the space of collider events with a metric structure calculated directly in the space of its inputs. For quarks and gluons, the recently developed energy mover's distance has allowed for a quantification of what is different between physical events. However, the large number of particles within jets makes using metrics and interpreting these...

64. Hunting for signals using Gaussian Process regression

Abhijith Gandrakota (Fermi National Accelerator Lab. (US))

04/11/2022, 15:00

Anomaly Detection

We present a novel computational approach for extracting weak signals, whose exact location and width may be unknown, from complex background distributions with an arbitrary functional form. We focus on datasets that can be naturally presented as binned integer counts, demonstrating our approach on the datasets from the Large Hadron Collider. Our approach is based on Gaussian Process (GP)...

81. Closing

David Shih, Manuel Sommerhalder (Hamburg University (DE))

04/11/2022, 15:20

57. GalaxyFlow: Upsampling Hydrodynamical Simulations for Realistic Gaia Mock Catalogs

Sung Hak Lim (Rutgers University)

Beyond Jets

The Gaia DR3 catalog provides high-quality measurements of stars in the Milky Way, but the current $N$-body simulation-based mock Gaia catalogs have larger resolutions compared to those of the original Gaia dataset.
Because of that, using the mock catalogs to aid statistical analyses on the Gaia dataset in a small position/velocity resolution scale is very limited.
To solve this issue, we...

63. Measuring Galactic Dark Matter through Unsupervised Machine Learning

Eric Putney (Rutgers, The State University of New Jersey)

Beyond Jets

Measuring the density profile of dark matter in the Solar neighborhood has important implications for both dark matter theory and experiment. In this work, we apply masked autoregressive flows to stars from a realistic simulation of a Milky Way-type galaxy to learn -- in an unsupervised way -- the stellar phase space density and its derivatives. With these as inputs we calculate the...

Choose timezone

ML4Jets2022

Contact