4th Inter-experiment Machine Learning Workshop

Name: 4th Inter-experiment Machine Learning Workshop
Start: 2020-10-19T09:00:00+02:00
End: 2020-10-23T18:10:00+02:00
Location: No location set

19–23 Oct 2020

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

Contribution List

84. Introduction

Andrea Wulzer (CERN and EPFL), David Rousseau (LAL-Orsay, FR), Gian Michele Innocenti (CERN), Lorenzo Moneta (CERN), Dr Pietro Vischia (Universite Catholique de Louvain (UCL) (BE)), Riccardo Torre (CERN), Simon Akar (University of Cincinnati (US))

20/10/2020, 10:00

Plenary

86. CERN Knowledge Transfer

Han Hubert Dols (CERN), Nick Ziogas (CERN)

20/10/2020, 10:15

Plenary

82. Machine Learning in Procter and Gamble

Michele Floris (University of Derby (GB))

20/10/2020, 10:25

Plenary

(no recording)

Procter & Gamble (P&G) is one of the oldest and largest “consumer goods” companies in the world. It is present in about 180 markets, with operations in 70 countries and almost 100 thousand employes. Machine Learning models created by the P&G Data Scientists support every aspect of this global business, from R&D, to shipment to marketing. The Data Science teams in the company...

83. Using Topological Data Analysis to Disentangle Complex Data Sets

Maurizio Sanarico (SDG Group)

20/10/2020, 10:55

Plenary

A recent new branch of the, currently called AI, is the Topological Data Analysis (TDA). TDA was born as an extension of algebraic topology to discrete data and, therefore, is a combination of algebraic topology, geometry, statistics and computational methods. According to E. Munch TDA comprises “a collection of powerful tools that can quantify shape and structure in data in order to answer...

81. Zenseact : Deep learning and computer vision for self-driving cars

Christoffer Petersson

20/10/2020, 11:25

Plenary

(no recording)

The mission of Zenseact is to develop a world-leading software platform for autonomous driving, with the main goal to dramatically reduce the number of traffic accidents in the world. I will discuss how we use deep learning and computer vision to reach this goal, and some of the challenges we face. I will also discuss the ongoing research collaboration between Zenseact and...

74. Solving Inverse Problems with Invertible Neural Networks

Ullrich Koethe (Visual Learning Lab Heidelberg)

20/10/2020, 14:00

Plenary

Interpretable models are a hot topic in neural network research. My talk will look on interpretability from the perspective of inverse problems, where one wants to infer backwards from observations to the hidden characteristics of a system. I will focus on three aspects: reliable uncertainty quantification, outlier detection, and disentanglement into meaningful features. It turns out that...

72. Structured models of objects, relations, and physics

Dr Peter Battaglia (DeepMind)

20/10/2020, 15:00

Data Science Seminar

76. Deep Learning @ LHC: An ATLAS Perspective

Amir Farbin (University of Texas at Arlington (US))

20/10/2020, 16:00

Plenary

6 Years after first demonstration of Deep Learning in HEP, the LHC community has explored a broad range of applications aiming for better, cheaper, faster, and easier solutions that ultimately extend the physics reach of the experiments and over come HL-LHC computing challenges. I’ll present a snapshot of where the ATLAS experiment currently stands in adoption of Deep Learning and suggest...

73. End-to-End, Machine Learning-based Data Reconstruction for Particle Imaging Neutrino Detectors

kazuhiro terao (Stanford University)

20/10/2020, 16:45

Plenary

With firm evidence of neutrino oscillation and measurements of mixing parameters, neutrino experiments are entering the high precision measurement era. The detector is becoming larger and denser to gain high statistics of measurements, and detector technologies evolve toward particle imaging, essentially a hi-resolution "camera", in order to capture every single detail of particles produced in...

24. GANplifying Event Samples

Sascha Daniel Diefenbacher (Hamburg University (DE))

21/10/2020, 10:00

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Regular talk

Workshop

Generative machine learning models have been successfully applied to many problems in particle physics, ranging from event generation to fast calorimeter simulation to many more. This indicates that generative models have the potential to become a mainstay in many simulation chains. However, one question that still remains is whether a generative model can have increased statistical precision...

47. Generative models for calorimeters response simulation - from GANs through VAE to e2e SAE

Kamil Rafal Deja (Warsaw University of Technology (PL))

21/10/2020, 10:20

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Regular talk

Workshop

Simulating detectors response is a crucial task in HEP experiments. Currently employed methods, such as Monte Carlo algorithms, provide high-fidelity results at a price of high computational cost, especially for dense detectors such as ZDC calorimeter in ALICE experiment. Multiple attempts are taken to reduce this burden, e.g. using generative approaches based on Generative Adversarial...

18. Reduced Precision Strategies for Deep Learning: 3DGAN Use Case

Mr Florian Rehm (Hochschule Coburg (DE))

21/10/2020, 10:40

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Regular talk

Workshop

Deep learning simulations are known as computational heavy with the need of a lot of memory and bandwidth. A promising approach to make deep learning more efficient and to reduce its hardware workload is to quantize the parameters of the model to lower precision. This approach results in lower execution inference time, lower memory footprint and lower memory bandwidth.
We will research the...

88. FastCaloGAN: a tool for fast simulation of the ATLAS calorimeter system with Generative Adversarial Networks

Michele Faucci Giannelli (INFN e Universita Roma Tor Vergata (IT))

21/10/2020, 11:00

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

Building on the recent success of deep learning algorithms, Generative Adversarial Networks (GANs) are exploited for modelling the response of the ATLAS detector calorimeter of different particle types; simulating calorimeter showers for photons, electrons and pions over a range of energies (between 256 MeV and 4 TeV) in the full detector $\eta$ range. The properties of showers in...

27. Estimating Support Size of Distribution Learnt by Generative Adversarial Networks for Particle Detector Simulation

Kristina Jaruskova (Czech Technical University in Prague)

21/10/2020, 11:05

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

Generative Adversarial Networks are usually used to generate images similar to the provided training data. The 3DGAN introduced in Khattak et al 2019 has the ability to simulate data from High Energy Physics detectors where each shower is represented by a three dimensional image. To evaluate the results, the generated images were compared to Monte Carlo GEANT4 simulations in terms of physics...

63. Fast simulation of Time Projection Chamber response at MPD using GANs

Artem Maevskiy (National Research University Higher School of Economics (RU))

21/10/2020, 11:10

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

NICA accelerator complex is currently being assembled in JINR (Dubna) to perform studies of heavy-ion collisions and explore new regions of the QCD phase diagram. Located at one of the two interaction points of the facility, the Multi-Purpose Detector (MPD) will utilize the Time-Projection Chamber (TPC) as the main tracker of the detector’s central barrel. TPC consists of a gas-filled...

64. Domain Adaptation Techniques in Particle Identification for the ALICE experiment

Michal Kurzynka (Warsaw University of Technology (PL))

21/10/2020, 11:15

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Lightning talk

Workshop

Classifying particle types on the basis of detectors response is a fundamental task in the ALICE experiment. Methods currently employed in this job are based on linear classifiers which are built on Monte Carlo simulation data, due to lack of labels (pdg code) in case of production data and require manual fine tuning to match latter data set distribution. This calibration is performed by...

62. Black-Box Optimization with Local Generative Surrogates

Mr Vladislav Belavin (Yandex School of Data Analysis (RU))

21/10/2020, 11:20

10 ML for experimental particle physics

Regular talk

Workshop

We propose a novel method for gradient-based optimization of black-box simulators using differentiable local surrogate models. In fields such as physics and engineering, many processes are modeled with non-differentiable simulators with intractable likelihoods. Optimization of these forward models is particularly challenging, especially when the simulator is stochastic. To address such cases,...

65. Using Machine Learning to Speed Up and Improve Detector R&D

Alexey Boldyrev (NRU Higher School of Economics (Moscow, Russia))

21/10/2020, 11:40

Regular talk

Workshop

Design of new experiments, as well as upgrade of ongoing ones, is a
continuous process in the experimental high energy physics.
Frontier R&Ds are used to squeeze the maximum physics performance using cutting edge detector technologies.
The evaluation of physics performance for a particular configuration
includes sketching this configuration in Geant, simulating typical
signals and...

6. Matrix Element Regression with Deep Neural Networks -- breaking the CPU barrier

Florian Bury (UCLouvain - CP3)

21/10/2020, 12:00

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Regular talk

Workshop

The Matrix Element Method (MEM) is a powerful method to extract information from measured events at collider experiments. Compared to multivariate techniques built on large sets of experimental data, the MEM does not rely on an examples-based learning phase but directly exploits our knowledge of the physics processes. This comes at a price, both in term of complexity and computing time since...

53. Adaptive divergence for rapid adversarial optimization & (1 + epsilon)-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets

Maxim Borisyak (Yandex School of Data Analysis (RU))

21/10/2020, 12:20

5 ML algorithms : Machine Learning development across applications

Regular talk

Workshop

This talk contains 2 contributions:

1) Adaptive divergence for rapid adversarial optimization.

Adversarial Optimization provides a reliable, practical way to match two implicitly defined distributions, one of which is typically represented by a sample of real data, and the other is represented by a parameterized generator. Matching of the distributions is achieved by minimizing a...

5. Accelerated pixel detector tracklet finding with Graph Neural Networks on FPGAs

Savannah Jennifer Thais (Princeton University (US))

21/10/2020, 14:00

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

Track finding is a critical and computationally expensive step of object reconstruction for the LHC detectors. The current method of track reconstruction is a physics-inspired Kalman Filter guided combinatorial search. This procedure is highly accurate but is sequential and thus scales poorly with increased luminosity like that planned for the HL-LHC. It is therefore necessary to consider new...

25. Set2Graph: Secondary Vertex finding in Jets with Neural Networks

Jonathan Shlomi (Weizmann Institute of Science (IL))

21/10/2020, 14:20

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

(due to slow internet connection : youtube video + recording of Q&A)

Secondary vertex finding is a crucial task for identifying jets containing heavy flavor hadron decays.
Bottom jets in particular have a very distinctive topology of 𝑏→𝑐→𝑠 decay which gives rise to two secondary vertices with high invariant mass and several associated charged tracks.

Existing secondary vertex finding...

32. Invertible Networks or Partons to Detector and Back Again

Anja Butter

21/10/2020, 14:40

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

For simulations where the forward and the inverse directions have a physics meaning, invertible neural networks are especially useful. A conditional INN can invert a detector simulation in terms of high-level observables, specifically for ZW production at the LHC. It allows for a per-event statistical interpretation. Next, we allow for a variable number of QCD jets. We unfold detector effects...

19. Hit-reco: ProtoDUNE denoising with DL models

Marco Rossi

21/10/2020, 15:00

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

We present Hit-reco model for denoising and region of interest selection on raw simulation data from ProtoDUNE experiment. ProtoDUNE detector is hosted by CERN and it aims to test and calibrate technologies for DUNE, a forthcoming experiment in neutrino physics. Hit-reco leverages deep learning algorithms to make the first step in the reconstruction workchain, which consists in converting...

37. Efficiency parametrization with Neural Networks

Nilotpal Kakati (Weizmann Institute of Science (IL))

21/10/2020, 15:20

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

An overarching issue of LHC experiments is the necessity to produce massive numbers of simulated collision events in very restricted regions of phase space. A commonly used approach to tackle the problem is the use of event weighting techniques where the selection cuts are replaced by event weights constructed from efficiency parametrizations. These techniques are however limited by the...

26. Zero-Permutation Jet Parton Assignment

Seungjin Yang (University of Seoul, Department of Physics (KR))

21/10/2020, 16:10

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

For many top quark measurements, it is essential to reconstruct the top quark from its decay products. For example, the top quark pair production process in the all-jets final state has six jets initiated from daughter partons and additional jets from initial/final state radiation. Due to the many possible permutations, it is very hard to assign jets to partons. We use a deep neural network...

12. Design by intelligent committee: use of machine learning as a scientific advisor

Stephen Burns Menary (University of Manchester)

21/10/2020, 16:30

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

Experimental measurements in high energy physics are primarily designed using the expert knowledge and intuition of the analysers, who define their background rejection cuts, control/signal regions and observables of interest based on their understanding of the physical processes involved. More recently, modern multivariate analysis techniques such as neural density estimation and boosted...

75. Neural Network Pruning:  from over-parametrized to under-parametrized networks

Dr Michela Paganini (Facebook AI Research)

21/10/2020, 17:00

Plenary

This talk will provide an introduction to the concept of over-parametrization in neural networks and the associated benefits that have been identified from the theoretical and empirical standpoints. It will then present the practice of pruning as both a practical engineering intervention to reduce model size and a scientific tool to investigate the behavior and trainability of compressed...

11. AutoDQM: A Statistical Tool for Monitoring Data Quality in the CMS Detector

Vivan Thi Nguyen (Northeastern University (US))

22/10/2020, 09:00

10 ML for experimental particle physics

Regular talk

Workshop

AutoDQM is an automated monitoring system which implements statistical tests and machine learning (ML) algorithms to compare data runs and flag anomalies for CMS data quality. It is used in conjunction with the existing Data Quality Monitoring (DQM) software to reduce the time and labor required of shifters during collision running by identifying anomalous behavior for further review from...

7. Quantum Graph Neural Networks for Track Reconstruction in Particle Physics and Beyond

Cenk Tuysuz (Middle East Technical University (TR))

22/10/2020, 09:20

13 Other

Regular talk

Workshop

The Large Hadron Collider (LHC) at the European Organisation for Nuclear Research (CERN) will undergo an upgrade to further increase the instantaneous rate of particle collisions (luminosity) and become the High Luminosity LHC. This increase in luminosity, will yield many more detector hits (occupancy), and thus measurements will pose a challenge to track reconstruction algorithms being...

30. Quantum Generative Adversarial Networks

Su Yeon Chang (EPFL - Ecole Polytechnique Federale Lausanne (CH))

22/10/2020, 09:40

13 Other

Regular talk

Workshop

In High Energy Physics (HEP), calorimeter outputs play an essential role in understanding low distance processes occurring during particle collisions. Due to the complexity of underlying physics, the traditional Monte-Carlo simulation is computationally expensive, and thus, the HEP community has suggested Generative Adversarial Networks (GAN) for fast simulation. Meanwhile, it has also been...

17. SWAN: Powering CERN's Data Analytics and Machine Learning Use cases

Luca Canali (CERN), Riccardo Castellotti (CERN), Prasanth Kothuri (CERN)

22/10/2020, 10:20

6 ML infrastructure : Hardware and software for Machine Learning

Regular talk

Workshop

SWAN (Service for Web-based ANalysis) is CERN’s general-purpose Jupyter notebook service. It offers a pre-configured, fully-fledged, and easy to use environment, integrating CERN-IT compute, GPU, storage, and analytics services, available at a simple mouse click. In this talk, we will describe the currently deployed SWAN service, as well as recent developments and service improvements that can...

31. Accelerating GAN training using distributed tensorflow and highly parallel hardware

Renato Paulo Da Costa Cardoso (Universidade de Lisboa (PT))

22/10/2020, 10:40

6 ML infrastructure : Hardware and software for Machine Learning

Regular talk

Workshop

Abstract

Machine Learning has been used in a wide array of areas and the necessity to make it faster while still maintaining the accuracy and validity of the results is a growing problem for data scientists. This work explores the Tensorflow distributed parallel strategy approach to effectively and efficiently run a Generative Adversarial Network, GAN, model [1] in a parallel environment,...

55. Using an Optical Processing Unit for tracking and calorimetry at the LHC

David Rousseau (IJCLab-Orsay)

22/10/2020, 11:00

6 ML infrastructure : Hardware and software for Machine Learning

Regular talk

Workshop

Experiments at HL-LHC and beyond will have ever higher read-out rate. It is then essential to explore new hardware paradigms for large scale computations. In this work we consider the Optical Processing Units (OPU) from LightOn, which compute random matrix multiplications on large datasets in an analog, fast and economic way, fostering faster machine learning results on a dataset of reduced...

16. MLaaS4HEP: Machine Learning as a Service for HEP

Luca Giommi (Universita e INFN, Bologna (IT))

22/10/2020, 11:20

6 ML infrastructure : Hardware and software for Machine Learning

Lightning talk

Workshop

Machine Learning is increasingly used in many fields of HEP and will give its contribute in the upcoming High-Luminosity LHC (HL-LHC) program at CERN. The raising of data produced needs new approaches to train and use ML models. In this presentation we discuss the Machine Learning as a Service (MLaaS) infrastructure, that allows to read data directly in the ROOT format exploiting the...

22. Distributed training of graph neural network at HPC

Xiangyang Ju (Lawrence Berkeley National Lab. (US))

22/10/2020, 11:25

6 ML infrastructure : Hardware and software for Machine Learning

Lightning talk

Workshop

Graph Neural Networks (GNN) are trainable functions that operate on a graph to learn latent graph attributes and to form a parameterized message-passing by which information is propagated across the graph, ultimately learning sophisticated graph attributes. Its application in the High Energy Physics grows rapidly in the past years, ranging from event reconstructions to data analyses, from...

14. Hyperparameter Optimisation for Machine Learning using ATLAS Grid and HPC

Rui Zhang (University of Wisconsin Madison (US))

22/10/2020, 11:30

6 ML infrastructure : Hardware and software for Machine Learning

Lightning talk

Workshop

With the emerging of more and more sophisticated machine learning models in high energy physics, optimising the parameters of the models (hyperparameters) is becoming more and more crucial in order to get the best performance for physics analysis. This requires a lot of computing resources. So far, many of the training results are worked out in a personal computer or a local institution...

20. Identifying jets in the Lund plane

Dr Frederic Alexandre Dreyer (University of Oxford)

22/10/2020, 11:35

11 ML for phenomenology and theory

Regular talk

Workshop

The identification of heavy particles such as top quarks or vector bosons is one of the key issues at the Large Hadron Collider. In this talk, we introduce a novel jet tagging method which relies on graph neural networks and an efficient description of the radiation patterns within a jet to optimally disentangle signatures of boosted objects from background QCD jets. We apply this framework to...

44. General recipe to form input space for deep learning analysis of HEP scattering processes.

Lev Dudko (M.V. Lomonosov Moscow State University (RU))

22/10/2020, 11:55

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Lightning talk

Workshop

The important step of the analysis of HEP scattering processes is the optimization of the input space for multivariate technique. We propose general recipe how to form the set of low-level observables which are sensitive to the differences in hard scattering processes at the colliders. It will be demonstrated that without any sophisticated analysis of the kinematic properties one can achieve...

34. Lorentz Equivariant Neural Networks for Particle Physics

Alexander Bogatskiy (University of Chicago)

22/10/2020, 14:00

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

We present a new set of neural network architectures, Lorentz group covariant architectures for learning the kinematics and properties of complex systems of particles. The novel design of this network, called LGN (Lorentz Group Network), implements activations as vectors that transform according to arbitrary finite-dimensional representations of the underlying symmetry group that governs...

39. Graph Neural Network-based Event Classification for Measurement of the Higgs-Top Yukawa Interaction

Ryan Roberts (Lawrence Berkeley National Lab. (US))

22/10/2020, 14:20

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

The measurement of the associated production of Higgs boson with a top-quark pair (ttH) at the LHC provides a direct determination of the Higgs-Top Yukawa interaction. The presence of a large number of objects in the final state makes the measurement very challenging. Multivariate Analysis methods such as Boosted Decision Trees (BDT) were used to enhance the analysis sensitivity. However, the...

50. Disentangling Boosted Higgs Boson Production Modes with Machine Learning

Yi-Lun Chung (National Tsing Hua University (TW))

22/10/2020, 14:40

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Lightning talk

Workshop

Higgs Bosons produced via gluon-gluon fusion (ggF) with large transverse momentum ($p_T$) are sensitive probes of physics beyond the Standard Model. However, high $p_T$ Higgs Boson production is contaminated by a diversity of production modes other than ggF: vector boson fusion, production of a Higgs boson in association with a vector boson, and production of a Higgs boson with a top-quark...

68. Bayesian Neural Networks for Predictions from High Dimensional Theories

Braden Kronheim

22/10/2020, 14:45

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

One of the goals of current particle physics research is to obtain evidence of physics beyond the Standard Model (BSM) at accelerators such as the Large Hadron Collider (LHC). The searches for new physics are often guided by BSM theories that depend on many unknown parameters, which makes testing their predictions computationally challenging. Bayesian neural networks (BNN) can map the...

79. Deep Dive on Graph Networks for Learning Simulation (Deep Mind)

Alvaro Sanchez-Gonzalez (DeepMind)

22/10/2020, 16:00

Walk through

(first 10' missing on the recording, sorry)

80. Tracking GNN Walk Through

Daniel Thomas Murnane (Lawrence Berkeley National Lab. (US)), Xiangyang Ju (Lawrence Berkeley National Lab. (US))

22/10/2020, 17:00

Walk through

45. Foundations of a Fast, Data-Driven, Machine-Learned Simulator

Jessica N. Howard (Department of Physics & Astronomy, UC Irvine), Jessica Nicole Howard (University of California Irvine (US))

23/10/2020, 10:00

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Regular talk

Workshop

We introduce a novel strategy for machine-learning-based predictive simulators, which can be trained in an unsupervised manner using observed data samples to learn a predictive model of the detector response and other difficult-to-model transformations. Particle physics detectors cannot directly probe fundamental particle collisions. Instead, statistical inference must be used to surmise...

43. Selective background MC simulation with graph neural networks at Belle II

Nikolai Hartmann (Ludwig Maximilians Universitat (DE))

23/10/2020, 10:20

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

Searching for rare physics processes requires a good understanding of the
backgrounds involved. This often requires large amounts of simulated data that
are computationally expensive to produce. The Belle II collaboration is planning
to collect 50 times the amount of data of its predecessor Belle. With the
increase in data volume the necessary volume of simulated data increases as
well....

46. Pixel Detector Background Generation using Generative Adversarial Networks at Belle II

Mr Hosein Hashemi (LMU)

23/10/2020, 10:25

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

The pixel detector (PXD) is an essential part of the Belle II detector recording particle positions. Data from the PXD and other sensors allow us to reconstruct particle tracks and decay vertices. The effect of background noise on track reconstruction for measured data is emulated for simulated data by a mixture of measured background noise and easily-simulated particle decays. This model...

2. Reinforcement learning environment for deep learn physics dataset

Mr Maciej Majewski (AGH-UST), Maciej Witold Majewski (AGH University of Science and Technology (PL))

23/10/2020, 10:30

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

Deep learn physics open dataset contains thousands of frames LARTPC detector data. The main problem of the dataset is semantic segmentation. This problem has been solved succesfully with modified version of U-Net, as well as graph-networks. The main difficulty of this problem lays within the sparcity of data (thin tracks inside pixels, or voxels) which make it difficult to feed classical...

35. Improving particle-flow with deep learning

Sanmay Ganguly (Weizmann Institute of Science (IL))

23/10/2020, 10:35

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

Canonical particle flow algorithm tries to estimate neutral energy deposition in calorimeter by first performing matching between calorimeter deposits and track
direction and subsequently subtracting the track momenta from the matched cluster energy deposition.
We propose a Deep Learning based method for estimating the energy fraction of individual components for each cell of the...

36. Super-resolution for calorimetry

Francesco Armando Di Bello (Sapienza Universita e INFN, Roma I (IT))

23/10/2020, 10:55

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

Super-resolution algorithms are commonly used to enhance the granularity of an imaging system beyond what can be achieved using the measuring device.
We show the first application of super-resolution algorithms using deep learning-based methods for calorimeter reconstruction using a simplified geometry consisting of overlapping showers originated by charged and neutral pions events.
The...

56. Deep learning solutions for 2D calorimetric cluster reconstruction at LHCb

Michal Mazurek (National Centre for Nuclear Research (PL))

23/10/2020, 11:15

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

Calorimetric cluster reconstruction can be performed using deep learning solutions from real-time computer vision by casting the detector readout as a two-dimensional image. The increased luminosity expected of Run III poses unprecedented challenges to shower reconstruction at LHCb. This work seeks to perform shower identification and energy regression under such conditions through both...

29. Object condensation: one-stage grid-free multi-object reconstruction in physics detectors, graph, and image data

Jan Kieseler (CERN)

23/10/2020, 11:35

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Regular talk

Workshop

High-energy physics detectors, images, and point clouds share many similarities in terms of object detection. However, while detecting an unknown number of objects in an image is well established in computer vision, even machine learning assisted object reconstruction algorithms in particle physics almost exclusively predict properties on an object-by-object basis.
Traditional approaches...

8. UCluster: Unsupervised clustering for HEP

Vinicius Massami Mikuni (Universitaet Zuerich (CH))

23/10/2020, 11:55

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Lightning talk

Workshop

In this talk I will present an unsupervised clustering (UCluster) method where a neural network is used to reduce the dimensionality of the data, while preserving the event information. The reduced representation is then clustered to a k-means friendly space with a suitable loss function. I will show how this idea can be used to unsupervised multi-class classification and anomaly detection.

21. A readily-interpretable fully-convolutional autoencoder-like algorithm for unlabelled waveform analysis

Benjamin Krikler (University of Bristol (GB))

23/10/2020, 12:00

1 ML for data reduction : Application of Machine Learning to data reduction, reconstruction, building/tagging of intermediate object

Lightning talk

Workshop

Waveform analysis is a crucial first step in the data processing pipeline for any particle physics experiment. Its accuracy, therefore, can limit the overall analysis performance although waveform analyses often face a variety of challenges, for example: overlapping ‘pile-up’ pulses, noise, non-linearities, floating baselines. Historically, many experiments have viewed template fitting as...

9. Teaching Machine Learning with ATLAS Open Data

Meirin Oan Evans (University of Sussex (GB))

23/10/2020, 14:00

7 ML training, courses and tutorials

Regular talk

Workshop

Open Data are a crucial cornerstone of science. Using Open Data brings benefits such as direct access to cutting edge research, tools to promote public understanding of science and training for scientists of the future. This talk will describe the enormous potential of ATLAS Open Data and how it’s used for training, courses and tutorials in machine learning, from undergraduate and postgraduate...

10. Active Anomaly Detection for time-domain discoveries

Emille Eugenia DE OLIVEIRA ISHIDA (CNRS)

23/10/2020, 14:20

9 ML for astroparticle

Regular talk

Workshop

We present the first application of adaptive machine learning to the identification of anomalies in a data set of non-periodic time series. The method follows an active learning strategy where highly informative objects are selected to be labelled. This new information is subsequently used to improve the machine learning model, allowing its accuracy to evolve with the addition of human...

40. Generative Adversarial Network for Identifying the Dark Matter Distribution of a Dwarf Spheroidal Galaxy

Sung Hak Lim (Rutgers University)

23/10/2020, 14:40

9 ML for astroparticle

Regular talk

Workshop

We introduce a generative adversarial network for analyzing the dark matter distribution of a dwarf spheroidal galaxy.
The mock data generator for dwarf spheroidal galaxies in the spherically symmetric case has three functional parameters: the number density of stars, the density of dark matter, and velocity anisotropy.
The generator will be adversarially trained on a mock dataset, which...

13. Pre-Learning a Geometry Using Machine Learning to Accelerate High Energy Physics Detector Simulations

Evangelos Kourlitis (Argonne National Laboratory (US))

23/10/2020, 15:00

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

The simulation of the passage of particles through the LHC detectors occupies already more than a third of the available computing resources and it's predicted to exceed them after 2026, for the example of the ATLAS detector. Significant portion of the most prevalent simulation toolkit, Geant4, is spent to explore the geometry of the detector volume in order to calculate a particle instance...

23. High Fidelity Simulation of High Granularity Calorimeters with High Speed

Engin Eren (Deutsches Elektronen-Synchrotron DESY)

23/10/2020, 15:05

3 ML for simulation and surrogate model : Application of Machine Learning to simulation or other cases where it is deemed to replace an existing complex model

Lightning talk

Workshop

In this talk, we investigate the use of Generative Adversarial Networks (GANs) and a new architecture -- the Bounded Information Bottleneck Autoencoder (Bib-AE) -- for modeling electromagnetic showers in the central region of the Silicon-Tungsten calorimeter of the proposed International Large Detector. An accurate simulation of differential distributions including for the first time the shape...

42. Graph Convolutional Operators in the the PyTorch JIT

Lindsey Gray (Fermi National Accelerator Lab. (US))

23/10/2020, 15:10

6 ML infrastructure : Hardware and software for Machine Learning

Lightning talk

Workshop

The PyTorch just-in-time (jit) compiler is a powerful tool for optimizing and serializing neural network models. However, its range is limited by the subset of the python language that it is restricted to and the number of tensor operations implemented in C++. These limitations were a major blocker to using graph neural networks implemented in the geometric deep learning (GDL) library PyTorch...

69. GPU and FPGA as a Service for Machine Learning Inference Accelerations

Yu Lou (University of Washington (US))

23/10/2020, 15:15

6 ML infrastructure : Hardware and software for Machine Learning

Lightning talk

Workshop

The data rate may surge after some planned upgrades for the high-luminosity Large Hadron Collider (LHC) and accelerator-based neutrino experiments. Since there is no enough storage to save all of the data, there is a challenging demand to process and filter billions of events in real-time. Machine learning algorithms are becoming increasingly prevalent in the particle reconstruction pipeline....

33. DisCo: Robust Networks and automated ABCD background estimation

David Shih (Rutgers University)

23/10/2020, 15:50

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

With the wide use of deep learning in HEP analyses, answering questions beyond the classification performance becomes increasingly important. One crucial aspect is ensuring the robustness of classifier outputs against other observables - typically an invariant mass. Superior performance in decorrelation was so far achieved by adversarial training. We show that a simple additive term in the...

58. Decorrelation via Disentanglement

Justin Tan

23/10/2020, 16:10

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

Abstract

Invariance of learned representations of neural networks against certain sensitive attributes of the input data is a desirable trait in many modern-day applications of machine learning, such as precision measurements in experimental high-energy physics. We propose to use the ability of variational autoencoders to learn a disentangled latent representation to achieve the desired...

59. Enhancing searches for resonances with machine learning and moment decomposition

Ouail Kitouni (Massachusetts Inst. of Technology (US))

23/10/2020, 16:30

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Lightning talk

Workshop

A key challenge in searches for resonant new physics is that classifiers trained to enhance potential signals must not induce localized structures. Such structures could result in a false signal when the background is estimated from data using sideband methods. A variety of techniques have been developed to construct classifiers which are independent from the resonant feature (often a mass)....

57. Simulation-Assisted Decorrelation for Resonant Anomaly Detection

Kees Christian Benkendorfer (Lawrence Berkeley National Lab. (US))

23/10/2020, 16:35

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

A growing number of weak- and unsupervised machine learning approaches to anomaly detection are being proposed to significantly extend the search program at the Large Hadron Collider and elsewhere. One of the prototypical examples for these methods is the search for resonant new physics, where a bump hunt can be performed in an invariant mass spectrum. A significant challenge to methods that...

52. Anomaly Awareness for BSM Searches at the LHC

Charanjit Kaur Khosa

23/10/2020, 16:55

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Regular talk

Workshop

n this talk we present a new algorithm called `Anomaly Awareness’ (AA) to search for physics beyond the standard model (BSM). By making the algorithm aware of the presence of a range of different anomalies, we improve its capability to detect anomalous events, even those it had not been exposed to. As an example, we apply this method to a boosted jet topology for BSM searches at LHC and use it...

49. Model-Independent Detection of New Physics Signals Using Interpretable Semi-Supervised Classifier Tests

Purvasha Chakravarti (Imperial College London)

23/10/2020, 17:15

2 ML for analysis : Application of Machine Learning to analysis, event classification and fundamental parameters inference

Lightning talk

Workshop

A central goal in experimental high energy physics is to detect new physics signals that are not explained by known physics. In this work, we aim to search for new signals that appear as deviations from known Standard Model physics in high-dimensional particle physics data. To do this, we determine whether there is any statistically significant difference between the distribution of Standard...

87. Conclusion and wrap-up

23/10/2020, 17:20

Workshop

Choose timezone

4th Inter-experiment Machine Learning Workshop

Contact

Abstract