Fast Machine Learning for Science Workshop 2023

Name: Fast Machine Learning for Science Workshop 2023
Start: 2023-09-25T08:30:00+01:00
End: 2023-09-28T18:00:00+01:00
Location: Imperial College London

25–28 Sept 2023

Imperial College London

Europe/London timezone

Scalable neural network models and terascale datasets for particle flow reconstruction

25 Sept 2023, 13:45

15m

Blackett Laboratory, Lecture Theatre 1 (Imperial College London)

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Blackett Laboratory

Standard Talk Contributed Talks Contributed Talks

Farouk Mokhtar (Univ. of California San Diego (US))

Particle flow reconstruction is crucial to analyses performed at general-purpose detectors, such as ATLAS and CMS. Recent developments have shown that a machine-learned particle-flow reconstruction using graph neural networks offer a prospect for computationally efficient event reconstruction [1-2]. Focusing on scalability of machine-learning based models for full event reconstruction, we compare two alternative models for particle flow reconstruction that can process full events consisting of tens of thousands of input elements, while avoiding quadratic memory allocation and computation cost. We test the models on a newly developed granular and detailed dataset based on full GEANT4 detector simulation for particle flow reconstruction studies. Using supercomputing, we carry out extensive hyperparameter optimization to choose a model configuration that significantly outperforms the baseline rule-based implementation on a cluster-based dataset; where the inputs are charged particle tracks and calorimeter clusters. We characterize the physics performance, using event-level quantities such as jet and missing transverse energy response, and computational performance of the model and find that using mixed precision can significantly improve training speed. We further demonstrate that the resulting model architecture and software setup is highly portable across hardware vendors, supporting training on NVidia, AMD, and Habana cards. Finally, we show that the model can be trained, alternatively, on a highly granular dataset consisting of tracks and raw calorimeter hits, resulting in a physics performance that is competitive with baseline particle flow, limited currently by training throughput. We expect that with additional effort in dataset design, model development and high-performance training, it will be possible to improve event reconstruction performance over current baselines. The extensive simulated dataset and model training code are made available under the FAIR principles.

[1] https://arxiv.org/abs/2101.08578
[2] https://arxiv.org/abs/2303.17657

David Southwick (CERN) Eric Wulff (CERN) Farouk Mokhtar (Univ. of California San Diego (US)) Javier Mauricio Duarte (Univ. of California San Diego (US)) Joosep Pata (National Institute of Chemical Physics and Biophysics (EE)) Dr Maria Girone (CERN) Mengke Zhang (Univ. of California San Diego (US))

FM_mlpf_FastML2023.pdf

FM_mlpf_FastML2023_withbuilds.pdf

Fast Machine Learning for Science Workshop 2023

Scalable neural network models and terascale datasets for particle flow reconstruction

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Speaker

Description

Authors

Presentation materials

Choose timezone

Fast Machine Learning for Science Workshop 2023

Speaker

Description

Authors

Presentation materials