23–28 Oct 2022
Villa Romanazzi Carducci, Bari, Italy
Europe/Rome timezone

Evolutionary algorithms for hyperparameter optimization in machine learning for application in high energy physics

27 Oct 2022, 15:30
20m
Sala Europa (Villa Romanazzi)

Sala Europa

Villa Romanazzi

Oral Track 2: Data Analysis - Algorithms and Tools Track 2: Data Analysis - Algorithms and Tools

Speaker

Laurits Tani (National Institute of Chemical Physics and Biophysics (EE))

Description

In contemporary high energy physics (HEP) experiments the analysis of vast amounts of data represents a major challenge. In order to overcome this challenge various machine learning (ML) methods are employed. However, in addition to the choice of the ML algorithm a multitude of algorithm-specific parameters, referred to as hyperparameters, need to be specified in practical applications of ML methods. The optimization of these hyperparameters, which is often performed manually, has a significant impact on the performance of the ML algorithm. In this talk we explore several evolutionary algorithms that allow to determine optimal hyperparameters for a given ML task in a fully automated way. Additionally, we study the capability of the two most promising hyperparameter optimisation algorithms, particle swarm optimization and bayesian optimization, for utilising the highly parallel computing architecture that is typical for the field of HEP.

References

Tani, L., Rand, D., Veelken, C. et al. Evolutionary algorithms for hyperparameter optimization in machine learning for application in high energy physics. Eur. Phys. J. C 81, 170 (2021). https://doi.org/10.1140/epjc/s10052-021-08950-y

Tani, L. & Veelken, C. Comparison of Bayesian and particle swarm algorithms for hyperparameter optimisation in machine learning applications in high energy physics. arXiv preprint arXiv:2201.06809 (2022) https://arxiv.org/pdf/2201.06809.pdf

Significance

ML methods are in common use in HEP data analyses, but very few studies have been performed of the task of finding optimal hyperparameter values. Algorithms that allow to determine optimal hyperparameter values in a fully automated way are presented in this talk. Furthermore, we present results on how well different hyperparameter optimization algorithms parallelise on modern computing architectures, such as computing clusters and the Worldwide LHC Computing Grid (WLCG).

Experiment context, if any None

Primary authors

Christian Veelken (National Institute of Chemical Physics and Biophysics (EE)) Laurits Tani (National Institute of Chemical Physics and Biophysics (EE)) Diana Rand (National Institute of Chemical Physics and Biophysics (EE)) Mario Kadastik (National Institute of Chemical Physics and Biophysics (EE))

Presentation materials