10-15 March 2019
Steinmatte conference center
Europe/Zurich timezone

Meta-Learning for Artificial Neural Network Hyper-Parameter Optimization for CERN CMS Offline Data Certification

Not scheduled
20m
Steinmatte conference center

Steinmatte conference center

Hotel Allalin, Saas Fee, Switzerland https://allalin.ch/conference/
Poster Track 2: Data Analysis - Algorithms and Tools Poster Session

Speaker

Mantas Stankevicius (Vilnius University, Institute of Data Science and Digital Technologies (LT))

Description

The Compact Muon Solenoid (CMS) is one of the general-purpose detectors at the CERN Large Hadron Collider (LHC) which collects enormous amounts of physics data. Before the final physics analysis can proceed, data has to be checked for quality (certified) by passing a number of automatic (like physics objects reconstruction, histogram preparation) and manual (checking, comparison and decision making) steps. Most important yet involving and error-prone last manual step of decision making is currently under active research as a subject of future automation by applying recent advancements from computer science, specifically, machine learning (ML).

Ultimately, CMS data certification is a binary classification task where various ML techniques are being investigated for applicability. Just like in any other ML task the hyper-parameter tuning is a difficult problem, there is no golden rule and each use case is different. This study explored meta-learning applicability, it is a hyper-parameters finding technique where algorithm learns hyper-parameters from previous training experiments. Evolutionary genetic algorithm has been used to tune hyper-parameters of a neural network, like number of hidden layers, number of neurons per layer, activation functions, dropouts, training batch size and optimizer. Initially, genetic algorithm takes manually specified set of hyper-parameters and then evolves towards the near-optimal solution. Genetic stochastic operators, crossover and mutation, were applied to avoid local optimal solutions.

Study proves that by carefully seeding the initial solution the optimal is likely to be found. Proposed solution has improved AUC score of neural network used for CERN CMS data certification. Similar algorithm can be applied for other machine learning models for hyper-parameter optimization.

Primary authors

Mantas Stankevicius (Vilnius University, Institute of Data Science and Digital Technologies (LT)) Valdas Rapsevicius (Fermi National Accelerator Lab. (US))

Presentation materials

There are no materials yet.

Peer reviewing

Paper