Description
Which particle types and properties (energy, angle) are considered?
How large is the full sim dataset used to extract parametrization/train the network? Is the dataset balanced (is the number of events for the different particle properties almost the same?)
How the structure of the input data is defined (hits, cells, clusters, custom voxels,..)?
Which data structure is used for the ML training (1D vector, images, graphs..)?
Is input data scaled? How?
How do you store the preprocessed data?
How are the condition values for the ML training (energy of the particle, angle,..) encoded?
Michele Faucci Giannelli
(INFN e Universita Roma Tor Vergata (IT))
22/11/2021, 16:20
Session: Data preprocessing
Jan Michal Dubinski
(Warsaw University of Technology (PL))
22/11/2021, 16:30
Session: Data preprocessing