Variable Importance Ranking in the CMS Search for pp to ttt in the Single Lepton Final State at a Center-of-Mass Energy of 13 TeV

12 Jul 2021, 15:00
15m
Track I (Zoom)

Track I

Zoom

talk Top Quark Physics Top Quark

Speaker

Daniel Li (Brown University (US))

Description

Variable Importance is a variable ranking framework that uses machine learning methods, such as neural networks, to construct a quantitative metric for characterizing a variable's discriminatory power in binary classification problems. The Variable Importance framework is presented in the context of the CMS search for the rare Standard Model process of three top quark production to the single lepton final state. The importance metrics for a set of 76 multivariate variables describing a 13 TeV proton-proton collision are determined, and a neural network discriminator is trained for a subset of the ranked variables which will be used in a likelihood analysis. The Variable Importance framework includes hyper parameter optimization and k-fold cross validation training when constructing the final discriminator. Preliminary results for the expected three top signal and cross section using 101 $\mathrm{fb}^{-1}$ of simulated Monte Carlo samples using the Run 2 CMS detector is shown. Additionally, a study on Variable Importance's predictive power of the expected significance using a cumulative importance metric is shown to further validate the accuracy of Variable Importance's quantitative ranking.

Are you are a member of the APS Division of Particles and Fields? Yes

Author

Daniel Li (Brown University (US))

Co-authors

Meenakshi Narain (Brown University) Emanuele Usai (Brown University) Sinan Sagir (Karamanoglu Mehmetbey Universitesi)

Presentation materials