ML4Jets2022

Name: ML4Jets2022
Start: 2022-11-01T08:00:00-04:00
End: 2022-11-04T17:50:00-04:00
Location: Rutgers University

1–4 Nov 2022

Rutgers University

US/Eastern timezone

Contact

ml4jets2022@googlegroups.com

Feature selection with Distance Correlation

4 Nov 2022, 10:20

20m

Multipurpose Room (aka Livingston Hall) (Rutgers University)

Multipurpose Room (aka Livingston Hall)

Rutgers University

Livingston Student Center

Interpretability

RANIT DAS

Feature selection algorithms can be an important tool for AI explainability. If the performance of neural networks trained on low-level data can be reproduced by a small set of high-level features, we can hope to understand “what the machine learned”. We present a new algorithm that selects features by ranking their Distance Correlation (DisCo) values with truth labels. We apply this algorithm to the classification of boosted top quarks and use a set of 7,000 Energy Flow Polynomials (EFPs) as our feature space. We show that our method is able to select a small set of high-level features, with a classification performance comparable to the state-of-the-art top taggers.

David Shih Gregor Kasieczka (Hamburg University (DE)) RANIT DAS

Presentation_ml4jets.pdf

ML4Jets2022

Contact

Feature selection with Distance Correlation

Multipurpose Room (aka Livingston Hall)

Rutgers University

Speaker

Description

Authors

Presentation materials

Choose timezone

ML4Jets2022

Contact

Speaker

Description

Authors

Presentation materials