29 January 2024 to 2 February 2024
CERN
Europe/Zurich timezone

Re-simulation-based self-supervision for representation learning

30 Jan 2024, 16:50
20m
40/S2-D01 - Salle Dirac (CERN)

40/S2-D01 - Salle Dirac

CERN

115
Show room on map
Contributed talk 1 ML for object identification and reconstruction Contributed Talks

Speaker

Jeffrey Krupa (Massachusetts Institute of Technology)

Description

Self-Supervised Learning (SSL) is at the core of training modern large ML models, providing a scheme for learning powerful representations in base models that can be used in a variety of downstream tasks. However, SSL training strategies must be adapted to the type of training data, thus driving the question: what are powerful SSL strategies for collider physics data? In the talk, we present a novel re-simulation-based SSL (RS3L) strategy wherein we develop a method of “re-simulation” to drive data augmentation for contrastive learning. We show how a RS3L-trained base model can learn powerful representations that can be used for downstream discrimination tasks and can help mitigate uncertainties.

Authors

Benedikt Maier (KIT - Karlsruhe Institute of Technology (DE)) Jeffrey Krupa (Massachusetts Institute of Technology) Maurizio Pierini (CERN) Michael Kagan (SLAC National Accelerator Laboratory (US)) Nathaniel Sherlock Woodward (Massachusetts Inst. of Technology (US)) Philip Coleman Harris (Massachusetts Inst. of Technology (US))

Presentation materials