11–15 Mar 2024
Charles B. Wang Center, Stony Brook University
US/Eastern timezone

Towards the construction of Foundational Models at the LHC

12 Mar 2024, 11:30
20m
Lecture Hall 2 ( Charles B. Wang Center, Stony Brook University )

Lecture Hall 2

Charles B. Wang Center, Stony Brook University

100 Circle Rd, Stony Brook, NY 11794
Oral Track 2: Data Analysis - Algorithms and Tools Track 2: Data Analysis - Algorithms and Tools

Speaker

Philip Coleman Harris (Massachusetts Inst. of Technology (US))

Description

The emergence of models pre-trained on simple tasks and then fine-tuned to solve many downstream tasks has become a mainstay for the application of deep learning within a large variety of domains. The models, often referred to as foundation models, aim, through self-supervision, to simplify complex tasks by extracting the most salient features of the data through a careful choice of pre-training strategy. When done effectively, these models can lead to world-leading algorithm performance on a large number of tasks. We present a re-simulation strategy for a model pretraining (R3SL) and show that this strategy applied to quark and gluon jets at the Large Hadron Collider can be used to create a foundation model for hadronically decaying objects. We show R3SL creates a feature space insensitive to the parton shower model uncertainties while retaining the core features of quark and gluon jets. On downstream tasks utilizing the pre-trained feature space, we demonstrate our method achieves comparable, if not better, tagging performance to established benchmarks for jet tagging in Higgs to bottom quarks, but with greatly reduced uncertainties. The algorithm presents a crucial step towards more robust searches for new physics involving particle jets and paves the way for the development of foundation models at the Large Hadron Collider.

Significance

This study is one of the first examples of a foundation model for applications in high-energy physics.

Experiment context, if any LHC

Primary authors

Benedikt Maier (KIT - Karlsruhe Institute of Technology (DE)) Jeffrey Krupa (Massachusetts Institute of Technology) Maurizio Pierini (CERN) Michael Kagan (SLAC National Accelerator Laboratory (US)) Philip Coleman Harris (Massachusetts Inst. of Technology (US))

Presentation materials