Foundation Models for Science Mini Workshop

Name: Foundation Models for Science Mini Workshop
Start: 2024-10-01T14:00:00+02:00
End: 2024-10-02T19:00:00+02:00
Location: No location set

1 Oct 2024, 14:00 → 2 Oct 2024, 19:00 Europe/Zurich

Gaia Grosso (IAIFI, MIT), Javier Mauricio Duarte (Univ. of California San Diego (US)), Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)), Raghav Kansal (Univ. of California San Diego (US))

Description

The CMS ML Innovation group is excited to host a mini-workshop on Foundation Models for Science.

There has been significant research in this topic in the last couple of years, and we are very interested in exploring foundation models on LHC data for a wide variety of physics applications, including classification tasks, simulations, and real-time triggers.

The aim of this workshop is to connect speakers from CMS, ATLAS, and beyond HEP working on building powerful, robust foundation models for science.

Physical Rooms at CERN

Tuesday: 354/1-019
Wednesday: 40/S2-B01 (Salle Bohr)

Organizing committee:

Javier Duarte (UCSD)
Gaia Grosso (IAIFI / MIT)
Raghav Kansal (Caltech / FNAL)
Pietro Vischia (Universidad de Oviedo / ICTEA)

From the same series

2 3

Registration

Participants

131 View full list

Tuesday 1 October
- Tue 1 Oct
- Wed 2 Oct
- 14:00 → 15:00
  Foundation Model MiniWorkshop: Day 1 354/1-019 (CERN)
  
  354/1-019
  
  CERN
  
  30
  Show room on map
  - 14:00
    
    Introduction 10m
    
    Speakers: Dr Gaia Grosso (IAIFI, MIT), Raghav Kansal (Univ. of California San Diego (US))
    
    Foundation Models for Science Introduction.pdf
    
    GMT20241001-120456_Recording_1920x1120.mp4
  - 14:10
    
    Towards Foundation Models in HEP with Self-Supervised Learning 50m
    
    Can Foundation Models, which rely on massive parameter counts, data sets, and compute, and have proven extremely powerful in computer vision and natural language systems, be built for High Energy Physics? To do so, several challenges must be addressed, including understanding how the training strategies, which are often data-type specific, can be developed for HEP data. In this talk, we will discuss our first steps towards building HEP foundation models using Self-Supervised training methods, such as masking strategies. We will also explore how pre-trained models can encode general knowledge of high utility when adapted for a variety of tasks.
    
    Speaker: Michael Kagan (SLAC National Accelerator Laboratory (US))
    
    GMT20241001-120912_Recording_2560x1440.mp4
    
    Kagan_FMworkshop_2024.pdf
- 15:00 → 15:45
  CMS/RooFit team discussion 354/1-019 (CERN)
  
  354/1-019
  
  CERN
  
  30
  Show room on map
  
  Conveners: Javier Mauricio Duarte (Univ. of California San Diego (US)), Dr Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))
  - 15:00
    
    Automatic Differentiation in RooFit and Community Needs 45m
    
    Speakers: Jonas Rembser (CERN), Vassil Vasilev (Princeton University (US)), Vassil Vassilev (CERN)
    
    CladML.pdf
    
    GMT20241001-131005_Recording_1920x1118.mp4
- 15:45 → 18:30
  Foundation Model MiniWorkshop: Day 1 354/1-019 (CERN)
  
  354/1-019
  
  CERN
  
  30
  Show room on map
  - 15:45
    
    Foundation models for HEP 45m
    
    Speaker: Philip Coleman Harris (Massachusetts Inst. of Technology (US))
    
    GMT20241001-135345_Recording_1920x1120.mp4
    
    PCH_FoundationModelOverview_1_10.pdf
  - 16:30
    
    Invisible Cities: Towards a multi-modal era of fundamental physics research 1h
    
    To achieve some of the biggest physics discoveries in the last decade -- e.g. finding definitive evidence of the Higgs boson, gravitational waves, and black holes -- physicists had to radically re-imagine the paradigm of working in small teams and instead construct large-scale experimental collaborations of hundreds or even thousands of scientists. The recent success of foundation models in various domains begs the question: could our scientific conventions yet again be restricting our access to major discoveries? In this talk, I propose that an interdisciplinary, multi-modal approach to fundamental physics research will be critical to finally answering the grand scientific mysteries about our Universe that have thus far eluded our usual strategies. In particular, I will present some recent work from my team at Polymathic AI exploring how we might form our first scientific foundation models, and I'll also share my perspectives on how we should strive to shape such models to reflect our highest priorities as scientists.
    
    Speaker: Mariel Pettee (Lawrence Berkeley National Lab. (US))
    
    GMT20241001-143700_Recording_1760x900.mp4
    
    Invisible Cities _ MPettee.pdf
  - 17:30
    
    Foundation Models for Scientific Discovery in the Lab of the Future 1h
    
    We reflect on the changes imparted by foundation models (FMs) to data-driven exploration and discovery in traditional scientific fields such as chemistry, spanning from hypothesis generation to experimental planning and validation. The emergence of multi-modal FMs also opens up new opportunities for data capture during manual experimentation, for which we present examples and lessons learned. Finally, we discuss recent work on how generative FMs can help accelerate simulation of stochastic particle shower events.
    
    Speaker: Patrick Ruch
    
    GMT20241001-153446_Recording_2560x1440.mp4
    
    Ruch - FMs for Scientific Discovery - CMS Workshop 20241001.pdf
Wednesday 2 October
- Tue 1 Oct
- Wed 2 Oct
- 14:00 → 19:00
  Foundation Model MiniWorkshop: Day 2 40/S2-B01 - Salle Bohr (CERN)
  
  40/S2-B01 - Salle Bohr
  
  CERN
  
  100
  Show room on map
  - 14:00
    
    Introduction to Day 2 15m
    
    Speakers: Dr Gaia Grosso (IAIFI, MIT), Raghav Kansal (Univ. of California San Diego (US))
    
    GMT20241002-120428_Recording_1672x844.mp4
  - 14:15
    
    Re-simulation-based self-supervised learning 45m
    
    Self-Supervised Learning (SSL) is at the core of training modern large machine learning models, providing a scheme for learning powerful representations that can be used in a variety of downstream tasks. We propose RS3L ("Re-simulation-based self-supervised representation learning"), a novel simulation-based SSL strategy that employs a method of re-simulation to drive data augmentation for contrastive learning in the physical sciences, particularly, in fields that rely on stochastic simulators. By intervening in the middle of the simulation process and re-running simulation components downstream of the intervention, we generate multiple realizations of an event, thus producing a set of augmentations covering all physics-driven variations available in the simulator. Using experiments from high-energy physics, we explore how this strategy may enable the development of a foundation model; we show how RS3L pre-training enables powerful performance in downstream tasks such as discrimination of a variety of objects and uncertainty mitigation.
    
    Speaker: Benedikt Maier (Imperial College (GB))
    
    GMT20241002-120538_Recording_1920x1080.mp4
    
    MLTownHallFM_RS3L.pdf
  - 15:00
    
    Finetuning Foundation Models for Joint Analysis Optimization 45m
    
    This talk highlights the significant gains in performance and data efficiency that can be achieved in HEP by moving away from the standard paradigm of separate reconstruction and analysis optimization. We introduce the key idea of fine-tuning a foundation model as a generalization of choosing working points in a physics analysis. The sensitivity gains achievable from end-to-end pipelines are demonstrated in an example with a heavy resonance decaying to a di-Higgs to four b-quark system from the Open CMS dataset with a ParT backbone taken as the foundation model.
    
    Speaker: Nicole Michelle Hartman (TUM (DE))
    
    Finetuned-FM-Hartman-2-10-2024.pdf
    
    GMT20241002-125951_Recording_1760x900.mp4
  - 15:45
    
    Coffe Break 15m
  - 16:00
    
    OmniJet-α: The first cross-task foundation model for particle physics 45m
    
    Foundation models are multi-dataset and multi-task machine learning methods that once pre-trained can be fine-tuned for a large variety of downstream applications. The successful development of such general-purpose models for physics data would be a major breakthrough as they could improve the achievable physics performance while at the same time drastically reduce the required amount of training time and data.
    We report significant progress on this challenge on several fronts. First, a comprehensive set of evaluation methods is introduced to judge the quality of an encoding from physics data into a representation suitable for the autoregressive generation of particle jets with transformer architectures. These measures motivate the choice of a higher-fidelity tokenization compared to previous works. Finally, we demonstrate transfer learning between an unsupervised problem (jet generation) and a classic supervised task (jet tagging) with our new OmniJet-α model. This is the first successful transfer between two different and actively studied classes of tasks and constitutes a major step in the building of foundation models for particle physics.
    
    Speaker: Joschka Birk (Hamburg University (DE))
    
    2024_10_02_Foundation_Models_for_Sciene_OmniJet_Joschka_Birk.pdf
    
    GMT20241002-140052_Recording_1600x768.mp4
  - 16:45
    
    Foundation Models for Astrophysics and Representation Learning 45m
    
    Given the remarkable success of foundation models in language and vision, it is worth exploring whether a similar approach can be applied to scientific domains. These models have the potential to improve computational efficiency, generalize better to low-data regimes, and significantly amortize training costs. However, many questions remain open regarding architectures, data selection, preprocessing techniques, and evaluation strategies. In this talk, I will focus on two foundation model approaches for astrophysics. The first, AstroCLIP, uses contrastive learning to build a shared latent space by aligning two models representing different views of the same phenomenon. The second, AstroOBS (work in progress), uses latent masked modeling to construct a unified multimodal representation capable of integrating diverse observational data. I will also discuss the importance of representation learning and briefly mention our ongoing work on time-series modeling as a starting point for future modality encoders.
    
    Speaker: Leopoldo Sarra
    
    foundation-models-for-science.pdf
    
    GMT20241002-144908_Recording_1832x982.mp4
  - 17:30
    
    OmniLearn: A Method to Simultaneously Facilitate All Jet Physics Tasks 1h
    
    Machine learning has become an essential tool in jet physics. Due to their complex, high-dimensional nature, jets can be explored holistically by neural networks in ways that are not possible manually. However, innovations in all areas of jet physics are proceeding in parallel. We show that specially constructed machine learning models trained for a specific jet classification task can improve the accuracy, precision, or speed of all other jet physics tasks. This is demonstrated by training on a particular multiclass generation and classification task and then using the learned representation for different generation and classification tasks, for datasets with a different (full) detector simulation, for jets from a different collision system (pp versus ep), for generative models, for likelihood ratio estimation, and for anomaly detection. Our OmniLearn approach is thus a foundation model and is made publicly available for use in any area where state-of-the-art precision is required for analyses involving jets and their substructure.
    
    Speaker: Vinicius Massami Mikuni (Lawrence Berkeley National Lab. (US))
    
    GMT20241002-153331_Recording_1760x900.mp4
    
    OmniLearn
  - 18:30
    
    Summary and Roundtable 30m
    
    Speakers: Dr Gaia Grosso (IAIFI, MIT), Raghav Kansal (Univ. of California San Diego (US))
    
    Foundation Models for Science Summary.pdf

Choose timezone

Foundation Models for Science Mini Workshop

354/1-019

CERN

354/1-019

CERN

354/1-019

CERN

40/S2-B01 - Salle Bohr

CERN