ACAT 2024

Name: ACAT 2024
Start: 2024-03-11T08:00:00-04:00
End: 2024-03-15T14:30:00-04:00
Location: Charles B. Wang Center, Stony Brook University

11–15 Mar 2024

Charles B. Wang Center, Stony Brook University

US/Eastern timezone

Contact

acat-loc2024@cern.ch

CaloDiT: Diffusion with transformers for fast shower simulation

13 Mar 2024, 17:10

20m

Theatre ( Charles B. Wang Center, Stony Brook University )

Theatre

Charles B. Wang Center, Stony Brook University

100 Circle Rd, Stony Brook, NY 11794

Oral Track 1: Computing Technology for Physics Research Track 1: Computing Technology for Physics Research

Piyush Raikwar (CERN)

Recently, transformers have proven to be a generalised architecture for various data modalities, i.e., ranging from text (BERT, GPT3), time series (PatchTST) to images (ViT) and even a combination of them (Dall-E 2, OpenAI Whisper). Additionally, when given enough data, transformers can learn better representations than other deep learning models thanks to the absence of inductive bias, better modelling of long-range dependencies, and interpolation and extrapolation capabilities. On the other hand, diffusion models are the state-of-the-art approach for image generation, which still use conventional U-net models for generation, mostly consisting of convolution layers making little use of the advantages of transformers. While these models show good generation performance it lacks the generalisation capabilities obtained from the transformer model. Standard diffusion models with an Unet architecture have already proven to be able to generate calorimeter showers, while transformer-based models, like those based on a VQ-VAE architecture, also show promising results. A combination of a diffusion model with a transformer architecture should bridge the quality of the generation sample obtained from diffusion with the generalisation capabilities of the transformer architecture. In this paper, we propose CaloDiT, to model our problem as a diffusion process with transformer blocks. Furthermore, we show the ability of the model to generalise to different calorimeter geometries, bringing us closer to a foundation model for calorimeter shower generation.

Significance

This contribution presents a novel transformer-based machine learning model for general fast shower simulation, fitting perfectly the focus theme of ACAT 2024 - the foundation models.

Renato Paulo Da Costa Cardoso (CERN) Piyush Raikwar (CERN) Anna Zaborowska (CERN) Dalila Salamani (CERN) Kristina Jaruskova (CERN) Dr Sofia Vallecorsa (CERN) Kyongmin Yeo (IBM Research) Vijay Ekambaram (IBM Research) Nam Nguyen (IBM Research) Jayant Kalagnanam (IBM Research) Mudhakar Srivatsa (IBM Research)

CaloDiT ACAT 24 (1).pdf

Presentation

ACAT 2024

Contact

CaloDiT: Diffusion with transformers for fast shower simulation

Theatre

Charles B. Wang Center, Stony Brook University

Speaker

Description

Significance

Authors

Presentation materials

Choose timezone

ACAT 2024

Contact

Speaker

Description

Significance

Authors

Presentation materials