ACAT 2025

Name: ACAT 2025
Start: 2025-09-08T08:00:00+02:00
End: 2025-09-12T16:30:00+02:00
Location: Hamburg, Germany

8–12 Sept 2025

Hamburg, Germany

Europe/Berlin timezone

Generative Language Model for Simulating Particles Interacting with Matter

8 Sept 2025, 11:00

30m

ESA W 'West Wing'

Poster Track 2: Data Analysis - Algorithms and Tools Poster session with coffee break

Jay Chan (Lawrence Berkeley National Lab. (US)) Xiangyang Ju (Lawrence Berkeley National Lab. (US))

The simulation of particle interactions with detectors plays a critical role in understanding the detector performances and optimizing physics analysis. Without the guidance of the first-principle theory, the current state-of-the-art simulation tool, \textsc{Geant4}, exploits phenomenology-inspired parametric models, which must be combined and carefully tuned to experimental observations. The tuning process, even with the help of semi-auto tools like Professor, is laborious.

Generative language models showed outstanding performance in predicting the next tokens for a given prompt. Its capabilities in learning complex language patterns can be potentially leveraged to learn particle interactions from experimental data.

We introduce a Language Model-based framework for simulation particle detectors. In this framework, the particle information and detector hits will be tokenized into discrete numbers. And a transformer will be trained to learn the statistical correlations between the incoming particles and outgoing detector hits. Instead of directly predicting the detector hits, the transformer will predict the outgoing tokens, which then can be detokenized into detector hits. Our approach replaces the regression task with a multiclass classification task, which Transformers perform much better.

In addition to the introduction of a simulation framework, our contribution includes the introduction of a point cloud data-oriented particle tokenizer and a pre-trained GPT-like model for simulating particles interacting with detector materials.

Significance

We introduce a novel Language Model-based framework for simulating the particle interactions with matter. Our contribution includes the introduction of a point cloud data-oriented particle tokenizer and a pre-trained GPT-like model for simulating particles interacting with detector materials.

Eshwary Mishra (UC Berkeley) Haoran Zhao (University of Washington (US)) Jay Chan (Lawrence Berkeley National Lab. (US)) Xiangyang Ju (Lawrence Berkeley National Lab. (US))

particleGPT.pdf

ACAT 2025

Generative Language Model for Simulating Particles Interacting with Matter

ESA W 'West Wing'

Speakers

Description

Significance

Authors

Presentation materials

Choose timezone

ACAT 2025

Speakers

Description

Significance

Authors

Presentation materials