ML4Jets2024

Name: ML4Jets2024
Start: 2024-11-04T08:30:00+01:00
End: 2024-11-08T17:00:00+01:00
Location: LPNHE, Paris, France

4–8 Nov 2024

LPNHE, Paris, France

Europe/Paris timezone

A Novel Approach to Training Foundation Models for Jet-Related Tasks Without Vector Quantization

7 Nov 2024, 17:00

20m

Salle Séminaires

Foundation models

Masahiro Morinaga (University of Tokyo (JP))

This study proposes a new method for training foundation models designed explicitly for jet-related tasks. Like those seen in large language models, a foundation model is a pre-trained model that can be fine-tuned for various applications and is not limited to a specific task. Previous approaches often involve randomly masking inputs, such as tracks within a jet, and then predicting the masked parts. However, unlike methods in other fields like image recognition and point clouds, these proposed techniques show less improvement in accuracy for downstream tasks as the amount of training data increases when compared to models trained from scratch.

Most existing methods heavily rely on vector quantization, which is crucial in determining accuracy. In High Energy Physics (HEP), input variables often have highly skewed distributions, making them poorly suited for vector quantization. Additionally, vector quantization using neural networks is known to be very unstable during training.

In response to these challenges, we propose a method that reconstructs masked inputs without using vector quantization. To reduce biases introduced by the model architecture, we use a LLaMA-type Transformer. This approach aims to evaluate the effectiveness of pre-training methods that do not rely on HEP-specific knowledge. We also discuss the results of pre-training and fine-tuning using the JetClass dataset.

Masahiro Morinaga (University of Tokyo (JP))

Junichi Tanaka (University of Tokyo (JP)) Masahiko Saito (University of Tokyo (JP))

MasahiroMorinaga-ML4Jets.pdf

ML4Jets2024

A Novel Approach to Training Foundation Models for Jet-Related Tasks Without Vector Quantization

Salle Séminaires

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

ML4Jets2024

Speaker

Description

Author

Co-authors

Presentation materials