7th Inter-Experimental LHC Machine Learning Workshop

Name: 7th Inter-Experimental LHC Machine Learning Workshop
Start: 2025-05-19T09:00:00+02:00
End: 2025-05-23T12:40:00+02:00
Location: CERN

19–23 May 2025

CERN

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

Anomaly preserving contrastive neural embeddings for end-to-end model-independent searches at the LHC

21 May 2025, 15:40

20m

222/R-001 (CERN)

222/R-001

CERN

200

Show room on map

Contributed talk 2 ML for analysis: Event classification, statistical analysis and inference, anomaly detection Contributed Talks

Kyle Sidney Metzger

Anomaly detection — identifying deviations from Standard Model predictions — is a key challenge at the Large Hadron Collider due to the size and complexity of its datasets. This is typically addressed by transforming high-dimensional detector data into lower-dimensional, physically meaningful features. We tackle feature extraction for anomaly detection by learning powerful low-dimensional representations via contrastive neural embeddings. This approach preserves potential anomalies indicative of new physics and enables rare signal extraction using novel machine learning-based statistical methods for signal-independent hypothesis testing. We compare supervised and self-supervised contrastive learning methods, for both MLP- and Transformer-based neural embeddings, trained on the kinematic observables of physics objects in LHC collision events. The learned embeddings serve as input representations for signal-agnostic statistical detection methods in inclusive final states, achieving over ten fold improved detection performance over the original feature representation and up to four fold improvement over using a physics-informed selections of the same dimensionality. We achieve significant improvement in discovery power for both rare new physics signals and rare Standard Model processes across diverse final states, demonstrating its applicability for efficiently searching for diverse signals simultaneously. We study the impact of architectural choices, contrastive loss formulations, supervision levels, and embedding dimensionality on anomaly detection performance. We show that the optimal representation for background classification does not always maximize sensitivity to new physics signals, revealing an inherent trade-off between background structure preservation and anomaly enhancement. Our findings demonstrate that foundation models for particle physics data hold significant potential for improving neural feature extraction, enabling scientific discovery in inclusive final states at collider experiments.

Would you like to be considered for an oral presentation?	Yes

Gaia Grosso (IAIFI, MIT) Katya Govorkova (Massachusetts Inst. of Technology (US)) Kyle Sidney Metzger Lana Xu (Massachusetts Institute of Technology) Mia Sodini (Massachusetts Inst. of Technology (US)) Philip Coleman Harris (Massachusetts Inst. of Technology (US)) Thea Aarrestad (ETH Zurich (CH))

IML_2025_KyleMetzger_final.pdf

7th Inter-Experimental LHC Machine Learning Workshop

Contact

Anomaly preserving contrastive neural embeddings for end-to-end model-independent searches at the LHC

222/R-001

CERN

Speaker

Description

Authors

Presentation materials

Choose timezone

7th Inter-Experimental LHC Machine Learning Workshop

Contact

Speaker

Description

Authors

Presentation materials