7th Inter-Experimental LHC Machine Learning Workshop

Name: 7th Inter-Experimental LHC Machine Learning Workshop
Start: 2025-05-19T09:00:00+02:00
End: 2025-05-23T12:40:00+02:00
Location: CERN

19–23 May 2025

CERN

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

Efficient data movement for Machine Learning inference in heterogeneous CMS software

Not scheduled

20m

61/1-201 - Pas perdus - Not a meeting room - (CERN)

61/1-201 - Pas perdus - Not a meeting room -

CERN

Show room on map

Poster 6 ML infrastructure: Hardware and software for ML/MLOps Poster Session

Lukasz Michalski (Wroclaw University of Science and Technology (PL))

Efficient data processing using machine learning relies on heterogeneous computing approaches, but optimizing input and output data movements remains a challenge. In GPU-based workflows, data already resides on GPU memory, but machine learning models require the input and output data to be provided in sa pecific tensor format, often requiring unnecessary copying outside of the GPU device and conversion steps. To address this, we present an interface that allows seamless conversion of Structure of Arrays (SoA) data into lists of PyTorch tensors without explicit data movement. Our approach computes the necessary strides for various data types, including scalars and rows of vectors, matrices, allowing PyTorch tensors to directly access the data on the GPU memory. The introduced metadata structure provides a flexible mechanism for defining the columns to be used and specifying the order of the resulting tensor list. This user-friendly interface minimizes the amount of code required, allowing direct integration with machine learning models. Implemented within the CMS computing framework and using the Alpaka library for heterogeneous applications, this solution significantly improves GPU efficiency. By avoiding unnecessary CPU-GPU transfers, it accelerates model execution while maintaining flexibility and ease of use.

Would you like to be considered for an oral presentation?	Yes

Christine Zeh (Vienna University of Technology (AT)) Davide Valsecchi (ETH Zurich (CH)) Leonardo Beltrame (Politecnico di Milano (IT)) Lukasz Michalski (Wroclaw University of Science and Technology (PL))

Eric Cano (CERN) Felice Pantaleo (CERN)

IML2025__7th_IML_Workshop_on_Machine_Learning_at_the_LHC.pdf

7th Inter-Experimental LHC Machine Learning Workshop

Contact

Efficient data movement for Machine Learning inference in heterogeneous CMS software

61/1-201 - Pas perdus - Not a meeting room -

CERN

Speaker

Description

Authors

Co-authors

Presentation materials

Choose timezone

7th Inter-Experimental LHC Machine Learning Workshop

Contact

Speaker

Description

Authors

Co-authors

Presentation materials