ACAT 2024

Name: ACAT 2024
Start: 2024-03-11T08:00:00-04:00
End: 2024-03-15T14:30:00-04:00
Location: Charles B. Wang Center, Stony Brook University

11–15 Mar 2024

Charles B. Wang Center, Stony Brook University

US/Eastern timezone

Contact

acat-loc2024@cern.ch

Accelerating Machine Learning Inference on GPUs with SYCL using SOFIE

12 Mar 2024, 11:50

20m

Theatre ( Charles B. Wang Center, Stony Brook University )

Theatre

Charles B. Wang Center, Stony Brook University

100 Circle Rd, Stony Brook, NY 11794

Oral Track 1: Computing Technology for Physics Research Track 1: Computing Technology for Physics Research

Dr Vincenzo Padulano (CERN)

Recently, machine learning has established itself as a valuable tool for researchers to analyze their data and draw conclusions in vari- ous scientific fields, such as High Energy Physics (HEP). Commonly used machine learning libraries, such as Keras and PyTorch, might provide functionality for inference, but they only support their own models, are constrained by heavy dependencies and often provide only a Python API and not a C++ one. SOFIE [13], which stands for System for Optimized Fast Inference code Emit, a part of the ROOT project developed at CERN, creates standalone C++ inference code from an input model in one of the popular machine learning formats. This code is directly invokable from other C++ projects and has minimal dependencies. We will present the new developments of SOFIE extending the functionality to generate SYCL code for machine learning model inference that can run on various GPU platforms and is only dependent on Intel MKL BLAS and portBLAS libraries, achieving a speedup of up to x258 over plain C++ code for large convolutional models.

Significance

This presentation covers new results coming from new developments that happened last year

Experiment context, if any	Work happening within the ROOT project (CERN EP/SFT) and in collaboration with CERN Openlab

Ioanna Maria Panagou Lorenzo Moneta (CERN) SANJIBAN SENGUPTA Dr Vincenzo Padulano (CERN)

TMVA_SOFIE_ACAT2024.pdf

ACAT 2024

Contact

Accelerating Machine Learning Inference on GPUs with SYCL using SOFIE

Theatre

Charles B. Wang Center, Stony Brook University

Speaker

Description

Significance

Authors

Presentation materials

Choose timezone

ACAT 2024

Contact

Speaker

Description

Significance

Authors

Presentation materials