ACAT 2024

Name: ACAT 2024
Start: 2024-03-11T08:00:00-04:00
End: 2024-03-15T14:30:00-04:00
Location: Charles B. Wang Center, Stony Brook University

11–15 Mar 2024

Charles B. Wang Center, Stony Brook University

US/Eastern timezone

Contact

acat-loc2024@cern.ch

Ahead-of-time (AOT) compilation of Tensorflow models for deployment

13 Mar 2024, 16:15

30m

Charles B. Wang Center, Stony Brook University

100 Circle Rd, Stony Brook, NY 11794

Poster Track 1: Computing Technology for Physics Research Poster session with coffee break

Bogdan Wiederspan (Hamburg University (DE))

In a wide range of high-energy particle physics applications, machine learning methods have proven as powerful tools to enhance various aspects of physics data analysis. In the past years, various ML models were also integrated in central workflows of the CMS experiment, leading to great improvements in reconstruction and object identification efficiencies. However, the continuation of successful deployments might be limited in the future due to memory and processing time constraints of more advanced models evaluated on central infrastructure.

A novel inference approach for models trained with TensorFlow, based on Ahead-of-time (AOT) compilation is presented. This approach offers a substantial reduction in memory footprint while preserving or even improving computational performance. This talk outlines strategies and limitations of this novel approach, and presents integration workflow for deploying AOT models in production.

Significance

The continuation of successful ML model deployments might be limited in the future due to memory and processing time constraints, and this contribution presents a novel approach for inference on central infrastructure that can drastically reduce resource consumption.

Experiment context, if any	CMS

Bogdan Wiederspan (Hamburg University (DE)) Marcel Rieger (Hamburg University (DE))

ahead_of_time_compilation_of_tensorflow_models_for_deplyoment.pdf

Ahead-of-Time_Compilation_of_TensorFlow_Models_in_CMS_Experiment_Software.pdf

ACAT 2024

Contact

Ahead-of-time (AOT) compilation of Tensorflow models for deployment

Charles B. Wang Center, Stony Brook University

Speaker

Description

Significance

Authors

Presentation materials

Peer reviewing

Paper

Choose timezone

ACAT 2024

Contact

Speaker

Description

Significance

Authors

Presentation materials

Peer reviewing

Paper