Fast Machine Learning for Science Workshop 2023

Name: Fast Machine Learning for Science Workshop 2023
Start: 2023-09-25T08:30:00+01:00
End: 2023-09-28T18:00:00+01:00
Location: Imperial College London

25–28 Sept 2023

Imperial College London

Europe/London timezone

Using NVIDIA Triton Server for Inference-as-a-Service at Fermilab

25 Sept 2023, 17:35

Blackett Laboratory, Lecture Theatre 1 (Imperial College London)

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Blackett Laboratory

Lightning Talk Contributed Talks Contributed Talks

Claire Savard (University of Colorado Boulder (US))

With machine learning gaining more and more popularity as a physics analysis tool, physics computing centers, such as the Fermilab LHC Physics Center (LPC), are seeing huge increases in their resources being used for such algorithms. These facilities, however, are not generally set up efficiently for machine learning inference as they rely on slower CPU evaluation, which has a noticeable impact on time-to-insight and is detrimental to computational throughput. In this work, we will discuss how we used the NVIDIA Triton Inference Server to re-optimize Fermilab's resource allocation and computing structure to achieve high throughput for scaling out to multiple users parallelizing their machine learning inference at the same time. We will also demonstrate how this service is used in current physics analyses and provide steps for how others can apply this tool to their analysis code.

Claire Savard (University of Colorado Boulder (US))

FastML_2023.pdf

Fast Machine Learning for Science Workshop 2023

Using NVIDIA Triton Server for Inference-as-a-Service at Fermilab

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Speaker

Description

Author

Presentation materials

Choose timezone

Fast Machine Learning for Science Workshop 2023

Speaker

Description

Author

Presentation materials