Fast Machine Learning for Science Conference 2024

Name: Fast Machine Learning for Science Conference 2024
Start: 2024-10-15T08:00:00-04:00
End: 2024-10-18T21:00:00-04:00
Location: Purdue University

15–18 Oct 2024

Purdue University

America/Indiana/Indianapolis timezone

IceSONIC - Network AI Inference on Coprocessors for IceCube Offline Processing

15 Oct 2024, 15:40

Steward Center 306 (Third floor) (Purdue University)

Steward Center 306 (Third floor)

Purdue University

128 Memorial Mall Dr, West Lafayette, IN 47907

Lightning 5 min talk + poster Lighting talks

Benedikt Riedel

An Artificial Intelligence (AI) model will spend “90% of its lifetime in inference.”To fully utilize co-
processors, such as FPGAs or GPUs, for AI inference requires O(10) CPU cores to feed to work to the
coprocessors. Traditional data analysis pipelines will not be able to effectively and efficiently use
the coprocessors to their full potential. To allow for distributed access to coprocessors for AI infer-
ence workloads, the LHC’s Compact Muon Solenoid (CMS) experiment has developed the concept
of Services for Optimized Network Inference on Coprocessors (SONIC) using NVIDIA’s Triton In-
ference Servers. We have extended this concept for the IceCube Neutrino Observatory by deploying
NVIDIA’s Triton Inference Servers in local and external Kubernetes clusters, integrating an NVIDIA
Triton Client with IceCube’s data analysis framework, and deploying an OAuth2-based HTTP au-
thentication service in front of the Triton Inference Servers. We will describe the setup and our
experience adding this to IceCube’s offline processing system.

Focus areas	MMA

Alec Sheperd Benedikt Riedel David Schultz

Lightning_Talk.pdf

Lightning_Talk.pptx

Fast Machine Learning for Science Conference 2024

IceSONIC - Network AI Inference on Coprocessors for IceCube Offline Processing

Steward Center 306 (Third floor)

Purdue University

Speaker

Description

Authors

Presentation materials

Choose timezone

Fast Machine Learning for Science Conference 2024

Speaker

Description

Authors

Presentation materials