Fast Machine Learning for Science Conference 2024

Name: Fast Machine Learning for Science Conference 2024
Start: 2024-10-15T08:00:00-04:00
End: 2024-10-18T21:00:00-04:00
Location: Purdue University

15–18 Oct 2024

Purdue University

America/Indiana/Indianapolis timezone

Inference as a Service for HEP ML Models on AMD GPUs Using the SONIC Framework

Not scheduled

20m

Steward Center 306 (Third floor) (Purdue University)

Steward Center 306 (Third floor)

Purdue University

128 Memorial Mall Dr, West Lafayette, IN 47907

Poster

Ethan Colbert (Purdue University (US))

One potential way to meet the quickly growing computing demands in High Energy Physics (HEP) experiments is by leveraging specialized processors such as GPUs. The “as a service” (AAS) approach helps improve utilization of GPU resources by allowing one GPU to serve a wide range of tasks, significantly reducing idle time. The SONIC project implements the AAS approach for a variety of widely used HEP algorithms and Machine Learning (ML) models by serving them using the NVIDIA Triton Inference Server framework. Focus has been primarily on serving models on NVIDIA GPUs, but the PyTriton package is flexible enough to allow Triton servers to be launched using AMD GPUs as well. This has been implemented, and the inference performance for two HEP ML models is compared across several AMD and NVIDIA GPUs.

Focus areas	HEP

Ethan Colbert (Purdue University (US)) Yongbin Feng (Texas Tech University (US))

Miaoyuan Liu (Purdue University (US))

FastML_poster_AMD_v3.pdf

Fast Machine Learning for Science Conference 2024

Inference as a Service for HEP ML Models on AMD GPUs Using the SONIC Framework

Steward Center 306 (Third floor)

Purdue University

Speaker

Description

Authors

Co-author

Presentation materials

Choose timezone

Fast Machine Learning for Science Conference 2024

Speaker

Description

Authors

Co-author

Presentation materials