A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop

Name: A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop
Start: 2023-07-10T01:30:00-07:00
End: 2023-07-14T12:00:00-07:00
Location: University of Washington

10–14 Jul 2023

University of Washington

US/Pacific timezone

Graph Neural Network-based particle tracking as a Service

10 Jul 2023, 19:00

Oak Hall Denny Room

Poster Working dinner

Elham E Khoda (University of Washington (US))

Recent studies on the ITk data showed that the Graph Neural Network (GNN) -based track finding can provide not only satisfied track efficiency but also reasonable track resolutions. However, the GNN-based track finding is computationally slow in CPUs, demanding the usage of coprocessors like GPUs to speed up the inference time. The large graph size, normally 300k nodes and 1M edges, necessitates significant GPU memory for feasible computation. Not all ATLAS computing sites are harnessed with high-end GPUs like A100s. These challenges have to be addressed in order to deploy the GNN-based track finding into production. We propose to address these challenges by establishing the GNN-based track-finding algorithm as a service hosted either in clouds or high-performance computing centers.

In this poster, we will describe the implementation of the GNN-based track-finding workflow as a service using the Nvidia Triton inference server. The pipeline contains three discrete deep-learning models and two CUDA-based algorithms. Because of the heterogeneity in the workflow, we explore different server settings to maximize the throughput of track finding. At the same time, we study the scalability of the inference server using the Perlmutter supercomputer at NERSC and cloud resources like AWS and Google Cloud. We will present the studies performed with the stand-alone algorithm. Integration and optimization of the workflows into ACTS and Athena are in progress.

Andrew Naylor (Lawrence Berkeley National Lab) Dylan Sheldon Rankin (University of Pennsylvania (US)) Elham E Khoda (University of Washington (US)) Paolo Calafiura (Lawrence Berkeley National Lab. (US)) Shih-Chieh Hsu (University of Washington Seattle (US)) Steven Farrell (Lawrence Berkeley National Laboratory) Xiangyang Ju (Lawrence Berkeley National Lab. (US))

poster_a3d32023_Elham.pdf

A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop

Graph Neural Network-based particle tracking as a Service

Oak Hall Denny Room

Speaker

Description

Authors

Presentation materials

Choose timezone

A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop

Speaker

Description

Authors

Presentation materials