4th Inter-experiment Machine Learning Workshop

Name: 4th Inter-experiment Machine Learning Workshop
Start: 2020-10-19T09:00:00+02:00
End: 2020-10-23T18:10:00+02:00
Location: No location set

19–23 Oct 2020

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

GPU and FPGA as a Service for Machine Learning Inference Accelerations

23 Oct 2020, 15:15

Lightning talk 6 ML infrastructure : Hardware and software for Machine Learning Workshop

Yu Lou (University of Washington (US))

The data rate may surge after some planned upgrades for the high-luminosity Large Hadron Collider (LHC) and accelerator-based neutrino experiments. Since there is no enough storage to save all of the data, there is a challenging demand to process and filter billions of events in real-time. Machine learning algorithms are becoming increasingly prevalent in the particle reconstruction pipeline. Specially designed hardware can significantly accelerate the machine learning inference time compared to CPUs. Thus, we propose a heterogeneous computing framework called the Services for Optimized Network Inference on Coprocessors (SONIC) to accelerate machine learning inferences with various coprocessors. With a unified interface, the framework conveniently provides GPU as a service, using either the Nvidia Triton framework or the Microsoft Brainwave service as the backend. It also features the first open-source FPGA-as-a-service toolkit, using either our hls4ml framework or the Xilinx ML Suite as the backend. We demonstrated that our method could speed up one classification and two regression problems in the LHC experiments and ProtoDUNE-SP. By providing coprocessors as a service, our work may assist various other computing workflows across science.

Yu Lou (University of Washington (US)) Javier Mauricio Duarte (Univ. of California San Diego (US)) Jeffrey Krupa Kelvin Lin (University of Washington (US)) Kevin Pedro (Fermi National Accelerator Lab. (US)) Dr Kyle Knoepfel (Fermi National Accelerator Laboratory) Maria Acosta Flechas (Fermi National Accelerator Lab. (US)) Matthew Trahms (UW ACME Lab) Mia Liu Michael Wang (Fermi National Accelerator Lab. (US)) Natchanon Suaysom (University of Washington (US)) Nhan Viet Tran (Fermi National Accelerator Lab. (US)) Philip Harris (Unknown) Scott Hauck (University of Washington) Shih-Chieh Hsu (University of Washington Seattle (US)) Ta-Wei Ho (National Tsing Hua University (TW)) Thomas Klijnsma (Fermi National Accelerator Lab. (US)) Tingjun Yang (Fermi National Accelerator Lab. (US)) Benjamin Hawks (Fermi National Accelerator Laboratory) Dr Burt Holzman (Fermi National Accelerator Lab. (US)) Dylan Sheldon Rankin (Massachusetts Inst. of Technology (US)) Jack Dinsmore

GPU and FPGA as a Service for Machine Learning Inference Accelerations - IML Workshop.pdf

IML Workshop rehearsal 2 - Tom.mp4

Yu Lou.mp4

4th Inter-experiment Machine Learning Workshop

Contact

GPU and FPGA as a Service for Machine Learning Inference Accelerations

Speaker

Description

Primary authors

Presentation materials

Choose timezone

4th Inter-experiment Machine Learning Workshop

Contact

Speaker

Description

Primary authors

Presentation materials

Share this page

Direct link

Social networks

Calendaring