25–28 Sept 2023
Imperial College London
Europe/London timezone

Portable Acceleration of CMS Mini-AOD Production with Coprocessors as a Service

25 Sept 2023, 14:30
15m
Blackett Laboratory, Lecture Theatre 1 (Imperial College London)

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Blackett Laboratory
Standard Talk Contributed Talks Contributed Talks

Speaker

William Patrick Mccormack (Massachusetts Inst. of Technology (US))

Description

Computing demands for large scientific experiments, such as the CMS experiment at CERN, will increase dramatically in the next decades. To complement the future performance increases of software running on CPUs, explorations of coprocessor usage in data processing hold great potential and interest. We explore the novel approach of Services for Optimized Network Inference on Coprocessors (SONIC) and study the deployment of this as-a-Service approach in large-scale data processing. In this setup, the main CMS Mini-AOD creation workflow is executed on CPUs, while several machine learning (ML) inference tasks are offloaded onto (remote) coprocessors, such as GPUs. With experiments performed at Google Cloud, the Purdue Tier-2 computing center, and combinations of the two, we demonstrate the acceleration of these ML algorithms individually on coprocessors and the corresponding throughput improvement for the entire workflow. We also show that this approach can be easily generalized to different types of coprocessors, and even deployed on local CPUs without performance decrease. We emphasize that SONIC enables high coprocessor usage and brings the portability to run workflows on different types of coprocessors.

Authors

Dr Burt Holzman (Fermi National Accelerator Lab. (US)) Javier Mauricio Duarte (Univ. of California San Diego (US)) Jeffrey Krupa (Massachusetts Institute of Technology) Kevin Pedro (Fermi National Accelerator Lab. (US)) Lindsey Gray (Fermi National Accelerator Lab. (US)) Maria Acosta Flechas (Fermi National Accelerator Lab. (US)) Miaoyuan Liu (Purdue University (US)) Nhan Tran (Fermi National Accelerator Lab. (US)) Nirmal Thomas Philip Coleman Harris (Massachusetts Inst. of Technology (US)) Raghav Kansal (Univ. of California San Diego (US)) Simon Rothman (Massachusetts Inst. of Technology (US)) Stefan Piperov (Purdue University (US)) William Patrick Mccormack (Massachusetts Inst. of Technology (US)) Yongbin Feng (Fermi National Accelerator Lab. (US))

Presentation materials