4th Inter-experiment Machine Learning Workshop

Name: 4th Inter-experiment Machine Learning Workshop
Start: 2020-10-19T09:00:00+02:00
End: 2020-10-23T18:10:00+02:00
Location: No location set

19–23 Oct 2020

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

Distributed training of graph neural network at HPC

22 Oct 2020, 11:25

Lightning talk 6 ML infrastructure : Hardware and software for Machine Learning Workshop

Xiangyang Ju (Lawrence Berkeley National Lab. (US))

Graph Neural Networks (GNN) are trainable functions that operate on a graph to learn latent graph attributes and to form a parameterized message-passing by which information is propagated across the graph, ultimately learning sophisticated graph attributes. Its application in the High Energy Physics grows rapidly in the past years, ranging from event reconstructions to data analyses, from precision measurements to the search of new physics. The size and complexity of the graphs are also growing. Because graph data structure is irregular and sparse, it imposes non-trivial computational challenges. Currently AI hardwares primarily focus on accelerating dense 1D or 2D arrays, to some extend neglecting sparse and irregular tensor calculations. In this talk, we take the GNN architecture used by the Exa.TrkX collaboration for track reconstruction and the tracking ML challenge dataset as the benchmark in evaluating distributed strategies and Artificial Intelligent (AI) accelerators. We study different AI accelerators that are either in the cloud or at a High Performance Computing center. We also study different distributed training strategies for GNN and the scalabilities of these training strategies on different AI accelerators. Finally, the talk ends with an outlook on deploying GNN for real-time data processing.

Xiangyang Ju (Lawrence Berkeley National Lab. (US))

20201021-distributed-training.pdf

zoom_4_DistTrainingGNN.mp4

4th Inter-experiment Machine Learning Workshop

Contact

Distributed training of graph neural network at HPC

Speaker

Description

Primary author

Presentation materials

Choose timezone

4th Inter-experiment Machine Learning Workshop

Contact

Speaker

Description

Primary author

Presentation materials

Share this page

Direct link

Social networks

Calendaring