ACAT 2022

Name: ACAT 2022
Start: 2022-10-23T16:30:00+02:00
End: 2022-10-28T17:00:00+02:00
Location: Villa Romanazzi Carducci, Bari, Italy

23–28 Oct 2022

Villa Romanazzi Carducci, Bari, Italy

Europe/Rome timezone

Contact

acat-loc2022@cern.ch

Data transfer to remote GPUs over high performance networks

27 Oct 2022, 11:00

30m

Area Poster (Floor -1) (Villa Romanazzi)

Area Poster (Floor -1)

Villa Romanazzi

Poster Track 1: Computing Technology for Physics Research Poster session with coffee break

Ali Marafi (Kuwait University (KW)) Andrea Bocci (CERN)

In the past years the CMS software framework (CMSSW) has been extended to offload part of the physics reconstruction to NVIDIA GPUs. This can achieve a higher computational efficiency, but it adds extra complexity to the design of dedicated data centres and the use of opportunistic resources, like HPC centres. A possible solution to increase the flexibility of heterogeneous clusters is to offload part of the computations to GPUs installed in external, dedicated nodes.

Our studies on this topic have been able to achieve high-throughput, low-latency data transfers to and from a remote NVIDIA GPU across Mellanox NICs, using the Remote Direct Memory Access (RDMA) technology to access the GPU memory without involving either nodes' operating system.

In this work we present our approach based on the Open MPI framework, and compare the performance of data transfers of local and remote GPUs from different generations, using different communication libraries and network protocols.

Experiment context, if any	CMS

Ali Marafi (Kuwait University (KW))

Andrea Bocci (CERN)

Poster.pdf

ACAT2022.pdf

ACAT 2022

Contact

Data transfer to remote GPUs over high performance networks

Area Poster (Floor -1)

Villa Romanazzi

Speakers

Description

Author

Co-author

Presentation materials

Peer reviewing

Paper

Choose timezone

ACAT 2022

Contact

Speakers

Description

Author

Co-author

Presentation materials

Peer reviewing

Paper