ACAT 2022

Name: ACAT 2022
Start: 2022-10-23T16:30:00+02:00
End: 2022-10-28T17:00:00+02:00
Location: Villa Romanazzi Carducci, Bari, Italy

23–28 Oct 2022

Villa Romanazzi Carducci, Bari, Italy

Europe/Rome timezone

Contact

acat-loc2022@cern.ch

Challenges and opportunities integrating LLAMA into AdePT

27 Oct 2022, 17:00

20m

Sala Federico II (Villa Romanazzi)

Sala Federico II

Villa Romanazzi

Oral Track 1: Computing Technology for Physics Research Track 1: Computing Technology for Physics Research

Bernhard Manfred Gruber (Technische Universitaet Dresden (DE)) Guilherme Amadio (CERN) Stephan Hageboeck (CERN)

Particle transport simulations are a cornerstone of high-energy physics (HEP), constituting almost half of the entire computing workload performed in HEP. To boost the simulation throughput and energy efficiency, GPUs as accelerators have been explored in recent years, further driven by the increasing use of GPUs on HPCs. The Accelerated demonstrator of electromagnetic Particle Transport (AdePT) is an advanced prototype for offloading the simulation of electromagnetic showers in Geant4 to GPUs, and still undergoes continuous development and optimization. Improving memory layout and data access is vital to use modern, massively parallel GPU hardware efficiently, contributing to the challenge of migrating traditional CPU based data structures to GPUs in AdePT. The low-level abstraction of memory access (LLAMA) is a C++ library that provides a zero-runtime-overhead data structure abstraction layer, focusing on multidimensional arrays of nested, structured data. It provides a framework for defining and switching custom memory mappings at compile time to define data layouts and instrument data access, making LLAMA an ideal tool to tackle the memory-related optimization challenges in AdePT. Our contribution shares insights gained with LLAMA when instrumenting data access inside AdePT, complementing traditional GPU profiler outputs. We demonstrate traces of read/write counts to data structure elements as well as memory heatmaps. The acquired knowledge allowed for subsequent data layout optimizations.

Significance

AdePT is central to the current strategy for improving simulation throughput in Geant4. We contribute further optimizations to the project. By coupling these optimizations with LLAMA, a general-purpose library, the demonstrated strategies, insights and optimizations will be transferable to other projects targeting GPUs and heterogeneous systems as well.

References

AdePT at ACAT2021: https://indico.cern.ch/event/855454/contributions/4605037/
LLAMA paper: https://doi.org/10.1002/spe.3077

Bernhard Manfred Gruber (Technische Universitaet Dresden (DE))

AdePT LLAMA.pdf

main.pdf

ACAT 2022

Contact

Challenges and opportunities integrating LLAMA into AdePT

Sala Federico II

Villa Romanazzi

Speakers

Description

Significance

References

Author

Presentation materials

Peer reviewing

Paper

Choose timezone

ACAT 2022

Contact

Speakers

Description

Significance

References

Author

Presentation materials

Peer reviewing

Paper