ACAT 2024

Name: ACAT 2024
Start: 2024-03-11T08:00:00-04:00
End: 2024-03-15T14:30:00-04:00
Location: Charles B. Wang Center, Stony Brook University

11–15 Mar 2024

Charles B. Wang Center, Stony Brook University

US/Eastern timezone

Contact

acat-loc2024@cern.ch

A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading

14 Mar 2024, 16:10

30m

Charles B. Wang Center, Stony Brook University

100 Circle Rd, Stony Brook, NY 11794

Poster Track 1: Computing Technology for Physics Research Poster session with coffee break

Mohammad Atif (Brookhaven National Laboratory)

We present a framework based on Catch2 to evaluate performance of OpenMP's target offload model via micro-benchmarks. The compilers supporting OpenMP's target offload model for heterogeneous architectures are currently undergoing rapid development. These developments influence performance of various physics applications in different ways. This framework can be employed to track the impact of compiler upgrades and compare their performance with the native programming models. We use the framework to benchmark performance of a few commonly used operations on leadership class supercomputers such as Perlmutter at National Energy Research Scientific Computing (NERSC) Center and Frontier at Oak Ridge Leadership Computing Facility (OLCF). Such a framework will be useful for compiler developers to gain insights into the overall impact of many small changes, as well as for users to decide which compilers and versions are expected to yield best performance for their applications.

Significance

Several portable programming models have been developed in the last decade as a solution to avoid code duplication and diversion for different GPU backends. This work focuses on OpenMP’s target offload model which has undergone rapid development in the past few years and gained increasing vendor support. Simplified algorithmic performance benchmarks provide a good overview of the current compiler support for OpenMP target offload, however, they often lack the granularity to evaluate the performance of specific, often specialized, operations, such as atomic, memset or scan. We have encountered several issues related to these operations when porting application codes to OpenMP, which motivated us to develop necessary tools for the
better evaluation of specific operations in OpenMP target offload.

Meifeng Lin Mohammad Atif (Brookhaven National Laboratory) Tianle Wang (Brookhaven National Lab) Zhihua Dong

openmp_benchmarks_acat2024.pdf

ACAT 2024

Contact

A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading

Charles B. Wang Center, Stony Brook University

Speaker

Description

Significance

Authors

Presentation materials

Choose timezone

ACAT 2024

Contact

Speaker

Description

Significance

Authors

Presentation materials