19–25 Oct 2024
Europe/Zurich timezone

Latest developments of the PUNCH4NFDI compute and storage infrastructures

24 Oct 2024, 14:42
18m
Room 2.A (Seminar Room)

Room 2.A (Seminar Room)

Talk Track 7 - Computing Infrastructure Parallel (Track 7)

Speaker

Benoit Roland (KIT - Karlsruhe Institute of Technology (DE))

Description

The PUNCH4NFDI consortium, funded by the German Research Foundation for an initial period of five years, gathers various physics communities - particle, astro-, astroparticle, hadron and nuclear physics - from different institutions embedded in the National Research Data Infrastructure initiative. The overall goal of PUNCH4NFDI is the establishment and support of FAIR data management solutions for all users of the participating communities.

The federated compute and storage infrastructures made available to the PUNCH4NFDI consortium, Compute4PUNCH and Storage4PUNCH, will be presented. These infrastructures, comprising a variety of heterogeneous compute and storage systems provided by the participating institutions, are managed by an HTCondor overlay batch system and COBalD/TARDIS metaschedulers. The TARDIS manager dynamically integrates the various compute resources into one overlay batch system based on HTCondor, while the COBalD workload balancer optimizes the distribution of the tasks to be performed. The standardized access to the federated compute and storage resources is managed by a token-based authentication and authorization infrastructure. The refreshment of short-lived access tokens is automated in a transparent monitoring and renewal mechanism making use of the HTCondor credential manager in combination with the MyToken service. Login nodes are defining single entry points to the federation, while a virtualized and scalable software environments provisioning is ensured by the use of containers and the CERN Virtual Machine File System.

The latest developments of Compute4PUNCH and Storage4PUNCH will be presented, including the newly developed automated token management using HTCondor and Mytoken. In addition, the integration of Compute4PUNCH as a compute backend into the REANA reproducible analysis platform developed at CERN, with an instance hosted
and managed by the PUNCH4NFDI consortium, will be shown.

Primary authors

Dr Arman Khalatyan (AIP - Leibniz-Institut für Astrophysik Potsdam (DE)) Benoit Roland (KIT - Karlsruhe Institute of Technology (DE)) Christoph Wissing (Deutsches Elektronen-Synchrotron (DE)) Dr Harry Enke (AIP - Leibniz-Institut für Astrophysik Potsdam (DE)) Manuel Giffels (KIT - Karlsruhe Institute of Technology (DE)) Prof. Matthias Hoeft (TLS - Thüringer Landessternwarte (DE)) Michael Huebner (University of Bonn (DE)) Oliver Freyermuth (University of Bonn (DE))

Presentation materials