19–25 Oct 2024
Europe/Zurich timezone

Efficient and fast container execution using image snapshotters

22 Oct 2024, 14:06
18m
Large Hall B

Large Hall B

Talk Track 7 - Computing Infrastructure Parallel (Track 7)

Speaker

Clemens Lange (Paul Scherrer Institute (CH))

Description

A large fraction of computing workloads in high-energy and nuclear physics is executed using software containers. For physics analysis use, such container images often have sizes of several gigabytes. Executing a large number of such jobs in parallel on different compute nodes efficiently, demands the availability and use of caching mechanisms and image loading techniques to prevent network saturation and significantly reduce startup time. Using the industry-standard containerd container runtime for pulling and running containers, enables the use of various so-called snapshotter plugins that “lazily” load container images. We present a quantitative comparison of the performance of the CVMFS, SOCI, and Stargz snapshotter plugins. Furthermore, we also evaluate the user-friendliness of such approaches and discuss how such seamlessly containerised workloads contribute to the reusability and reproducibility of physics analyses.

Primary authors

Amal Santosh Thundiyil (Sardar Patel Institute Technology of Technology (SPIT) (IN)) Clemens Lange (Paul Scherrer Institute (CH)) Derek Feichtinger (Paul Scherrer Institute (CH)) Jakob Blomer (CERN) Max Fatouros (Paul Scherrer Institute (CH)) Valentin Volkl (CERN)

Presentation materials