19–25 Oct 2024
Europe/Zurich timezone

Preparation for Multi-Site Processing at the Vera C. Rubin Observatory

24 Oct 2024, 14:42
18m
Room 2.B (Conference Room)

Room 2.B (Conference Room)

Talk Track 4 - Distributed Computing Parallel (Track 4)

Speaker

Fabio Hernandez (IN2P3 / CNRS computing centre)

Description

After several years of focused work, preparation for Data Release Production (DRP) of the Vera C. Rubin Observatory’s Legacy Survey of Space and Time (LSST) at multiple data facilities is taking its shape. Rubin Observatory DRP features both complex, long workflows with many short jobs, and fewer long jobs with sometimes unpredictably large memory usage. Both of them create scaling issues that need to be addressed in order to meet the annual processing timeline.

This paper summarizes the infrastructure and services deployed at Rubin data facilities to support multi-site data processing. Rubin selected PanDA (Production and Distributed Analysis) to orchestrate its complex workflow and to manage its distributed workload. We address the interface between workflow/workload management system and Rubin’s campaign management system, as well as the associated analytics platform, and the interface to the observatory’s data management system.

Rubin has already exercised this infrastructure to process data from other observatories as well as simulated data. The experience of those processing campaigns is summarized in this paper. Finally, this paper outlines future plans, including providing the campaign management team a higher level view on ongoing campaigns and analyzing finished campaigns as well as using PanDA to support end users' need for batch processing from within a “hybrid” cloud approach to data hosting.

Authors

Brian Yanny (Fermilab) Dr Edward Karavakis (Brookhaven National Laboratory (US)) Fabio Hernandez (IN2P3 / CNRS computing centre) Jennifer Adelman-Mccarthy (Fermi National Accelerator Lab. (Fermilab)-Unknown-Unknown) Michelle Gower Peter Love (Lancaster University (GB)) Stephen R. Pietrowicz (National Center for Supercomputing Applications, USA) Tim Jenness (Vera C. Rubin Observatory, USA) Timothy John Noble (Science and Technology Facilities Council STFC (GB)) Wei Yang (SLAC National Accelerator Laboratory (US)) Wen Guan (Brookhaven National Laboratory (US)) Zhaoyu Yang (Brookhaven National Laboratory (US)) kian tat lim (SLAC National Accelerator Lab/Vera C. Rubin Observatory) richard dubois

Presentation materials