Conference on Computing in High Energy and Nuclear Physics

Name: Conference on Computing in High Energy and Nuclear Physics
Start: 2024-10-19T08:00:00+02:00
End: 2024-10-25T18:30:00+02:00
Location: No location set

19–25 Oct 2024

Europe/Zurich timezone

Contact Program Chairs

chep2024-pc@cern.ch

Preparation for Multi-Site Processing at the Vera C. Rubin Observatory

24 Oct 2024, 14:42

18m

Room 2.B (Conference Room)

Talk Track 4 - Distributed Computing Parallel (Track 4)

Fabio Hernandez (IN2P3 / CNRS computing centre)

After several years of focused work, preparation for Data Release Production (DRP) of the Vera C. Rubin Observatory’s Legacy Survey of Space and Time (LSST) at multiple data facilities is taking its shape. Rubin Observatory DRP features both complex, long workflows with many short jobs, and fewer long jobs with sometimes unpredictably large memory usage. Both of them create scaling issues that need to be addressed in order to meet the annual processing timeline.

This paper summarizes the infrastructure and services deployed at Rubin data facilities to support multi-site data processing. Rubin selected PanDA (Production and Distributed Analysis) to orchestrate its complex workflow and to manage its distributed workload. We address the interface between workflow/workload management system and Rubin’s campaign management system, as well as the associated analytics platform, and the interface to the observatory’s data management system.

Rubin has already exercised this infrastructure to process data from other observatories as well as simulated data. The experience of those processing campaigns is summarized in this paper. Finally, this paper outlines future plans, including providing the campaign management team a higher level view on ongoing campaigns and analyzing finished campaigns as well as using PanDA to support end users' need for batch processing from within a “hybrid” cloud approach to data hosting.

Brian Yanny (Fermilab) Dr Edward Karavakis (Brookhaven National Laboratory (US)) Fabio Hernandez (IN2P3 / CNRS computing centre) Jennifer Adelman-Mccarthy (Fermi National Accelerator Lab. (Fermilab)-Unknown-Unknown) Michelle Gower Peter Love (Lancaster University (GB)) Stephen R. Pietrowicz (National Center for Supercomputing Applications, USA) Tim Jenness (Vera C. Rubin Observatory, USA) Timothy John Noble (Science and Technology Facilities Council STFC (GB)) Wei Yang (SLAC National Accelerator Laboratory (US)) Wen Guan (Brookhaven National Laboratory (US)) Zhaoyu Yang (Brookhaven National Laboratory (US)) kian tat lim (SLAC National Accelerator Lab/Vera C. Rubin Observatory) richard dubois

CHEP2024_Preparation of Multi-Site Data Processing at the Vera C. Rubin Observatory.pdf

Conference on Computing in High Energy and Nuclear Physics

Contact Program Chairs

Preparation for Multi-Site Processing at the Vera C. Rubin Observatory

Room 2.B (Conference Room)

Speaker

Description

Authors

Presentation materials