4th Rucio Community Workshop (Virtual)
Rucio is a software framework that provides functionality to organize, manage, and access large volumes of scientific data using customisable policies. The data can be spread across globally distributed locations and across heterogeneous data centers, uniting different storage and network technologies as a single federated entity. Rucio offers advanced features such as distributed data recovery or adaptive replication, and is highly scalable, modular, and extensible. Rucio has been originally developed to meet the requirements of the high-energy physics experiment ATLAS, and is continuously extended to support LHC experiments and other diverse scientific communities.
We set up a mailing list to which you can subscribe and where we will send more details about the program in the coming weeks.
We also created a Slack channel dedicated to the workshop discussion on the Rucio Slack workspace (Invitation Link). Join #workshop.
-
-
Opening & Closing
- 1
- 2
- 3
-
Community ReportsConvener: Alastair Dewhurst (Science and Technology Facilities Council STFC (GB))
- 4
-
5
Rucio for the Light Dark Matter eXperiment (LDMX)
The Light Dark Matter eXperiment (LDMX) is a planned small-scale accelerator-based experiment to search for dark matter in the sub-GeV mass region. Finalizing the design of the detector relies on Monte-Carlo simulation of expected physics processes. A distributed computing pilot project was initiated based around existing software used by other communities, including Rucio for data management. Rucio is primarily used as a dataset catalog, and LDMX also makes extensive use of Rucio metadata to record physics properties as well as bookkeeping information on the data produced. In this talk we describe how Rucio is used by LDMX and propose some possible extensions to the metadata functionality.
Speaker: David Cameron (University of Oslo (NO)) -
6
DUNE Rucio Usage and Status
We present the current state of the DUNE Rucio deployment and plans for extension and expansion. Results will include new monitoring and testing that has been
added as well as plans for moving from the current hybrid system to a native
Rucio deployment.Speaker: Steven Timm (Fermi National Accelerator Lab. (US))
-
16:30
Break
-
Community ReportsConvener: Mario Lassnig (CERN)
-
7
Customizing Rucio at LCLS
This talk will present the speaker's onboarding experience as someone new to Rucio, covering the aspects of using documentation, building and standing up the containers, and configuring the Rucio system. Furthermore, specifics will be discussed in regard to the customization in development to meet the project requirements for LCLS, a free electron laser that produces ultra fast X-ray pulses.
Speaker: Kenny Lo (SLAC National Accelerator Laboratory) -
8
IGWNSpeaker: Gabriele Gaetano Fronze' (INFN Torino (IT) and LIGO-Virgo-Kagra Collaboration (US/IT/JP))
-
9
FTS: Updates, Direction and PlansSpeaker: Mihai Patrascoiu (CERN)
-
10
AAI/Tokens IAMSpeaker: Andrea Ceccanti (Universita e INFN, Bologna (IT))
-
7
-
Panels: Rucio in a non-grid environmentConvener: Cedric Serfon (Brookhaven National Laboratory (US))
-
11
Rucio in a non-grid environment panelSpeakers: Alastair Dewhurst (Science and Technology Facilities Council STFC (GB)), Andrea Manzi, David Cameron (University of Oslo (NO)), Ilija Vukotic (University of Chicago (US)), Mario Lassnig (CERN), Oliver Keeble (CERN)
-
11
-
-
-
KeynoteConvener: Mario Lassnig (CERN)
-
12
Pipe Dreams (and Nightmares)
Australia's Academic and Research Network (AARNet) was established in 1989 and is widely regarded as the founder of the Internet in Australia and renowned as the architect, builder and operator of world-class network infrastructure for research and education.
We are Australia's National Research and Education Network (NREN). We connect over one million users—researchers, faculty, staff and students—at institutions across Australia, supporting education and research across a diverse range of disciplines including high energy physics, climate science, genomics, radio astronomy and the arts.
Nationally, AARNet interconnects Australian universities, the CSIRO, and other organisations who have a research and education mission, or with whom the education and research sector interacts. These include hospitals, vocational training providers, schools and museums. Internationally, AARNet interconnects the Australian Research and Education (R & E) community to the world – and continuously develops new capabilities and partnerships to facilitate seamless data access and transfer.
Today, we'll talk about some of the work we've done in the data access and transfer space, challenges we've faced, and our plans for the future.Speaker: Crystal Michelle Chua
-
12
-
Community Reports: WFMSConvener: Cedric Serfon (Brookhaven National Laboratory (US))
-
13
PandaSpeaker: Paul Nilsson (Brookhaven National Laboratory (US))
- 14
-
15
CMS Workload Management and Rucio integrationSpeaker: Todor Trendafilov Ivanov (University of Sofia - St. Kliment Ohridski (BG))
-
16
Discussion
-
13
-
11:05
Break
-
Community ReportsConvener: Martin Barisits (CERN)
-
17
Belle IISpeaker: Cedric Serfon (Brookhaven National Laboratory (US))
-
18
Rucio and ScienceMesh: Enabling data management for the CS3 communitySpeakers: Giuseppe Lo Presti (CERN), Rahul Chauhan
- 19
-
17
-
Community Reports: AstronomyConvener: Rosie Bolton (SKA Organisation)
- 20
-
21
SKA Rucio deployment and metadata/ data lifecycle use caseSpeaker: Rohini Joshi (SKA Organisation)
- 22
-
23
CTA rucio use case and development with cloud storageSpeaker: Frederic Gillardo (Centre National de la Recherche Scientifique (FR))
-
24
LOFAR Use Cases in ESCAPE - Experience and Future Directions
We will be presenting our experiences and results for LOFAR use case using ESCAPE data lake and Rucio. Reflections on future requirements including for other data sets (e.g. APERTIF) shall also be deliberated.
Speaker: Yan Grange (ASTRON) - 25
- 26
-
16:40
Break
-
Community ReportsConvener: Eric Vaandering (Fermi National Accelerator Lab. (US))
-
27
ESCAPE Data Lake as a Service
Experiments and scientists, whether in the process of designing and building up a data management system or managing multi-petabyte data historically, gather in the European Science Cluster of Astronomy & Particle physics ESFRI research infrastructures (ESCAPE) project to address computing challenges by developing common solutions in the context of the EOSC.
A modular ecosystem of services and tools constitutes the ESCAPE Data Lake, which is exploited by flagship ESFRIs in Astro-particle Physics, Electromagnetic and Gravitational-Wave Astronomy, Particle Physics, and Nuclear Physics to pursue together the FAIR and open-access data principles.
This infrastructure fulfils the needs of the ESCAPE community in terms of data organisation, management, and access, and dedicated assessment exercises demonstrated its robustness.
As a result, collaborating sciences are choosing their reference implementations of the various technologies among the proposed solutions.
A variety of challenges and specific use cases boost ESCAPE to carefully take into account both user and infrastructure perspectives, and contributed to successfully conclude the pilot phase beyond expectations, embarking on a like-production prototype stage.
The ongoing phase of the project aims at consolidating the functionalities of the services, e.g. integrating token-based AuthN/Z or deploying a tailored content delivery and caching layer, and at simplifying the user experience. Specifically for this reason, a considerable effort is being devoted towards a DataLake-as-a-Service whose goal is to provide the end-user with a Notebook ready-to-be-used and fully integrated with the Data Lake.
ESCAPE milestones achieved during the length of the project represent a fundamental accomplishment under both sociological and computing model aspects for different scientific communities that should address upcoming data management and computing challenges in the next decade.Speaker: Dr Riccardo Di Maria (CERN) -
28
XENONSpeaker: Paschalis Paschos (University of Chicago)
-
29
GO/QoS/MASSpeaker: Matt Snyder (Brookhaven National Laboratory)
-
27
-
Panels: Astronomy & MetadataConvener: Rosie Bolton (SKA Organisation)
-
30
Astronomy & MetadataSpeakers: Cedric Serfon (Brookhaven National Laboratory (US)), Dave Morris, Greg Daues (NCSA), Pandey Vishambhar (ASTRON), Rob Barnsley (SKAO)
-
30
-
-
-
Community Reports: Long tail of scienceConvener: Alastair Dewhurst (Science and Technology Facilities Council STFC (GB))
-
31
Introduction - What is the 'Long Tail of Science'?Speaker: Alastair Dewhurst (Science and Technology Facilities Council STFC (GB))
-
32
Multi-VO RucioSpeaker: Timothy John Noble (Science and Technology Facilities Council STFC (GB))
- 33
-
34
Fermilab viewSpeaker: Brandon White (Fermi National Accelerator Lab. (US))
-
31
-
16:00
Break
-
Panels: Transfer & StorageConvener: Mario Lassnig (CERN)
-
35
Transfer & Storage panelSpeakers: Andrea Ceccanti (Universita e INFN, Bologna (IT)), Andrew Bohdan Hanushevsky (SLAC National Accelerator Laboratory (US)), Hannah Short (CERN), Martin Barisits (CERN), Mihai Patrascoiu (CERN), Paul Millar
-
35
-
Opening & Closing: Discussion, Photo and ClosingConvener: Martin Barisits (CERN)
-
36
Discussion
-
37
Photo
-
38
Closing
-
36
-