REDWOOD Summer Workshop

Europe/Zurich
Tour Lombarde, Conthey Switzerland Salle des Presidents/Salle Des Lombardes
Description

Workshop Scope and Goals: REDWOOD is a small collaboration of investigators from diverse fields of physical sciences combining their efforts to solve the very difficult challenge of complex workflows on large volumes of data distributed across multiple systems. The collaboration includes the University of Pittsburgh, Carnegie Mellon University, The University of Massachussets (Amherst), Brookhaven National Laboratory, Oak Ridge National Laboratory, and SLAC. Their efforts are set to benefit the ATLAS experiment, the Vera Rubin Observatory, and Fusion experiments at ORNL. Workflow management involves the software infrastructure to manage the processing steps which may include simulation, reconstruction, data analysis, or near real-time acquisition and processing of experimental data. Currently, for example, in ATLAS, over 1 million computational tasks execute concurrently on thousands of compute nodes. The collaboration operates on four tracks: Workflow algorithms, near real-time applications, workflow monitoring, and modeling & simulation of workflow. The geographically dispersed collaboration meets 2-3 times per year to insure sufficient contact between the four subgroups.

Dates: July 21-25 2025. A mini-workshop on distributed computing in ATLAS and CMS experiments with discussion sessions with experts, intended for co-PIs is held at CERN in parallel with a hackathon for junior collaborations at the workshop site in Conthey during the first two days; this is followed by an All-Hands meeting during the final three days, in Conthey.ย 

Location: Tour Lombarde, Conthey Switzerland

Mini-Workshop: CERN, Geneva Switzerland

ย 

Banquet venue: TBA

Local Organizers: Joe Boudreau, Raees Khan, Tania Korchuganova

External Organizers: Alexei Klimentov

The Zoom room link for the conference is: https://cern.zoom.us/j/68411518927

Passcode: 89864368

To acquire the Zoom room password, kindly send an email to any of the individuals listed in the Contact section.

A registration fee of CHF 200/Person will be collected at the workshop

ย 

Registration
Registration
    • 09:00 12:00
      Hackathon

      For collaborative coding, documentation, and other writing

      Convener: Raees Ahmad Khan (University of Pittsburgh (US))
    • 12:00 13:00
      Lunch 1h
    • 13:00 16:30
      Hackathon

      For collaborative coding, documentation, and other writing

      Convener: Raees Ahmad Khan (University of Pittsburgh (US))
    • 13:00 16:30
      Mini workshop: REDWOOD - ATLAS Distributed Computing mini-Workshop and Technical interchange meeting 61/1-009 - Room C (CERN)

      61/1-009 - Room C

      CERN

      22
      Show room on map

      Expert input from Rucio, ATLAS and CMS computing.

      Conveners: Adolfy Hoisie (Brookhaven National Laboratory (US)), Alexei Klimentov (Brookhaven National Laboratory (US)), Joseph Boudreau (University of Pittsburgh), Tadashi Maeno (Brookhaven National Laboratory (US))
    • 09:00 12:00
      Hackathon

      For collaborative coding, documentation, and other writing

      Convener: Raees Ahmad Khan (University of Pittsburgh (US))
    • 10:00 12:00
      Mini workshop: REDWOOD - ATLAS Distributed Computing mini-Workshop and Technical Interchange Meeting 40/R-D10 (CERN)

      40/R-D10

      CERN

      20
      Show room on map

      Expert input from Rucio, ATLAS and CMS computing.

      Conveners: Adolfy Hoisie (Brookhaven National Laboratory (US)), Alexei Klimentov (Brookhaven National Laboratory (US)), Joseph Boudreau (University of Pittsburgh), Tadashi Maeno (Brookhaven National Laboratory (US))
      • 10:00
        Discussion 2h

        Topics:

        - Workflow and data management
        - Modeling results (SimGrid & AI/ML)
        

        This mini workshop brings together the ATLAS Distributed Computing experts and REDWOOD Co-PIs from CMU, BNL, SLAC, ORNL, UPItt and UMass

    • 12:00 13:30
      Lunch 1h 30m
    • 13:00 16:30
      Mini workshop: REDWOOD - ATLAS Distributed Computing mini-Workshop and Technical Interchange Meeting

      Expert input from Rucio, ATLAS and CMS computing.

      Conveners: Joseph Boudreau (University of Pittsburgh (US)), Joseph Boudreau (University of Pittsburgh), Tadashi Maeno (Brookhaven National Laboratory (US)), Verena Ingrid Martinez Outschoorn (University of Massachusetts (US)), Wei Yang (SLAC National Accelerator Laboratory (US))
      • 13:00
        Discussion 2h 30m

        Topics:

        • Workflow and data management

        • Modeling results (SimGrid & AI/ML)

        This mini workshop brings together the ATLAS Distributed Computing experts and REDWOOD Co-PIs from CMU, BNL, SLAC, ORNL, UPItt and UMass

    • 13:30 17:00
      Hackathon

      For collaborative coding, documentation, and other writing

      Convener: Raees Ahmad Khan (University of Pittsburgh (US))
    • 09:30 11:00
      Round table discussion 1h 30m
    • 12:30 15:30
      Topical Presentations and Discussion Sessions
      • 12:30
        Workflow Management 1h
        • AI for Error Analysis 30m
          Speaker: Paul Nilsson (Brookhaven National Laboratory (US))
        • Categorization of Errors 30m
          Speaker: Tatiana Korchuganova (University of Pittsburgh (US))
      • 13:30
        Track 3: Monitoring and Integration 1h
        • Technology Components of the LLM Applications 30m
          Speaker: Wei Yang (SLAC National Accelerator Laboratory (US))
    • 15:30 18:30
      Topical Presentations and Discussion Sessions
      • 15:30
        Simulation and Modeling 2h
        • Framework Updates & Documentation 20m
          Speaker: Raees Ahmad Khan (University of Pittsburgh (US))
        • Site Data Collection 20m
          Speaker: Paul Nilsson (Brookhaven National Laboratory (US))
        • Simulator Calibration 20m
          Speaker: Sairam Sri Vatsavai (Brookhaven National Laboratory (US))
        • ML Surrogate Model for Simulation 20m
          Speaker: Sairam Sri Vatsavai (Brookhaven National Laboratory (US))
        • Updates in Allocation Algorithms 20m
          Speaker: Fatih Furkan Akman (University of Massachusetts (US))
        • Rucio Study/Data Transfer Model 20m
          Speakers: Kuan-Chieh Hsu (Brookhaven National Laboratory (US)), Kuan-Chieh Hsu (Brookhaven National Laboratory)
        • Monitoring and Visualization 20m
          Speaker: John Rembrandt Steele (University of Massachusetts (US))
    • 09:00 12:10
      Track Status Reports
      • 09:00
        Track 1: High Throughput computing/Workflow Management 40m
        Speaker: Tadashi Maeno (Brookhaven National Laboratory (US))
      • 09:40
        Track 2: Near Real Time Computing 40m
        Speakers: Frederic Suter, Norbert Podhorszki (Oak Ridge National Laboratory), Norbert Podhorszki (Unknown), Scott Klasky
      • 10:20
        Coffee Break 25m
      • 10:45
        Track 3: Monitoring and Integration 40m
        Speaker: Wei Yang (SLAC National Accelerator Laboratory (US))
      • 11:25
        Track 4: Modeling and Simulation 45m
        Speaker: Adolfy Hoisie (Brookhaven National Laboratory (US))
    • 12:10 14:00
      Lunch 1h 50m
    • 14:00 17:05
      Topical Presentations and Discussion Sessions
      • 14:00
        High Throughput Computing/Workflow Management 1h 30m
        • Scout prediction in PanDA with AI/ML 20m
          Speaker: Dr Tasnuva Chowdhury (Brookhaven National Laboratory (US))
        • Error Categorization at Scale: An LLM-MCP System for 8 Million Rubin PanDA Jobs 20m
          Speakers: Kenny Lo, Kenny Weng Kong Lo
    • 09:00 11:00
      Executive Session
    • 11:30 12:00
      Closeout 30m