RAL Tier1 Experiments Liaison Meeting

Europe/London
Access Grid (RAL R89)

Access Grid

RAL R89

Videoconference
RAL Tier1 Experiments Liaison Meeting
Zoom Meeting ID
66811541532
Host
Alastair Dewhurst
Useful links
Join via phone
Zoom URL

ACTIONS (From last meeting):

DM - Add brief current status to GGUS tickets to aid understanding of non-regular attendees - COMPLETE

TBr - Increase the number of WN's  running XrootD 5.2 from 10 to 20 to help facilitate the investigation of low I/O rates. - COMPLETE

ACTIONS (New)

TBr - Liasions have decided that the lower rate of I/O with XrootD 5.2 is acceptabel and have authorised TBr to roll-out to the complete batch farm.

 

There are minutes attached to this event. Show them.
    • 1:18 PM 1:19 PM
      Major Incidents Changes 1m
    • 1:19 PM 1:20 PM
      Summary of Operational Status and Issues 1m
      Speakers: Brian Davies (Lancaster University (GB)) , Darren Moore (Science and Technology Facilities Council STFC (GB))
    • 1:20 PM 1:21 PM
      GGUS /RT Tickets 1m

      https://tinyurl.com/T1-GGUS-Open
      https://tinyurl.com/T1-GGUS-Closed

    • 1:21 PM 1:22 PM
      Site Availability 1m

      https://lcgwww.gridpp.rl.ac.uk/utils/availchart/

      https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T1_UK_RAL

      http://hammercloud.cern.ch/hc/app/atlas/siteoverview/?site=RAL-LCG2&startTime=2020-01-29&endTime=2020-02-06&templateType=isGolden

    • 1:22 PM 1:23 PM
      Experiment Operational Issues 1m
    • 1:24 PM 1:25 PM
      VO-Liaison ATLAS 1m
      Speakers: James William Walder (Science and Technology Facilities Council STFC (GB)) , Dr Tim Adye (Science and Technology Facilities Council STFC (GB))

      RAL batch queues set offline last night (Triggered by CASTOR downtime).

      Online briefly, but now set offline for the weekend (48Hr prior to Downtime). 

       - Could it be easily done to have a managed release of jobs when downtime released? e.g. remove surplus options / cap the VOs to their pledge? 

       

      External Gateways running with Webdav enabled. Input from ATLAS experts needed before running TPC passive mode tests.

       

    • 1:25 PM 1:26 PM
      VO Liaison CMS 1m
      Speaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))
    • 1:30 PM 1:31 PM
      VO Liaison LHCb 1m
      Speaker: Raja Nandakumar (Science and Technology Facilities Council STFC (GB))
      LHCb:
       
      1. arc-ce-test02 : LHCbDirac jobs run fine there (tested in the LHCb certification system).
       
      2. https://ggus.eu/index.php?mode=ticket_info&ticket_id=151955
      Title : Checksums using xrootd on ECHO taking a long time.
      Issue : As in title
      Status : Open - possibly implemented next week after code review?
       
      3. https://ggus.eu/index.php?mode=ticket_info&ticket_id=152009
      Title : FTS3 transfers Failed (RAL => CERN) RAL-LCG2
      Issue : Data corruption at RAL again. Updated with as much information as available about the file in question.
      Note: Similar issue in old ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=150898
      Status : Need to understand why the original write failure occurs - so, it is likely a CEPH / ECHO issue as the error is seen both in gsiftp and xrootd writes. The problem was first seen on 8 March 2021 and not seen before then. So, some maybe related to some change done before then.
       
      4. https://ggus.eu/index.php?mode=ticket_info&ticket_id=150653
      Title : FTS error: libX509SciTokensIssuer.so ...
      Issue : RAL FTS does not support macaroons. FTS upgrade needed at RAL.
      Resolution : Upgraded - to be tested by LHCb when Brian gets some information about authentication from FTS developers
      Status : Waiting for upgrade to be scheduled.
       
      5. https://ggus.eu/index.php?mode=ticket_info&ticket_id=142350
      Title : Proble accessing some LHCb files at RAL
      Issue : xrootd vector reads from CEPH (ECHO@RAL) not properly implemented.
      Status : Waiting for fix.
       
       
      DUNE:
       
      Looking forward to http(s) access for ECHO (xrootd5)
      Waiting for CTA to go into production at RAL for testing tape access.
       
    • 1:35 PM 1:38 PM
      VO Liaison LSST 3m
      Speaker: Joshua Kitenge
    • 1:40 PM 1:41 PM
      VO Liaison Others 1m
    • 1:45 PM 1:46 PM
      Experiment Planning 1m
    • 1:50 PM 1:51 PM
      Euclid 1m
    • 1:55 PM 1:56 PM
      SKA 1m
    • 2:00 PM 2:10 PM
      Dune/protoDune 10m
    • 2:10 PM 2:11 PM
      AOB 1m
    • 2:15 PM 2:16 PM
      Any other Business 1m
      Speakers: Brian Davies (Lancaster University (GB)) , Darren Moore (Science and Technology Facilities Council STFC (GB))