RAL Tier1 Experiments Liaison Meeting

Europe/London
Access Grid (RAL R89)

Access Grid

RAL R89

Description
    • 13:38 13:39
      Major Incidents Changes 1m
    • 13:39 13:40
      Summary of Operational Status and Issues 1m
      Speakers: Brian Davies (Lancaster University (GB)), Darren Moore (Science and Technology Facilities Council STFC (GB))
    • 13:40 13:41
      GGUS /RT Tickets 1m

      https://tinyurl.com/T1-GGUS-Open
      https://tinyurl.com/T1-GGUS-Closed

    • 13:41 13:42
      Site Availability 1m

      https://lcgwww.gridpp.rl.ac.uk/utils/availchart/

      https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T1_UK_RAL

      http://hammercloud.cern.ch/hc/app/atlas/siteoverview/?site=RAL-LCG2&startTime=2020-01-29&endTime=2020-02-06&templateType=isGolden

    • 13:42 13:43
      Experiment Operational Issues 1m
    • 13:44 13:45
      VO-Liaison ATLAS 1m
      Speakers: James William Walder (Science and Technology Facilities Council STFC (GB)), Dr Tim Adye (Science and Technology Facilities Council STFC (GB))

      MCTape deletions; slow deletions, affecting overall deletion rate:
       - https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=382696
      * Dedicated RAL Rucio Reaper set up for these deletions
       - N threads modified to give ~ 3.3Hz rate
       - Expect completion around beginning of next week
       - Still would be nice to understand if 3s (asynchronous) is usual for deletion request.

      WN Slots - Modification to slots count for tranche
       - https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=381995 
      Jose hardcoded 48 slots for test tranche; but quattor not propagating to condor?

       

    • 13:46 13:47
      VO Liaison CMS 1m
      Speaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))

      Tape is still full so no new transfers in. This is why the FTS status is currently at zero. CMS is currently not in need of additional tape to be provided.

      I thought I made an accidental deletion on part of folder CSA07 on castor. After investigation, I found : Data is no longer needed by CMS (2007 training data), data was actually deleted already and I only deleted empty folders. I sent a list of the empty folders deleted to AD and DM.

      There was supposed to be a network intervention today, on the jumbo frames, but we don't think it happened. There was a switch over of the OPN to the new 100Gb link though (links RAL to CERN and T1s).

      CMS job efficiency has been up and down. Probably some jobs were running purely with Onsite data, although I cannot confirm this.

    • 13:48 13:49
      VO Liaison LHCb 1m
      Speaker: Raja Nandakumar (Science and Technology Facilities Council STFC (GB))

      LHCb:

      1. ECHO streaming issue
        • Waiting for development update
        • Testing lower block sizes on WNs
          • Studying a couple of features on current job failure rates
          • More information hopefully on Friday
      2. OPN update of this morning : No specific issue seen by LHCb

      DUNE:

      Averaging ~100 concurrent jobs in the last few days, ~300 concurrent jobs in the last few hours. Jobs are 4-core slots.

       

    • 13:52 13:53
      VO Liaison Others 1m
    • 13:53 13:54
      Experiment Planning 1m
    • 13:54 13:55
      Dune/protoDune 1m
    • 13:55 13:56
      Euclid 1m
    • 13:56 13:57
      SKA 1m
    • 13:57 13:58
      AOB 1m
    • 13:58 13:59
      Any other Business 1m
      Speakers: Brian Davies (Lancaster University (GB)), Darren Moore (Science and Technology Facilities Council STFC (GB))