RAL Tier1 Experiments Liaison Meeting

Europe/London
Access Grid (RAL R89)

Access Grid

RAL R89

Description
    • 13:38 13:39
      Major Incidents Changes 1m
    • 13:39 13:40
      Summary of Operational Status and Issues 1m
      Speakers: Brian Davies (Lancaster University (GB)) , Darren Moore (Science and Technology Facilities Council STFC (GB))
    • 13:40 13:41
      GGUS /RT Tickets 1m

      https://tinyurl.com/T1-GGUS-Open
      https://tinyurl.com/T1-GGUS-Closed

    • 13:41 13:42
      Site Availability 1m

      https://lcgwww.gridpp.rl.ac.uk/utils/availchart/

      https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T1_UK_RAL

      http://hammercloud.cern.ch/hc/app/atlas/siteoverview/?site=RAL-LCG2&startTime=2020-01-29&endTime=2020-02-06&templateType=isGolden

    • 13:42 13:43
      Experiment Operational Issues 1m
    • 13:44 13:45
      VO-Liaison ATLAS 1m
      Speakers: James William Walder (Science and Technology Facilities Council STFC (GB)) , Dr Tim Adye (Science and Technology Facilities Council STFC (GB))

      NTR

    • 13:46 13:47
      VO Liaison CMS 1m
      Speaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))

      I have been trying to fix the AAA service which has been causing SAM tests both directly for AAA and indirectly on the WN (xrootd-access) to fail over the last week or more. Also RALPP and other T2s in the UK were affected at times. I found the redirector based at RAL was full of logs. Chris and I cleaned up and restarted services. I also changed the throttling value on gw10 (not gw11) to try to allow more traffic, and hence give the tests a better chance of reaching our AAA. Restarts of all services from the proxies to the redirector (again) were then required. As of now, things are looking better, although we do see periods of passing tests, so it could just be that. I see no effect in the Vande monitoring of increased throughput from the changed throttling, so I don't know if that config is effective.

       

    • 13:48 13:49
      VO Liaison LHCb 1m
      Speaker: Raja Nandakumar (Science and Technology Facilities Council STFC (GB))

      LHCb:

      1. Low number of running jobs at RAL
        • Jose aware and looking at it
      2. Streaming issue from ECHO ongoing
        • Development ongoing (? no news)
        • Looking at mitigation with Tom

      DUNE

      1. Normal operations
    • 13:52 13:53
      VO Liaison Others 1m
    • 13:53 13:54
      Experiment Planning 1m
    • 13:54 13:55
      Dune/protoDune 1m
    • 13:55 13:56
      Euclid 1m
    • 13:56 13:57
      SKA 1m
    • 13:57 13:58
      AOB 1m
    • 13:58 13:59
      Any other Business 1m
      Speakers: Brian Davies (Lancaster University (GB)) , Darren Moore (Science and Technology Facilities Council STFC (GB))