RAL Tier1 Experiments Liaison Meeting

Europe/London
Access Grid (RAL R89)

Access Grid

RAL R89

Description
    • 13:38
      Major Incidents Changes
    • 1
      Summary of Operational Status and Issues
      Speakers: Brian Davies (Lancaster University (GB)), Darren Moore (Science and Technology Facilities Council STFC (GB))
    • 2
      GGUS /RT Tickets

      https://tinyurl.com/T1-GGUS-Open
      https://tinyurl.com/T1-GGUS-Closed

    • 3
      Site Availability

      https://lcgwww.gridpp.rl.ac.uk/utils/availchart/

      https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T1_UK_RAL

      http://hammercloud.cern.ch/hc/app/atlas/siteoverview/?site=RAL-LCG2&startTime=2020-01-29&endTime=2020-02-06&templateType=isGolden

    • 13:42
      Experiment Operational Issues
    • 4
      VO-Liaison ATLAS
      Speakers: James William Walder (Science and Technology Facilities Council STFC (GB)), Dr Tim Adye (Science and Technology Facilities Council STFC (GB))

      NTR

    • 5
      VO Liaison CMS
      Speaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))

      I have been trying to fix the AAA service which has been causing SAM tests both directly for AAA and indirectly on the WN (xrootd-access) to fail over the last week or more. Also RALPP and other T2s in the UK were affected at times. I found the redirector based at RAL was full of logs. Chris and I cleaned up and restarted services. I also changed the throttling value on gw10 (not gw11) to try to allow more traffic, and hence give the tests a better chance of reaching our AAA. Restarts of all services from the proxies to the redirector (again) were then required. As of now, things are looking better, although we do see periods of passing tests, so it could just be that. I see no effect in the Vande monitoring of increased throughput from the changed throttling, so I don't know if that config is effective.

       

    • 6
      VO Liaison LHCb
      Speaker: Raja Nandakumar (Science and Technology Facilities Council STFC (GB))

      LHCb:

      1. Low number of running jobs at RAL
        • Jose aware and looking at it
      2. Streaming issue from ECHO ongoing
        • Development ongoing (? no news)
        • Looking at mitigation with Tom

      DUNE

      1. Normal operations
    • 7
      VO Liaison Others
    • 13:53
      Experiment Planning
    • 8
      Dune/protoDune
    • 9
      Euclid
    • 10
      SKA
    • 13:57
      AOB
    • 11
      Any other Business
      Speakers: Brian Davies (Lancaster University (GB)), Darren Moore (Science and Technology Facilities Council STFC (GB))