RAL Tier1 Experiments Liaison Meeting

Europe/London
Access Grid (RAL R89)

Access Grid

RAL R89

    • 13:38 13:39
      Major Incidents Changes 1m
    • 13:39 13:40
      Summary of Operational Status and Issues 1m
      Speakers: Brian Davies (Lancaster University (GB)), Darren Moore (Science and Technology Facilities Council STFC (GB))

      Kernel and errata reboot campaign ongoing

      SL6to SL7 migration ongoing

      Migration campaign from nagger and ganglia ongoing

    • 13:40 13:41
      GGUS /RT Tickets 1m

      https://tinyurl.com/T1-GGUS-Open
      https://tinyurl.com/T1-GGUS-Closed

    • 13:41 13:42
      Site Availability 1m

      https://lcgwww.gridpp.rl.ac.uk/utils/availchart/

      https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T1_UK_RAL

      http://hammercloud.cern.ch/hc/app/atlas/siteoverview/?site=RAL-LCG2&startTime=2020-01-29&endTime=2020-02-06&templateType=isGolden

    • 13:42 13:43
      Experiment Operational Issues 1m
    • 13:44 13:45
      VO-Liaison ATLAS 1m
      Speakers: James William Walder (Science and Technology Facilities Council STFC (GB)), Dr Tim Adye (Science and Technology Facilities Council STFC (GB))

      May:   181k vs 156k (pledged)
      CPU wall time ratio 56%

      TPC: running in stress-test and on correct (test)gateway.
      Transfers from RAL working well; Transfers to RAL not working (previously had)
       

      Analysis jobs now set to use DirectIO; will collect some stats and report back.

        

    • 13:46 13:47
      VO Liaison CMS 1m
      Speaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))
    • 13:48 13:49
      VO Liaison LHCb 1m
      Speaker: Raja Nandakumar (Science and Technology Facilities Council STFC (GB))

      LHCb:

      1. Singularity issue of last week solved after James Adams re-enabled namespaces
      2. ECHO streaming testing
        • Some issues with the LHCb test system monitoring
        • Verified that XRD_STREAMTIMEOUT variable has no effect on error rate
        • Next : move to using mrmory proxy on the test WNs
          • First run for a day ot two with the XRD_STREAMTIMEOUT variable reverted to get a baseline failure rate again and be sure nothing changed.
          • Then switch to memory proxy
        • LHCb week next week - national computing board discussion on this is probable
      3. Some reduction in running jobs at RAL today. Trying to understand if it is an LHCb effect.

      DUNE

      1. User space to be enabled some time this week.
      2. Country-wide fraction of pledged resources available with Andrew McNab.
        • Part of a much bigger document?
        • Will forward when made available
      3. Low aactivity from DUNE
        • No production jobs. A few user jobs
        • Clear indication that even at low load fewer jobs are coming to RAL vs other sites.
          • Investigating again.

      Other stuff

      • Following up on arc-v6 technical meeting. RAL-PPD already running arc-v6.
    • 13:52 13:53
      VO Liaison Others 1m
    • 13:53 13:54
      Experiment Planning 1m
    • 13:54 13:55
      Dune/protoDune 1m
    • 13:55 13:56
      Euclid 1m
    • 13:56 13:57
      SKA 1m
    • 13:57 13:58
      AOB 1m
    • 13:58 13:59
      Any other Business 1m
      Speakers: Brian Davies (Lancaster University (GB)), Darren Moore (Science and Technology Facilities Council STFC (GB))