ATLAS UK Cloud Support

Europe/London
Zoom

Zoom

Tim Adye (Science and Technology Facilities Council STFC (GB)), James William Walder (Science and Technology Facilities Council STFC (GB))
Description

https://cern.zoom.us/j/98434450232

Password protected (same as (new) OPs Mtg)

Videoconference
ATLAS UK Cloud Support
Zoom Meeting ID
98434450232
Host
James William Walder
Useful links
Join via phone
Zoom URL

● Outstanding tickets

  • 151141 UKI-NORTHGRID-LANCS-HEP less urgent on hold 2021-04-01 08:24:00 UKI-NORTHGRID-LANCS-HEP deletion errors
    • On Hold waiting for Matt to return
  • 151098 RAL-LCG2 urgent in progress 2021-04-07 15:14:00 High failure rate at RAL-LCG2_TEST
    • Problem likely understood due to docker config and pilot killing orphaned processes.
      • An ENV exists to hopefully mitigate this effect.
  • 146651 RAL-LCG2 urgent on hold 2021-04-07 13:04:00 singularity and user NS setup at RAL
    • Remains on hold, ticket recently updated

 


● CPU

  • RAL

    • No issues; still to be understood if corepower values are propagated through.
  • Northgrid

    • MAN downtime for loss of core router; (BHAM affected)
  • London

    • No issues
  • SouthGrid

    • No issues; Susx remains in test however
  • Scotgrid

    • No isses

● Ongoing Items

  • CentOS7 - Sussex
    • Problems remain; to follow-up with Patrick.
  • TPC with http
    • RAL moved to new TPC test endpoint machine; problems remain.
    • Memory leaks in 5.1.1 xrootd aim to be sorted for 5.1.2
  • Storageless Site test / storage decomissioning (Oxford)
    • Some urgency now as hardware is brittle
  • Glasgow DPM Decommissioning
    • Appears to have had some progress behind the scenes.
    • GLA would still want to have a downtime, to coordinate with Dimitrios
  • ATLAS: Site Availability/Reliability reports: Glasgow
    • New SNOW ticket opened, to understand current situation

● News round-table

  • Vip
    • Need to move the data soon from OX
    • JW to contact Dimitros.
  • Dan
    • Sorted (closed) GGUS for file access problems.
  • Peter
    • Noted the lack of issues over the break
  • Sam
    • NTR
  • Gareth
    • Final ATLAS meeting, as moves to new role tomorrow.
    • Will still be supporting Scotgrid (indirectly)
    • All wished best of luck.
  • JW
    • NTR
  • Emanuelle
    • NTR
There are minutes attached to this event. Show them.
    • 10:00 10:20
      Status 20m
      • Outstanding tickets 10m
        • 151141 UKI-NORTHGRID-LANCS-HEP less urgent on hold 2021-04-01 08:24:00 UKI-NORTHGRID-LANCS-HEP deletion errors
          • On Hold waiting for Matt to return
        • 151098 RAL-LCG2 urgent in progress 2021-04-07 15:14:00 High failure rate at RAL-LCG2_TEST
          • Problem likely understood due to docker config and pilot killing orphaned processes.
            • An ENV exists to hopefully mitigate this effect.
        • 146651 RAL-LCG2 urgent on hold 2021-04-07 13:04:00 singularity and user NS setup at RAL
          • Remains on hold, ticket recently updated

         

      • CPU 5m

        New link for the site-oriented dashboard

        • RAL

          • No issues; still to be understood if corepower values are propagated through.
        • Northgrid

          • MAN downtime for loss of core router; (BHAM affected)
        • London

          • No issues
        • SouthGrid

          • No issues; Susx remains in test however
        • Scotgrid

          • No isses
      • Other new issues / tasks 5m
    • 10:20 10:40
      Ongoing Items 20m
      • CentOS7 - Sussex
        • Problems remain; to follow-up with Patrick.
      • TPC with http
        • RAL moved to new TPC test endpoint machine; problems remain.
        • Memory leaks in 5.1.1 xrootd aim to be sorted for 5.1.2
      • Storageless Site test / storage decomissioning (Oxford)
        • Some urgency now as hardware is brittle
      • Glasgow DPM Decommissioning
        • Appears to have had some progress behind the scenes.
        • GLA would still want to have a downtime, to coordinate with Dimitrios
      • ATLAS: Site Availability/Reliability reports: Glasgow
        • New SNOW ticket opened, to understand current situation
    • 10:40 10:50
      News round-table 10m
      • Vip
        • Need to move the data soon from OX
        • JW to contact Dimitros.
      • Dan
        • Sorted (closed) GGUS for file access problems.
      • Peter
        • Noted the lack of issues over the break
      • Sam
        • NTR
      • Gareth
        • Final ATLAS meeting, as moves to new role tomorrow.
        • Will still be supporting Scotgrid (indirectly)
        • All wished best of luck.
      • JW
        • NTR
      • Emanuelle
        • NTR
    • 10:50 11:00
      AOB 10m