ATLAS UK Cloud Support

Europe/London
Zoom

Zoom

James William Walder (Science and Technology Facilities Council STFC (GB)), Jyoti Prakash Biswal (Rutherford Appleton Laboratory)
Description

https://cern.zoom.us/j/98434450232

Password protected (same as (new) OPs Mtg)

There are minutes attached to this event. Show them.
    • 10:00 10:20
    • 10:20 10:40
      Ongoing Items 20m
      • New Issue(s) 20m
        • Partial network issue between RAL-LCG2 and KIT-FZK:

          • GGUS-164115 --> solved.
          • FZK added new ipv6 subnet of RAL to their edge routers, and things started to improve on 14 November.
        • RAL:

          • Lack of jobs since the past two days or so. Above the pledge starting early morning today.
          • There were several HammerCloud exclusions for RAL recently.
            • Echo/XRootD-related (21XMA gen); for short time periods; none since 14 November.
            • The number of failed jobs has been more than 50% of the number of finished jobs lately.
        • News on SLURM/EL8 @LANCS.

        • LANCS HammerCloud exclusions are more frequent lately.

          • File transfer timed out during stage-out: / Failed to stage-out file:
        • ECDFs HammerCloud failures are now back!

        • RALPP HammerCloud failures at the 1013 template.

        • DC24 updates.

        • Discussions on Nuclei:
          - A table per site with WLCG reliability, Total disk size, Connectivity (as per Cedric’s script), Current status (Nucleus/not Nucleus) will be prepared. Based on this we can come up with a formula for definition of a Nucleus.
          - Connectivity source is not available. To be looked at.
          - Already have a table, Fabio will move it to google sheets and share it.
          - We need to find a metric that would point us to the optimal number of nuclei. This would define the minimal requirements for a nucleus.

        • Reminder: quarterly report deadline -- 20 November.

        • Pending: Storage Resource Reporting (SRR) checks.

      • Storageless Site tests/Decommissioning 20m

        MAN

      • Localgroupdisk cleanup and migrations 20m
      • Regular file dumps 20m
    • 10:40 10:50
      News round-table 10m
    • 10:50 11:00