ATLAS UK Cloud Support

Europe/London
Zoom

Zoom

Tim Adye (Science and Technology Facilities Council STFC (GB)), James William Walder (Science and Technology Facilities Council STFC (GB))
Description

https://cern.zoom.us/j/98434450232

Password protected (same as (new) OPs Mtg)

Videoconference
ATLAS UK Cloud Support
Zoom Meeting ID
98434450232
Host
James William Walder
Useful links
Join via phone
Zoom URL

● Outstanding tickets

Outstanding tickets

  • 155889 TEAM atlas UKI-NORTHGRID-LANCS-HEP less urgent NGI_UK in progress 2022-02-08 17:39:00 UK UKI-NORTHGRID-LANCS-HEP: Huge transfer failures as destination EGI

    • Probably Issues found with XrootD XrdMacaroons code with RH8 stricter code checking
  • 155856 TEAM atlas RAL-LCG2 less urgent NGI_UK in progress 2022-02-04 09:09:00 RAL-LCG2: deletion errors EGI

    • Now closed; problems not reoccured
  • 155430 TEAM atlas UKI-SCOTGRID-ECDF less urgent NGI_UK in progress 2022-02-09 22:45:00 UKI-SCOTGRID-ECDF transfer and deletion errors EGI

    • To be followed up, still assuming that lost data needs to be processed.
  • 155141 TEAM atlas UKI-LT2-Brunel less urgent NGI_UK in progress 2022-02-08 12:38:00 Transfers from UKI-LT2-Brunel fail with “Internal Server Error” EGI

    • Data declared lost; much less registered in rucio than from the file list.
  • 154806 TEAM atlas UKI-LT2-QMUL less urgent NGI_UK in progress 2022-02-07 06:07:00 UKI-LT2-QMUL SOURCE transfer failures EGI

    • Restarting webdav once a day; situation looking much improved.
  • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2022-02-08 14:01:00 DPM storage ACL configuration EGI

    • Site updated ticket
  • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2022-02-03 16:46:00 RAL Echo Davs developments EGI

    • In progress, additonal Vos are using the endpoints now
  • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK on hold 2021-12-01 15:37:00 HTTPS on RAL CTA EGI

    • CTA migration expected in the week of Feb 28th

● CPU

    • RAL

      • Looking good; may be underreporting its corepower to ATLAS
    • Northgrid

      • generally fine
    • London

      • QMUL looking better with the daily webdav restarts
    • SouthGrid

      • OK
    • Scotgrid

      • Durham, appears to have had problem with CE’s, better after the restarts
      • Glasgow; Xcache blip, and stage-in problems; working on the xrootd tunings

● Ongoing Items

  • TPC with http

    • Continuing. Plan for Glasgow to be decided.
  • Storageless Site test (Oxford)

    • Switched to prefetch 1 at 1100 today
  • LANCS Storage migration

    • Apparent problem with xrootd/libmacaroon in Centos8
    • Issue submitted to github for devs
    • Best current solution is to use Centos 7 for the xrootd box

● News round-table

  • Alessandra

    • NTR
  • Dan

    • Total of 7PB (from ~ 3.5PB) now deployed
  • Gerard

    • NTR
  • Matt

    • NTR
  • Gordon

    • NTR
  • Peter

    • NTR
  • Sam

    • NTR
  • Stephen

    • NTR
  • Vip

    • NTR

 


 


● AOB

  • HS06 talk from S&C week: try to get correct Corepowers by next Q1

 

 

There are minutes attached to this event. Show them.
    • 10:00 AM 10:20 AM
      Status 20m
      • Outstanding tickets 10m

        Outstanding tickets

        • 155889 TEAM atlas UKI-NORTHGRID-LANCS-HEP less urgent NGI_UK in progress 2022-02-08 17:39:00 UK UKI-NORTHGRID-LANCS-HEP: Huge transfer failures as destination EGI

          • Probably Issues found with XrootD XrdMacaroons code with RH8 stricter code checking
        • 155856 TEAM atlas RAL-LCG2 less urgent NGI_UK in progress 2022-02-04 09:09:00 RAL-LCG2: deletion errors EGI

          • Now closed; problems not reoccured
        • 155430 TEAM atlas UKI-SCOTGRID-ECDF less urgent NGI_UK in progress 2022-02-09 22:45:00 UKI-SCOTGRID-ECDF transfer and deletion errors EGI

          • To be followed up, still assuming that lost data needs to be processed.
        • 155141 TEAM atlas UKI-LT2-Brunel less urgent NGI_UK in progress 2022-02-08 12:38:00 Transfers from UKI-LT2-Brunel fail with “Internal Server Error” EGI

          • Data declared lost; much less registered in rucio than from the file list.
        • 154806 TEAM atlas UKI-LT2-QMUL less urgent NGI_UK in progress 2022-02-07 06:07:00 UKI-LT2-QMUL SOURCE transfer failures EGI

          • Restarting webdav once a day; situation looking much improved.
        • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2022-02-08 14:01:00 DPM storage ACL configuration EGI

          • Site updated ticket
        • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2022-02-03 16:46:00 RAL Echo Davs developments EGI

          • In progress, additonal Vos are using the endpoints now
        • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK on hold 2021-12-01 15:37:00 HTTPS on RAL CTA EGI

          • CTA migration expected in the week of Feb 28th
      • CPU 5m

        New link for the site-oriented dashboard

          • RAL

            • Looking good; may be underreporting its corepower to ATLAS
          • Northgrid

            • generally fine
          • London

            • QMUL looking better with the daily webdav restarts
          • SouthGrid

            • OK
          • Scotgrid

            • Durham, appears to have had problem with CE’s, better after the restarts
            • Glasgow; Xcache blip, and stage-in problems; working on the xrootd tunings
      • Other new issues / tasks 5m
    • 10:20 AM 10:40 AM
      Ongoing Items 20m
      • TPC with http

        • Continuing. Plan for Glasgow to be decided.
      • Storageless Site test (Oxford)

        • Switched to prefetch 1 at 1100 today
      • LANCS Storage migration

        • Apparent problem with xrootd/libmacaroon in Centos8
        • Issue submitted to github for devs
        • Best current solution is to use Centos 7 for the xrootd box
    • 10:40 AM 10:50 AM
      News round-table 10m
      • Alessandra

        • NTR
      • Dan

        • Total of 7PB (from ~ 3.5PB) now deployed
      • Gerard

        • NTR
      • Matt

        • NTR
      • Gordon

        • NTR
      • Peter

        • NTR
      • Sam

        • NTR
      • Stephen

        • NTR
      • Vip

        • NTR

       


       

    • 10:50 AM 11:00 AM
      AOB 10m
      • HS06 talk from S&C week: try to get correct Corepowers by next Q1