ATLAS UK Cloud Support

Europe/London
Zoom

Zoom

Tim Adye (Science and Technology Facilities Council STFC (GB)), James William Walder (Science and Technology Facilities Council STFC (GB))
Description

https://cern.zoom.us/j/98434450232

Password protected (same as (new) OPs Mtg)

● Outstanding tickets

    • 156204 TEAM atlas UKI-NORTHGRID-MAN-HEP less urgent NGI_UK in progress 2022-03-01 09:39:00 UKI-NORTHGRID-MAN-HEP deletion errors due to unexpected server error EGI

      • Now should be fixed; following up on HC test failure
    • 156179 TEAM atlas UKI-SCOTGRID-DURHAM less urgent NGI_UK assigned 2022-03-01 16:42:00 Stage-out timeouts at UKI-SCOTGRID-DURHAM_SL7_UCORE EGI

      • tmp directory cleaning might be removing files before transfer is completing. Being investigated
      • Still some cause to check consistency
    • 155881 TEAM atlas UKI-SCOTGRID-GLASGOW less urgent NGI_UK in progress 2022-03-01 09:40:00 Stage-in at UKI-SCOTGRID-GLASGOW_CEPH timeouts EGI

      • Unclear, seems intermittent
    • 154806 TEAM atlas UKI-LT2-QMUL less urgent NGI_UK in progress 2022-03-01 11:33:00 UKI-LT2-QMUL SOURCE (and DESTINATION) transfer failures EGI

      • Increase No. connections
      • Dune using gridFTP to do large transfers
    • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2022-02-08 14:01:00 DPM storage ACL configuration EGI

      • Might not now break the cache, and be rather straightforward to do; but needs care and time to monitor
    • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2022-03-02 11:47:00 RAL Echo Davs developments EGI

      • Enabled webdavs for writes; transfers generally ok, untill xrootd issues, which are increasingly common
      • moved back to gridFTP to reduce backlog
    • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK on hold 2021-12-01 15:37:00 HTTPS on RAL CTA EGI

      • Castor -> CTA migration ongoing; some ACL settings are causing additional delays

● CPU

    • RAL

      • Largely ok; a drop yesterday as arc ce and condor got out of sync
    • Northgrid

      • Ok; Lancs not yet managed to consider changes to fairshare
    • London

      • NTR
    • SouthGrid

      • OX Some Xcache updates; Vip migrated to 5.4.1 in the meeting
    • Scotgrid

      • GLA Xcache restart
      • Durham stage-out timeouts; possibly due to scripts that clean up the space on the wns.

● Ongoing Items

  • TPC with http

    • RAL wedavs for writes not so succeful
    • GLA still looking
  • Storageless Site tests

    • OX; upgraded to 5.4.1
    • BHAM; continued working on VP
  • Storage migrations

    • How is the interaction of StorM
      • Is it Storm that resolves the URL / path, (and not )

 

 


● News round-table

  • Alessandra

    • NTR
  • Dan

    • NTR
  • Gerard

    • NTR
  • Stefen

  • Matt

    • NTR;
  • Peter

    • NTR
  • Sam

    • NTR
  • Vip

    • NTR

 

 

There are minutes attached to this event. Show them.
    • 10:00 10:20
      Status 20m
      • Outstanding tickets 10m
          • 156204 TEAM atlas UKI-NORTHGRID-MAN-HEP less urgent NGI_UK in progress 2022-03-01 09:39:00 UKI-NORTHGRID-MAN-HEP deletion errors due to unexpected server error EGI

            • Now should be fixed; following up on HC test failure
          • 156179 TEAM atlas UKI-SCOTGRID-DURHAM less urgent NGI_UK assigned 2022-03-01 16:42:00 Stage-out timeouts at UKI-SCOTGRID-DURHAM_SL7_UCORE EGI

            • tmp directory cleaning might be removing files before transfer is completing. Being investigated
            • Still some cause to check consistency
          • 155881 TEAM atlas UKI-SCOTGRID-GLASGOW less urgent NGI_UK in progress 2022-03-01 09:40:00 Stage-in at UKI-SCOTGRID-GLASGOW_CEPH timeouts EGI

            • Unclear, seems intermittent
          • 154806 TEAM atlas UKI-LT2-QMUL less urgent NGI_UK in progress 2022-03-01 11:33:00 UKI-LT2-QMUL SOURCE (and DESTINATION) transfer failures EGI

            • Increase No. connections
            • Dune using gridFTP to do large transfers
          • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2022-02-08 14:01:00 DPM storage ACL configuration EGI

            • Might not now break the cache, and be rather straightforward to do; but needs care and time to monitor
          • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2022-03-02 11:47:00 RAL Echo Davs developments EGI

            • Enabled webdavs for writes; transfers generally ok, untill xrootd issues, which are increasingly common
            • moved back to gridFTP to reduce backlog
          • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK on hold 2021-12-01 15:37:00 HTTPS on RAL CTA EGI

            • Castor -> CTA migration ongoing; some ACL settings are causing additional delays
      • CPU 5m

        New link for the site-oriented dashboard

          • RAL

            • Largely ok; a drop yesterday as arc ce and condor got out of sync
          • Northgrid

            • Ok; Lancs not yet managed to consider changes to fairshare
          • London

            • NTR
          • SouthGrid

            • OX Some Xcache updates; Vip migrated to 5.4.1 in the meeting
          • Scotgrid

            • GLA Xcache restart
            • Durham stage-out timeouts; possibly due to scripts that clean up the space on the wns.
      • Other new issues / tasks 5m
    • 10:20 10:40
      Ongoing Items 20m
      • TPC with http

        • RAL wedavs for writes not so succeful
        • GLA still looking
      • Storageless Site tests

        • OX; upgraded to 5.4.1
        • BHAM; continued working on VP
      • Storage migrations

        • How is the interaction of StorM
          • Is it Storm that resolves the URL / path, (and not )

       

       

    • 10:40 10:50
      News round-table 10m
      • Alessandra

        • NTR
      • Dan

        • NTR
      • Gerard

        • NTR
      • Stefen

      • Matt

        • NTR;
      • Peter

        • NTR
      • Sam

        • NTR
      • Vip

        • NTR

       

       

    • 10:50 11:00