ATLAS UK Cloud Support

Europe/London
Zoom

Zoom

Tim Adye (Science and Technology Facilities Council STFC (GB)), James William Walder (Science and Technology Facilities Council STFC (GB))
Description

https://cern.zoom.us/j/98434450232

Password protected (same as (new) OPs Mtg)

Videoconference
ATLAS UK Cloud Support
Zoom Meeting ID
98434450232
Host
James William Walder
Useful links
Join via phone
Zoom URL

● Outstanding tickets

    • 156050 TEAM atlas UKI-NORTHGRID-LANCS-HEP urgent NGI_UK in progress 2022-02-23 09:56:00 Production failures due to LRMS error

      • vmem (old recommendation is 3*2GiB) currently at 5GB,
    • 155881 TEAM atlas UKI-SCOTGRID-GLASGOW less urgent NGI_UK reopened 2022-02-22 16:31:00 Stage-in at UKI-SCOTGRID-GLASGOW_CEPH timeouts EGI

      • timouts in staging
    • 155430 TEAM atlas UKI-SCOTGRID-ECDF less urgent NGI_UK in progress 2022-02-23 14:25:00 UKI-SCOTGRID-ECDF transfer and deletion errors EGI

      • Should be resolved soon
    • 154806 TEAM atlas UKI-LT2-QMUL less urgent NGI_UK in progress 2022-02-21 10:36:00 UKI-LT2-QMUL SOURCE (and DESTINATION) transfer failures EGI

      • Ongoing
    • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2022-02-08 14:01:00 DPM storage ACL configuration EGI

      • Doing it the obvious way breaks how Xcache/DPM works at the site; under investigation
    • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2022-02-03 16:46:00 RAL Echo Davs developments EGI

      • Ongoing work, writes with davs now a prioirty
    • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK on hold 2021-12-01 15:37:00 HTTPS on RAL CTA EGI

      • Likely to now be resolved with production traffic / Tape challenge

● CPU

    • RAL

      • NTR
    • Northgrid

      • LANCS scheduling discussion and how ATLAS priority can be
        • Lancs sees 8-core LHCb jobs (same as Oxford)
          • Do jobs from LHCb understand how long they have available?
        • To investigate job shares
    • London

      • QMUL some draining compute racks,
    • SouthGrid

      • OX; some WNs down from storm effect
    • Scotgrid

      • some blips
  •  


 


● Ongoing Items

  • TPC with http

    • Glasgow progressing with xrootd updates
    • RAL to test writes with davs once migriation of Castor underway
  • Storageless Site test

    • BHAM following up with VP
  • LANCS Storage migration

    • In progress looking at Posix file access

 

 


● News round-table


  - Alessandra
      - NTR

  - Dan
      - NTR

  - Gerard
      - NTR

  - Gordon 
      - NTR

  - Matt
      - NTR

  - Peter
      - NTR

  - Sam
      - NTR

  - Vip
      - NTR

  - Steven
      - NTR

There are minutes attached to this event. Show them.
    • 10:00 10:20
      Status 20m
      • Outstanding tickets 10m
          • 156050 TEAM atlas UKI-NORTHGRID-LANCS-HEP urgent NGI_UK in progress 2022-02-23 09:56:00 Production failures due to LRMS error

            • vmem (old recommendation is 3*2GiB) currently at 5GB,
          • 155881 TEAM atlas UKI-SCOTGRID-GLASGOW less urgent NGI_UK reopened 2022-02-22 16:31:00 Stage-in at UKI-SCOTGRID-GLASGOW_CEPH timeouts EGI

            • timouts in staging
          • 155430 TEAM atlas UKI-SCOTGRID-ECDF less urgent NGI_UK in progress 2022-02-23 14:25:00 UKI-SCOTGRID-ECDF transfer and deletion errors EGI

            • Should be resolved soon
          • 154806 TEAM atlas UKI-LT2-QMUL less urgent NGI_UK in progress 2022-02-21 10:36:00 UKI-LT2-QMUL SOURCE (and DESTINATION) transfer failures EGI

            • Ongoing
          • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2022-02-08 14:01:00 DPM storage ACL configuration EGI

            • Doing it the obvious way breaks how Xcache/DPM works at the site; under investigation
          • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2022-02-03 16:46:00 RAL Echo Davs developments EGI

            • Ongoing work, writes with davs now a prioirty
          • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK on hold 2021-12-01 15:37:00 HTTPS on RAL CTA EGI

            • Likely to now be resolved with production traffic / Tape challenge
      • CPU 5m

        New link for the site-oriented dashboard

          • RAL

            • NTR
          • Northgrid

            • LANCS scheduling discussion and how ATLAS priority can be
              • Lancs sees 8-core LHCb jobs (same as Oxford)
                • Do jobs from LHCb understand how long they have available?
              • To investigate job shares
          • London

            • QMUL some draining compute racks,
          • SouthGrid

            • OX; some WNs down from storm effect
          • Scotgrid

            • some blips
        •  


         

      • Other new issues / tasks 5m
    • 10:20 10:40
      Ongoing Items 20m
      • TPC with http

        • Glasgow progressing with xrootd updates
        • RAL to test writes with davs once migriation of Castor underway
      • Storageless Site test

        • BHAM following up with VP
      • LANCS Storage migration

        • In progress looking at Posix file access

       

       

    • 10:40 10:50
      News round-table 10m


        - Alessandra
            - NTR

        - Dan
            - NTR

        - Gerard
            - NTR

        - Gordon 
            - NTR

        - Matt
            - NTR

        - Peter
            - NTR

        - Sam
            - NTR

        - Vip
            - NTR

        - Steven
            - NTR

    • 10:50 11:00