ATLAS UK Cloud Support

Europe/London
Zoom

Zoom

Tim Adye (Science and Technology Facilities Council STFC (GB)), James William Walder (Science and Technology Facilities Council STFC (GB))
Description

https://cern.zoom.us/j/98434450232

Password protected (same as (new) OPs Mtg)

Videoconference
ATLAS UK Cloud Support
Zoom Meeting ID
98434450232
Host
James William Walder
Useful links
Join via phone
Zoom URL

● Outstanding tickets

  • 154605 USER atlas UKI-SCOTGRID-ECDF less urgent NGI_UK in progress 2021-11-02 10:48:00 Failovers from UKI-SCOTGRID-ECDF to DESY CVMFS startum 1 EGI

      • Cloud resources; under investigation
    • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2021-11-03 17:51:00 DPM storage ACL configuration EGI

      • needs a response
    • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2021-11-04 09:37:00 slow transfers CERN - RAL EGI

      • Transfers are ok, but ticket should stay open for davs improvement tests
    • 154200 TEAM atlas RAL-LCG2 less urgent NGI_UK on hold 2021-11-04 09:34:00 RAL-LCG2 deletion issues with error “The requested service is not available at the moment” EGI

      • Needs to push with PR
    • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK in progress 2021-10-20 09:54:00 HTTPS on RAL CTA EGI

      • Awaiting further tests

● CPU

  • CPU

    • RAL

      • Generally ok
    • Northgrid

      • ntr
    • London

      • QMUL Still flapping a bit for HC tests, might be CPU limitations for davs; will be load balancing at somepoint
      • Brunel has bad disk
    • Scotgrid

      • Occasional issue with XrootD instances; needs a restart of both the redirector and server


 


● Other new issues / tasks

    • GPUs
      • GPU scheduling for ATLAS can be trickly, and to get new users
    • SRR
      • RALPP will be upgrading dCache shortly
    • BRUNEL
      • Awaiting hardware to fix disk server

 

 


● Ongoing Items

  • CentOS7 - Sussex

  • TPC with http

    • RAL and Glasgow still to have full production with davs; activily followed
  • Storageless Site test (Oxford)

    • Short time without Xcache

● News round-table

  • Alessandra

    • NTR
  • Dan

    • NTR
  • Gerard

  • Matt

    • NTR
  • Patrick

    • NTR
  • Peter

    • NTR;
    • Singularity from the wrapper to be tested
  • Sam

    • Interested in the Storm setup config

 

 

There are minutes attached to this event. Show them.
    • 10:00 10:20
      Status 20m
      • Outstanding tickets 10m
        • Outstanding tickets

          • 154605 USER atlas UKI-SCOTGRID-ECDF less urgent NGI_UK in progress 2021-11-02 10:48:00 Failovers from UKI-SCOTGRID-ECDF to DESY CVMFS startum 1 EGI

            • Cloud resources; under investigation
          • 154543 TEAM atlas UKI-SCOTGRID-ECDF urgent NGI_UK in progress 2021-11-03 17:51:00 DPM storage ACL configuration EGI

            • needs a response
          • 154436 TEAM atlas RAL-LCG2 very urgent NGI_UK in progress 2021-11-04 09:37:00 slow transfers CERN - RAL EGI

            • Transfers are ok, but ticket should stay open for davs improvement tests
          • 154200 TEAM atlas RAL-LCG2 less urgent NGI_UK on hold 2021-11-04 09:34:00 RAL-LCG2 deletion issues with error “The requested service is not available at the moment” EGI

            • Needs to push with PR
          • 153367 TEAM atlas RAL-LCG2 urgent NGI_UK in progress 2021-10-20 09:54:00 HTTPS on RAL CTA EGI

            • Awaiting further tests
      • CPU 5m

        New link for the site-oriented dashboard

        • CPU

          • RAL

            • Generally ok
          • Northgrid

            • ntr
          • London

            • QMUL Still flapping a bit for HC tests, might be CPU limitations for davs; will be load balancing at somepoint
            • Brunel has bad disk
          • Scotgrid

            • Occasional issue with XrootD instances; needs a restart of both the redirector and server


         

      • Other new issues / tasks 5m

        UKI-LT2-BRUNEL_DATADISK (offline, needing new backplane)

        Renabling GPU queue for QMUL

          • GPUs
            • GPU scheduling for ATLAS can be trickly, and to get new users
          • SRR
            • RALPP will be upgrading dCache shortly
          • BRUNEL
            • Awaiting hardware to fix disk server

         

         

    • 10:20 10:40
      Ongoing Items 20m
      • CentOS7 - Sussex

      • TPC with http

        • RAL and Glasgow still to have full production with davs; activily followed
      • Storageless Site test (Oxford)

        • Short time without Xcache
    • 10:40 10:50
      News round-table 10m
      • Alessandra

        • NTR
      • Dan

        • NTR
      • Gerard

      • Matt

        • NTR
      • Patrick

        • NTR
      • Peter

        • NTR;
        • Singularity from the wrapper to be tested
      • Sam

        • Interested in the Storm setup config

       

       

    • 10:50 11:00