ADC Weekly

Europe/Zurich
3162/2-E01 (CERN)

3162/2-E01

CERN

20
Show room on map
I Ueda (Department of Particle Physics-University of Tokyo)
    • 15:40 16:15
      Hot topics
      • 15:40
        SW Installation system status 5m
        The releases tagging is now done by the new one with panda jobs since June 11.
        mail to atlas-adc-operations
      • 15:45
        reproducing lost files 5m
        Speakers: David Cameron (University of Oslo (NO)), Sasha Vanyashin (Argonne National Laboratory (US))
        Slides
      • 15:55
        T1_datadisk 5m
        Speaker: Tomas Kouba (Acad. of Sciences of the Czech Rep. (CZ))
        Slides
      • 16:05
        Storage Area Automatic Blacklisting (SAAB) 5m
        Speaker: Dr Salvatore Tupputi (Universita e INFN (IT))
        Slides
        SAAB
        • to be activated tomorrow with an elog and a link to the document
        • SSB major views to be updated to include the SAAB column
        • 'SAAB actions' log file to be prepared (similar to panda 'incidents' page)
          • the per-site history view does not fulfill the request
    • 16:15 16:25
      rucio naming convention for Panda jobs 10m
      Speaker: Stephane Jezequel (Centre National de la Recherche Scientifique (FR))
      Slides
    • 16:25 16:40
      AMOD report 15m
    • 16:40 16:50
      Shifters procedures on issues related to FAX 10m
      Speaker: Robert William Gardner Jr (University of Chicago (US))
      more information
      • A.DiGirolamo, R.Gardner: discussions on the procedures for the shifters on-going
      • R.Gardner: activation of 'allowfax' for the US and some UK sites should have negligible impact. only small scale tests is planned that should not pull attention of shifters. no plan such as to set data server down and trigger a large scale failover to FAX.
      • What would happen if a site storage goes down (not by intention as mentioned)? Then, with 'allowfax', jobs after the first failures (usually 2, set on schedconfig) will start accessing files via FAX. FAX will try to get another copy of that file first within the cloud if they are available, and if not, outside the cloud, again if available, only on sites which participate to FAX.
        If the storage is not completely available (i.e. not even for writing), then the jobs will fail while trying to upload the output, but if the storage had just a glitch, then the jobs will be successful.
         
    • 16:50 16:55
      AOB 5m
      • network reports (if any)
        - for T2(D)s against T1s - for T1s against T2Ds
      • Analysis Availability Reports
      • Draft reccomendation for T2 space reservation