Tier1 Service Coordination Meeting / Call

Europe/Zurich
513 R-068 (CERN)

513 R-068

CERN

Maria Girone (CERN)
Description

To join the call, do one of the following:

  • Dial +41227676000 (Main), or and enter access code 0119168, or
  • To have the system call you, click here.

Mailing list: wlcg-service-coordination@cern.ch

Time at WLCG Tier1 sites

Minutes
    • 15:30 15:35
      Minutes of last meeting and matters arising 5m
    • 15:35 15:45
      Service Interventions during LHC Technical Stop 10m
      • PIC
        I will not be able to connect to the next T1SCM, so this is just to remind that at PIC we are planning an Scheduled Intervention next Tuesday the 20th July, during the LHC technical stop days. Several interventions are planned. Most of them are related to firmware and OS upgrades affecting the Storage, Computing and Oracle (3D, FTS, LFC) services. The whole site will be declared hence in SD from 6 am until 6 pm on that day. Batch queues and FTS channels will be drained accordingly.

        Gonzalo

      • RAL
        Monday - Thursday 19-22 July. Site at Risk for transformer work (TBC)
        • Monday 19th July (08:00-14:00 UTC) - Outage on tape system for swap to spare controller.
        • Tuesday 20th July (07:00-13:00 UTC) - Outage on tape system for microcode update on tape robots.
        The transformer work is looking increasingly unlikely but remains scheduled for now.

        Not in the GOC DB yet - but proposed:

        • Tuesday 20th July. At Risk on Atlas 3D (ogma) for SAN multipath configuration update.
        • Wednesday 21st July. At Risk on LHCb 3d/FTS (lugh) for SAN multipath configuration update.
        • Thursday 22nd July. At Risk on LFC/FTS (somnus) for SAN multipath configuration update.
    • 15:45 15:55
      Status of open GGUS tickets 10m
      Speaker: Maria Dimou (CERN)
      Slides
      • ATLAS ongoing issues
    • 15:55 16:10
      Review of recent / open SIRs 15m
      Speaker: Dr Jamie Shiers (CERN)
      • Partial failure of DNS for .de (affected GGUS)
        Sir
      • RAL data loss
        Sir
      • CASTOR + SRM service degradation due to logging issues
        Sir
    • 16:10 16:20
      Actions from WLCG Collaboration workshop 10m
      Speaker: Dr Jamie Shiers (CERN)
      Agenda
      • SIR template & MoU terminology for service degradations / interruptions
        The categories in the MoU are:
        • Service interruption
        • Degradation of the capacity of the service by more than 50%
        • Degradation of the capacity of the service by more than 20%
        Cern It-Fio Incident Template
        Ral Incident Template
        Wlcg Mou
      • Squid / FroNTier as WLCG services
      • Monitoring enhancements
        Slides
    • 16:20 16:30
      Deployment / Rollout Issues 10m
      • glexec
        Speaker: Maarten Litmaath (CERN)
      • WLCG Release Planning
        Speaker: Maria Alandes Pradillo (Unknown)
        Document
    • 16:30 16:35
      Data Management & Other Tier1 Service Issues 5m
      Speaker: Dr Andrea Sciabà (CERN)
      • FTS logs 5m
        Speaker: Dr Flavia Donno (CERN)
        Slides
    • 16:35 16:45
      Conditions Data Access and related services 10m
      FroNTier, CORAL server, ...
      Speakers: Dr Andrea Valassi (CERN), Dr Flavia Donno (CERN)
    • 16:45 17:05
      Experiment Database Service Issues 20m
    • 17:05 17:10
      AOB 5m