WLCG-OSG-EGEE Operations meeting

Europe/Zurich
28-R-15 (CERN conferencing service (joining details below))

28-R-15

CERN conferencing service (joining details below)

Nick Thackray
Description
grid-operations-meeting@cern.ch
Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
Attendees:
  • OSG operations team
  • EGEE operations team
  • EGEE ROC managers
  • WLCG coordination representatives
  • WLCG Tier-1 representatives
  • other site representatives (optional)
  • GGUS representatives
  • VO representatives
  • To dial in to the conference:
    a. Dial +41227676000
    b. Enter access code 0148141

    OR click HERE
    (Please specify your name & affiliation in the web-interface)

    Click here for minutes of all meetings

    Click here for the List of Actions

    Recording of the meeting
      • 16:00 16:00
        Feedback on last meeting's minutes
      • 16:01 16:30
        EGEE Items 29m
        • <big> Grid-Operator-on-Duty handover </big>
          From: Russia and RAL
          To: Central Europe and Taiwan


          Report from Russia::
          • opened: 27
          • closed: 29
          • 2nd mail: 9
          • extended: 36
          total: 101
          1. GGUS:40700 on BEIJING-CNIC-LCG2-IA64 (CERN-ROC) APEL problem not solved yet, case opened on Sept. 10th. Site is not suspended yet.

          Report from RAL: Tickets escalated to Operations Meeting
          1. GGUS:42958 MK-01-UKIM-II, ROC SE
          2. GGUS:42124 WEIZMANN-LCG2, ROC SE
          3. GGUS:43001 BEgrid-KULeuven, ROC North
          4. GGUS:42469 BEgrid-UGent, ROC North
          5. GGUS:42015 ITPA-LCG2 is not suspended, ROC North. See action item.
        • <big> PPS Report & Issues </big>
          Please find Issues from EGEE ROCs and general info in:

          https://twiki.cern.ch/twiki/bin/view/LCG/OpsMeetingPps
        • <big> gLite Release News</big>
        • <big> EGEE issues coming from ROC reports </big>
          • None.
      • 16:30 17:00
        WLCG Items 30m
        • <big> WLCG issues coming from ROC reports </big>
          1. None
        • <big> Service type change and SAM LFC Tests.
          https://savannah.cern.ch/task/?7108

          On Wednesday when Gilles implements the new, sanitized, GOCDB service names, SAM will submit LFC_LOCAL and LFC_CENTRAL tests instead of the current LFC tests. The only difference is that an LFC-ping test has been added, and the tests will no longer attempt to write to read-only LFCs. An EGEE briadcast will be sent after the meeting.

        • <big>WLCG Service Interventions (with dates / times where known) </big>
          Link to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board

          Many interventions scheduled this week. Please consult the URLs above for details.

          Time at WLCG T0 and T1 sites.

        • <big> WLCG Operational Review </big>
          Speaker: Harry Renshall / Jamie Shiers
        • <big> Alice report </big>
          1. Item
        • <big> Atlas report </big>
          1. Item
        • <big> CMS report </big>
          1. Item
          Speaker: Daniele Bonacorsi
        • <big> LHCb report </big>
          1. Some modification on the critical SAM tests for LHCb
            Set new critical tests targeted to test the infrastructure under the LHCb perspective: JS Shared Area and CSH test from ops. This gives a disentangled from DIRAC view of the health status f the site and guarantees to have always SAM test results for the CE (that ultimately for LHCB suffered several internal problem preventing to have CE properly rated)
            Further to that we also have set critical (after month of test) the condition DB test (only valid for T1)
            For the SE two more LHCb specific tests are also coming soon (they are in a test phase before becoming critical)

          2. A second point is about warning sites that LHCB is going to broadcast the request for the pilot role as discussed at the GDB and last Workshop.

          3. Non critical GGUS ticket submitted last week against GridKA today escalated at the daily ops meeting it looks like from the answer that streams to GridKA are off since half a year now. Some words from GridKA are welcome
        • <big> Storage services: Recommended base versions </big>
          The recommended baseline versions for the storage solutions can be found here: https://twiki.cern.ch/twiki/bin/view/LCG/GSSDCCRCBaseVersions

        • <big> Storage services: this week's updates </big>
      • 17:00 17:30
        OSG Items 30m
        Speaker: Rob Quick (OSG - Indiana University)
        • Discussion of open tickets for OSG
      • 17:30 17:35
        Review of action items 5m
      • 17:35 17:35
        AOB