WLCG-OSG-EGEE Operations meeting

Europe/Zurich
28-R-15 (CERN conferencing service (joining details below))

28-R-15

CERN conferencing service (joining details below)

Nick Thackray
Description
grid-operations-meeting@cern.ch
Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
Attendees:
  • OSG operations team
  • EGEE operations team
  • EGEE ROC managers
  • WLCG coordination representatives
  • WLCG Tier-1 representatives
  • other site representatives (optional)
  • GGUS representatives
  • VO representatives
  • To dial in to the conference:
    a. Dial +41227676000
    b. Enter access code 0148141

    OR click HERE
    (Please specify your name & affiliation in the web-interface)

    Click here for minutes of all meetings

    Click here for the List of Actions

      • 16:01 16:30
        EGEE Items 29m
        • <big> Grid-Operator-on-Duty handover </big>
          From: CH-DE and CERN
          To: France and Italy


          Report from CERN: Due to network intervention at CERN on March 19th a number of alarms were raised either because lxdpm104.cern.ch (which is central SE for SAM) or lcg-bdii.cern.ch (top level BDII) were unavailable. Such alarms were set to off without raising the tickets.
          Report from CH-DE:
          general information: Dear C-COD The Handover log has been reviewed . https://cic.gridops.org/index.php?section=roc&page=dashboard&subpage=handover_new This tool permits now to send a message to a Regional Team . This message will be visible in the summary on the right only in your scope and in the scope of the Team contacted . If you remark a bug , or if you have any feedback , don' t hesitate .
        • <big> PPS Report & Issues </big>
          Please find Issues from EGEE ROCs and general info in: https://twiki.cern.ch/twiki/bin/view/LCG/OpsMeetingPps

        • <big> gLite Release News</big>
          Please find gLite release news in: https://twiki.cern.ch/twiki/bin/view/LCG/OpsMeetingGliteReleases

          HIGHLIGHT gLite 3.2 Update 01 was released to production
          This is the first release of the 3.2 baseline and contains:

          • WNs + Torque clients for SLC5
          • signed rpm repository
          Please notice that this version is supported only on 64bit machines
          Release notes in
          http://glite.web.cern.ch/glite/packages/R3.2/x86_64/updates.asp

        • <big> EGEE issues coming from ROC reports </big>
          • None this week
        • <big>Grid Service Interventions </big>
          CERN Significant network disruption, 19th March 06:00 to 08:00 CET. Details

          In order to profit from this intervention it has been suggested by the UK that regions and sites record the problems seen during this time. Please report back next week and the results can be considered.
          any major problems to be discussed this week?

          Link to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board

          Please consult the URLs above for details.

      • 16:30 17:00
        WLCG Items 30m
        • <big> WLCG issues coming from ROC reports </big>
          None
        • <big> Wiki page containing FTM Endpoints </big>
          Can all tier-1 sites please keep the list of FTM endpoints up to date. The list is here: https://twiki.cern.ch/twiki/bin/view/LCG/LCGFTMEndpoints
        • <big> WLCG Operational Review </big>
          The minutes of the daily WLCG Operations meetings (one file per week) are available here: https://twiki.cern.ch/twiki/bin/view/LCG/WLCGOperationsMeetings
          Speaker: Harry Renshall / Jamie Shiers
        • <big> Alice items </big>
        • <big> Atlas items </big>

        • <big> CMS items </big>
          1. Please have a look at the daily reports given at WLCG daily calls here.
        • <big> LHCb items </big>
        • <big> WLCG service recommended baseline versions </big>
          FTS Configuration
          The current FTS tries SRM v1 unless endpoints are published correctly.
          To force type 2 use

          FTA_GLOBAL_ACTIONS_SRMVERSION="2.2"

          Once glite-data-transfer-agents 3_3_4_1 is released this will be default anyway.
          Given all sites are using SRM v2.2 now we recommend that this configuration be added to all the current FTSes now prior to this upgrade reaching production.

          The recommended baseline versions can be found here: https://twiki.cern.ch/twiki/bin/view/LCG/WLCGBaselineVersions

      • 17:00 17:30
        OSG Items 30m
        Speaker: Rob Quick (OSG - Indiana University)
        • Discussion of open tickets for OSG
          OSG
          Exactly this was discussed last week and Rob had an action to check. GGUS #46647:Duplicate of the above? If 'yes' is this human error?
          Guenter is also asked to check on the GGUS side via https://savannah.cern.ch/support/?107511#comment4
      • 17:30 17:35
        Review of action items 5m
      • 17:35 17:35
        AOB

      • It's show time folks, again! Next week: LHC experiment VOs to perform an ALARM ticket test (full round from opening to ticket closing) to Tier1s. [savannah ticket #107452] and [testing rules]. Summary reports must be sent to wlcg-operations@cern.ch by April 3rd at the latest!
      • USAG meeting on 1st level User Support Strategy 2009-04-02