WLCG-OSG-EGEE Operations meeting

28-R-15 (CERN conferencing service (joining details below))


CERN conferencing service (joining details below)

Nick Thackray
Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
  • OSG operations team
  • EGEE operations team
  • EGEE ROC managers
  • WLCG coordination representatives
  • WLCG Tier-1 representatives
  • other site representatives (optional)
  • GGUS representatives
  • VO representatives
  • To dial in to the conference:
    a. Dial +41227676000
    b. Enter access code 0148141

    OR click HERE
    (Please specify your name & affiliation in the web-interface)

    Click here for minutes of all meetings

    Click here for the List of Actions

      • 4:00 PM 4:00 PM
        EGEE Items
        • <big> Grid-Operator-on-Duty handover </big>
          "Old" COD: Germany/Switzerland (DECH) => Russia

          Report from "old style" COD:
          No unresponsive sites. Nothing to raise.

          cCOD: North Europe (NE) => Asia Pacific (AP)

          Report from cCOD:
        • Vera: There are a number of ROC tickets that are well overdue. Also, please switch off alarms that are in OK state.

  • <big> PPS Report & Issues </big>
    Please find Issues from EGEE ROCs and general info in:
  • <big> gLite Release News</big>
  • <big> EGEE issues coming from ROC reports </big>
    • IT-ROC: Most of the errors (lcg-cr test for CE) at Italian sites (last night until early this morning), were due to our top-bdii egee-bdii.cnaf.infn.it: one of the dns configured on it was unreachable, so the bdii has been emptied.

    • SEE ROC:Some middleware components do not like 3 years period logs (which it is a requirement) due to system limits, please see the corresponding ticket at https://gus.fzk.de/ws/ticket_info.php?ticket=48291 .

    • SWE ROC: 32bit binaries overwrite 64bit binaries for lcg-utils (installation of WNs)

  • <big>Grid Service Interventions </big>


    Downtimes effecting the WLCG tier-1 sites:

    Link to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board

    Please consult the URLs above for details.

  • 4:01 PM 4:01 PM
    OSG Items
    Speakers: Maria Dimou, Rob Quick (OSG - Indiana University)
    • Discussion of open tickets for OSG
      Information on GOCdb-to-OIM migration for USA wLCG T1 sites
      • ggus #44104. This ticket is waiting on the OSG GOC to roll out changes to their production BDII that will publish entries by their OSG resource group, not the OSG resource name. This will remove this issue before it gets to the BDII. Next action deadline in OIM is in Feb 2010. Should we close as unsolved to free the escalation reports?
      • ggus #37059. Urgent ticket re-opened. Please have a look.
      • ggus #47786. Site concerned is Nebraska. Urgent. Submitted 2009-04-08! Some OSG reminders remain unanswered by the site (?) The submitter arbitrarily decided no LHCb jobs should be submitted at the Nebraska site but this is not the opinion of the VO management. A generic queue to be used when resources are spare would be appreciated.
  • 4:02 PM 4:02 PM
    Review of action items
  • 4:03 PM 4:03 PM