WLCG-OSG-EGEE Operations meeting

Europe/Zurich
28-R-15 (CERN conferencing service (joining details below))

28-R-15

CERN conferencing service (joining details below)

Maite Barroso Lopez (CERN)
Description
grid-operations-meeting@cern.ch
Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
Attendees:
  • OSG operations team
  • EGEE operations team
  • EGEE ROC managers
  • WLCG coordination representatives
  • WLCG Tier-1 representatives
  • other site representatives (optional)
  • GGUS representatives
  • VO representatives
  • To dial in to the conference:
    a. Dial +41227676000
    b. Enter access code 0148141

    OR click HERE
    (Please specify your name & affiliation in the web-interface)

    Click here for minutes of all meetings

    Click here for the List of Actions

      • 1
        EGEE Items
        • a) <big>Central Grid-Operator-on-Duty (c-COD) handover</big>
          From Italy to France

          Last week a lot of alarms appeared for Germany/Switzerland sites - most of them should be switched off - as requested on 2009/09/03 - but nobody responded from the Germany/Switzerland ROC The problems with Asia Pacific have now been resolved.
          Cristina Aiftimiei
          IT C-COD

        • b) <big> PPS Report & Issues </big>
          Please find Issues from EGEE ROCs and general info in:
          https://twiki.cern.ch/twiki/bin/view/LCG/OpsMeetingPps
          1. gLite 3.2.0 PPS Update 06 released on 3rd September.
          2. Highlights include: GFAL, yaim core and clients, BDII, lcg-infosites, Myproxy and Torque.
        • c) <big> gLite Release News</big>
          Please find gLite release news in:
          https://twiki.cern.ch/twiki/bin/view/LCG/OpsMeetingGliteReleases

          1. Nothing yet.
        • d) <big> EGEE issues coming from ROC reports </big>
          Italy had not completed their ROC report at 14:00 CEST today.
          • SouthEast: Concerning the MPI + glite-3.2 issue, we propose to enable the MPI SAM tests only after the MPI it is fully supported by the glite-3.2 middleware
        • e) <big>Grid Service Interventions </big>
          Link to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board
          Please consult the URLs above for details.

        • f) <big>Miscellaneous</big>
          • SAM default DPM upgrade

            Last reminder that the default DPM used for SAM tests will be upgraded to SL4 today Monday 7th of September, and that sites with obsolete client S/W will start failing tests.

            SAM is using lxdpm101 since 15:00 CET with lxdpm103 as backup. Both are running the SL4 DPM, which is incompatible with old versions of GFAL. A few sites still have not upgraded their WNs, despite repeated warnings and extensions of the grace period. They have started failing the SAM tests now.

            We intend to reinstall lxdpm104 tomorrow when no more SAM job should be alive that still refers to that SE.

          • 6 Sites running legacy gLite releases, those not upgraded this week will be moved to suspended/uncertified till they do so:
            Site Host Version
            EENet kriit.eenet.ee 3.0.2
            HK-HKU-CC-01 ce.grid.hku.hk 3.0.2
            JP-KEK-CRC-01 dg10.cc.kek.jp 3.0.2
            Taiwan-IPAS-LCG2 atlasce.phys.sinica.edu.tw 3.0.2
            Taiwan-NCUCC-LCG2 ce.cc.ncu.edu.tw 3.0.2
            TW-NTCU-HPC-01 host001.hpc.ntcu.edu.tw 3.0.2

          Should we do this now. thankyou A-P for your comments.

      • 2
        OSG Items
        Speakers: Maria Dimou, Rob Quick
      • 3
        Review of Action Items
      • 4
        AOB