SA1 coordination meeting

Europe/Zurich
28/R-015 (CERN)

28/R-015

CERN

15
Show room on map
Description

Minutes, https://edms.cern.ch/document/923879

Actions, https://edms.cern.ch/document/923881

This meeting is 10:00 to 11:30 UTC+1
Phone number is: +41 22 767 6000
Access code is: 0191881
Or click here: https://audioconf.cern.ch/call/0191881

The conference call opens 15 minutes before the meeting starts.
    • 10:00 10:10
      Admin matters
      1. Deadline for QR3 contributions (partner reports, task reports, ROC aggregations): Friday 6th February
      2. Proposal for next F2F SA1 coordination meeting:Tuesday 9th or Thursday 11th June @ CERN (WLCG GDB on Wednesday 10th). What to do with associated meetings, AOT, COD, etc? if CERN, confirmation needed asap to book rooms.
      3. Installed capacity: a request from the EGEE project office for all ROC Managers to “convert” the installed capacity commitments (for computing) that appear in the Description of Work, from kSpecInt2000 values, to numbers of cores.
    • 10:10 10:25
    • 10:25 10:45
      Grid Configuration Data

      What should be on the grid?

      slides
    • 10:45 10:55
      Report from the All-Activity meeting
    • 10:55 11:10
      ROC regional models
    • 11:10 11:25
      Nagios Deployment Update

      The EGEE SA1 Nagios bundle was released this week with significant updates.

      • GOCDB Integration: A list of Sites can now be collected using the GOCDB's new API. In particular a list of sites in a ROC or in a Country can be monitored extending the previous LDAP filter on Sites.
      • GOCDB Downtimes: Downtimes entered in the GOCDB are now also pulled into and inserted as NAGIOS downtimes for your services.
      • Integration with Existing NAGIOS/Apache: YAIM variables now exist to enable NAGIOS, apache or NCG configuration. By default off. All must be set to true to restore old behaviour. The backwards incompatible bit...
      • HGSM Integration: HGSM is the SouthEast Europe equivalent to the GOCDB.
      • NDOUtils Installed: NDOUtils sits behind NAGIOS and fills in a MySQL database with NAGIOS's configuration and metric and test results.
      • New SRM Tests: These mimic some of the logic of the existing SAM SRM tests. The eventual replacement to the SAM SRM tests. In NAGIOS speak we now have an active check that submits scripts and returns passive results for each of steps of the lcg-cr, lcg-rep, lcg-del seem before.
      • NSCA Installed: Especially for the case where two nodes are used, a NAGIOS node and NRPE triggered UI then passive test results are submitted back via NSCA from the NRPE-UI. (Known Issue)
      • New BDII Checks: These are the checks taken directly from the gstat2 work but now running against your services.
      • New msg-to-queue Service: Running on a NAGIOS box this subscribes to externally executed test results for your Site or ROC from the ActiveMQ messaging system. Currently nothing is actually coming in but much of the infrastructure is now there.
      As before installation can still be done completely via YAIM both for a site or ROC. New packages can be followed for i386 or x86_64 via repoview. And of course bug reports and feedback are always welcome.

      Steve Traylen on behalf of EGEE SA1 - OAT.
    • 11:25 11:40
      Operational Security update
      slides
    • 11:40 11:55
      Post-mortem of release process
      slides
    • 11:55 12:05
      AOB
      1. TPM assignment time and quality of their assignment: The wrong assignemnts are going up even in case of simple assignments, and now that only the most difficult cases are going to the TPMs their training is essential, so we would like to remind the ROCs of that.
      2. Roadmap summary for regionalisation of operations in 2009: input needed from RU, DE-CH and SEE by next SA1 phone meeting at the latest.
      3. All ROCs: Please identify your underperforming sites in the January Availability report, and provide a short explanation.