From Italy to France
Handover Log:
1) there are several tickets unsolved since a month regarding APEL problems: we should involve APEL experts (at least in the tickets in which they aren't investigating yet) to understand if there is a common cause or if each site has got a different problem.
(RO-15-NIPNE) https://gus.fzk.de/ws/ticket_info.php?ticket=54784&from=ID
(MA-01-CNRST) https://gus.fzk.de/ws/ticket_info.php?ticket=54115&from=ID
(IN-DAE-VECC-02) https://gus.fzk.de/ws/ticket_info.php?ticket=54839&from=ID
(RO-11-NIPNE) https://gus.fzk.de/ws/ticket_info.php?ticket=54771&from=ID
(CA-SCINET-T2) https://gus.fzk.de/ws/ticket_info.php?ticket=54764&from=ID
(VN-HPCC-HUT-HN) https://gus.fzk.de/ws/ticket_info.php?ticket=54731&from=ID
(CA-ALBERTA-WESTGRID-T2) https://gus.fzk.de/ws/ticket_info.php?ticket=54707&from=ID
(CERN-PROD) https://gus.fzk.de/ws/ticket_info.php?ticket=54424&from=ID
2) last week appeared on dashobard very old alarms for "not production" and monitored node (like grid-ce2.physik.rwth-aachen.de still present): some sites are interested to keep monitored an host for test purposes and they don't want receive tickets in case of problems. Moreover the nodes registered on GOC-DB in this way are taken into account for site availability metrics (isn't it?).
We should ask to SAM/NAGIOS developers to not trigger alarms for node marked as "not Production" and so to exclude them from availability calculations
As a temporary solution, these types of node are put in downtime (anyway the relevant items in case of problems will continue to appear in the dashboard)
3) a brief report on MPI test status:
last Friday 14 CE in error, 4 in maintenance
- general failures on kg-ce01.cc.kuleuven.be (BEgrid-KULeuven), creamce.reef.man.poznan.pl (PSNC), gilda-ce.rediris.es (RedIRIS_GILDA )and cedric.scai.fraunhofer.de (SCAI)
- 1 is failing OPENMPI test (PSNC)
- 6 are failing MPICH test (BEgrid-ULB-VUB, INFN-NAPOLI-PAMELA, prague_cesnet_lcg2, PSNC, SZTAKI, Taiwan-LCG2, UFRJ-IF)
- 2 are failing MPICH2 test (BEgrid-ULB-VUB, PSNC, Taiwan-LCG2)