Hypervisor issue yesterday, the site caches were affected and survived, maybe just with some performance degradation that nobody noticed.
Alarms, fault tolerance, telegram, all worked beautifully apparenty. The system came back to 100% without any intervention needed. I'll doublecheck anyway, as it's Friday and it seems even too good to be true.
Another issue popped up around 21h00 yesterday 19/03. This was affecting the acron job that runs the whitelists and hostcount checks. It solved by itself around 22h. Cause: unknown, maybe the same network/hypervisor issue as the previous points.
Apparently I am not able to prevent
cvmfsdata20-4455801563
from swapping. This causes alarms, as those machines are not supposed to swap. I set the vm.swappiness to 0 but it does not seem to work. What can I do?