News:
- LHCb drained last Sunday due to a problem with one of the LHCbDIRAC VMs
- Recovered by Monday morning
Issues:
- Network outages [GGUS ticket]:
- All looks ~OK since Friday afternoon
- The issue was fully fixed today, so we can close the ticket?
- ceph-svc24 was intermittently crashing yesterday
- Due to a bug in the new checksum dump code
- That caused a (minor) increase in upload failures.
- Spike of deletion failures this morning
- All failures were due to timeouts.
CVMFS:
Stratum-1 servers were rebooted yesterday. That stopped snapshots due to removal of the /run/cvmfs.local
directory. The directory was created manually yesterday evening, that enabled snapshots again. A proper fix is being prepared.