- We upgraded Kubernetes two major versions from 1.21 to 1.23
- We upgraded HTCondor to 9.0.13 with OSG 3.6 on head/login nodes
- We did a
yum update of all packages on the AF login and head nodes, including the latest mainline Kernel from ELRepo
- We are still upgrading workers in the background
- We deferred the CephFS upgrade from v16 (Pacific) to v17 (Quincy) - we found 1 node (c001) with what seems to be a hardware error - all disks are reporting "I/O error" trying to mount them. Need to get the cluster clean before we upgrade major version.