Briefly blacklisted on 8/13 due to failure of jobs from two servers to resolve Varnish host name.  Restarted the hosts in question and haven't seen this error since.  Nobody seems to have seen it before so for now we are just keeping an eye out in case it comes back.

Also blacklisted on 8/12 due to a network instability affecting the connection between servers in the campus data center and those in the main data center.  (These are different servers than those that had the Varnish issue.)  UMass IT fixed the issue pretty quickly and it hasn't recurred.

Finally, blacklisted on 8/6 due to SCRATCHDISK becoming unavailable thanks to extremely high load on a dcache pool, owing to a user trying to stage a file on that pool for all of his thousands of jobs.  Using LRU as for load balancing policy, cost based p2p migration is tricky.

Also, saw an unusual XCache problem, where xrootd crashed inside the XCache: as a result, caching wasn't working, but jobs continued to be submitted to the VP queue because the XCache itself was up.

OKD upgrade preparation is almost done. Tests on baremetal is done and we are aiming to shcedule a downtime to next Thusday.