Updated cvmfs to use varnish style cache server from SLATE. 
Each site (MSU/UM) now has its own varnish instance from the site's own slate cluster. 

 

3/10/2023 
MSU upgraded Juniper OS in all rack data switches

to solve a problem/bug preventing access to a readonly account on our rack switches.
Some Worker nodes from MSU had a burst of failed jobs from a couple sources.
Some expected, as we were missing some of the redundant/bonded cabling (solved/all cabled now)
Some unexpected related to the force-on setting needed for provisioning (avoidable in the future)

 

3/16

There was a new security kernel update available, we applied this new kernel to all our work nodes and interactive login nodes, and rebooted them to the new kernel. We also took this opportunity to update the firmware for the work nodes. This process required draining the HTCondor cluster, and for most of the time, BOINC backfilling jobs filled the draining job slots. Only for 2 days, when we were draining a small batch of work nodes, it happened that the BOINC queue had no available jobs, so some draining job slots did not get fulfilled. 

 

2023 Equipment orders placed at MSU and UM  
18x R740xd2 with 20T drives for estimated 6.8 PB in dCache (minus retirements)
12x R6525 with AMD 7443 for 1152k cores or estimated ~2k HS06 (minus retirements)
a second NVMe storage node for MSU Vmware cluster
a second NVMe storage node for MSU SLATE
another NVMe storage node for UM 
also storage and GPU node for UM T3