- Diagnosing Coffea Casa deployment issues ongoing
- Duplicate resource errors when spawning notebooks
- EOS deployment ongoing
- Cluster up with 2PB of storage across several 90TB arrays (retired MWT2 storage)
- Need to understand options available for authentication. Don't really want to run Kerberos
- Gathering a list of issues/tweaks/workarounds with the Helm charts, would like to meet with the developers at some point to discuss further
- Experimenting with WireGuard 'routing node' features
- Don't have to install WireGuard on all nodes, but a node can be a NAT between the WG network and a private LAN
- Demonstrated connectivity from, e.g. umich001 to UChicago AF NFS server via the WG network[1]
- Was also able to mount /home and it seems to work. 200MB/s read/write - not great but probably due to MTU ~1500 as Aidan/Judith observed
- Sent Wei a demonstration `podman` command to create a pod to join the WG network
- Tested HTCondor glidein on NET2 (ostensibly to connect back to UChicago AF), caused HTCondor to segfault :)
- Kuantifier discussion tomorrow, use Facility R&D link:
[1]
[root@umich001 ~]# tracepath 192.168.240.133
1?: [LOCALHOST] pmtu 1280
1: 100.81.190.82 6.515ms
1: 100.81.190.82 6.275ms
2: 192.168.240.133 6.750ms reached
Resume: pmtu 1280 hops 2 back 2