- Compact style
- Indico style
- Indico style - inline minutes
- Indico style - numbered
- Indico style - numbered + minutes
- Indico Weeks View
Multiple crashes on Sat caused by (per Andreas)
Working on putting the authentication proxies in front of the MGM: very exotic issue with iptables... (was working for EOSCMS, but somehow not for EOSALICE).
Roberto is working on EOSALICE scheduling group unbalance (got random FST crashes, now going slowly = 25 filesystems/day, but need to do O(3000) filesystems in 30 groups-> 160 days ETA at current rate.. scripted but launched manually). Andreas might have suggestion on how to speed this (or take 1 FS/node in parallel?).
Started to work on EOSCMS (which reached critical levels, up to 99% full) and EOSPUBLIC.
Note: have filesystems of different sizes (2TB..6TB), should take into account for groups.
EOSATLAS (Cristi): similar - drain filesystems, add them to groups based on fullness (add until <90% full).
Investigating "strange" 1min-delays (also seen by probe, also by WOPI - stat() takes minute(s)). Could have been Backup launching in a "storm", but unlikely.
Andreas suggests a better probe: mkdir() on established connection (vs "mkdir" on a new/separate connection)? Might be different, seems to come from "xrdcp -f" waiting for redirection.
Not reaching the max number of threads (4k).
Might capture the latency in MGM - have this but would need to reset every hour.
Compiled 4.2.0-3 on el7/el6 for koji. el6 repo has a new dependency, hiredis.
el7 testing: http://linuxsoft.cern.ch/internal/repos/eos7-testing/x86_64/os/Packages/
el6 testing: http://linuxsoft.cern.ch/internal/repos/eos6-testing/x86_64/os/Packages/
Dan's basic tests are passing, but these have *not* been pushed to qa.
Also, eos-fusex 4.2.0-3 can be found in the above repos, but puppet eosclient integration incomplete.
Q: what needs to be done - should not be blocked for 3 weeks.
Q: who can push this to "qa" since fixes 4.1.30 session binding crash? see brand-new EOSops procedure.
They confirmed the preferred slot for migrating to Citrine would be after the Christmas shutdown
Meeting on friday about Batch on EOS, hence also about CentOS 7 and Citrine migration
SWAN had "spontaneous" update to EOSFUSE 4.1.30 (which crashes on LXPLUS, when used with per-session bindings.. might not affect).
new FUSE
todo
numeric UIDs: done, clients resolve, converter handles
protobuf
Have 2 old ALICE headnodes, now doing EOSBACKUP namespace conversion tests - found issues with orphans and name conflicts (done on-the fly during boot) . To be fixed today, will then convert+validate.
Rollout: EOSBACKUP. Does it need CC7? yes, only on MGM and QuarkDB". Luca: "mhmmmh.."
Task 263925 starts at Mon Oct 23 16:39:20 2017 and ends at Mon Oct 23 17:07:32 2017 (28.2 minutes)
Analysed jobs: 100
Correct jobs: 100
Maximum concurrency: 3
Execution hosts (top 5): b69586e854 [#43] b64972dff9 [#28] b674d8742c [#19] b6163cf2d6 [#10]
Execution environments (top 5): eos-client-4.1.30-1.el6.x86_64, eos-fuse-core-4.1.30-1.el6.x86_64, xrootd-client-libs-4.6.1-1.el6.i686, xrootd-client-libs-4.6.1-1.el6.x86_64 [#100]
Q: why still running xrootd-4.6 (has "empty buffer retry" issue) - should be 4.7 - where is this version coming from?