Rucio Meeting

Europe/Zurich
Martin Barisits (CERN)
Zoom Meeting ID
413496641
Host
Martin Barisits
Alternative hosts
Mario Lassnig, Cedric Serfon, Dimitrios Christidis
Passcode
28849311
Useful links
Join via phone
Zoom URL
    • 15:00 15:05
      News 5m
      • One Google Summer of Code project proposal for Rucio submitted
      • Mattermost move proposal
        • https://mattermost.web.cern.ch/rucio
        • Sending out eMail to usual mailing lists that the move will happen on March 6
        • On March 6 official announcement on Slack that we are now only on Mattermost
          • Disable all Slack invitation links (no new members)
        • On March 10 archive ALL channels
          • Maybe not possible for #general
        • Sometime in the future delete the workspace completely
      • Next week release checkpoint
    • 15:05 15:25
      Community News & DevOps roundtable 20m
      • ATLAS
        • signed_url issue still a problem (Seee last week notes)
          • more monitoring fields to understand the issue
        • Alma9 testing
          • Servers worked well
          • Reaper issue with gfal2
            • CPU Consumption very high
            • Worked correctly, but much slower
            • Did not investigate further (yet)
            • Standard RPM was used
            • Needs investigation by GFAL team
      • CMS
        • Lag between FTS and Rucio
          • Transfer Success / Failure until it shows up in Rucio
            • 8h gap, 24h gap, ... seen
            • Is conveyor-receiver ran?
              • Radu: Yes, but not in the full_update mode, so it will still wait for finisher #5532
            • Need to understand if it is held by finisher or by poller/receiver
          • Recovery from accidental mark as lost issue
            • finisher took a morning to work through 1M requests
            • CMS running 4 x 5 = 20
            • ATLAS runs 42 finisher threads (=6 x 7 or so)
        • Lag in staging from tape
          • Probably not a Rucio thing?
            • FTS issue?
      • Fermilab / DUNE / RUBIN / ...
        • DUNE
          • Protocol priorities
            • delete-protocol reshuffles existing protocol priorities - no like :-)
          • Issue with renewing tokens not working
            • Fixed in next weeks LTS release
          • rucio upload failing to dpm sites without full parent directory tree
            • gfal does not trigger parent directory creation?
            • Noticed in move from 1.26 to 1.29
        • RUBIN
          • data ingest test successful
      • ESCAPE / InterTwin
        • Not much news
      • MultiVO / RAL
    • 15:25 15:55
      Developers roundtable 30m
      • 1.31 "Donkeys of the Caribbean" Priority followup
        • In Progress
          • Create ongoing token architecture document [Dimitrios]
          • Increase WebUI Test coverage [Mayank]
            • 2 / 4 suites implemented 
              • React Component tests
              • API tests
            • Remaining are integration tests
          • Collect feedback from running ATLAS webui beta [Mayank]
          • Move WebUI Core to Clean + Hexagonal Architecture + Domain Driven Design #117 [Mayank]
            • UserPass login workflow migrated;
            • 7 more rest-calls need to be moved
            • UI Components need to be moved too
            • Change of branches on repo
          • Merge list_dids and list_dids_extended methods #5448 [Rob]
          • Track SQLAlchemy 2.0 migration progress [Yuyi, Martin]
            • Probably hard to auto-generate
            • Maybe just an overview of which modules are migrated and which are missing
            • Radu went through testfailures on 2.0 migration
              • Fixed some, but not complete at all (Radu not working actively on this)
              • Left are the tricky ones
        • In Review
          • Unable to Delete File DID via Undertaker #5154 [Martin, Anton]
        • Todo
          • Exchange of tombstone function-based indices with normal indices #5440 [Martin]
          • foreign key error on deleting dids in reaper #5733 [Martin, Cedric]
          • Rules on containers in state OK but not all the files from the containers have locks #5447 [Martin]
          • Reduce rule tickets to <13 [Martin]
          • Create a server/daemon installation howto #5445 [Mayank]
          • Create developers testing guide in the documentation #5452 [Mayank]
          • WebUI release process #121 [Mayank]
          • Replace hermes1 by hermes2 [Mario, Cedric] -> ticket
            • Rename hermes1 -> hermes_legacy
            • hermes2 -> hermes
            • Need to check hermes2 if everything is there
              • Would need to check ActiveMQ and eMail in e.g. ATLAS
        • Done
          • Add python 3.10 tests to CI framework. #5145 [Mayank]
          • STOMP connections not closed by Hermes? #5894 [Eric, Yuyi]
            • Simulation with hermes and could not trigger connection problem
            • Needs more investigation
            • This one can be closed, but Kronos does make connection issues
      • Documentation corner
        • Ambiguity of "developing" and "developer" #25 [Eraldo]
          • PR out, needs review
        • Document environmental variables affecting the client #171 [Dimitrios]
        • Documentation and dev guidelines for MyPi type annotations #116 [Mayank, Martin]
        • Create developers testing guide in the documentation #177 [Mayank] [1.31 priority]
        • Create a server/daemon installation howto #178 [Mayank] [1.31 priority]
      • Other topics
    • 15:55 16:00
      AOB 5m