CTA deployment meeting

Europe/Zurich
600/R-001 (CERN)

600/R-001

CERN

15
Show room on map
Michael Davis (CERN)

Getting LHCb into Production

  • eos-xrootd-4.12.7 and eos-4.8.39 fix the XRoot "write to files opened read-only" bug and also the "missing check of targetsizeerror" bug fixed by Steve.
  • Julien has deployed these versions on eosctalhcbpps (15/02/2021).
  • Michael followed up with Chris Haen after the meeting about final tests and schedule.

DAQ Tests

  • Chris does not have time right now to do the functional test of DAQ-to-CTA with check file safely on tape check. This is not a blocker for migration, we will do these tests in production.

HTTP Tests

  • On the other hand, Chris does want to test CTA to/from all the other sites before migration: "I am currently deploying HTTPS in production in all our sites to be able to talk with CTA but to have the full coverage, we will need the token configuration, to accomodate for sites using Storm."
  • Michael, Chris and Julien will meet on Monday 22/02/2021 @ 11.00 to finalise details of the HTTP tests.

Revised Schedule

  • w/c 22 February: HTTP tests. Prepare OTGs.
  • 1 March: disable write access to CASTOR LHCb in preparation for migration. Check all queues are flushed and everything is written to tape.
  • 8 March: disable CASTOR LHCb. Migrate to CTA.
  • 15 March: EOSCTA LHCb in production

To Do

  • HTTP+TPC tests
  • Publish OTGs

 

Getting PUBLIC into Production

  • AMS tests are in progress. xrdcp transfers work, but when transferring with FTS there is an error, "Operation not supported: Destination does not support delegation." Vova is following up.
  • NA61/Shine: Meeting on Thu 18/02/2021 @ 14.00 with EOS+FTS+CTA teams to discuss "best practice" for their new DAQ storage endpoint.
  • DUNE: no news, we are not pushing as we have enough to do and there is no rush (data taking starts in 2022).
  • n_TOF, COMPASS: need to test with spinner space. Julien will configure this this week so Vova can proceed with testing.
  • Repack of public_user is continuing with tapes containing LEP data in preparation for migrating the legacy tapepool.
  • Draft KB article for users to access files in CTA is in CodiMD.

NA62 Schedule

  • Repack of NA62 tapes is complete and has been checked that all NA62 fileclasses are now in the correct tapepool.
  • Vova and Michael met with Barbara (17/02/2021). Barbara confirms that their code will clean up files from EOS PUBLIC for their recall workflow (CTA → EOS PUBLIC → process batch in HT CONDOR → delete files from EOS PUBLIC). Therefore spinner space is not required for the NA62 case. (It is still required for n_TOF and COMPASS).
  • This means that NA62 tests are complete and migration to CTA can proceed on the original schedule.
  • Mon 22 Feb: remove migration routes for NA62
  • Check all archive queues are flushed to tape
  • Disable NA62 directories in CASTOR
  • Migrate NA62 tapepools to CTA
  • Mon 1 March: NA62 in production

 

There are minutes attached to this event. Show them.
    • 14:00 14:10
      Getting LHCb into Production 10m

      Status

      • HTTP+TPC: XRoot issue #1373 was fixed upstream and Michal was to backport to v4.12. Status?
      • Tests with CNAF using delegated X.509 credentials, see #209
      • Token authorisation configuration (not a blocker for LHCb migration)
      • LHCb DAQ tests: Maria has been testing DAQ to EOS with xrdcp (no DIRAC yet), 10 GB/s.

      Schedule

      • 15 February: green light, if all tests are OK we proceed. If there are unresolved problems we can postpone. Prepare OTGs.
      • 22 February: disable write access to CASTOR LHCb in preparation for migration. Check all queues are flushed and everything is written to tape.
      • 1 March: disable CASTOR LHCb. Migrate to CTA.
      • 8 March: EOSCTA LHCb in production

      To Do

      • Finish testing HTTP+TPC
      • DAQ functional test
      • Publish OTGs
    • 14:10 14:20
      Getting PUBLIC into Production 10m

      Status Update

      • AMS tests
      • NA61/SHINE: Meeting this week (probably Thu) to discuss DAQ storage
      • DUNE. See #213
      • NA62: Test with spinner space, see #72.
      • Repack of NA62 tapes is complete, Michael will check for any loose ends.
      • n_TOF: test offline workflow with spinner space.
      • COMPASS: Recall testing with spinner space.

      TO DO

      • "ALICE-like" spinner space on EOS PUBLIC PPS, to test with NA62 and n_TOF (#161)
      • Finish repack of public_user
      • Knowledge Base article for users to access files in CTA, see #214 Test and document workflow for retrieving user data from CTA (draft written)
      • Migrate data from legacy experiments (Aleph, Chorus, Delphi, Nomad, Opal, ...).
      • Bartek (NA61) is following up on issue of experiment data in CASTOR /user part of the namespace

      Schedule

      • 22 Feb ? : migrate NA62 to CTA
    • 14:20 14:30
      CTA Software 10m
      • #959 Cedric's departure preparation
      • CTA v4.0 deployment
      • Review of cta-admin output
    • 14:30 14:35
      AOB 5m