CTA deployment meeting

Europe/Zurich
600/R-001 (CERN)

600/R-001

CERN

15
Show room on map
Michael Davis (CERN)

Getting LHCb into Production

  • Chris is asking for help debugging some issues with transfers to T1s which are showing intermittent failures. We should open tickets for each case so we can track how they are resolved.
  • We will try to resolve any teething problems this week, then ask Chris to sign off on the migration. Any further issues will be dealt with in the normal way for production, by opening a ticket in Service Now.
  • In the next week or so, Michael will check on the status of LHCb DAQ integration and schedule when we will integrate and test FTS Archive Monitoring.
  • Also to be checked: what role T1s will have in the reprocessing campaign in May.
  • Cleaning up /castor/cern.ch/grid/test - at some point we will create an OTG saying this data will be deleted, and notify all experiment data managers.

Getting PUBLIC into Production

  • Main focus this week is NA62 migration, and continuing testing with DUNE and n_TOF.

CTA Software

  • Reviewed CTA tickets tagged "EOS".
  • We will continue reviewing CTA tickets over the next weeks in order to identify high-priority tasks and plan for the arrival of new fellows.

AOB

  • Potential stagiaire indicated they were not interested in a C++/build systems project. We won't pursue this further.
  • Aleph lost files: last week we discovered some files missing from Aleph. Investigation showed that the files were lost before 2015 and our repack logs only go back that far, so no way to recover the lost files. Subsequent investigation by Steve indicates that the files were probably lost before June 2001, shortly after they were imported from SHIFT. Oliver will follow up with German to see if he knows anything about this incident.
There are minutes attached to this event. Show them.
    • 14:00 14:10
      Getting LHCb into Production 10m

      Post Migration

      • T1 debugging
      • DAQ integration and testing, including FTS Archive Monitoring
      • April: LHCb will stage small amounts of data for validation prior to reprocessing, then start pre-staging for their reprocessing campaign which will begin in May.
      • Clean up /castor/cern.ch/grid/test ?
    • 14:10 14:20
      Getting PUBLIC into Production 10m

      Status Updates: Experiments

      • n_TOF (spinner space) #143
      • COMPASS (spinner space) #69
      • DUNE: needs XRootD TPC to be configured on PUBLIC instance #213
      • AMS: auth issues #82
      • NA61/SHINE: setting up their DAQ system + storage. No schedule for testing yet.
      • TOTEM #277. 24 tapes from public_user need to be repacked. 0.5 PB of files in EOS need a tape copy. VM with SLC5 needs to access data on tape.
      • CLIC (Dirac) - will set up a meeting in April

      Status Updates: Other Use Cases

      • Repack of LEP-era data. A further 247 tapes need to be repacked (engineering and nomad_delayed) #240
      • UNOSAT: trying to clarify if data in CASTOR is needed
      • /afsmigration #282
      • CASTOR backup use cases

      TO DO

      • ACLs and workflow links for top-level directories
      • Finish repack of r_public_user
      • Migrate data from legacy experiments
      • Schedule next SME migrations after NA62

      Schedule

      • 17 March: migrate NA62 to CTA
    • 14:20 14:30
      CTA Software 10m
      • Vlado and Julien: highlight high-priority issues which need development effort
      • Broad review of open tickets and set priorities over next few weeks, in preparation for arrival of fellows from May onwards
    • 14:30 14:35
      AOB 5m
      • Stagiaire