RAL Tier1 Experiments Liaison Meeting

Europe/London
Access Grid (RAL R89)

Access Grid

RAL R89

Zoom Meeting ID
66811541532
Host
Alastair Dewhurst
Useful links
Join via phone
Zoom URL
    • 13:30 13:34
      Site Operations 4m
    • 13:34 13:35
      Experiment Operational Issues 1m
    • 13:35 13:40
      ATLAS Operations Report 5m
      Speakers: Dr Brij Kishor Jashal (Rutherford Appleton Laboratory), Jyoti Prakash Biswal (Rutherford Appleton Laboratory)
    • 13:40 13:45
      CMS Operations Report 5m
      Speaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))

      Production jobs and SAM tests: BAU. Nice and quiet over the Easter weekend.

      Just a couple of periods of SAM test failures on the AAA system - general improvements for this service still to do.

      Issue with /store/unmerged/ on Echo not being cleaned up for all files: CMS keeps files that require merging via Merge jobs in this 'directory'. These files are not managed by Rucio. We run Cleanup jobs, which delete unmerged files and typically work well at RAL. However, some files remain, and CMS has a mechanism to delete these after a certain period and after checking those files are no longer needed. This uses ls of directories and does not work on Echo. The test is always green though! Files have built up over the years - we can delete the majority of them. We are considering a long-term solution. 

      DC27: 50% of HL-LHC challenge. Proposed for last week of Feb and first week of March. 

    • 13:45 13:50
      LHCb Operations Report 5m
      Speaker: Alexander Rogovskiy (Rutherford Appleton Laboratory)
      • Quiet Easter break, no noticeable issues @RAL.
      • Corrupted echo files found (GGUS:1002197)
        • Affected files re-replicated from other sites (where possible)
        • Corruption reason for most of the user files (~270) understood -- user uploading std.out of the jobs
      • Resource confirmation request (GGUS:1002186)
        • Any news on the SRR update?
    • 13:50 13:55
      ALICE Operations Report 5m
      Speaker: Alexander Rogovskiy (Rutherford Appleton Laboratory)
    • 13:55 14:00
      LSST Operations Report 5m
      Speakers: Thomas Birkett, Timothy Noble (Science and Technology Facilities Council STFC (GB))
      • Mapping of LSST files in echo using the dumps - LSST have left a lot of files behind not tracked by Rucio - producing file lists to ensure we can check when files are not needed abd delete them
      • Modified repo (hsc_pdr2_multisite) from US side - IngestD did not ingest the changes, 
        • Investigation lead to x509 issue
        • Cloud issue over the weekend delayed investigation
        • Issue with moving x509 into the pod - copied to correct permissions
          • Issue corrected and should now no longer be an issue going forwards
      • After ingestion of the data new jobs:

       

      • 20Tb moved to RAL of RAW data from IN2P3
      • LSST running Data movement tests - RAL showing nearly 10GB/s for the test - twice the mean speed of LANCS
        • A 1/10 speed of IN2P3
      • FTS transfers analysed for small files (200bytes)
        42 seconds total transfer from SLAC to RAL
        21 of those was getting checksum from SLAC
        RAL times a fraction of those (checksum) or the same as SLAC 
    • 14:00 14:01
      Tier-1 Projects 1m
    • 14:02 14:07
      Antares Upgrade 5m
      Speakers: George Patargias, Thomas Byrne
    • 14:08 14:13
      XRootD Development 5m
      Speakers: Alexander Rogovskiy (Rutherford Appleton Laboratory), Jyothish Thomas (STFC)
    • 14:14 14:19
      Utilizing GPUs 5m
      Speakers: Dr Brij Kishor Jashal (Rutherford Appleton Laboratory), Thomas Birkett
    • 14:25 14:26
      AOB 1m
    • 14:27 14:36
      Summary of Operational Status and Issues 9m
      Speakers: Brian Davies (Science and Technology Facilities Council STFC (GB)), Darren Moore, Thomas Birkett
    • 14:45 14:50
      Any other Business 5m
      Speakers: Brian Davies (Science and Technology Facilities Council STFC (GB)), Darren Moore