* tools to perform data/MC comparison at trigger and reconstruction level: at trigger level there is JRA and JMRA in Jpp, do they miss any useful plots?. At reconstruction level, in which framework would be best to include it? Which are the variables to look at?
* Data/MC vs time
* tools to check the quality of the data and of the MC: for the data there is JRA, that can be used also for MC. For MC, some sanity checks are needed: how many jobs finished/crashed, is the number of generated events ok, is the can size ok, is the number of MC_hits reasonable, etc..
ARCA7/8 - DATA/MC comparisons (Vasileios):
Studied the effect of cuts on WR and BR events for ARCA6 and ARCA7 looking at atmospheric muon MC files.
Differences are found so the question is if they are expected. They should be if we only look at atmospheric muons. The differences should be due to the events reconstructed as up-going. To get an answer, it is also necessary to look at the neutrino files.
Another cause could be the dynamic calibration not available for ARCA6 and ARCA7 but only for ARCA8. It is suggested to see the effect of dynamic calibration versus static one.
JShowerFit DATA/MC (Chiara):
Update on JShowerFit processing (in which 3 steps have been automated) and DATA/MC comparison (better agreement with a small asymmetry for high-current runs). The work has been done on a ORCA6 test sample.
We need to split the files into subruns for the data and the atm. muon MC runs. Splitting and re-merging files can be problematic. There were not failures of the new procedure runnig on the test sample but checks are needed for the mass productions to prevent and take care of possible failures during splitting and merging.
Chaira suggests a set of safety checks to be done as intermediate steps during the processing.
JDAQSplit can fail if the file is not on irods, what to do to prevent?
problems with JSF if a subrun is empty, how to prevent?
suggestion to check if the output file is provided but also check the number of events in the file: a file can be present but contain an empty tree.
are all the files in the run list on irods? a part of the MC files may be missing (e.g. they cannot be processed because the calibrations are missing). Missing file are documented.
Running time remains high with an average duration of 12 hours. A speeding up of the code is expected which would also allow a reduction in the number of subruns, reducing the criticalities of splitting.
We have DQ tools to run in the pre-analysis phase that list runs based on run id.
It is also required to add step-by-step checks in the data_processing. Each code must provide the checks to be implemented,