Measure start and stop time of SAG-RECO
ACADA is required to re-start in less than 2 minutes. This requirement is not yet enforced but will have to be in the final delivered ACADA software. Many ACADA sub-systems are started in parallel, but we are probably one of the last as we're started by SAG-supervisor. Its not clear what our budget in the 2 minutes is going to be. Because we rely on slurm, we should make sure we don't get delays due to the scheduler too.
To get an idea of where we stand, we should measure the start and stop time of SAG-RECO, that is the time between the hiperta_stream_start
command and the R0-DL1
job running, for instance recording the the time before hiperta_stream_start
and have a script check the running jobs to get the time of R0-DL1
running. For shutdown we could measure the time between when we receive the shutdown command and the RECO process exiting.
We might also want to measure what happens with several hiperta_stream_start
command at the same time, which will happen when several telescope will be observing.