Running NESTOR sometimes fails
When using spirit_intel
machine configuration.
On spirit:
Note (UNread): units not found for
SH
slurmstepd-spirit64-07: error: StepId=204519.0 exceeded memory limit (4802486272 > 4194304000), being killed
srun: Exceeded job memory limit
slurmstepd-spirit64-07: error: *** STEP 204519.0 ON spirit64-07 CANCELLED AT 2023-01-24T17:27:41 ***
srun: Job step aborted: Waiting up to 32 seconds for job step to finish.
slurmstepd-spirit64-07: error: StepId=204519.0 exceeded memory limit (4802486272 > 4194304000), being killed
srun: Exceeded job memory limit
srun: error: spirit64-07: task 0: Killed
Error while running NESTOR
On spiritx:
slurmstepd-spiritx64-5: error: StepId=138233.0 exceeded memory limit (4355932160 > 4194304000), being killed
srun: Exceeded job memory limit
srun: Job step aborted: Waiting up to 32 seconds for job step to finish.
slurmstepd-spiritx64-5: error: *** STEP 138233.0 ON spiritx64-5 CANCELLED AT 2023-01-24T17:28:32 ***
slurmstepd-spiritx64-5: error: StepId=138233.0 exceeded memory limit (4355932160 > 4194304000), being killed
srun: Exceeded job memory limit
srun: error: spiritx64-5: task 0: Killed
Error while running NESTOR