Remove default values in modes-slurm.sh
modes-slurm.sh currently contains default values for its --partition
, --nodes
and --cores
parameters. This makes certain requests impossible, e.g. to use all cores on all nodes of a specified partition when one does not know or want to specify how many nodes and/or cores there are, or to use all cores on the selected nodes which may be inhomogenous.
Example:
$ ./modes-slurm.sh ../database.modest --partition r930 --nodes 1 -E "RED=3" -M MA -N 100000000 -R Uniform
SLURM configuration:
- partition = r930
- nodes = 1
- cores = 8
- timeout-slurm =
Using mono binary /home/hartmannsa/mono/bin/mono
Using modes binary ./modes.exe
Allocating SLURM nodes for the modes slaves
SLURM job name: i9_modeslurm
Output file (server nodes): servers_i9_modeslurm.out
salloc: Pending job allocation 36489
salloc: job 36489 queued and waiting for resources
................salloc: job 36489 has been allocated resources
salloc: Granted job allocation 36489
.Running distributed modes in the slaves of the SLURM batch 36489
Host IP addresses: 130.89.7.57
modes invocation arguments: ../database.modest -E RED=3 -M MA -N 100000000 -R Uniform
(00:00) 0······
Preparing simulation...
database.modest: info: Using default value of 0.95 for the confidence parameter.
(00:00) 0·······
Connecting to hosts for RED=3...
database.modest: info: Connection established to host 130.89.7.57 (8 threads).
database.modest: info: All connections established (8 total threads).
Actual behaviour: only 8 cores are used.
Expected behaviour: all 192 cores of the r930 node (caserta) are used.