This Partition is for the old DGX-Users, which have been migrated from M2 to MOGON-KI.
GPUs are only useable, when your Project requested them.
Billing Weights: CPU=1.5*Num Mem=0.25*GB GPU=10*Num
(ki)-smallcpu is only available when 1 Node is choosen. mogondoks
(ki)-smallcpu is the common queue for most users on MOGON, lowest bill overall.
Billing Weights: CPU=1.0*Num Mem=1*GB
(ki)-Parallel is an exclusive queue,
you must pay for all the resources of your allocated node,
even if you do not use them. mogondoks
Over 173 Nodes only 256GB per Node is available.
Billing Weights: CPU=128 Mem=1*256/512
(ki)-Longtime is a special queue for jobs,
that exceed the 6-day walltime limit of the other CPU partitions.
If your Job has less than 6 Days Walltime,
Slurm will not schedule your Job or accept your Script. mogondoks
Billing Weights: CPU=1.25*Num Mem=1.0*GB
(ki)-Largemem is a high memory queue,
for jobs that exceed the standard node's 512GB mem limit.
If your Job need more then 1TB RAM,
only Hugemem is available. mogondoks
If your Job requires less than 512GB,
Slurm will not schedule your Job or accept your Script.
Billing Weights: CPU=1.0*Num Mem=1.6*GB
(ki)-Hugemem is a high memory queue, for jobs that
exceed the 1TB mem limit of Largemem. mogondoks
If your Job requires less than 1TB,
Slurm will not schedule your Job or accept your Script.
Billing Weights: CPU=1.0*Num Mem=2.8*GB
For the MI250 Queue, you must compile your Application with SYCL/HIP/OpenCL
on the System with ROCm, not CUDA. mogondoks
GPUs are only useable, when your Project requested them.
The MI250 are Dual-GPUs, for best Performace often 2 MPI-Prozesses per GPU is best.
Billing Weights: CPU=1.0*Num Mem=1.5*GB GPU=9*Num
For the SmallGPU Queue with A40, load CUDA-Modules. mogondoks
GPUs are only useable, when your Project requested them.
Billing Weights: CPU=1.0*Num Mem=1.5*GB GPU=7*Num
For the A100DL Queue, load CUDA-Modules. mogondoks
GPUs are only useable, when your Project requested them.
Billing Weights: CPU=1.0*Num Mem=1.5*GB GPU=9*Num
For the A100AI Queue, load CUDA-Modules. mogondoks
GPUs are only useable, when your Project requested them.
Billing Weights: CPU=1.0*Num Mem=3.0*GB GPU=17*Num
Modules
Currently only Toolchain 2024a is supported with no logic.
Some combinations will fail, take attention !
Executable Commands
srun / mpirun
Use mpirun/mpiexec — NOT srun — for OpenMPI 5.x.x jobs.