dpgen run machine parameters#
Note
One can load, modify, and export the input file by using our effective web-based tool DP-GUI online or hosted using the command line interface dpgen gui. All parameters below can be set in DP-GUI. By clicking “SAVE JSON”, one can download the input file.
- run_mdata:#
- type:
dictargument path:run_mdatamachine.json file
- api_version:#
- type:
str, optional, default:1.0argument path:run_mdata/api_versionPlease set to 1.0
- deepmd_version:#
- type:
str, optional, default:2argument path:run_mdata/deepmd_versionDeePMD-kit version, e.g. 2.1.3
- train:#
- type:
dictargument path:run_mdata/trainParameters of command, machine, and resources for train
- command:#
- type:
strargument path:run_mdata/train/commandCommand of a program.
- machine:#
- type:
dictargument path:run_mdata/train/machine- batch_type:#
- type:
strargument path:run_mdata/train/machine/batch_typeBatch backend used to execute jobs. Option: DistributedShell, Torque, Slurm, PBS, Bohrium, OpenAPI, SGE, LSF, Shell, Fugaku, JH_UniScheduler, SlurmJobArray
- local_root:#
- type:
str|NoneTypeargument path:run_mdata/train/machine/local_rootLocal project root used by DPDispatcher to find task directories and local files. If submission.work_base is a relative path, it is resolved inside this directory; if submission.work_base is absolute, it is used as-is and local_root is ignored.
- remote_root:#
- type:
str|NoneType, optionalargument path:run_mdata/train/machine/remote_rootRemote root directory used by non-local contexts such as SSH. DPDispatcher creates and uses a submission-specific working directory beneath this root on the remote side. For SSHContext, this path should be absolute.
- clean_asynchronously:#
- type:
bool, optional, default:Falseargument path:run_mdata/train/machine/clean_asynchronouslyClean the remote working directory asynchronously after the job finishes. Avoid enabling this while debugging, because it can remove remote artifacts before you inspect them.
- retry_count:#
- type:
int, optional, default:3argument path:run_mdata/train/machine/retry_countHow many times DPDispatcher will retry a failed job before raising an error.
Depending on the value of context_type, different sub args are accepted.
- context_type:#
- type:
str(flag key)argument path:run_mdata/train/machine/context_typepossible choices:OpenAPIContext,LazyLocalContext,LocalContext,HDFSContext,BohriumContext,SSHContextExecution context / connection type used to reach the execution environment. Option: OpenAPIContext, LocalContext, SSHContext, LazyLocalContext, HDFSContext, BohriumContext
When context_type is set to
OpenAPIContext(or its aliasesopenapicontext,OpenAPI,openapi):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/train/machine[OpenAPIContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LazyLocalContext(or its aliaseslazylocalcontext,LazyLocal,lazylocal):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/train/machine[LazyLocalContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LocalContext(or its aliaseslocalcontext,Local,local):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/train/machine[LocalContext]/remote_profileOptions controlling how files are staged between local_root and remote_root when both paths are on the local filesystem.
- symlink:#
- type:
bool, optional, default:Trueargument path:run_mdata/train/machine[LocalContext]/remote_profile/symlinkWhether to use symbolic links instead of copying files from local_root into remote_root. Disable this when the execution side cannot access the original local path through the same filesystem view.
When context_type is set to
HDFSContext(or its aliaseshdfscontext,HDFS,hdfs):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/train/machine[HDFSContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
BohriumContext(or its aliasesbohriumcontext,Bohrium,bohrium,DpCloudServerContext,dpcloudservercontext,DpCloudServer,dpcloudserver,LebesgueContext,lebesguecontext,Lebesgue,lebesgue):- remote_profile:#
- type:
dictargument path:run_mdata/train/machine[BohriumContext]/remote_profileConfiguration for Bohrium submission, including login credentials, project selection, and job-handling behavior.
- email:#
- type:
str, optionalargument path:run_mdata/train/machine[BohriumContext]/remote_profile/emailEmail address used to log in to Bohrium.
- password:#
- type:
str, optionalargument path:run_mdata/train/machine[BohriumContext]/remote_profile/passwordPassword used together with email or phone login. If BOHR_TICKET is set, password-based login can be skipped.
- phone:#
- type:
str, optionalargument path:run_mdata/train/machine[BohriumContext]/remote_profile/phonePhone number used to log in when email is not used.
- program_id:#
- type:
int, alias: project_idargument path:run_mdata/train/machine[BohriumContext]/remote_profile/program_idProgram / project ID used to place uploaded jobs under the correct Bohrium project namespace.
- retry_count:#
- type:
NoneType|int, optional, default:2argument path:run_mdata/train/machine[BohriumContext]/remote_profile/retry_countHow many times a terminated remote job is retried on the platform side before giving up.
- ignore_exit_code:#
- type:
bool, optional, default:Trueargument path:run_mdata/train/machine[BohriumContext]/remote_profile/ignore_exit_codeWhether a non-zero exit code from the remote platform is still treated as finished. If False, such jobs are marked as terminated.
- keep_backup:#
- type:
bool, optionalargument path:run_mdata/train/machine[BohriumContext]/remote_profile/keep_backupWhether to keep uploaded/downloaded zip archives in the local backup directory after transfer.
- input_data:#
- type:
dictargument path:run_mdata/train/machine[BohriumContext]/remote_profile/input_dataPlatform-specific job configuration passed through to the Bohrium API.
When context_type is set to
SSHContext(or its aliasessshcontext,SSH,ssh):- remote_profile:#
- type:
dictargument path:run_mdata/train/machine[SSHContext]/remote_profileSSH connection settings for the remote machine, including authentication, timeouts, and optional proxy/jump-host behavior.
- hostname:#
- type:
strargument path:run_mdata/train/machine[SSHContext]/remote_profile/hostnameHostname or IP address of the SSH target machine.
- username:#
- type:
strargument path:run_mdata/train/machine[SSHContext]/remote_profile/usernameUsername used to log in to the target system.
- password:#
- type:
str, optionalargument path:run_mdata/train/machine[SSHContext]/remote_profile/password(deprecated) password of linux system. Please use SSH keys instead to improve security.
- port:#
- type:
int, optional, default:22argument path:run_mdata/train/machine[SSHContext]/remote_profile/portSSH port of the target machine. Usually 22.
- key_filename:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/machine[SSHContext]/remote_profile/key_filenamePath to the private key file used for SSH authentication. If left None, DPDispatcher can try discoverable keys in ~/.ssh or fall back to password-based login if configured.
- passphrase:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/machine[SSHContext]/remote_profile/passphrasePassphrase for the SSH private key, if the key is encrypted.
- timeout:#
- type:
int, optional, default:10argument path:run_mdata/train/machine[SSHContext]/remote_profile/timeoutTimeout in seconds for establishing the SSH connection.
- totp_secret:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/machine[SSHContext]/remote_profile/totp_secretTime-based one-time-password secret used for keyboard-interactive 2FA. It should be a base32-encoded string.
- tar_compress:#
- type:
bool, optional, default:Trueargument path:run_mdata/train/machine[SSHContext]/remote_profile/tar_compressWhether upload/download tar archives are compressed. Keeping this True usually reduces transfer size at the cost of extra CPU time.
- look_for_keys:#
- type:
bool, optional, default:Trueargument path:run_mdata/train/machine[SSHContext]/remote_profile/look_for_keysWhether to search for discoverable private key files in ~/.ssh when key_filename is not provided.
- execute_command:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/machine[SSHContext]/remote_profile/execute_commandOptional command executed immediately after the SSH connection is established.
- proxy_command:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/machine[SSHContext]/remote_profile/proxy_commandOptional SSH ProxyCommand used to reach the target through an intermediate host or tunnel.
- resources:#
- type:
dictargument path:run_mdata/train/resources- number_node:#
- type:
int, optional, default:1argument path:run_mdata/train/resources/number_nodeNumber of nodes requested for each scheduler job generated by DPDispatcher.
- cpu_per_node:#
- type:
int, optional, default:1argument path:run_mdata/train/resources/cpu_per_nodeNumber of CPUs requested on each node for each scheduler job.
- gpu_per_node:#
- type:
int, optional, default:0argument path:run_mdata/train/resources/gpu_per_nodeNumber of GPUs requested on each node for each scheduler job.
- queue_name:#
- type:
str, optional, default: (empty string)argument path:run_mdata/train/resources/queue_nameQueue or partition name used by the selected batch system. For local Shell runs this is usually an empty string; for Slurm it typically maps to a partition.
- group_size:#
- type:
intargument path:run_mdata/train/resources/group_sizeHow many tasks are packed into one scheduler job. For example, 20 tasks with group_size=5 are typically split into 4 jobs. Use 1 for the simplest one-task workflow. 0 means no explicit upper limit in the grouping logic.
- custom_flags:#
- type:
typing.List[str], optionalargument path:run_mdata/train/resources/custom_flagsExtra scheduler-header lines inserted into the generated submission script, typically for backend-specific options that are not covered by the standard fields.
- strategy:#
- type:
dict, optionalargument path:run_mdata/train/resources/strategyStrategy options that affect how DPDispatcher generates and evaluates submission scripts.
- if_cuda_multi_devices:#
- type:
bool, optional, default:Falseargument path:run_mdata/train/resources/strategy/if_cuda_multi_devicesIf a node has multiple NVIDIA GPUs, assign different tasks inside the same job to different GPUs by setting CUDA_VISIBLE_DEVICES automatically. Usually used together with para_deg > 1 and task-level resource awareness.
- ratio_unfinished:#
- type:
float, optional, default:0.0argument path:run_mdata/train/resources/strategy/ratio_unfinishedMaximum fraction of tasks allowed to remain unfinished when evaluating job completion. Use 0.0 for the strict default that requires every task to finish.
- customized_script_header_template_file:#
- type:
str, optionalargument path:run_mdata/train/resources/strategy/customized_script_header_template_fileCustom template file for the scheduler-header portion of generated submission scripts. Overrides the default template.
- para_deg:#
- type:
int, optional, default:1argument path:run_mdata/train/resources/para_degHow many tasks inside one generated job are run in parallel. This is different from group_size: group_size controls how many tasks are bundled into a job, while para_deg controls concurrency within that job. Keep para_deg=1 for the safest default.
- source_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/train/resources/source_listShell scripts or environment files sourced before task commands run. Useful on HPC systems for activating software stacks explicitly instead of relying on login-shell defaults.
- module_purge:#
- type:
bool, optional, default:Falseargument path:run_mdata/train/resources/module_purgeWhether to run ‘module purge’ before applying module_unload_list and module_list. Mainly useful on HPC systems.
- module_unload_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/train/resources/module_unload_listModules to unload before loading the requested modules. Mainly relevant on HPC systems with environment modules.
- module_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/train/resources/module_listModules to load before executing tasks. Mainly relevant on HPC systems with environment modules.
- envs:#
- type:
dict, optional, default:{}argument path:run_mdata/train/resources/envsEnvironment variables exported before executing tasks.
- prepend_script:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/train/resources/prepend_scriptOptional shell lines inserted before task commands in the generated job script.
- append_script:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/train/resources/append_scriptOptional shell lines inserted after task commands in the generated job script.
- wait_time:#
- type:
float|int, optional, default:0argument path:run_mdata/train/resources/wait_timeDelay in seconds inserted after a job is submitted or resubmitted. Usually keep 0 unless the scheduler/site asks you to throttle submission pace.
Depending on the value of batch_type, different sub args are accepted.
- batch_type:#
- type:
str(flag key)argument path:run_mdata/train/resources/batch_typepossible choices:SlurmJobArray,JH_UniScheduler,SGE,Torque,Fugaku,Bohrium,LSF,DistributedShell,OpenAPI,PBS,Shell,SlurmThe batch job system type loaded from machine/batch_type.
When batch_type is set to
SlurmJobArray(or its aliasslurmjobarray):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[SlurmJobArray]/kwargsSlurm-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/resources[SlurmJobArray]/kwargs/custom_gpu_lineCustom GPU header line starting with #SBATCH. When set, it overrides DPDispatcher’s default Slurm GPU line generated from gpu_per_node.
- slurm_job_size:#
- type:
int, optional, default:1argument path:run_mdata/train/resources[SlurmJobArray]/kwargs/slurm_job_sizeFor SlurmJobArray, how many DPDispatcher tasks are grouped into one array element / Slurm job script branch.
When batch_type is set to
JH_UniScheduler(or its aliasjh_unischeduler):- kwargs:#
- type:
dictargument path:run_mdata/train/resources[JH_UniScheduler]/kwargsJH_UniScheduler-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/resources[JH_UniScheduler]/kwargs/custom_gpu_lineCustom GPU header line starting with #JSUB. When set, it overrides the default UniScheduler GPU line generated from gpu_per_node.
When batch_type is set to
SGE(or its aliassge):- kwargs:#
- type:
dictargument path:run_mdata/train/resources[SGE]/kwargsSGE-specific extra arguments.
- pe_name:#
- type:
str, optional, default:mpi, alias: sge_pe_nameargument path:run_mdata/train/resources[SGE]/kwargs/pe_nameParallel environment name used by SGE, for example mpi. This controls the #$ -pe … header line in SGE mode.
- job_name:#
- type:
str, optional, default:wDPjobargument path:run_mdata/train/resources[SGE]/kwargs/job_nameJob name shown by SGE for this submission.
When batch_type is set to
Torque(or its aliastorque):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[Torque]/kwargsThis field is empty for this batch.
When batch_type is set to
Fugaku(or its aliasfugaku):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[Fugaku]/kwargsThis field is empty for this batch.
When batch_type is set to
Bohrium(or its aliasesbohrium,Lebesgue,lebesgue,DpCloudServer,dpcloudserver):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[Bohrium]/kwargsThis field is empty for this batch.
When batch_type is set to
LSF(or its aliaslsf):- kwargs:#
- type:
dictargument path:run_mdata/train/resources[LSF]/kwargsLSF-specific extra arguments.
- gpu_usage:#
- type:
bool, optional, default:Falseargument path:run_mdata/train/resources[LSF]/kwargs/gpu_usageWhether DPDispatcher should emit an LSF GPU request line at all. If False, no GPU request header is added.
- gpu_new_syntax:#
- type:
bool, optional, default:Falseargument path:run_mdata/train/resources[LSF]/kwargs/gpu_new_syntaxWhether to use the newer #BSUB -gpu syntax instead of the older resource string syntax. This is typically used on newer LSF versions.
- gpu_exclusive:#
- type:
bool, optional, default:Trueargument path:run_mdata/train/resources[LSF]/kwargs/gpu_exclusiveOnly meaningful when gpu_new_syntax is enabled. Controls whether the submitted job requests GPUs in exclusive mode.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/resources[LSF]/kwargs/custom_gpu_lineCustom GPU header line starting with #BSUB. When set, it overrides the GPU-related LSF header generated from the other GPU kwargs.
When batch_type is set to
DistributedShell(or its aliasdistributedshell):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[DistributedShell]/kwargsThis field is empty for this batch.
When batch_type is set to
OpenAPI(or its aliasopenapi):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[OpenAPI]/kwargsThis field is empty for this batch.
When batch_type is set to
PBS(or its aliaspbs):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[PBS]/kwargsThis field is empty for this batch.
When batch_type is set to
Shell(or its aliasshell):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[Shell]/kwargsThis field is empty for this batch.
When batch_type is set to
Slurm(or its aliasslurm):- kwargs:#
- type:
dict, optionalargument path:run_mdata/train/resources[Slurm]/kwargsSlurm-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/train/resources[Slurm]/kwargs/custom_gpu_lineCustom GPU header line starting with #SBATCH. When set, it overrides DPDispatcher’s default Slurm GPU line generated from gpu_per_node.
- user_forward_files:#
- type:
list, optionalargument path:run_mdata/train/user_forward_filesFiles to be forwarded to the remote machine.
- user_backward_files:#
- type:
list, optionalargument path:run_mdata/train/user_backward_filesFiles to be backwarded from the remote machine.
- model_devi:#
- type:
dictargument path:run_mdata/model_deviParameters of command, machine, and resources for model_devi
- command:#
- type:
strargument path:run_mdata/model_devi/commandCommand of a program.
- machine:#
- type:
dictargument path:run_mdata/model_devi/machine- batch_type:#
- type:
strargument path:run_mdata/model_devi/machine/batch_typeBatch backend used to execute jobs. Option: DistributedShell, Torque, Slurm, PBS, Bohrium, OpenAPI, SGE, LSF, Shell, Fugaku, JH_UniScheduler, SlurmJobArray
- local_root:#
- type:
str|NoneTypeargument path:run_mdata/model_devi/machine/local_rootLocal project root used by DPDispatcher to find task directories and local files. If submission.work_base is a relative path, it is resolved inside this directory; if submission.work_base is absolute, it is used as-is and local_root is ignored.
- remote_root:#
- type:
str|NoneType, optionalargument path:run_mdata/model_devi/machine/remote_rootRemote root directory used by non-local contexts such as SSH. DPDispatcher creates and uses a submission-specific working directory beneath this root on the remote side. For SSHContext, this path should be absolute.
- clean_asynchronously:#
- type:
bool, optional, default:Falseargument path:run_mdata/model_devi/machine/clean_asynchronouslyClean the remote working directory asynchronously after the job finishes. Avoid enabling this while debugging, because it can remove remote artifacts before you inspect them.
- retry_count:#
- type:
int, optional, default:3argument path:run_mdata/model_devi/machine/retry_countHow many times DPDispatcher will retry a failed job before raising an error.
Depending on the value of context_type, different sub args are accepted.
- context_type:#
- type:
str(flag key)argument path:run_mdata/model_devi/machine/context_typepossible choices:OpenAPIContext,LazyLocalContext,LocalContext,HDFSContext,BohriumContext,SSHContextExecution context / connection type used to reach the execution environment. Option: OpenAPIContext, LocalContext, SSHContext, LazyLocalContext, HDFSContext, BohriumContext
When context_type is set to
OpenAPIContext(or its aliasesopenapicontext,OpenAPI,openapi):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/model_devi/machine[OpenAPIContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LazyLocalContext(or its aliaseslazylocalcontext,LazyLocal,lazylocal):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/model_devi/machine[LazyLocalContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LocalContext(or its aliaseslocalcontext,Local,local):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/model_devi/machine[LocalContext]/remote_profileOptions controlling how files are staged between local_root and remote_root when both paths are on the local filesystem.
- symlink:#
- type:
bool, optional, default:Trueargument path:run_mdata/model_devi/machine[LocalContext]/remote_profile/symlinkWhether to use symbolic links instead of copying files from local_root into remote_root. Disable this when the execution side cannot access the original local path through the same filesystem view.
When context_type is set to
HDFSContext(or its aliaseshdfscontext,HDFS,hdfs):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/model_devi/machine[HDFSContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
BohriumContext(or its aliasesbohriumcontext,Bohrium,bohrium,DpCloudServerContext,dpcloudservercontext,DpCloudServer,dpcloudserver,LebesgueContext,lebesguecontext,Lebesgue,lebesgue):- remote_profile:#
- type:
dictargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profileConfiguration for Bohrium submission, including login credentials, project selection, and job-handling behavior.
- email:#
- type:
str, optionalargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/emailEmail address used to log in to Bohrium.
- password:#
- type:
str, optionalargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/passwordPassword used together with email or phone login. If BOHR_TICKET is set, password-based login can be skipped.
- phone:#
- type:
str, optionalargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/phonePhone number used to log in when email is not used.
- program_id:#
- type:
int, alias: project_idargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/program_idProgram / project ID used to place uploaded jobs under the correct Bohrium project namespace.
- retry_count:#
- type:
NoneType|int, optional, default:2argument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/retry_countHow many times a terminated remote job is retried on the platform side before giving up.
- ignore_exit_code:#
- type:
bool, optional, default:Trueargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/ignore_exit_codeWhether a non-zero exit code from the remote platform is still treated as finished. If False, such jobs are marked as terminated.
- keep_backup:#
- type:
bool, optionalargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/keep_backupWhether to keep uploaded/downloaded zip archives in the local backup directory after transfer.
- input_data:#
- type:
dictargument path:run_mdata/model_devi/machine[BohriumContext]/remote_profile/input_dataPlatform-specific job configuration passed through to the Bohrium API.
When context_type is set to
SSHContext(or its aliasessshcontext,SSH,ssh):- remote_profile:#
- type:
dictargument path:run_mdata/model_devi/machine[SSHContext]/remote_profileSSH connection settings for the remote machine, including authentication, timeouts, and optional proxy/jump-host behavior.
- hostname:#
- type:
strargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/hostnameHostname or IP address of the SSH target machine.
- username:#
- type:
strargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/usernameUsername used to log in to the target system.
- password:#
- type:
str, optionalargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/password(deprecated) password of linux system. Please use SSH keys instead to improve security.
- port:#
- type:
int, optional, default:22argument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/portSSH port of the target machine. Usually 22.
- key_filename:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/key_filenamePath to the private key file used for SSH authentication. If left None, DPDispatcher can try discoverable keys in ~/.ssh or fall back to password-based login if configured.
- passphrase:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/passphrasePassphrase for the SSH private key, if the key is encrypted.
- timeout:#
- type:
int, optional, default:10argument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/timeoutTimeout in seconds for establishing the SSH connection.
- totp_secret:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/totp_secretTime-based one-time-password secret used for keyboard-interactive 2FA. It should be a base32-encoded string.
- tar_compress:#
- type:
bool, optional, default:Trueargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/tar_compressWhether upload/download tar archives are compressed. Keeping this True usually reduces transfer size at the cost of extra CPU time.
- look_for_keys:#
- type:
bool, optional, default:Trueargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/look_for_keysWhether to search for discoverable private key files in ~/.ssh when key_filename is not provided.
- execute_command:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/execute_commandOptional command executed immediately after the SSH connection is established.
- proxy_command:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/machine[SSHContext]/remote_profile/proxy_commandOptional SSH ProxyCommand used to reach the target through an intermediate host or tunnel.
- resources:#
- type:
dictargument path:run_mdata/model_devi/resources- number_node:#
- type:
int, optional, default:1argument path:run_mdata/model_devi/resources/number_nodeNumber of nodes requested for each scheduler job generated by DPDispatcher.
- cpu_per_node:#
- type:
int, optional, default:1argument path:run_mdata/model_devi/resources/cpu_per_nodeNumber of CPUs requested on each node for each scheduler job.
- gpu_per_node:#
- type:
int, optional, default:0argument path:run_mdata/model_devi/resources/gpu_per_nodeNumber of GPUs requested on each node for each scheduler job.
- queue_name:#
- type:
str, optional, default: (empty string)argument path:run_mdata/model_devi/resources/queue_nameQueue or partition name used by the selected batch system. For local Shell runs this is usually an empty string; for Slurm it typically maps to a partition.
- group_size:#
- type:
intargument path:run_mdata/model_devi/resources/group_sizeHow many tasks are packed into one scheduler job. For example, 20 tasks with group_size=5 are typically split into 4 jobs. Use 1 for the simplest one-task workflow. 0 means no explicit upper limit in the grouping logic.
- custom_flags:#
- type:
typing.List[str], optionalargument path:run_mdata/model_devi/resources/custom_flagsExtra scheduler-header lines inserted into the generated submission script, typically for backend-specific options that are not covered by the standard fields.
- strategy:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources/strategyStrategy options that affect how DPDispatcher generates and evaluates submission scripts.
- if_cuda_multi_devices:#
- type:
bool, optional, default:Falseargument path:run_mdata/model_devi/resources/strategy/if_cuda_multi_devicesIf a node has multiple NVIDIA GPUs, assign different tasks inside the same job to different GPUs by setting CUDA_VISIBLE_DEVICES automatically. Usually used together with para_deg > 1 and task-level resource awareness.
- ratio_unfinished:#
- type:
float, optional, default:0.0argument path:run_mdata/model_devi/resources/strategy/ratio_unfinishedMaximum fraction of tasks allowed to remain unfinished when evaluating job completion. Use 0.0 for the strict default that requires every task to finish.
- customized_script_header_template_file:#
- type:
str, optionalargument path:run_mdata/model_devi/resources/strategy/customized_script_header_template_fileCustom template file for the scheduler-header portion of generated submission scripts. Overrides the default template.
- para_deg:#
- type:
int, optional, default:1argument path:run_mdata/model_devi/resources/para_degHow many tasks inside one generated job are run in parallel. This is different from group_size: group_size controls how many tasks are bundled into a job, while para_deg controls concurrency within that job. Keep para_deg=1 for the safest default.
- source_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/model_devi/resources/source_listShell scripts or environment files sourced before task commands run. Useful on HPC systems for activating software stacks explicitly instead of relying on login-shell defaults.
- module_purge:#
- type:
bool, optional, default:Falseargument path:run_mdata/model_devi/resources/module_purgeWhether to run ‘module purge’ before applying module_unload_list and module_list. Mainly useful on HPC systems.
- module_unload_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/model_devi/resources/module_unload_listModules to unload before loading the requested modules. Mainly relevant on HPC systems with environment modules.
- module_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/model_devi/resources/module_listModules to load before executing tasks. Mainly relevant on HPC systems with environment modules.
- envs:#
- type:
dict, optional, default:{}argument path:run_mdata/model_devi/resources/envsEnvironment variables exported before executing tasks.
- prepend_script:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/model_devi/resources/prepend_scriptOptional shell lines inserted before task commands in the generated job script.
- append_script:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/model_devi/resources/append_scriptOptional shell lines inserted after task commands in the generated job script.
- wait_time:#
- type:
float|int, optional, default:0argument path:run_mdata/model_devi/resources/wait_timeDelay in seconds inserted after a job is submitted or resubmitted. Usually keep 0 unless the scheduler/site asks you to throttle submission pace.
Depending on the value of batch_type, different sub args are accepted.
- batch_type:#
- type:
str(flag key)argument path:run_mdata/model_devi/resources/batch_typepossible choices:SlurmJobArray,JH_UniScheduler,SGE,Torque,Fugaku,Bohrium,LSF,DistributedShell,OpenAPI,PBS,Shell,SlurmThe batch job system type loaded from machine/batch_type.
When batch_type is set to
SlurmJobArray(or its aliasslurmjobarray):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[SlurmJobArray]/kwargsSlurm-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/resources[SlurmJobArray]/kwargs/custom_gpu_lineCustom GPU header line starting with #SBATCH. When set, it overrides DPDispatcher’s default Slurm GPU line generated from gpu_per_node.
- slurm_job_size:#
- type:
int, optional, default:1argument path:run_mdata/model_devi/resources[SlurmJobArray]/kwargs/slurm_job_sizeFor SlurmJobArray, how many DPDispatcher tasks are grouped into one array element / Slurm job script branch.
When batch_type is set to
JH_UniScheduler(or its aliasjh_unischeduler):- kwargs:#
- type:
dictargument path:run_mdata/model_devi/resources[JH_UniScheduler]/kwargsJH_UniScheduler-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/resources[JH_UniScheduler]/kwargs/custom_gpu_lineCustom GPU header line starting with #JSUB. When set, it overrides the default UniScheduler GPU line generated from gpu_per_node.
When batch_type is set to
SGE(or its aliassge):- kwargs:#
- type:
dictargument path:run_mdata/model_devi/resources[SGE]/kwargsSGE-specific extra arguments.
- pe_name:#
- type:
str, optional, default:mpi, alias: sge_pe_nameargument path:run_mdata/model_devi/resources[SGE]/kwargs/pe_nameParallel environment name used by SGE, for example mpi. This controls the #$ -pe … header line in SGE mode.
- job_name:#
- type:
str, optional, default:wDPjobargument path:run_mdata/model_devi/resources[SGE]/kwargs/job_nameJob name shown by SGE for this submission.
When batch_type is set to
Torque(or its aliastorque):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[Torque]/kwargsThis field is empty for this batch.
When batch_type is set to
Fugaku(or its aliasfugaku):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[Fugaku]/kwargsThis field is empty for this batch.
When batch_type is set to
Bohrium(or its aliasesbohrium,Lebesgue,lebesgue,DpCloudServer,dpcloudserver):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[Bohrium]/kwargsThis field is empty for this batch.
When batch_type is set to
LSF(or its aliaslsf):- kwargs:#
- type:
dictargument path:run_mdata/model_devi/resources[LSF]/kwargsLSF-specific extra arguments.
- gpu_usage:#
- type:
bool, optional, default:Falseargument path:run_mdata/model_devi/resources[LSF]/kwargs/gpu_usageWhether DPDispatcher should emit an LSF GPU request line at all. If False, no GPU request header is added.
- gpu_new_syntax:#
- type:
bool, optional, default:Falseargument path:run_mdata/model_devi/resources[LSF]/kwargs/gpu_new_syntaxWhether to use the newer #BSUB -gpu syntax instead of the older resource string syntax. This is typically used on newer LSF versions.
- gpu_exclusive:#
- type:
bool, optional, default:Trueargument path:run_mdata/model_devi/resources[LSF]/kwargs/gpu_exclusiveOnly meaningful when gpu_new_syntax is enabled. Controls whether the submitted job requests GPUs in exclusive mode.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/resources[LSF]/kwargs/custom_gpu_lineCustom GPU header line starting with #BSUB. When set, it overrides the GPU-related LSF header generated from the other GPU kwargs.
When batch_type is set to
DistributedShell(or its aliasdistributedshell):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[DistributedShell]/kwargsThis field is empty for this batch.
When batch_type is set to
OpenAPI(or its aliasopenapi):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[OpenAPI]/kwargsThis field is empty for this batch.
When batch_type is set to
PBS(or its aliaspbs):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[PBS]/kwargsThis field is empty for this batch.
When batch_type is set to
Shell(or its aliasshell):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[Shell]/kwargsThis field is empty for this batch.
When batch_type is set to
Slurm(or its aliasslurm):- kwargs:#
- type:
dict, optionalargument path:run_mdata/model_devi/resources[Slurm]/kwargsSlurm-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/model_devi/resources[Slurm]/kwargs/custom_gpu_lineCustom GPU header line starting with #SBATCH. When set, it overrides DPDispatcher’s default Slurm GPU line generated from gpu_per_node.
- user_forward_files:#
- type:
list, optionalargument path:run_mdata/model_devi/user_forward_filesFiles to be forwarded to the remote machine.
- user_backward_files:#
- type:
list, optionalargument path:run_mdata/model_devi/user_backward_filesFiles to be backwarded from the remote machine.
- fp:#
- type:
dictargument path:run_mdata/fpParameters of command, machine, and resources for fp
- command:#
- type:
strargument path:run_mdata/fp/commandCommand of a program.
- machine:#
- type:
dictargument path:run_mdata/fp/machine- batch_type:#
- type:
strargument path:run_mdata/fp/machine/batch_typeBatch backend used to execute jobs. Option: DistributedShell, Torque, Slurm, PBS, Bohrium, OpenAPI, SGE, LSF, Shell, Fugaku, JH_UniScheduler, SlurmJobArray
- local_root:#
- type:
str|NoneTypeargument path:run_mdata/fp/machine/local_rootLocal project root used by DPDispatcher to find task directories and local files. If submission.work_base is a relative path, it is resolved inside this directory; if submission.work_base is absolute, it is used as-is and local_root is ignored.
- remote_root:#
- type:
str|NoneType, optionalargument path:run_mdata/fp/machine/remote_rootRemote root directory used by non-local contexts such as SSH. DPDispatcher creates and uses a submission-specific working directory beneath this root on the remote side. For SSHContext, this path should be absolute.
- clean_asynchronously:#
- type:
bool, optional, default:Falseargument path:run_mdata/fp/machine/clean_asynchronouslyClean the remote working directory asynchronously after the job finishes. Avoid enabling this while debugging, because it can remove remote artifacts before you inspect them.
- retry_count:#
- type:
int, optional, default:3argument path:run_mdata/fp/machine/retry_countHow many times DPDispatcher will retry a failed job before raising an error.
Depending on the value of context_type, different sub args are accepted.
- context_type:#
- type:
str(flag key)argument path:run_mdata/fp/machine/context_typepossible choices:OpenAPIContext,LazyLocalContext,LocalContext,HDFSContext,BohriumContext,SSHContextExecution context / connection type used to reach the execution environment. Option: OpenAPIContext, LocalContext, SSHContext, LazyLocalContext, HDFSContext, BohriumContext
When context_type is set to
OpenAPIContext(or its aliasesopenapicontext,OpenAPI,openapi):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/fp/machine[OpenAPIContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LazyLocalContext(or its aliaseslazylocalcontext,LazyLocal,lazylocal):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/fp/machine[LazyLocalContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LocalContext(or its aliaseslocalcontext,Local,local):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/fp/machine[LocalContext]/remote_profileOptions controlling how files are staged between local_root and remote_root when both paths are on the local filesystem.
- symlink:#
- type:
bool, optional, default:Trueargument path:run_mdata/fp/machine[LocalContext]/remote_profile/symlinkWhether to use symbolic links instead of copying files from local_root into remote_root. Disable this when the execution side cannot access the original local path through the same filesystem view.
When context_type is set to
HDFSContext(or its aliaseshdfscontext,HDFS,hdfs):- remote_profile:#
- type:
dict, optionalargument path:run_mdata/fp/machine[HDFSContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
BohriumContext(or its aliasesbohriumcontext,Bohrium,bohrium,DpCloudServerContext,dpcloudservercontext,DpCloudServer,dpcloudserver,LebesgueContext,lebesguecontext,Lebesgue,lebesgue):- remote_profile:#
- type:
dictargument path:run_mdata/fp/machine[BohriumContext]/remote_profileConfiguration for Bohrium submission, including login credentials, project selection, and job-handling behavior.
- email:#
- type:
str, optionalargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/emailEmail address used to log in to Bohrium.
- password:#
- type:
str, optionalargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/passwordPassword used together with email or phone login. If BOHR_TICKET is set, password-based login can be skipped.
- phone:#
- type:
str, optionalargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/phonePhone number used to log in when email is not used.
- program_id:#
- type:
int, alias: project_idargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/program_idProgram / project ID used to place uploaded jobs under the correct Bohrium project namespace.
- retry_count:#
- type:
NoneType|int, optional, default:2argument path:run_mdata/fp/machine[BohriumContext]/remote_profile/retry_countHow many times a terminated remote job is retried on the platform side before giving up.
- ignore_exit_code:#
- type:
bool, optional, default:Trueargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/ignore_exit_codeWhether a non-zero exit code from the remote platform is still treated as finished. If False, such jobs are marked as terminated.
- keep_backup:#
- type:
bool, optionalargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/keep_backupWhether to keep uploaded/downloaded zip archives in the local backup directory after transfer.
- input_data:#
- type:
dictargument path:run_mdata/fp/machine[BohriumContext]/remote_profile/input_dataPlatform-specific job configuration passed through to the Bohrium API.
When context_type is set to
SSHContext(or its aliasessshcontext,SSH,ssh):- remote_profile:#
- type:
dictargument path:run_mdata/fp/machine[SSHContext]/remote_profileSSH connection settings for the remote machine, including authentication, timeouts, and optional proxy/jump-host behavior.
- hostname:#
- type:
strargument path:run_mdata/fp/machine[SSHContext]/remote_profile/hostnameHostname or IP address of the SSH target machine.
- username:#
- type:
strargument path:run_mdata/fp/machine[SSHContext]/remote_profile/usernameUsername used to log in to the target system.
- password:#
- type:
str, optionalargument path:run_mdata/fp/machine[SSHContext]/remote_profile/password(deprecated) password of linux system. Please use SSH keys instead to improve security.
- port:#
- type:
int, optional, default:22argument path:run_mdata/fp/machine[SSHContext]/remote_profile/portSSH port of the target machine. Usually 22.
- key_filename:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/machine[SSHContext]/remote_profile/key_filenamePath to the private key file used for SSH authentication. If left None, DPDispatcher can try discoverable keys in ~/.ssh or fall back to password-based login if configured.
- passphrase:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/machine[SSHContext]/remote_profile/passphrasePassphrase for the SSH private key, if the key is encrypted.
- timeout:#
- type:
int, optional, default:10argument path:run_mdata/fp/machine[SSHContext]/remote_profile/timeoutTimeout in seconds for establishing the SSH connection.
- totp_secret:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/machine[SSHContext]/remote_profile/totp_secretTime-based one-time-password secret used for keyboard-interactive 2FA. It should be a base32-encoded string.
- tar_compress:#
- type:
bool, optional, default:Trueargument path:run_mdata/fp/machine[SSHContext]/remote_profile/tar_compressWhether upload/download tar archives are compressed. Keeping this True usually reduces transfer size at the cost of extra CPU time.
- look_for_keys:#
- type:
bool, optional, default:Trueargument path:run_mdata/fp/machine[SSHContext]/remote_profile/look_for_keysWhether to search for discoverable private key files in ~/.ssh when key_filename is not provided.
- execute_command:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/machine[SSHContext]/remote_profile/execute_commandOptional command executed immediately after the SSH connection is established.
- proxy_command:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/machine[SSHContext]/remote_profile/proxy_commandOptional SSH ProxyCommand used to reach the target through an intermediate host or tunnel.
- resources:#
- type:
dictargument path:run_mdata/fp/resources- number_node:#
- type:
int, optional, default:1argument path:run_mdata/fp/resources/number_nodeNumber of nodes requested for each scheduler job generated by DPDispatcher.
- cpu_per_node:#
- type:
int, optional, default:1argument path:run_mdata/fp/resources/cpu_per_nodeNumber of CPUs requested on each node for each scheduler job.
- gpu_per_node:#
- type:
int, optional, default:0argument path:run_mdata/fp/resources/gpu_per_nodeNumber of GPUs requested on each node for each scheduler job.
- queue_name:#
- type:
str, optional, default: (empty string)argument path:run_mdata/fp/resources/queue_nameQueue or partition name used by the selected batch system. For local Shell runs this is usually an empty string; for Slurm it typically maps to a partition.
- group_size:#
- type:
intargument path:run_mdata/fp/resources/group_sizeHow many tasks are packed into one scheduler job. For example, 20 tasks with group_size=5 are typically split into 4 jobs. Use 1 for the simplest one-task workflow. 0 means no explicit upper limit in the grouping logic.
- custom_flags:#
- type:
typing.List[str], optionalargument path:run_mdata/fp/resources/custom_flagsExtra scheduler-header lines inserted into the generated submission script, typically for backend-specific options that are not covered by the standard fields.
- strategy:#
- type:
dict, optionalargument path:run_mdata/fp/resources/strategyStrategy options that affect how DPDispatcher generates and evaluates submission scripts.
- if_cuda_multi_devices:#
- type:
bool, optional, default:Falseargument path:run_mdata/fp/resources/strategy/if_cuda_multi_devicesIf a node has multiple NVIDIA GPUs, assign different tasks inside the same job to different GPUs by setting CUDA_VISIBLE_DEVICES automatically. Usually used together with para_deg > 1 and task-level resource awareness.
- ratio_unfinished:#
- type:
float, optional, default:0.0argument path:run_mdata/fp/resources/strategy/ratio_unfinishedMaximum fraction of tasks allowed to remain unfinished when evaluating job completion. Use 0.0 for the strict default that requires every task to finish.
- customized_script_header_template_file:#
- type:
str, optionalargument path:run_mdata/fp/resources/strategy/customized_script_header_template_fileCustom template file for the scheduler-header portion of generated submission scripts. Overrides the default template.
- para_deg:#
- type:
int, optional, default:1argument path:run_mdata/fp/resources/para_degHow many tasks inside one generated job are run in parallel. This is different from group_size: group_size controls how many tasks are bundled into a job, while para_deg controls concurrency within that job. Keep para_deg=1 for the safest default.
- source_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/fp/resources/source_listShell scripts or environment files sourced before task commands run. Useful on HPC systems for activating software stacks explicitly instead of relying on login-shell defaults.
- module_purge:#
- type:
bool, optional, default:Falseargument path:run_mdata/fp/resources/module_purgeWhether to run ‘module purge’ before applying module_unload_list and module_list. Mainly useful on HPC systems.
- module_unload_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/fp/resources/module_unload_listModules to unload before loading the requested modules. Mainly relevant on HPC systems with environment modules.
- module_list:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/fp/resources/module_listModules to load before executing tasks. Mainly relevant on HPC systems with environment modules.
- envs:#
- type:
dict, optional, default:{}argument path:run_mdata/fp/resources/envsEnvironment variables exported before executing tasks.
- prepend_script:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/fp/resources/prepend_scriptOptional shell lines inserted before task commands in the generated job script.
- append_script:#
- type:
typing.List[str], optional, default:[]argument path:run_mdata/fp/resources/append_scriptOptional shell lines inserted after task commands in the generated job script.
- wait_time:#
- type:
float|int, optional, default:0argument path:run_mdata/fp/resources/wait_timeDelay in seconds inserted after a job is submitted or resubmitted. Usually keep 0 unless the scheduler/site asks you to throttle submission pace.
Depending on the value of batch_type, different sub args are accepted.
- batch_type:#
- type:
str(flag key)argument path:run_mdata/fp/resources/batch_typepossible choices:SlurmJobArray,JH_UniScheduler,SGE,Torque,Fugaku,Bohrium,LSF,DistributedShell,OpenAPI,PBS,Shell,SlurmThe batch job system type loaded from machine/batch_type.
When batch_type is set to
SlurmJobArray(or its aliasslurmjobarray):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[SlurmJobArray]/kwargsSlurm-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/resources[SlurmJobArray]/kwargs/custom_gpu_lineCustom GPU header line starting with #SBATCH. When set, it overrides DPDispatcher’s default Slurm GPU line generated from gpu_per_node.
- slurm_job_size:#
- type:
int, optional, default:1argument path:run_mdata/fp/resources[SlurmJobArray]/kwargs/slurm_job_sizeFor SlurmJobArray, how many DPDispatcher tasks are grouped into one array element / Slurm job script branch.
When batch_type is set to
JH_UniScheduler(or its aliasjh_unischeduler):- kwargs:#
- type:
dictargument path:run_mdata/fp/resources[JH_UniScheduler]/kwargsJH_UniScheduler-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/resources[JH_UniScheduler]/kwargs/custom_gpu_lineCustom GPU header line starting with #JSUB. When set, it overrides the default UniScheduler GPU line generated from gpu_per_node.
When batch_type is set to
SGE(or its aliassge):- kwargs:#
- type:
dictargument path:run_mdata/fp/resources[SGE]/kwargsSGE-specific extra arguments.
- pe_name:#
- type:
str, optional, default:mpi, alias: sge_pe_nameargument path:run_mdata/fp/resources[SGE]/kwargs/pe_nameParallel environment name used by SGE, for example mpi. This controls the #$ -pe … header line in SGE mode.
- job_name:#
- type:
str, optional, default:wDPjobargument path:run_mdata/fp/resources[SGE]/kwargs/job_nameJob name shown by SGE for this submission.
When batch_type is set to
Torque(or its aliastorque):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[Torque]/kwargsThis field is empty for this batch.
When batch_type is set to
Fugaku(or its aliasfugaku):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[Fugaku]/kwargsThis field is empty for this batch.
When batch_type is set to
Bohrium(or its aliasesbohrium,Lebesgue,lebesgue,DpCloudServer,dpcloudserver):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[Bohrium]/kwargsThis field is empty for this batch.
When batch_type is set to
LSF(or its aliaslsf):- kwargs:#
- type:
dictargument path:run_mdata/fp/resources[LSF]/kwargsLSF-specific extra arguments.
- gpu_usage:#
- type:
bool, optional, default:Falseargument path:run_mdata/fp/resources[LSF]/kwargs/gpu_usageWhether DPDispatcher should emit an LSF GPU request line at all. If False, no GPU request header is added.
- gpu_new_syntax:#
- type:
bool, optional, default:Falseargument path:run_mdata/fp/resources[LSF]/kwargs/gpu_new_syntaxWhether to use the newer #BSUB -gpu syntax instead of the older resource string syntax. This is typically used on newer LSF versions.
- gpu_exclusive:#
- type:
bool, optional, default:Trueargument path:run_mdata/fp/resources[LSF]/kwargs/gpu_exclusiveOnly meaningful when gpu_new_syntax is enabled. Controls whether the submitted job requests GPUs in exclusive mode.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/resources[LSF]/kwargs/custom_gpu_lineCustom GPU header line starting with #BSUB. When set, it overrides the GPU-related LSF header generated from the other GPU kwargs.
When batch_type is set to
DistributedShell(or its aliasdistributedshell):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[DistributedShell]/kwargsThis field is empty for this batch.
When batch_type is set to
OpenAPI(or its aliasopenapi):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[OpenAPI]/kwargsThis field is empty for this batch.
When batch_type is set to
PBS(or its aliaspbs):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[PBS]/kwargsThis field is empty for this batch.
When batch_type is set to
Shell(or its aliasshell):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[Shell]/kwargsThis field is empty for this batch.
When batch_type is set to
Slurm(or its aliasslurm):- kwargs:#
- type:
dict, optionalargument path:run_mdata/fp/resources[Slurm]/kwargsSlurm-specific extra arguments.
- custom_gpu_line:#
- type:
str|NoneType, optional, default:Noneargument path:run_mdata/fp/resources[Slurm]/kwargs/custom_gpu_lineCustom GPU header line starting with #SBATCH. When set, it overrides DPDispatcher’s default Slurm GPU line generated from gpu_per_node.
- user_forward_files:#
- type:
list, optionalargument path:run_mdata/fp/user_forward_filesFiles to be forwarded to the remote machine.
- user_backward_files:#
- type:
list, optionalargument path:run_mdata/fp/user_backward_filesFiles to be backwarded from the remote machine.