deepmd.train package
Submodules
deepmd.train.run_options module
Module taking care of important package constants.
- class deepmd.train.run_options.RunOptions(init_model: str | None = None, init_frz_model: str | None = None, finetune: str | None = None, restart: str | None = None, log_path: str | None = None, log_level: int = 0, mpi_log: str = 'master')[source]
Bases:
object
Class with info on how to run training (cluster, MPI and GPU config).
- Attributes:
- gpus: Optional[List[int]]
list of GPUs if any are present else None
- is_chief: bool
in distribured training it is true for tha main MPI process in serail it is always true
- world_size: int
total worker count
- my_rank: int
index of the MPI task
- nodename: str
name of the node
- node_list_
List
[str
] the list of nodes of the current mpirun
- my_device: str
deviice type - gpu or cpu
Methods
Print build and current running cluster configuration summary.
- property is_chief
Whether my rank is 0.
deepmd.train.trainer module
- class deepmd.train.trainer.DPTrainer(jdata, run_opt, is_compress=False)[source]
Bases:
object
Methods
Save the compressed graph.
build
eval_single_list
get_evaluation_results
get_feed_dict
get_global_step
print_header
print_on_training
save_checkpoint
train
valid_on_the_fly