5.3. Training Parameters
Note
One can load, modify, and export the input file by using our effective web-based tool DP-GUI online or hosted using the command line interface dp gui. All training parameters below can be set in DP-GUI. By clicking “SAVE JSON”, one can download the input file for further training.
Note
One can benefit from IntelliSense and validation when writing JSON files using Visual Studio Code. See here to learn how to configure.
- model:
- type:
dictargument path:model- type_map:
- type:
typing.List[str], optionalargument path:model/type_mapA list of strings. Give the name to each type of atoms. It is noted that the number of atom type of training system must be less than 128 in a GPU environment. If not given, type.raw in each system should use the same type indexes, and type_map.raw will take no effect.
- data_stat_nbatch:
- type:
int, optional, default:10argument path:model/data_stat_nbatchThe model determines the normalization from the statistics of the data. This key specifies the number of frames in each system used for statistics.
- data_stat_protect:
- type:
float, optional, default:0.01argument path:model/data_stat_protectProtect parameter for atomic energy regression.
- data_bias_nsample:
- type:
int, optional, default:10argument path:model/data_bias_nsampleThe number of training samples in a system to compute and change the energy bias.
- use_srtab:
- type:
str, optionalargument path:model/use_srtabThe table for the short-range pairwise interaction added on top of DP. The table is a text data file with (N_t + 1) * N_t / 2 + 1 columes. The first colume is the distance between atoms. The second to the last columes are energies for pairs of certain types. For example we have two atom types, 0 and 1. The columes from 2nd to 4th are for 0-0, 0-1 and 1-1 correspondingly.
- smin_alpha:
- type:
float, optionalargument path:model/smin_alphaThe short-range tabulated interaction will be swithed according to the distance of the nearest neighbor. This distance is calculated by softmin. This parameter is the decaying parameter in the softmin. It is only required when use_srtab is provided.
- sw_rmin:
- type:
float, optionalargument path:model/sw_rminThe lower boundary of the interpolation between short-range tabulated interaction and DP. It is only required when use_srtab is provided.
- sw_rmax:
- type:
float, optionalargument path:model/sw_rmaxThe upper boundary of the interpolation between short-range tabulated interaction and DP. It is only required when use_srtab is provided.
- pair_exclude_types:
- type:
list, optional, default:[]argument path:model/pair_exclude_types(Supported Backend: PyTorch) The atom pairs of the listed types are not treated to be neighbors, i.e. they do not see each other.
- atom_exclude_types:
- type:
list, optional, default:[]argument path:model/atom_exclude_types(Supported Backend: PyTorch) Exclude the atomic contribution of the listed atom types
- srtab_add_bias:
- type:
bool, optional, default:Trueargument path:model/srtab_add_bias(Supported Backend: TensorFlow) Whether add energy bias from the statistics of the data to short-range tabulated atomic energy. It only takes effect when use_srtab is provided.
- type_embedding:
- type:
dict, optionalargument path:model/type_embedding(Supported Backend: TensorFlow) The type embedding.
- neuron:
- type:
typing.List[int], optional, default:[8]argument path:model/type_embedding/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model/type_embedding/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model/type_embedding/resnet_dtWhether to use a “Timestep” in the skip connection
- precision:
- type:
str, optional, default:defaultargument path:model/type_embedding/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model/type_embedding/trainableIf the parameters in the embedding net are trainable
- seed:
- type:
NoneType|int, optional, default:Noneargument path:model/type_embedding/seedRandom seed for parameter initialization
- use_econf_tebd:
- type:
bool, optional, default:Falseargument path:model/type_embedding/use_econf_tebdWhether to use electronic configuration type embedding.
- use_tebd_bias:
- type:
bool, optional, default:Falseargument path:model/type_embedding/use_tebd_biasWhether to use bias in the type embedding layer.
- modifier:
- type:
dict, optionalargument path:model/modifier(Supported Backend: TensorFlow) The modifier of model output.
Depending on the value of type, different sub args are accepted.
- type:
The type of modifier. See explanation below.
-dipole_charge: Use WFCC to model the electronic structure of the system. Correct the long-range interaction
When type is set to
dipole_charge:- model_name:
- type:
strargument path:model/modifier[dipole_charge]/model_nameThe name of the frozen dipole model file.
- model_charge_map:
- type:
typing.List[float]argument path:model/modifier[dipole_charge]/model_charge_mapThe charge of the WFCC. The list length should be the same as the `sel_type <model/fitting_net[dipole]/sel_type_>`_.
- sys_charge_map:
- type:
typing.List[float]argument path:model/modifier[dipole_charge]/sys_charge_mapThe charge of real atoms. The list length should be the same as the type_map
- ewald_beta:
- type:
float, optional, default:0.4argument path:model/modifier[dipole_charge]/ewald_betaThe splitting parameter of Ewald sum. Unit is A^-1
- ewald_h:
- type:
float, optional, default:1.0argument path:model/modifier[dipole_charge]/ewald_hThe grid spacing of the FFT grid. Unit is A
- compress:
- type:
dict, optionalargument path:model/compress(Supported Backend: TensorFlow) Model compression configurations
- spin:
- type:
dict, optionalargument path:model/spinThe settings for systems with spin.
- use_spin:
- type:
typing.List[int]|typing.List[bool]argument path:model/spin/use_spinWhether to use atomic spin model for each atom type. List of boolean values with the shape of [ntypes] to specify which types use spin, or a list of integer values (Supported Backend: PyTorch) to indicate the index of the type that uses spin.
- spin_norm:
- type:
typing.List[float], optionalargument path:model/spin/spin_norm(Supported Backend: TensorFlow) The magnitude of atomic spin for each atom type with spin
- virtual_len:
- type:
typing.List[float], optionalargument path:model/spin/virtual_len(Supported Backend: TensorFlow) The distance between virtual atom representing spin and its corresponding real atom for each atom type with spin
- virtual_scale:
- type:
float|typing.List[float], optionalargument path:model/spin/virtual_scale(Supported Backend: PyTorch) The scaling factor to determine the virtual distance between a virtual atom representing spin and its corresponding real atom for each atom type with spin. This factor is defined as the virtual distance divided by the magnitude of atomic spin for each atom type with spin. The virtual coordinate is defined as the real coordinate plus spin * virtual_scale. List of float values with shape of [ntypes] or [ntypes_spin] or one single float value for all types, only used when use_spin is True for each atom type.
- finetune_head:
- type:
str, optionalargument path:model/finetune_head(Supported Backend: PyTorch) The chosen fitting net to fine-tune on, when doing multi-task fine-tuning. If not set or set to ‘RANDOM’, the fitting net will be randomly initialized.
Depending on the value of type, different sub args are accepted.
- type:
- type:
str(flag key), default:standardargument path:model/type
When type is set to
standard:Stardard model, which contains a descriptor and a fitting.
- descriptor:
- type:
dictargument path:model[standard]/descriptorThe descriptor of atomic environment.
Depending on the value of type, different sub args are accepted.
- type:
- type:
str(flag key)argument path:model[standard]/descriptor/typeThe type of the descritpor. See explanation below.
loc_frame: Defines a local frame at each atom, and the compute the descriptor as local coordinates under this frame.
se_e2_a: Used by the smooth edition of Deep Potential. The full relative coordinates are used to construct the descriptor.
se_e2_r: Used by the smooth edition of Deep Potential. Only the distance between atoms is used to construct the descriptor.
se_e3: Used by the smooth edition of Deep Potential. The full relative coordinates are used to construct the descriptor. Three-body embedding will be used by this descriptor.
se_a_tpe: Used by the smooth edition of Deep Potential. The full relative coordinates are used to construct the descriptor. Type embedding will be used by this descriptor.
se_atten: Used by the smooth edition of Deep Potential. The full relative coordinates are used to construct the descriptor. Attention mechanism will be used by this descriptor.
se_atten_v2: Used by the smooth edition of Deep Potential. The full relative coordinates are used to construct the descriptor. Attention mechanism with new modifications will be used by this descriptor.
se_a_mask: Used by the smooth edition of Deep Potential. It can accept a variable number of atoms in a frame (Non-PBC system). aparam are required as an indicator matrix for the real/virtual sign of input atoms.
hybrid: Concatenate of a list of descriptors as a new descriptor.
When type is set to
loc_frame:(Supported Backend: TensorFlow)
- sel_a:
- type:
typing.List[int]argument path:model[standard]/descriptor[loc_frame]/sel_aA list of integers. The length of the list should be the same as the number of atom types in the system. sel_a[i] gives the selected number of type-i neighbors. The full relative coordinates of the neighbors are used by the descriptor.
- sel_r:
- type:
typing.List[int]argument path:model[standard]/descriptor[loc_frame]/sel_rA list of integers. The length of the list should be the same as the number of atom types in the system. sel_r[i] gives the selected number of type-i neighbors. Only relative distance of the neighbors are used by the descriptor. sel_a[i] + sel_r[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[loc_frame]/rcutThe cut-off radius. The default value is 6.0
- axis_rule:
- type:
typing.List[int]argument path:model[standard]/descriptor[loc_frame]/axis_ruleA list of integers. The length should be 6 times of the number of types.
axis_rule[i*6+0]: class of the atom defining the first axis of type-i atom. 0 for neighbors with full coordinates and 1 for neighbors only with relative distance.
axis_rule[i*6+1]: type of the atom defining the first axis of type-i atom.
axis_rule[i*6+2]: index of the axis atom defining the first axis. Note that the neighbors with the same class and type are sorted according to their relative distance.
axis_rule[i*6+3]: class of the atom defining the second axis of type-i atom. 0 for neighbors with full coordinates and 1 for neighbors only with relative distance.
axis_rule[i*6+4]: type of the atom defining the second axis of type-i atom.
axis_rule[i*6+5]: index of the axis atom defining the second axis. Note that the neighbors with the same class and type are sorted according to their relative distance.
When type is set to
se_e2_a(or its aliasse_a):- sel:
- type:
typing.List[int]|str, optional, default:autoargument path:model[standard]/descriptor[se_e2_a]/selThis parameter set the number of selected neighbors for each type of atom. It can be:
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. sel[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius. It is noted that the total sel value must be less than 4096 in a GPU environment.
str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_e2_a]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_e2_a]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_e2_a]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:4, alias: n_axis_neuronargument path:model[standard]/descriptor[se_e2_a]/axis_neuronSize of the submatrix of G (embedding matrix).
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_e2_a]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e2_a]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e2_a]/type_one_sideIf true, the embedding network parameters vary by types of neighbor atoms only, so there will be $N_text{types}$ sets of embedding network parameters. Otherwise, the embedding network parameters vary by types of centric atoms and types of neighbor atoms, so there will be $N_text{types}^2$ sets of embedding network parameters.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_e2_a]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_e2_a]/trainableIf the parameters in the embedding net is trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_e2_a]/seedRandom seed for parameter initialization
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_e2_a]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_e2_a]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
- set_davg_zero:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e2_a]/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used
When type is set to
se_e3(or its aliasesse_at,se_a_3be,se_t):- sel:
- type:
typing.List[int]|str, optional, default:autoargument path:model[standard]/descriptor[se_e3]/selThis parameter set the number of selected neighbors for each type of atom. It can be:
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. sel[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius. It is noted that the total sel value must be less than 4096 in a GPU environment.
str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_e3]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_e3]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_e3]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_e3]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e3]/resnet_dtWhether to use a “Timestep” in the skip connection
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_e3]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_e3]/trainableIf the parameters in the embedding net are trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_e3]/seedRandom seed for parameter initialization
- set_davg_zero:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e3]/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_e3]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_e3]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
When type is set to
se_a_tpe(or its aliasse_a_ebd):(Supported Backend: TensorFlow)
- sel:
- type:
typing.List[int]|str, optional, default:autoargument path:model[standard]/descriptor[se_a_tpe]/selThis parameter set the number of selected neighbors for each type of atom. It can be:
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. sel[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius. It is noted that the total sel value must be less than 4096 in a GPU environment.
str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_a_tpe]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_a_tpe]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_a_tpe]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:4, alias: n_axis_neuronargument path:model[standard]/descriptor[se_a_tpe]/axis_neuronSize of the submatrix of G (embedding matrix).
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_a_tpe]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_tpe]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_tpe]/type_one_sideIf true, the embedding network parameters vary by types of neighbor atoms only, so there will be $N_text{types}$ sets of embedding network parameters. Otherwise, the embedding network parameters vary by types of centric atoms and types of neighbor atoms, so there will be $N_text{types}^2$ sets of embedding network parameters.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_a_tpe]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_a_tpe]/trainableIf the parameters in the embedding net is trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_a_tpe]/seedRandom seed for parameter initialization
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_a_tpe]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_a_tpe]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
- set_davg_zero:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_tpe]/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used
- type_nchanl:
- type:
int, optional, default:4argument path:model[standard]/descriptor[se_a_tpe]/type_nchanlnumber of channels for type embedding
- type_nlayer:
- type:
int, optional, default:2argument path:model[standard]/descriptor[se_a_tpe]/type_nlayernumber of hidden layers of type embedding net
- numb_aparam:
- type:
int, optional, default:0argument path:model[standard]/descriptor[se_a_tpe]/numb_aparamdimension of atomic parameter. if set to a value > 0, the atomic parameters are embedded.
When type is set to
se_e2_r(or its aliasse_r):- sel:
- type:
typing.List[int]|str, optional, default:autoargument path:model[standard]/descriptor[se_e2_r]/selThis parameter set the number of selected neighbors for each type of atom. It can be:
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. sel[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius. It is noted that the total sel value must be less than 4096 in a GPU environment.
str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_e2_r]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_e2_r]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_e2_r]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_e2_r]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e2_r]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e2_r]/type_one_sideIf true, the embedding network parameters vary by types of neighbor atoms only, so there will be $N_text{types}$ sets of embedding network parameters. Otherwise, the embedding network parameters vary by types of centric atoms and types of neighbor atoms, so there will be $N_text{types}^2$ sets of embedding network parameters.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_e2_r]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_e2_r]/trainableIf the parameters in the embedding net are trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_e2_r]/seedRandom seed for parameter initialization
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_e2_r]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- set_davg_zero:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_e2_r]/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_e2_r]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
When type is set to
hybrid:- list:
- type:
listargument path:model[standard]/descriptor[hybrid]/listA list of descriptor definitions
When type is set to
se_atten(or its aliasdpa1):- sel:
- type:
typing.List[int]|str|int, optional, default:autoargument path:model[standard]/descriptor[se_atten]/selThis parameter set the number of selected neighbors. Note that this parameter is a little different from that in other descriptors. Instead of separating each type of atoms, only the summation matters. And this number is highly related with the efficiency, thus one should not make it too large. Usually 200 or less is enough, far away from the GPU limitation 4096. It can be:
int. The maximum number of neighbor atoms to be considered. We recommend it to be less than 200.
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. Only the summation of sel[i] matters, and it is recommended to be less than 200. - str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_atten]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_atten]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_atten]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:4, alias: n_axis_neuronargument path:model[standard]/descriptor[se_atten]/axis_neuronSize of the submatrix of G (embedding matrix).
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_atten]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten]/type_one_sideIf ‘False’, type embeddings of both neighbor and central atoms are considered. If ‘True’, only type embeddings of neighbor atoms are considered. Default is ‘False’.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_atten]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten]/trainableIf the parameters in the embedding net is trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_atten]/seedRandom seed for parameter initialization
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_atten]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_atten]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
- attn:
- type:
int, optional, default:128argument path:model[standard]/descriptor[se_atten]/attnThe length of hidden vectors in attention layers
- attn_layer:
- type:
int, optional, default:2argument path:model[standard]/descriptor[se_atten]/attn_layerThe number of attention layers. Note that model compression of se_atten is only enabled when attn_layer==0 and tebd_input_mode==’strip’
- attn_dotr:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten]/attn_dotrWhether to do dot product with the normalized relative coordinates
- attn_mask:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten]/attn_maskWhether to do mask on the diagonal in the attention matrix
- stripped_type_embedding:
- type:
bool|NoneType, optional, default:Noneargument path:model[standard]/descriptor[se_atten]/stripped_type_embedding(Deprecated, kept only for compatibility.) Whether to strip the type embedding into a separate embedding network. Setting this parameter to True is equivalent to setting tebd_input_mode to ‘strip’. Setting it to False is equivalent to setting tebd_input_mode to ‘concat’.The default value is None, which means the tebd_input_mode setting will be used instead.
- smooth_type_embedding:
- type:
bool, optional, default:False, alias: smooth_type_embddingargument path:model[standard]/descriptor[se_atten]/smooth_type_embeddingWhether to use smooth process in attention weights calculation. (Supported Backend: TensorFlow) When using stripped type embedding, whether to dot smooth factor on the network output of type embedding to keep the network smooth, instead of setting set_davg_zero to be True.
- set_davg_zero:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten]/set_davg_zeroSet the normalization average to zero. This option should be set when se_atten descriptor or atom_ener in the energy fitting is used
- trainable_ln:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten]/trainable_lnWhether to use trainable shift and scale weights in layer normalization.
- ln_eps:
- type:
float|NoneType, optional, default:Noneargument path:model[standard]/descriptor[se_atten]/ln_epsThe epsilon value for layer normalization. The default value for TensorFlow is set to 1e-3 to keep consistent with keras while set to 1e-5 in PyTorch and DP implementation.
- tebd_dim:
- type:
int, optional, default:8argument path:model[standard]/descriptor[se_atten]/tebd_dim(Supported Backend: PyTorch) The dimension of atom type embedding.
- use_econf_tebd:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten]/use_econf_tebd(Supported Backend: PyTorch) Whether to use electronic configuration type embedding. For TensorFlow backend, please set use_econf_tebd in type_embedding block instead.
- use_tebd_bias:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten]/use_tebd_biasWhether to use bias in the type embedding layer.
- tebd_input_mode:
- type:
str, optional, default:concatargument path:model[standard]/descriptor[se_atten]/tebd_input_modeThe input mode of the type embedding. Supported modes are [‘concat’, ‘strip’].- ‘concat’: Concatenate the type embedding with the smoothed radial information as the union input for the embedding network. When type_one_side is False, the input is input_ij = concat([r_ij, tebd_j, tebd_i]). When type_one_side is True, the input is input_ij = concat([r_ij, tebd_j]). The output is out_ij = embeding(input_ij) for the pair-wise representation of atom i with neighbor j.- ‘strip’: Use a separated embedding network for the type embedding and combine the output with the radial embedding network output. When type_one_side is False, the input is input_t = concat([tebd_j, tebd_i]). (Supported Backend: PyTorch) When type_one_side is True, the input is input_t = tebd_j. The output is out_ij = embeding_t(input_t) * embeding_s(r_ij) + embeding_s(r_ij) for the pair-wise representation of atom i with neighbor j.
- scaling_factor:
- type:
float, optional, default:1.0argument path:model[standard]/descriptor[se_atten]/scaling_factor(Supported Backend: PyTorch) The scaling factor of normalization in calculations of attention weights, which is used to scale the matmul(Q, K). If temperature is None, the scaling of attention weights is (N_hidden_dim * scaling_factor)**0.5. Else, the scaling of attention weights is setting to temperature.
- normalize:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten]/normalize(Supported Backend: PyTorch) Whether to normalize the hidden vectors during attention calculation.
- temperature:
- type:
float, optionalargument path:model[standard]/descriptor[se_atten]/temperature(Supported Backend: PyTorch) The scaling factor of normalization in calculations of attention weights, which is used to scale the matmul(Q, K).
- concat_output_tebd:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten]/concat_output_tebd(Supported Backend: PyTorch) Whether to concat type embedding at the output of the descriptor.
When type is set to
se_atten_v2:- sel:
- type:
typing.List[int]|str|int, optional, default:autoargument path:model[standard]/descriptor[se_atten_v2]/selThis parameter set the number of selected neighbors. Note that this parameter is a little different from that in other descriptors. Instead of separating each type of atoms, only the summation matters. And this number is highly related with the efficiency, thus one should not make it too large. Usually 200 or less is enough, far away from the GPU limitation 4096. It can be:
int. The maximum number of neighbor atoms to be considered. We recommend it to be less than 200.
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. Only the summation of sel[i] matters, and it is recommended to be less than 200. - str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_atten_v2]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_atten_v2]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_atten_v2]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:4, alias: n_axis_neuronargument path:model[standard]/descriptor[se_atten_v2]/axis_neuronSize of the submatrix of G (embedding matrix).
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_atten_v2]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten_v2]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten_v2]/type_one_sideIf ‘False’, type embeddings of both neighbor and central atoms are considered. If ‘True’, only type embeddings of neighbor atoms are considered. Default is ‘False’.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_atten_v2]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten_v2]/trainableIf the parameters in the embedding net is trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_atten_v2]/seedRandom seed for parameter initialization
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_atten_v2]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_atten_v2]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
- attn:
- type:
int, optional, default:128argument path:model[standard]/descriptor[se_atten_v2]/attnThe length of hidden vectors in attention layers
- attn_layer:
- type:
int, optional, default:2argument path:model[standard]/descriptor[se_atten_v2]/attn_layerThe number of attention layers. Note that model compression of se_atten is only enabled when attn_layer==0 and tebd_input_mode==’strip’
- attn_dotr:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten_v2]/attn_dotrWhether to do dot product with the normalized relative coordinates
- attn_mask:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten_v2]/attn_maskWhether to do mask on the diagonal in the attention matrix
- set_davg_zero:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten_v2]/set_davg_zeroSet the normalization average to zero. This option should be set when se_atten descriptor or atom_ener in the energy fitting is used
- trainable_ln:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten_v2]/trainable_lnWhether to use trainable shift and scale weights in layer normalization.
- ln_eps:
- type:
float|NoneType, optional, default:Noneargument path:model[standard]/descriptor[se_atten_v2]/ln_epsThe epsilon value for layer normalization. The default value for TensorFlow is set to 1e-3 to keep consistent with keras while set to 1e-5 in PyTorch and DP implementation.
- tebd_dim:
- type:
int, optional, default:8argument path:model[standard]/descriptor[se_atten_v2]/tebd_dim(Supported Backend: PyTorch) The dimension of atom type embedding.
- use_econf_tebd:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten_v2]/use_econf_tebd(Supported Backend: PyTorch) Whether to use electronic configuration type embedding. For TensorFlow backend, please set use_econf_tebd in type_embedding block instead.
- use_tebd_bias:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_atten_v2]/use_tebd_biasWhether to use bias in the type embedding layer.
- scaling_factor:
- type:
float, optional, default:1.0argument path:model[standard]/descriptor[se_atten_v2]/scaling_factor(Supported Backend: PyTorch) The scaling factor of normalization in calculations of attention weights, which is used to scale the matmul(Q, K). If temperature is None, the scaling of attention weights is (N_hidden_dim * scaling_factor)**0.5. Else, the scaling of attention weights is setting to temperature.
- normalize:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten_v2]/normalize(Supported Backend: PyTorch) Whether to normalize the hidden vectors during attention calculation.
- temperature:
- type:
float, optionalargument path:model[standard]/descriptor[se_atten_v2]/temperature(Supported Backend: PyTorch) The scaling factor of normalization in calculations of attention weights, which is used to scale the matmul(Q, K).
- concat_output_tebd:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_atten_v2]/concat_output_tebd(Supported Backend: PyTorch) Whether to concat type embedding at the output of the descriptor.
When type is set to
dpa2:(Supported Backend: PyTorch)
- repinit:
- type:
dictargument path:model[standard]/descriptor[dpa2]/repinitThe arguments used to initialize the repinit block.
- rcut:
- type:
floatargument path:model[standard]/descriptor[dpa2]/repinit/rcutThe cut-off radius.
- rcut_smth:
- type:
floatargument path:model[standard]/descriptor[dpa2]/repinit/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth.
- nsel:
- type:
intargument path:model[standard]/descriptor[dpa2]/repinit/nselMaximally possible number of selected neighbors.
- neuron:
- type:
list, optional, default:[25, 50, 100]argument path:model[standard]/descriptor[dpa2]/repinit/neuronNumber of neurons in each hidden layers of the embedding net.When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:16argument path:model[standard]/descriptor[dpa2]/repinit/axis_neuronSize of the submatrix of G (embedding matrix).
- tebd_dim:
- type:
int, optional, default:8argument path:model[standard]/descriptor[dpa2]/repinit/tebd_dimThe dimension of atom type embedding.
- tebd_input_mode:
- type:
str, optional, default:concatargument path:model[standard]/descriptor[dpa2]/repinit/tebd_input_modeThe input mode of the type embedding. Supported modes are [‘concat’, ‘strip’].- ‘concat’: Concatenate the type embedding with the smoothed radial information as the union input for the embedding network. When type_one_side is False, the input is input_ij = concat([r_ij, tebd_j, tebd_i]). When type_one_side is True, the input is input_ij = concat([r_ij, tebd_j]). The output is out_ij = embeding(input_ij) for the pair-wise representation of atom i with neighbor j.- ‘strip’: Use a separated embedding network for the type embedding and combine the output with the radial embedding network output. When type_one_side is False, the input is input_t = concat([tebd_j, tebd_i]). (Supported Backend: PyTorch) When type_one_side is True, the input is input_t = tebd_j. The output is out_ij = embeding_t(input_t) * embeding_s(r_ij) + embeding_s(r_ij) for the pair-wise representation of atom i with neighbor j.
- set_davg_zero:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repinit/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[dpa2]/repinit/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”..
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/repinit/type_one_sideIf true, the embedding network parameters vary by types of neighbor atoms only, so there will be $N_text{types}$ sets of embedding network parameters. Otherwise, the embedding network parameters vary by types of centric atoms and types of neighbor atoms, so there will be $N_text{types}^2$ sets of embedding network parameters.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/repinit/resnet_dtWhether to use a “Timestep” in the skip connection.
- repformer:
- type:
dictargument path:model[standard]/descriptor[dpa2]/repformerThe arguments used to initialize the repformer block.
- rcut:
- type:
floatargument path:model[standard]/descriptor[dpa2]/repformer/rcutThe cut-off radius.
- rcut_smth:
- type:
floatargument path:model[standard]/descriptor[dpa2]/repformer/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth.
- nsel:
- type:
intargument path:model[standard]/descriptor[dpa2]/repformer/nselMaximally possible number of selected neighbors.
- nlayers:
- type:
int, optional, default:3argument path:model[standard]/descriptor[dpa2]/repformer/nlayersThe number of repformer layers.
- g1_dim:
- type:
int, optional, default:128argument path:model[standard]/descriptor[dpa2]/repformer/g1_dimThe dimension of invariant single-atom representation.
- g2_dim:
- type:
int, optional, default:16argument path:model[standard]/descriptor[dpa2]/repformer/g2_dimThe dimension of invariant pair-atom representation.
- axis_neuron:
- type:
int, optional, default:4argument path:model[standard]/descriptor[dpa2]/repformer/axis_neuronThe number of dimension of submatrix in the symmetrization ops.
- direct_dist:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/repformer/direct_distWhether or not use direct distance as input for the embedding net to get g2 instead of smoothed 1/r.
- update_g1_has_conv:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/update_g1_has_convUpdate the g1 rep with convolution term.
- update_g1_has_drrd:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/update_g1_has_drrdUpdate the g1 rep with the drrd term.
- update_g1_has_grrg:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/update_g1_has_grrgUpdate the g1 rep with the grrg term.
- update_g1_has_attn:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/update_g1_has_attnUpdate the g1 rep with the localized self-attention.
- update_g2_has_g1g1:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/update_g2_has_g1g1Update the g2 rep with the g1xg1 term.
- update_g2_has_attn:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/update_g2_has_attnUpdate the g2 rep with the gated self-attention.
- update_h2:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/repformer/update_h2Update the h2 rep.
- attn1_nhead:
- type:
int, optional, default:4argument path:model[standard]/descriptor[dpa2]/repformer/attn1_nheadThe number of heads in localized self-attention to update the g1 rep.
- attn2_nhead:
- type:
int, optional, default:4argument path:model[standard]/descriptor[dpa2]/repformer/attn2_nheadThe number of heads in gated self-attention to update the g2 rep.
- attn2_has_gate:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/repformer/attn2_has_gateWhether to use gate in the gated self-attention to update the g2 rep.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[dpa2]/repformer/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”..
- update_style:
- type:
str, optional, default:res_avgargument path:model[standard]/descriptor[dpa2]/repformer/update_styleStyle to update a representation. Supported options are: -‘res_avg’: Updates a rep u with: u = 1/sqrt{n+1} (u + u_1 + u_2 + … + u_n) -‘res_incr’: Updates a rep u with: u = u + 1/sqrt{n} (u_1 + u_2 + … + u_n)-‘res_residual’: Updates a rep u with: u = u + (r1*u_1 + r2*u_2 + … + r3*u_n) where r1, r2 … r3 are residual weights defined by update_residual and update_residual_init.
- update_residual:
- type:
float, optional, default:0.001argument path:model[standard]/descriptor[dpa2]/repformer/update_residualWhen update using residual mode, the initial std of residual vector weights.
- update_residual_init:
- type:
str, optional, default:normargument path:model[standard]/descriptor[dpa2]/repformer/update_residual_initWhen update using residual mode, the initialization mode of residual vector weights.Supported modes are: [‘norm’, ‘const’].
- set_davg_zero:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used.
- trainable_ln:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/repformer/trainable_lnWhether to use trainable shift and scale weights in layer normalization.
- ln_eps:
- type:
float|NoneType, optional, default:Noneargument path:model[standard]/descriptor[dpa2]/repformer/ln_epsThe epsilon value for layer normalization. The default value for TensorFlow is set to 1e-3 to keep consistent with keras while set to 1e-5 in PyTorch and DP implementation.
- concat_output_tebd:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/concat_output_tebdWhether to concat type embedding at the output of the descriptor.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[dpa2]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- smooth:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/smoothWhether to use smoothness in processes such as attention weights calculation.
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[dpa2]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[dpa2]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[dpa2]/trainableIf the parameters in the embedding net is trainable.
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[dpa2]/seedRandom seed for parameter initialization.
- add_tebd_to_repinit_out:
- type:
bool, optional, default:False, alias: repformer_add_type_ebd_to_seqargument path:model[standard]/descriptor[dpa2]/add_tebd_to_repinit_outAdd type embedding to the output representation from repinit before inputting it into repformer.
- use_econf_tebd:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/use_econf_tebd(Supported Backend: PyTorch) Whether to use electronic configuration type embedding.
- use_tebd_bias:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[dpa2]/use_tebd_biasWhether to use bias in the type embedding layer.
When type is set to
se_a_ebd_v2(or its aliasse_a_tpe_v2):(Supported Backend: TensorFlow)
- sel:
- type:
typing.List[int]|str, optional, default:autoargument path:model[standard]/descriptor[se_a_ebd_v2]/selThis parameter set the number of selected neighbors for each type of atom. It can be:
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. sel[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius. It is noted that the total sel value must be less than 4096 in a GPU environment.
str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- rcut:
- type:
float, optional, default:6.0argument path:model[standard]/descriptor[se_a_ebd_v2]/rcutThe cut-off radius.
- rcut_smth:
- type:
float, optional, default:0.5argument path:model[standard]/descriptor[se_a_ebd_v2]/rcut_smthWhere to start smoothing. For example the 1/r term is smoothed from rcut to rcut_smth
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_a_ebd_v2]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:4, alias: n_axis_neuronargument path:model[standard]/descriptor[se_a_ebd_v2]/axis_neuronSize of the submatrix of G (embedding matrix).
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_a_ebd_v2]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_ebd_v2]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_ebd_v2]/type_one_sideIf true, the embedding network parameters vary by types of neighbor atoms only, so there will be $N_text{types}$ sets of embedding network parameters. Otherwise, the embedding network parameters vary by types of centric atoms and types of neighbor atoms, so there will be $N_text{types}^2$ sets of embedding network parameters.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_a_ebd_v2]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_a_ebd_v2]/trainableIf the parameters in the embedding net is trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_a_ebd_v2]/seedRandom seed for parameter initialization
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_a_ebd_v2]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- env_protection:
- type:
float, optional, default:0.0argument path:model[standard]/descriptor[se_a_ebd_v2]/env_protection(Supported Backend: PyTorch) Protection parameter to prevent division by zero errors during environment matrix calculations. For example, when using paddings, there may be zero distances of neighbors, which may make division by zero error during environment matrix calculations without protection.
- set_davg_zero:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_ebd_v2]/set_davg_zeroSet the normalization average to zero. This option should be set when atom_ener in the energy fitting is used
When type is set to
se_a_mask:(Supported Backend: TensorFlow)
- sel:
- type:
typing.List[int]|str, optional, default:autoargument path:model[standard]/descriptor[se_a_mask]/selThis parameter sets the number of selected neighbors for each type of atom. It can be:
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. sel[i] is recommended to be larger than the maximally possible number of type-i neighbors in the cut-off radius. It is noted that the total sel value must be less than 4096 in a GPU environment.
str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
- neuron:
- type:
typing.List[int], optional, default:[10, 20, 40]argument path:model[standard]/descriptor[se_a_mask]/neuronNumber of neurons in each hidden layers of the embedding net. When two layers are of the same size or one layer is twice as large as the previous layer, a skip connection is built.
- axis_neuron:
- type:
int, optional, default:4, alias: n_axis_neuronargument path:model[standard]/descriptor[se_a_mask]/axis_neuronSize of the submatrix of G (embedding matrix).
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/descriptor[se_a_mask]/activation_functionThe activation function in the embedding net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_mask]/resnet_dtWhether to use a “Timestep” in the skip connection
- type_one_side:
- type:
bool, optional, default:Falseargument path:model[standard]/descriptor[se_a_mask]/type_one_sideIf true, the embedding network parameters vary by types of neighbor atoms only, so there will be $N_text{types}$ sets of embedding network parameters. Otherwise, the embedding network parameters vary by types of centric atoms and types of neighbor atoms, so there will be $N_text{types}^2$ sets of embedding network parameters.
- exclude_types:
- type:
typing.List[typing.List[int]], optional, default:[]argument path:model[standard]/descriptor[se_a_mask]/exclude_typesThe excluded pairs of types which have no interaction with each other. For example, [[0, 1]] means no interaction between type 0 and type 1.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/descriptor[se_a_mask]/precisionThe precision of the embedding net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- trainable:
- type:
bool, optional, default:Trueargument path:model[standard]/descriptor[se_a_mask]/trainableIf the parameters in the embedding net is trainable
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/descriptor[se_a_mask]/seedRandom seed for parameter initialization
- fitting_net:
- type:
dictargument path:model[standard]/fitting_netThe fitting of physical properties.
Depending on the value of type, different sub args are accepted.
- type:
- type:
str(flag key), default:enerargument path:model[standard]/fitting_net/typeThe type of the fitting. See explanation below.
ener: Fit an energy model (potential energy surface).
dos : Fit a density of states model. The total density of states / site-projected density of states labels should be provided by dos.npy or atom_dos.npy in each data system. The file has number of frames lines and number of energy grid columns (times number of atoms in atom_dos.npy). See loss parameter.
dipole: Fit an atomic dipole model. Global dipole labels or atomic dipole labels for all the selected atoms (see sel_type) should be provided by dipole.npy in each data system. The file either has number of frames lines and 3 times of number of selected atoms columns, or has number of frames lines and 3 columns. See loss parameter.
polar: Fit an atomic polarizability model. Global polarizazbility labels or atomic polarizability labels for all the selected atoms (see sel_type) should be provided by polarizability.npy in each data system. The file eith has number of frames lines and 9 times of number of selected atoms columns, or has number of frames lines and 9 columns. See loss parameter.
When type is set to
ener:- numb_fparam:
- type:
int, optional, default:0argument path:model[standard]/fitting_net[ener]/numb_fparamThe dimension of the frame parameter. If set to >0, file fparam.npy should be included to provided the input fparams.
- numb_aparam:
- type:
int, optional, default:0argument path:model[standard]/fitting_net[ener]/numb_aparamThe dimension of the atomic parameter. If set to >0, file aparam.npy should be included to provided the input aparams.
- neuron:
- type:
typing.List[int], optional, default:[120, 120, 120], alias: n_neuronargument path:model[standard]/fitting_net[ener]/neuronThe number of neurons in each hidden layers of the fitting net. When two hidden layers are of the same size, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/fitting_net[ener]/activation_functionThe activation function in the fitting net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/fitting_net[ener]/precisionThe precision of the fitting net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- resnet_dt:
- type:
bool, optional, default:Trueargument path:model[standard]/fitting_net[ener]/resnet_dtWhether to use a “Timestep” in the skip connection
- trainable:
- type:
bool|typing.List[bool], optional, default:Trueargument path:model[standard]/fitting_net[ener]/trainableWhether the parameters in the fitting net are trainable. This option can be
bool: True if all parameters of the fitting net are trainable, False otherwise.
list of bool(Supported Backend: TensorFlow) : Specifies if each layer is trainable. Since the fitting net is composed by hidden layers followed by a output layer, the length of this list should be equal to len(neuron)+1.
- rcond:
- type:
float|NoneType, optional, default:Noneargument path:model[standard]/fitting_net[ener]/rcondThe condition number used to determine the inital energy shift for each type of atoms. See rcond in
numpy.linalg.lstsq()for more details.
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/fitting_net[ener]/seedRandom seed for parameter initialization of the fitting net
- atom_ener:
- type:
typing.List[typing.Optional[float]], optional, default:[]argument path:model[standard]/fitting_net[ener]/atom_enerSpecify the atomic energy in vacuum for each type
- layer_name:
- type:
typing.List[str], optionalargument path:model[standard]/fitting_net[ener]/layer_nameThe name of the each layer. The length of this list should be equal to n_neuron + 1. If two layers, either in the same fitting or different fittings, have the same name, they will share the same neural network parameters. The shape of these layers should be the same. If null is given for a layer, parameters will not be shared.
- use_aparam_as_mask:
- type:
bool, optional, default:Falseargument path:model[standard]/fitting_net[ener]/use_aparam_as_maskWhether to use the aparam as a mask in input.If True, the aparam will not be used in fitting net for embedding.When descrpt is se_a_mask, the aparam will be used as a mask to indicate the input atom is real/virtual. And use_aparam_as_mask should be set to True.
When type is set to
dos:- numb_fparam:
- type:
int, optional, default:0argument path:model[standard]/fitting_net[dos]/numb_fparamThe dimension of the frame parameter. If set to >0, file fparam.npy should be included to provided the input fparams.
- numb_aparam:
- type:
int, optional, default:0argument path:model[standard]/fitting_net[dos]/numb_aparamThe dimension of the atomic parameter. If set to >0, file aparam.npy should be included to provided the input aparams.
- neuron:
- type:
typing.List[int], optional, default:[120, 120, 120]argument path:model[standard]/fitting_net[dos]/neuronThe number of neurons in each hidden layers of the fitting net. When two hidden layers are of the same size, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/fitting_net[dos]/activation_functionThe activation function in the fitting net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- precision:
- type:
str, optional, default:float64argument path:model[standard]/fitting_net[dos]/precisionThe precision of the fitting net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- resnet_dt:
- type:
bool, optional, default:Trueargument path:model[standard]/fitting_net[dos]/resnet_dtWhether to use a “Timestep” in the skip connection
- trainable:
- type:
bool|typing.List[bool], optional, default:Trueargument path:model[standard]/fitting_net[dos]/trainableWhether the parameters in the fitting net are trainable. This option can be
bool: True if all parameters of the fitting net are trainable, False otherwise.
list of bool: Specifies if each layer is trainable. Since the fitting net is composed by hidden layers followed by a output layer, the length of tihs list should be equal to len(neuron)+1.
- rcond:
- type:
float|NoneType, optional, default:Noneargument path:model[standard]/fitting_net[dos]/rcondThe condition number used to determine the inital energy shift for each type of atoms. See rcond in
numpy.linalg.lstsq()for more details.
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/fitting_net[dos]/seedRandom seed for parameter initialization of the fitting net
- numb_dos:
- type:
int, optional, default:300argument path:model[standard]/fitting_net[dos]/numb_dosThe number of gridpoints on which the DOS is evaluated (NEDOS in VASP)
When type is set to
polar:- neuron:
- type:
typing.List[int], optional, default:[120, 120, 120], alias: n_neuronargument path:model[standard]/fitting_net[polar]/neuronThe number of neurons in each hidden layers of the fitting net. When two hidden layers are of the same size, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/fitting_net[polar]/activation_functionThe activation function in the fitting net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Trueargument path:model[standard]/fitting_net[polar]/resnet_dtWhether to use a “Timestep” in the skip connection
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/fitting_net[polar]/precisionThe precision of the fitting net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- fit_diag:
- type:
bool, optional, default:Trueargument path:model[standard]/fitting_net[polar]/fit_diagFit the diagonal part of the rotational invariant polarizability matrix, which will be converted to normal polarizability matrix by contracting with the rotation matrix.
- scale:
- type:
float|typing.List[float], optional, default:1.0argument path:model[standard]/fitting_net[polar]/scaleThe output of the fitting net (polarizability matrix) will be scaled by
scale
- shift_diag:
- type:
bool, optional, default:Trueargument path:model[standard]/fitting_net[polar]/shift_diagWhether to shift the diagonal of polar, which is beneficial to training. Default is true.
- sel_type:
- type:
NoneType|typing.List[int]|int, optional, alias: pol_typeargument path:model[standard]/fitting_net[polar]/sel_typeThe atom types for which the atomic polarizability will be provided. If not set, all types will be selected.(Supported Backend: TensorFlow)
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/fitting_net[polar]/seedRandom seed for parameter initialization of the fitting net
When type is set to
dipole:- neuron:
- type:
typing.List[int], optional, default:[120, 120, 120], alias: n_neuronargument path:model[standard]/fitting_net[dipole]/neuronThe number of neurons in each hidden layers of the fitting net. When two hidden layers are of the same size, a skip connection is built.
- activation_function:
- type:
str, optional, default:tanhargument path:model[standard]/fitting_net[dipole]/activation_functionThe activation function in the fitting net. Supported activation functions are “relu”, “tanh”, “softplus”, “linear”, “relu6”, “none”, “gelu”, “gelu_tf”, “sigmoid”. Note that “gelu” denotes the custom operator version, and “gelu_tf” denotes the TF standard version. If you set “None” or “none” here, no activation function will be used.
- resnet_dt:
- type:
bool, optional, default:Trueargument path:model[standard]/fitting_net[dipole]/resnet_dtWhether to use a “Timestep” in the skip connection
- precision:
- type:
str, optional, default:defaultargument path:model[standard]/fitting_net[dipole]/precisionThe precision of the fitting net parameters, supported options are “float32”, “default”, “float16”, “float64”. Default follows the interface precision.
- sel_type:
- type:
NoneType|typing.List[int]|int, optional, alias: dipole_typeargument path:model[standard]/fitting_net[dipole]/sel_typeThe atom types for which the atomic dipole will be provided. If not set, all types will be selected.(Supported Backend: TensorFlow)
- seed:
- type:
NoneType|int, optionalargument path:model[standard]/fitting_net[dipole]/seedRandom seed for parameter initialization of the fitting net
When type is set to
frozen:- model_file:
- type:
strargument path:model[frozen]/model_filePath to the frozen model file.
When type is set to
pairtab:(Supported Backend: TensorFlow) Pairwise tabulation energy model.
- tab_file:
- type:
strargument path:model[pairtab]/tab_filePath to the tabulation file.
- rcut:
- type:
floatargument path:model[pairtab]/rcutThe cut-off radius.
- sel:
- type:
typing.List[int]|str|intargument path:model[pairtab]/selThis parameter set the number of selected neighbors. Note that this parameter is a little different from that in other descriptors. Instead of separating each type of atoms, only the summation matters. And this number is highly related with the efficiency, thus one should not make it too large. Usually 200 or less is enough, far away from the GPU limitation 4096. It can be:
int. The maximum number of neighbor atoms to be considered. We recommend it to be less than 200.
List[int]. The length of the list should be the same as the number of atom types in the system. sel[i] gives the selected number of type-i neighbors. Only the summation of sel[i] matters, and it is recommended to be less than 200. - str. Can be “auto:factor” or “auto”. “factor” is a float number larger than 1. This option will automatically determine the sel. In detail it counts the maximal number of neighbors with in the cutoff radius for each type of neighbor, then multiply the maximum by the “factor”. Finally the number is wraped up to 4 divisible. The option “auto” is equivalent to “auto:1.1”.
When type is set to
pairwise_dprc:(Supported Backend: TensorFlow)
- qm_model:
- type:
dictargument path:model[pairwise_dprc]/qm_model
- qmmm_model:
- type:
dictargument path:model[pairwise_dprc]/qmmm_model
When type is set to
linear_ener:(Supported Backend: TensorFlow)
- models:
- type:
dict|listargument path:model[linear_ener]/modelsThe sub-models.
- weights:
- type:
list|strargument path:model[linear_ener]/weightsIf the type is list of float, a list of weights for each model. If “mean”, the weights are set to be 1 / len(models). If “sum”, the weights are set to be 1.
- learning_rate:
- type:
dict, optionalargument path:learning_rateThe definitio of learning rate
- scale_by_worker:
- type:
str, optional, default:linearargument path:learning_rate/scale_by_workerWhen parallel training or batch size scaled, how to alter learning rate. Valid values are linear`(default), `sqrt or none.
Depending on the value of type, different sub args are accepted.
- type:
The type of the learning rate.
When type is set to
exp:- start_lr:
- type:
float, optional, default:0.001argument path:learning_rate[exp]/start_lrThe learning rate at the start of the training.
- stop_lr:
- type:
float, optional, default:1e-08argument path:learning_rate[exp]/stop_lrThe desired learning rate at the end of the training. When decay_rate (Supported Backend: PyTorch) is explicitly set, this value will serve as the minimum learning rate during training. In other words, if the learning rate decays below stop_lr, stop_lr will be applied instead.
- decay_steps:
- type:
int, optional, default:5000argument path:learning_rate[exp]/decay_stepsThe learning rate is decaying every this number of training steps.
- decay_rate:
- type:
float|NoneType, optional, default:Noneargument path:learning_rate[exp]/decay_rate(Supported Backend: PyTorch) The decay rate for the learning rate. If this is provided, it will be used directly as the decay rate for learning rate instead of calculating it through interpolation between start_lr and stop_lr.
- loss:
- type:
dict, optionalargument path:lossThe definition of loss function. The loss type should be set to tensor, ener or left unset.
Depending on the value of type, different sub args are accepted.
- type:
When type is set to
ener:- start_pref_e:
- type:
float|int, optional, default:0.02argument path:loss[ener]/start_pref_eThe prefactor of energy loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the energy label should be provided by file energy.npy in each data system. If both start_pref_e and limit_pref_e are set to 0, then the energy will be ignored.
- limit_pref_e:
- type:
float|int, optional, default:1.0argument path:loss[ener]/limit_pref_eThe prefactor of energy loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_f:
- type:
float|int, optional, default:1000argument path:loss[ener]/start_pref_fThe prefactor of force loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the force label should be provided by file force.npy in each data system. If both start_pref_f and limit_pref_f are set to 0, then the force will be ignored.
- limit_pref_f:
- type:
float|int, optional, default:1.0argument path:loss[ener]/limit_pref_fThe prefactor of force loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_v:
- type:
float|int, optional, default:0.0argument path:loss[ener]/start_pref_vThe prefactor of virial loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the virial label should be provided by file virial.npy in each data system. If both start_pref_v and limit_pref_v are set to 0, then the virial will be ignored.
- limit_pref_v:
- type:
float|int, optional, default:0.0argument path:loss[ener]/limit_pref_vThe prefactor of virial loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_ae:
- type:
float|int, optional, default:0.0argument path:loss[ener]/start_pref_aeThe prefactor of atomic energy loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the atom_ener label should be provided by file atom_ener.npy in each data system. If both start_pref_ae and limit_pref_ae are set to 0, then the atomic energy will be ignored.
- limit_pref_ae:
- type:
float|int, optional, default:0.0argument path:loss[ener]/limit_pref_aeThe prefactor of atomic energy loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_pf:
- type:
float|int, optional, default:0.0argument path:loss[ener]/start_pref_pfThe prefactor of atomic prefactor force loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the atom_pref label should be provided by file atom_pref.npy in each data system. If both start_pref_pf and limit_pref_pf are set to 0, then the atomic prefactor force will be ignored.
- limit_pref_pf:
- type:
float|int, optional, default:0.0argument path:loss[ener]/limit_pref_pfThe prefactor of atomic prefactor force loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- relative_f:
- type:
float|NoneType, optionalargument path:loss[ener]/relative_fIf provided, relative force error will be used in the loss. The difference of force will be normalized by the magnitude of the force in the label with a shift given by relative_f, i.e. DF_i / ( || F || + relative_f ) with DF denoting the difference between prediction and label and || F || denoting the L2 norm of the label.
- enable_atom_ener_coeff:
- type:
bool, optional, default:Falseargument path:loss[ener]/enable_atom_ener_coeffIf true, the energy will be computed as sum_i c_i E_i. c_i should be provided by file atom_ener_coeff.npy in each data system, otherwise it’s 1.
- start_pref_gf:
- type:
float, optional, default:0.0argument path:loss[ener]/start_pref_gfThe prefactor of generalized force loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the drdq label should be provided by file drdq.npy in each data system. If both start_pref_gf and limit_pref_gf are set to 0, then the generalized force will be ignored.
- limit_pref_gf:
- type:
float, optional, default:0.0argument path:loss[ener]/limit_pref_gfThe prefactor of generalized force loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- numb_generalized_coord:
- type:
int, optional, default:0argument path:loss[ener]/numb_generalized_coordThe dimension of generalized coordinates. Required when generalized force loss is used.
When type is set to
ener_spin:- start_pref_e:
- type:
float|int, optional, default:0.02argument path:loss[ener_spin]/start_pref_eThe prefactor of energy loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the energy label should be provided by file energy.npy in each data system. If both start_pref_energy and limit_pref_energy are set to 0, then the energy will be ignored.
- limit_pref_e:
- type:
float|int, optional, default:1.0argument path:loss[ener_spin]/limit_pref_eThe prefactor of energy loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_fr:
- type:
float|int, optional, default:1000argument path:loss[ener_spin]/start_pref_frThe prefactor of force_real_atom loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the force_real_atom label should be provided by file force_real_atom.npy in each data system. If both start_pref_force_real_atom and limit_pref_force_real_atom are set to 0, then the force_real_atom will be ignored.
- limit_pref_fr:
- type:
float|int, optional, default:1.0argument path:loss[ener_spin]/limit_pref_frThe prefactor of force_real_atom loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_fm:
- type:
float|int, optional, default:10000argument path:loss[ener_spin]/start_pref_fmThe prefactor of force_magnetic loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the force_magnetic label should be provided by file force_magnetic.npy in each data system. If both start_pref_force_magnetic and limit_pref_force_magnetic are set to 0, then the force_magnetic will be ignored.
- limit_pref_fm:
- type:
float|int, optional, default:10.0argument path:loss[ener_spin]/limit_pref_fmThe prefactor of force_magnetic loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_v:
- type:
float|int, optional, default:0.0argument path:loss[ener_spin]/start_pref_vThe prefactor of virial loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the virial label should be provided by file virial.npy in each data system. If both start_pref_virial and limit_pref_virial are set to 0, then the virial will be ignored.
- limit_pref_v:
- type:
float|int, optional, default:0.0argument path:loss[ener_spin]/limit_pref_vThe prefactor of virial loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_ae:
- type:
float|int, optional, default:0.0argument path:loss[ener_spin]/start_pref_aeThe prefactor of atom_ener loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the atom_ener label should be provided by file atom_ener.npy in each data system. If both start_pref_atom_ener and limit_pref_atom_ener are set to 0, then the atom_ener will be ignored.
- limit_pref_ae:
- type:
float|int, optional, default:0.0argument path:loss[ener_spin]/limit_pref_aeThe prefactor of atom_ener loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_pf:
- type:
float|int, optional, default:0.0argument path:loss[ener_spin]/start_pref_pfThe prefactor of atom_pref loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the atom_pref label should be provided by file atom_pref.npy in each data system. If both start_pref_atom_pref and limit_pref_atom_pref are set to 0, then the atom_pref will be ignored.
- limit_pref_pf:
- type:
float|int, optional, default:0.0argument path:loss[ener_spin]/limit_pref_pfThe prefactor of atom_pref loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- relative_f:
- type:
float|NoneType, optionalargument path:loss[ener_spin]/relative_fIf provided, relative force error will be used in the loss. The difference of force will be normalized by the magnitude of the force in the label with a shift given by relative_f, i.e. DF_i / ( || F || + relative_f ) with DF denoting the difference between prediction and label and || F || denoting the L2 norm of the label.
- enable_atom_ener_coeff:
- type:
bool, optional, default:Falseargument path:loss[ener_spin]/enable_atom_ener_coeffIf true, the energy will be computed as sum_i c_i E_i. c_i should be provided by file atom_ener_coeff.npy in each data system, otherwise it’s 1.
When type is set to
dos:- start_pref_dos:
- type:
float|int, optional, default:0.0argument path:loss[dos]/start_pref_dosThe prefactor of Density of State (DOS) loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the Density of State (DOS) label should be provided by file Density of State (DOS).npy in each data system. If both start_pref_Density of State (DOS) and limit_pref_Density of State (DOS) are set to 0, then the Density of State (DOS) will be ignored.
- limit_pref_dos:
- type:
float|int, optional, default:0.0argument path:loss[dos]/limit_pref_dosThe prefactor of Density of State (DOS) loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_cdf:
- type:
float|int, optional, default:0.0argument path:loss[dos]/start_pref_cdfThe prefactor of Cumulative Distribution Function (cumulative intergral of DOS) loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the Cumulative Distribution Function (cumulative intergral of DOS) label should be provided by file Cumulative Distribution Function (cumulative intergral of DOS).npy in each data system. If both start_pref_Cumulative Distribution Function (cumulative intergral of DOS) and limit_pref_Cumulative Distribution Function (cumulative intergral of DOS) are set to 0, then the Cumulative Distribution Function (cumulative intergral of DOS) will be ignored.
- limit_pref_cdf:
- type:
float|int, optional, default:0.0argument path:loss[dos]/limit_pref_cdfThe prefactor of Cumulative Distribution Function (cumulative intergral of DOS) loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_ados:
- type:
float|int, optional, default:1.0argument path:loss[dos]/start_pref_adosThe prefactor of atomic DOS (site-projected DOS) loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the atomic DOS (site-projected DOS) label should be provided by file atomic DOS (site-projected DOS).npy in each data system. If both start_pref_atomic DOS (site-projected DOS) and limit_pref_atomic DOS (site-projected DOS) are set to 0, then the atomic DOS (site-projected DOS) will be ignored.
- limit_pref_ados:
- type:
float|int, optional, default:1.0argument path:loss[dos]/limit_pref_adosThe prefactor of atomic DOS (site-projected DOS) loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
- start_pref_acdf:
- type:
float|int, optional, default:0.0argument path:loss[dos]/start_pref_acdfThe prefactor of Cumulative integral of atomic DOS loss at the start of the training. Should be larger than or equal to 0. If set to none-zero value, the Cumulative integral of atomic DOS label should be provided by file Cumulative integral of atomic DOS.npy in each data system. If both start_pref_Cumulative integral of atomic DOS and limit_pref_Cumulative integral of atomic DOS are set to 0, then the Cumulative integral of atomic DOS will be ignored.
- limit_pref_acdf:
- type:
float|int, optional, default:0.0argument path:loss[dos]/limit_pref_acdfThe prefactor of Cumulative integral of atomic DOS loss at the limit of the training, Should be larger than or equal to 0. i.e. the training step goes to infinity.
When type is set to
tensor:- pref:
- type:
float|intargument path:loss[tensor]/prefThe prefactor of the weight of global loss. It should be larger than or equal to 0. If controls the weight of loss corresponding to global label, i.e. ‘polarizability.npy` or dipole.npy, whose shape should be #frames x [9 or 3]. If it’s larger than 0.0, this npy should be included.
- pref_atomic:
- type:
float|intargument path:loss[tensor]/pref_atomicThe prefactor of the weight of atomic loss. It should be larger than or equal to 0. If controls the weight of loss corresponding to atomic label, i.e. atomic_polarizability.npy or atomic_dipole.npy, whose shape should be #frames x ([9 or 3] x #selected atoms). If it’s larger than 0.0, this npy should be included. Both pref and pref_atomic should be provided, and either can be set to 0.0.
- training:
- type:
dictargument path:trainingThe training options.
- training_data:
- type:
dict, optionalargument path:training/training_dataConfigurations of training data.
- systems:
- type:
typing.List[str]|strargument path:training/training_data/systemsThe data systems for training. This key can be provided with a list that specifies the systems, or be provided with a string by which the prefix of all systems are given and the list of the systems is automatically generated.
- batch_size:
- type:
typing.List[int]|str|int, optional, default:autoargument path:training/training_data/batch_sizeThis key can be
list: the length of which is the same as the systems. The batch size of each system is given by the elements of the list.
int: all systems use the same batch size.
string “auto”: automatically determines the batch size so that the batch_size times the number of atoms in the system is no less than 32.
string “auto:N”: automatically determines the batch size so that the batch_size times the number of atoms in the system is no less than N.
string “mixed:N”: the batch data will be sampled from all systems and merged into a mixed system with the batch size N. Only support the se_atten descriptor for Tensorflow backend.
If MPI is used, the value should be considered as the batch size per task.
- auto_prob:
- type:
str, optional, default:prob_sys_size, alias: auto_prob_styleargument path:training/training_data/auto_probDetermine the probability of systems automatically. The method is assigned by this key and can be
“prob_uniform” : the probability all the systems are equal, namely 1.0/self.get_nsystems()
“prob_sys_size” : the probability of a system is proportional to the number of batches in the system
“prob_sys_size;stt_idx:end_idx:weight;stt_idx:end_idx:weight;…” : the list of systems is devided into blocks. A block is specified by stt_idx:end_idx:weight, where stt_idx is the starting index of the system, end_idx is then ending (not including) index of the system, the probabilities of the systems in this block sums up to weight, and the relatively probabilities within this block is proportional to the number of batches in the system.
- sys_probs:
- type:
NoneType|typing.List[float], optional, default:None, alias: sys_weightsargument path:training/training_data/sys_probsA list of float if specified. Should be of the same length as systems, specifying the probability of each system.
- validation_data:
- type:
NoneType|dict, optional, default:Noneargument path:training/validation_dataConfigurations of validation data. Similar to that of training data, except that a numb_btch argument may be configured
- systems:
- type:
typing.List[str]|strargument path:training/validation_data/systemsThe data systems for validation. This key can be provided with a list that specifies the systems, or be provided with a string by which the prefix of all systems are given and the list of the systems is automatically generated.
- batch_size:
- type:
typing.List[int]|str|int, optional, default:autoargument path:training/validation_data/batch_sizeThis key can be
list: the length of which is the same as the systems. The batch size of each system is given by the elements of the list.
int: all systems use the same batch size.
string “auto”: automatically determines the batch size so that the batch_size times the number of atoms in the system is no less than 32.
string “auto:N”: automatically determines the batch size so that the batch_size times the number of atoms in the system is no less than N.
- auto_prob:
- type:
str, optional, default:prob_sys_size, alias: auto_prob_styleargument path:training/validation_data/auto_probDetermine the probability of systems automatically. The method is assigned by this key and can be
“prob_uniform” : the probability all the systems are equal, namely 1.0/self.get_nsystems()
“prob_sys_size” : the probability of a system is proportional to the number of batches in the system
“prob_sys_size;stt_idx:end_idx:weight;stt_idx:end_idx:weight;…” : the list of systems is devided into blocks. A block is specified by stt_idx:end_idx:weight, where stt_idx is the starting index of the system, end_idx is then ending (not including) index of the system, the probabilities of the systems in this block sums up to weight, and the relatively probabilities within this block is proportional to the number of batches in the system.
- sys_probs:
- type:
NoneType|typing.List[float], optional, default:None, alias: sys_weightsargument path:training/validation_data/sys_probsA list of float if specified. Should be of the same length as systems, specifying the probability of each system.
- numb_btch:
- type:
int, optional, default:1, alias: numb_batchargument path:training/validation_data/numb_btchAn integer that specifies the number of batches to be sampled for each validation period.
- stat_file:
- type:
str, optionalargument path:training/stat_file(Supported Backend: PyTorch) The file path for saving the data statistics results. If set, the results will be saved and directly loaded during the next training session, avoiding the need to recalculate the statistics. If the file extension is .h5 or .hdf5, an HDF5 file is used to store the statistics; otherwise, a directory containing NumPy binary files are used.
- mixed_precision:
- type:
dict, optionalargument path:training/mixed_precisionConfigurations of mixed precision.
- output_prec:
- type:
str, optional, default:float32argument path:training/mixed_precision/output_precThe precision for mixed precision params. “ “The trainable variables precision during the mixed precision training process, “ “supported options are float32 only currently.
- compute_prec:
- type:
strargument path:training/mixed_precision/compute_precThe precision for mixed precision compute. “ “The compute precision during the mixed precision training process, “” “supported options are float16 and bfloat16 currently.
- numb_steps:
- type:
int, alias: stop_batchargument path:training/numb_stepsNumber of training batch. Each training uses one batch of data.
- seed:
- type:
NoneType|int, optionalargument path:training/seedThe random seed for getting frames from the training data set.
- disp_file:
- type:
str, optional, default:lcurve.outargument path:training/disp_fileThe file for printing learning curve.
- disp_freq:
- type:
int, optional, default:1000argument path:training/disp_freqThe frequency of printing learning curve.
- save_freq:
- type:
int, optional, default:1000argument path:training/save_freqThe frequency of saving check point.
- save_ckpt:
- type:
str, optional, default:model.ckptargument path:training/save_ckptThe path prefix of saving check point files.
- max_ckpt_keep:
- type:
int, optional, default:5argument path:training/max_ckpt_keepThe maximum number of checkpoints to keep. The oldest checkpoints will be deleted once the number of checkpoints exceeds max_ckpt_keep. Defaults to 5.
- change_bias_after_training:
- type:
bool, optional, default:Falseargument path:training/change_bias_after_trainingWhether to change the output bias after the last training step, by performing predictions using trained model on training data and doing least square on the errors to add the target shift on the bias.
- disp_training:
- type:
bool, optional, default:Trueargument path:training/disp_trainingDisplaying verbose information during training.
- time_training:
- type:
bool, optional, default:Trueargument path:training/time_trainingTiming durining training.
- profiling:
- type:
bool, optional, default:Falseargument path:training/profilingExport the profiling results to the Chrome JSON file for performance analysis, driven by the legacy TensorFlow profiling API or PyTorch Profiler. The output file will be saved to profiling_file.
- profiling_file:
- type:
str, optional, default:timeline.jsonargument path:training/profiling_fileOutput file for profiling.
- enable_profiler:
- type:
bool, optional, default:Falseargument path:training/enable_profilerExport the profiling results to the TensorBoard log for performance analysis, driven by TensorFlow Profiler (available in TensorFlow 2.3) or PyTorch Profiler. The log will be saved to tensorboard_log_dir.
- tensorboard:
- type:
bool, optional, default:Falseargument path:training/tensorboardEnable tensorboard
- tensorboard_log_dir:
- type:
str, optional, default:logargument path:training/tensorboard_log_dirThe log directory of tensorboard outputs
- tensorboard_freq:
- type:
int, optional, default:1argument path:training/tensorboard_freqThe frequency of writing tensorboard events.
- warmup_steps:
- type:
int, optionalargument path:training/warmup_steps(Supported Backend: PyTorch) The number of steps for learning rate warmup. During warmup, the learning rate begins at zero and progressively increases linearly to start_lr, rather than starting directly from start_lr
- gradient_max_norm:
- type:
float, optionalargument path:training/gradient_max_norm(Supported Backend: PyTorch) Clips the gradient norm to a maximum value. If the gradient norm exceeds this value, it will be clipped to this limit. No gradient clipping will occur if set to 0.
Depending on the value of opt_type, different sub args are accepted.
- opt_type:
When opt_type is set to
Adam:When opt_type is set to
LKF:- kf_blocksize:
- type:
int, optionalargument path:training[LKF]/kf_blocksize(Supported Backend: PyTorch) The blocksize for the Kalman filter.
- nvnmd:
- type:
dict, optionalargument path:nvnmdThe nvnmd options.
- version:
- type:
intargument path:nvnmd/versionconfiguration the nvnmd version (0 | 1), 0 for 4 types, 1 for 32 types
- max_nnei:
- type:
intargument path:nvnmd/max_nneiconfiguration the max number of neighbors, 128|256 for version 0, 128 for version 1
- net_size:
- type:
intargument path:nvnmd/net_sizeconfiguration the number of nodes of fitting_net, just can be set as 128
- map_file:
- type:
strargument path:nvnmd/map_fileA file containing the mapping tables to replace the calculation of embedding nets
- config_file:
- type:
strargument path:nvnmd/config_fileA file containing the parameters about how to implement the model in certain hardware
- weight_file:
- type:
strargument path:nvnmd/weight_filea *.npy file containing the weights of the model
- enable:
- type:
boolargument path:nvnmd/enableenable the nvnmd training
- restore_descriptor:
- type:
boolargument path:nvnmd/restore_descriptorenable to restore the parameter of embedding_net from weight.npy
- restore_fitting_net:
- type:
boolargument path:nvnmd/restore_fitting_netenable to restore the parameter of fitting_net from weight.npy
- quantize_descriptor:
- type:
boolargument path:nvnmd/quantize_descriptorenable the quantizatioin of descriptor
- quantize_fitting_net:
- type:
boolargument path:nvnmd/quantize_fitting_netenable the quantizatioin of fitting_net
5.3.1. Writing JSON files using Visual Studio Code
When writing JSON files using Visual Studio Code, one can benefit from IntelliSense and validation by adding a JSON schema. To do so, in a VS Code workspace, one can generate a JSON schema file for the input file by running the following command:
dp doc-train-input --out-type json_schema > deepmd.json
Then one can map the schema by updating the workspace settings in the .vscode/settings.json file as follows:
{
"json.schemas": [
{
"fileMatch": [
"/**/*.json"
],
"url": "./deepmd.json"
}
]
}