All descriptors require to set
sel, which means the expected maximum number of type-i neighbors of an atom. DeePMD-kit will allocate memory according to
sel should not be too large or too small. If
sel is too large, the computing will become much slower and cost more memory. If
sel is not enough, the energy will be not conserved, making the accuracy of the model worse.
To determine a proper
sel, one can calculate the neighbor stat of the training data before training:
dp neighbor-stat -s data -r 6.0 -t O H
data is the directory of data,
6.0 is the cutoff radius, and
H is the type map. The program will give the
max_nbor_size. For example,
max_nbor_size of the water example is
[38, 72], meaning an atom may have 38 O neighbors and 72 H neighbors in the training data.
sel should be set to a higher value than that of the training data, considering there may be some extreme geometries during MD simulations. As a result, we set to
[46, 92] in the water example.