Backend

1. Backend#

1.1. Supported backends#

DeePMD-kit supports multiple backends: TensorFlow and PyTorch. To use DeePMD-kit, you must install at least one backend. Each backend does not support all features. In the documentation, TensorFlow , PyTorch , and Paddle icons are used to mark whether a backend supports a feature.

1.1.1. TensorFlow #

Model filename extension: .pb
Checkpoint filename extension: .meta, .index, .data-00000-of-00001

TensorFlow 2.8 is the first version to support Python 3.10. DeePMD-kit does not use the TensorFlow v2 API but uses the TensorFlow v1 API (tf.compat.v1) in the graph mode.

1.1.2. PyTorch #

Model filename extension: .pth
Checkpoint filename extension: .pt

PyTorch 2.1 or above is required. While .pth and .pt are the same in the PyTorch package, they have different meanings in the DeePMD-kit to distinguish the model and the checkpoint.

1.1.3. JAX #

Model filename extension: .xlo, .savedmodel
Checkpoint filename extension: .jax

JAX 0.4.33 or above is required. Both .xlo and .jax are customized format extensions defined in DeePMD-kit, since JAX has no convention for file extensions. .savedmodel is the TensorFlow SavedModel format generated by JAX2TF, which needs the installation of TensorFlow. Only the .savedmodel format supports C++ inference, which needs the TensorFlow C++ interface. The model is device-specific, so that the model generated on the GPU device cannot be run on the CPUs.

1.1.4. Paddle #

Model filename extensions: .json and .pdiparams
Checkpoint filename extension: .pd

Paddle version 3.0 or above is required.

The .pd extension is used for model checkpoint storage, which is commonly utilized during training and testing in Python. The .json extension is for the model’s computational graph in PIR representation, while the .pdiparams extension stores model parameters. Both .json and .pdiparams files are exported together and used in model freezing and Python/C++ inference.

1.1.5. DP #

Note

This backend is only for development and should not take into production.

Model filename extension: .dp, .yaml, .yml

DP is a reference backend for development, which uses pure NumPy to implement models without using any heavy deep-learning frameworks. Due to the limitation of NumPy, it doesn’t support gradient calculation and thus cannot be used for training. As a reference backend, it is not aimed at the best performance, but only the correct results. The DP backend has two formats, both of which are backend-independent: The .dp format uses HDF5 to store model serialization data, which has good performance. The .yaml or .yml use YAML to save the data as plain texts, which is easy to read for human beings. Only Python inference interface can load these formats.

NumPy 1.21 or above is required.

1.2. Switch the backend#

1.2.1. Training#

When training and freezing a model, you can use dp --tf, dp --pt or dp --pd in the command line to switch the backend.

1.2.2. Inference#

When doing inference, DeePMD-kit detects the backend from the model filename. For example, when the model filename ends with .pb (the ProtoBuf file), DeePMD-kit will consider it using the TensorFlow backend.

1.3. Convert model files between backends#

If a model is supported by two backends, one can use dp convert-backend to convert the model file between these two backends.