CREST

Conformer-Rotamer Ensemble Sampling Tool

CREST-MLIP — This fork adds Machine Learning Interatomic Potential (MLIP) support to CREST via three backends: libtorch (direct C++ TorchScript), pymlip (embedded Python for UMA/MACE), and ASE socket (TCP to any ASE calculator). Based on CREST 3.0.2. See CHANGES.md for a detailed list of modifications.

MLIP Backends

Backend	Method keyword	When to Use
`libtorch`	`libtorch`	Fastest GPU inference, no Python runtime needed
`pymlip` (UMA)	`uma`	Meta's Universal Model for Atoms (fairchem)
`pymlip` (MACE)	`mace`	MACE foundation models (mace-torch)
`ase-socket`	`ase-socket`	Any ASE-compatible calculator via TCP socket

Building with MLIP Support

Prerequisites

CMake >= 3.17 and gfortran >= 10 (same as upstream CREST)
For libtorch: PyTorch C++ (libtorch) installed
For pymlip: Python 3.10+ with fairchem-core (UMA) or mace-torch (MACE)

Conda environment files are provided in environments/:

conda env create -f environments/uma-cuda.yml
conda activate crest-uma

CMake Build

For pymlip (UMA/MACE):

cmake -B build \
  -DWITH_PYMLIP=true \
  -DPython3_EXECUTABLE=$(which python) \
  -DCMAKE_BUILD_TYPE=Release
cmake --build build -j$(nproc)

For libtorch:

cmake -B build \
  -DWITH_LIBTORCH=true \
  -DCMAKE_PREFIX_PATH=/path/to/libtorch \
  -DCMAKE_BUILD_TYPE=Release
cmake --build build -j$(nproc)

Note: WITH_GFNFF is auto-enabled when building with MLIP support — GFN-FF provides lightweight topology-based WBOs needed by SHAKE bond constraints and the flexibility measure.

Quick Start

# UMA conformer search (GPU)
crest molecule.xyz --input examples/conformer_search_uma.toml

# MACE optimization via embedded Python
crest molecule.xyz --input examples/optimization_mace_pymlip.toml

# MACE conformer search via libtorch (TorchScript)
crest molecule.xyz --input examples/conformer_search_mace_libtorch.toml

Key Changes from Upstream CREST 3.0.2

Three MLIP calculator backends — each targeting a different use case:
- libtorch: loads TorchScript-exported models (.pt) and runs inference entirely in C++ via the PyTorch C API. No Python runtime needed at all. This is the fastest path for production GPU runs. Models are exported with scripts/export_model.py or scripts/export_mace.py, which bake graph construction, neighbor lists, and unit conversions into the TorchScript module.
- pymlip: embeds the CPython interpreter inside the Fortran process and calls UMA (fairchem-core) or MACE (mace-torch) calculators directly in-memory. Avoids TCP socket overhead while reusing all Python model infrastructure. The GIL is acquired per-call; for parallel MD, separate worker processes are spawned to bypass this limitation (see process-based parallel MD below).
- ASE socket: connects over TCP to an external Python server (src/python_server/crest_ase_server.py) wrapping any ASE-compatible calculator. The most flexible option — works with any model or code that has an ASE interface, at the cost of serialization overhead.
GPU batched inference — CREST's conformer search generates hundreds of independent structures per iteration. Instead of evaluating them one-by-one through OpenMP threads (which would serialize on a single GPU anyway), the batched path in parallel.f90 collects all structures, pads them to equal length, and sends them through the model in a single batched forward pass. Two sub-paths exist: single-GPU (one batch) and multi-GPU (round-robin distribution across ngpus devices). Batch size is auto-tuned from atom count if not set explicitly. This typically gives 5-20x speedup over per-structure evaluation.
Shared model loading — MLIP models can be 100 MB–2 GB. Without sharing, each OpenMP thread would load its own copy, quickly exhausting GPU memory. Instead, the master thread loads the model once and broadcasts the handle to all workers. For libtorch this uses a mutex-protected shared pointer; for pymlip, the GIL naturally serializes access to a single Python object. Workers that receive a shared handle skip cleanup on exit (only the owner deallocates).
Process-based parallel MD for pymlip — Python's Global Interpreter Lock (GIL) serializes all embedded Python calls, making OpenMP thread parallelism ineffective for pymlip calculators. To achieve true parallelism, CREST automatically detects pymlip and switches from OpenMP threads to a process-based model: the parent writes per-worker binary configs (molecule + MD settings + calculator parameters), then spawns N independent CREST worker processes (crest --worker <config> <index>), each with its own Python interpreter and CUDA context. Workers load the MLIP model, run their assigned metadynamics trajectory, write the trajectory file, and exit. The parent waits for all workers and merges trajectories as before. With 8 worker processes on an A100 GPU, 14 MTD simulations complete in 2 batches instead of 14 serial runs — a ~7× speedup. Non-pymlip calculators (libtorch, xTB, GFN-FF) continue using the standard OpenMP path.
WBO fallback cascade for MLIP calculators — CREST's metadynamics relies on Wiberg Bond Orders (WBOs) in two places: (1) SHAKE bond constraints use WBOs to identify which bonds to constrain (threshold > 0.5), and (2) the flexi() function uses WBOs to estimate molecular flexibility, which controls metadynamics simulation length. MLIP calculators provide only energy and gradient — no WBOs. To fill this gap, we implemented a cascade: first try GFN2-xTB (produces continuous WBOs of 1.0/1.5/2.0, most accurate), then fall back to GFN-FF topology (binary 0/1 WBOs from neighbor list only, ~0.01s, no singlepoint needed), then fall back to a size-based default (flexibility = 0.5). SHAKE has an additional safety net: if no WBOs are available at all, it degrades from mode 2 (all-bond constraints) to mode 1 (X-H bonds only, which need no WBOs). GFN-FF is auto-enabled in the build system (WITH_GFNFF) whenever MLIP support is compiled in.
MLIP resource cleanup — GPU memory, Python interpreter state, and TCP sockets must be released after each algorithm step (MD, optimization, singlepoint, scan, numerical Hessian). Every algorithm endpoint calls idempotent cleanup routines that free GPU tensors, close socket connections, and release Python objects. This prevents GPU memory leaks during multi-step workflows (e.g., conformer search → optimization → frequency calculation).
Conda environments — ready-to-use YAML files in environments/ for UMA and MACE, with both CPU and CUDA variants. These pin compatible versions of PyTorch, fairchem-core/mace-torch, and all dependencies.

TOML Configuration Reference

MLIP Keywords

All keys go inside [[calculation.level]] blocks.

Key	Values	Default	Description
`method`	`uma`, `mace`, `libtorch`, `pymlip`, `ase-socket`	—	Calculator backend selection
`device`	`cpu`, `cuda`, `cuda:0`–`cuda:3`, `mps`	`cpu`	Compute device for inference
`model_path`	file path	—	Path to model checkpoint (`.pt` or `.model`)
`model_type`	`uma`, `mace`	—	Model family (pymlip backend only)
`model_format`	`generic`, `mace`	`generic`	TorchScript output format (libtorch only)
`cutoff`	float (Angstrom)	`6.0`	Neighbor list cutoff (libtorch only)
`task`	string	—	UMA task name, e.g. `omol`
`atom_refs`	file path	—	Per-element energy references YAML (UMA only)
`compile_mode`	`""`, `reduce-overhead`, `max-autotune`	`""`	torch.compile mode (pymlip only)
`dtype`	`float64`, `float32`	`float64`	Floating-point precision (MACE only)
`turbo`	`true`/`false`	`false`	UMA turbo-mode: tf32 + compile + merge_mole
`batch_size`	integer	`0` (auto)	Structures per GPU batch
`aten_threads`	integer	`0` (auto)	ATen intra-op threads
`shared_model`	`true`/`false`	`false`	Share one model across threads (libtorch)
`ngpus`	integer	`0` (auto)	GPUs for multi-GPU batching
`host`	string	`127.0.0.1`	ASE socket server hostname
`port`	integer	`6789`	ASE socket server TCP port
`debug`	`true`/`false`	`false`	Per-call timing output

Optimization with MLIPs

Note: The default ANCOPT optimizer uses a model Hessian in internal coordinates that can oscillate with MLIP gradient noise. For MLIP calculators, use the RFO optimizer:

[calculation]
opt_engine = "rfo"

Example TOML

runtype = "imtd-gc"
threads = 8

[calculation]
opt_engine = "rfo"

[[calculation.level]]
method     = "uma"
device     = "cuda"

CREST (abbreviated from Conformer-Rotamer Ensemble Sampling Tool) is a program for the automated exploration of the low-energy molecular chemical space. It functions as an OMP scheduler for calculations with efficient force-field and semiempirical quantum mechanical methods such as xTB, and provides a variety of capabilities for creation and analysis of structure ensembles.
See our recent publication in J. Chem. Phys. for a feature overview: https://doi.org/10.1063/5.0197592

Documentation

The CREST documentation with installation instructions and application examples is hosted at:

Installation quick guide

There are multiple possible ways of installing CREST. Detailed build instructions can be found at https://crest-lab.github.io/crest-docs/page/installation.

Warning

For any installation make sure that you have correctly installed and sourced the xtb program before attempting any calculations with CREST. While xtb is technically not needed for the primary runtypes of CREST versions >3.0 thanks to an integration of tblite, some functionalities, like QCG, still require it!

Option 1: Precompiled binaries

The statically linked binaries can be found at the release page of this repository. The most recent program version is automatically build (both Meson/Intel and CMake/GNU) from the main branch and can be found at the continous release page, or directly download them here:

Simply unpack the binary and add it to your PATH variable.

tar -xf crest-gnu-12-ubuntu-latest.tar.xz

or

tar -xf crest-intel-2023.1.0-ubuntu-latest.tar.xz

The program should be directly executable.

Option 2: Conda

A conda-forge feedstock is maintained at https://github.com/conda-forge/crest-feedstock.

Installing CREST from the conda-forge channel can be done via:

conda install conda-forge::crest

The conda-forge distribution is based on a dynamically linked CMake/GNU build.

Warning

When using OpenBLAS as shared library backend for the linear algebra in CREST, please set the system variable export OPENBLAS_NUM_THREADS=1, as there may be an ugly warning in the concurrent (nested) parallel code parts otherwise.

Option 3: Compiling from source

Tested builds

Working and tested builds of CREST (mostly on Ubuntu 20.04 LTS):

Build System	Compiler	Linear Algebra Backend	Build type	Status	Note
CMake 3.30.2	GNU (gcc 14.1.0)	libopenblas 0.3.27	dynamic	✅
CMake 3.30.2	GNU (gcc 12.3.0)	libopenblas-dev	static	✅
CMake 3.28.3	Intel (`ifort`/`icc` 2021.9.0)	MKL static (oneAPI 2023.1)	dynamic	⚠️	OpenMP/MKL problem (#285)
Meson 1.2.0	Intel (`ifort`/`icx` 2023.1.0)	MKL static (oneAPI 2023.1)	static	✅

Generally, subprojects should be initialized for the default build options, which can be done by

git submodule init
git submodule update

For more information about builds including subprojects see here.

Some basic build instructions can be found in the following dropdown tabs:

`CMake` build

Building CREST with CMake works with the following chain of commands (in this example with gfortran/gcc compilers):

export FC=gfortran CC=gcc
cmake -B _build

and then to build the CREST binary

make -C _build

Optionally, the build can be tested via

make test -C _build

The CMake build typically requires access to shared libraries of LAPACK and OpenMP. They must be present in the library paths at compile and runtime. Alternatively, a static build can be selected by using -DSTATICBUILD=true in the CMake setup step. The current static build with GNU compilers is available from the continous release page.

`meson` build

For the setup an configuration of meson see also the meson setup page hosted at the xtb repository. The chain of commands to build CREST with meson is:

export FC=ifort CC=icc
meson setup _build --prefix=$PWD/_dist
meson install -C _build

The meson build of CREST is mainly focused on and tested with the Intel ifort/icc compilers. When using newer versions of Intel's oneAPI, replacing icc with icx should work. Please refrain from using ifx instead of ifort, however. When attempting to build with gfortran and gcc, add -Dla_backend=mkl to the meson setup command. Compatibility with the GNU compilers might be limited. We recommend the CMake build (see the corresponding section) in this instance.

By default the meson build will create a statically linked binary.

Citations

P. Pracht, F. Bohle, S. Grimme, Phys. Chem. Chem. Phys., 2020, 22, 7169-7192. DOI: 10.1039/C9CP06869D
S. Grimme, J. Chem. Theory Comput., 2019, 155, 2847-2862. DOI: 10.1021/acs.jctc.9b00143
P. Pracht, S. Grimme, Chem. Sci., 2021, 12, 6551-6568. DOI: 10.1039/d1sc00621e
P. Pracht, C.A. Bauer, S. Grimme, J. Comput. Chem., 2017, 38, 2618-2631. DOI: 10.1002/jcc.24922
S. Spicher, C. Plett, P. Pracht, A. Hansen, S. Grimme, J. Chem. Theory Comput., 2022, 18, 3174-3189. DOI: 10.1021/acs.jctc.2c00239
P. Pracht, C. Bannwarth, J. Chem. Theory Comput., 2022, 18 (10), 6370-6385. DOI: 10.1021/acs.jctc.2c00578
P. Pracht, S. Grimme, C. Bannwarth, F. Bohle, S. Ehlert, G. Feldmann, J. Gorges, M. Müller, T. Neudecker, C. Plett, S. Spicher, P. Steinbach, P. Wesołowski, F. Zeller, J. Chem. Phys., 2024, 160, 114110. DOI: 10.1063/5.0197592

If you use the MLIP backends in this fork, please also cite:

UMA: Meta Fundamental AI Research, fairchem-core, https://github.com/FAIR-Chem/fairchem
MACE: I. Batatia, D.P. Kovacs, G.N.C. Simm, C. Ortner, G. Csanyi, DOI: 10.48550/arXiv.2206.07697

BibTex entries

@article{Pracht2020,
  author ="Pracht, Philipp and Bohle, Fabian and Grimme, Stefan",
  title  ="Automated exploration of the low-energy chemical space with fast quantum chemical methods",
  journal  ="Phys. Chem. Chem. Phys.",
  year  ="2020",
  volume  ="22",
  issue  ="14",
  pages  ="7169-7192",
  doi  ="10.1039/C9CP06869D"
}

@article{Grimme2019,
  author = {Grimme, Stefan},
  title = {Exploration of Chemical Compound, Conformer, and Reaction Space with Meta-Dynamics Simulations Based on Tight-Binding Quantum Chemical Calculations},
  journal = {J. Chem. Theory Comput.},
  volume = {15},
  number = {5},
  pages = {2847-2862},
  year = {2019},
  doi = {10.1021/acs.jctc.9b00143}
}

@article{Pracht2021,
  author ="Pracht, Philipp and Grimme, Stefan",
  title  ="Calculation of absolute molecular entropies and heat capacities made simple",
  journal  ="Chem. Sci.",
  year  ="2021",
  volume  ="12",
  issue  ="19",
  pages  ="6551-6568",
  doi  ="10.1039/D1SC00621E",
  url  ="http://dx.doi.org/10.1039/D1SC00621E"
}

@article{Pracht2017,
  author = {Pracht, Philipp and Bauer, Christoph Alexander and Grimme, Stefan},
  title = {Automated and efficient quantum chemical determination and energetic ranking of molecular protonation sites},
  journal = {J. Comput. Chem.},
  volume = {38},
  number = {30},
  pages = {2618-2631},
  doi = {https://doi.org/10.1002/jcc.24922},
  url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/jcc.24922},
  year = {2017}
}

@article{Spicher2022,
  author = {Spicher, Sebastian and Plett, Christoph and Pracht, Philipp and Hansen, Andreas and Grimme, Stefan},
  title = {Automated Molecular Cluster Growing for Explicit Solvation by Efficient Force Field and Tight Binding Methods},
  journal = {J. Chem. Theory Comput.},
  volume = {18},
  number = {5},
  pages = {3174-3189},
  year = {2022},
  doi = {10.1021/acs.jctc.2c00239}
}

@article{Pracht2022,
  author = {Pracht, Philipp and Bannwarth, Christoph},
  title = {Fast Screening of Minimum Energy Crossing Points with Semiempirical Tight-Binding Methods},
  journal = {J. Chem. Theory Comput.},
  volume = {18},
  number = {10},
  pages = {6370-6385},
  year = {2022},
  doi = {10.1021/acs.jctc.2c00578}
}

@article{Pracht2024,
  author = {Pracht, Philipp and Grimme, Stefan and Bannwarth, Christoph and Bohle, Fabian and Ehlert, Sebastian and Feldmann, Gereon and Gorges, Johannes and M\"uller, Marcel and Neudecker, Tim and Plett, Christoph and Spicher, Sebastian and Steinbach, Pit and Weso\{}lowski, Patryk A. and Zeller, Felix},
  title = "{CREST - A program for the exploration of low-energy molecular chemical space}",
  journal = {J. Chem. Phys.},
  volume = {160},
  number = {11},
  pages = {114110},
  year = {2024},
  month = {03},
  issn = {0021-9606},
  doi = {10.1063/5.0197592},
  url = {https://doi.org/10.1063/5.0197592}
}

License

CREST is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

CREST is distributed in the hope that it will be useful, but without any warranty; without even the implied warranty of merchantability or fitness for a particular purpose. See the GNU Lesser General Public License for more details.

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in CREST by you, as defined in the GNU Lesser General Public license, shall be licensed as above, without any additional terms or conditions

Name		Name	Last commit message	Last commit date
Latest commit History 345 Commits
.github		.github
assets		assets
config		config
docs		docs
environments		environments
examples		examples
include		include
scripts		scripts
src		src
subprojects		subprojects
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
CHANGES.md		CHANGES.md
CITATION.cff		CITATION.cff
CMakeLists.txt		CMakeLists.txt
COPYING		COPYING
COPYING.LESSER		COPYING.LESSER
README.md		README.md
crest_batch_submit.sh		crest_batch_submit.sh
install_snellius.md		install_snellius.md
meson.build		meson.build
meson_options.txt		meson_options.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CREST

Conformer-Rotamer Ensemble Sampling Tool

MLIP Backends

Building with MLIP Support

Prerequisites

CMake Build

Quick Start

Key Changes from Upstream CREST 3.0.2

TOML Configuration Reference

MLIP Keywords

Optimization with MLIPs

Example TOML

Documentation

Installation quick guide

Option 1: Precompiled binaries

Option 2: Conda

Option 3: Compiling from source

Tested builds

`CMake` build

`meson` build

Citations

BibTex entries

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CREST

Conformer-Rotamer Ensemble Sampling Tool

MLIP Backends

Building with MLIP Support

Prerequisites

CMake Build

Quick Start

Key Changes from Upstream CREST 3.0.2

TOML Configuration Reference

MLIP Keywords

Optimization with MLIPs

Example TOML

Documentation

Installation quick guide

Option 1: Precompiled binaries

Option 2: Conda

Option 3: Compiling from source

Tested builds

CMake build

meson build

Citations

BibTex entries

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`CMake` build

`meson` build

Packages