Minimal standalone PromisE inference package (no dependency on upper-level project code).
promisE/- Minimal model code (
pl_trigo + mpnn + multitask head + wrapper) - Sequence embedder (ESM)
- Inference entry (
promisE/inference.py)
- Minimal model code (
sample_input.json: example inputrequirements.txt: minimal runtime dependenciesparams/: checkpoint location (params/params.pt)
The model checkpoint is not stored in git history because it is large (~375MB).
Place the file at:
params/params.pt
Recommended release asset name:
params.pt
Example (replace v0.1.0 with your release tag):
curl -L -o params/params.pt \
https://github.com/ChemBioHTP/PromisE/releases/download/v0.1.0/params.ptInput JSON fields:
enzyme_sequence: protein sequence stringsubstrate_panels: list of substrate iso-smiles (multiple entries)protein_structure: optional path string (accepted but not required by this minimal model)
Example (sample_input.json):
{
"enzyme_sequence": "MNNRIT...",
"substrate_panels": ["CC(=O)O", "CCO", "CC(C)O"],
"protein_structure": "/path/to/optional_structure.pdb"
}pip install -r requirements.txtpython -m promisE.inference \
--input sample_input.json \
--output predictions.jsonOptional flags:
--device cudaor--device cpu--cache-dir ./esm_cache_msi_local--esm-model esm2_t30_150M_UR50D--checkpoint /custom/path/params.pt
predictions.json includes:
- Panel-level summary (
msi_mean/std,log_kcat_mean/std) - Per-substrate predictions:
pred_msipred_log_kcat