RFdiffusion
A deep learning-based protein design tool for motif scaffolding and sequence design.
RFdiffusion requires an environment module
In order to use RFdiffusion, you must first load the appropriate environment module:
module load rfdiffusion
RFdiffusion is an open source method for structure generation developed by the Rosetta Commons community for protein design and motif scaffolding. It generates new protein backbones and sequences based on diffusion models trained on large protein structure datasets.
Warning
RFdiffusion runs only inside an Apptainer container on GPU-capable compute nodes. Make sure to request a GPU node in your job submission or via Open OnDemand.
Loading the RFFdiffusion Module#
You can load the RFFdiffusion module in two equivalent ways:
or explicitly:
This will make the rfdiffusion.sif container image and supporting environment variables (e.g., $MODELS) available in your session.
Running RFdiffusion#
To run RFdiffusion within Apptainer on an HPC GPU node:
Notes#
--nvenables GPU access inside the container.- Replace
/path/to/your/output_dirand/path/to/your/input_dirwith your actual directories. - The
contigmap.contigsargument defines residue ranges and scaffold positions.
Using Non-Home Directories#
If your input or output directories are located outside your home directory (for example under /gpfs/research/), you must bind those directories manually using the -B option:
Note
Without directory binding, the container will not have access to files outside your home directory.
Accessing the Pretrained Models#
RFdiffusion includes several pretrained models that are required for inference.
These are stored at:
This path is automatically accessible via the environment variable:
You can reference it in your commands, for example:
```bash inference.model_directory_path=$MODELS