Changes between Version 12 and Version 13 of FAQ
- Timestamp:
- 11/02/21 12:25:48 (3 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
FAQ
v12 v13 210 210 1. [=#alphafold How can I run '''AlphaFold 2.0''' here at Whitehead?] 211 211 * You'll need access to a system with a GPU, or the new GPU queue. IT can help you choose and obtain access to GPU systems if required. 212 * Modify the environment with a path to the required data:213 * export ALPHAFOLD_DATA_PATH=/alphafold/data214 * [Optional] Get information about the 'alphafold' command:212 * For a working command to be executed on fry.wi.mit.edu, which then sends the AlphaFold command to the GPU node on the slurm cluster, see /nfs/BaRC_Public/BaRC_code/shell/run_AlphaFold/Commands.sh 213 * Start by copying RunAlphaFold_slurm.sh to your project directory. 214 * The main command looks like 215 215 {{{ 216 singularity run -B $ALPHAFOLD_DATA_PATH:/data -B .:/etc \ 217 --pwd /app/alphafold \ 218 --nv /alphafold/alphafold_2.0.0.sif --helpshort 216 sbatch -J AF_1 --export=ALL,FASTA_NAME=Sample_protein_1.fa,USERNAME=myUsername,FASTA_PATH=proteins,AF2_WORK_DIR=/nfs/BaRC_Public/BaRC_code/shell/run_AlphaFold ./RunAlphaFold_slurm.sh 219 217 }}} 220 * Run the main AlphaFold 2.0 command (replacing query.fasta with your input protein sequence), like 221 {{{ 222 singularity run -B $ALPHAFOLD_DATA_PATH:/data -B .:/etc \ 223 --pwd /app/alphafold \ 224 --nv /alphafold/alphafold_2.0.0.sif \ 225 --fasta_paths=/data/query.fasta \ 226 --output_dir=/tmp/AlphaFold_output \ 227 --model_names=model_1 \ 228 --preset=full_dbs \ 229 --data_dir=/data \ 230 --max_template_date=2020-05-14 \ 231 --uniref90_database_path=/data/uniref90/uniref90.fasta \ 232 --mgnify_database_path=/data/mgnify/mgy_clusters.fa \ 233 --uniclust30_database_path=/data/uniclust30/uniclust30_2018_08/uniclust30_2018_08 \ 234 --bfd_database_path=/data/bfd/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt \ 235 --pdb70_database_path=/data/pdb70/pdb70 \ 236 --template_mmcif_dir=/data/pdb_mmcif/mmcif_files \ 237 --obsolete_pdbs_path=/data/pdb_mmcif/obsolete.dat 238 }}} 239 218 where the inputs are 219 * AF2_WORK_DIR => project working directory 220 * FASTA_PATH => directory within AF2_WORK_DIR with input protein sequence 221 * FASTA_NAME => name of input protein sequence file within $AF2_WORK_DIR/$FASTA_PATH 222 * USERNAME => Username for job submission and email 240 223 * More information, including an explanation of the output files, is here: https://github.com/deepmind/alphafold