Changes between Version 12 and Version 13 of FAQ


Ignore:
Timestamp:
11/02/21 12:25:48 (3 years ago)
Author:
gbell
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • FAQ

    v12 v13  
    2102101. [=#alphafold How can I run '''AlphaFold 2.0''' here at Whitehead?]
    211211  * You'll need access to a system with a GPU, or the new GPU queue.  IT can help you choose and obtain access to GPU systems if required.
    212   * Modify the environment with a path to the required data:
    213     * export ALPHAFOLD_DATA_PATH=/alphafold/data
    214   * [Optional] Get information about the 'alphafold' command:
     212  * For a working command to be executed on fry.wi.mit.edu, which then sends the AlphaFold command to the GPU node on the slurm cluster, see /nfs/BaRC_Public/BaRC_code/shell/run_AlphaFold/Commands.sh
     213  * Start by copying RunAlphaFold_slurm.sh to your project directory.
     214  * The main command looks like
    215215{{{
    216 singularity run -B $ALPHAFOLD_DATA_PATH:/data -B .:/etc \
    217 --pwd /app/alphafold \
    218 --nv /alphafold/alphafold_2.0.0.sif --helpshort
     216sbatch -J AF_1 --export=ALL,FASTA_NAME=Sample_protein_1.fa,USERNAME=myUsername,FASTA_PATH=proteins,AF2_WORK_DIR=/nfs/BaRC_Public/BaRC_code/shell/run_AlphaFold ./RunAlphaFold_slurm.sh
    219217}}}
    220   * Run the main AlphaFold 2.0 command (replacing query.fasta with your input protein sequence), like
    221    {{{
    222 singularity run -B $ALPHAFOLD_DATA_PATH:/data -B .:/etc \
    223 --pwd /app/alphafold \
    224 --nv /alphafold/alphafold_2.0.0.sif \
    225 --fasta_paths=/data/query.fasta \
    226 --output_dir=/tmp/AlphaFold_output \
    227 --model_names=model_1 \
    228 --preset=full_dbs \
    229 --data_dir=/data \
    230 --max_template_date=2020-05-14 \
    231 --uniref90_database_path=/data/uniref90/uniref90.fasta \
    232 --mgnify_database_path=/data/mgnify/mgy_clusters.fa \
    233 --uniclust30_database_path=/data/uniclust30/uniclust30_2018_08/uniclust30_2018_08 \
    234 --bfd_database_path=/data/bfd/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt \
    235 --pdb70_database_path=/data/pdb70/pdb70 \
    236 --template_mmcif_dir=/data/pdb_mmcif/mmcif_files \
    237 --obsolete_pdbs_path=/data/pdb_mmcif/obsolete.dat
    238 }}}
    239 
     218    where the inputs are
     219    * AF2_WORK_DIR => project working directory
     220    * FASTA_PATH   => directory within AF2_WORK_DIR with input protein sequence
     221    * FASTA_NAME   => name of input protein sequence file within $AF2_WORK_DIR/$FASTA_PATH
     222    * USERNAME     => Username for job submission and email
    240223  * More information, including an explanation of the output files, is here: https://github.com/deepmind/alphafold