Version 15 (modified by 4 years ago) ( diff ) | ,
---|
Identifying enriched biological themes in gene sets
DAVID
- DAVID is generally the best place to start your enrichment analysis.
- Instructions for using DAVID can be found under Functional Annotation on the DAVID web site.
- You'll probably end up running DAVID multiple times, with different types of annotations, to get the more informative combination.
- Full output can be downloaded and viewed as a spreadsheet.
Gene Set Enrichment Analysis (GSEA)
Explore the Molecular Signatures Database (MSigDB), a collection of annotated gene sets for use with GSEA software.
GSEA and MSigDB documentation
Guidelines for using RNA-seq datasets with GSEA
Ranked List
- Create a two column file with gene names as first column and numeric values for second column (eg. log2 fold change, log2 ratio). The file does not need to be sorted and it should have extension ".rnk".
- The second column, used to rank genes, could be log2 fold change, t-statistic, or another scoring scheme that takes into account both log ratio and p-value.
- To run using the GUI
- 1. Upload your ranked file "file.rnk". Click on "Steps in GSEA analysis -> Load data"
- 2. Click on "Tools -> GseaPreranked"
- 3. Select one of the gene sets from the "Gene sets database". We recommend starting with the Hallmarks set (h.all). You can find more information about the sets here
- 4. Select your uploaded ranked list and click the run button.
- To run the same type of analysis on the command line, use a command like
java -Xmx512m -cp /usr/lib/share/gsea2/gsea2-2.2.2.jar xtools.gsea.GseaPreranked -gmx gseaftp.broadinstitute.org://pub/gsea/gene_sets/h.all.v5.2.symbols.gmt -collapse false -mode Max_probe -norm meandiv -nperm 1000 -rnk ./MY_COMPARISON.rnk -scoring_scheme weighted -rpt_label GSEA_out_v1 -chip gseaftp.broadinstitute.org://pub/gsea/annotations/GENE_SYMBOL.chip -include_only_symbols true -make_sets true -plot_top_x 20 -rnd_seed timestamp -set_max 500 -set_min 2 -zip_report false -out GSEA_OUT.TEST_v1 -gui false
Fast gene set enrichment analysis (fgsea)
Unranked List
GSEA will rank the genes
- Create necessary files in correct format for expression, phenotype and chip annotation ( see GSEA wiki)
- Use MSigDB for gene sets or create custom gene sets in correct format
- Run GSEA, use default options to start
Single-sample GSEA (ssGSEA)
An extension of GSEA that can be used to determine enrichment of gene sets in individual samples.
- NOTE: GSEA should be run on the entire dataset, and not a subset of genes as this may bias the results. See GSEA pre-ranked questions (Google Groups)
BiNGO
BiNGO Plugin
You need to have Cytoscape installed to use BiNGO
- Start BiNGO via Cytoscape , Plugins->Start BiNGO
- Get genes from cluster/network or paste gene list
- Select the correct options (eg. species)
- Run BiNGO
GeneGO
GeneGO Login (Password Required)
- Upload gene list and activate
- One-click analysis -> Select GeneGo Pathway Maps
Other/Useful Links
GO Term Finder : significant GO terms shared among a list of genes from your organism.
GO Term Mapper : maps the granular GO annotations for genes in a list to a set of GO slim terms, allowing you to bin your genes into broad categories.
Ingenuity IPA, subscription required.
Advaita iPathwayGuide, login required - subscription required for downloading.
More Information
Hot Topics: Gene List Enrichment