Version 11 (modified by 5 years ago) ( diff ) | ,
---|
Identifying enriched biological themes in gene sets
DAVID
- DAVID is generally the best place to start your enrichment analysis.
- Instructions for using DAVID can be found under Functional Annotation on the DAVID web site.
- You'll probably end up running DAVID multiple times, with different types of annotations, to get the more informative combination.
- Full output can be downloaded and viewed as a spreadsheet.
Gene Set Enrichment Analysis (GSEA)
Ranked List
- Create a two column file with gene names as first column and numeric values for second column (eg. weight, p-value, etc), does not need to be sorted.
- Assigning weights: There is no standard way to assign weights, however, it should reflect some logical order. GSEA uses the correlation between expression and phenotype to assign weights, if the list is not pre-ordered or ranked. A similar scheme can be used to rank genes by log2 ratio, t-statistic, or a scoring scheme that takes into account both log ratio and p-value.
- If a gene list is not unique, duplicate genes can be given a shared weight, for eg. if a gene occurs four times in the list it is given a weight of 0.25, if it is unique a weight of 1 is given.
- Run GSEA: Tools -> GseaPreranked
- To run the same type of analysis on the command line, use a command like
java -Xmx512m -cp /usr/lib/share/gsea2/gsea2-2.2.2.jar xtools.gsea.GseaPreranked -gmx gseaftp.broadinstitute.org://pub/gsea/gene_sets/h.all.v5.2.symbols.gmt -collapse false -mode Max_probe -norm meandiv -nperm 1000 -rnk ./MY_COMPARISON.rnk -scoring_scheme weighted -rpt_label GSEA_out_v1 -chip gseaftp.broadinstitute.org://pub/gsea/annotations/GENE_SYMBOL.chip -include_only_symbols true -make_sets true -plot_top_x 20 -rnd_seed timestamp -set_max 500 -set_min 2 -zip_report false -out GSEA_OUT.TEST_v1 -gui false
Fast gene set enrichment analysis (fgsea)
Unranked List
GSEA will rank the genes
- Create necessary files in correct format for expression, phenotype and chip annotation (see GSEA wiki)
- Use MSigDB for gene sets or create custom gene sets in correct format
- Run GSEA, use default options to start
Single-sample GSEA (ssGSEA)
An extension of GSEA that can be used to determine enrichment of gene sets in individual samples.
- NOTE: GSEA should be run on the entire dataset, and not a subset of genes as this may bias the results. See GSEA pre-ranked questions (Google Groups)
BiNGO
BiNGO Plugin
You need to have Cytoscape installed to use BiNGO
- Start BiNGO via Cytoscape , Plugins->Start BiNGO
- Get genes from cluster/network or paste gene list
- Select the correct options (eg. species)
- Run BiNGO
GeneGO
GeneGO Login (Password Required)
- Upload gene list and activate
- One-click analysis -> Select GeneGo Pathway Maps
Other/Useful Links
GO Term Finder : significant GO terms shared among a list of genes from your organism.
GO Term Mapper : maps the granular GO annotations for genes in a list to a set of GO slim terms, allowing you to bin your genes into broad categories.
Ingenuity IPA, subscription required.
Advaita iPathwayGuide, login required - subscription required for downloading.
More Information
Hot Topics: Gene List Enrichment
Note:
See TracWiki
for help on using the wiki.