Context Navigation

Changes between Version 15 and Version 16 of SOP/scRNA-seq

Timestamp:: 08/04/20 11:49:55 (5 years ago)
Author:: twhitfie
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

SOP/scRNA-seq

-              v15
+              v16
 Many experiments are especially informative when compared to other experiments, either performed by the same or different laboratories.  This is challenging, however, especially when the different experiments profile different types of cells.  In these cases, biological and technical differences are confounded, and one needs to make thoughtful assumptions about how to perform batch correction and achieve "success" during dataset integration.
+As of 2020 there are more than a dozen algorithms available for integrating single cell RNA-seq data-sets.  Three such methods are canonical correlation analysis (implemented in Seurat), iterative linear correction based on soft clustering (implemented in Harmony) and integrative nonnegative matrix factorization (implemented in LIGER). Commands for using each of these methods from within a Seurat workflow are given below.
+    * Using CCA in Seurat (please see T. Stuart ''et al''. “Comprehensive Integration of Single-Cell Data”, ''Cell'' '''177''', 1888-1902 (2019), the associated Seurat v.3 vignette and the documentation for the FindIntegrationAnchors function):
+{{{
+library(Seurat)
+# Merge two or more Seurat objects, objA and objB, from different batches.
+all <- merge(x=objA,y=objB,add.cell.ids=c("A","B"))
+# Split and re-integrate the merged object according to the batch slot.
+s3.list <- SplitObject(all, split.by = "batch")
+# This loop normalizes each experiment separately first.
+for (i in 1:length(s3.list)) {
+    s3.list[[i]] <- NormalizeData(s3.list[[i]], verbose = FALSE)
+    s3.list[[i]] <- FindVariableFeatures(s3.list[[i]], selection.method = "vst", nfeatures = 2000, verbose = FALSE)
+}
+# Find so-called anchors.
+s3.anchors <- FindIntegrationAnchors(object.list = s3.list)
+s3.integrated <- IntegrateData(anchorset = s3.anchors)
+DefaultAssay(s3.integrated) <- "integrated"
+}}}
+    * Using Harmony from within Seurat (please see I. Korsunsky ''et al.'' “Fast, sensitive and accurate integration of single-cell data with Harmony”, ''Nature Methods'' '''16''', 1289-1296 (2019) and the documentation for the RunHarmony function):
+{{{
+library(Seurat)
+library(harmony)
+# Merge two or more Seurat objects, objA and objB, from different batches.
+all <- merge(x=objA,y=objB,add.cell.ids=c("A","B"))
+# In anticipation of using Harmony to integrate data-sets below, first use Seurat to run PCA on the un-corrected data.
+all <- NormalizeData(all, normalization.method = "LogNormalize", scale.factor = 10000)
+all <- FindVariableFeatures(all, selection.method = "vst", nfeatures = 2000)
+all <- ScaleData(all, features = rownames(all))
+all <- RunPCA(all, features = VariableFeatures(object = all))
+# Do the integration using Harmony, indexing samples by the batch slot:
+all <- RunHarmony(all, "batch")
+# When generating UMAP or another embedding, be sure to use the integrated "harmony" reduction.
+all <- RunUMAP(all,reduction = "harmony")
+}}}
+   * Using LIGER (v 0.4.2.9000) from within Seurat (please see J.D. Welsh ''et al.'' “Single-Cell Multi-omic Integration Compares and Contrasts Features of Brain Cell Identity”, ''Nature Biotechnology'' '''37''', 1873–1887 (2019) and the documentation for the RunOptimizeALS and RunQuantileAlignSNF functions):
+{{{
+library(Seurat)
+library(SeuratWrappers)
+library(liger)
+# Merge two or more Seurat objects, objA and objB, from different batches.
+all <- merge(x=objA,y=objB,add.cell.ids=c("A","B"))
+# In anticipation of using LIGER to integrate data-sets below, first use Seurat to scale the data without centering.
+all <- NormalizeData(all, normalization.method = "LogNormalize", scale.factor = 10000)
+all <- FindVariableFeatures(all, selection.method = "vst", nfeatures = 2000)
+all <- ScaleData(all, do.center=FALSE, split.by = "batch")
+# Do the integration using LIGER, indexing samples by the batch slot:
+all <- RunOptimizeALS(all, split.by = "batch")
+all <- RunQuantileAlignSNF(all, split.by = "batch")
+# When generating UMAP or another embedding, be sure to use the reduction from integrated nonnegative factorization ("iNMF").
+all <- RunUMAP(all, dims = 1:ncol(all[["iNMF"]]), reduction = "iNMF")
+}}}
 === Export expression and dimensional analysis data for interactive viewing ===