Raphael Gottardo

Raphael Gottardo
Full Member
Vaccine and Infectious Disease & Public Health Sciences Divisions
Fred Hutchinson Cancer Research Center

MAST: A novel statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA-seq data

Single-cell transcriptomic profiling enables the unprecedented interrogation of gene expression heterogeneity in rare cell populations that would otherwise be obscured in bulk RNA sequencing experiments. The stochastic nature of transcription is revealed in the bimodality of single-cell transcriptomic data, a feature shared across many single-cell expression platforms. There is, however, a paucity of computational tools that properly handle this unique characteristic. We present a new methodology to analyze single-cell transcriptomic data that models this bimodality within a coherent generalized linear modeling framework. We propose a two-part, generalized linear model that allows one to characterize biological changes in the proportions of cells that are expressing each gene, and in the positive mean expression level of that gene. We introduce the cellular detection rate, the fraction of genes turned on in a cell, and show how it can be used to simultaneously adjust for technical variation and so-called “extrinsic noise” at the single-cell level without the use of control genes. Our model permits direct inference on statistics formed by collections of genes, facilitating gene set enrichment analysis. The residuals defined by such models can be manipulated to interrogate cellular heterogeneity and gene-gene correlation across cells and conditions, providing insights into the temporal evolution of networks of co-expressed genes at the single-cell level. I will illustrate this novel approach using several RNA-seq datasets that we have recently generated to characterize specific human immune cell subsets.