PQG Short Courses

July 16-18, 2018

Introduction to R & Differential Gene Expression Analysis

  • using R for increasing their efficiency for data analysis
  • visualizing data using R (ggplot2)
  • using R to perform statistical analysis on RNA-seq count data to obtain differentially expressed gene lists (DESeq2)

October 31, 2018

Scale with Hail: Genomic Analysis in the Biobank Era

Hail (http://hail.is) is a free, open-source, general-purpose, Python-based data analysis tool with additional data types and methods for working with genomic data. Unlike the Python and R scientific computing stacks, Hail:

  • scales from laptop to large compute cluster or cloud, with the same code
  • is designed to work with datasets that do not fit in memory
  • has first-class support for multi-dimensional structured data, like genomic data