Scale with Hail: Genomic Analysis in the Biobank Era

Led by Jon Bloom and Tim Poterba  from the Hail Team at the Broad Institute

Do you work with genomic data? 

Then grab your laptop and join us for a hands-on tutorial, where we’ll teach you how to use Hail to accelerate your research.

Hail ( is a free, open-source, general-purpose, Python-based data analysis tool with additional data types and methods for working with genomic data. Unlike the Python and R scientific computing stacks, Hail:

  • scales from laptop to large compute cluster or cloud, with the same code
  • is designed to work with datasets that do not fit in memory
  • has first-class support for multi-dimensional structured data, like genomic data

At Broad, Hail is the analytical engine behinds dozens of studies, the Genome Aggregation Database (, and the Neale lab mega-GWAS ( Beyond Broad, Hail is used by academia and industry, on data ranging from mouse models to GTEx. We welcome the scientific community to leverage Hail to develop, share, and apply new methods at scale!

Note: We will assume participants have some experience using Python for data analysis; exposure to Jupyter notebooks is helpful but not required. You will connect through your laptop browser to a Hail instance running in the cloud, so do not need to pre-install any software.

Wednesday, October 31, 2018  |  2:00-4:00 pm
Kresge Building, Room G2  |  Harvard T.H. Chan School of Public Health
677 Huntington Ave, Boston MA

Students: $40
Faculty / Postdocs / Research Scientists: $60
Corporate: $80
*fee is to cover administrative costs

Logistical and tuition assistance inquiries can be directed to Amanda King