Tianxi Cai
Primary Faculty

Tianxi Cai

John Rock Professor of Population and Translational Data Sciences




Dr. Cai’s current research interests are mainly in the area of biomarker evaluation; model selection and validation; prediction methods; personalized medicine in disease diagnosis, prognosis and treatment; statistical inference with high dimensional data; and survival analysis.

In addition to her methdological research, Dr. Cai also collaborates with the I2B2 (Informatics for Integrating Biology and the Bedside) center on developing a scalable informatics framework that will bridge clinical research data and the vast data banks arising from basic science research in order to better understand the genetic bases of complex diseases.

Sc.D., 1999
Harvard University


Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?

Tan ALM, Getzen EJ, Hutch MR, Strasser ZH, Gutiérrez-Sacristán A, Le TT, Dagliati A, Morris M, Hanauer DA, Moal B, Bonzel CL, Yuan W, Chiudinelli L, Das P, Zhang HG, Aronow BJ, Avillach P, Brat GA, Cai T, Hong C, La Cava WG, Hooi Will Loh H, Luo Y, Murphy SN, Yuan Hgiam K, Omenn GS, Patel LP, Jebathilagam Samayamuthu M, Shriver ER, Shakeri Hossein Abad Z, Tan BWL, Visweswaran S, Wang X, Weber GM, Xia Z, Verdy B, Long Q, Mowery DL, Holmes JH.

J Biomed Inform. 2023 Mar. 139:104306. PMID: 36738870

Acute respiratory distress syndrome after SARS-CoV-2 infection on young adult population: International observational federated study based on electronic health records through the 4CE consortium.

Moal B, Orieux A, Ferté T, Neuraz A, Brat GA, Avillach P, Bonzel CL, Cai T, Cho K, Cossin S, Griffier R, Hanauer DA, Haverkamp C, Ho YL, Hong C, Hutch MR, Klann JG, Le TT, Loh NHW, Luo Y, Makoudjou A, Morris M, Mowery DL, Olson KL, Patel LP, Samayamuthu MJ, Sanz Vidorreta FJ, Schriver ER, Schubert P, Verdy G, Visweswaran S, Wang X, Weber GM, Xia Z, Yuan W, Zhang HG, Zöller D, Kohane IS, Boyer A, Jouhet V.

PLoS One. 2023. 18(1):e0266985. PMID: 36598895

SurvMaximin: Robust federated approach to transporting survival risk prediction models.

Wang X, Zhang HG, Xiong X, Hong C, Weber GM, Brat GA, Bonzel CL, Luo Y, Duan R, Palmer NP, Hutch MR, Gutiérrez-Sacristán A, Bellazzi R, Chiovato L, Cho K, Dagliati A, Estiri H, García-Barrio N, Griffier R, Hanauer DA, Ho YL, Holmes JH, Keller MS, Klann MEng JG, L'Yi S, Lozano-Zahonero S, Maidlow SE, Makoudjou A, Malovini A, Moal B, Moore JH, Morris M, Mowery DL, Murphy SN, Neuraz A, Yuan Ngiam K, Omenn GS, Patel LP, Pedrera-Jiménez M, Prunotto A, Jebathilagam Samayamuthu M, Sanz Vidorreta FJ, Schriver ER, Schubert P, Serrano-Balazote P, South AM, Tan ALM, Tan BWL, Tibollo V, Tippmann P, Visweswaran S, Xia Z, Yuan W, Zöller D, Kohane IS, Avillach P, Guo Z, Cai T.

J Biomed Inform. 2022 10. 134:104176. PMID: 36007785


Digging for research gold in electronic medical records

September 25, 2014 — For scientists who study rare diseases, hospitals’ vast data banks hold tantalizing potential. Access to anonymized electronic medical records allows researchers to track the progress of a larger group of patients than would be…