Environmental Statistics Seminar

Please direct any logistical questions to Amanda King

Environmental Statistics Mailing List – sign up to receive emails on the Environmental Statistics Seminar Series and other news and events

The Environmental Statistics training group is currently co-sponsoring seminars with the National Studies on Air Pollution and Health (NSAPH) group. These are listed in the schedule below.

Upcoming Seminar


Environmental Statistics seminar meetings will be held by Zoom.  The link to each meeting will be posted along with the talk information.

Thursday, April 6, 2023
NSAPH Working Group
1:00-2:00 PM
Location: FXB G11
Join Zoom meeting: https://harvard.zoom.us/j/93590617508?pwd=eUZNRzFUaTZEMThwNzhBa29TeU8yUT09

Kevin Josey

TITLE: Causal Inference in an Ecological Regression Setting ABSTRACT:  Ecological regression is an important tool for analyzing relationships with aggregate data, especially in environmental health studies where only group-level data, rather than individual-level data, are available. By combining ecological regression and causal inference, researchers can overcome many limitations inherent with traditional ecological studies, and produce more reliable and valid results. However, there is limited understanding on how to perform a causal analysis with aggregate data. In this presentation, I will demonstrate how to use aggregate data to estimate causal effects, specifically the exposure response curve relating annual average fine particulate matter (PM2.5) with mortality, enabling inferences on the individual-level and further overcoming issues with ecological bias. We cover minor differences in the indentifying assumptions required to obtain individual-level parameter estimates as opposed to group-level parameter estimates supposing that the exposure assignment is determined at the group-level. We compare the group-level causal estimates in a simulation study and in a real world data-analysis examining the association PM2.5 with mortality

Jenny Lee

TITLE: Air Pollution Epidemiology using Medicaid Claims: Causal Exposure-Response Function Estimation in Clustered Data with Surrogate Measures 

ABSTRACT: The methods development of this project is motivated by a case study where we are interested in estimating the causal exposure-response function (ERF) in the context of long-term exposure to (PM$_{2.5}$) and respiratory hospitalizations in socioeconomically disadvantaged children enrolled in the Medicaid program. We need to develop new methods to overcome the following challenges: 1) socioeconomic information, which is known to be a confounder and an effect modifier, is not available for the Medicaid population under study; 2) data are clustered, that is zip codes (our unit of the analysis) are nested within states; and 3) we have available two useful surrogates: the median household income at the zip code of residence of each Medicare enrollee and the state-level Medicaid family income eligibility thresholds for children. In this paper, we introduce a customized approach that leverages these two surrogates for addressing our motivating question. More specifically, we introduce \textit{MedMatch} that builds on generalized propensity score (GPS) matching methods for causal inference in the context of clustered data. We conduct simulation studies to evaluate the performance of our proposed method. We apply our method to estimate the average causal ERF between long-term PM$_{2.5}$ and first respiratory hospitalization among low-income children in Medicaid from 2000 to 2012.

ModeratorEllen Considine

 

2022-2023 Dates


October 6, 2022 - NSAPH: Edgar Castro, HSPH

Edgar Castro
PhD Candidate, HSPH

November 17, 2022 - NSAPH: Naeem Khoshnevis, Riccardo Cadei

Naeem Khoshnevis – Software Engineer
Pycausalgps – Matching on generalized propensity scores with continuous exposures (Python Package – Phase I: Design Proposal

Riccardo Cadei – Visiting Master’s Student
Tutorial on Causal Rule Ensemble R Package: Interpretable Inference of Heterogenous Treatment Effects

December 1, 2022 - NSAPH: Sarika Aggarwal

Presenter: Sarika Aggarwal (PhD Student)
Title: Associations of Floods with Zipcode-Level Hospitalization Rates in the United States

February 2, 2023 - NSAPH: Ellen Considine, Kevin Chen, HSPH

Presenter #1: Ellen Considine – PhD Student
Title: Can Reinforcement Learning Improve Strategies for Issuing Heat Alerts?

Presenter #2: Kevin Lee Chen – PhD Student
Title: Estimating Heterogenous Causal Effects Under Bipartite Network Interference: An Application to Air Pollution Regulatory Policy

March 2, 2023 - NSAPH: Michael Cork, Sofia Vega, HSPH

Presenter #1: Michael Cork – PhD Student
Title: Curving Emissions – Comparing Methods for Evaluating Response Curves

Presenter #2: Sofia Vega – PhD Student
Title: Expanding Bayesian Methods to Estimate Causal Effects of Environmental Exposures on Childhood Cancers

 

April 6, 2023 - NSAPH: Jenny Lee, HSPH

Kevin Josey

TITLE: Causal Inference in an Ecological Regression Setting ABSTRACT:  Ecological regression is an important tool for analyzing relationships with aggregate data, especially in environmental health studies where only group-level data, rather than individual-level data, are available. By combining ecological regression and causal inference, researchers can overcome many limitations inherent with traditional ecological studies, and produce more reliable and valid results. However, there is limited understanding on how to perform a causal analysis with aggregate data. In this presentation, I will demonstrate how to use aggregate data to estimate causal effects, specifically the exposure response curve relating annual average fine particulate matter (PM2.5) with mortality, enabling inferences on the individual-level and further overcoming issues with ecological bias. We cover minor differences in the indentifying assumptions required to obtain individual-level parameter estimates as opposed to group-level parameter estimates supposing that the exposure assignment is determined at the group-level. We compare the group-level causal estimates in a simulation study and in a real world data-analysis examining the association PM2.5 with mortality

Jenny Lee

TITLE: Air Pollution Epidemiology using Medicaid Claims: Causal Exposure-Response Function Estimation in Clustered Data with Surrogate Measures 

ABSTRACT: The methods development of this project is motivated by a case study where we are interested in estimating the causal exposure-response function (ERF) in the context of long-term exposure to (PM$_{2.5}$) and respiratory hospitalizations in socioeconomically disadvantaged children enrolled in the Medicaid program. We need to develop new methods to overcome the following challenges: 1) socioeconomic information, which is known to be a confounder and an effect modifier, is not available for the Medicaid population under study; 2) data are clustered, that is zip codes (our unit of the analysis) are nested within states; and 3) we have available two useful surrogates: the median household income at the zip code of residence of each Medicare enrollee and the state-level Medicaid family income eligibility thresholds for children. In this paper, we introduce a customized approach that leverages these two surrogates for addressing our motivating question. More specifically, we introduce \textit{MedMatch} that builds on generalized propensity score (GPS) matching methods for causal inference in the context of clustered data. We conduct simulation studies to evaluate the performance of our proposed method. We apply our method to estimate the average causal ERF between long-term PM$_{2.5}$ and first respiratory hospitalization among low-income children in Medicaid from 2000 to 2012.

ModeratorEllen Considine

 

May 4, 2023 - NSAPH: Kate Hu, Robbie Parks

Kate Hu
Research Associate, Department of Biostatistics, HSPH

Robbie Parks
Post-Doctoral Research Fellow

May 5, 2023 - TBD

Environmental Statistics Seminar Archive