Department of Biostatistics Seminar/Workshop Series

Desing and Analysis of Two-Phase Studies, with Applications to Genetic Association Studies

Ran Tao

PhD Biostatistics, University of North Carolina, Chapel Hill

In modern epidemiological and medical studies, the covariates of interest may involve genome sequencing or biomarker assay and thus are prohibitively expensive to measure on a large number of subjects. A cost-effective solution is the two-phase design, under which the outcome and inexpensive covariates are observed for all subjects during the first phase and this information is used to select subjects for measurements of expensive covariates during the second phase. Herein, we consider general two-phase designs, where the outcome can be continuous or discrete, and inexpensive covariates can be continuous and correlated with expensive covariates. We propose a semiparametric approach to regression analysis and establish the consistency, asymptotic normality, and asymptotic efficiency of the estimators. In addition, we demonstrate the usefulness of the proposed methods through extensive simulation studies and applications to the National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP). Finally, we derive optimal two-phase designs, which can be substantially more efficient than the current designs.
Topic revision: r1 - 11 May 2016, AshleeBartley
 

This site is powered by FoswikiCopyright © 2013-2022 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Vanderbilt Biostatistics Wiki? Send feedback