Department of Biostatistics Seminar/Workshop Series
An efficient hierarchical generalized linear mixed model for pathway analysis of genome-wide association studies
Lily Wang, PhD
Assistant Professor, Department of Biostatistics, Vanderbilt University School of Medicine
Wednesday, April 13, 1:30-2:30pm, MRBIII Conference Room 1220
In Genome-wide Association Studies (GWAS) of complex diseases, genetic variants having real but weak associations often fail to be detected at the stringent genome-wide significance level. Pathway analysis, which tests disease association with combined association signals from a group of variants in the same pathway, has become increasingly popular. However, because of the complexities in genetic data and the large sample sizes in typical GWAS, pathway analysis remains to be
challenging. We propose a new statistical model for pathway analysis of GWAS. This model includes a fixed effects component that models mean disease association for a group of genes, and a random effects component that models how each gene’s association with disease varies about the gene group mean, thus belongs to the class of mixed effects models. The proposed model is computational efficient and uses only summary statistics. In addition, it corrects for the presence of overlapping genes and Linkage Disequilibrium (LD). Via simulated and real GWAS data, we show our model improves power over currently available pathway analysis methods while preserving type I error rate. Furthermore, using the WTCCC T1D dataset, we demonstrate mixed model analysis identifies meaningful biological processes that agree well with previous reports on T1D. Therefore, the proposed methodology provides an efficient statistical modeling framework for systems analysis of GWAS.