You are here:
Vanderbilt Biostatistics Wiki
>
Main Web
>
TWikiUsers
>
ChunLi
>
SNPChipCostEfficiency
(revision 7) (raw view)
Edit
Attach
---+++ Comparison of cost efficiency of SNP chips Li C, Li M, Long J-R, Cai Q, Zheng W (2007) Evaluating cost efficiency of SNP chips in genome-wide association studies. (revision submitted) *Abstract:* Genome-wide association (GWA) studies have recently emerged as a major approach to gene discovery for many complex diseases. Since GWA scans are expensive, cost efficiency is an important factor to consider in study design. However, it often requires extensive and time consuming computer simulations to compare cost efficiency across different SNP chips. Here we propose two simulation-free approaches to cost efficiency comparisons across SNP chips. In the first method, the overall power under a given disease model is calculated for each SNP chip under various sample sizes. Then SNP chips can be compared with respect to the sample sizes required to achieve the same level of power. In the second method, for a desired level of genomic coverage, the effective r<sup>2</sup> threshold values are calculated for each SNP chip. Since r<sup>2</sup> is inversely proportional to the sample size to achieve the same power, the required sample sizes can then be compared among SNP chips. These methods are complementary. The first approach provides direct power comparisons, but it requires information on disease model and may not be reliable for SNP chips that contain many non-HapMap SNPs. The second approach allows sample size comparisons based on the coverage of SNP chips, and it can be modified for SNP chips that contain non-HapMap SNPs. These methods are particularly relevant for large epidemiological studies in which enough subjects are available for GWA screening and follow-up stages. We illustrate these approaches using five currently available whole genome SNP chips. *Power* for SNP chips: [[%ATTACHURL%/CEUpower.tar.gz][CEU]], [[%ATTACHURL%/CHBpower.tar.gz][CHB]], [[%ATTACHURL%/JPTpower.tar.gz][JPT]], [[%ATTACHURL%/YRIpower.tar.gz][YRI]]. Each package has nine files, with names like CEUpower_##_$$.txt, where ## can be 01, 05, 10 for prevalence 0.01, 0.05, and 0.10, and $$ is the genotype relative risk (1.25, 1.50, 1.75). Note the power for SNP Array 6.0 and Human1M may be underestimated because each chip contains about 10% non-HapMap SNPs. *Genomic coverage* for SNP chips at various r<sup>2</sup> thresholds: [[%ATTACHURL%/CEUr2coverage.txt.gz][CEU]], [[%ATTACHURL%/CHBr2coverage.txt.gz][CHB]], [[%ATTACHURL%/JPTr2coverage.txt.gz][JPT]], [[%ATTACHURL%/YRIr2coverage.txt.gz][YRI]]. The columns whose name has a "b" at the end are the alternative coverage estimate that takes into account the non-HapMap SNPs on the SNP Array 6.0 and Human1M. *HapMap SNP tabulation* for a fine grid of maximum r<sup>2</sup> and MAF: [[%ATTACHURL%/CEUtables.tar.gz][CEU]], [[%ATTACHURL%/CHBtables.tar.gz][CHB]], [[%ATTACHURL%/JPTtables.tar.gz][JPT]], [[%ATTACHURL%/YRItables.tar.gz][YRI]]. Each package has five files for the five SNP chips (SNP Array 5.0, SNP Array 6.0, <nop>HumanHap300, <nop>HumanHap550, Human1M). Each file has 100 rows and 50 columns. The number in cell (i,j) is the number of <nop>HapMap common SNPs (MAF ≥ .05) that have maximum r<sup>2</sup> in ((i-1)/100, i/100] and MAF in ((j-1)/100, j/100]. Note the maximum r<sup>2</sup> for SNP Array 6.0 and Human1M may be underestimated because each chip contains about 10% non-HapMap SNPs. *[[%ATTACHURL%/GWApower_web.r][R function for power calculation]]* ---++++ Other papers of relevance * [[SNPChipCoverage][Coverage variation of SNP chips]]: Coverage maps of SNP chips and their coverage variation across the human genome and over known genes * [[GWAsimulator][GWAsimulator]]: A rapid whole genome simulation program Please contact Chun Li (chun.li@vanderbilt.edu) if you have any questions. <!-- * Set ALLOWTOPICCHANGE = ChunLi * Set TOPICLAYOUTURL = /twiki/pub/Main/ChunLi/ChunLiLayout.css * Set TOPICSTYLEURL = /twiki/pub/Main/ChunLi/ChunLiStyle.css -->
Edit
|
Attach
|
P
rint version
|
H
istory
:
r9
|
r8
<
r7
<
r6
<
r5
|
B
acklinks
|
V
iew topic
|
Edit WikiText
|
More topic actions...
Topic revision: r7 - 20 Dec 2007,
ChunLi
Main
Department Home Page
Biostatistics Graduate Program
Vanderbilt University Medical Center
Main Web
Main Web Home
Search
Recent Changes
Changes
Topic list
Biostatistics Webs
Archive
Main
Sandbox
System
Register
|
Log In
Copyright © 2013-2022 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Vanderbilt Biostatistics Wiki?
Send feedback