|Title||A variable selection method for genome-wide association studies.|
|Publication Type||Journal Article|
|Year of Publication||2011|
|Authors||He, Qianchuan, and Dan-Yu Lin|
|Date Published||2011 Jan 01|
|Keywords||Computer Simulation, Genome-Wide Association Study, Linkage Disequilibrium, Logistic Models, Polymorphism, Single Nucleotide, Software|
MOTIVATION: Genome-wide association studies (GWAS) involving half a million or more single nucleotide polymorphisms (SNPs) allow genetic dissection of complex diseases in a holistic manner. The common practice of analyzing one SNP at a time does not fully realize the potential of GWAS to identify multiple causal variants and to predict risk of disease. Existing methods for joint analysis of GWAS data tend to miss causal SNPs that are marginally uncorrelated with disease and have high false discovery rates (FDRs).RESULTS: We introduce GWASelect, a statistically powerful and computationally efficient variable selection method designed to tackle the unique challenges of GWAS data. This method searches iteratively over the potential SNPs conditional on previously selected SNPs and is thus capable of capturing causal SNPs that are marginally correlated with disease as well as those that are marginally uncorrelated with disease. A special resampling mechanism is built into the method to reduce false positive findings. Simulation studies demonstrate that the GWASelect performs well under a wide spectrum of linkage disequilibrium patterns and can be substantially more powerful than existing methods in capturing causal variants while having a lower FDR. In addition, the regression models based on the GWASelect tend to yield more accurate prediction of disease risk than existing methods. The advantages of the GWASelect are illustrated with the Wellcome Trust Case-Control Consortium (WTCCC) data.AVAILABILITY: The software implementing GWASelect is available at http://www.bios.unc.edu/~lin. Access to WTCCC data: http://www.wtccc.org.uk/.
|Original Publication||A variable selection method for genome-wide association studies.|
|PubMed Central ID||PMC3025714|
|Grant List||P01 CA142538 / CA / NCI NIH HHS / United States |
R01 CA082659 / CA / NCI NIH HHS / United States
R01 CA082659-13 / CA / NCI NIH HHS / United States
1-P01-CA142538-01 / CA / NCI NIH HHS / United States
A variable selection method for genome-wide association studies.