Reexamining Dis/Similarity-Based Tests for Rare-Variant Association with Case-Control Samples.

TitleReexamining Dis/Similarity-Based Tests for Rare-Variant Association with Case-Control Samples.
Publication TypeJournal Article
Year of Publication2018
AuthorsWang, Charlotte, Jung-Ying Tzeng, Pei-Zhen Wu, Martin Preisig, and Chuhsing Kate Hsiao
Date Published2018 May
KeywordsAlgorithms, Case-Control Studies, Computer Simulation, Genetic Association Studies, Genetic Variation, Humans, Models, Genetic, Models, Statistical, Phenotype, Sample Size

A properly designed distance-based measure can capture informative genetic differences among individuals with different phenotypes and can be used to detect variants responsible for the phenotypes. To detect associated variants, various tests have been designed to contrast genetic dissimilarity or similarity scores of certain subject groups in different ways, among which the most widely used strategy is to quantify the difference between the within-group genetic dissimilarity/similarity (, case-case and control-control similarities) and the between-group dissimilarity/similarity (, case-control similarities). While it has been noted that for common variants, the within-group and the between-group measures should all be included; in this work, we show that for rare variants, comparison based on the two within-group measures can more effectively quantify the genetic difference between cases and controls. The between-group measure tends to overlap with one of the two within-group measures for rare variants, although such overlap is not present for common variants. Consequently, a dissimilarity or similarity test that includes the between-group information tends to attenuate the association signals and leads to power loss. Based on these findings, we propose a dissimilarity test that compares the degree of SNP dissimilarity within cases to that within controls to better characterize the difference between two disease phenotypes. We provide the statistical properties, asymptotic distribution, and computation details for a small sample size of the proposed test. We use simulated and real sequence data to assess the performance of the proposed test, comparing it with other rare-variant methods including those similarity-based tests that use both within-group and between-group information. As similarity-based approaches serve as one of the dominating approaches in rare-variant analysis, our results provide some insight for the effective detection of rare variants.

Alternate JournalGenetics
Original PublicationReexamining dis/similarity-based tests for rare-variant association with case-control samples.
PubMed ID29545466
PubMed Central IDPMC5937191
Grant ListP01 CA142538 / CA / NCI NIH HHS / United States