Publications
Provider-based research networks and diffusion of surgical technologies among patients with early-stage kidney cancer." Cancer 121, no. 6 (2015): 836-43.
"Optimal estimation for regression models on τ-year survival probability." J Biopharm Stat 25, no. 3 (2015): 539-47.
"Variable selection for covariate-adjusted semiparametric inference in randomized clinical trials." Stat Med 31, no. 29 (2012): 3789-804.
"Robust kernel association testing (RobKAT)." Genet Epidemiol 44, no. 3 (2020): 272-282.
"Estimation of treatment effect for the sequential parallel design." Stat Med 30, no. 30 (2011): 3496-506.
"Comparison of operational characteristics for binary tests with clustered data." Stat Med 34, no. 15 (2015): 2325-33.
"Assessing the dependence of sensitivity and specificity on prevalence in meta-analysis." Biostatistics 12, no. 4 (2011): 710-22.
"Simulation of brain mass effect with an arbitrary Lagrangian and Eulerian FEM." Med Image Comput Comput Assist Interv 13, no. Pt 2 (2010): 274-81.
"A composite likelihood method for bivariate meta-analysis in diagnostic systematic reviews." Stat Methods Med Res 26, no. 2 (2017): 914-930.
"Performance of rapid influenza H1N1 diagnostic tests: a meta-analysis." Influenza Other Respir Viruses 6, no. 2 (2012): 80-6.
"A moment-adjusted imputation method for measurement error models." Biometrics 67, no. 4 (2011): 1461-70.
"Multivariate varying coefficient models for DTI tract statistics." Med Image Comput Comput Assist Interv 13, no. Pt 1 (2010): 690-7.
"Statistical methods for multivariate meta-analysis of diagnostic tests: An overview and tutorial." Stat Methods Med Res 25, no. 4 (2016): 1596-619.
"Bayesian design of superiority clinical trials for recurrent events data with applications to bleeding and transfusion events in myelodyplastic syndrome." Biometrics 70, no. 4 (2014): 1003-13.
"Detection of gene-gene interactions using multistage sparse and low-rank regression." Biometrics 72, no. 1 (2016): 85-94.
"Efficient semiparametric estimation of short-term and long-term hazard ratios with right-censored data." Biometrics 69, no. 4 (2013): 840-9.
"Bayesian path specific frailty models for multi-state survival data with applications." Biometrics 71, no. 3 (2015): 760-71.
"Accelerated intensity frailty model for recurrent events data." Biometrics 70, no. 3 (2014): 579-87.
"A hybrid Bayesian hierarchical model combining cohort and case-control studies for meta-analysis of diagnostic tests: Accounting for partial verification bias." Stat Methods Med Res 25, no. 6 (2016): 3015-3037.
"Basic concepts and methods for joint models of longitudinal and survival data." J Clin Oncol 28, no. 16 (2010): 2796-801.
"Doubly-robust estimators of treatment-specific survival distributions in observational studies with stratified sampling." Biometrics 69, no. 4 (2013): 830-9.
"A longitudinal functional analysis framework for analysis of white matter tract statistics." Inf Process Med Imaging 23 (2013): 220-31.
"Diseased region detection of longitudinal knee MRI data." Inf Process Med Imaging 23 (2013): 632-43.
"Multivariate network-level approach to detect interactions between large-scale functional systems." Med Image Comput Comput Assist Interv 13, no. Pt 2 (2010): 298-305.
"Multiscale adaptive marginal analysis of longitudinal neuroimaging data with time-varying covariates." Biometrics 68, no. 4 (2012): 1083-92.
"A global sensitivity test for evaluating statistical hypotheses with nonidentifiable models." Biometrics 66, no. 2 (2010): 558-66.
"A genome-wide association study identifies novel loci for paclitaxel-induced sensory peripheral neuropathy in CALGB 40101." Clin Cancer Res 18, no. 18 (2012): 5099-109.
"Alignment and mapping methodology influence transcript abundance estimation." Genome Biol 21, no. 1 (2020): 239.
"Improved detection of epigenomic marks with mixed-effects hidden Markov models." Biometrics 75, no. 4 (2019): 1401-1413.
"A general framework for integrative analysis of incomplete multiomics data." Genet Epidemiol 44, no. 7 (2020): 646-664.
"Allele-specific copy-number discovery from whole-genome and whole-exome sequencing." Nucleic Acids Res 43, no. 14 (2015): e90.
"Identifying individual risk rare variants using protein structure guided local tests (POINT)." PLoS Comput Biol 15, no. 2 (2019): e1006722.
"Global copy number profiling of cancer genomes." Bioinformatics 32, no. 6 (2016): 926-8.
"DiNAMIC: a method to identify recurrent DNA copy number aberrations in tumors." Bioinformatics 27, no. 5 (2011): 678-85.
"SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling." Genome Biol 18, no. 1 (2017): 66.
"SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing." Cell Syst 10, no. 5 (2020): 445-452.e6.
"Meta-analysis of sequencing studies with heterogeneous genetic associations." Genet Epidemiol 38, no. 5 (2014): 389-401.
"Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny." Bioinformatics 34, no. 12 (2018): 2126-2128.
"Quantitative trait analysis in sequencing studies under trait-dependent sampling." Proc Natl Acad Sci U S A 110, no. 30 (2013): 12247-52.
"Single-nucleotide resolution analysis of nucleotide excision repair of ribosomal DNA in humans and mice." J Biol Chem 294, no. 1 (2019): 210-217.
"MASS: meta-analysis of score statistics for sequencing studies." Bioinformatics 29, no. 14 (2013): 1803-5.
"Alignment and mapping methodology influence transcript abundance estimation." Genome Biol 21, no. 1 (2020): 239.
"Differential damage and repair of DNA-adducts induced by anti-cancer drug cisplatin across mouse organs." Nat Commun 10, no. 1 (2019): 309.
"Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification." F1000Res 7 (2018): 952.
"Integrative gene set analysis of multi-platform data with sample heterogeneity." Bioinformatics 30, no. 11 (2014): 1501-7.
"Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences." Bioinformatics 35, no. 12 (2019): 2084-2092.
"A general framework for integrative analysis of incomplete multiomics data." Genet Epidemiol 44, no. 7 (2020): 646-664.
"SAFE-clustering: Single-cell Aggregated (from Ensemble) clustering for single-cell RNA-seq data." Bioinformatics 35, no. 8 (2019): 1269-1277.
"Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications." Genome Biol 19, no. 1 (2018): 24.
"Sex differences in grey matter atrophy patterns among AD and aMCI patients: results from ADNI." Neuroimage 56, no. 3 (2011): 890-906.
"FADTTS: functional analysis of diffusion tensor tract statistics." Neuroimage 56, no. 3 (2011): 1412-25.
"Impact of sex and gonadal steroids on neonatal brain structure." Cereb Cortex 24, no. 10 (2014): 2721-31.
"Clinical prognostic model for older patients with advanced non-small cell lung cancer." J Geriatr Oncol 10, no. 4 (2019): 555-559.
"Inference on phenotype-specific effects of genes using multivariate kernel machine regression." Genet Epidemiol 42, no. 1 (2018): 64-79.
"Enrollment Trends and Disparity Among Patients With Lung Cancer in National Clinical Trials, 1990 to 2012." J Clin Oncol 34, no. 33 (2016): 3992-3999.
"A new Bayesian joint model for longitudinal count data with many zeros, intermittent missingness, and dropout with applications to HIV prevention trials." Stat Med 38, no. 30 (2019): 5565-5586.
"Block-diagonal discriminant analysis and its bias-corrected rules." Stat Appl Genet Mol Biol 12, no. 3 (2013): 347-59.
"Regression analysis for secondary response variable in a case-cohort study." Biometrics 74, no. 3 (2018): 1014-1022.
"Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications." BMC Bioinformatics 16 (2015): 245.
"Age and African-American race impact the validity and reliability of the asthma control test in persistent asthmatics." Respir Res 19, no. 1 (2018): 152.
"Single cell analysis reveals distinct immune landscapes in transplant and primary sarcomas that determine response or resistance to immunotherapy." Nat Commun 11, no. 1 (2020): 6410.
"SAFE-clustering: Single-cell Aggregated (from Ensemble) clustering for single-cell RNA-seq data." Bioinformatics 35, no. 8 (2019): 1269-1277.
"Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications." Genome Biol 19, no. 1 (2018): 24.
"SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing." Cell Syst 10, no. 5 (2020): 445-452.e6.
"A composite likelihood method for bivariate meta-analysis in diagnostic systematic reviews." Stat Methods Med Res 26, no. 2 (2017): 914-930.
"Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates." Bioinformatics 32, no. 1 (2016): 50-7.
"Bayesian methods in clinical trials: a Bayesian analysis of ECOG trials E1684 and E1690." BMC Med Res Methodol 12 (2012): 183.
"Multitrial Evaluation of Progression-Free Survival as a Surrogate End Point for Overall Survival in First-Line Extensive-Stage Small-Cell Lung Cancer." J Thorac Oncol 10, no. 7 (2015): 1099-106.
" "A general framework for studying genetic effects and gene-environment interactions with missing data." Biostatistics 11, no. 4 (2010): 583-98.
"Latent class model characterization of neighborhood socioeconomic status." Cancer Causes Control 27, no. 3 (2016): 445-52.
"Spatial regression with covariate measurement error: A semiparametric approach." Biometrics 72, no. 3 (2016): 678-86.
"Enrollment Trends and Disparity Among Patients With Lung Cancer in National Clinical Trials, 1990 to 2012." J Clin Oncol 34, no. 33 (2016): 3992-3999.
"SNPpy--database management for SNP data from genome wide association studies." PLoS One 6, no. 10 (2011): e24982.
"Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs." Am J Hum Genet 97, no. 1 (2015): 35-53.
"TEAM: efficient two-locus epistasis tests in human genome-wide association study." Bioinformatics 26, no. 12 (2010): i217-27.
"permGPU: Using graphics processing units in RNA microarray association studies." BMC Bioinformatics 11 (2010): 329.
"Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification." F1000Res 7 (2018): 952.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"Rapid and robust resampling-based multiple-testing correction with application in a genome-wide expression quantitative trait loci study." Genetics 190, no. 4 (2012): 1511-20.
"Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications." Genome Biol 19, no. 1 (2018): 24.
"CRM2DIM: A SAS macro for implementing the dual-agent Bayesian continual reassessment method." Comput Methods Programs Biomed 176 (2019): 211-223.
"Meta-analysis of sequencing studies with heterogeneous genetic associations." Genet Epidemiol 38, no. 5 (2014): 389-401.
"Biclustering via sparse clustering." Biometrics 76, no. 1 (2020): 348-358.
"Complete effect-profile assessment in association studies with multiple genetic and multiple environmental factors." Genet Epidemiol 39, no. 2 (2015): 122-33.
"Empirical pathway analysis, without permutation." Biostatistics 14, no. 3 (2013): 573-85.
"I-Boost: an integrative boosting approach for predicting survival time with multiple genomics platforms." Genome Biol 20, no. 1 (2019): 52.
"A general framework for detecting disease associations with rare variants in sequencing studies." Am J Hum Genet 89, no. 3 (2011): 354-67.
"Meta-analysis of gene-level associations for rare variants based on single-variant statistics." Am J Hum Genet 93, no. 2 (2013): 236-48.
"Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny." Bioinformatics 34, no. 12 (2018): 2126-2128.
"MASS: meta-analysis of score statistics for sequencing studies." Bioinformatics 29, no. 14 (2013): 1803-5.
"Incorporating covariates in skewed functional data models." Biostatistics 16, no. 3 (2015): 413-26.
"Power and sample size calculations for SNP association studies with censored time-to-event outcomes." Genet Epidemiol 36, no. 6 (2012): 538-48.
"PreMeta: a tool to facilitate meta-analysis of rare-variant associations." BMC Genomics 18, no. 1 (2017): 160.
"bcSeq: an R package for fast sequence mapping in high-throughput shRNA and CRISPR screens." Bioinformatics 34, no. 20 (2018): 3581-3583.
"A variable selection method for genome-wide association studies." Bioinformatics 27, no. 1 (2011): 1-8.
"DiNAMIC: a method to identify recurrent DNA copy number aberrations in tumors." Bioinformatics 27, no. 5 (2011): 678-85.
"Statistical considerations for analysis of microarray experiments." Clin Transl Sci 4, no. 6 (2011): 466-77.
"SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling." Genome Biol 18, no. 1 (2017): 66.
"Joint modeling of survival and longitudinal non-survival data: current methods and issues. Report of the DIA Bayesian joint modeling working group." Stat Med 34, no. 14 (2015): 2181-95.
"