Publications
The use of Bayesian hierarchical models for adaptive randomization in biomarker-driven phase II studies." J Biopharm Stat 25, no. 1 (2015): 66-88.
"Bayesian gamma frailty models for survival data with semi-competing risks and treatment switching." Lifetime Data Anal 20, no. 1 (2014): 76-105.
"Bayesian design of noninferiority trials for medical devices using historical data." Biometrics 67, no. 3 (2011): 1163-70.
"Differential losses to follow-up that are outcome-dependent can vitiate a clinical trial: Simulation results." J Biopharm Stat 28, no. 4 (2018): 633-644.
"Incorporating covariates in skewed functional data models." Biostatistics 16, no. 3 (2015): 413-26.
"Semiparametric estimation of the accelerated failure time model with partly interval-censored data." Biometrics 73, no. 4 (2017): 1161-1168.
"Quantifying the average of the time-varying hazard ratio via a class of transformations." Lifetime Data Anal 21, no. 2 (2015): 259-79.
"Marginal additive hazards model for case-cohort studies with multiple disease outcomes: an application to the Atherosclerosis Risk in Communities (ARIC) study." Biostatistics 14, no. 1 (2013): 28-41.
"Using decision lists to construct interpretable and parsimonious treatment regimes." Biometrics 71, no. 4 (2015): 895-904.
"Incorporating Functional Information in Tests of Excess De Novo Mutational Load." Am J Hum Genet 97, no. 2 (2015): 272-83.
"Testing and estimation in marker-set association study using semiparametric quantile regression kernel machine." Biometrics 72, no. 2 (2016): 364-71.
"Predictive accuracy of markers or risk scores for interval censored survival data." Stat Med 39, no. 18 (2020): 2437-2446.
"Semiparametric regression for the weighted composite endpoint of recurrent and terminal events." Biostatistics 17, no. 2 (2016): 390-403.
"Meta-analysis methods and models with applications in evaluation of cholesterol-lowering drugs." Stat Med 31, no. 28 (2012): 3597-616.
"Sparse meta-analysis with high-dimensional data." Biostatistics 17, no. 2 (2016): 205-20.
"Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness." Stat Med 39, no. 25 (2020): 3503-3520.
"Proportional exponentiated link transformed hazards (ELTH) models for discrete time survival data with application." Lifetime Data Anal 22, no. 1 (2016): 38-62.
"Additive mixed effect model for clustered failure time data." Biometrics 67, no. 4 (2011): 1340-51.
"A general framework for studying genetic effects and gene-environment interactions with missing data." Biostatistics 11, no. 4 (2010): 583-98.
"Sample size and power determination in joint modeling of longitudinal and survival data." Stat Med 30, no. 18 (2011): 2295-309.
"Inference on treatment effects from a randomized clinical trial in the presence of premature treatment discontinuation: the SYNERGY trial." Biostatistics 12, no. 2 (2011): 258-69.
"SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing." Cell Syst 10, no. 5 (2020): 445-452.e6.
"Fixed and random effects selection in mixed effects models." Biometrics 67, no. 2 (2011): 495-503.
"Look before you leap: systematic evaluation of tree-based statistical methods in subgroup identification." J Biopharm Stat 29, no. 6 (2019): 1082-1102.
"A spatial dirichlet process mixture model for clustering population genetics data." Biometrics 67, no. 2 (2011): 381-90.
"Kappa statistic for clustered dichotomous responses from physicians and patients." Stat Med 32, no. 21 (2013): 3700-19.
"Spatial regression with covariate measurement error: A semiparametric approach." Biometrics 72, no. 3 (2016): 678-86.
"A global logrank test for adaptive treatment strategies based on observational studies." Stat Med 33, no. 5 (2014): 760-71.
"Genetic association analysis under complex survey sampling: the Hispanic Community Health Study/Study of Latinos." Am J Hum Genet 95, no. 6 (2014): 675-88.
"Bayesian lasso for semiparametric structural equation models." Biometrics 68, no. 2 (2012): 567-77.
"Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy." Stat Med 35, no. 8 (2016): 1245-56.
"Sample size estimation in educational intervention trials with subgroup heterogeneity in only one arm." Stat Med 32, no. 12 (2013): 2140-54.
"Efficient methods for signal detection from correlated adverse events in clinical trials." Biometrics 75, no. 3 (2019): 1000-1008.
"Assessing temporal agreement between central and local progression-free survival times." Stat Med 34, no. 5 (2015): 844-58.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective." Lifetime Data Anal 23, no. 4 (2017): 585-604.
"Block-diagonal discriminant analysis and its bias-corrected rules." Stat Appl Genet Mol Biol 12, no. 3 (2013): 347-59.
"Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates." Bioinformatics 32, no. 1 (2016): 50-7.
"Optimal estimation for regression models on τ-year survival probability." J Biopharm Stat 25, no. 3 (2015): 539-47.
"On optimal treatment regimes selection for mean survival time." Stat Med 34, no. 7 (2015): 1169-84.
"Bayesian design of superiority clinical trials for recurrent events data with applications to bleeding and transfusion events in myelodyplastic syndrome." Biometrics 70, no. 4 (2014): 1003-13.
"Tests of trend between disease outcomes and ordinal covariates discretized from underlying continuous variables: simulation studies and applications to NHANES 2007-2008." BMC Med Res Methodol 19, no. 1 (2019): 2.
"Multivariate phenotype association analysis by marker-set kernel machine regression." Genet Epidemiol 36, no. 7 (2012): 686-95.
"Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data." Lifetime Data Anal 26, no. 1 (2020): 85-108.
"Analysis of secondary phenotypes in multigroup association studies." Biometrics 76, no. 2 (2020): 606-618.
"A novel test to compare two treatments based on endpoints involving both nonfatal and fatal events." Pharm Stat 14, no. 4 (2015): 273-83.
"Meta-analysis of gene-level associations for rare variants based on single-variant statistics." Am J Hum Genet 93, no. 2 (2013): 236-48.
"Analysis of multiple survival events in generalized case-cohort designs." Biometrics 74, no. 4 (2018): 1250-1260.
"Phase II cancer clinical trials with heterogeneous patient populations." J Biopharm Stat 22, no. 2 (2012): 312-28.
"Hypothesis testing at the extremes: fast and robust association for high-throughput data." Biostatistics 16, no. 3 (2015): 611-25.
"Pathway-based identification of SNPs predictive of survival." Eur J Hum Genet 19, no. 6 (2011): 704-9.
"Estimating the subgroup and testing for treatment effect in a post-hoc analysis of a clinical trial with a biomarker." J Biopharm Stat 29, no. 4 (2019): 685-695.
"Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme." Biometrics 69, no. 3 (2013): 714-23.
"On model selections for repeated measurement data in clinical studies." Stat Med 34, no. 10 (2015): 1621-33.
"Ten Simple Rules for Effective Statistical Practice." PLoS Comput Biol 12, no. 6 (2016): e1004961.
"Improved detection of epigenomic marks with mixed-effects hidden Markov models." Biometrics 75, no. 4 (2019): 1401-1413.
"A variable selection method for genome-wide association studies." Bioinformatics 27, no. 1 (2011): 1-8.
"A smoothing-based goodness-of-fit test of covariance for functional data." Biometrics 75, no. 2 (2019): 562-571.
"A penalized likelihood approach for investigating gene-drug interactions in pharmacogenetic studies." Biometrics 71, no. 2 (2015): 529-37.
"Simulation of brain mass effect with an arbitrary Lagrangian and Eulerian FEM." Med Image Comput Comput Assist Interv 13, no. Pt 2 (2010): 274-81.
"TwinMARM: two-stage multiscale adaptive regression methods for twin neuroimaging data." IEEE Trans Med Imaging 31, no. 5 (2012): 1100-12.
"Mapping eQTL by leveraging multiple tissues and DNA methylation." BMC Bioinformatics 18, no. 1 (2017): 455.
"Improving the efficiency of estimation in the additive hazards model for stratified case-cohort design with multiple diseases." Stat Med 35, no. 2 (2016): 282-93.
"Continual reassessment method with regularization in phase I clinical trials." J Biopharm Stat 30, no. 6 (2020): 964-978.
"Efficient estimation of the distribution of time to composite endpoint when some endpoints are only partially observed." Lifetime Data Anal 19, no. 4 (2013): 513-46.
"Hypothesis testing for two-stage designs with over or under enrollment." Stat Med 34, no. 16 (2015): 2417-26.
"Multiple kernel learning with random effects for predicting longitudinal outcomes and data integration." Biometrics 71, no. 4 (2015): 918-28.
"Independence screening for high dimensional nonlinear additive ODE models with applications to dynamic gene regulatory networks." Stat Med 37, no. 17 (2018): 2630-2644.
"Robust test method for time-course microarray experiments." BMC Bioinformatics 11 (2010): 391.
"Pathway-guided identification of gene-gene interactions." Ann Hum Genet 78, no. 6 (2014): 478-91.
"A new Bayesian joint model for longitudinal count data with many zeros, intermittent missingness, and dropout with applications to HIV prevention trials." Stat Med 38, no. 30 (2019): 5565-5586.
"Sample size calculation for cluster randomization trials with a time-to-event endpoint." Stat Med 39, no. 25 (2020): 3608-3623.
"Improving efficiency of parameter estimation in case-cohort studies with multivariate failure time data." Biometrics 73, no. 3 (2017): 1042-1052.
"Time-varying latent effect model for longitudinal data with informative observation times." Biometrics 68, no. 4 (2012): 1093-102.
"Multicategory reclassification statistics for assessing improvements in diagnostic accuracy." Biostatistics 14, no. 2 (2013): 382-94.
"Multiscale adaptive marginal analysis of longitudinal neuroimaging data with time-varying covariates." Biometrics 68, no. 4 (2012): 1083-92.
"Module-based association analysis for omics data with network structure." PLoS One 10, no. 3 (2015): e0122309.
"Effective SNP ranking improves the performance of eQTL mapping." Genet Epidemiol 44, no. 6 (2020): 611-619.
"Pattern mixture models for clinical validation of biomarkers in the presence of missing data." Stat Med 36, no. 19 (2017): 2994-3004.
"Sensitivity analysis for missing outcomes in time-to-event data with covariate adjustment." J Biopharm Stat 26, no. 2 (2016): 269-79.
"Semiparametric regression analysis of interval-censored competing risks data." Biometrics 73, no. 3 (2017): 857-865.
"Data for cancer comparative effectiveness research: past, present, and future potential." Cancer 118, no. 21 (2012): 5186-97.
"Joint skeleton estimation of multiple directed acyclic graphs for heterogeneous population." Biometrics 75, no. 1 (2019): 36-47.
"Association test using Copy Number Profile Curves (CONCUR) enhances power in rare copy number variant analysis." PLoS Comput Biol 16, no. 5 (2020): e1007797.
"Purity Independent Subtyping of Tumors (PurIST), A Clinically Robust, Single-sample Classifier for Tumor Subtyping in Pancreatic Cancer." Clin Cancer Res 26, no. 1 (2020): 82-92.
"SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling." Genome Biol 18, no. 1 (2017): 66.
"Rapid and robust resampling-based multiple-testing correction with application in a genome-wide expression quantitative trait loci study." Genetics 190, no. 4 (2012): 1511-20.
"Statistical considerations for analysis of microarray experiments." Clin Transl Sci 4, no. 6 (2011): 466-77.
"Tximeta: Reference sequence checksums for provenance identification in RNA-seq." PLoS Comput Biol 16, no. 2 (2020): e1007664.
"PreMeta: a tool to facilitate meta-analysis of rare-variant associations." BMC Genomics 18, no. 1 (2017): 160.
"Ten Simple Rules for Effective Statistical Practice." PLoS Comput Biol 12, no. 6 (2016): e1004961.
"DiNAMIC: a method to identify recurrent DNA copy number aberrations in tumors." Bioinformatics 27, no. 5 (2011): 678-85.
"Module-based association analysis for omics data with network structure." PLoS One 10, no. 3 (2015): e0122309.
"Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification." F1000Res 7 (2018): 952.
"Consistency and overfitting of multi-omics methods on experimental data." Brief Bioinform 21, no. 4 (2020): 1277-1284.
"Identifying individual risk rare variants using protein structure guided local tests (POINT)." PLoS Comput Biol 15, no. 2 (2019): e1006722.
"A framework for understanding cancer comparative effectiveness research data needs." J Clin Epidemiol 65, no. 11 (2012): 1150-8.
"Comparative effectiveness of oxaliplatin vs non-oxaliplatin-containing adjuvant chemotherapy for stage III colon cancer." J Natl Cancer Inst 104, no. 3 (2012): 211-27.
"Data for cancer comparative effectiveness research: past, present, and future potential." Cancer 118, no. 21 (2012): 5186-97.
" "