Publications
ROC curve estimation under test-result-dependent sampling." Biostatistics 14, no. 1 (2013): 160-72.
"Improving the efficiency of estimation in the additive hazards model for stratified case-cohort design with multiple diseases." Stat Med 35, no. 2 (2016): 282-93.
"A general framework for studying genetic effects and gene-environment interactions with missing data." Biostatistics 11, no. 4 (2010): 583-98.
"Meta-analysis of sequencing studies with heterogeneous genetic associations." Genet Epidemiol 38, no. 5 (2014): 389-401.
"Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials." JAMA Netw Open 3, no. 10 (2020): e2014661.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"Estimating individualized treatment regimes from crossover designs." Biometrics 76, no. 3 (2020): 778-788.
"Semiparametric regression for the weighted composite endpoint of recurrent and terminal events." Biostatistics 17, no. 2 (2016): 390-403.
"Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma." Cancer Discov 7, no. 10 (2017): 1116-1135.
"Across-Platform Imputation of DNA Methylation Levels Incorporating Nonlocal Information Using Penalized Functional Regression." Genet Epidemiol 40, no. 4 (2016): 333-40.
"The association between copy number aberration, DNA methylation and gene expression in tumor samples." Nucleic Acids Res 46, no. 6 (2018): 3009-3018.
"Optimizing delivery of a behavioral pain intervention in cancer patients using a sequential multiple assignment randomized trial SMART." Contemp Clin Trials 57 (2017): 51-57.
"Auxiliary variable-enriched biomarker-stratified design." Stat Med 37, no. 30 (2018): 4610-4635.
"Kappa statistic for clustered dichotomous responses from physicians and patients." Stat Med 32, no. 21 (2013): 3700-19.
"Improving the efficiency of estimation in the additive hazards model for stratified case-cohort design with multiple diseases." Stat Med 35, no. 2 (2016): 282-93.
"Sample size/power calculation for stratified case-cohort design." Stat Med 33, no. 23 (2014): 3973-85.
"Marginal additive hazards model for case-cohort studies with multiple disease outcomes: an application to the Atherosclerosis Risk in Communities (ARIC) study." Biostatistics 14, no. 1 (2013): 28-41.
"Marginal hazard regression for correlated failure time data with auxiliary covariates." Lifetime Data Anal 18, no. 1 (2012): 116-38.
"Doubly-robust estimators of treatment-specific survival distributions in observational studies with stratified sampling." Biometrics 69, no. 4 (2013): 830-9.
"A unification of models for meta-analysis of diagnostic accuracy studies without a gold standard." Biometrics 71, no. 2 (2015): 538-47.
"Marginal additive hazards model for case-cohort studies with multiple disease outcomes: an application to the Atherosclerosis Risk in Communities (ARIC) study." Biostatistics 14, no. 1 (2013): 28-41.
"Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective." Lifetime Data Anal 23, no. 4 (2017): 585-604.
"Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective." Lifetime Data Anal 23, no. 4 (2017): 585-604.
"A trivariate meta-analysis of diagnostic studies accounting for prevalence and non-evaluable subjects: re-evaluation of the meta-analysis of coronary CT angiography studies." BMC Med Res Methodol 14 (2014): 128.
"A unification of models for meta-analysis of diagnostic accuracy studies without a gold standard." Biometrics 71, no. 2 (2015): 538-47.
"Subgroup detection and sample size calculation with proportional hazards regression for survival data." Stat Med 36, no. 29 (2017): 4646-4659.
"A practical Bayesian adaptive design incorporating data from historical controls." Stat Med 37, no. 27 (2018): 4054-4070.
"Medical records-based postmarketing safety evaluation of rare events with uncertain status." J Biopharm Stat 23, no. 4 (2013): 744-55.
"Secondary outcome analysis for data from an outcome-dependent sampling design." Stat Med 37, no. 15 (2018): 2321-2337.
"Network meta-analysis of margin threshold for women with ductal carcinoma in situ." J Natl Cancer Inst 104, no. 7 (2012): 507-16.
"On variance estimate for covariate adjustment by propensity score analysis." Stat Med 35, no. 20 (2016): 3537-48.
"On the substructure controls in rare variant analysis: Principal components or variance components?" Genet Epidemiol 42, no. 3 (2018): 276-287.
"Comparative effectiveness of oxaliplatin vs non-oxaliplatin-containing adjuvant chemotherapy for stage III colon cancer." J Natl Cancer Inst 104, no. 3 (2012): 211-27.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"Assessing temporal agreement between central and local progression-free survival times." Stat Med 34, no. 5 (2015): 844-58.
"Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme." Biometrics 69, no. 3 (2013): 714-23.
"Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy." Stat Med 35, no. 8 (2016): 1245-56.
"Hypothesis testing for two-stage designs with over or under enrollment." Stat Med 34, no. 16 (2015): 2417-26.
"Improving the efficiency of estimation in the additive hazards model for stratified case-cohort design with multiple diseases." Stat Med 35, no. 2 (2016): 282-93.
"Continual reassessment method with regularization in phase I clinical trials." J Biopharm Stat 30, no. 6 (2020): 964-978.
"Simulation of brain mass effect with an arbitrary Lagrangian and Eulerian FEM." Med Image Comput Comput Assist Interv 13, no. Pt 2 (2010): 274-81.
"TwinMARM: two-stage multiscale adaptive regression methods for twin neuroimaging data." IEEE Trans Med Imaging 31, no. 5 (2012): 1100-12.
"Mapping eQTL by leveraging multiple tissues and DNA methylation." BMC Bioinformatics 18, no. 1 (2017): 455.
"Pathway-guided identification of gene-gene interactions." Ann Hum Genet 78, no. 6 (2014): 478-91.
"Efficient estimation of the distribution of time to composite endpoint when some endpoints are only partially observed." Lifetime Data Anal 19, no. 4 (2013): 513-46.
"A robust method for estimating optimal treatment regimes." Biometrics 68, no. 4 (2012): 1010-8.
"Hypothesis testing for two-stage designs with over or under enrollment." Stat Med 34, no. 16 (2015): 2417-26.
"Multiple kernel learning with random effects for predicting longitudinal outcomes and data integration." Biometrics 71, no. 4 (2015): 918-28.
"Independence screening for high dimensional nonlinear additive ODE models with applications to dynamic gene regulatory networks." Stat Med 37, no. 17 (2018): 2630-2644.
"Empirical pathway analysis, without permutation." Biostatistics 14, no. 3 (2013): 573-85.
"A new Bayesian joint model for longitudinal count data with many zeros, intermittent missingness, and dropout with applications to HIV prevention trials." Stat Med 38, no. 30 (2019): 5565-5586.
"Semiparametric additive marginal regression models for multiple type recurrent events." Lifetime Data Anal 18, no. 4 (2012): 504-27.
"Sample size calculation for cluster randomization trials with a time-to-event endpoint." Stat Med 39, no. 25 (2020): 3608-3623.
"Improving efficiency of parameter estimation in case-cohort studies with multivariate failure time data." Biometrics 73, no. 3 (2017): 1042-1052.
"Multicategory reclassification statistics for assessing improvements in diagnostic accuracy." Biostatistics 14, no. 2 (2013): 382-94.
"Sensitivity analysis for missing outcomes in time-to-event data with covariate adjustment." J Biopharm Stat 26, no. 2 (2016): 269-79.
"Semiparametric regression analysis of interval-censored competing risks data." Biometrics 73, no. 3 (2017): 857-865.
"Bayesian meta-experimental design: evaluating cardiovascular risk in new antidiabetic therapies to treat type 2 diabetes." Biometrics 68, no. 2 (2012): 578-86.
"Multiscale adaptive marginal analysis of longitudinal neuroimaging data with time-varying covariates." Biometrics 68, no. 4 (2012): 1083-92.
"Module-based association analysis for omics data with network structure." PLoS One 10, no. 3 (2015): e0122309.
"Effective SNP ranking improves the performance of eQTL mapping." Genet Epidemiol 44, no. 6 (2020): 611-619.
"A general framework for detecting disease associations with rare variants in sequencing studies." Am J Hum Genet 89, no. 3 (2011): 354-67.
"Pattern mixture models for clinical validation of biomarkers in the presence of missing data." Stat Med 36, no. 19 (2017): 2994-3004.
"Identifying individual risk rare variants using protein structure guided local tests (POINT)." PLoS Comput Biol 15, no. 2 (2019): e1006722.
"Factor selection and structural identification in the interaction ANOVA model." Biometrics 69, no. 1 (2013): 70-9.
"Joint modeling of survival time and longitudinal outcomes with flexible random effects." Lifetime Data Anal 24, no. 1 (2018): 126-152.
"Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs." Am J Hum Genet 97, no. 1 (2015): 35-53.
"Inference on phenotype-specific effects of genes using multivariate kernel machine regression." Genet Epidemiol 42, no. 1 (2018): 64-79.
"Partial correlation matrix estimation using ridge penalty followed by thresholding and re-estimation." Biometrics 70, no. 3 (2014): 765-73.
"Accelerated intensity frailty model for recurrent events data." Biometrics 70, no. 3 (2014): 579-87.
"Biomarker threshold adaptive designs for survival endpoints." J Biopharm Stat 28, no. 6 (2018): 1038-1054.
"A composite likelihood method for bivariate meta-analysis in diagnostic systematic reviews." Stat Methods Med Res 26, no. 2 (2017): 914-930.
"Joint skeleton estimation of multiple directed acyclic graphs for heterogeneous population." Biometrics 75, no. 1 (2019): 36-47.
"A model-based conditional power assessment for decision making in randomized controlled trial studies." Stat Med 36, no. 30 (2017): 4765-4776.
"Regression analysis for secondary response variable in a case-cohort study." Biometrics 74, no. 3 (2018): 1014-1022.
"Semiparametric regression analysis of interval-censored data with informative dropout." Biometrics 74, no. 4 (2018): 1213-1222.
"Multiplicative rates model for recurrent events in case-cohort studies." Lifetime Data Anal 26, no. 1 (2020): 134-157.
"Detection of gene-gene interactions using multistage sparse and low-rank regression." Biometrics 72, no. 1 (2016): 85-94.
"Sample size considerations of prediction-validation methods in high-dimensional data for survival outcomes." Genet Epidemiol 37, no. 3 (2013): 276-82.
"Receiver operating characteristic curves and confidence bands for support vector machines." Biometrics 77, no. 4 (2021): 1422-1430.
"Comparison of operational characteristics for binary tests with clustered data." Stat Med 34, no. 15 (2015): 2325-33.
"Sample size/power calculation for stratified case-cohort design." Stat Med 33, no. 23 (2014): 3973-85.
"Analysis of gene-gene interactions using gene-trait similarity regression." Hum Hered 78, no. 1 (2014): 17-26.
"ROC curve estimation under test-result-dependent sampling." Biostatistics 14, no. 1 (2013): 160-72.
"Efficient estimation for accelerated failure time model under case-cohort and nested case-control sampling." Biometrics 73, no. 1 (2017): 114-123.
"Probability-enhanced sufficient dimension reduction for binary classification." Biometrics 70, no. 3 (2014): 546-55.
"A moment-adjusted imputation method for measurement error models." Biometrics 67, no. 4 (2011): 1461-70.
"Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression." Am J Hum Genet 89, no. 2 (2011): 277-88.
"fastJT: An R package for robust and efficient feature selection for machine learning and genome-wide association studies." BMC Bioinformatics 20, no. 1 (2019): 333.
"Bayesian clinical trial design using historical data that inform the treatment effect." Biostatistics 20, no. 3 (2019): 400-415.
"A counterfactual p-value approach for benefit-risk assessment in clinical trials." J Biopharm Stat 25, no. 3 (2015): 508-24.
"Bayesian spatial transformation models with applications in neuroimaging data." Biometrics 69, no. 4 (2013): 1074-83.
"Maximum likelihood estimation in generalized linear models with multiple covariates subject to detection limits." Stat Med 30, no. 20 (2011): 2551-61.
"Optimal two-stage dynamic treatment regimes from a classification perspective with censored survival data." Biometrics 74, no. 4 (2018): 1180-1192.
"Multiscale adaptive generalized estimating equations for longitudinal neuroimaging data." Neuroimage 72 (2013): 91-105.
"Subgroup detection and sample size calculation with proportional hazards regression for survival data." Stat Med 36, no. 29 (2017): 4646-4659.
"Rapid enrollment design for finding the optimal dose in immunotherapy trials with ordered groups." J Biopharm Stat 29, no. 4 (2019): 625-634.
"Assessing gene-environment interactions for common and rare variants with binary traits using gene-trait similarity regression." Genetics 199, no. 3 (2015): 695-710.
"Bayesian design of biosimilars clinical programs involving multiple therapeutic indications." Biometrics 76, no. 2 (2020): 630-642.
"SGPP: spatial Gaussian predictive process models for neuroimaging data." Neuroimage 89 (2014): 70-80.
"