Publications
"
Improving the efficiency of estimation in the additive hazards model for stratified case-cohort design with multiple diseases." Stat Med 35, no. 2 (2016): 282-93.
"Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials." JAMA Netw Open 3, no. 10 (2020): e2014661.
"Meta-analysis of sequencing studies with heterogeneous genetic associations." Genet Epidemiol 38, no. 5 (2014): 389-401.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"A general framework for studying genetic effects and gene-environment interactions with missing data." Biostatistics 11, no. 4 (2010): 583-98.
"Estimating individualized treatment regimes from crossover designs." Biometrics 76, no. 3 (2020): 778-788.
"Semiparametric regression for the weighted composite endpoint of recurrent and terminal events." Biostatistics 17, no. 2 (2016): 390-403.
"Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma." Cancer Discov 7, no. 10 (2017): 1116-1135.
"Across-Platform Imputation of DNA Methylation Levels Incorporating Nonlocal Information Using Penalized Functional Regression." Genet Epidemiol 40, no. 4 (2016): 333-40.
"The association between copy number aberration, DNA methylation and gene expression in tumor samples." Nucleic Acids Res 46, no. 6 (2018): 3009-3018.
"Optimizing delivery of a behavioral pain intervention in cancer patients using a sequential multiple assignment randomized trial SMART." Contemp Clin Trials 57 (2017): 51-57.
"Auxiliary variable-enriched biomarker-stratified design." Stat Med 37, no. 30 (2018): 4610-4635.
"Sample size/power calculation for stratified case-cohort design." Stat Med 33, no. 23 (2014): 3973-85.
"Marginal additive hazards model for case-cohort studies with multiple disease outcomes: an application to the Atherosclerosis Risk in Communities (ARIC) study." Biostatistics 14, no. 1 (2013): 28-41.
"Marginal hazard regression for correlated failure time data with auxiliary covariates." Lifetime Data Anal 18, no. 1 (2012): 116-38.
"Kappa statistic for clustered dichotomous responses from physicians and patients." Stat Med 32, no. 21 (2013): 3700-19.
"Improving the efficiency of estimation in the additive hazards model for stratified case-cohort design with multiple diseases." Stat Med 35, no. 2 (2016): 282-93.
"A unification of models for meta-analysis of diagnostic accuracy studies without a gold standard." Biometrics 71, no. 2 (2015): 538-47.
"Marginal additive hazards model for case-cohort studies with multiple disease outcomes: an application to the Atherosclerosis Risk in Communities (ARIC) study." Biostatistics 14, no. 1 (2013): 28-41.
"Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective." Lifetime Data Anal 23, no. 4 (2017): 585-604.
"Doubly-robust estimators of treatment-specific survival distributions in observational studies with stratified sampling." Biometrics 69, no. 4 (2013): 830-9.
"Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective." Lifetime Data Anal 23, no. 4 (2017): 585-604.
"A trivariate meta-analysis of diagnostic studies accounting for prevalence and non-evaluable subjects: re-evaluation of the meta-analysis of coronary CT angiography studies." BMC Med Res Methodol 14 (2014): 128.
"A unification of models for meta-analysis of diagnostic accuracy studies without a gold standard." Biometrics 71, no. 2 (2015): 538-47.
"A practical Bayesian adaptive design incorporating data from historical controls." Stat Med 37, no. 27 (2018): 4054-4070.
"Subgroup detection and sample size calculation with proportional hazards regression for survival data." Stat Med 36, no. 29 (2017): 4646-4659.
"Medical records-based postmarketing safety evaluation of rare events with uncertain status." J Biopharm Stat 23, no. 4 (2013): 744-55.
"Secondary outcome analysis for data from an outcome-dependent sampling design." Stat Med 37, no. 15 (2018): 2321-2337.
"Comparative effectiveness of oxaliplatin vs non-oxaliplatin-containing adjuvant chemotherapy for stage III colon cancer." J Natl Cancer Inst 104, no. 3 (2012): 211-27.
"Network meta-analysis of margin threshold for women with ductal carcinoma in situ." J Natl Cancer Inst 104, no. 7 (2012): 507-16.
"On variance estimate for covariate adjustment by propensity score analysis." Stat Med 35, no. 20 (2016): 3537-48.
"On the substructure controls in rare variant analysis: Principal components or variance components?" Genet Epidemiol 42, no. 3 (2018): 276-287.
"Hypothesis testing for two-stage designs with over or under enrollment." Stat Med 34, no. 16 (2015): 2417-26.
"Assessing temporal agreement between central and local progression-free survival times." Stat Med 34, no. 5 (2015): 844-58.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme." Biometrics 69, no. 3 (2013): 714-23.
"Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy." Stat Med 35, no. 8 (2016): 1245-56.
"Meta-analysis methods and models with applications in evaluation of cholesterol-lowering drugs." Stat Med 31, no. 28 (2012): 3597-616.
"Parameter estimation in Cox models with missing failure indicators and the OPPERA study." Stat Med 34, no. 30 (2015): 3984-96.
"Complete effect-profile assessment in association studies with multiple genetic and multiple environmental factors." Genet Epidemiol 39, no. 2 (2015): 122-33.
"Assessing model fit in joint models of longitudinal and survival data with applications to cancer clinical trials." Stat Med 33, no. 27 (2014): 4715-33.
"Additive mixed effect model for clustered failure time data." Biometrics 67, no. 4 (2011): 1340-51.
"FADTTS: functional analysis of diffusion tensor tract statistics." Neuroimage 56, no. 3 (2011): 1412-25.
"Bayesian probability of success for clinical trials using historical data." Stat Med 34, no. 2 (2015): 249-64.
"SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing." Cell Syst 10, no. 5 (2020): 445-452.e6.
"Look before you leap: systematic evaluation of tree-based statistical methods in subgroup identification." J Biopharm Stat 29, no. 6 (2019): 1082-1102.
"Efficient methods for signal detection from correlated adverse events in clinical trials." Biometrics 75, no. 3 (2019): 1000-1008.
"Sample size and power determination in joint modeling of longitudinal and survival data." Stat Med 30, no. 18 (2011): 2295-309.
"Inference on treatment effects from a randomized clinical trial in the presence of premature treatment discontinuation: the SYNERGY trial." Biostatistics 12, no. 2 (2011): 258-69.
"Multivariate varying coefficient models for DTI tract statistics." Med Image Comput Comput Assist Interv 13, no. Pt 1 (2010): 690-7.
"Rare Variants Association Analysis in Large-Scale Sequencing Studies at the Single Locus Level." PLoS Comput Biol 12, no. 6 (2016): e1004993.
"Genetic association analysis under complex survey sampling: the Hispanic Community Health Study/Study of Latinos." Am J Hum Genet 95, no. 6 (2014): 675-88.
"The use of Bayesian hierarchical models for adaptive randomization in biomarker-driven phase II studies." J Biopharm Stat 25, no. 1 (2015): 66-88.
"Bayesian lasso for semiparametric structural equation models." Biometrics 68, no. 2 (2012): 567-77.
"Sample size estimation in educational intervention trials with subgroup heterogeneity in only one arm." Stat Med 32, no. 12 (2013): 2140-54.
"Bayesian gamma frailty models for survival data with semi-competing risks and treatment switching." Lifetime Data Anal 20, no. 1 (2014): 76-105.
"A spatial dirichlet process mixture model for clustering population genetics data." Biometrics 67, no. 2 (2011): 381-90.
"Kappa statistic for clustered dichotomous responses from physicians and patients." Stat Med 32, no. 21 (2013): 3700-19.
"Incorporating covariates in skewed functional data models." Biostatistics 16, no. 3 (2015): 413-26.
"Analysis of secondary phenotypes in multigroup association studies." Biometrics 76, no. 2 (2020): 606-618.
"Incorporating Functional Information in Tests of Excess De Novo Mutational Load." Am J Hum Genet 97, no. 2 (2015): 272-83.
"Tests of trend between disease outcomes and ordinal covariates discretized from underlying continuous variables: simulation studies and applications to NHANES 2007-2008." BMC Med Res Methodol 19, no. 1 (2019): 2.
"Quantifying the average of the time-varying hazard ratio via a class of transformations." Lifetime Data Anal 21, no. 2 (2015): 259-79.
"Analysis of untyped SNPs: maximum likelihood and imputation methods." Genet Epidemiol 34, no. 8 (2010): 803-15.
"Using decision lists to construct interpretable and parsimonious treatment regimes." Biometrics 71, no. 4 (2015): 895-904.
"Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data." Lifetime Data Anal 26, no. 1 (2020): 85-108.
"Testing and estimation in marker-set association study using semiparametric quantile regression kernel machine." Biometrics 72, no. 2 (2016): 364-71.
"Semiparametric regression for the weighted composite endpoint of recurrent and terminal events." Biostatistics 17, no. 2 (2016): 390-403.
"Sparse meta-analysis with high-dimensional data." Biostatistics 17, no. 2 (2016): 205-20.
"Analysis of multiple survival events in generalized case-cohort designs." Biometrics 74, no. 4 (2018): 1250-1260.
"Multivariate phenotype association analysis by marker-set kernel machine regression." Genet Epidemiol 36, no. 7 (2012): 686-95.
"Proportional exponentiated link transformed hazards (ELTH) models for discrete time survival data with application." Lifetime Data Anal 22, no. 1 (2016): 38-62.
"A general framework for studying genetic effects and gene-environment interactions with missing data." Biostatistics 11, no. 4 (2010): 583-98.
"Estimating the subgroup and testing for treatment effect in a post-hoc analysis of a clinical trial with a biomarker." J Biopharm Stat 29, no. 4 (2019): 685-695.
"Phase II cancer clinical trials with heterogeneous patient populations." J Biopharm Stat 22, no. 2 (2012): 312-28.
"A smoothing-based goodness-of-fit test of covariance for functional data." Biometrics 75, no. 2 (2019): 562-571.
"Fixed and random effects selection in mixed effects models." Biometrics 67, no. 2 (2011): 495-503.
"Improved detection of epigenomic marks with mixed-effects hidden Markov models." Biometrics 75, no. 4 (2019): 1401-1413.
"Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme." Biometrics 69, no. 3 (2013): 714-23.
"Mapping eQTL by leveraging multiple tissues and DNA methylation." BMC Bioinformatics 18, no. 1 (2017): 455.
"Assessing temporal agreement between central and local progression-free survival times." Stat Med 34, no. 5 (2015): 844-58.
"Continual reassessment method with regularization in phase I clinical trials." J Biopharm Stat 30, no. 6 (2020): 964-978.
"Spatial regression with covariate measurement error: A semiparametric approach." Biometrics 72, no. 3 (2016): 678-86.
"A global logrank test for adaptive treatment strategies based on observational studies." Stat Med 33, no. 5 (2014): 760-71.
"Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy." Stat Med 35, no. 8 (2016): 1245-56.
"Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective." Lifetime Data Anal 23, no. 4 (2017): 585-604.
"Block-diagonal discriminant analysis and its bias-corrected rules." Stat Appl Genet Mol Biol 12, no. 3 (2013): 347-59.
"Independence screening for high dimensional nonlinear additive ODE models with applications to dynamic gene regulatory networks." Stat Med 37, no. 17 (2018): 2630-2644.
"Optimal estimation for regression models on τ-year survival probability." J Biopharm Stat 25, no. 3 (2015): 539-47.
"A new Bayesian joint model for longitudinal count data with many zeros, intermittent missingness, and dropout with applications to HIV prevention trials." Stat Med 38, no. 30 (2019): 5565-5586.
"Sample size calculation for cluster randomization trials with a time-to-event endpoint." Stat Med 39, no. 25 (2020): 3608-3623.
"Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates." Bioinformatics 32, no. 1 (2016): 50-7.
"Effective SNP ranking improves the performance of eQTL mapping." Genet Epidemiol 44, no. 6 (2020): 611-619.
"Pattern mixture models for clinical validation of biomarkers in the presence of missing data." Stat Med 36, no. 19 (2017): 2994-3004.
"On optimal treatment regimes selection for mean survival time." Stat Med 34, no. 7 (2015): 1169-84.
"Bayesian design of superiority clinical trials for recurrent events data with applications to bleeding and transfusion events in myelodyplastic syndrome." Biometrics 70, no. 4 (2014): 1003-13.
"Robust test method for time-course microarray experiments." BMC Bioinformatics 11 (2010): 391.
"Time-varying latent effect model for longitudinal data with informative observation times." Biometrics 68, no. 4 (2012): 1093-102.
"Identifying individual risk rare variants using protein structure guided local tests (POINT)." PLoS Comput Biol 15, no. 2 (2019): e1006722.
"