Publications
Accelerated failure time model for data from outcome-dependent sampling." Lifetime Data Anal 27, no. 1 (2021): 15-37.
"Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness." Stat Med 39, no. 25 (2020): 3503-3520.
"Association test using Copy Number Profile Curves (CONCUR) enhances power in rare copy number variant analysis." PLoS Comput Biol 16, no. 5 (2020): e1007797.
"Combining Multiple Observational Data Sources to Estimate Causal Effects." J Am Stat Assoc 115, no. 531 (2020): 1540-1554.
"Doubly robust inference when combining probability and non-probability samples with high dimensional data." J R Stat Soc Series B Stat Methodol 82, no. 2 (2020): 445-465.
"The Long Noncoding RNA Promotes Sarcoma Metastasis by Regulating RNA Splicing Pathways." Mol Cancer Res 18, no. 10 (2020): 1534-1544.
"Modeling Between-Study Heterogeneity for Improved Replicability in Gene Signature Selection and Clinical Prediction." J Am Stat Assoc 115, no. 531 (2020): 1125-1138.
"Purity Independent Subtyping of Tumors (PurIST), A Clinically Robust, Single-sample Classifier for Tumor Subtyping in Pancreatic Cancer." Clin Cancer Res 26, no. 1 (2020): 82-92.
"Radiomics analysis using stability selection supervised component analysis for right-censored survival data." Comput Biol Med 124 (2020): 103959.
"Robust kernel association testing (RobKAT)." Genet Epidemiol 44, no. 3 (2020): 272-282.
"Semiparametric estimation of structural failure time models in continuous-time processes." Biometrika 107, no. 1 (2020): 123-136.
"Application of a sequential multiple assignment randomized trial (SMART) design in older patients with chronic lymphocytic leukemia." Ann Oncol 30, no. 4 (2019): 542-550.
"Bayesian Variable Selection for Pareto Regression Models with Latent Multivariate Log Gamma Process with Applications to Earthquake Magnitudes." Geosciences (Basel) 9, no. 4 (2019).
"Differential damage and repair of DNA-adducts induced by anti-cancer drug cisplatin across mouse organs." Nat Commun 10, no. 1 (2019): 309.
"Genetic analyses of diverse populations improves discovery for complex traits." Nature 570, no. 7762 (2019): 514-518.
"Genetic analyses of diverse populations improves discovery for complex traits." Nature 570, no. 7762 (2019): 514-518.
"Genome analysis and pleiotropy assessment using causal networks with loss of function mutation and metabolomics." BMC Genomics 20, no. 1 (2019): 395.
"Genome analysis and pleiotropy assessment using causal networks with loss of function mutation and metabolomics." BMC Genomics 20, no. 1 (2019): 395.
"Impact of Esophageal Motion on Dosimetry and Toxicity With Thoracic Radiation Therapy." Technol Cancer Res Treat 18 (2019): 1533033819849073.
"SAFE-clustering: Single-cell Aggregated (from Ensemble) clustering for single-cell RNA-seq data." Bioinformatics 35, no. 8 (2019): 1269-1277.
"Single-nucleotide resolution analysis of nucleotide excision repair of ribosomal DNA in humans and mice." J Biol Chem 294, no. 1 (2019): 210-217.
"Single-nucleotide resolution analysis of nucleotide excision repair of ribosomal DNA in humans and mice." J Biol Chem 294, no. 1 (2019): 210-217.
"ASSESSING ROBUSTNESS OF CLASSIFICATION USING ANGULAR BREAKDOWN POINT." Ann Stat 46, no. 6B (2018): 3362-3389.
"Modeling survival distribution as a function of time to treatment discontinuation: A dynamic treatment regime approach." Biometrics 74, no. 3 (2018): 900-909.
"Online updating method with new variables for big data streams." Can J Stat 46, no. 1 (2018): 123-146.
"Improving efficiency of parameter estimation in case-cohort studies with multivariate failure time data." Biometrics 73, no. 3 (2017): 1042-1052.
"Reporting and Guidelines in Propensity Score Analysis: A Systematic Review of Cancer and Cancer Surgical Studies." J Natl Cancer Inst 109, no. 8 (2017).
"Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma." Cancer Discov 7, no. 10 (2017): 1116-1135.
"Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma." Cancer Discov 7, no. 10 (2017): 1116-1135.
"Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma." Cancer Discov 7, no. 10 (2017): 1116-1135.
"Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma." Cancer Discov 7, no. 10 (2017): 1116-1135.
"Global copy number profiling of cancer genomes." Bioinformatics 32, no. 6 (2016): 926-8.
"Online Updating of Statistical Inference in the Big Data Setting." Technometrics 58, no. 3 (2016): 393-403.
"Onset of persistent pseudomonas aeruginosa infection in children with cystic fibrosis with interval censored data." BMC Med Res Methodol 16, no. 1 (2016): 122.
"Outcome-Dependent Sampling Design and Inference for Cox's Proportional Hazards Model." J Stat Plan Inference 178 (2016): 24-36.
"PIK3CA mutations enable targeting of a breast tumor dependency through mTOR-mediated MCL-1 translation." Sci Transl Med 8, no. 369 (2016): 369ra175.
"Seamless Phase IIa/IIb and enhanced dose-finding adaptive design." J Biopharm Stat 26, no. 5 (2016): 912-23.
"SR-HARDI: Spatially Regularizing High Angular Resolution Diffusion Imaging." J Comput Graph Stat 25, no. 4 (2016): 1195-1211.
"Statistical methods and computing for big data." Stat Interface 9, no. 4 (2016): 399-414.
"Ten Simple Rules for Effective Statistical Practice." PLoS Comput Biol 12, no. 6 (2016): e1004961.
"Bayesian Inference for Multivariate Meta-regression with a Partially Observed Within-Study Sample Covariance Matrix." J Am Stat Assoc 110, no. 510 (2015): 528-544.
"Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications." BMC Bioinformatics 16 (2015): 245.
"Doubly Robust Learning for Estimating Individualized Treatment with Censored Data." Biometrika 102, no. 1 (2015): 151-168.
"Effective dimension reduction for sparse functional data." Biometrika 102, no. 2 (2015): 421-437.
"On Sparse representation for Optimal Individualized Treatment Selection with Penalized Outcome Weighted Learning." Stat 4, no. 1 (2015): 59-68.
"Statistical inference for the additive hazards model under outcome-dependent sampling." Can J Stat 43, no. 3 (2015): 436-453.
"Statistical Significance of Clustering using Soft Thresholding." J Comput Graph Stat 24, no. 4 (2015): 975-993.
"Environmental and genetic contributors to salivary testosterone levels in infants." Front Endocrinol (Lausanne) 5 (2014): 187.
"FMEM: functional mixed effects modeling for the analysis of longitudinal white matter Tract data." Neuroimage 84 (2014): 753-64.
"Multivariate longitudinal shape analysis of human lateral ventricles during the first twenty-four months of life." PLoS One 9, no. 9 (2014): e108306.
" On Varying-coefficient Independence Screening for High-dimensional Varying-coefficient Models." Stat Sin 24, no. 4 (2014): 1735-1752.
"Efficient semiparametric estimation of short-term and long-term hazard ratios with right-censored data." Biometrics 69, no. 4 (2013): 840-9.
"Evaluating Statistical Hypotheses Using Weakly-Identifiable Estimating Functions." Scand Stat Theory Appl 40, no. 2 (2013): 256-273.
"A longitudinal functional analysis framework for analysis of white matter tract statistics." Inf Process Med Imaging 23 (2013): 220-31.
"Nomenclature for alleles of the thiopurine methyltransferase gene." Pharmacogenet Genomics 23, no. 4 (2013): 242-8.
"Semiparametric inference on the penetrances of rare genetic mutations based on a case-family design." J Stat Plan Inference 143, no. 2 (2013): 368-377.
"snplist: Tools to create gene sets (R).. 0.12 ed., 2013.
VARYING COEFFICIENT MODEL FOR MODELING DIFFUSION TENSORS ALONG WHITE MATTER TRACTS." Ann Appl Stat 7, no. 1 (2013): 102-125.
"Local Polynomial Regression for Symmetric Positive Definite Matrices." J R Stat Soc Series B Stat Methodol 74, no. 4 (2012): 697-719.
"Marginal hazard regression for correlated failure time data with auxiliary covariates." Lifetime Data Anal 18, no. 1 (2012): 116-38.
"Meta-analysis methods and models with applications in evaluation of cholesterol-lowering drugs." Stat Med 31, no. 28 (2012): 3597-616.
"Power and sample size calculation for microarray studies." J Biopharm Stat 22, no. 1 (2012): 30-42.
" Variable selection for covariate-adjusted semiparametric inference in randomized clinical trials." Stat Med 31, no. 29 (2012): 3789-804.
"Bayesian design of noninferiority trials for medical devices using historical data." Biometrics 67, no. 3 (2011): 1163-70.
"Bayesian estimation of semiparametric nonlinear dynamic factor analysis models using the Dirichlet process prior." Br J Math Stat Psychol 64, no. Pt 1 (2011): 69-106.
" SNPpy--database management for SNP data from genome wide association studies." PLoS One 6, no. 10 (2011): e24982.
"