Publications
Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications." Genome Biol 19, no. 1 (2018): 24.
"Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny." Bioinformatics 34, no. 12 (2018): 2126-2128.
"A New Method for Detecting Associations with Rare Copy-Number Variants." PLoS Genet 11, no. 10 (2015): e1005403.
"GENE-LEVEL PHARMACOGENETIC ANALYSIS ON SURVIVAL OUTCOMES USING GENE-TRAIT SIMILARITY REGRESSION." Ann Appl Stat 8, no. 2 (2014): 1232-1255.
" Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression." Am J Hum Genet 89, no. 2 (2011): 277-88.
"Dynamic Treatment Regimes: Statistical Methods for Precision Medicine In Chapman & Hall/CRC Monographs on Statistics and Applied Probability. Boca Raton: Chapman and Hall/CRC, 2019.
Discussion of "Connections Between Survey Calibration Estimators and Semiparametric Models for Incomplete Data" by T. Lumley, P.A. Shaw & J.Y. Dai." Int Stat Rev 79, no. 2 (2011): 221-223.
"doublyRobust: Doubly robust estimation for monotonely coarsened data in longitudinal studies with dropout and/or incomplete data (R).. 2nd ed., 2013.
Missing data methods: A semiparametric perspective. In Handbook of Missing Data Methodology. Boca Raton: Chapman and Hall/CRC, 2014.
Improved doubly robust estimation when data are monotonely coarsened, with application to longitudinal studies with dropout." Biometrics 67, no. 2 (2011): 536-45.
"A global sensitivity test for evaluating statistical hypotheses with nonidentifiable models." Biometrics 66, no. 2 (2010): 558-66.
"Semiparametric regression models and sensitivity analysis of longitudinal data with nonrandom dropouts." Stat Neerl 64, no. 2 (2010): 133-156.
"Moment Adjusted Imputation for Multivariate Measurement Error Data with Applications to Logistic Regression." Comput Stat Data Anal 67 (2013): 15-24.
"A moment-adjusted imputation method for measurement error models." Biometrics 67, no. 4 (2011): 1461-70.
" Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies." J Am Stat Assoc 112, no. 520 (2017): 1468-1476.
" Analysis of Sequence Data Under Multivariate Trait-Dependent Sampling." J Am Stat Assoc 110, no. 510 (2015): 560-572.
"Meta-analysis of sequencing studies with heterogeneous genetic associations." Genet Epidemiol 38, no. 5 (2014): 389-401.
"Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs." Am J Hum Genet 97, no. 1 (2015): 35-53.
"PreMeta: a tool to facilitate meta-analysis of rare-variant associations." BMC Genomics 18, no. 1 (2017): 160.
"MASS: Meta-analysis of sequencing studies (C).. 5.0 ed., 2013.
PreMeta: Facilitates the Exchange of Information Between Software Packages for Meta-Analysis (C++)., 2017.
MASS: meta-analysis of score statistics for sequencing studies." Bioinformatics 29, no. 14 (2013): 1803-5.
"Empirical Likelihood for Estimating Equations with Nonignorably Missing Data." Stat Sin 24, no. 2 (2014): 723-747.
"Provider-based research networks and diffusion of surgical technologies among patients with early-stage kidney cancer." Cancer 121, no. 6 (2015): 836-43.
"Estimation of a partially linear additive model for data from an outcome-dependent sampling design with a continuous outcome." Biostatistics 17, no. 4 (2016): 663-76.
"Estimation of treatment effect for the sequential parallel design." Stat Med 30, no. 30 (2011): 3496-506.
"SPReM: Sparse Projection Regression Model For High-dimensional Linear Regression." J Am Stat Assoc 110, no. 509 (2015): 289-302.
"The association between copy number aberration, DNA methylation and gene expression in tumor samples." Nucleic Acids Res 46, no. 6 (2018): 3009-3018.
"Modelling and estimation for optimal treatment decision with interference." Stat (Int Stat Inst) 8, no. 1 (2019).
"Surface Estimation, Variable Selection, and the Nonparametric Oracle Property." Stat Sin 21, no. 2 (2011): 679-705.
"Pooled Analysis of Individual Patient Data on Concurrent Chemoradiotherapy for Stage III Non-Small-Cell Lung Cancer in Elderly Patients Compared With Younger Patients Who Participated in US National Cancer Institute Cooperative Group Studies." J Clin Oncol 35, no. 25 (2017): 2885-2892.
"Effect of Erlotinib Plus Bevacizumab vs Erlotinib Alone on Progression-Free Survival in Patients With Advanced EGFR-Mutant Non-Small Cell Lung Cancer: A Phase 2 Randomized Clinical Trial." JAMA Oncol 5, no. 10 (2019): 1448-1455.
"skda: Sparse (multicategory) kernel discriminant analysis (R).. 0.1 ed., 2013.
Variable Selection in Nonparametric Classification via Measurement Error Model Selection Likelihoods." J Am Stat Assoc 109, no. 506 (2014): 574-589.
"Longitudinal dynamic functional regression." J R Stat Soc Ser C Appl Stat 69, no. 1 (2020): 25-46.
"Alignment and mapping methodology influence transcript abundance estimation." Genome Biol 21, no. 1 (2020): 239.
"Censored Rank Independence Screening for High-dimensional Survival Data." Biometrika 101, no. 4 (2014): 799-814.
"ASYMPTOTICS FOR CHANGE-POINT MODELS UNDER VARYING DEGREES OF MIS-SPECIFICATION." Ann Stat 44, no. 1 (2016): 153-182.
" Penalized Q-Learning for Dynamic Treatment Regimens." Stat Sin 25, no. 3 (2015): 901-920.
"Semiparametric Single-Index Model for Estimating Optimal Individualized Treatment Strategy." Electron J Stat 11, no. 1 (2017): 364-384.
"On Sparse representation for Optimal Individualized Treatment Selection with Penalized Outcome Weighted Learning." Stat 4, no. 1 (2015): 59-68.
"On Varying-coefficient Independence Screening for High-dimensional Varying-coefficient Models." Stat Sin 24, no. 4 (2014): 1735-1752.
" Enrollment and Stopping Rules for Managing Toxicity Requiring Long Follow-Up in Phase II Oncology Trials." J Biopharm Stat 25, no. 6 (2015): 1206-14.
"A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs." Life Sci Alliance 2, no. 1 (2019).
"Robust test method for time-course microarray experiments." BMC Bioinformatics 11 (2010): 391.
"Multiple testing for gene sets from microarray experiments." BMC Bioinformatics 12 (2011): 209.
"Multiscale adaptive marginal analysis of longitudinal neuroimaging data with time-varying covariates." Biometrics 68, no. 4 (2012): 1083-92.
"Sex differences in grey matter atrophy patterns among AD and aMCI patients: results from ADNI." Neuroimage 56, no. 3 (2011): 890-906.
"BAYESIAN INFERENCE OF HIDDEN GAMMA WEAR PROCESS MODEL FOR SURVIVAL DATA WITH TIES." Stat Sin 25, no. 4 (2015): 1613-1635.
" SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling." Genome Biol 18, no. 1 (2017): 66.
"Facilitating the Calculation of the Efficient Score Using Symbolic Computing." Am Stat 72, no. 2 (2018): 199-205.
" permGPU: Using graphics processing units in RNA microarray association studies." BMC Bioinformatics 11 (2010): 329.
"A multiple imputation strategy for sequential multiple assignment randomized trials." Stat Med 33, no. 24 (2014): 4202-14.
" Using Structural Equation Modeling to Assess the Links between Tobacco Smoke Exposure, Volatile Organic Compounds, and Respiratory Function for Adolescents Aged 6 to 18 in the United States." Int J Environ Res Public Health 14, no. 10 (2017).
"Adaptive Estimation with Partially Overlapping Models." Stat Sin 26, no. 1 (2016): 235-253.
"Two-Dimensional Solution Surface for Weighted Support Vector Machines." J Comput Graph Stat 23, no. 2 (2014): 383-402.
"Probability-enhanced sufficient dimension reduction for binary classification." Biometrics 70, no. 3 (2014): 546-55.
"Determining the Number of Latent Factors in Statistical Multi-Relational Learning." J Mach Learn Res 20 (2019).
"Intrinsic Regression Models for Medial Representation of Subcortical Structures." J Am Stat Assoc 107, no. 497 (2012): 12-23.
"Diffusion tensor imaging-based characterization of brain neurodevelopment in primates." Cereb Cortex 23, no. 1 (2013): 36-48.
"A Sparse Random Projection-based Test for Overall Qualitative Treatment Effects." J Am Stat Assoc 115, no. 531 (2020): 1201-1213.
"LINEAR HYPOTHESIS TESTING FOR HIGH DIMENSIONAL GENERALIZED LINEAR MODELS." Ann Stat 47, no. 5 (2019): 2671-2703.
"Maximin Projection Learning for Optimal Treatment Decision with Heterogeneous Individualized Treatment Effects." J R Stat Soc Series B Stat Methodol 80, no. 4 (2018): 681-702.
"Robust learning for optimal treatment decision with NP-dimensionality." Electron J Stat 10 (2016): 2894-2921.
"ON TESTING CONDITIONAL QUALITATIVE TREATMENT EFFECTS." Ann Stat 47, no. 4 (2019): 2348-2377.
"HIGH-DIMENSIONAL A-LEARNING FOR OPTIMAL DYNAMIC TREATMENT REGIMES." Ann Stat 46, no. 3 (2018): 925-957.
"A Massive Data Framework for M-Estimators with Cubic-Rate." J Am Stat Assoc 113, no. 524 (2018): 1698-1709.
"Consistent Group Identification and Variable Selection in Regression with Correlated Predictors." J Comput Graph Stat 22, no. 2 (2013): 319-340.
"Predictive Blood-Based Biomarkers in Patients with Epithelial Ovarian Cancer Treated with Carboplatin and Paclitaxel with or without Bevacizumab: Results from GOG-0218." Clin Cancer Res 26, no. 6 (2020): 1288-1296.
"Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes." Stat Sci 29, no. 4 (2014): 640-661.
""Genome-wide association study identifies five new schizophrenia loci." Nat Genet 43, no. 10 (2011): 969-76.
Toxicity Related to Radiotherapy Dose and Targeting Strategy: A Pooled Analysis of Cooperative Group Trials of Combined Modality Therapy for Locally Advanced Non-Small Cell Lung Cancer." J Thorac Oncol 14, no. 2 (2019): 298-303.
"Online Updating of Statistical Inference in the Big Data Setting." Technometrics 58, no. 3 (2016): 393-403.
" Biomarker-based clinical trials., 2012.
Clinical trials data collection: when less is more." J Clin Oncol 28, no. 34 (2010): 5019-21.
"Comparative effectiveness of oxaliplatin vs non-oxaliplatin-containing adjuvant chemotherapy for stage III colon cancer." J Natl Cancer Inst 104, no. 3 (2012): 211-27.
"Ten-year experience with extended criteria cardiac transplantation." Circ Heart Fail 6, no. 6 (2013): 1230-8.
"Application of a sequential multiple assignment randomized trial (SMART) design in older patients with chronic lymphocytic leukemia." Ann Oncol 30, no. 4 (2019): 542-550.
" The Closure Principle Revisited., 2014.
Ascertainment, classification, and impact of neoplasm detection during prolonged treatment with dual antiplatelet therapy with prasugrel vs. clopidogrel following acute coronary syndrome." Eur Heart J 37, no. 4 (2016): 412-22.
"Research methods for clinical trials in personalized medicine: A systematic review." In Lost In Translation: Barriers to Incentives for Translational Research in Medical Sciences. Singapore: World Scientific, 2014.
"A nonparametric spatial model for periodontal data with non-random missingness." J Am Stat Assoc 108, no. 503 (2013).
"A spatial dirichlet process mixture model for clustering population genetics data." Biometrics 67, no. 2 (2011): 381-90.
"Sufficient dimension reduction via bayesian mixture modeling." Biometrics 67, no. 3 (2011): 886-95.
"Modeling Between-Study Heterogeneity for Improved Replicability in Gene Signature Selection and Clinical Prediction." J Am Stat Assoc 115, no. 531 (2020): 1125-1138.
"Purity Independent Subtyping of Tumors (PurIST), A Clinically Robust, Single-sample Classifier for Tumor Subtyping in Pancreatic Cancer." Clin Cancer Res 26, no. 1 (2020): 82-92.
"SR-HARDI: Spatially Regularizing High Angular Resolution Diffusion Imaging." J Comput Graph Stat 25, no. 4 (2016): 1195-1211.
"