Affiliation:
1. Department of Population Medicine Harvard Pilgrim Health Care Institute and Harvard Medical School Boston Massachusetts U.S.A.
2. Biostatistics Shared Resource Knight Cancer Institute, Oregon Health and Science University Portland Oregon U.S.A.
3. Department of Biostatistics Harvard T.H. Chan School of Public Health Boston Massachusetts U.S.A.
Abstract
AbstractWhen analyzing data combined from multiple sources (e.g., hospitals, studies), the heterogeneity across different sources must be accounted for. In this article, we consider high‐dimensional linear regression models for integrative data analysis. We propose a new adaptive clustering penalty (ACP) method to simultaneously select variables and cluster source‐specific regression coefficients with subhomogeneity. We show that the estimator based on the ACP method enjoys a strong oracle property under certain regularity conditions. We also develop an efficient algorithm based on the alternating direction method of multipliers (ADMM) for parameter estimation. We conduct simulation studies to compare the performance of the proposed method to three existing methods (a fused LASSO with adjacent fusion, a pairwise fused LASSO and a multidirectional shrinkage penalty method). Finally, we apply the proposed method to the multicentre Childhood Adenotonsillectomy Trial to identify subhomogeneity in the treatment effects across different study sites.
Funder
Agency for Healthcare Research and Quality
National Heart, Lung, and Blood Institute
National Institute of Allergy and Infectious Diseases
Subject
Statistics, Probability and Uncertainty,Statistics and Probability
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献