Your browser doesn't support javascript.
loading
Estimation of multiple networks with common structures in heterogeneous subgroups.
Qin, Xing; Hu, Jianhua; Ma, Shuangge; Wu, Mengyun.
Afiliação
  • Qin X; School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai, China.
  • Hu J; School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China.
  • Ma S; Department of Biostatistics, Yale School of Public Health, New Haven, USA.
  • Wu M; School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China.
J Multivar Anal ; 2022024 Jul.
Article em En | MEDLINE | ID: mdl-38433779
ABSTRACT
Network estimation has been a critical component of high-dimensional data analysis and can provide an understanding of the underlying complex dependence structures. Among the existing studies, Gaussian graphical models have been highly popular. However, they still have limitations due to the homogeneous distribution assumption and the fact that they are only applicable to small-scale data. For example, cancers have various levels of unknown heterogeneity, and biological networks, which include thousands of molecular components, often differ across subgroups while also sharing some commonalities. In this article, we propose a new joint estimation approach for multiple networks with unknown sample heterogeneity, by decomposing the Gaussian graphical model (GGM) into a collection of sparse regression problems. A reparameterization technique and a composite minimax concave penalty are introduced to effectively accommodate the specific and common information across the networks of multiple subgroups, making the proposed estimator significantly advancing from the existing heterogeneity network analysis based on the regularized likelihood of GGM directly and enjoying scale-invariant, tuning-insensitive, and optimization convexity properties. The proposed analysis can be effectively realized using parallel computing. The estimation and selection consistency properties are rigorously established. The proposed approach allows the theoretical studies to focus on independent network estimation only and has the significant advantage of being both theoretically and computationally applicable to large-scale data. Extensive numerical experiments with simulated data and the TCGA breast cancer data demonstrate the prominent performance of the proposed approach in both subgroup and network identifications.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: J Multivar Anal Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: J Multivar Anal Ano de publicação: 2024 Tipo de documento: Article