Your browser doesn't support javascript.
loading
Leveraging pleiotropic association using sparse group variable selection in genomics data.
Sutton, Matthew; Sugier, Pierre-Emmanuel; Truong, Therese; Liquet, Benoit.
Afiliación
  • Sutton M; Queensland University of Technology Centre for Data Science, Brisbane, Australia. matt.sutton.stat@gmail.com.
  • Sugier PE; Laboratoire De Mathématiques et de leurs Applications de PAU E2S UPPA, CNRS, Pau, France.
  • Truong T; University Paris-Saclay, UVSQ, Inserm, Gustave Roussy, CESP, Team "Exposome and Heredity", Villejuif, France.
  • Liquet B; University Paris-Saclay, UVSQ, Inserm, Gustave Roussy, CESP, Team "Exposome and Heredity", Villejuif, France.
BMC Med Res Methodol ; 22(1): 9, 2022 01 07.
Article en En | MEDLINE | ID: mdl-34996381
ABSTRACT

BACKGROUND:

Genome-wide association studies (GWAS) have identified genetic variants associated with multiple complex diseases. We can leverage this phenomenon, known as pleiotropy, to integrate multiple data sources in a joint analysis. Often integrating additional information such as gene pathway knowledge can improve statistical efficiency and biological interpretation. In this article, we propose statistical methods which incorporate both gene pathway and pleiotropy knowledge to increase statistical power and identify important risk variants affecting multiple traits.

METHODS:

We propose novel feature selection methods for the group variable selection in multi-task regression problem. We develop penalised likelihood methods exploiting different penalties to induce structured sparsity at a gene (or pathway) and SNP level across all studies. We implement an alternating direction method of multipliers (ADMM) algorithm for our penalised regression methods. The performance of our approaches are compared to a subset based meta analysis approach on simulated data sets. A bootstrap sampling strategy is provided to explore the stability of the penalised methods.

RESULTS:

Our methods are applied to identify potential pleiotropy in an application considering the joint analysis of thyroid and breast cancers. The methods were able to detect eleven potential pleiotropic SNPs and six pathways. A simulation study found that our method was able to detect more true signals than a popular competing method while retaining a similar false discovery rate.

CONCLUSION:

We developed feature selection methods for jointly analysing multiple logistic regression tasks where prior grouping knowledge is available. Our method performed well on both simulation studies and when applied to a real data analysis of multiple cancers.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Genómica / Estudio de Asociación del Genoma Completo Tipo de estudio: Prognostic_studies / Risk_factors_studies / Systematic_reviews Límite: Humans Idioma: En Revista: BMC Med Res Methodol Asunto de la revista: MEDICINA Año: 2022 Tipo del documento: Article País de afiliación: Australia

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Genómica / Estudio de Asociación del Genoma Completo Tipo de estudio: Prognostic_studies / Risk_factors_studies / Systematic_reviews Límite: Humans Idioma: En Revista: BMC Med Res Methodol Asunto de la revista: MEDICINA Año: 2022 Tipo del documento: Article País de afiliación: Australia
...