Your browser doesn't support javascript.
loading
PopCluster: an algorithm to identify genetic variants with ethnicity-dependent effects.
Gurinovich, Anastasia; Bae, Harold; Farrell, John J; Andersen, Stacy L; Monti, Stefano; Puca, Annibale; Atzmon, Gil; Barzilai, Nir; Perls, Thomas T; Sebastiani, Paola.
Afiliação
  • Gurinovich A; Bioinformatics Program, Boston University, Boston, MA, USA.
  • Bae H; College of Public Health and Human Sciences, Oregon State University, Corvallis, OR, USA.
  • Farrell JJ; Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
  • Andersen SL; Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
  • Monti S; Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
  • Puca A; Department of Medicine and Surgery, University of Salerno, Fisciano, Italy.
  • Atzmon G; Cardiovascular Research Unit, IRCCS MultiMedica, Sesto San Giovanni, Italy.
  • Barzilai N; Department of Medicine and Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA.
  • Perls TT; Department of Medicine and Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA.
  • Sebastiani P; Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
Bioinformatics ; 35(17): 3046-3054, 2019 09 01.
Article em En | MEDLINE | ID: mdl-30624692
ABSTRACT
MOTIVATION Over the last decade, more diverse populations have been included in genome-wide association studies. If a genetic variant has a varying effect on a phenotype in different populations, genome-wide association studies applied to a dataset as a whole may not pinpoint such differences. It is especially important to be able to identify population-specific effects of genetic variants in studies that would eventually lead to development of diagnostic tests or drug discovery.

RESULTS:

In this paper, we propose PopCluster an algorithm to automatically discover subsets of individuals in which the genetic effects of a variant are statistically different. PopCluster provides a simple framework to directly analyze genotype data without prior knowledge of subjects' ethnicities. PopCluster combines logistic regression modeling, principal component analysis, hierarchical clustering and a recursive bottom-up tree parsing procedure. The evaluation of PopCluster suggests that the algorithm has a stable low false positive rate (∼4%) and high true positive rate (>80%) in simulations with large differences in allele frequencies between cases and controls. Application of PopCluster to data from genetic studies of longevity discovers ethnicity-dependent heterogeneity in the association of rs3764814 (USP42) with the phenotype. AVAILABILITY AND IMPLEMENTATION PopCluster was implemented using the R programming language, PLINK and Eigensoft software, and can be found at the following GitHub repository https//github.com/gurinovich/PopCluster with instructions on its installation and usage. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Etnicidade / Estudo de Associação Genômica Ampla Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Etnicidade / Estudo de Associação Genômica Ampla Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos