Your browser doesn't support javascript.
loading
DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology.
Asif, Muhammad; Martiniano, Hugo F M C; Lamurias, Andre; Kausar, Samina; Couto, Francisco M.
Afiliação
  • Asif M; Biomedical Data Science Lab, Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad, 38000, Pakistan. muhasif123@gmail.com.
  • Martiniano HFMC; LASIGE, Departamento de Informática, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal. muhasif123@gmail.com.
  • Lamurias A; Instituto Nacional de Saúde Doutor Ricardo Jorge, Avenida Padre Cruz, 1649-016, Lisbon, Portugal.
  • Kausar S; BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisboa, Portugal.
  • Couto FM; Department of Computer Science, Aalborg University, Ålborg, Denmark.
BMC Bioinformatics ; 24(1): 171, 2023 Apr 26.
Article em En | MEDLINE | ID: mdl-37101154
ABSTRACT

BACKGROUND:

Complex diseases such as neurodevelopmental disorders (NDDs) exhibit multiple etiologies. The multi-etiological nature of complex-diseases emerges from distinct but functionally similar group of genes. Different diseases sharing genes of such groups show related clinical outcomes that further restrict our understanding of disease mechanisms, thus, limiting the applications of personalized medicine approaches to complex genetic disorders.

RESULTS:

Here, we present an interactive and user-friendly application, called DGH-GO. DGH-GO allows biologists to dissect the genetic heterogeneity of complex diseases by stratifying the putative disease-causing genes into clusters that may contribute to distinct disease outcome development. It can also be used to study the shared etiology of complex-diseases. DGH-GO creates a semantic similarity matrix for the input genes by using Gene Ontology (GO). The resultant matrix can be visualized in 2D plots using different dimension reduction methods (T-SNE, Principal component analysis, umap and Principal coordinate analysis). In the next step, clusters of functionally similar genes are identified from genes functional similarities assessed through GO. This is achieved by employing four different clustering methods (K-means, Hierarchical, Fuzzy and PAM). The user may change the clustering parameters and explore their effect on stratification immediately. DGH-GO was applied to genes disrupted by rare genetic variants in Autism Spectrum Disorder (ASD) patients. The analysis confirmed the multi-etiological nature of ASD by identifying four clusters of genes that were enriched for distinct biological mechanisms and clinical outcome. In the second case study, the analysis of genes shared by different NDDs showed that genes causing multiple disorders tend to aggregate in similar clusters, indicating a possible shared etiology.

CONCLUSION:

DGH-GO is a user-friendly application that allows biologists to study the multi-etiological nature of complex diseases by dissecting their genetic heterogeneity. In summary, functional similarities, dimension reduction and clustering methods, coupled with interactive visualization and control over analysis allows biologists to explore and analyze their datasets without requiring expert knowledge on these methods. The source code of proposed application is available at https//github.com/Muh-Asif/DGH-GO.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Heterogeneidade Genética / Transtorno do Espectro Autista Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Heterogeneidade Genética / Transtorno do Espectro Autista Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article