Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Res Sq ; 2023 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-36824788

RESUMO

Genetic dissection of neuropsychiatric disorders can potentially reveal novel therapeutic targets. While genome-wide association studies (GWAS) have tremendously advanced our understanding, we approach a sample size bottleneck (i.e., the number of cases needed to identify >90% of all loci is impractical). Therefore, computationally enhancing GWAS on existing samples may be particularly valuable. Here, we describe DeepGWAS, a deep neural network-based method to enhance GWAS by integrating GWAS results with linkage disequilibrium and brain-related functional annotations. DeepGWAS enhanced schizophrenia (SCZ) loci by ~3X when applied to the largest European GWAS, and 21.3% enhanced loci were validated by the latest multi-ancestry GWAS. Importantly, DeepGWAS models can be transferred to other neuropsychiatric disorders. Transferring SCZ-trained models to Alzheimer's disease and major depressive disorder, we observed 1.3-17.6X detected loci compared to standard GWAS, among which 27-40% were validated by other GWAS studies. We anticipate DeepGWAS to be a powerful tool in GWAS studies.

2.
PLoS Comput Biol ; 18(5): e1010011, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35576194

RESUMO

Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have been developed to predict regulatory effects of non-coding variants. An emerging approach to variant annotation is deep learning models that predict regulatory functions from DNA sequences alone. While such models have been trained on large publicly available dataset such as ENCODE, neuropsychiatric trait-related cell types are under-represented in these datasets, thus there is an urgent need of better tools and resources to annotate variant functions in such cellular contexts. To fill this gap, we collected a large collection of neurodevelopment-related cell/tissue types, and trained deep Convolutional Neural Networks (ResNet) using such data. Furthermore, our model, called MetaChrom, borrows information from public epigenomic consortium to improve the accuracy via transfer learning. We show that MetaChrom is substantially better in predicting experimentally determined chromatin accessibility variants than popular variant annotation tools such as CADD and delta-SVM. By combining GWAS data with MetaChrom predictions, we prioritized 31 SNPs for Schizophrenia, suggesting potential risk genes and the biological contexts where they act. In summary, MetaChrom provides functional annotations of any DNA variants in the neuro-development context and the general method of MetaChrom can also be extended to other disease-related cell or tissue types.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Epigenômica/métodos , Aprendizado de Máquina , Redes Neurais de Computação , Polimorfismo de Nucleotídeo Único/genética
3.
Brief Bioinform ; 23(1)2022 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-34882195

RESUMO

Experimental protein function annotation does not scale with the fast-growing sequence databases. Only a tiny fraction (<0.1%) of protein sequences has experimentally determined functional annotations. Computational methods may predict protein function very quickly, but their accuracy is not very satisfactory. Based upon recent breakthroughs in protein structure prediction and protein language models, we develop GAT-GO, a graph attention network (GAT) method that may substantially improve protein function prediction by leveraging predicted structure information and protein sequence embedding. Our experimental results show that GAT-GO greatly outperforms the latest sequence- and structure-based deep learning methods. On the PDB-mmseqs testset where the train and test proteins share <15% sequence identity, our GAT-GO yields Fmax (maximum F-score) 0.508, 0.416, 0.501, and area under the precision-recall curve (AUPRC) 0.427, 0.253, 0.411 for the MFO, BPO, CCO ontology domains, respectively, much better than the homology-based method BLAST (Fmax 0.117, 0.121, 0.207 and AUPRC 0.120, 0.120, 0.163) that does not use any structure information. On the PDB-cdhit testset where the training and test proteins are more similar, although using predicted structure information, our GAT-GO obtains Fmax 0.637, 0.501, 0.542 for the MFO, BPO, CCO ontology domains, respectively, and AUPRC 0.662, 0.384, 0.481, significantly exceeding the just-published method DeepFRI that uses experimental structures, which has Fmax 0.542, 0.425, 0.424 and AUPRC only 0.313, 0.159, 0.193.


Assuntos
Biologia Computacional , Proteínas , Sequência de Aminoácidos , Área Sob a Curva , Biologia Computacional/métodos , Bases de Dados de Proteínas , Anotação de Sequência Molecular , Proteínas/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA