Pesquisa | BVS Aleitamento Materno

The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.

Szklarczyk, Damian; Kirsch, Rebecca; Koutrouli, Mikaela; Nastou, Katerina; Mehryary, Farrokh; Hachilif, Radja; Gable, Annika L; Fang, Tao; Doncheva, Nadezhda T; Pyysalo, Sampo; Bork, Peer; Jensen, Lars J; von Mering, Christian.

Nucleic Acids Res ; 51(D1): D638-D646, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36370105

RESUMO

Much of the complexity within cells arises from functional and regulatory interactions among proteins. The core of these interactions is increasingly known, but novel interactions continue to be discovered, and the information remains scattered across different database resources, experimental modalities and levels of mechanistic detail. The STRING database (https://string-db.org/) systematically collects and integrates protein-protein interactions-both physical interactions as well as functional associations. The data originate from a number of sources: automated text mining of the scientific literature, computational interaction predictions from co-expression, conserved genomic context, databases of interaction experiments and known complexes/pathways from curated sources. All of these interactions are critically assessed, scored, and subsequently automatically transferred to less well-studied organisms using hierarchical orthology information. The data can be accessed via the website, but also programmatically and via bulk downloads. The most recent developments in STRING (version 12.0) are: (i) it is now possible to create, browse and analyze a full interaction network for any novel genome of interest, by submitting its complement of encoded proteins, (ii) the co-expression channel now uses variational auto-encoders to predict interactions, and it covers two new sources, single-cell RNA-seq and experimental proteomics data and (iii) the confidence in each experimentally derived interaction is now estimated based on the detection method used, and communicated to the user in the web-interface. Furthermore, STRING continues to enhance its facilities for functional enrichment analysis, which are now fully available also for user-submitted genomes.

Assuntos

Mapeamento de Interação de Proteínas , Proteínas , Mapeamento de Interação de Proteínas/métodos , Bases de Dados de Proteínas , Proteínas/genética , Proteínas/metabolismo , Genômica , Proteômica , Interface Usuário-Computador

Enhancing coevolutionary signals in protein-protein interaction prediction through clade-wise alignment integration.

Fang, Tao; Szklarczyk, Damian; Hachilif, Radja; von Mering, Christian.

Sci Rep ; 14(1): 6009, 2024 03 12.

Artigo em Inglês | MEDLINE | ID: mdl-38472223

RESUMO

Protein-protein interactions (PPIs) play essential roles in most biological processes. The binding interfaces between interacting proteins impose evolutionary constraints that have successfully been employed to predict PPIs from multiple sequence alignments (MSAs). To construct MSAs, critical choices have to be made: how to ensure the reliable identification of orthologs, and how to optimally balance the need for large alignments versus sufficient alignment quality. Here, we propose a divide-and-conquer strategy for MSA generation: instead of building a single, large alignment for each protein, multiple distinct alignments are constructed under distinct clades in the tree of life. Coevolutionary signals are searched separately within these clades, and are only subsequently integrated using machine learning techniques. We find that this strategy markedly improves overall prediction performance, concomitant with better alignment quality. Using the popular DCA algorithm to systematically search pairs of such alignments, a genome-wide all-against-all interaction scan in a bacterial genome is demonstrated. Given the recent successes of AlphaFold in predicting direct PPIs at atomic detail, a discover-and-refine approach is proposed: our method could provide a fast and accurate strategy for pre-screening the entire genome, submitting to AlphaFold only promising interaction candidates-thus reducing false positives as well as computation time.

Assuntos

Algoritmos , Proteínas , Alinhamento de Sequência , Proteínas/genética , Evolução Biológica , Filogenia , Biologia Computacional/métodos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA