Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
NAR Genom Bioinform ; 5(4): lqad088, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37850036

RESUMO

When splitting biological sequence data for the development and testing of predictive models, it is necessary to avoid too-closely related pairs of sequences ending up in different partitions. If this is ignored, performance of prediction methods will tend to be overestimated. Several algorithms have been proposed for homology reduction, where sequences are removed until no too-closely related pairs remain. We present GraphPart, an algorithm for homology partitioning that divides the data such that closely related sequences always end up in the same partition, while keeping as many sequences as possible in the dataset. Evaluation of GraphPart on Protein, DNA and RNA datasets shows that it is capable of retaining a larger number of sequences per dataset, while providing homology separation on a par with reduction approaches.

2.
Nat Biotechnol ; 40(7): 1023-1025, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-34980915

RESUMO

Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms. SPs can be predicted from sequence data, but existing algorithms are unable to detect all known types of SPs. We introduce SignalP 6.0, a machine learning model that detects all five SP types and is applicable to metagenomic data.


Assuntos
Idioma , Sinais Direcionadores de Proteínas , Algoritmos , Sequência de Aminoácidos , Sinais Direcionadores de Proteínas/genética , Proteínas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA