Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 1 de 1
Filtrar
Mais filtros

Base de dados
Ano de publicação
Tipo de documento
Intervalo de ano de publicação
1.
PLoS Comput Biol ; 16(7): e1008104, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32735589

RESUMO

High levels of heterozygosity present a unique genome assembly challenge and can adversely impact downstream analyses, yet is common in sequencing datasets obtained from non-model organisms. Here we show that by re-assembling a heterozygous dataset with variant parameters and different assembly algorithms, we are able to generate assemblies whose protein annotations are statistically enriched for specific gene ontology categories. While total assembly length was not significantly affected by assembly methodologies tested, the assemblies generated varied widely in fragmentation level and we show local assembly collapse or expansion underlying the enrichment or depletion of specific protein functional groups. We show that these statistically significant deviations in gene ontology groups can occur in seemingly high-quality assemblies, and result from difficult-to-detect local sequence expansion or contractions. Given the unpredictable interplay between assembly algorithm, parameter, and biological sequence data heterozygosity, we highlight the need for better measures of assembly quality than N50 value, including methods for assessing local expansion and collapse.


Assuntos
Mapeamento de Sequências Contíguas , Genoma Helmíntico , Heterozigoto , Anotação de Sequência Molecular/métodos , Nematoides/genética , Membro 1 da Subfamília B de Cassetes de Ligação de ATP/metabolismo , Algoritmos , Animais , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Funções Verossimilhança , Proteoma , Análise de Sequência de DNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA