Your browser doesn't support javascript.
loading
Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3.
Han, Mira V; Thomas, Gregg W C; Lugo-Martinez, Jose; Hahn, Matthew W.
Afiliação
  • Han MV; National Evolutionary Synthesis Center, Durham, North Carolina, USA.
Mol Biol Evol ; 30(8): 1987-97, 2013 Aug.
Article em En | MEDLINE | ID: mdl-23709260
ABSTRACT
Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software package CAFE, along with several other novel features. We demonstrate the accuracy of the method with extensive simulations and reanalyze several previously published data sets. Our results show that errors in genome annotation do lead to higher inferred rates of gene gain and loss but that CAFE 3 sufficiently accounts for these errors to provide accurate estimates of important evolutionary parameters.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Genoma / Análise de Sequência de DNA / Anotação de Sequência Molecular Idioma: En Ano de publicação: 2013 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Genoma / Análise de Sequência de DNA / Anotação de Sequência Molecular Idioma: En Ano de publicação: 2013 Tipo de documento: Article