Your browser doesn't support javascript.
loading
CompositeSearch: A Generalized Network Approach for Composite Gene Families Detection.
Pathmanathan, Jananan Sylvestre; Lopez, Philippe; Lapointe, François-Joseph; Bapteste, Eric.
Afiliação
  • Pathmanathan JS; Institut de Biologie Paris-Seine (IBPS), UPMC Université Paris 06, Sorbonne Universités, Paris, France.
  • Lopez P; Institut de Biologie Paris-Seine (IBPS), UPMC Université Paris 06, Sorbonne Universités, Paris, France.
  • Lapointe FJ; Département de Sciences Biologiques, Université de Montréal, Montréal, QC, Canada.
  • Bapteste E; Institut de Biologie Paris-Seine (IBPS), UPMC Université Paris 06, Sorbonne Universités, Paris, France.
Mol Biol Evol ; 35(1): 252-255, 2018 01 01.
Article em En | MEDLINE | ID: mdl-29092069
Genes evolve by point mutations, but also by shuffling, fusion, and fission of genetic fragments. Therefore, similarity between two sequences can be due to common ancestry producing homology, and/or partial sharing of component fragments. Disentangling these processes is especially challenging in large molecular data sets, because of computational time. In this article, we present CompositeSearch, a memory-efficient, fast, and scalable method to detect composite gene families in large data sets (typically in the range of several million sequences). CompositeSearch generalizes the use of similarity networks to detect composite and component gene families with a greater recall, accuracy, and precision than recent programs (FusedTriplets and MosaicFinder). Moreover, CompositeSearch provides user-friendly quality descriptions regarding the distribution and primary sequence conservation of these gene families allowing critical biological analyses of these data.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Alinhamento de Sequência / Análise de Sequência de DNA / Biologia Computacional Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Alinhamento de Sequência / Análise de Sequência de DNA / Biologia Computacional Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2018 Tipo de documento: Article