Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Mol Biol Evol ; 39(2)2022 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-35021210

RESUMO

Species tree inference from gene family trees is becoming increasingly popular because it can account for discordance between the species tree and the corresponding gene family trees. In particular, methods that can account for multiple-copy gene families exhibit potential to leverage paralogy as informative signal. At present, there does not exist any widely adopted inference method for this purpose. Here, we present SpeciesRax, the first maximum likelihood method that can infer a rooted species tree from a set of gene family trees and can account for gene duplication, loss, and transfer events. By explicitly modeling events by which gene trees can depart from the species tree, SpeciesRax leverages the phylogenetic rooting signal in gene trees. SpeciesRax infers species tree branch lengths in units of expected substitutions per site and branch support values via paralogy-aware quartets extracted from the gene family trees. Using both empirical and simulated data sets we show that SpeciesRax is at least as accurate as the best competing methods while being one order of magnitude faster on large data sets at the same time. We used SpeciesRax to infer a biologically plausible rooted phylogeny of the vertebrates comprising 188 species from 31,612 gene families in 1 h using 40 cores. SpeciesRax is available under GNU GPL at https://github.com/BenoitMorel/GeneRax and on BioConda.


Assuntos
Algoritmos , Duplicação Gênica , Modelos Genéticos , Linhagem , Filogenia
2.
Mol Biol Evol ; 37(9): 2763-2774, 2020 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-32502238

RESUMO

Inferring phylogenetic trees for individual homologous gene families is difficult because alignments are often too short, and thus contain insufficient signal, while substitution models inevitably fail to capture the complexity of the evolutionary processes. To overcome these challenges, species-tree-aware methods also leverage information from a putative species tree. However, only few methods are available that implement a full likelihood framework or account for horizontal gene transfers. Furthermore, these methods often require expensive data preprocessing (e.g., computing bootstrap trees) and rely on approximations and heuristics that limit the degree of tree space exploration. Here, we present GeneRax, the first maximum likelihood species-tree-aware phylogenetic inference software. It simultaneously accounts for substitutions at the sequence level as well as gene level events, such as duplication, transfer, and loss relying on established maximum likelihood optimization algorithms. GeneRax can infer rooted phylogenetic trees for multiple gene families, directly from the per-gene sequence alignments and a rooted, yet undated, species tree. We show that compared with competing tools, on simulated data GeneRax infers trees that are the closest to the true tree in 90% of the simulations in terms of relative Robinson-Foulds distance. On empirical data sets, GeneRax is the fastest among all tested methods when starting from aligned sequences, and it infers trees with the highest likelihood score, based on our model. GeneRax completed tree inferences and reconciliations for 1,099 Cyanobacteria families in 8 min on 512 CPU cores. Thus, its parallelization scheme enables large-scale analyses. GeneRax is available under GNU GPL at https://github.com/BenoitMorel/GeneRax (last accessed June 17, 2020).


Assuntos
Duplicação Gênica , Técnicas Genéticas , Filogenia , Software , Cianobactérias/genética , Deleção de Genes , Transferência Genética Horizontal
3.
Methods Mol Biol ; 2757: 461-490, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38668979

RESUMO

Understanding gene evolution across genomes and organisms, including ctenophores, can provide unexpected biological insights. It enables powerful integrative approaches that leverage sequence diversity to advance biomedicine. Sequencing and bioinformatic tools can be inexpensive and user-friendly, but numerous options and coding can intimidate new users. Distinct challenges exist in working with data from diverse species but may go unrecognized by researchers accustomed to gold-standard genomes. Here, we provide a high-level workflow and detailed pipeline to enable animal collection, single-molecule sequencing, and phylogenomic analysis of gene and species evolution. As a demonstration, we focus on (1) PacBio RNA-seq of the genome-sequenced ctenophore Mnemiopsis leidyi, (2) diversity and evolution of the mechanosensitive ion channel Piezo in genetic models and basal-branching animals, and (3) associated challenges and solutions to working with diverse species and genomes, including gene model updating and repair using single-molecule RNA-seq. We provide a Python Jupyter Notebook version of our pipeline (GitHub Repository: Ctenophore-Ocean-To-Tree-2023 https://github.com/000generic/Ctenophore-Ocean-To-Tree-2023 ) that can be run for free in the Google Colab cloud to replicate our findings or modified for specific or greater use. Our protocol enables users to design new sequencing projects in ctenophores, marine invertebrates, or other novel organisms. It provides a simple, comprehensive platform that can ease new user entry into running their evolutionary sequence analyses.


Assuntos
Ctenóforos , Evolução Molecular , Filogenia , RNA-Seq , Animais , RNA-Seq/métodos , Ctenóforos/genética , Ctenóforos/classificação , Genoma/genética , Biologia Computacional/métodos , Software , Genômica/métodos , Modelos Genéticos
4.
Front Genet ; 13: 1031705, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36406110

RESUMO

Hybridization is common and considered as an important evolutionary force to increase intraspecific genetic diversity. Detecting hybridization events is crucial for understanding the evolutionary history of species and further improving molecular breeding. The studies on identifying hybridization events through the phylogenomic approach are still limited. We proposed the conception and method of identifying allopolyploidy events by phylogenomics. The reconciliation and summary of nuclear multi-labeled gene family trees were adopted to untangle hybridization events from next-generation data in our novel phylogenomic approach. Given horticulturalists' relatively clear cultivated crossbreeding history, the water lily family is a suitable case for examining recent allopolyploidy events. Here, we reconstructed and confirmed the well-resolved nuclear phylogeny for the Nymphaeales family in the context of geological time as a framework for identifying hybridization signals. We successfully identified two possible allopolyploidy events with the parental lineages for the hybrids in the family Nymphaeaceae based on summarization from multi-labeled gene family trees of Nymphaeales. The lineages where species Nymphaea colorata and Nymphaea caerulea are located may be the progenitors of horticultural cultivated species Nymphaea 'midnight' and Nymphaea 'Woods blue goddess'. The proposed hybridization hypothesis is also supported by horticultural breeding records. Our methodology can be widely applied to identify hybridization events and theoretically facilitate the genome breeding design of hybrid plants.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa