Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
1.
Syst Biol ; 72(3): 723-737, 2023 Jun 17.
Artigo em Inglês | MEDLINE | ID: mdl-35713492

RESUMO

Common molecular phylogenetic characteristics such as long branches and compositional heterogeneity can be problematic for phylogenetic reconstruction when using amino acid data. Recoding alignments to reduced alphabets before phylogenetic analysis has often been used both to explore and potentially decrease the effect of such problems. We tested the effectiveness of this strategy on topological accuracy using simulated data on four-taxon trees. We simulated alignments in phylogenetically challenging ways to test the phylogenetic accuracy of analyses using various recoding strategies together with commonly used homogeneous models. We tested three recoding methods based on amino acid exchangeability, and another recoding method based on lowering the compositional heterogeneity among alignment sequences as measured by the Chi-squared statistic. Our simulation results show that on trees with long branches where sequences approach saturation, accuracy was not greatly affected by exchangeability-based recodings, but Chi-squared-based recoding decreased accuracy. We then simulated sequences with different kinds of compositional heterogeneity over the tree. Recoding often increased accuracy on such alignments. Exchangeability-based recoding was rarely worse than not recoding, and often considerably better. Recoding based on lowering the Chi-squared value improved accuracy in some cases but not in others, suggesting that low compositional heterogeneity by itself is not sufficient to increase accuracy in the analysis of these alignments. We also simulated alignments using site-specific amino acid profiles, making sequences that had compositional heterogeneity over alignment sites. Exchangeability-based recoding coupled with site-homogeneous models had poor accuracy for these data sets but Chi-squared-based recoding on these alignments increased accuracy. We then simulated data sets that were compositionally both site- and tree-heterogeneous, like many real data sets. The effect on the accuracy of recoding such doubly problematic data sets varied widely, depending on the type of compositional tree heterogeneity and on the recoding scheme. Interestingly, analysis of unrecoded compositionally heterogeneous alignments with the NDCH or CAT models was generally more accurate than homogeneous analysis, whether recoded or not. Overall, our results suggest that making trees for recoded amino acid data sets can be useful, but they need to be interpreted cautiously as part of a more comprehensive analysis. The use of better-fitting models like NDCH and CAT, which directly account for the patterns in the data, may offer a more promising long-term solution for analyzing empirical data. [Compositional heterogeneity; models of evolution; phylogenetic methods; recoding amino acid data sets.].


Assuntos
Aminoácidos , Filogenia , Simulação por Computador
2.
Proc Biol Sci ; 290(2010): 20231458, 2023 Nov 08.
Artigo em Inglês | MEDLINE | ID: mdl-37909081

RESUMO

Parental care is considered crucial for the enhanced survival of offspring and evolutionary success of many metazoan groups. Most bryozoans incubate their young in brood chambers or intracoelomically. Based on the drastic morphological differences in incubation chambers across members of the order Cheilostomatida (class Gymnolaemata), multiple origins of incubation were predicted in this group. This hypothesis was tested by constructing a molecular phylogeny based on mitogenome data and nuclear rRNA genes 18S and 28S with the most complete sampling of taxa with various incubation devices to date. Ancestral character estimation suggested that distinct types of brood chambers evolved at least 10 times in Cheilostomatida. In Eucratea loricata and Aetea spp. brooding evolved unambiguously from a zygote-spawning ancestral state, as it probably did in Tendra zostericola, Neocheilostomata, and 'Carbasea' indivisa. In two further instances, brooders with different incubation chamber types, skeletal and non-skeletal, formed clades (Scruparia spp., Leiosalpinx australis) and (Catenicula corbulifera (Steginoporella spp. (Labioporella spp., Thalamoporella californica))), each also probably evolved from a zygote-spawning ancestral state. The modular nature of bryozoans probably contributed to the evolution of such a diverse array of embryonic incubation chambers, which included complex constructions made of polymorphic heterozooids, and maternal zooidal invaginations and outgrowths.


Assuntos
Briozoários , Invertebrados , Animais , Filogenia , Reprodução/genética
3.
Proc Natl Acad Sci U S A ; 116(12): 5613-5622, 2019 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-30842288

RESUMO

Many microbes acquire metabolites in a "feeding" process where complex polymers are broken down in the environment to their subunits. The subsequent uptake of soluble metabolites by a cell, sometimes called osmotrophy, is facilitated by transporter proteins. As such, the diversification of osmotrophic microorganisms is closely tied to the diversification of transporter functions. Horizontal gene transfer (HGT) has been suggested to produce genetic variation that can lead to adaptation, allowing lineages to acquire traits and expand niche ranges. Transporter genes often encode single-gene phenotypes and tend to have low protein-protein interaction complexity and, as such, are potential candidates for HGT. Here we test the idea that HGT has underpinned the expansion of metabolic potential and substrate utilization via transfer of transporter-encoding genes. Using phylogenomics, we identify seven cases of transporter-gene HGT between fungal phyla, and investigate compatibility, localization, function, and fitness consequences when these genes are expressed in Saccharomyces cerevisiae Using this approach, we demonstrate that the transporters identified can alter how fungi utilize a range of metabolites, including peptides, polyols, and sugars. We then show, for one model gene, that transporter gene acquisition by HGT can significantly alter the fitness landscape of S. cerevisiae We therefore provide evidence that transporter HGT occurs between fungi, alters how fungi can acquire metabolites, and can drive gain in fitness. We propose a "transporter-gene acquisition ratchet," where transporter repertoires are continually augmented by duplication, HGT, and differential loss, collectively acting to overwrite, fine-tune, and diversify the complement of transporters present in a genome.


Assuntos
Transferência Genética Horizontal/genética , Aptidão Genética/genética , Saccharomyces cerevisiae/genética , Evolução Biológica , Evolução Molecular , Fungos/genética , Genoma , Proteínas de Membrana Transportadoras/genética , Fenótipo , Filogenia , Proteínas de Saccharomyces cerevisiae/genética
4.
Proc Natl Acad Sci U S A ; 114(23): E4602-E4611, 2017 06 06.
Artigo em Inglês | MEDLINE | ID: mdl-28533395

RESUMO

A root for the archaeal tree is essential for reconstructing the metabolism and ecology of early cells and for testing hypotheses that propose that the eukaryotic nuclear lineage originated from within the Archaea; however, published studies based on outgroup rooting disagree regarding the position of the archaeal root. Here we constructed a consensus unrooted archaeal topology using protein concatenation and a multigene supertree method based on 3,242 single gene trees, and then rooted this tree using a recently developed model of genome evolution. This model uses evidence from gene duplications, horizontal transfers, and gene losses contained in 31,236 archaeal gene families to identify the most likely root for the tree. Our analyses support the monophyly of DPANN (Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota, Nanohaloarchaea), a recently discovered cosmopolitan and genetically diverse lineage, and, in contrast to previous work, place the tree root between DPANN and all other Archaea. The sister group to DPANN comprises the Euryarchaeota and the TACK Archaea, including Lokiarchaeum, which our analyses suggest are monophyletic sister lineages. Metabolic reconstructions on the rooted tree suggest that early Archaea were anaerobes that may have had the ability to reduce CO2 to acetate via the Wood-Ljungdahl pathway. In contrast to proposals suggesting that genome reduction has been the predominant mode of archaeal evolution, our analyses infer a relatively small-genomed archaeal ancestor that subsequently increased in complexity via gene duplication and horizontal gene transfer.


Assuntos
Archaea/genética , Evolução Molecular , Genoma Arqueal , Modelos Genéticos , Algoritmos , Archaea/classificação , Archaea/metabolismo , Eucariotos/classificação , Eucariotos/genética , Duplicação Gênica , Transferência Genética Horizontal , Redes e Vias Metabólicas/genética , Família Multigênica , Filogenia , Temperatura
5.
New Phytol ; 222(1): 565-575, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30411803

RESUMO

Unraveling the phylogenetic relationships between the four major lineages of terrestrial plants (mosses, liverworts, hornworts, and vascular plants) is essential for an understanding of the evolution of traits specific to land plants, such as their complex life cycles, and the evolutionary development of stomata and vascular tissue. Well supported phylogenetic hypotheses resulting from different data and methods are often incongruent due to processes of nucleotide evolution that are difficult to model, for example substitutional saturation and composition heterogeneity. We reanalysed a large published dataset of nuclear data and modelled these processes using degenerate-codon recoding and tree-heterogeneous composition substitution models. Our analyses resolved bryophytes as a monophyletic group and showed that the nonnonmonophyly of the clade that is supported by the analysis of nuclear nucleotide data is due solely to fast-evolving synonymous substitutions. The current congruence among phylogenies of both nuclear and chloroplast analyses lent considerable support to the conclusion that the bryophytes are a monophyletic group. An initial split between bryophytes and vascular plants implies that the bryophyte life cycle (with a dominant gametophyte nurturing an unbranched sporophyte) may not be ancestral to all land plants and that stomata are likely to be a symplesiomorphy among embryophytes.


Assuntos
Briófitas/metabolismo , Proteínas Nucleares/metabolismo , Filogenia , Aminoácidos/genética , Nucleotídeos/genética
6.
Nature ; 504(7479): 231-6, 2013 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-24336283

RESUMO

The discovery of the Archaea and the proposal of the three-domains 'universal' tree, based on ribosomal RNA and core genes mainly involved in protein translation, catalysed new ideas for cellular evolution and eukaryotic origins. However, accumulating evidence suggests that the three-domains tree may be incorrect: evolutionary trees made using newer methods place eukaryotic core genes within the Archaea, supporting hypotheses in which an archaeon participated in eukaryotic origins by founding the host lineage for the mitochondrial endosymbiont. These results provide support for only two primary domains of life--Archaea and Bacteria--because eukaryotes arose through partnership between them.


Assuntos
Archaea/classificação , Eucariotos/classificação , Modelos Biológicos , Filogenia , Archaea/citologia , Archaea/genética , Bactérias/classificação , Bactérias/genética , Membrana Celular/metabolismo , Eucariotos/citologia , Eucariotos/genética , Mitocôndrias/genética , RNA Ribossômico/genética , Simbiose
7.
BMC Genomics ; 16: 831, 2015 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-26489754

RESUMO

BACKGROUND: The Coronator Group currently encompasses six morphologically similar species (Culex camposi Dyar, Culex coronator Dyar and Knab, Culex covagarciai Forattini, Culex usquatus Dyar, Culex usquatissimus Dyar, and Culex ousqua Dyar). Culex coronator has been incriminated as a potential vector of West Nile Virus (WNV), Saint Louis Encephalitis Virus (SLEV), and Venezuelan Equine Encephalitis Virus (VEEV). The complete mitochondrial genome of Cx. coronator, Cx. usquatus, Cx.usquatissimus, and Cx. camposi was sequenced, annotated, and analyzed to provide genetic information about these species. RESULTS: The mitochondrial genomes of Cx. coronator, Cx. usquatus, Cx.usquatissimus, and Cx. camposi varied from 15,573 base pairs in Cx. usquatus to 15,576 in Cx. coronator. They contained 37 genes (13 protein-encoding genes, 2 rRNA genes, and 22 tRNA genes) and the AT-rich control region. Comparative analyses of the 37 genes demonstrated the mitochondrial genomes to be composed of variable and conserved genes. Despite the small size, the ATP8, ATP6 plus NADH5 protein-encoding genes were polymorphic, whereas tRNAs and rRNAs were conserved. The control region contained some poly-T stretch. The Bayesian phylogenetic tree corroborated that both the Coronator Group and the Culex pipens complex are monophyletic taxa. CONCLUSIONS: The mitochondrial genomes of Cx. coronator, Cx. usquatus, Cx. usquatissimus and Cx. camposi share the same gene composition and arrangement features that match to those reported for most Culicidae species. They are composed of the same 37 genes and the AT-rich control region, which contains poly-T stretches that may be involved in the functional role of the mitochondrial genome. Taken together, results of the dN/dS ratios, the sliding window analyses and the Bayesian phylogenetic analyses suggest that ATP6, ATP8 and NADH5 are promising genes to be employed in phylogenetic studies involving species of the Coronator Group, and probably other species groups of the subgenus Culex. Bayesian topology corroborated the morphological hypothesis of the Coronator Group as monophyletic lineage within the subgenus Culex.


Assuntos
Culex/genética , Genoma de Inseto , Genoma Mitocondrial , Animais , Composição de Bases , Brasil , Códon , Biologia Computacional , Culex/classificação , Genes de Insetos , Genes Mitocondriais , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Insetos Vetores , Anotação de Sequência Molecular , Fases de Leitura Aberta , Filogenia
8.
Mol Biol Evol ; 31(7): 1697-709, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24795089

RESUMO

Archaeplastida (=Kingdom Plantae) are primary plastid-bearing organisms that evolved via the endosymbiotic association of a heterotrophic eukaryote host cell and a cyanobacterial endosymbiont approximately 1,400 Ma. Here, we present analyses of cyanobacterial and plastid genomes that show strongly conflicting phylogenies based on 75 plastid (or nuclear plastid-targeted) protein-coding genes and their direct translations to proteins. The conflict between genes and proteins is largely robust to the use of sophisticated data- and tree-heterogeneous composition models. However, by using nucleotide ambiguity codes to eliminate synonymous substitutions due to codon-degeneracy, we identify a composition bias, and dependent codon-usage bias, resulting from synonymous substitutions at all third codon positions and first codon positions of leucine and arginine, as the main cause for the conflicting phylogenetic signals. We argue that the protein-coding gene data analyses are likely misleading due to artifacts induced by convergent composition biases at first codon positions of leucine and arginine and at all third codon positions. Our analyses corroborate previous studies based on gene sequence analysis that suggest Cyanobacteria evolved by the early paraphyletic splitting of Gloeobacter and a specific Synechococcus strain (JA33Ab), with all other remaining cyanobacterial groups, including both unicellular and filamentous species, forming the sister-group to the Archaeplastida lineage. In addition, our analyses using better-fitting models suggest (but without statistically strong support) an early divergence of Glaucophyta within Archaeplastida, with the Rhodophyta (red algae), and Viridiplantae (green algae and land plants) forming a separate lineage.


Assuntos
Cianobactérias/classificação , Cianobactérias/genética , Plastídeos/genética , Aminoácidos/genética , Viés , Códon/genética , Evolução Molecular , Genoma Bacteriano , Genomas de Plastídeos , Filogenia
9.
Mol Biol Evol ; 30(9): 2145-56, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23813979

RESUMO

Heterogeneity among life traits in mammals has resulted in considerable phylogenetic conflict, particularly concerning the position of the placental root. Layered upon this are gene- and lineage-specific variation in amino acid substitution rates and compositional biases. Life trait variations that may impact upon mutational rates are longevity, metabolic rate, body size, and germ line generation time. Over the past 12 years, three main conflicting hypotheses have emerged for the placement of the placental root. These hypotheses place the Atlantogenata (common ancestor of Xenarthra plus Afrotheria), the Afrotheria, or the Xenarthra as the sister group to all other placental mammals. Model adequacy is critical for accurate tree reconstruction and by failing to account for these compositional and character exchange heterogeneities across the tree and data set, previous studies have not provided a strongly supported hypothesis for the placental root. For the first time, models that accommodate both tree and data set heterogeneity have been applied to mammal data. Here, we show the impact of accurate model assignment and the importance of data sets in accommodating model parameters while maintaining the power to reject competing hypotheses. Through these sophisticated methods, we demonstrate the importance of model adequacy, data set power and provide strong support for the Atlantogenata over other competing hypotheses for the position of the placental root.


Assuntos
Evolução Molecular , Mamíferos/classificação , Modelos Genéticos , Filogenia , Substituição de Aminoácidos , Animais , Feminino , Heterogeneidade Genética , Mamíferos/genética , Taxa de Mutação , Placenta/fisiologia , Gravidez , Recombinação Genética
10.
Proc Natl Acad Sci U S A ; 108(37): 15258-63, 2011 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-21878562

RESUMO

Horizontal gene transfer (HGT) can radically alter the genomes of microorganisms, providing the capacity to adapt to new lifestyles, environments, and hosts. However, the extent of HGT between eukaryotes is unclear. Using whole-genome, gene-by-gene phylogenetic analysis we demonstrate an extensive pattern of cross-kingdom HGT between fungi and oomycetes. Comparative genomics, including the de novo genome sequence of Hyphochytrium catenoides, a free-living sister of the oomycetes, shows that these transfers largely converge within the radiation of oomycetes that colonize plant tissues. The repertoire of HGTs includes a large number of putatively secreted proteins; for example, 7.6% of the secreted proteome of the sudden oak death parasite Phytophthora ramorum has been acquired from fungi by HGT. Transfers include gene products with the capacity to break down plant cell walls and acquire sugars, nucleic acids, nitrogen, and phosphate sources from the environment. Predicted HGTs also include proteins implicated in resisting plant defense mechanisms and effector proteins for attacking plant cells. These data are consistent with the hypothesis that some oomycetes became successful plant parasites by multiple acquisitions of genes from fungi.


Assuntos
Evolução Biológica , Transferência Genética Horizontal/genética , Interações Hospedeiro-Parasita/genética , Phytophthora/genética , Plantas/parasitologia , Fungos/genética , Filogenia , Proteoma/metabolismo
11.
Mol Phylogenet Evol ; 68(2): 218-20, 2013 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-23567023

RESUMO

We introduce a new empirical amino-acid substitution model intended for use with for green plant (Viridiplantae) chloroplast protein data, which we call gcpREV (green chloroplast reversible model). Relative exchange rates and base composition frequencies among amino-acids were calculated using a Markov-chain Monte Carlo analysis on a combined data set of 34 proteins sampled among 27 taxa of green plants. The gcpREV model is a better fit than the commonly-used cpREV model to five previously published chloroplast protein test data sets of green plants, but is not a better fit to test data consisting mostly of non-green plant chloroplasts. Consequently, our analyses suggest a degree of specificity of the new model to green plant chloroplast data. The new model is implemented in the software P4, and model description files are available for other popular phylogenetic analysis software.


Assuntos
Proteínas de Cloroplastos/genética , Modelos Genéticos , Filogenia , Plantas/genética , Sequência de Aminoácidos/genética , Substituição de Aminoácidos , Evolução Molecular , Funções Verossimilhança , Cadeias de Markov , Método de Monte Carlo , Homologia de Sequência de Aminoácidos , Software
12.
PeerJ ; 11: e15716, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37576497

RESUMO

Calculating amino-acid substitution models that are specific for individual protein data sets is often difficult due to the computational burden of estimating large numbers of rate parameters. In this study, we tested the computational efficiency and accuracy of five methods used to estimate substitution models, namely Codeml, FastMG, IQ-TREE, P4 (maximum likelihood), and P4 (Bayesian inference). Data-specific substitution models were estimated from simulated alignments (with different lengths) that were generated from a known simulation model and simulation tree. Each of the resulting data-specific substitution models was used to calculate the maximum likelihood score of the simulation tree and simulated data that was used to calculate the model, and compared with the maximum likelihood scores of the known simulation model and simulation tree on the same simulated data. Additionally, the commonly-used empirical models, cpREV and WAG, were assessed similarly. Data-specific models performed better than the empirical models, which under-fitted the simulated alignments, had the highest difference to the simulation model maximum-likelihood score, clustered further from the simulation model in principal component analysis ordination, and inferred less accurate trees. Data-specific models and the simulation model shared statistically indistinguishable maximum-likelihood scores, indicating that the five methods were reasonably accurate at estimating substitution models by this measure. Nevertheless, tree statistics showed differences between optimal maximum likelihood trees. Unlike other model estimating methods, trees inferred using data-specific models generated with IQ-TREE and P4 (maximum likelihood) were not significantly different from the trees derived from the simulation model in each analysis, indicating that these two methods alone were the most accurate at estimating data-specific models. To show the benefits of using data-specific protein models several published data sets were reanalysed using IQ-TREE-estimated models. These newly estimated models were a better fit to the data than the empirical models that were used by the original authors, often inferred longer trees, and resulted in different tree topologies in more than half of the re-analysed data sets. The results of this study show that software availability and high computation burden are not limitations to generating better-fitting data-specific amino-acid substitution models for phylogenetic analyses.


Assuntos
Classificação , Modelos Genéticos , Filogenia , Proteínas , Substituição de Aminoácidos , Teorema de Bayes , Simulação por Computador , Proteínas/genética , Classificação/métodos
13.
Front Microbiol ; 14: 1242275, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37808290

RESUMO

Introduction: The protozoan parasite Trichomonas vaginalis is the most common cellular sexually transmitted disease in humans, and the closely related species Trichomonas gallinae is an avian parasite of ecological and economic importance. Phylogenetic evidence suggests T. vaginalis arose during bird to human transmission of a T. gallinae-like ancestor. Trichomonas vaginalis shares a strong clinical association with the independent sexually transmitted pathogen Metamycoplasma (formerly Mycoplasma) hominis, and the uncultured bacterium "Candidatus Malacoplasma (formerly Mycoplasma) girerdii," with the latter association being an order of magnitude stronger. Both bacterial species have been shown to profoundly influence T. vaginalis growth, energy production and virulence-associated mechanisms. Methods: Evidence for a novel Malacoplasma sp. was discovered by in vivo Illumina metatranscriptomics sequencing of the T. gallinae-infected pigeon mouth. We leveraged published 16S rDNA profiling data from digestive tract of 12 healthy and 24 T. gallinae-infected pigeons to investigate association between the novel Malacoplasma sp. and T. gallinae. We utilised Illumina metagenomics sequencing targeted to pigeon oral and crop samples infected with the novel Malacoplasma sp. to generate its full-length genome sequence. Sequence similarity network analysis was used to compare annotated proteins from the novel Malacoplasma sp. with a range of other related species. Results: Here we present evidence for a novel Malacoplasma species, related to "Ca. M. girerdii," that is strongly associated with T. gallinae in the upper digestive tract of domestic pigeons. Analysis of the genome sequence revealed gene features apparently specific to a Trichomonas-symbiotic Malacoplasma lineage. Discussion: These data support a model of long-term association between Trichomonas and Malacoplasma spp. that has been conserved across diversification of the Trichomonas lineage and the host species barrier from birds to human.

14.
BMC Genomics ; 13: 496, 2012 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-22992290

RESUMO

BACKGROUND: Mitochondrial genomes comprise a small but critical component of the total DNA in eukaryotic organisms. They encode several key proteins for the cell's major energy producing apparatus, the mitochondrial respiratory chain. Additionally, their nucleotide and amino acid sequences are of great utility as markers for systematics, molecular ecology and forensics. Their characterization through nucleotide sequencing is a fundamental starting point in mitogenomics. Methods to amplify complete mitochondrial genomes rapidly and efficiently from microgram quantities of tissue of single individuals are, however, not always available. Here we validate two approaches, which combine long-PCR with Roche 454 pyrosequencing technology, to obtain two complete mitochondrial genomes from individual amphibian species. RESULTS: We obtained two new xenopus frogs (Xenopus borealis and X. victorianus) complete mitochondrial genome sequences by means of long-PCR followed by 454 of individual genomes (approach 1) or of multiple pooled genomes (approach 2), the mean depth of coverage per nucleotide was 9823 and 186, respectively. We also characterised and compared the new mitogenomes against their sister taxa; X. laevis and Silurana tropicalis, two of the most intensely studied amphibians. Our results demonstrate how our approaches can be used to obtain complete amphibian mitogenomes with depths of coverage that far surpass traditional primer-walking strategies, at either the same cost or less. Our results also demonstrate: that the size, gene content and order are the same among xenopus mitogenomes and that S. tropicalis form a separate clade to the other xenopus, among which X. laevis and X. victorianus were most closely related. Nucleotide and amino acid diversity was found to vary across the xenopus mitogenomes, with the greatest diversity observed in the Complex 1 gene nad4l and the least diversity observed in Complex 4 genes (cox1-3). All protein-coding genes were shown to be under strong negative (purifying selection), with genes under the strongest pressure (Complex 4) also being the most highly expressed, highlighting their potentially crucial functions in the mitochondrial respiratory chain. CONCLUSIONS: Next generation sequencing of long-PCR amplicons using single taxon or multi-taxon approaches enabled two new species of Xenopus mtDNA to be fully characterized. We anticipate our complete mitochondrial genome amplification methods to be applicable to other amphibians, helpful for identifying the most appropriate markers for differentiating species, populations and resolving phylogenies, a pressing need since amphibians are undergoing drastic global decline. Our mtDNAs also provide templates for conserved primer design and the assembly of RNA and DNA reads following high throughput "omic" techniques such as RNA- and ChIP-seq. These could help us better understand how processes such mitochondrial replication and gene expression influence xenopus growth and development, as well as how they evolved and are regulated.


Assuntos
Variação Genética , Genoma Mitocondrial/genética , Filogenia , Seleção Genética , Xenopus/genética , Animais , Sequência de Bases , Teorema de Bayes , Etiquetas de Sequências Expressas , Marcadores Genéticos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Modelos Genéticos , Anotação de Sequência Molecular , Dados de Sequência Molecular , Especificidade da Espécie
15.
Proc Biol Sci ; 279(1749): 4870-9, 2012 Dec 22.
Artigo em Inglês | MEDLINE | ID: mdl-23097517

RESUMO

Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook 'three domains' tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation.


Assuntos
Archaea/classificação , Archaea/genética , Eucariotos/classificação , Eucariotos/genética , Evolução Molecular , Genes de RNAr , Filogenia , Proteínas/genética , Análise de Sequência de RNA , Homologia de Sequência
16.
Methods Mol Biol ; 2569: 119-135, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36083446

RESUMO

Molecular sequences in a phylogenetic analysis can differ in composition, and that shows that the process of evolution can change over time. However, models of evolution in common use are homogeneous over the tree, and if used in a phylogenetic analysis with compositionally tree-heterogeneous datasets these models can recover incorrect trees. The NDCH or Node-Discrete Compositional Heterogeneity model is able to model such data by accommodating differences in composition over the tree. Usage, problems, and limitations of this model are discussed, and a modification, the NDCH2 model, is described that can ameliorate some of these problems and limitations. Using these models can greatly increase the fit of the model to the data and can find better tree topologies. These models and various statistical tests are illustrated using a bacterial SSU rRNA dataset. These models are implemented in the software P4, and files for the analyses described here are made available.


Assuntos
Evolução Molecular , Modelos Genéticos , Teorema de Bayes , Filogenia
17.
Nature ; 433(7028): 865-8, 2005 Feb 24.
Artigo em Inglês | MEDLINE | ID: mdl-15729342

RESUMO

Entamoeba histolytica is an intestinal parasite and the causative agent of amoebiasis, which is a significant source of morbidity and mortality in developing countries. Here we present the genome of E. histolytica, which reveals a variety of metabolic adaptations shared with two other amitochondrial protist pathogens: Giardia lamblia and Trichomonas vaginalis. These adaptations include reduction or elimination of most mitochondrial metabolic pathways and the use of oxidative stress enzymes generally associated with anaerobic prokaryotes. Phylogenomic analysis identifies evidence for lateral gene transfer of bacterial genes into the E. histolytica genome, the effects of which centre on expanding aspects of E. histolytica's metabolic repertoire. The presence of these genes and the potential for novel metabolic pathways in E. histolytica may allow for the development of new chemotherapeutic agents. The genome encodes a large number of novel receptor kinases and contains expansions of a variety of gene families, including those associated with virulence. Additional genome features include an abundance of tandemly repeated transfer-RNA-containing arrays, which may have a structural function in the genome. Analysis of the genome provides new insights into the workings and genome evolution of a major human pathogen.


Assuntos
Entamoeba histolytica/genética , Genoma de Protozoário , Parasitos/genética , Animais , Entamoeba histolytica/metabolismo , Entamoeba histolytica/patogenicidade , Evolução Molecular , Fermentação , Transferência Genética Horizontal/genética , Glicólise , Estresse Oxidativo/genética , Parasitos/metabolismo , Parasitos/patogenicidade , Filogenia , Transdução de Sinais , Virulência/genética
18.
Proc Natl Acad Sci U S A ; 105(51): 20356-61, 2008 Dec 23.
Artigo em Inglês | MEDLINE | ID: mdl-19073919

RESUMO

The origin of the eukaryotic genetic apparatus is thought to be central to understanding the evolution of the eukaryotic cell. Disagreement about the source of the relevant genes has spawned competing hypotheses for the origins of the eukaryote nuclear lineage. The iconic rooted 3-domains tree of life shows eukaryotes and archaebacteria as separate groups that share a common ancestor to the exclusion of eubacteria. By contrast, the eocyte hypothesis has eukaryotes originating within the archaebacteria and sharing a common ancestor with a particular group called the Crenarchaeota or eocytes. Here, we have investigated the relative support for each hypothesis from analysis of 53 genes spanning the 3 domains, including essential components of the eukaryotic nucleic acid replication, transcription, and translation apparatus. As an important component of our analysis, we investigated the fit between model and data with respect to composition. Compositional heterogeneity is a pervasive problem for reconstruction of ancient relationships, which, if ignored, can produce an incorrect tree with strong support. To mitigate its effects, we used phylogenetic models that allow for changing nucleotide or amino acid compositions over the tree and data. Our analyses favor a topology that supports the eocyte hypothesis rather than archaebacterial monophyly and the 3-domains tree of life.


Assuntos
Archaea , Evolução Biológica , Crenarchaeota/genética , Células Eucarióticas , Modelos Genéticos , Filogenia , Genes Arqueais/genética , Genes Bacterianos/genética , Genômica , Ácidos Nucleicos/genética , Biossíntese de Proteínas/genética , Transcrição Gênica/genética
19.
Genome Biol Evol ; 13(5)2021 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-33772552

RESUMO

There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods.


Assuntos
Evolução Biológica , Modelos Genéticos , Filogenia , Deinococcus/classificação , Microsporídios/classificação , Thermus/classificação
20.
J Med Virol ; 82(1): 175-85, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19950229

RESUMO

Nucleotide sequences of two regions of the genomes of 11 yellow fever virus (YFV) samples isolated from monkeys or humans with symptomatic yellow fever (YF) in Brazil in 2000, 2004, and 2008 were determined with the objective of establishing the genotypes and studying the genetic variation. Results of the Bayesian phylogenetic analysis showed that sequences generated from strains from 2004 and 2008 formed a new subclade within the clade 1 of the South American genotype I. The new subgroup is here designated as 1E. Sequences of YFV strains recovered in 2000 belong to the subclade 1D, which comprises previously characterized YFV strains from Brazil. Molecular dating analyses suggested that the new subclade 1E started diversifying from 1D about 1975 and that the most recent 2004-2008 isolates arose about 1985.


Assuntos
Variação Genética , Doenças dos Macacos/epidemiologia , Filogenia , Febre Amarela/epidemiologia , Vírus da Febre Amarela , Regiões 3' não Traduzidas/genética , Animais , Teorema de Bayes , Brasil/epidemiologia , Evolução Molecular , Genótipo , Humanos , Dados de Sequência Molecular , Doenças dos Macacos/virologia , Análise de Sequência de DNA , América do Sul , Proteínas do Envelope Viral , Febre Amarela/veterinária , Febre Amarela/virologia , Vírus da Febre Amarela/classificação , Vírus da Febre Amarela/genética , Vírus da Febre Amarela/isolamento & purificação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA