Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 33
Filtrar
1.
PLoS Biol ; 22(7): e3002696, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38959200

RESUMEN

Sterile alpha motif domain-containing proteins 9 and 9-like (SAMD9/9L) are associated with life-threatening genetic diseases in humans and are restriction factors of poxviruses. Yet, their cellular function and the extent of their antiviral role are poorly known. Here, we found that interferon-stimulated human SAMD9L restricts HIV-1 in the late phases of replication, at the posttranscriptional and prematuration steps, impacting viral translation and, possibly, endosomal trafficking. Surprisingly, the paralog SAMD9 exerted an opposite effect, enhancing HIV-1. More broadly, we showed that SAMD9L restricts primate lentiviruses, but not a gammaretrovirus (MLV), nor 2 RNA viruses (arenavirus MOPV and rhabdovirus VSV). Using structural modeling and mutagenesis of SAMD9L, we identified a conserved Schlafen-like active site necessary for HIV-1 restriction by human and a rodent SAMD9L. By testing a gain-of-function constitutively active variant from patients with SAMD9L-associated autoinflammatory disease, we determined that SAMD9L pathogenic functions also depend on the Schlafen-like active site. Finally, we found that the constitutively active SAMD9L strongly inhibited HIV, MLV, and, to a lesser extent, MOPV. This suggests that the virus-specific effect of SAMD9L may involve its differential activation/sensing and the virus ability to evade from SAMD9L restriction. Overall, our study identifies SAMD9L as an HIV-1 antiviral factor from the cell autonomous immunity and deciphers host determinants underlying the translational repression. This provides novel links and therapeutic avenues against viral infections and genetic diseases.


Asunto(s)
VIH-1 , Lentivirus de los Primates , Replicación Viral , Humanos , VIH-1/genética , VIH-1/fisiología , Animales , Lentivirus de los Primates/genética , Lentivirus de los Primates/metabolismo , Células HEK293 , Biosíntesis de Proteínas , Factores de Restricción Antivirales , Péptidos y Proteínas de Señalización Intracelular/metabolismo , Péptidos y Proteínas de Señalización Intracelular/genética , Infecciones por VIH/virología , Infecciones por VIH/tratamiento farmacológico , Proteínas Supresoras de Tumor
2.
Proc Natl Acad Sci U S A ; 119(35): e2206610119, 2022 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-35947637

RESUMEN

The coronavirus disease 19 (COVID-19) pandemic is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a coronavirus that spilled over from the bat reservoir. Despite numerous clinical trials and vaccines, the burden remains immense, and the host determinants of SARS-CoV-2 susceptibility and COVID-19 severity remain largely unknown. Signatures of positive selection detected by comparative functional genetic analyses in primate and bat genomes can uncover important and specific adaptations that occurred at virus-host interfaces. We performed high-throughput evolutionary analyses of 334 SARS-CoV-2-interacting proteins to identify SARS-CoV adaptive loci and uncover functional differences between modern humans, primates, and bats. Using DGINN (Detection of Genetic INNovation), we identified 38 bat and 81 primate proteins with marks of positive selection. Seventeen genes, including the ACE2 receptor, present adaptive marks in both mammalian orders, suggesting common virus-host interfaces and past epidemics of coronaviruses shaping their genomes. Yet, 84 genes presented distinct adaptations in bats and primates. Notably, residues involved in ubiquitination and phosphorylation of the inflammatory RIPK1 have rapidly evolved in bats but not primates, suggesting different inflammation regulation versus humans. Furthermore, we discovered residues with typical virus-host arms race marks in primates, such as in the entry factor TMPRSS2 or the autophagy adaptor FYCO1, pointing to host-specific in vivo interfaces that may be drug targets. Finally, we found that FYCO1 sites under adaptation in primates are those associated with severe COVID-19, supporting their importance in pathogenesis and replication. Overall, we identified adaptations involved in SARS-CoV-2 infection in bats and primates, enlightening modern genetic determinants of virus susceptibility and severity.


Asunto(s)
COVID-19 , Quirópteros , Evolución Molecular , Adaptación al Huésped , Primates , SARS-CoV-2 , Glicoproteína de la Espiga del Coronavirus , Animales , COVID-19/genética , Quirópteros/virología , Predisposición Genética a la Enfermedad , Adaptación al Huésped/genética , Humanos , Pandemias , Primates/genética , Primates/virología , SARS-CoV-2/genética , Selección Genética , Glicoproteína de la Espiga del Coronavirus/genética
3.
Syst Biol ; 70(3): 608-622, 2021 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-33252676

RESUMEN

Detecting the signature of selection in coding sequences and associating it with shifts in phenotypic states can unveil genes underlying complex traits. Of the various signatures of selection exhibited at the molecular level, changes in the pattern of selection at protein-coding genes have been of main interest. To this end, phylogenetic branch-site codon models are routinely applied to detect changes in selective patterns along specific branches of the phylogeny. Many of these methods rely on a prespecified partition of the phylogeny to branch categories, thus treating the course of trait evolution as fully resolved and assuming that phenotypic transitions have occurred only at speciation events. Here, we present TraitRELAX, a new phylogenetic model that alleviates these strong assumptions by explicitly accounting for the uncertainty in the evolution of both trait and coding sequences. This joint statistical framework enables the detection of changes in selection intensity upon repeated trait transitions. We evaluated the performance of TraitRELAX using simulations and then applied it to two case studies. Using TraitRELAX, we found an intensification of selection in the primate SEMG2 gene in polygynandrous species compared to species of other mating forms, as well as changes in the intensity of purifying selection operating on sixteen bacterial genes upon transitioning from a free-living to an endosymbiotic lifestyle.[Evolutionary selection; intensification; $\gamma $-proteobacteria; genotype-phenotype; relaxation; SEMG2.].


Asunto(s)
Evolución Molecular , Fenotipo , Selección Genética , Animales , Codón , Modelos Genéticos , Filogenia , Primates/genética
4.
Nucleic Acids Res ; 48(18): e103, 2020 10 09.
Artículo en Inglés | MEDLINE | ID: mdl-32941639

RESUMEN

Adaptive evolution has shaped major biological processes. Finding the protein-coding genes and the sites that have been subjected to adaptation during evolutionary time is a major endeavor. However, very few methods fully automate the identification of positively selected genes, and widespread sources of genetic innovations such as gene duplication and recombination are absent from most pipelines. Here, we developed DGINN, a highly-flexible and public pipeline to Detect Genetic INNovations and adaptive evolution in protein-coding genes. DGINN automates, from a gene's sequence, all steps of the evolutionary analyses necessary to detect the aforementioned innovations, including the search for homologs in databases, assignation of orthology groups, identification of duplication and recombination events, as well as detection of positive selection using five methods to increase precision and ranking of genes when a large panel is analyzed. DGINN was validated on nineteen genes with previously-characterized evolutionary histories in primates, including some engaged in host-pathogen arms-races. Our results confirm and also expand results from the literature, including novel findings on the Guanylate-binding protein family, GBPs. This establishes DGINN as an efficient tool to automatically detect genetic innovations and adaptive evolution in diverse datasets, from the user's gene of interest to a large gene list in any species range.


Asunto(s)
Bases de Datos Genéticas , Primates/genética , Proteínas/genética , Animales , Evolución Molecular , Variación Genética , Selección Genética
5.
Bioinformatics ; 36(18): 4822-4824, 2020 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-33085745

RESUMEN

MOTIVATION: Gene and species tree reconciliation methods are used to interpret gene trees, root them and correct uncertainties that are due to scarcity of signal in multiple sequence alignments. So far, reconciliation tools have not been integrated in standard phylogenetic software and they either lack performance on certain functions, or usability for biologists. RESULTS: We present Treerecs, a phylogenetic software based on duplication-loss reconciliation. Treerecs is simple to install and to use. It is fast and versatile, has a graphic output, and can be used along with methods for phylogenetic inference on multiple alignments like PLL and Seaview. AVAILABILITY AND IMPLEMENTATION: Treerecs is open-source. Its source code (C++, AGPLv3) and manuals are available from https://project.inria.fr/treerecs/.


Asunto(s)
Algoritmos , Evolución Molecular , Filogenia , Alineación de Secuencia , Programas Informáticos
6.
Genome Res ; 27(6): 1016-1028, 2017 06.
Artículo en Inglés | MEDLINE | ID: mdl-28424354

RESUMEN

The evolutionary origin of the striking genome size variations found in eukaryotes remains enigmatic. The effective size of populations, by controlling selection efficacy, is expected to be a key parameter underlying genome size evolution. However, this hypothesis has proved difficult to investigate using empirical data sets. Here, we tested this hypothesis using 22 de novo transcriptomes and low-coverage genomes of asellid isopods, which represent 11 independent habitat shifts from surface water to resource-poor groundwater. We show that these habitat shifts are associated with higher transcriptome-wide [Formula: see text] After ruling out the role of positive selection and pseudogenization, we show that these transcriptome-wide [Formula: see text] increases are the consequence of a reduction in selection efficacy imposed by the smaller effective population size of subterranean species. This reduction is paralleled by an important increase in genome size (25% increase on average), an increase also confirmed in subterranean decapods and mollusks. We also control for an adaptive impact of genome size on life history traits but find no correlation between body size, or growth rate, and genome size. We show instead that the independent increases in genome size measured in subterranean isopods are the direct consequence of increasing invasion rates by repeat elements, which are less efficiently purged out by purifying selection. Contrary to selection efficacy, polymorphism is not correlated to genome size. We propose that recent demographic fluctuations and the difficulty of observing polymorphism variation in polymorphism-poor species can obfuscate the link between effective population size and genome size when polymorphism data are used alone.


Asunto(s)
Especiación Genética , Tamaño del Genoma , Isópodos/genética , Filogenia , Selección Genética , Animales , Decápodos/clasificación , Decápodos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Isópodos/clasificación , Repeticiones de Microsatélite , Moluscos/clasificación , Moluscos/genética , Polimorfismo Genético , Transcriptoma
7.
Mol Biol Evol ; 35(3): 734-742, 2018 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-29220511

RESUMEN

The measurement of synonymous and nonsynonymous substitution rates (dS and dN) is useful for assessing selection operating on protein sequences or for investigating mutational processes affecting genomes. In particular, the ratio dNdS is expected to be a good proxy for ω, the ratio of fixation probabilities of nonsynonymous mutations relative to that of neutral mutations. Standard methods for estimating dN, dS, or ω rely on the assumption that the base composition of sequences is at the equilibrium of the evolutionary process. In many clades, this assumption of stationarity is in fact incorrect, and we show here through simulations and analyses of empirical data that nonstationarity biases the estimate of dN, dS, and ω. We show that the bias in the estimate of ω can be fixed by explicitly taking into consideration nonstationarity in the modeling of codon evolution, in a maximum likelihood framework. Moreover, we propose an exact method for estimating dN and dS on branches, based on stochastic mapping, that can take into account nonstationarity. This method can be directly applied to any kind of codon evolution model, as long as neutrality is clearly parameterized.

8.
Mol Biol Evol ; 35(9): 2296-2306, 2018 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-29986048

RESUMEN

In the history of life, some phenotypes have been acquired several times independently, through convergent evolution. Recently, lots of genome-scale studies have been devoted to identify nucleotides or amino acids that changed in a convergent manner when the convergent phenotypes evolved. These efforts have had mixed results, probably because of differences in the detection methods, and because of conceptual differences about the definition of a convergent substitution. Some methods contend that substitutions are convergent only if they occur on all branches where the phenotype changed toward the exact same state at a given nucleotide or amino acid position. Others are much looser in their requirements and define a convergent substitution as one that leads the site at which they occur to prefer a phylogeny in which species with the convergent phenotype group together. Here, we suggest to look for convergent shifts in amino acid preferences instead of convergent substitutions to the exact same amino acid. We define as convergent shifts substitutions that occur on all branches where the phenotype changed and such that they correspond to a change in the type of amino acid preferred at this position. We implement the corresponding model into a method named PCOC. We show on simulations that PCOC better recovers convergent shifts than existing methods in terms of sensitivity and specificity. We test it on a plant protein alignment where convergent evolution has been studied in detail and find that our method recovers several previously identified convergent substitutions and proposes credible new candidates.


Asunto(s)
Sustitución de Aminoácidos , Evolución Molecular , Técnicas Genéticas , Modelos Genéticos , Animales , Cyperaceae/genética , Mamíferos/genética
9.
Mol Biol Evol ; 35(12): 2900-2912, 2018 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-30247705

RESUMEN

The rate of molecular evolution varies widely among species. Life history traits (LHTs) have been proposed as a major driver of these variations. However, the relative contribution of each trait is poorly understood. Here, we test the influence of metabolic rate (MR), longevity, and generation time (GT) on the nuclear and mitochondrial synonymous substitution rates using a group of isopod species that have made multiple independent transitions to subterranean environments. Subterranean species have repeatedly evolved a lower MR, a longer lifespan and a longer GT. We assembled the nuclear transcriptomes and the mitochondrial genomes of 13 pairs of closely related isopods, each pair composed of one surface and one subterranean species. We found that subterranean species have a lower rate of nuclear synonymous substitution than surface species whereas the mitochondrial rate remained unchanged. We propose that this decoupling between nuclear and mitochondrial rates comes from different DNA replication processes in these two compartments. In isopods, the nuclear rate is probably tightly controlled by GT alone. In contrast, mitochondrial genomes appear to replicate and mutate at a rate independent of LHTs. These results are incongruent with previous studies, which were mostly devoted to vertebrates. We suggest that this incongruence can be explained by developmental differences between animal clades, with a quiescent period during female gametogenesis in mammals and birds which imposes a nuclear and mitochondrial rate coupling, as opposed to the continuous gametogenesis observed in most arthropods.


Asunto(s)
Evolución Molecular , Genoma Mitocondrial , Isópodos/genética , Rasgos de la Historia de Vida , Animales , Replicación del ADN , Ecosistema , Transporte de Electrón , Isópodos/metabolismo , Isópodos/efectos de la radiación , Biosíntesis de Proteínas , Selección Genética
10.
BMC Bioinformatics ; 16 Suppl 14: S7, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26451469

RESUMEN

We study statistical estimators of the number of genomic events separating two genomes under a Double Cut-and Join (DCJ) rearrangement model, by a method of moment estimation. We first propose an exact, closed, analytically invertible formula for the expected number of breakpoints after a given number of DCJs. This improves over the heuristic, recursive and computationally slower previously proposed one. Then we explore the analogies of genome evolution by DCJ with evolution of binary sequences under substitutions, permutations under transpositions, and random graphs. Each of these are presented in the literature with intuitive justifications, and are used to import results from better known fields. We formalize the relations by proving a correspondence between moments in sequence and genome evolution, provided substitutions appear four by four in the corresponding model. Eventually we prove a bounded error on two estimators of the number of cycles in the breakpoint graph after a given number of rearrangements, by an analogy with cycles in permutations and components in random graphs.


Asunto(s)
Algoritmos , Evolución Molecular , Reordenamiento Génico , Genoma , Genómica/métodos , Modelos Genéticos , Simulación por Computador , Humanos
11.
BMC Bioinformatics ; 16 Suppl 14: S5, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26452018

RESUMEN

BACKGROUND: Most models of genome evolution concern either genetic sequences, gene content or gene order. They sometimes integrate two of the three levels, but rarely the three of them. Probabilistic models of gene order evolution usually have to assume constant gene content or adopt a presence/absence coding of gene neighborhoods which is blind to complex events modifying gene content. RESULTS: We propose a probabilistic evolutionary model for gene neighborhoods, allowing genes to be inserted, duplicated or lost. It uses reconciled phylogenies, which integrate sequence and gene content evolution. We are then able to optimize parameters such as phylogeny branch lengths, or probabilistic laws depicting the diversity of susceptibility of syntenic regions to rearrangements. We reconstruct a structure for ancestral genomes by optimizing a likelihood, keeping track of all evolutionary events at the level of gene content and gene synteny. Ancestral syntenies are associated with a probability of presence.


Asunto(s)
Drosophila/genética , Evolución Molecular , Genoma , Modelos Genéticos , Filogenia , Sintenía , Algoritmos , Animales , Drosophila/clasificación , Orden Génico
13.
Mol Biol Evol ; 30(8): 1745-50, 2013 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-23699471

RESUMEN

Efficient algorithms and programs for the analysis of the ever-growing amount of biological sequence data are strongly needed in the genomics era. The pace at which new data and methodologies are generated calls for the use of pre-existing, optimized-yet extensible-code, typically distributed as libraries or packages. This motivated the Bio++ project, aiming at developing a set of C++ libraries for sequence analysis, phylogenetics, population genetics, and molecular evolution. The main attractiveness of Bio++ is the extensibility and reusability of its components through its object-oriented design, without compromising the computer-efficiency of the underlying methods. We present here the second major release of the libraries, which provides an extended set of classes and methods. These extensions notably provide built-in access to sequence databases and new data structures for handling and manipulating sequences from the omics era, such as multiple genome alignments and sequencing reads libraries. More complex models of sequence evolution, such as mixture models and generic n-tuples alphabets, are also included.


Asunto(s)
Biología Computacional , Evolución Molecular , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Genómica/métodos , Humanos , Internet
14.
Genome Biol Evol ; 16(4)2024 04 02.
Artículo en Inglés | MEDLINE | ID: mdl-38518756

RESUMEN

Ancestral reconstruction is a widely used technique that has been applied to understand the evolutionary history of gain and loss of gene families. Ancestral gene content can be reconstructed via different phylogenetic methods, but many current and previous studies employ Dollo parsimony. We hypothesize that Dollo parsimony is not appropriate for ancestral gene content reconstruction inferences based on sequence homology, as Dollo parsimony is derived from the assumption that a complex character cannot be regained. This premise does not accurately model molecular sequence evolution, in which false orthology can result from sequence convergence or lateral gene transfer. The aim of this study is to test Dollo parsimony's suitability for ancestral gene content reconstruction and to compare its inferences with a maximum likelihood-based approach that allows a gene family to be gained more than once within a tree. We first compared the performance of the two approaches on a series of artificial data sets each of 5,000 genes that were simulated according to a spectrum of evolutionary rates without gene gain or loss, so that inferred deviations from the true gene count would arise only from errors in orthology inference and ancestral reconstruction. Next, we reconstructed protein domain evolution on a phylogeny representing known eukaryotic diversity. We observed that Dollo parsimony produced numerous ancestral gene content overestimations, especially at nodes closer to the root of the tree. These observations led us to the conclusion that, confirming our hypothesis, Dollo parsimony is not an appropriate method for ancestral reconstruction studies based on sequence homology.


Asunto(s)
Evolución Molecular , Filogenia , Funciones de Verosimilitud
15.
Syst Biol ; 61(3): 510-21, 2012 May.
Artículo en Inglés | MEDLINE | ID: mdl-22331438

RESUMEN

Most models and algorithms developed to perform statistical inference from DNA data make the assumption that substitution processes affecting distinct nucleotide sites are stochastically independent. This assumption ensures both mathematical and computational tractability but is in disagreement with observed data in many situations--one well-known example being CpG dinucleotide hypermutability in mammalian genomes. In this paper, we consider the class of RN95 + YpR substitution models, which allows neighbor-dependent effects--including CpG hypermutability--to be taken into account, through transitions between pyrimidine-purine dinucleotides. We show that it is possible to adapt inference methods originally developed under the assumption of independence between sites to RN95 + YpR models, using a mathematically rigorous framework provided by specific structural properties of this class of models. We assess how efficient this approach is at inferring the CpG hypermutability rate from aligned DNA sequences. The method is tested on simulated data and compared against several alternatives; the results suggest that it delivers a high degree of accuracy at a low computational cost. We then apply our method to an alignment of 10 DNA sequences from primate species. Model comparisons within the RN95 + YpR class show the importance of taking into account neighbor-dependent effects. An application of the method to the detection of hypomethylated islands is discussed.


Asunto(s)
Clasificación/métodos , Modelos Biológicos , Animales , Simulación por Computador , Humanos , Filogenia , Primates/clasificación , Primates/genética
16.
Curr Biol ; 32(10): 2325-2333.e6, 2022 05 23.
Artículo en Inglés | MEDLINE | ID: mdl-35483362

RESUMEN

Cytoplasmic male sterility (CMS) is a form of genetic conflict over sex determination that results from differences in modes of inheritance between genomic compartments.1-3 Indeed, maternally transmitted (usually mitochondrial) genes sometimes enhance their transmission by suppressing the male function in a hermaphroditic organism to the detriment of biparentally inherited nuclear genes. Therefore, these hermaphrodites become functionally female and may coexist with regular hermaphrodites in so-called gynodioecious populations.3 CMS has been known in plants since Darwin's times4 but is previously unknown in the animal kingdom.5-8 We relate the first observation of CMS in animals. It occurs in a freshwater snail population, where some individuals appear unable to sire offspring in controlled crosses and show anatomical, physiological, and behavioral characters consistent with a suppression of the male function. Male sterility is associated with a mitochondrial lineage that underwent a spectacular acceleration of DNA substitution rates, affecting the entire mitochondrial genome-this acceleration concerns both synonymous and non-synonymous substitutions and therefore results from increased mitogenome mutation rates. Consequently, mitochondrial haplotype divergence within the population is exceptionally high, matching that observed between snail taxa that diverged 475 million years ago. This result is reminiscent of similar accelerations in mitogenome evolution observed in plant clades where gynodioecy is frequent,9,10 both being consistent with arms-race evolution of genome regions implicated in CMS.11,12 Our study shows that genomic conflicts can trigger independent evolution of similar sex-determination systems in plants and animals and dramatically accelerate molecular evolution.


Asunto(s)
ADN Mitocondrial , Genoma Mitocondrial , Animales , ADN Mitocondrial/genética , Evolución Molecular , Femenino , Haplotipos , Mitocondrias/genética
17.
Genome Biol ; 20(1): 5, 2019 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-30616647

RESUMEN

BACKGROUND: The nearly neutral theory of molecular evolution predicts that the efficacy of natural selection increases with the effective population size. This prediction has been verified by independent observations in diverse taxa, which show that life-history traits are strongly correlated with measures of the efficacy of selection, such as the dN/dS ratio. Surprisingly, avian taxa are an exception to this theory because correlations between life-history traits and dN/dS are apparently absent. Here we explore the role of GC-biased gene conversion on estimates of substitution rates as a potential driver of these unexpected observations. RESULTS: We analyze the relationship between dN/dS estimated from alignments of 47 avian genomes and several proxies for effective population size. To distinguish the impact of GC-biased gene conversion from selection, we use an approach that accounts for non-stationary base composition and estimate dN/dS separately for changes affected or unaffected by GC-biased gene conversion. This analysis shows that the impact of GC-biased gene conversion on substitution rates can explain the lack of correlations between life-history traits and dN/dS. Strong correlations between life-history traits and dN/dS are recovered after accounting for GC-biased gene conversion. The correlations are robust to variation in base composition and genomic location. CONCLUSIONS: Our study shows that gene sequence evolution across a wide range of avian lineages meets the prediction of the nearly neutral theory, the efficacy of selection increases with effective population size. Moreover, our study illustrates that accounting for GC-biased gene conversion is important to correctly estimate the strength of selection.


Asunto(s)
Aves/genética , Conversión Génica , Flujo Genético , Selección Genética , Animales , Composición de Base , Aves/crecimiento & desarrollo , Cromosomas
18.
Philos Trans R Soc Lond B Biol Sci ; 374(1777): 20180234, 2019 07 22.
Artículo en Inglés | MEDLINE | ID: mdl-31154974

RESUMEN

In evolutionary genomics, researchers have taken an interest in identifying substitutions that subtend convergent phenotypic adaptations. This is a difficult question that requires distinguishing foreground convergent substitutions that are involved in the convergent phenotype from background convergent substitutions. Those may be linked to other adaptations, may be neutral or may be the consequence of mutational biases. Furthermore, there is no generally accepted definition of convergent substitutions. Various methods that use different definitions have been proposed in the literature, resulting in different sets of candidate foreground convergent substitutions. In this article, we first describe the processes that can generate foreground convergent substitutions in coding sequences, separating adaptive from non-adaptive processes. Second, we review methods that have been proposed to detect foreground convergent substitutions in coding sequences and expose the assumptions that underlie them. Finally, we examine their power on simulations of convergent changes-including in the presence of a change in the efficacy of selection-and on empirical alignments. This article is part of the theme issue 'Convergent evolution in the genomics era: new insights and directions'.


Asunto(s)
Aminoácidos/genética , Evolución Molecular , Proteínas/genética , Aminoácidos/metabolismo , Animales , Genómica , Humanos , Modelos Genéticos , Filogenia , Proteínas/metabolismo
19.
BMC Evol Biol ; 8: 272, 2008 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-18834516

RESUMEN

BACKGROUND: Despite a large agreement between ribosomal RNA and concatenated protein phylogenies, the phylogenetic tree of the bacterial domain remains uncertain in its deepest nodes. For instance, the position of the hyperthermophilic Aquificales is debated, as their commonly observed position close to Thermotogales may proceed from horizontal gene transfers, long branch attraction or compositional biases, and may not represent vertical descent. Indeed, another view, based on the analysis of rare genomic changes, places Aquificales close to epsilon-Proteobacteria. RESULTS: To get a whole genome view of Aquifex relationships, all trees containing sequences from Aquifex in the HOGENOM database were surveyed. This study revealed that Aquifex is most often found as a neighbour to Thermotogales. Moreover, informational genes, which appeared to be less often transferred to the Aquifex lineage than non-informational genes, most often placed Aquificales close to Thermotogales. To ensure these results did not come from long branch attraction or compositional artefacts, a subset of carefully chosen proteins from a wide range of bacterial species was selected for further scrutiny. Among these genes, two phylogenetic hypotheses were found to be significantly more likely than the others: the most likely hypothesis placed Aquificales as a neighbour to Thermotogales, and the second one with epsilon-Proteobacteria. We characterized the genes that supported each of these two hypotheses, and found that differences in rates of evolution or in amino-acid compositions could not explain the presence of two incongruent phylogenetic signals in the alignment. Instead, evidence for a large Horizontal Gene Transfer between Aquificales and epsilon-Proteobacteria was found. CONCLUSION: Methods based on concatenated informational proteins and methods based on character cladistics led to different conclusions regarding the position of Aquificales because this lineage has undergone many horizontal gene transfers. However, if a tree of vertical descent can be reconstructed for Bacteria, our results suggest Aquificales should be placed close to Thermotogales.


Asunto(s)
Bacterias/clasificación , Bacterias/genética , Transferencia de Gen Horizontal/genética , Genoma Bacteriano/genética , Filogenia , Clasificación/métodos
20.
Gene ; 411(1-2): 87-93, 2008 Mar 31.
Artículo en Inglés | MEDLINE | ID: mdl-18281162

RESUMEN

Transposable elements, which are major components of most genomes, are known to accumulate in heterochromatic regions in which they have progressively diverged in sequence by mutations and internal deletions and insertions (indels) during the course of evolution. They therefore provide a record of the genomic events that have shaped the genomes, some of which could correspond to speciation events. Using the sequence divergence between the long terminal repeats (LTRs), we estimated the date of the insertion events of the LTR retrotransposon copies embedded within the heterochromatin regions of the Drosophila melanogaster genome. We did not detect traces of any specific waves of mobilization of retrotransposons within heterochromatin, apart from a very recent wave, which corresponds to the numerous LTR retrotransposon copies found in euchromatin.


Asunto(s)
Drosophila melanogaster/genética , Genoma de los Insectos , Heterocromatina , Retroelementos , Secuencias Repetidas Terminales , Animales
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA