Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 2.611
1.
Mol Biol Evol ; 41(5)2024 May 03.
Article En | MEDLINE | ID: mdl-38667829

Different frequencies amongst codons that encode the same amino acid (i.e. synonymous codons) have been observed in multiple species. Studies focused on uncovering the forces that drive such codon usage showed that a combined effect of mutational biases and translational selection works to produce different frequencies of synonymous codons. However, only few have been able to measure and distinguish between these forces that may leave similar traces on the coding regions. Here, we have developed a codon model that allows the disentangling of mutation, selection on amino acids and synonymous codons, and GC-biased gene conversion (gBGC) which we employed on an extensive dataset of 415 chordates and 191 arthropods. We found that chordates need 15 more synonymous codon categories than arthropods to explain the empirical codon frequencies, which suggests that the extent of codon usage can vary greatly between animal phyla. Moreover, methylation at CpG sites seems to partially explain these patterns of codon usage in chordates but not in arthropods. Despite the differences between the two phyla, our findings demonstrate that in both, GC-rich codons are disfavored when mutations are GC-biased, and the opposite is true when mutations are AT-biased. This indicates that selection on the genomic coding regions might act primarily to stabilize its GC/AT content on a genome-wide level. Our study shows that the degree of synonymous codon usage varies considerably among animals, but is likely governed by a common underlying dynamic.


Arthropods , Codon Usage , Selection, Genetic , Animals , Arthropods/genetics , Chordata/genetics , Mutation , Evolution, Molecular , Codon , Models, Genetic , Base Composition , Gene Conversion
2.
Am J Hum Genet ; 111(4): 691-700, 2024 Apr 04.
Article En | MEDLINE | ID: mdl-38513668

We present a method for efficiently identifying clusters of identical-by-descent haplotypes in biobank-scale sequence data. Our multi-individual approach enables much more computationally efficient inference of identity by descent (IBD) than approaches that infer pairwise IBD segments and provides locus-specific IBD clusters rather than IBD segments. Our method's computation time, memory requirements, and output size scale linearly with the number of individuals in the dataset. We also present a method for using multi-individual IBD to detect alleles changed by gene conversion. Application of our methods to the autosomal sequence data for 125,361 White British individuals in the UK Biobank detects more than 9 million converted alleles. This is 2,900 times more alleles changed by gene conversion than were detected in a previous analysis of familial data. We estimate that more than 250,000 sequenced probands and a much larger number of additional genomes from multi-generational family members would be required to find a similar number of alleles changed by gene conversion using a family-based approach. Our IBD clustering method is implemented in the open-source ibd-cluster software package.


Biological Specimen Banks , Gene Conversion , Humans , Software , Haplotypes/genetics , Chromosomes , Polymorphism, Single Nucleotide
3.
PLoS Biol ; 22(3): e3002507, 2024 Mar.
Article En | MEDLINE | ID: mdl-38451924

While the malaria parasite Plasmodium falciparum has low average genome-wide diversity levels, likely due to its recent introduction from a gorilla-infecting ancestor (approximately 10,000 to 50,000 years ago), some genes display extremely high diversity levels. In particular, certain proteins expressed on the surface of human red blood cell-infecting merozoites (merozoite surface proteins (MSPs)) possess exactly 2 deeply diverged lineages that have seemingly not recombined. While of considerable interest, the evolutionary origin of this phenomenon remains unknown. In this study, we analysed the genetic diversity of 2 of the most variable MSPs, DBLMSP and DBLMSP2, which are paralogs (descended from an ancestral duplication). Despite thousands of available Illumina WGS datasets from malaria-endemic countries, diversity in these genes has been hard to characterise as reads containing highly diverged alleles completely fail to align to the reference genome. To solve this, we developed a pipeline leveraging genome graphs, enabling us to genotype them at high accuracy and completeness. Using our newly- resolved sequences, we found that both genes exhibit 2 deeply diverged lineages in a specific protein domain (DBL) and that one of the 2 lineages is shared across the genes. We identified clear evidence of nonallelic gene conversion between the 2 genes as the likely mechanism behind sharing, leading us to propose that gene conversion between diverged paralogs, and not recombination suppression, can generate this surprising genealogy; a model that is furthermore consistent with high diversity levels in these 2 genes despite the strong historical P. falciparum transmission bottleneck.


Hominidae , Malaria, Falciparum , Malaria , Parasites , Animals , Humans , Plasmodium falciparum/metabolism , Parasites/metabolism , Gene Conversion , Antigens, Surface , Malaria/parasitology , Protozoan Proteins/genetics , Protozoan Proteins/metabolism , Genetic Variation
4.
Nat Commun ; 15(1): 1915, 2024 Mar 01.
Article En | MEDLINE | ID: mdl-38429336

Artificial biomolecular condensates are emerging as a versatile approach to organize molecular targets and reactions without the need for lipid membranes. Here we ask whether the temporal response of artificial condensates can be controlled via designed chemical reactions. We address this general question by considering a model problem in which a phase separating component participates in reactions that dynamically activate or deactivate its ability to self-attract. Through a theoretical model we illustrate the transient and equilibrium effects of reactions, linking condensate response and reaction parameters. We experimentally realize our model problem using star-shaped DNA motifs known as nanostars to generate condensates, and we take advantage of strand invasion and displacement reactions to kinetically control the capacity of nanostars to interact. We demonstrate reversible dissolution and growth of DNA condensates in the presence of specific DNA inputs, and we characterize the role of toehold domains, nanostar size, and nanostar valency. Our results will support the development of artificial biomolecular condensates that can adapt to environmental changes with prescribed temporal dynamics.


Biomolecular Condensates , DNA Packaging , DNA Replication , Gene Conversion , Nucleotide Motifs
5.
J Evol Biol ; 37(4): 383-400, 2024 Apr 14.
Article En | MEDLINE | ID: mdl-38367009

Population genetic inference of selection on the nucleotide sequence level often proceeds by comparison to a reference sequence evolving only under mutation and population demography. Among the few candidates for such a reference sequence is the 5' part of short introns (5SI) in Drosophila. In addition to mutation and population demography, however, there is evidence for a weak force favouring GC bases, likely due to GC-biased gene conversion (gBGC), and for the effect of linked selection. Here, we use polymorphism and divergence data of Drosophila melanogaster to detect and describe the forces affecting the evolution of the 5SI. We separately analyse mutation classes, compare them between chromosomes, and relate them to recombination rate frequencies. GC-conservative mutations seem to be mainly influenced by mutation and drift, with linked selection mostly causing differences between the central and the peripheral (i.e., telomeric and centromeric) regions of the chromosome arms. Comparing GC-conservative mutation patterns between autosomes and the X chromosome showed differences in mutation rates, rather than linked selection, in the central chromosomal regions after accounting for differences in effective population sizes. On the other hand, GC-changing mutations show asymmetric site frequency spectra, indicating the presence of gBGC, varying among mutation classes and in intensity along chromosomes, but approximately equal in strength in autosomes and the X chromosome.


Drosophila melanogaster , Gene Conversion , Animals , Drosophila melanogaster/genetics , Introns , Evolution, Molecular , Mutation , Drosophila/genetics , X Chromosome/genetics , Selection, Genetic
6.
HLA ; 103(2): e15386, 2024 Feb.
Article En | MEDLINE | ID: mdl-38342852

Identification of novel HLA-A*23:128 allele generated by interlocus gene conversion in Brazilian bone marrow donor.


Bone Marrow , Gene Conversion , Humans , Brazil , Alleles , Tissue Donors , HLA-A Antigens/genetics
7.
BMC Plant Biol ; 23(1): 608, 2023 Dec 01.
Article En | MEDLINE | ID: mdl-38036992

BACKGROUND: Despite GC variation constitutes a fundamental element of genome and species diversity, the precise mechanisms driving it remain unclear. The abundant sequence data available for the ITS2, a commonly employed phylogenetic marker in plants, offers an exceptional resource for exploring the GC variation across angiosperms. RESULTS: A comprehensive selection of 8666 species, comprising 165 genera, 63 families, and 30 orders were used for the analyses. The alignment of ITS2 sequence-structures and partitioning of secondary structures into paired and unpaired regions were performed using 4SALE. Substitution rates and frequencies among GC base-pairs in the paired regions of ITS2 were calculated using RNA-specific models in the PHASE package. The results showed that the distribution of ITS2 GC contents on the angiosperm phylogeny was heterogeneous, but their increase was generally associated with ITS2 sequence homogenization, thereby supporting the occurrence of GC-biased gene conversion (gBGC) during the concerted evolution of ITS2. Additionally, the GC content in the paired regions of the ITS2 secondary structure was significantly higher than that of the unpaired regions, indicating the selection of GC for thermodynamic stability. Furthermore, the RNA substitution models demonstrated that base-pair transformations favored both the elevation and fixation of GC in the paired regions, providing further support for gBGC. CONCLUSIONS: Our findings highlight the significance of secondary structure in GC investigation, which demonstrate that both gBGC and structure-based selection are influential factors driving angiosperm ITS2 GC content.


Magnoliopsida , Humans , Magnoliopsida/genetics , Phylogeny , Gene Conversion , Base Composition , RNA , Evolution, Molecular
8.
Genome Res ; 33(10): 1673-1689, 2023 10.
Article En | MEDLINE | ID: mdl-37884342

Ultraconserved elements (UCEs) are the most conserved regions among the genomes of evolutionarily distant species and are thought to play critical biological functions. However, some UCEs rapidly evolved in specific lineages, and whether they contributed to adaptive evolution is still controversial. Here, using an increased number of sequenced genomes with high taxonomic coverage, we identified 2191 mammalian UCEs and 5938 avian UCEs from 95 mammal and 94 bird genomes, respectively. Our results show that these UCEs are functionally constrained and that their adjacent genes are prone to widespread expression with low expression diversity across tissues. Functional enrichment of mammalian and avian UCEs shows different trends indicating that UCEs may contribute to adaptive evolution of taxa. Focusing on lineage-specific accelerated evolution, we discover that the proportion of fast-evolving UCEs in nine mammalian and 10 avian test lineages range from 0.19% to 13.2%. Notably, up to 62.1% of fast-evolving UCEs in test lineages are much more likely to result from GC-biased gene conversion (gBGC). A single cervid-specific gBGC region embracing the uc.359 allele significantly alters the expression of Nova1 and other neural-related genes in the rat brain. Combined with the altered regulatory activity of ancient gBGC-induced fast-evolving UCEs in eutherians, our results provide evidence that synergy between gBGC and selection shaped lineage-specific substitution patterns, even in the most constrained regulatory elements. In summary, our results show that gBGC played an important role in facilitating lineage-specific accelerated evolution of UCEs, and further support the idea that a combination of multiple evolutionary forces shapes adaptive evolution.


Gene Conversion , Mammals , Animals , Rats , Mammals/genetics , Alleles , Birds/genetics , Evolution, Molecular , Neuro-Oncological Ventral Antigen
9.
Nat Commun ; 14(1): 5692, 2023 09 14.
Article En | MEDLINE | ID: mdl-37709766

In the absence of recombination, the number of transposable elements (TEs) increases due to less efficient selection, but the dynamics of such TE accumulations are not well characterized. Leveraging a dataset of 21 independent events of recombination cessation of different ages in mating-type chromosomes of Microbotryum fungi, we show that TEs rapidly accumulated in regions lacking recombination, but that TE content reached a plateau at ca. 50% of occupied base pairs by 1.5 million years following recombination suppression. The same TE superfamilies have expanded in independently evolved non-recombining regions, in particular rolling-circle replication elements (Helitrons). Long-terminal repeat (LTR) retrotransposons of the Copia and Ty3 superfamilies also expanded, through transposition bursts (distinguished from gene conversion based on LTR divergence), with both non-recombining regions and autosomes affected, suggesting that non-recombining regions constitute TE reservoirs. This study improves our knowledge of genome evolution by showing that TEs can accumulate through bursts, following non-linear decelerating dynamics.


DNA Transposable Elements , Reproduction , DNA Transposable Elements/genetics , Cell Communication , DNA Replication , Gene Conversion
10.
Mol Biol Evol ; 40(9)2023 09 01.
Article En | MEDLINE | ID: mdl-37675606

Following a duplication, the resulting paralogs tend to diverge. While mutation and natural selection can accelerate this process, they can also slow it. Here, we quantify the paralog homogenization that is caused by point mutations and interlocus gene conversion (IGC). Among 164 duplicated teleost genes, the median percentage of postduplication codon substitutions that arise from IGC rather than point mutation is estimated to be between 7% and 8%. By differentiating between the nonsynonymous codon substitutions that homogenize the protein sequences of paralogs and the nonhomogenizing nonsynonymous substitutions, we estimate the homogenizing nonsynonymous rates to be higher for 163 of the 164 teleost data sets as well as for all 14 data sets of duplicated yeast ribosomal protein-coding genes that we consider. For all 14 yeast data sets, the estimated homogenizing nonsynonymous rates exceed the synonymous rates.


Gene Conversion , Magnoliopsida , Saccharomyces cerevisiae , Amino Acid Sequence , Genes, Duplicate , Selection, Genetic
11.
Genome Biol Evol ; 15(8)2023 08 01.
Article En | MEDLINE | ID: mdl-37565492

Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.


Butterflies , Animals , Butterflies/genetics , Codon Usage , Base Composition , Codon , Gene Conversion , Selection, Genetic , Evolution, Molecular
12.
Bioinformatics ; 39(8)2023 08 01.
Article En | MEDLINE | ID: mdl-37535674

MOTIVATION: Meiotic recombination is the main driving force of human genetic diversity, along with mutations. Recombinations split into crossovers, separating large chromosomal regions originating from different homologous chromosomes, and non-crossovers (NCOs), where a small segment from one chromosome is embedded in a region originating from the homologous chromosome. NCOs are much less studied than mutations and crossovers as NCOs are short and can only be detected at markers heterozygous in the transmitting parent, leaving most of them undetectable. RESULTS: The detectable NCOs, known as gene conversions, hide information about NCOs, including their number and length, waiting to be unveiled. We introduce NCOurd, software, and algorithm, based on an expectation-maximization algorithm, to estimate the number of NCOs and their length distribution from gene conversion data. AVAILABILITY AND IMPLEMENTATION: https://github.com/DecodeGenetics/NCOurd.


Crossing Over, Genetic , Gene Conversion , Humans , Heterozygote , Meiosis
13.
Theor Popul Biol ; 153: 69-90, 2023 10.
Article En | MEDLINE | ID: mdl-37451508

Recombination often concentrates in small regions called recombination hotspots where recombination is much higher than the genome's average. In many vertebrates, including humans, gene PRDM9 specifies which DNA motifs will be the target for breaks that initiate recombination, ultimately determining the location of recombination hotspots. Because the sequence that breaks (allowing recombination) is converted into the sequence that does not break (preventing recombination), the latter sequence is over-transmitted to future generations and recombination hotspots are self-destructive. Given their self-destructive nature, recombination hotspots should eventually become extinct in genomes where they are found. While empirical evidence shows that individual hotspots do become inactive over time (die), hotspots are abundant in many vertebrates: a contradiction called the Recombination Hotspot Paradox. What saves recombination hotspots from their foretold extinction? Here we formulate a co-evolutionary model of the interaction among sequence-specific gene conversion, fertility selection, and recurrent mutation. We find that allelic frequencies oscillate leading to stable limit cycles. From a biological perspective this means that when fertility selection is weaker than gene conversion, it cannot stop individual hotspots from dying but can save them from extinction by driving their re-activation (resuscitation). In our model, mutation balances death and resuscitation of hotspots, thus maintaining their number over evolutionary time. Interestingly, we find that multiple alleles result in oscillations that are chaotic and multiple targets in oscillations that are asynchronous between targets thus helping to maintain the average genomic recombination probability constant. Furthermore, we find that the level of expression of PRDM9 should control for the fraction of targets that are hotspots and the overall temperature of the genome. Therefore, our co-evolutionary model improves our understanding of how hotspots may be replaced, thus contributing to solve the Recombination Hotspot Paradox. From a more applied perspective our work provides testable predictions regarding the relation between mutation probability and fertility selection with life expectancy of hotspots.


Gene Conversion , Recombination, Genetic , Humans , Animals , Mutation , Gene Frequency , Models, Genetic , Histone-Lysine N-Methyltransferase/genetics
14.
Nature ; 617(7960): 325-334, 2023 05.
Article En | MEDLINE | ID: mdl-37165237

Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data1,2. Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions3,4. We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have 'relocated' on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences5,6.


Gene Conversion , Mutation , Segmental Duplications, Genomic , Humans , Gene Conversion/genetics , Genome, Human/genetics , Polymorphism, Single Nucleotide/genetics , Haplotypes/genetics , Exons/genetics , Cytosine/chemistry , Guanine/chemistry , CpG Islands/genetics
15.
Nature ; 618(7965): 557-565, 2023 Jun.
Article En | MEDLINE | ID: mdl-37198485

Centromeres are critical for cell division, loading CENH3 or CENPA histone variant nucleosomes, directing kinetochore formation and allowing chromosome segregation1,2. Despite their conserved function, centromere size and structure are diverse across species. To understand this centromere paradox3,4, it is necessary to know how centromeric diversity is generated and whether it reflects ancient trans-species variation or, instead, rapid post-speciation divergence. To address these questions, we assembled 346 centromeres from 66 Arabidopsis thaliana and 2 Arabidopsis lyrata accessions, which exhibited a remarkable degree of intra- and inter-species diversity. A. thaliana centromere repeat arrays are embedded in linkage blocks, despite ongoing internal satellite turnover, consistent with roles for unidirectional gene conversion or unequal crossover between sister chromatids in sequence diversification. Additionally, centrophilic ATHILA transposons have recently invaded the satellite arrays. To counter ATHILA invasion, chromosome-specific bursts of satellite homogenization generate higher-order repeats and purge transposons, in line with cycles of repeat evolution. Centromeric sequence changes are even more extreme in comparison between A. thaliana and A. lyrata. Together, our findings identify rapid cycles of transposon invasion and purging through satellite homogenization, which drive centromere evolution and ultimately contribute to speciation.


Arabidopsis , Centromere , DNA Transposable Elements , DNA, Satellite , Evolution, Molecular , Arabidopsis/genetics , Arabidopsis/metabolism , Centromere/genetics , Centromere/metabolism , DNA Transposable Elements/genetics , Histones/genetics , Histones/metabolism , Nucleosomes/genetics , Nucleosomes/metabolism , DNA, Satellite/genetics , Gene Conversion
16.
PLoS Genet ; 19(4): e1010702, 2023 04.
Article En | MEDLINE | ID: mdl-37053290

Heterozygous chromosome inversions suppress meiotic crossover (CO) formation within an inversion, potentially because they lead to gross chromosome rearrangements that produce inviable gametes. Interestingly, COs are also severely reduced in regions nearby but outside of inversion breakpoints even though COs in these regions do not result in rearrangements. Our mechanistic understanding of why COs are suppressed outside of inversion breakpoints is limited by a lack of data on the frequency of noncrossover gene conversions (NCOGCs) in these regions. To address this critical gap, we mapped the location and frequency of rare CO and NCOGC events that occurred outside of the dl-49 chrX inversion in D. melanogaster. We created full-sibling wildtype and inversion stocks and recovered COs and NCOGCs in the syntenic regions of both stocks, allowing us to directly compare rates and distributions of recombination events. We show that COs outside of the proximal inversion breakpoint are distributed in a distance-dependent manner, with strongest suppression near the inversion breakpoint. We find that NCOGCs occur evenly throughout the chromosome and, importantly, are not suppressed near inversion breakpoints. We propose a model in which COs are suppressed by inversion breakpoints in a distance-dependent manner through mechanisms that influence DNA double-strand break repair outcome but not double-strand break formation. We suggest that subtle changes in the synaptonemal complex and chromosome pairing might lead to unstable interhomolog interactions during recombination that permits NCOGC formation but not CO formation.


Drosophila melanogaster , Recombinational DNA Repair , Animals , Drosophila melanogaster/genetics , Chromosome Inversion/genetics , DNA Repair/genetics , Gene Conversion , Crossing Over, Genetic , Meiosis/genetics
17.
mBio ; 14(3): e0329022, 2023 06 27.
Article En | MEDLINE | ID: mdl-37093087

Meiosis is associated with genetic changes in the genome-via recombination, gene conversion, and mutations. The occurrence of gene conversion and mutations during meiosis may further be influenced by the chromatin conformation, similar to the effect of the chromatin conformation on the mitotic mutation rate. To date, however, the exact distribution and type of meiosis-associated changes and the role of the chromatin conformation in this context are largely unexplored. Here, we determine recombination, gene conversion, and de novo mutations using whole-genome sequencing of all meiotic products of 23 individual meioses in Zymoseptoria tritici, an important pathogen of wheat. We confirm a high genome-wide recombination rate of 65 centimorgan (cM)/Mb and see higher recombination rates on the accessory compared to core chromosomes. A substantial fraction of 0.16% of all polymorphic markers was affected by gene conversions, showing a weak GC-bias and occurring at higher frequency in regions of constitutive heterochromatin, indicated by the histone modification H3K9me3. The de novo mutation rate associated with meiosis was approximately three orders of magnitude higher than the corresponding mitotic mutation rate. Importantly, repeat-induced point mutation (RIP), a fungal defense mechanism against duplicated sequences, is active in Z. tritici and responsible for the majority of these de novo meiotic mutations. Our results indicate that the genetic changes associated with meiosis are a major source of variability in the genome of an important plant pathogen and shape its evolutionary trajectory. IMPORTANCE The impact of meiosis on the genome composition via gene conversion and mutations is mostly poorly understood, in particular, for non-model species. Here, we sequenced all four meiotic products for 23 individual meioses and determined the genetic changes caused by meiosis for the important fungal wheat pathogen Zymoseptoria tritici. We found a high rate of gene conversions and an effect of the chromatin conformation on gene conversion rates. Higher conversion rates were found in regions enriched with the H3K9me3-a mark for constitutive heterochromatin. Most importantly, meiosis was associated with a much higher frequency of de novo mutations than mitosis; 78% of the meiotic mutations were caused by repeat-induced point mutations-a fungal defense mechanism against duplicated sequences. In conclusion, the genetic changes associated with meiosis are therefore a major factor shaping the genome of this fungal pathogen.


Ascomycota , Gene Conversion , Point Mutation , Heterochromatin/genetics , Ascomycota/genetics , Mutation , Meiosis/genetics
18.
Nucleic Acids Res ; 51(8): 3869-3887, 2023 05 08.
Article En | MEDLINE | ID: mdl-36951094

A haploid of the fission yeast Schizosaccharomyces pombe expresses either the P or M mating-type, determined by the active, euchromatic, mat1 cassette. Mating-type is switched by Rad51-driven gene conversion of mat1 using a heterochromatic donor cassette, mat2-P or mat3-M. The Swi2-Swi5 complex, a mating-type switching factor, is central to this process by designating a preferred donor in a cell-type-specific manner. Swi2-Swi5 selectively enables one of two cis-acting recombination enhancers, SRE2 adjacent to mat2-P or SRE3 adjacent to mat3-M. Here, we identified two functionally important motifs in Swi2, a Swi6 (HP1 homolog)-binding site and two DNA-binding AT-hooks. Genetic analysis demonstrated that the AT-hooks were required for Swi2 localization at SRE3 to select the mat3-M donor in P cells, while the Swi6-binding site was required for Swi2 localization at SRE2 to select mat2-P in M cells. In addition, the Swi2-Swi5 complex promoted Rad51-driven strand exchange in vitro. Taken together, our results show how the Swi2-Swi5 complex would localize to recombination enhancers through a cell-type specific binding mechanism and stimulate Rad51-driven gene conversion at the localization site.


Schizosaccharomyces pombe Proteins , Schizosaccharomyces , DNA, Fungal/genetics , DNA, Fungal/metabolism , Gene Conversion , Genes, Fungal , Genes, Mating Type, Fungal/genetics , Recombination, Genetic , Schizosaccharomyces/genetics , Schizosaccharomyces/metabolism , Schizosaccharomyces pombe Proteins/genetics , Schizosaccharomyces pombe Proteins/metabolism
19.
PLoS One ; 18(2): e0266234, 2023.
Article En | MEDLINE | ID: mdl-36800354

Ehrlichia ruminantium is a tick-borne intracellular pathogen of ruminants that causes heartwater, a disease present in Sub-saharan Africa, islands in the Indian Ocean and the Caribbean, inducing significant economic losses. At present, three avirulent strains of E. ruminantium (Gardel, Welgevonden and Senegal isolates) have been produced by a process of serial passaging in mammalian cells in vitro, but unfortunately their use as vaccines do not offer a large range of protection against other strains, possibly due to the genetic diversity present within the species. So far no genetic basis for virulence attenuation has been identified in any E. ruminantium strain that could offer targets to facilitate vaccine production. Virulence attenuated Senegal strains have been produced twice independently, and require many fewer passages to attenuate than the other strains. We compared the genomes of a virulent and attenuated Senegal strain and identified a likely attenuator gene, ntrX, a global transcription regulator and member of a two-component system that is linked to environmental sensing. This gene has an inverted partial duplicate close to the parental gene that shows evidence of gene conversion in different E. ruminantium strains. The pseudogenisation of the gene in the avirulent Senegal strain occurred by gene conversion from the duplicate to the parent, transferring a 4 bp deletion which is unique to the Senegal strain partial duplicate amongst the wild isolates. We confirmed that the ntrX gene is not expressed in the avirulent Senegal strain by RT-PCR. The inverted duplicate structure combined with the 4 bp deletion in the Senegal strain can explain both the attenuation and the faster speed of attenuation in the Senegal strain relative to other strains of E. ruminantium. Our results identify nrtX as a promising target for the generation of attenuated strains of E. ruminantium by random or directed mutagenesis that could be used for vaccine production.


Ehrlichia ruminantium , Animals , Ehrlichia ruminantium/genetics , Gene Conversion , Senegal , Virulence/genetics , Segmental Duplications, Genomic , Ruminants/genetics
20.
Hum Mol Genet ; 32(1): 65-78, 2023 01 01.
Article En | MEDLINE | ID: mdl-35921243

The human MSY ampliconic region is mainly composed of large duplicated sequences that are organized in eight palindromes (termed P1-P8), and may undergo arm-to-arm gene conversion. Although the importance of these elements is widely recognized, their evolutionary dynamics are still nuanced. Here, we focused on the P8 palindrome, which shows a complex evolutionary history, being involved in intra- and inter-chromosomal gene conversion. To disclose its evolutionary complexity, we performed a high-depth (50×) targeted next-generation sequencing of this element in 157 subjects belonging to the most divergent lineages of the Y chromosome tree. We found a total of 72 polymorphic paralogous sequence variants that have been exploited to identify 41 Y-Y gene conversion events that occurred during recent human history. Through our analysis, we were able to categorize P8 arms into three portions, whose molecular diversity was modelled by different evolutionary forces. Notably, the outer region of the palindrome is not involved in any gene conversion event and evolves exclusively through the action of mutational pressure. The inner region is affected by Y-Y gene conversion occurring at a rate of 1.52 × 10-5 conversions/base/year, with no bias towards the retention of the ancestral state of the sequence. In this portion, GC-biased gene conversion is counterbalanced by a mutational bias towards AT bases. Finally, the middle region of the arms, in addition to intra-chromosomal gene conversion, is involved in X-to-Y gene conversion (at a rate of 6.013 × 10-8 conversions/base/year) thus being a major force in the evolution of the VCY/VCX gene family.


Chromosomes, Human, Y , Gene Conversion , Humans , Gene Conversion/genetics , Chromosomes, Human, Y/genetics , Mutation , Evolution, Molecular
...