RESUMEN
Colletotrichum destructivum (Cd) is a phytopathogenic fungus causing significant economic losses on forage legume crops (Medicago and Trifolium species) worldwide. To gain insights into the genetic basis of fungal virulence and host specificity, we sequenced the genome of an isolate from Medicago sativa using long-read (PacBio) technology. The resulting genome assembly has a total length of 51.7 Mb and comprises ten core chromosomes and two accessory chromosomes, all of which were sequenced from telomere to telomere. A total of 15,â631 gene models were predicted, including genes encoding potentially pathogenicity-related proteins such as candidate-secreted effectors (484), secondary metabolism key enzymes (110) and carbohydrate-active enzymes (619). Synteny analysis revealed extensive structural rearrangements in the genome of Cd relative to the closely related Brassicaceae pathogen, Colletotrichum higginsianum. In addition, a 1.2 Mb species-specific region was detected within the largest core chromosome of Cd that has all the characteristics of fungal accessory chromosomes (transposon-rich, gene-poor, distinct codon usage), providing evidence for exchange between these two genomic compartments. This region was also unique in having undergone extensive intra-chromosomal segmental duplications. Our findings provide insights into the evolution of accessory regions and possible mechanisms for generating genetic diversity in this asexual fungal pathogen.
Asunto(s)
Cromosomas Fúngicos , Colletotrichum , Genoma Fúngico , Enfermedades de las Plantas , Colletotrichum/genética , Colletotrichum/patogenicidad , Cromosomas Fúngicos/genética , Enfermedades de las Plantas/microbiología , Sintenía , Filogenia , Medicago sativa/microbiologíaRESUMEN
Knowledge of genetic determinism and evolutionary dynamics mediating host-pathogen interactions is essential to manage fungal plant diseases. Studies on the genetic architecture of fungal pathogenicity often focus on large-effect effector genes triggering strong, qualitative resistance. It is not clear how this translates to predominately quantitative interactions. Here, we use the Zymoseptoria tritici-wheat model to elucidate the genetic architecture of quantitative pathogenicity and mechanisms mediating host adaptation. With a multi-host genome-wide association study, we identify 19 high-confidence candidate genes associated with quantitative pathogenicity. Analysis of genetic diversity reveals that sequence polymorphism is the main evolutionary process mediating differences in quantitative pathogenicity, a process that is likely facilitated by genetic recombination and transposable element dynamics. Finally, we use functional approaches to confirm the role of an effector-like gene and a methyltransferase in phenotypic variation. This study highlights the complex genetic architecture of quantitative pathogenicity, extensive diversifying selection and plausible mechanisms facilitating pathogen adaptation.
Asunto(s)
Estudio de Asociación del Genoma Completo , Adaptación al Huésped , Virulencia/genética , Polimorfismo Genético , Interacciones Huésped-Patógeno/genética , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/microbiologíaRESUMEN
Zymoseptoria tritici is the fungal pathogen responsible for Septoria tritici blotch on wheat. Disease outcome in this pathosystem is partly determined by isolate-specific resistance, where wheat resistance genes recognize specific fungal factors triggering an immune response. Despite the large number of known wheat resistance genes, fungal molecular determinants involved in such cultivar-specific resistance remain largely unknown. We identified the avirulence factor AvrStb9 using association mapping and functional validation approaches. Pathotyping AvrStb9 transgenic strains on Stb9 cultivars, near isogenic lines and wheat mapping populations, showed that AvrStb9 interacts with Stb9 resistance gene, triggering an immune response. AvrStb9 encodes an unusually large avirulence gene with a predicted secretion signal and a protease domain. It belongs to a S41 protease family conserved across different filamentous fungi in the Ascomycota class and may constitute a core effector. AvrStb9 is also conserved among a global Z. tritici population and carries multiple amino acid substitutions caused by strong positive diversifying selection. These results demonstrate the contribution of an 'atypical' conserved effector protein to fungal avirulence and the role of sequence diversification in the escape of host recognition, adding to our understanding of host-pathogen interactions and the evolutionary processes underlying pathogen adaptation.
Asunto(s)
Ascomicetos , Triticum , Triticum/genética , Triticum/microbiología , Péptido Hidrolasas/metabolismo , Proteínas Fúngicas/metabolismo , Endopeptidasas/metabolismo , Enfermedades de las Plantas/microbiologíaRESUMEN
Endogenous viruses form an important proportion of eukaryote genomes and a source of novel functions. How large DNA viruses integrated into a genome evolve when they confer a benefit to their host, however, remains unknown. Bracoviruses are essential for the parasitism success of parasitoid wasps, into whose genomes they integrated ~103 million years ago. Here we show, from the assembly of a parasitoid wasp genome at a chromosomal scale, that bracovirus genes colonized all ten chromosomes of Cotesia congregata. Most form clusters of genes involved in particle production or parasitism success. Genomic comparison with another wasp, Microplitis demolitor, revealed that these clusters were already established ~53 mya and thus belong to remarkably stable genomic structures, the architectures of which are evolutionary constrained. Transcriptomic analyses highlight temporal synchronization of viral gene expression without resulting in immune gene induction, suggesting that no conflicts remain between ancient symbiotic partners when benefits to them converge.
Asunto(s)
Evolución Biológica , Cromosomas de Insectos , Genoma de los Insectos , Polydnaviridae/genética , Avispas/genética , Animales , Secuencia de Bases , Secuencia Conservada , Nudiviridae/genética , Receptores Odorantes/genética , Olfato , Simbiosis , Sintenía , Avispas/virologíaRESUMEN
The soilborne fungus Gaeumannomyces tritici (G. tritici) causes the take-all disease on wheat roots. Ambient pH has been shown to be critical in different steps of G. tritici life cycle such as survival in bulk soil, saprophytic growth, and pathogenicity on plants. There are however intra-specific variations and we previously found two types of G. tritici strains that grow preferentially either at acidic pH or at neutral/alkaline pH; gene expression involved in pH-signal transduction pathway and pathogenesis was differentially regulated in two strains representative of these types. To go deeper in the description of the genetic pathways and the understanding of this adaptative mechanism, transcriptome sequencing was achieved on two strains (PG6 and PG38) which displayed opposite growth profiles in two pH conditions (acidic and neutral). PG6, growing better at acidic pH, overexpressed in this condition genes related to cell proliferation. In contrast, PG38, which grew better at neutral pH, overexpressed in this condition genes involved in fatty acids and amino acid metabolisms, and genes potentially related to pathogenesis. This strain also expressed stress resistance mechanisms at both pH, to assert a convenient growth under various ambient pH conditions. These differences in metabolic pathway expression between strains at different pH might buffer the effect of field or soil variation in wheat fields, and explain the success of the pathogen.
Asunto(s)
Ascomicetos/genética , Transcriptoma/genética , Ascomicetos/crecimiento & desarrollo , Ascomicetos/aislamiento & purificación , Perfilación de la Expresión Génica , Regulación Fúngica de la Expresión Génica , Ontología de Genes , Genes Fúngicos , Concentración de Iones de Hidrógeno , Micelio/crecimiento & desarrollo , Especificidad de la Especie , TriticumRESUMEN
Although there are a number of bioinformatic tools to identify plant nucleotide-binding leucine-rich repeat (NLR) disease resistance genes based on conserved protein sequences, only a few of these tools have attempted to identify disease resistance genes that have not been annotated in the genome. The overall goal of the NLGenomeSweeper pipeline is to annotate NLR disease resistance genes, including RPW8, in the genome assembly with high specificity and a focus on complete functional genes. This is based on the identification of the complete NB-ARC domain, the most conserved domain of NLR genes, using the BLAST suite. In this way, the tool has a high specificity for complete genes and relatively intact pseudogenes. The tool returns all candidate NLR gene locations as well as InterProScan ORF and domain annotations for manual curation of the gene structure.
Asunto(s)
Genómica/métodos , Proteínas NLR/genética , Proteínas de Plantas/genética , Análisis de Secuencia de Proteína/métodos , Programas Informáticos/normas , Arabidopsis , Secuencia Conservada , Resistencia a la Enfermedad , Genómica/normas , Helianthus , Proteínas NLR/química , Proteínas de Plantas/química , Unión Proteica , Dominios Proteicos , Análisis de Secuencia de Proteína/normasRESUMEN
As members of the plant microbiota, arbuscular mycorrhizal fungi (AMF, Glomeromycotina) symbiotically colonize plant roots. AMF also possess their own microbiota, hosting some uncultivable endobacteria. Ongoing research has revealed the genetics underlying plant responses to colonization by AMF, but the fungal side of the relationship remains in the dark. Here, we sequenced the genome of Gigaspora margarita, a member of the Gigasporaceae in an early diverging group of the Glomeromycotina. In contrast to other AMF, G. margarita may host distinct endobacterial populations and possesses the largest fungal genome so far annotated (773.104 Mbp), with more than 64% transposable elements. Other unique traits of the G. margarita genome include the expansion of genes for inorganic phosphate metabolism, the presence of genes for production of secondary metabolites and a considerable number of potential horizontal gene transfer events. The sequencing of G. margarita genome reveals the importance of its immune system, shedding light on the evolutionary pathways that allowed early diverging fungi to interact with both plants and bacteria.
Asunto(s)
Fenómenos Fisiológicos Bacterianos , Glomeromycota/fisiología , Micorrizas/fisiología , Raíces de Plantas/microbiología , Plantas/microbiología , Simbiosis/fisiología , Bacterias/clasificación , Bacterias/genética , Secuencia de Bases , Transferencia de Gen Horizontal , Genoma Fúngico/genética , Glomeromycota/genética , Microbiota/genéticaRESUMEN
The Venturia genus comprises fungal species that are pathogens on Rosaceae host plants, including V. inaequalis and V. asperata on apple, V. aucupariae on sorbus and V. pirina on pear. Although the genetic structure of V. inaequalis populations has been investigated in detail, genomic features underlying these subdivisions remain poorly understood. Here, we report whole genome sequencing of 87 Venturia strains that represent each species and each population within V. inaequalis We present a PacBio genome assembly for the V. inaequalis EU-B04 reference isolate. The size of selected genomes was determined by flow cytometry, and varied from 45 to 93 Mb. Genome assemblies of V. inaequalis and V. aucupariae contain a high content of transposable elements (TEs), most of which belong to the Gypsy or Copia LTR superfamilies and have been inactivated by Repeat-Induced Point mutations. The reference assembly of V. inaequalis presents a mosaic structure of GC-equilibrated regions that mainly contain predicted genes and AT-rich regions, mainly composed of TEs. Six pairs of strains were identified as clones. Single-Nucleotide Polymorphism (SNP) analysis between these clones revealed a high number of SNPs that are mostly located in AT-rich regions due to misalignments and allowed determining a false discovery rate. The availability of these genome sequences is expected to stimulate genetics and population genomics research of Venturia pathogens. Especially, it will help understanding the evolutionary history of Venturia species that are pathogenic on different hosts, a history that has probably been substantially influenced by TEs.
Asunto(s)
Ascomicetos/genética , Genoma Fúngico , Genómica , Ascomicetos/clasificación , Biología Computacional/métodos , Genómica/métodos , Anotación de Secuencia Molecular , Filogenia , Enfermedades de las Plantas/microbiología , Polimorfismo de Nucleótido Simple , Secuenciación Completa del GenomaRESUMEN
BACKGROUND: Thanks to their ability to move around and replicate within genomes, transposable elements (TEs) are perhaps the most important contributors to genome plasticity and evolution. Their detection and annotation are considered essential in any genome sequencing project. The number of fully sequenced genomes is rapidly increasing with improvements in high-throughput sequencing technologies. A fully automated de novo annotation process for TEs is therefore required to cope with the deluge of sequence data.However, all automated procedures are error-prone, and an automated procedure for TE identification and classification would be no exception. It is therefore crucial to provide not only the TE reference sequences, but also evidence justifying their classification, at the scale of the whole genome. A few TE databases already exist, but none provides evidence to justify TE classification. Moreover, biological information about the sequences remains globally poor. RESULTS: We present here the RepetDB database developed in the framework of GnpIS, a genetic and genomic information system. RepetDB is designed to store and retrieve detected, classified and annotated TEs in a standardized manner. RepetDB is an implementation with extensions of InterMine, an open-source data warehouse framework used here to store, search, browse, analyze and compare all the data recorded for each TE reference sequence. InterMine can display diverse information for each sequence and allows simple to very complex queries. Finally, TE data are displayed via a worldwide data discovery portal. RepetDB is accessible at urgi.versailles.inra.fr/repetdb. CONCLUSIONS: RepetDB is designed to be a TE knowledge base populated with full de novo TE annotations of complete (or near-complete) genome sequences. Indeed, the description and classification of TEs facilitates the exploration of specific TE families, superfamilies or orders across a large range of species. It also makes possible cross-species searches and comparisons of TE family content between genomes.
RESUMEN
Oaks are an important part of our natural and cultural heritage. Not only are they ubiquitous in our most common landscapes1 but they have also supplied human societies with invaluable services, including food and shelter, since prehistoric times2. With 450 species spread throughout Asia, Europe and America3, oaks constitute a critical global renewable resource. The longevity of oaks (several hundred years) probably underlies their emblematic cultural and historical importance. Such long-lived sessile organisms must persist in the face of a wide range of abiotic and biotic threats over their lifespans. We investigated the genomic features associated with such a long lifespan by sequencing, assembling and annotating the oak genome. We then used the growing number of whole-genome sequences for plants (including tree and herbaceous species) to investigate the parallel evolution of genomic characteristics potentially underpinning tree longevity. A further consequence of the long lifespan of trees is their accumulation of somatic mutations during mitotic divisions of stem cells present in the shoot apical meristems. Empirical4 and modelling5 approaches have shown that intra-organismal genetic heterogeneity can be selected for6 and provides direct fitness benefits in the arms race with short-lived pests and pathogens through a patchwork of intra-organismal phenotypes7. However, there is no clear proof that large-statured trees consist of a genetic mosaic of clonally distinct cell lineages within and between branches. Through this case study of oak, we demonstrate the accumulation and transmission of somatic mutations and the expansion of disease-resistance gene families in trees.
Asunto(s)
Genoma de Planta/genética , Quercus/genética , Evolución Biológica , ADN de Plantas/genética , Variación Genética/genética , Longevidad/genética , Mutación , Filogenia , Análisis de Secuencia de ADNRESUMEN
As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR).
RESUMEN
BACKGROUND: Coniophora olivacea is a basidiomycete fungus belonging to the order Boletales that produces brown-rot decay on dead wood of conifers. The Boletales order comprises a diverse group of species including saprotrophs and ectomycorrhizal fungi that show important differences in genome size. RESULTS: In this study we report the 39.07-megabase (Mb) draft genome assembly and annotation of C. olivacea. A total of 14,928 genes were annotated, including 470 putatively secreted proteins enriched in functions involved in lignocellulose degradation. Using similarity clustering and protein structure prediction we identified a new family of 10 putative lytic polysaccharide monooxygenase genes. This family is conserved in basidiomycota and lacks of previous functional annotation. Further analyses showed that C. olivacea has a low repetitive genome, with 2.91% of repeats and a restrained content of transposable elements (TEs). The annotation of TEs in four related Boletales yielded important differences in repeat content, ranging from 3.94 to 41.17% of the genome size. The distribution of insertion ages of LTR-retrotransposons showed that differential expansions of these repetitive elements have shaped the genome architecture of Boletales over the last 60 million years. CONCLUSIONS: Coniophora olivacea has a small, compact genome that shows macrosynteny with Coniophora puteana. The functional annotation revealed the enzymatic signature of a canonical brown-rot. The annotation and comparative genomics of transposable elements uncovered their particular contraction in the Coniophora genera, highlighting their role in the differential genome expansions found in Boletales species.
Asunto(s)
Basidiomycota/genética , Evolución Molecular , Genoma Fúngico , Basidiomycota/clasificación , Proteínas Fúngicas/genética , Tamaño del Genoma , Genómica , Anotación de Secuencia Molecular , Familia de Multigenes , Filogenia , Proteómica , ADN Polimerasa Dirigida por ARN/genética , Retroelementos , Secuencias Repetidas TerminalesRESUMEN
BACKGROUND: The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. RESULTS: Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. CONCLUSION: The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.
Asunto(s)
Cromosomas Fúngicos/genética , Colletotrichum/genética , Colletotrichum/metabolismo , Elementos Transponibles de ADN/genética , Genómica , Familia de Multigenes/genética , Recombinación Homóloga/genética , Anotación de Secuencia Molecular , Filogenia , Mutación Puntual/genéticaRESUMEN
Zymoseptoria tritici is the causal agent of Septoria tritici blotch, a major pathogen of wheat globally and the most damaging pathogen of wheat in Europe. A gene-for-gene (GFG) interaction between Z. tritici and wheat cultivars carrying the Stb6 resistance gene has been postulated for many years, but the genes have not been identified. We identified AvrStb6 by combining quantitative trait locus mapping in a cross between two Swiss strains with a genome-wide association study using a natural population of c. 100 strains from France. We functionally validated AvrStb6 using ectopic transformations. AvrStb6 encodes a small, cysteine-rich, secreted protein that produces an avirulence phenotype on wheat cultivars carrying the Stb6 resistance gene. We found 16 nonsynonymous single nucleotide polymorphisms among the tested strains, indicating that AvrStb6 is evolving very rapidly. AvrStb6 is located in a highly polymorphic subtelomeric region and is surrounded by transposable elements, which may facilitate its rapid evolution to overcome Stb6 resistance. AvrStb6 is the first avirulence gene to be functionally validated in Z. tritici, contributing to our understanding of avirulence in apoplastic pathogens and the mechanisms underlying GFG interactions between Z. tritici and wheat.
Asunto(s)
Ascomicetos/patogenicidad , Resistencia a la Enfermedad/genética , Proteínas Fúngicas/metabolismo , Genes de Plantas , Triticum/genética , Triticum/microbiología , Secuencia de Aminoácidos , Secuencia de Bases , Mapeo Cromosómico , Proteínas Fúngicas/química , Estudio de Asociación del Genoma Completo , Desequilibrio de Ligamiento/genética , Fenotipo , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/microbiología , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Polimorfismo Genético , Sitios de Carácter Cuantitativo/genética , Virulencia/genéticaRESUMEN
Botrydial (BOT) is a non-host specific phytotoxin produced by the polyphagous phytopathogenic fungus Botrytis cinerea. The genomic region of the BOT biosynthetic gene cluster was investigated and revealed two additional genes named Bcbot6 and Bcbot7. Analysis revealed that the G+C/A+T-equilibrated regions that contain the Bcbot genes alternate with A+T-rich regions made of relics of transposable elements that have undergone repeat-induced point mutations (RIP). Furthermore, BcBot6, a Zn(II)2Cys6 putative transcription factor was identified as a nuclear protein and the major positive regulator of BOT biosynthesis. In addition, the phenotype of the ΔBcbot6 mutant indicated that BcBot6 and therefore BOT are dispensable for the development, pathogenicity and response to abiotic stresses in the B. cinerea strain B05.10. Finally, our data revealed that B. pseudocinerea, that is also polyphagous and lives in sympatry with B. cinerea, lacks the ability to produce BOT. Identification of BcBot6 as the major regulator of BOT synthesis is the first step towards a comprehensive understanding of the complete regulation network of BOT synthesis and of its ecological role in the B. cinerea life cycle.
Asunto(s)
Aldehídos/metabolismo , Botrytis/genética , Compuestos Bicíclicos con Puentes/metabolismo , Proteínas Fúngicas/metabolismo , Regulación Fúngica de la Expresión Génica , Familia de Multigenes , Factores de Transcripción/metabolismo , Secuencia Rica en At , Botrytis/metabolismo , Botrytis/patogenicidad , Elementos Transponibles de ADN , ADN de Hongos , Proteínas Fúngicas/genética , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , VirulenciaRESUMEN
The 1.5 Gbp/2C genome of pedunculate oak (Quercus robur) has been sequenced. A strategy was established for dealing with the challenges imposed by the sequencing of such a large, complex and highly heterozygous genome by a whole-genome shotgun (WGS) approach, without the use of costly and time-consuming methods, such as fosmid or BAC clone-based hierarchical sequencing methods. The sequencing strategy combined short and long reads. Over 49 million reads provided by Roche 454 GS-FLX technology were assembled into contigs and combined with shorter Illumina sequence reads from paired-end and mate-pair libraries of different insert sizes, to build scaffolds. Errors were corrected and gaps filled with Illumina paired-end reads and contaminants detected, resulting in a total of 17,910 scaffolds (>2 kb) corresponding to 1.34 Gb. Fifty per cent of the assembly was accounted for by 1468 scaffolds (N50 of 260 kb). Initial comparison with the phylogenetically related Prunus persica gene model indicated that genes for 84.6% of the proteins present in peach (mean protein coverage of 90.5%) were present in our assembly. The second and third steps in this project are genome annotation and the assignment of scaffolds to the oak genetic linkage map. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement, the oak genome data have been released into public sequence repositories in advance of publication. In this presubmission paper, the oak genome consortium describes its principal lines of work and future directions for analyses of the nature, function and evolution of the oak genome.
Asunto(s)
Genoma de Planta , Quercus/genética , Modelos Genéticos , Anotación de Secuencia Molecular , Filogenia , Quercus/clasificación , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: The Avrk1 and Avra10 avirulence (AVR) genes encode effectors that increase the pathogenicity of the fungus Blumeria graminis f.sp. hordei (Bgh), the powdery mildew pathogen, in susceptible barley plants. In resistant barley, MLK1 and MLA10 resistance proteins recognize the presence of AVRK1 and AVRA10, eliciting the hypersensitive response typical of gene for gene interactions. Avrk1 and Avra10 have more than 1350 homologues in Bgh genome, forming the EKA (Effectors homologous to Avr k 1 and Avr a 10) gene family. RESULTS: We tested the hypothesis that the EKA family originated from degenerate copies of Class I LINE retrotransposons by analysing the EKA family in the genome of Bgh isolate DH14 with bioinformatic tools specially developed for the analysis of Transposable Elements (TE) in genomes. The Class I LINE retrotransposon copies homologous to Avrk1 and Avra10 represent 6.5 % of the Bgh annotated genome and, among them, we identified 293 AVR/effector candidate genes. We also experimentally identified peptides that indicated the translation of several predicted proteins from EKA family members, which had higher relative abundance in haustoria than in hyphae. CONCLUSIONS: Our analyses indicate that Avrk1 and Avra10 have evolved from part of the ORF1 gene of Class I LINE retrotransposons. The co-option of Avra10 and Avrk1 as effectors from truncated copies of retrotransposons explains the huge number of homologues in Bgh genome that could act as dynamic reservoirs from which new effector genes may evolve. These data provide further evidence for recruitment of retrotransposons in the evolution of new biological functions.
Asunto(s)
Ascomicetos/genética , Proteínas Fúngicas/genética , Hordeum/microbiología , Elementos de Nucleótido Esparcido Largo , Familia de Multigenes , Enfermedades de las Plantas/microbiología , Ascomicetos/clasificación , Ascomicetos/metabolismo , Biología Computacional , Secuencia de Consenso , Genoma Fúngico , Sistemas de Lectura Abierta , Filogenia , ProteómicaRESUMEN
Deciphering the genetic bases of pathogen adaptation to its host is a key question in ecology and evolution. To understand how the fungus Magnaporthe oryzae adapts to different plants, we sequenced eight M. oryzae isolates differing in host specificity (rice, foxtail millet, wheat, and goosegrass), and one Magnaporthe grisea isolate specific of crabgrass. Analysis of Magnaporthe genomes revealed small variation in genome sizes (39-43 Mb) and gene content (12,283-14,781 genes) between isolates. The whole set of Magnaporthe genes comprised 14,966 shared families, 63% of which included genes present in all the nine M. oryzae genomes. The evolutionary relationships among Magnaporthe isolates were inferred using 6,878 single-copy orthologs. The resulting genealogy was mostly bifurcating among the different host-specific lineages, but was reticulate inside the rice lineage. We detected traces of introgression from a nonrice genome in the rice reference 70-15 genome. Among M. oryzae isolates and host-specific lineages, the genome composition in terms of frequencies of genes putatively involved in pathogenicity (effectors, secondary metabolism, cazome) was conserved. However, 529 shared families were found only in nonrice lineages, whereas the rice lineage possessed 86 specific families absent from the nonrice genomes. Our results confirmed that the host specificity of M. oryzae isolates was associated with a divergence between lineages without major gene flow and that, despite the strong conservation of gene families between lineages, adaptation to different hosts, especially to rice, was associated with the presence of a small number of specific gene families. All information was gathered in a public database (http://genome.jouy.inra.fr/gemo).