RESUMO
BACKGROUND: The Avrk1 and Avra10 avirulence (AVR) genes encode effectors that increase the pathogenicity of the fungus Blumeria graminis f.sp. hordei (Bgh), the powdery mildew pathogen, in susceptible barley plants. In resistant barley, MLK1 and MLA10 resistance proteins recognize the presence of AVRK1 and AVRA10, eliciting the hypersensitive response typical of gene for gene interactions. Avrk1 and Avra10 have more than 1350 homologues in Bgh genome, forming the EKA (Effectors homologous to Avr k 1 and Avr a 10) gene family. RESULTS: We tested the hypothesis that the EKA family originated from degenerate copies of Class I LINE retrotransposons by analysing the EKA family in the genome of Bgh isolate DH14 with bioinformatic tools specially developed for the analysis of Transposable Elements (TE) in genomes. The Class I LINE retrotransposon copies homologous to Avrk1 and Avra10 represent 6.5 % of the Bgh annotated genome and, among them, we identified 293 AVR/effector candidate genes. We also experimentally identified peptides that indicated the translation of several predicted proteins from EKA family members, which had higher relative abundance in haustoria than in hyphae. CONCLUSIONS: Our analyses indicate that Avrk1 and Avra10 have evolved from part of the ORF1 gene of Class I LINE retrotransposons. The co-option of Avra10 and Avrk1 as effectors from truncated copies of retrotransposons explains the huge number of homologues in Bgh genome that could act as dynamic reservoirs from which new effector genes may evolve. These data provide further evidence for recruitment of retrotransposons in the evolution of new biological functions.
Assuntos
Ascomicetos/genética , Proteínas Fúngicas/genética , Hordeum/microbiologia , Elementos Nucleotídeos Longos e Dispersos , Família Multigênica , Doenças das Plantas/microbiologia , Ascomicetos/classificação , Ascomicetos/metabolismo , Biologia Computacional , Sequência Consenso , Genoma Fúngico , Fases de Leitura Aberta , Filogenia , ProteômicaRESUMO
Grasspea (Lathyrus sativus L.) is an underutilised but promising legume crop with tolerance to a wide range of abiotic and biotic stress factors, and potential for climate-resilient agriculture. Despite a long history and wide geographical distribution of cultivation, only limited breeding resources are available. This paper reports a 5.96 Gbp genome assembly of grasspea genotype LS007, of which 5.03 Gbp is scaffolded into 7 pseudo-chromosomes. The assembly has a BUSCO completeness score of 99.1% and is annotated with 31719 gene models and repeat elements. This represents the most contiguous and accurate assembly of the grasspea genome to date.
Assuntos
Cromossomos de Plantas , Genoma de Planta , Lathyrus , Lathyrus/genética , Cromossomos de Plantas/genéticaRESUMO
The filamentous bacterium Streptomyces coelicolor modulates polar growth and branching by phosphorylating the cytoskeletal protein DivIVA. Previous MALDI-TOF analysis of DivIVA showed that a large 7.2 kDa tryptic peptide was multiply phosphorylated. To aid localization of the phosphorylation sites, we introduced additional tryptic cleavage sites into DivIVA, and the resulting phosphopeptides were analyzed by LC-MS/MS. Phosphopeptide isomers could be separated chromatographically, but because of overlapping elution and spectrum quality, site assignment by standard software tools was ambiguous. Because fragment ions carrying the phosphate group are essential for confident localization, large numbers of spectra were collected using targeted LC-MS/MS, and a special script was developed for plotting the elution of site-determining fragments from those spectra under the XIC of the parent ions. Where multiple phosphopeptide isomers were present, the elution of the site-determining y-ions perfectly coincided with the elution of the corresponding phosphopeptide isomer. This method represents a useful tool for user inspection of spectra derived from phosphopeptide isomers and significantly increases confidence when defining phosphorylation sites. In this way, we show that DivIVA is phosphorylated in vivo on five sites in the C-terminal part of the protein (T304, S309, S338, S344, and S355). The data have been deposited to the ProteomeXchange Consortium with identifier PXD000095.
Assuntos
Proteínas de Bactérias/metabolismo , Proteínas de Ciclo Celular/metabolismo , Proteínas do Citoesqueleto/metabolismo , Processamento de Proteína Pós-Traducional , Streptomyces coelicolor/metabolismo , Sequência de Aminoácidos , Proteínas de Bactérias/química , Proteínas de Bactérias/isolamento & purificação , Proteínas de Ciclo Celular/química , Proteínas de Ciclo Celular/isolamento & purificação , Proteínas do Citoesqueleto/química , Proteínas do Citoesqueleto/isolamento & purificação , Isomerismo , Dados de Sequência Molecular , Fosfopeptídeos/química , Fosforilação , Espectrometria de Massas em TandemRESUMO
Grass pea (Lathyrus sativus L.) is a rich source of protein cultivated as an insurance crop in Ethiopia, Eritrea, India, Bangladesh, and Nepal. Its resilience to both drought and flooding makes it a promising crop for ensuring food security in a changing climate. The lack of genetic resources and the crop's association with the disease neurolathyrism have limited the cultivation of grass pea. Here, we present an annotated, long read-based assembly of the 6.5 Gbp L. sativus genome. Using this genome sequence, we have elucidated the biosynthetic pathway leading to the formation of the neurotoxin, ß-L-oxalyl-2,3-diaminopropionic acid (ß-L-ODAP). The final reaction of the pathway depends on an interaction between L. sativus acyl-activating enzyme 3 (LsAAE3) and a BAHD-acyltransferase (LsBOS) that form a metabolon activated by CoA to produce ß-L-ODAP. This provides valuable insight into the best approaches for developing varieties which produce substantially less toxin.
Assuntos
Diamino Aminoácidos , Lathyrus , Lathyrus/genética , Lathyrus/metabolismo , Diamino Aminoácidos/metabolismo , Neurotoxinas/metabolismo , GenômicaRESUMO
Numerous examples of biosynthetic gene clusters (BGCs), including for compounds of agricultural and medicinal importance, have now been discovered in plant genomes. However, little is known about how these complex traits are assembled and diversified. Here, we examine a large number of variants within and between species for a paradigm BGC (the thalianol cluster), which has evolved recently in a common ancestor of the Arabidopsis genus. Comparisons at the species level reveal differences in BGC organization and involvement of auxiliary genes, resulting in production of species-specific triterpenes. Within species, the thalianol cluster is primarily fixed, showing a low frequency of deleterious haplotypes. We further identify chromosomal inversion as a molecular mechanism that may shuffle more distant genes into the cluster, so enabling cluster compaction. Antagonistic natural selection pressures are likely involved in shaping the occurrence and maintenance of this BGC. Our work sheds light on the birth, life and death of complex genetic and metabolic traits in plants.
Assuntos
Arabidopsis/genética , Genoma de Planta , Família Multigênica , Triterpenos/metabolismo , Proteínas de Arabidopsis/genética , Vias Biossintéticas/genética , Evolução Molecular , Genes de Plantas/genética , Metabolismo SecundárioRESUMO
Scutellaria baicalensis Georgi is important in Chinese traditional medicine where preparations of dried roots, "Huang Qin," are used for liver and lung complaints and as complementary cancer treatments. We report a high-quality reference genome sequence for S. baicalensis where 93% of the 408.14-Mb genome has been assembled into nine pseudochromosomes with a super-N50 of 33.2 Mb. Comparison of this sequence with those of closely related species in the order Lamiales, Sesamum indicum and Salvia splendens, revealed that a specialized metabolic pathway for the synthesis of 4'-deoxyflavone bioactives evolved in the genus Scutellaria. We found that the gene encoding a specific cinnamate coenzyme A ligase likely obtained its new function following recent mutations, and that four genes encoding enzymes in the 4'-deoxyflavone pathway are present as tandem repeats in the genome of S. baicalensis. Further analyses revealed that gene duplications, segmental duplication, gene amplification, and point mutations coupled to gene neo- and subfunctionalizations were involved in the evolution of 4'-deoxyflavone synthesis in the genus Scutellaria. Our study not only provides significant insight into the evolution of specific flavone biosynthetic pathways in the mint family, Lamiaceae, but also will facilitate the development of tools for enhancing bioactive productivity by metabolic engineering in microbes or by molecular breeding in plants. The reference genome of S. baicalensis is also useful for improving the genome assemblies for other members of the mint family and offers an important foundation for decoding the synthetic pathways of bioactive compounds in medicinal plants.
Assuntos
Vias Biossintéticas/genética , Flavanonas , Flavonoides/genética , Scutellaria baicalensis/genética , Flavanonas/genética , Flavanonas/metabolismo , Flavonoides/metabolismo , Genoma de Planta , Medicina Tradicional Chinesa , Extratos Vegetais , Raízes de Plantas/metabolismo , Plantas Medicinais/genética , Plantas Medicinais/metabolismo , Scutellaria baicalensis/metabolismo , Sequenciamento Completo do GenomaRESUMO
Parasites are able to evolve rapidly and overcome host defense mechanisms, but the molecular basis of this adaptation is poorly understood. Powdery mildew fungi (Erysiphales, Ascomycota) are obligate biotrophic parasites infecting nearly 10,000 plant genera. They obtain their nutrients from host plants through specialized feeding structures known as haustoria. We previously identified the AVR(k1) powdery mildew-specific gene family encoding effectors that contribute to the successful establishment of haustoria. Here, we report the extensive proliferation of the AVR(k1) gene family throughout the genome of B. graminis, with sequences diverging in formae speciales adapted to infect different hosts. Also, importantly, we have discovered that the effectors have coevolved with a particular family of LINE-1 retrotransposons, named TE1a. The coevolution of these two entities indicates a mutual benefit to the association, which could ultimately contribute to parasite adaptation and success. We propose that the association would benefit 1) the powdery mildew fungus, by providing a mechanism for amplifying and diversifying effectors and 2) the associated retrotransposons, by providing a basis for their maintenance through selection in the fungal genome.
Assuntos
Ascomicetos/genética , Ascomicetos/patogenicidade , Elementos Nucleotídeos Longos e Dispersos/genética , Plantas/microbiologia , Retroelementos/genética , Ascomicetos/fisiologia , Evolução Molecular , Fungos/genética , Fungos/metabolismo , Biblioteca Gênica , Genes Fúngicos , Genoma Fúngico , Modelos Genéticos , Filogenia , Doenças das Plantas/microbiologia , VirulênciaRESUMO
PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
Assuntos
Proteínas de Arabidopsis/análise , Genoma , Espectrometria de Massas , Peptídeos/análise , Software , Algoritmos , Sequência de Aminoácidos , Animais , Arabidopsis/química , Arabidopsis/citologia , Arabidopsis/genética , Proteínas de Arabidopsis/genética , Sequência de Bases , Cloroplastos/química , Cloroplastos/genética , Espectrometria de Massas/instrumentação , Espectrometria de Massas/métodos , Dados de Sequência Molecular , Peptídeos/genética , Alinhamento de SequênciaRESUMO
With the completion of the sequencing of the Arabidopsis genome and the recent advances in proteomic technology, the identification of proteins from highly complex mixtures is now possible. Rather than using gel electrophoresis and peptide mass fingerprinting, we have used multidimensional protein identification technology (MudPIT) to analyse the "tightly-bound" proteome for purified cell walls from Arabidopsis cell suspension cultures. Using bioinformatics for the prediction of signal peptides for targeting to the secretory pathway and for the absence of ER retention signal, 89 proteins were selected as potential extracellular proteins. Only 33% of these were identified in previous proteomic analyses of Arabidopsis cell walls. A functional classification revealed that a large proportion of the proteins were enzymes, notably carbohydrate active enzymes, peroxidases and proteases. Comparison of all the published proteomic analyses for the Arabidopsis cell wall identified 268 non-redundant genes encoding wall proteins. Sixty of these (22%) were derived from our analysis of tightly-bound wall proteins.
Assuntos
Proteínas de Arabidopsis/química , Arabidopsis/química , Parede Celular/química , Proteoma , Biologia Computacional , Processamento de Proteína Pós-TraducionalRESUMO
We sequenced 2.2 Mb representing triplicated genome segments of Brassica oleracea, which are each paralogous with one another and homologous with a segmentally duplicated region of the Arabidopsis thaliana genome. Sequence annotation identified 177 conserved collinear genes in the B. oleracea genome segments. Analysis of synonymous base substitution rates indicated that the triplicated Brassica genome segments diverged from a common ancestor soon after divergence of the Arabidopsis and Brassica lineages. This conclusion was corroborated by phylogenetic analysis of protein families. Using A. thaliana as an outgroup, 35% of the genes inferred to be present when genome triplication occurred in the Brassica lineage have been lost, most likely via a deletion mechanism, in an interspersed pattern. Genes encoding proteins involved in signal transduction or transcription were not found to be significantly more extensively retained than those encoding proteins classified with other functions, but putative proteins predicted in the A. thaliana genome were underrepresented in B. oleracea. We identified one example of gene loss from the Arabidopsis lineage. We found evidence for the frequent insertion of gene fragments of nuclear genomic origin and identified four apparently intact genes in noncollinear positions in the B. oleracea and A. thaliana genomes.