RESUMO
Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within- and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.
Assuntos
Produtos Agrícolas/classificação , Produtos Agrícolas/genética , Variação Genética , Genoma de Planta/genética , Oryza/classificação , Oryza/genética , Ásia , Evolução Molecular , Genes de Plantas/genética , Genética Populacional , Genômica , Haplótipos , Mutação INDEL/genética , Filogenia , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Pollen allergies have long been a major pandemic health problem for human. However, the evolutionary events and biological function of pollen allergens in plants remain largely unknown. Here, we report the genome-wide prediction of pollen allergens and their biological function in the dicotyledonous model plant Arabidopsis (Arabidopsis thaliana) and the monocotyledonous model plant rice (Oryza sativa). In total, 145 and 107 pollen allergens were predicted from rice and Arabidopsis, respectively. These pollen allergens are putatively involved in stress responses and metabolic processes such as cell wall metabolism during pollen development. Interestingly, these putative pollen allergen genes were derived from large gene families and became diversified during evolution. Sequence analysis across 25 plant species from green alga to angiosperms suggest that about 40% of putative pollen allergenic proteins existed in both lower and higher plants, while other allergens emerged during evolution. Although a high proportion of gene duplication has been observed among allergen-coding genes, our data show that these genes might have undergone purifying selection during evolution. We also observed that epitopes of an allergen might have a biological function, as revealed by comprehensive analysis of two known allergens, expansin and profilin. This implies a crucial role of conserved amino acid residues in both in planta biological function and allergenicity. Finally, a model explaining how pollen allergens were generated and maintained in plants is proposed. Prediction and systematic analysis of pollen allergens in model plants suggest that pollen allergens were evolved by gene duplication and then functional specification. This study provides insight into the phylogenetic and evolutionary scenario of pollen allergens that will be helpful to future characterization and epitope screening of pollen allergens.
Assuntos
Alérgenos/genética , Arabidopsis/genética , Oryza/genética , Proteínas de Plantas/genética , Plantas/genética , Pólen/genética , Alérgenos/classificação , Alérgenos/metabolismo , Sequência de Aminoácidos , Arabidopsis/metabolismo , Análise por Conglomerados , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica de Plantas , Ontologia Genética , Genoma de Planta/genética , Humanos , Oryza/metabolismo , Filogenia , Proteínas de Plantas/classificação , Proteínas de Plantas/metabolismo , Plantas/classificação , Plantas/metabolismo , Pólen/metabolismo , Homologia de Sequência de Aminoácidos , Especificidade da EspécieRESUMO
The birth of new genes in genomes is an important evolutionary event. Several studies reveal that new genes in animals tend to be preferentially expressed in male reproductive tissues such as testis (Betrán et al., 2002; Begun et al., 2007; Dubruille et al., 2012), and thus an "out of testis" hypothesis for the emergence of new genes has been proposed (Vinckenbosch et al., 2006; Kaessmann, 2010). However, such phenomena have not been examined in plant species. Here, by employing a phylostratigraphic method, we dated the origin of protein-coding genes in rice and Arabidopsis thaliana and observed a number of young genes in both species. These young genes tend to encode short extracellular proteins, which may be involved in rapid evolving processes, such as reproductive barriers, species specification, and anti-microbial processes. Further analysis of transcriptome age indexes across different tissues revealed that male reproductive cells express a phylogenetically younger transcriptome than other plant tissues. Compared with sporophytic tissues, the young transcriptomes of the male gametophyte displayed greater complexity and diversity, which included a higher ratio of anti-sense and inter-genic transcripts, reflecting a pervasive transcription state that facilitated the emergence of new genes. Here, we propose that pollen may act as an "innovation incubator" for the birth of de novo genes. With cases of male-biased expression of young genes reported in animals, the "new genes out of the male" model revealed a common evolutionary force that drives reproductive barriers, species specification, and the upgrading of defensive mechanisms against pathogens.