RESUMEN
Morella rubra, red bayberry, is an economically important fruit tree in south China. Here, we assembled the first high-quality genome for both a female and a male individual of red bayberry. The genome size was 313-Mb, and 90% sequences were assembled into eight pseudo chromosome molecules, with 32 493 predicted genes. By whole-genome comparison between the female and male and association analysis with sequences of bulked and individual DNA samples from female and male, a 59-Kb region determining female was identified and located on distal end of pseudochromosome 8, which contains abundant transposable element and seven putative genes, four of them are related to sex floral development. This 59-Kb female-specific region was likely to be derived from duplication and rearrangement of paralogous genes and retained non-recombinant in the female-specific region. Sex-specific molecular markers developed from candidate genes co-segregated with sex in a genetically diverse female and male germplasm. We propose sex determination follow the ZW model of female heterogamety. The genome sequence of red bayberry provides a valuable resource for plant sex chromosome evolution and also provides important insights for molecular biology, genetics and modern breeding in Myricaceae family.
Asunto(s)
Evolución Molecular , Genoma de Planta/genética , Myrica/genética , Mapeo Cromosómico , Flores/genética , Flores/crecimiento & desarrollo , Flores/fisiología , Frutas/genética , Frutas/crecimiento & desarrollo , Frutas/fisiología , Marcadores Genéticos/genética , Anotación de Secuencia Molecular , Myrica/crecimiento & desarrollo , Myrica/fisiología , Especificidad de Órganos , FitomejoramientoRESUMEN
Bamboo occupies an important phylogenetic node in the grass family and plays a significant role in the forest industry. We produced 1.2 Mb of tetraploid moso bamboo (Phyllostachys pubescens E. Mazel ex H. de Leh.) sequences from 13 bacterial artificial chromosome (BAC) clones, and these are the largest genomic sequences available so far from the subfamily Bambusoideae. The content of repetitive elements (36.2%) in bamboo is similar to that in rice. Both rice and sorghum exhibit high genomic synteny with bamboo, which suggests that rice and sorghum may be useful as models for decoding Bambusoideae genomes.
Asunto(s)
Bambusa/genética , Genoma de Planta/genética , Oryza/genética , Sorghum/genética , Sintenía/genética , Secuencia de Bases , Cromosomas Artificiales Bacterianos/genética , Clonación Molecular , Genes de Plantas/genética , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , Secuencias Repetitivas de Ácidos Nucleicos/genética , Análisis de Secuencia de ADN , Zea mays/genéticaRESUMEN
Begomoviruses are single-stranded DNA viruses and cause severe diseases in major crop plants worldwide. Based on current genome sequence analyses, we found that synonymous codon usage variations in the protein-coding genes of begomoviruses are mainly influenced by mutation bias. Base composition analysis suggested that the codon usage bias of AV1 and BV1 genes is significant and their expressions are high. Fourteen codons were determined as translational optimal ones according to the comparison of codon usage patterns between highly and lowly expressed genes. Interestingly the codon usages between begomoviruses from the Old and the New Worlds are apparently different, which supports the idea that the bipartite begomoviruses of the New World might originate from bipartite ones of the Old World, whereas the latter evolve from the Old World monopartite begomoviruses.
Asunto(s)
Begomovirus/genética , Evolución Biológica , Mapeo Cromosómico , Codón/genética , Análisis Mutacional de ADN , ADN Viral/genética , Evolución MolecularRESUMEN
Agrobacterium-mediated transformation has been widely used in producing transgenic plants, and was recently used to generate "transgene-clean" targeted genomic modifications coupled with the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas9) system. Although tremendous variation in morphological and agronomic traits, such as plant height, seed fertility, and grain size, was observed in transgenic plants, the underlying mechanisms are not yet well understood, and the types and frequency of genetic variation in transformed plants have not been fully disclosed. To reveal the genome-wide variation in transformed plants, we sequenced the genomes of five independent T0 rice plants using next-generation sequencing (NGS) techniques. Bioinformatics analyses followed by experimental validation revealed the following: (1) in addition to transfer-DNA (T-DNA) insertions, three transformed plants carried heritable plasmid backbone DNA of variable sizes (855-5216 bp) and in different configurations with the T-DNA insertions (linked or apart); (2) each transgenic plant contained an estimated 338-1774 independent genetic variations (single nucleotide variations (SNVs) or small insertion/deletions); and (3) 2-6 new Tos17 insertions were detected in each transformed plant, but no other transposable elements or bacterial genomic DNA.
Asunto(s)
Agrobacterium/genética , Oryza/genética , Biología Computacional , Elementos Transponibles de ADN , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Plantas Modificadas Genéticamente , Transformación BacterianaRESUMEN
Mutation (substitution, deletion, insertion, etc.) in nucleotide acid causes the maximal sequence lengths of exact match (MALE) between paralogous members from a duplicate event to become shorter during evolution. In this work, MALE changes between members of 26 gene families from four representative species (Arabidopsis thaliana, Oryza sativa, Mus musculus and Homo sapiens) were investigated. Comparative study of paralogous' MALE and amino acid substitution rate (d(A)<0.5) indicated that a close relationship existed between them. The results suggested that MALE could be a sound evolutionary scale for the divergent time for paralogous genes during their early evolution. A reference table between MALE and divergent time for the four species was set up, which would be useful widely, for large-scale genome alignment and comparison. As an example, detection of large-scale duplication events of rice genome based on the table was illustrated.
Asunto(s)
Arabidopsis/genética , Mapeo Cromosómico/métodos , Evolución Molecular , Oryza/genética , Proteoma/genética , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Sustitución de Aminoácidos , Animales , Secuencia Conservada/genética , Humanos , Ratones , Datos de Secuencia Molecular , Proteoma/metabolismo , Homología de Secuencia de Aminoácido , Especificidad de la EspecieRESUMEN
An ancient genome duplication (PPP1) that predates divergence of the cereals has recently been recognized. We report here another potentially older large-scale duplication (PPP2) event that predates monocot-dicot divergence in the genome of rice (Oryza sativa L.), as inferred from the age distribution of pairs of duplicate genes based on recent genome data for rice. Our results suggest that paleopolyploidy was widespread and played an important role in the evolution of rice.
Asunto(s)
Mapeo Cromosómico/métodos , Evolución Molecular , Genoma de Planta , Oryza/genética , Poliploidía , Evolución Biológica , Variación Genética/genéticaRESUMEN
The developmental behavior of gene expression for transparency of rice (TR) was analyzed from two-year experimental data at different filling stages by using both conditional and unconditional analysis methods for quantitative traits of seeds in cereal crops. It was revealed that the gene expression was most active for the performance of TR at the middle and late filling stages (8-21 d after flowering). The differences of expression at different environments for triploid endosperm nuclear genes, cytoplasmic genes and diploid maternal plant nuclear genes were found for the performance of TR. The phenomena of which some genetic effects were spasmodically expressible were detected among filling stages of rice. Because of the higher additive and cytoplasmic effects along with their interaction effects for TR at the filling stages, the better improving effects for this trait could be expected by selection in rice breeding. Some parents such as Zhenan 3 and 1391 were better than others for improving TR.
Asunto(s)
Regulación del Desarrollo de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Oryza/genética , Carácter Cuantitativo Heredable , Semillas/genética , Ambiente , Oryza/crecimiento & desarrolloRESUMEN
MOTIVATION: It was found that high accuracy splicing-site recognition of rice (Oryza sativa L.) DNA sequence is especially difficult. We described a new method for the splicing-site recognition of rice DNA sequences. METHOD: Based on the intron in eukaryotic organisms conforming to the principle of GT-AG, we used support vector machines (SVM) to predict the splicing sites. By machine learning, we built a model and used it to test the effect of the test data set of true and pseudo splicing sites. RESULTS: The prediction accuracy we obtained was 87.53% at the true 5' end splicing site and 87.37% at the true 3' end splicing sites. The results suggested that the SVM approach could achieve higher accuracy than the previous approaches.
Asunto(s)
ADN/genética , Oryza/genética , Empalme del ARN , Algoritmos , Vectores Genéticos , Intrones , Modelos TeóricosRESUMEN
Diversity arrays technology (DArT) is a microarray-based marker system that achieves high throughput by reducing the complexity of the genome. A DArT chip has recently been developed for tobacco. In this study, we genotyped 267 flue-cured cultivars/landraces, including 121 Chinese accessions over five decades from widespread geographic regions in China, 103 from the Americas, and 43 other foreign cultivars, using the newly developed chip. Three hundred and thirty polymorphic DArT makers were selected and used for a phylogenetic analysis, which suggested that the 267 accessions could be classified into two subgroups, which could each be further divided into 2â4 sections. Eight elite cultivars, which account for 83% of the area of Chinese tobacco production, were all found in one subgroup. Two high-quality cultivars, HHDJY and Cuibi1, were grouped together in one section, while six other high-yield cultivars were grouped into another section. The 330 DArT marker clones were sequenced and close to 95% of them are within non-repetitive regions. Finally, the implications of this study for Chinese flue-cured tobacco breeding and production programs were discussed.
Asunto(s)
Nicotiana/genética , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Polimorfismo Genético , China , Mapeo Cromosómico , Análisis por Conglomerados , ADN de Plantas/análisis , Marcadores Genéticos/genética , Genoma de Planta , Genotipo , Geografía , Filogenia , Análisis de Secuencia de ADN , Estados UnidosRESUMEN
DNA methylation plays an important role in the epigenetic regulation of gene expression during plant growth, development, and polyploidization. However, there is still no distinct evidence in tobacco regarding the distribution of the methylation pattern and whether it contributes to qualitative characteristics. We studied the levels and patterns of methylation polymorphism at CCGG sites in 48 accessions of allotetraploid flue-cured tobacco, Nicotiana tabacum, using a methylation-sensitive amplified polymorphism (MSAP) technique. The results showed that methylation existed at a high level among tobacco accessions, among which 49.3% sites were methylated and 69.9% allelic sites were polymorphic. A cluster analysis revealed distinct patterns of geography-specific groups. In addition, three polymorphic sites significantly related to tobacco mosaic virus (TMV) resistance were explored. This suggests that tobacco breeders should pay more attention to epigenetic traits.