RESUMO
Chard (Beta vulgaris ssp. vulgaris var. cicla) is a member of one of four different cultigroups of beets. While the genome of sugar beet, the most prominent beet crop, has been studied extensively, molecular data on other beet cultivars is scant. Here, we present a genome assembly of chard, a vegetable crop grown for its fleshy leaves. We report a de novo genome assembly of 604 Mbp, slightly larger than sugar beet assemblies presented so far. About 57 % of the assembly was annotated as repetitive sequence, of which LTR retrotransposons were the most abundant. Based on the presence of conserved genes, the chard assembly was estimated to be at least 96 % complete regarding its gene space. We predicted 34,521 genes of which 27,582 genes were supported by evidence from transcriptomic sequencing reads, and 5503 of the evidence-supported genes had multiple isoforms. We compared the chard gene set with gene sets from sugar beet and two wild beets (i.e. Beta vulgaris ssp. maritima and Beta patula) to find orthology relationships and identified genome-wide syntenic regions between chard and sugar beet. Lastly, we determined genomic variants that distinguish sugar beet and chard. Assessing the variation distribution along the chard chromosomes, we found extensive haplotype sharing between the two cultivars. In summary, our work provides a foundation for the molecular analysis of Beta vulgaris cultigroups as a basis for chard genomics and to unravel the domestication history of beet crops.
Assuntos
Beta vulgaris , Genoma de Planta , Beta vulgaris/genética , Produtos Agrícolas/genética , Genômica , RetroelementosRESUMO
We present draft genome assemblies of Beta patula, a critically endangered wild beet endemic to the Madeira archipelago, and of the closely related Beta vulgaris ssp. maritima (sea beet). Evidence-based reference gene sets for B. patula and sea beet were generated, consisting of 25 127 and 27 662 genes, respectively. The genomes and gene sets of the two wild beets were compared with their cultivated sister taxon B. vulgaris ssp. vulgaris (sugar beet). Large syntenic regions were identified, and a display tool for automatic genome-wide synteny image generation was developed. Phylogenetic analysis based on 9861 genes showing 1:1:1 orthology supported the close relationship of B. patula to sea beet and sugar beet. A comparative analysis of the Rz2 locus, responsible for rhizomania resistance, suggested that the sequenced B. patula accession was rhizomania susceptible. Reference karyotypes for the two wild beets were established, and genomic rearrangements were detected. We consider our data as highly valuable and comprehensive resources for wild beet studies, B. patula conservation management, and sugar beet breeding research.
Assuntos
Beta vulgaris/genética , Genoma de Planta , Doenças das Plantas/genética , Beta vulgaris/virologia , Cromossomos/genética , Produtos Agrícolas/genética , Variação Genética , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Hibridização in Situ Fluorescente , Cariótipo , Filogenia , Doenças das Plantas/virologia , Sintenia/genéticaRESUMO
We develop a method to predict and validate gene models using PacBio single-molecule, real-time (SMRT) cDNA reads. Ninety-eight percent of full-insert SMRT reads span complete open reading frames. Gene model validation using SMRT reads is developed as automated process. Optimized training and prediction settings and mRNA-seq noise reduction of assisting Illumina reads results in increased gene prediction sensitivity and precision. Additionally, we present an improved gene set for sugar beet (Beta vulgaris) and the first genome-wide gene set for spinach (Spinacia oleracea). The workflow and guidelines are a valuable resource to obtain comprehensive gene sets for newly sequenced genomes of non-model eukaryotes.
Assuntos
Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Beta vulgaris/genética , DNA Complementar/química , Genes de Plantas , Dados de Sequência Molecular , Spinacia oleracea/genéticaRESUMO
DNA methylation is an essential epigenetic feature for the regulation and maintenance of heterochromatin. Satellite DNA is a repetitive sequence component that often occurs in large arrays in heterochromatin of subtelomeric, intercalary and centromeric regions. Knowledge about the methylation status of satellite DNA is important for understanding the role of repetitive DNA in heterochromatization. In this study, we investigated the cytosine methylation of the ancient satellite family pEV in the wild beet Beta procumbens. The pEV satellite is widespread in species-specific pEV subfamilies in the genus Beta and most likely originated before the radiation of the Betoideae and Chenopodioideae. In B. procumbens, the pEV subfamily occurs abundantly and spans intercalary and centromeric regions. To uncover its cytosine methylation, we performed chromosome-wide immunostaining and bisulfite sequencing of pEV satellite repeats. We found that CG and CHG sites are highly methylated while CHH sites show only low levels of methylation. As a consequence of the low frequency of CG and CHG sites and the preferential occurrence of most cytosines in the CHH motif in pEV monomers, this satellite family displays only low levels of total cytosine methylation.
Assuntos
Beta vulgaris/genética , Beta vulgaris/metabolismo , Citosina/metabolismo , Metilação de DNA/genética , DNA Satélite/genética , Sequência de Bases , Centrômero/genética , Cromossomos de Plantas/genética , Ilhas de CpG/genética , Dados de Sequência MolecularRESUMO
Methylation of DNA is important for the epigenetic silencing of repetitive DNA in plant genomes. Knowledge about the cytosine methylation status of satellite DNAs, a major class of repetitive DNA, is scarce. One reason for this is that arrays of tandemly arranged sequences are usually collapsed in next-generation sequencing assemblies. We applied strategies to overcome this limitation and quantified the level of cytosine methylation and its pattern in three satellite families of sugar beet (Beta vulgaris) which differ in their abundance, chromosomal localization and monomer size. We visualized methylation levels along pachytene chromosomes with respect to small satellite loci at maximum resolution using chromosome-wide fluorescent in situ hybridization complemented with immunostaining and super-resolution microscopy. Only reduced methylation of many satellite arrays was obtained. To investigate methylation at the nucleotide level we performed bisulfite sequencing of 1569 satellite sequences. We found that the level of methylation of cytosine strongly depends on the sequence context: cytosines in the CHH motif show lower methylation (44-52%), while CG and CHG motifs are more strongly methylated. This affects the overall methylation of satellite sequences because CHH occurs frequently while CG and CHG are rare or even absent in the satellite arrays investigated. Evidently, CHH is the major target for modulation of the cytosine methylation level of adjacent monomers within individual arrays and contributes to their epigenetic function. This strongly indicates that asymmetric cytosine methylation plays a role in the epigenetic modification of satellite repeats in plant genomes.
Assuntos
Beta vulgaris/genética , Citosina/metabolismo , Metilação de DNA , DNA de Plantas/química , Cromossomos de Plantas , Epigênese Genética , Genoma de Planta , Motivos de Nucleotídeos , Sequências Repetitivas de Ácido Nucleico , Análise de Sequência de DNARESUMO
Biennial sugar beet (Beta vulgaris spp. vulgaris) is a Caryophyllidae that has adapted its growth cycle to the seasonal temperature and daylength variation of temperate regions. This is the first time a holistic study of the expression pattern of non-symbiotic hemoglobins (nsHbs) is being carried out in a member of this group and under two essential environmental conditions for flowering, namely vernalization and length of photoperiod. BvHb genes were identified by sequence homology searches against the latest draft of the sugar beet genome. Three nsHb genes (BvHb1.1, BvHb1.2 and BvHb2) and one truncated Hb gene (BvHb3) were found in the genome of sugar beet. Gene expression profiling of the nsHb genes was carried out by quantitative PCR in different organs and developmental stages, as well as during vernalization and under different photoperiods. BvHb1.1 and BvHb2 showed differential expression during vernalization as well as during long and short days. The high expression of BvHb2 indicates that it has an active role in the cell, maybe even taking over some BvHb1.2 functions, except during germination where BvHb1.2 together with BvHb1.1-both Class 1 nsHbs-are highly expressed. The unprecedented finding of a leader peptide at the N-terminus of BvHb1.1, for the first time in an nsHb from higher plants, together with its observed expression indicate that it may have a very specific role due to its suggested location in chloroplasts. Our findings open up new possibilities for research, breeding and engineering since Hbs could be more involved in plant development than previously was anticipated.
Assuntos
Beta vulgaris/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Hemoglobinas/genética , Proteínas de Plantas/genética , Simbiose/genética , Sequência de Aminoácidos , Beta vulgaris/fisiologia , Flores/genética , Flores/fisiologia , Regulação da Expressão Gênica no Desenvolvimento , Genes de Plantas , Hemoglobinas/química , Hemoglobinas/metabolismo , Dados de Sequência Molecular , Fotoperíodo , Proteínas de Plantas/química , Proteínas de Plantas/metabolismo , Transporte Proteico , Alinhamento de Sequência , Frações Subcelulares/metabolismoRESUMO
Sugar beet (Beta vulgaris ssp. vulgaris) is an important crop of temperate climates which provides nearly 30% of the world's annual sugar production and is a source for bioethanol and animal feed. The species belongs to the order of Caryophylalles, is diploid with 2n = 18 chromosomes, has an estimated genome size of 714-758 megabases and shares an ancient genome triplication with other eudicot plants. Leafy beets have been cultivated since Roman times, but sugar beet is one of the most recently domesticated crops. It arose in the late eighteenth century when lines accumulating sugar in the storage root were selected from crosses made with chard and fodder beet. Here we present a reference genome sequence for sugar beet as the first non-rosid, non-asterid eudicot genome, advancing comparative genomics and phylogenetic reconstructions. The genome sequence comprises 567 megabases, of which 85% could be assigned to chromosomes. The assembly covers a large proportion of the repetitive sequence content that was estimated to be 63%. We predicted 27,421 protein-coding genes supported by transcript data and annotated them on the basis of sequence homology. Phylogenetic analyses provided evidence for the separation of Caryophyllales before the split of asterids and rosids, and revealed lineage-specific gene family expansions and losses. We sequenced spinach (Spinacia oleracea), another Caryophyllales species, and validated features that separate this clade from rosids and asterids. Intraspecific genomic variation was analysed based on the genome sequences of sea beet (Beta vulgaris ssp. maritima; progenitor of all beet crops) and four additional sugar beet accessions. We identified seven million variant positions in the reference genome, and also large regions of low variability, indicating artificial selection. The sugar beet genome sequence enables the identification of genes affecting agronomically relevant traits, supports molecular breeding and maximizes the plant's potential in energy biotechnology.
Assuntos
Beta vulgaris/genética , Produtos Agrícolas/genética , Genoma de Planta/genética , Biocombustíveis/provisão & distribuição , Metabolismo dos Carboidratos , Cromossomos de Plantas/genética , Etanol/metabolismo , Genômica , Hibridização in Situ Fluorescente , Dados de Sequência Molecular , Filogenia , Análise de Sequência de DNA , Spinacia oleracea/genéticaRESUMO
It has been argued that the evolution of plant genome size is principally unidirectional and increasing owing to the varied action of whole-genome duplications (WGDs) and mobile element proliferation. However, extreme genome size reductions have been reported in the angiosperm family tree. Here we report the sequence of the 82-megabase genome of the carnivorous bladderwort plant Utricularia gibba. Despite its tiny size, the U. gibba genome accommodates a typical number of genes for a plant, with the main difference from other plant genomes arising from a drastic reduction in non-genic DNA. Unexpectedly, we identified at least three rounds of WGD in U. gibba since common ancestry with tomato (Solanum) and grape (Vitis). The compressed architecture of the U. gibba genome indicates that a small fraction of intergenic DNA, with few or no active retrotransposons, is sufficient to regulate and integrate all the processes required for the development and reproduction of a complex organism.
Assuntos
Evolução Molecular , Genoma de Planta/genética , Magnoliopsida/genética , DNA Intergênico/genética , Duplicação Gênica/genética , Genes de Plantas/genética , Modelos Genéticos , Solanum/genética , Sintenia/genética , Vitis/genéticaRESUMO
LTR retrotransposons and retroviruses are closely related. Although a viral envelope gene is found in some LTR retrotransposons and all retroviruses, only the latter show infectivity. The identification of Ty3-gypsy-like retrotransposons possessing putative envelope-like open reading frames blurred the taxonomical borders and led to the establishment of the Errantivirus, Metavirus and Chromovirus genera within the Metaviridae. Only a few plant Errantiviruses have been described, and their evolutionary history is not well understood. In this study, we investigated 27 retroelements of four abundant Elbe retrotransposon families belonging to the Errantiviruses in Beta vulgaris (sugar beet). Retroelements of the Elbe lineage integrated between 0.02 and 5.59 million years ago, and show family-specific variations in autonomy and degree of rearrangements: while Elbe3 members are highly fragmented, often truncated and present in a high number of solo LTRs, Elbe2 members are mainly autonomous. We observed extensive reshuffling of structural motifs across families, leading to the formation of new retrotransposon families. Elbe retrotransposons harbor a typical envelope-like gene, often encoding transmembrane domains. During the course of Elbe evolution, the additional open reading frames have been strongly modified or independently acquired. Taken together, the Elbe lineage serves as retrotransposon model reflecting the various stages in Errantivirus evolution, and allows a detailed analysis of retrotransposon family formation.
Assuntos
Beta vulgaris/genética , Evolução Molecular , Genoma de Planta , Vírus de Plantas/genética , Recombinação Genética , Retroelementos , Sequência de Aminoácidos , Beta vulgaris/virologia , Cromossomos de Plantas/genética , Biologia Computacional/métodos , Sequência Conservada , Variação Genética , Dados de Sequência Molecular , Motivos de Nucleotídeos , Fases de Leitura Aberta , Mapeamento Físico do Cromossomo , Vírus de Plantas/classificação , Alinhamento de Sequência , Especificidade da EspécieRESUMO
Genome-wide analyses of repetitive DNA suggest a significant impact particularly of transposable elements on genome size and evolution of virtually all eukaryotic organisms. In this study, we analyzed the abundance and diversity of the hAT transposon superfamily of the sugar beet (B. vulgaris) genome, using molecular, bioinformatic and cytogenetic approaches. We identified 81 transposase-coding sequences, three of which are part of structurally intact but nonfunctional hAT transposons (BvhAT), in a B. vulgaris BAC library as well as in whole genome sequencing-derived data sets. Additionally, 116 complete and 497 truncated non-autonomous BvhAT derivatives lacking the transposase gene were in silico-detected. The 116 complete derivatives were subdivided into four BvhATpin groups each characterized by a distinct terminal inverted repeat motif. Both BvhAT and BvhATpin transposons are specific for species of the genus Beta and closely related species, showing a localization on B. vulgaris chromosomes predominantely in euchromatic regions. The lack of any BvhAT transposase function together with the high degree of degeneration observed for the BvhAT and the BvhATpin genomic fraction contrasts with the abundance and activity of autonomous and non-autonomous hAT transposons revealed in other plant species. This indicates a possible genus-specific structural and functional repression of the hAT transposon superfamily during Beta diversification and evolution.
Assuntos
Beta vulgaris/genética , Elementos de DNA Transponíveis , Sequência de Aminoácidos , Sequência de Bases , Cromossomos Artificiais Bacterianos , Cromossomos de Plantas , Biblioteca Gênica , Genoma de Planta , Hibridização in Situ Fluorescente , Dados de Sequência Molecular , Filogenia , Mapeamento Físico do Cromossomo , Transposases/genéticaRESUMO
BACKGROUND: The generation and analysis of high-throughput sequencing data are becoming a major component of many studies in molecular biology and medical research. Illumina's Genome Analyzer (GA) and HiSeq instruments are currently the most widely used sequencing devices. Here, we comprehensively evaluate properties of genomic HiSeq and GAIIx data derived from two plant genomes and one virus, with read lengths of 95 to 150 bases. RESULTS: We provide quantifications and evidence for GC bias, error rates, error sequence context, effects of quality filtering, and the reliability of quality values. By combining different filtering criteria we reduced error rates 7-fold at the expense of discarding 12.5% of alignable bases. While overall error rates are low in HiSeq data we observed regions of accumulated wrong base calls. Only 3% of all error positions accounted for 24.7% of all substitution errors. Analyzing the forward and reverse strands separately revealed error rates of up to 18.7%. Insertions and deletions occurred at very low rates on average but increased to up to 2% in homopolymers. A positive correlation between read coverage and GC content was found depending on the GC content range. CONCLUSIONS: The errors and biases we report have implications for the use and the interpretation of Illumina sequencing data. GAIIx and HiSeq data sets show slightly different error profiles. Quality filtering is essential to minimize downstream analysis artifacts. Supporting previous recommendations, the strand-specificity provides a criterion to distinguish sequencing errors from low abundance polymorphisms.
Assuntos
Arabidopsis/genética , Bacteriófago phi X 174/genética , Beta vulgaris/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Artefatos , Automação Laboratorial , Composição de Bases/genética , Sequência de Bases , Genômica , Sequenciamento de Nucleotídeos em Larga Escala/instrumentação , Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Dados de Sequência Molecular , Mutagênese Insercional , Polimorfismo Genético , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Análise de Sequência de DNA/instrumentação , Análise de Sequência de DNA/estatística & dados numéricos , Deleção de SequênciaRESUMO
Sugar beet (Beta vulgaris) chromosomes consist of large heterochromatic blocks in pericentromeric, centromeric, and intercalary regions comprised of two different highly abundant DNA satellite families. To investigate DNA methylation at single base resolution at heterochromatic regions, we applied a method for strand-specific bisulfite sequencing of more than 1,000 satellite monomers followed by statistical analyses. As a result, we uncovered diversity in the distribution of different methylation patterns in both satellite families. Heavily methylated CG and CHG (H=A, T, or C) sites occur more frequently in intercalary heterochromatin, while CHH sites, with the exception of CAA, are only sparsely methylated, in both intercalary and pericentromeric/centromeric heterochromatin. We show that the difference in DNA methylation intensity is correlated to unequal distribution of heterochromatic histone H3 methylation marks. While clusters of H3K9me2 were absent from pericentromeric heterochromatin and restricted only to intercalary heterochromatic regions, H3K9me1 and H3K27me1 were observed in all types of heterochromatin. By sequencing of a small RNA library consisting of 6.76 million small RNAs, we identified small interfering RNAs (siRNAs) of 24 nucleotides in size which originated from both strands of the satellite DNAs. We hypothesize an involvement of these siRNAs in the regulation of DNA and histone methylation for maintaining heterochromatin.