Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
Cell ; 173(4): 839-850.e18, 2018 05 03.
Artigo em Inglês | MEDLINE | ID: mdl-29628142

RESUMO

Maize abnormal chromosome 10 (Ab10) encodes a classic example of true meiotic drive that converts heterochromatic regions called knobs into motile neocentromeres that are preferentially transmitted to egg cells. Here, we identify a cluster of eight genes on Ab10, called the Kinesin driver (Kindr) complex, that are required for both neocentromere motility and preferential transmission. Two meiotic drive mutants that lack neocentromere activity proved to be kindr epimutants with increased DNA methylation across the entire gene cluster. RNAi of Kindr induced a third epimutant and corresponding loss of meiotic drive. Kinesin gliding assays and immunolocalization revealed that KINDR is a functional minus-end-directed kinesin that localizes specifically to knobs containing 180 bp repeats. Sequence comparisons suggest that Kindr diverged from a Kinesin-14A ancestor ∼12 mya and has driven the accumulation of > 500 Mb of knob repeats and affected the segregation of thousands of genes linked to knobs on all 10 chromosomes.


Assuntos
Centrômero/metabolismo , Cinesinas/metabolismo , Meiose , Proteínas de Plantas/metabolismo , Zea mays/metabolismo , Centrômero/genética , Cromossomos de Plantas , Evolução Molecular , Haplótipos , Hibridização in Situ Fluorescente , Cinesinas/antagonistas & inibidores , Cinesinas/classificação , Cinesinas/genética , Modelos Genéticos , Mutagênese , Filogenia , Proteínas de Plantas/antagonistas & inibidores , Proteínas de Plantas/classificação , Proteínas de Plantas/genética , Interferência de RNA , RNA Interferente Pequeno/metabolismo , Sequenciamento Completo do Genoma , Zea mays/genética
2.
PLoS Genet ; 19(12): e1011086, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38134220

RESUMO

Structural differences between genomes are a major source of genetic variation that contributes to phenotypic differences. Transposable elements, mobile genetic sequences capable of increasing their copy number and propagating themselves within genomes, can generate structural variation. However, their repetitive nature makes it difficult to characterize fine-scale differences in their presence at specific positions, limiting our understanding of their impact on genome variation. Domesticated maize is a particularly good system for exploring the impact of transposable element proliferation as over 70% of the genome is annotated as transposable elements. High-quality transposable element annotations were recently generated for de novo genome assemblies of 26 diverse inbred maize lines. We generated base-pair resolved pairwise alignments between the B73 maize reference genome and the remaining 25 inbred maize line assemblies. From this data, we classified transposable elements as either shared or polymorphic in a given pairwise comparison. Our analysis uncovered substantial structural variation between lines, representing both simple and complex connections between TEs and structural variants. Putative insertions in SNP depleted regions, which represent recently diverged identity by state blocks, suggest some TE families may still be active. However, our analysis reveals that within these recently diverged genomic regions, deletions of transposable elements likely account for more structural variation events and base pairs than insertions. These deletions are often large structural variants containing multiple transposable elements. Combined, our results highlight how transposable elements contribute to structural variation and demonstrate that deletion events are a major contributor to genomic differences.


Assuntos
Elementos de DNA Transponíveis , Zea mays , Humanos , Elementos de DNA Transponíveis/genética , Zea mays/genética , Genômica
3.
Proc Natl Acad Sci U S A ; 119(1)2022 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-34934012

RESUMO

Millions of species are currently being sequenced, and their genomes are being compared. Many of them have more complex genomes than model systems and raise novel challenges for genome alignment. Widely used local alignment strategies often produce limited or incongruous results when applied to genomes with dispersed repeats, long indels, and highly diverse sequences. Moreover, alignment using many-to-many or reciprocal best hit approaches conflicts with well-studied patterns between species with different rounds of whole-genome duplication. Here, we introduce Anchored Wavefront alignment (AnchorWave), which performs whole-genome duplication-informed collinear anchor identification between genomes and performs base pair-resolved global alignment for collinear blocks using a two-piece affine gap cost strategy. This strategy enables AnchorWave to precisely identify multikilobase indels generated by transposable element (TE) presence/absence variants (PAVs). When aligning two maize genomes, AnchorWave successfully recalled 87% of previously reported TE PAVs. By contrast, other genome alignment tools showed low power for TE PAV recall. AnchorWave precisely aligns up to three times more of the genome as position matches or indels than the closest competitive approach when comparing diverse genomes. Moreover, AnchorWave recalls transcription factor-binding sites at a rate of 1.05- to 74.85-fold higher than other tools with significantly lower false-positive alignments. AnchorWave complements available genome alignment tools by showing obvious improvement when applied to genomes with dispersed repeats, active TEs, high sequence diversity, and whole-genome duplication variation.


Assuntos
Genoma de Planta , Polimorfismo Genético , Alinhamento de Sequência , Software , Zea mays/genética
4.
PLoS Genet ; 17(10): e1009768, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34648488

RESUMO

Transposable elements (TEs) constitute the majority of flowering plant DNA, reflecting their tremendous success in subverting, avoiding, and surviving the defenses of their host genomes to ensure their selfish replication. More than 85% of the sequence of the maize genome can be ascribed to past transposition, providing a major contribution to the structure of the genome. Evidence from individual loci has informed our understanding of how transposition has shaped the genome, and a number of individual TE insertions have been causally linked to dramatic phenotypic changes. Genome-wide analyses in maize and other taxa have frequently represented TEs as a relatively homogeneous class of fragmentary relics of past transposition, obscuring their evolutionary history and interaction with their host genome. Using an updated annotation of structurally intact TEs in the maize reference genome, we investigate the family-level dynamics of TEs in maize. Integrating a variety of data, from descriptors of individual TEs like coding capacity, expression, and methylation, as well as similar features of the sequence they inserted into, we model the relationship between attributes of the genomic environment and the survival of TE copies and families. In contrast to the wholesale relegation of all TEs to a single category of junk DNA, these differences reveal a diversity of survival strategies of TE families. Together these generate a rich ecology of the genome, with each TE family representing the evolution of a distinct ecological niche. We conclude that while the impact of transposition is highly family- and context-dependent, a family-level understanding of the ecology of TEs in the genome can refine our ability to predict the role of TEs in generating genetic and phenotypic diversity.


Assuntos
Elementos de DNA Transponíveis/genética , Genoma de Planta/genética , Zea mays/genética , Ecossistema , Evolução Molecular , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Anotação de Sequência Molecular/métodos , Análise de Sequência de DNA/métodos
5.
Nature ; 546(7659): 524-527, 2017 06 22.
Artigo em Inglês | MEDLINE | ID: mdl-28605751

RESUMO

Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.


Assuntos
Genoma de Planta/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Imagem Individual de Molécula/métodos , Zea mays/genética , Centrômero/genética , Cromossomos de Plantas/genética , Mapeamento de Sequências Contíguas , Produtos Agrícolas/genética , Elementos de DNA Transponíveis/genética , DNA Intergênico/genética , Genes de Plantas/genética , Anotação de Sequência Molecular , Óptica e Fotônica , Filogenia , RNA Mensageiro/análise , RNA Mensageiro/genética , Padrões de Referência , Sorghum/genética
6.
Proc Natl Acad Sci U S A ; 117(5): 2526-2534, 2020 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-31964817

RESUMO

The seasonal timing of seed germination determines a plant's realized environmental niche, and is important for adaptation to climate. The timing of seasonal germination depends on patterns of seed dormancy release or induction by cold and interacts with flowering-time variation to construct different seasonal life histories. To characterize the genetic basis and climatic associations of natural variation in seed chilling responses and associated life-history syndromes, we selected 559 fully sequenced accessions of the model annual species Arabidopsis thaliana from across a wide climate range and scored each for seed germination across a range of 13 cold stratification treatments, as well as the timing of flowering and senescence. Germination strategies varied continuously along 2 major axes: 1) Overall germination fraction and 2) induction vs. release of dormancy by cold. Natural variation in seed responses to chilling was correlated with flowering time and senescence to create a range of seasonal life-history syndromes. Genome-wide association identified several loci associated with natural variation in seed chilling responses, including a known functional polymorphism in the self-binding domain of the candidate gene DOG1. A phylogeny of DOG1 haplotypes revealed ancient divergence of these functional variants associated with periods of Pleistocene climate change, and Gradient Forest analysis showed that allele turnover of candidate SNPs was significantly associated with climate gradients. These results provide evidence that A. thaliana's germination niche and correlated life-history syndromes are shaped by past climate cycles, as well as local adaptation to contemporary climate.


Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/metabolismo , Sementes/química , Alelos , Arabidopsis/genética , Arabidopsis/crescimento & desenvolvimento , Proteínas de Arabidopsis/genética , Temperatura Baixa , Regulação da Expressão Gênica de Plantas , Germinação , Características de História de Vida , Polimorfismo Genético , Estações do Ano , Sementes/genética , Sementes/crescimento & desenvolvimento , Sementes/metabolismo
7.
New Phytol ; 234(2): 719-734, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35090191

RESUMO

The relevance of flowering time variation and plasticity to climate adaptation requires a comprehensive empirical assessment. We investigated natural selection and the genetic architecture of flowering time in Arabidopsis through field experiments in Europe across multiple sites and seasons. We estimated selection for flowering time, plasticity and canalization. Loci associated with flowering time, plasticity and canalization by genome-wide association studies were tested for a geographic signature of climate adaptation. Selection favored early flowering and increased canalization, except at the northernmost site, but was rarely detected for plasticity. Genome-wide association studies revealed significant associations with flowering traits and supported a substantial polygenic inheritance. Alleles associated with late flowering, including functional FRIGIDA variants, were more common in regions experiencing high annual temperature variation. Flowering time plasticity to fall vs spring and summer environments was associated with GIGANTEA SUPPRESSOR 5, which promotes early flowering under decreasing day length and temperature. The finding that late flowering genotypes and alleles are associated with climate is evidence for past adaptation. Real-time phenotypic selection analysis, however, reveals pervasive contemporary selection for rapid flowering in agricultural settings across most of the species range. The response to this selection may involve genetic shifts in environmental cuing compared to the ancestral state.


Assuntos
Proteínas de Arabidopsis , Arabidopsis , Arabidopsis/genética , Proteínas de Arabidopsis/genética , Flores/genética , Variação Genética , Estudo de Associação Genômica Ampla , Fenótipo , Estações do Ano
8.
PLoS Genet ; 15(9): e1008291, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31498837

RESUMO

DNA methylation and epigenetic silencing play important roles in the regulation of transposable elements (TEs) in many eukaryotic genomes. A majority of the maize genome is derived from TEs that can be classified into different orders and families based on their mechanism of transposition and sequence similarity, respectively. TEs themselves are highly methylated and it can be tempting to view them as a single uniform group. However, the analysis of DNA methylation profiles in flanking regions provides evidence for distinct groups of chromatin properties at different TE families. These differences among TE families are reproducible in different tissues and different inbred lines. TE families with varying levels of DNA methylation in flanking regions also show distinct patterns of chromatin accessibility and modifications within the TEs. The differences in the patterns of DNA methylation flanking TE families arise from a combination of non-random insertion preferences of TE families, changes in DNA methylation triggered by the insertion of the TE and subsequent selection pressure. A set of nearly 70,000 TE polymorphisms among four assembled maize genomes were used to monitor the level of DNA methylation at haplotypes with and without the TE insertions. In many cases, TE families with high levels of DNA methylation in flanking sequence are enriched for insertions into highly methylated regions. The majority of the >2,500 TE insertions into unmethylated regions result in changes in DNA methylation in haplotypes with the TE, suggesting the widespread potential for TE insertions to condition altered methylation in conserved regions of the genome. This study highlights the interplay between TEs and the methylome of a major crop species.


Assuntos
Metilação de DNA/genética , Elementos de DNA Transponíveis/genética , Zea mays/genética , Epigênese Genética/genética , Epigenômica/métodos , Evolução Molecular , Regulação da Expressão Gênica de Plantas/genética , Genoma de Planta/genética , Genótipo , Haplótipos/genética , Polimorfismo Genético/genética , Análise de Sequência de DNA/métodos
9.
Plant J ; 100(5): 1052-1065, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31381222

RESUMO

Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genome organization and content. Most maize genomes are composed of TEs. We developed an approach to define shared and variable TE insertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17 and PH207) with uniform structural annotations of TEs. Among these genomes we identified approximately 400 000 TEs that are polymorphic, encompassing 1.6 Gb of variable TE sequence. These polymorphic TEs include a combination of recent transposition events as well as deletions of older TEs. There are examples of polymorphic TEs within each of the superfamilies of TEs and they are found distributed across the genome, including in regions of recent shared ancestry among individuals. There are many examples of polymorphic TEs within or near maize genes. In addition, there are 2380 gene annotations in the B73 genome that are located within variable TEs, providing evidence for the role of TEs in contributing to the substantial differences in annotated gene content among these genotypes. TEs are highly variable in our survey of four temperate maize genomes, highlighting the major contribution of TEs in driving variation in genome organization and gene content. OPEN RESEARCH BADGES: This article has earned an Open Data Badge for making publicly available the digitally-shareable data necessary to reproduce the reported results. The data is available at https://github.com/SNAnderson/maizeTE_variation; https://mcstitzer.github.io/maize_TEs.


Assuntos
Elementos de DNA Transponíveis , Genoma de Planta , Zea mays/genética , Evolução Molecular , Regulação da Expressão Gênica de Plantas , Variação Genética , Genômica , Genótipo , Anotação de Sequência Molecular , Análise de Sequência de DNA/métodos
10.
New Phytol ; 220(2): 395-408, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30035321

RESUMO

Contents Summary 395 I. Introduction 395 II. The genetic basis of maize domestication 396 III. The tempo of maize domestication 401 IV. Genetic interactions and selection during maize domestication 401 V. Gene networks of maize domestication alleles 404 VI. Implications of gene interactions on evolution and selection404 VII. Conclusions 405 Acknowledgements 405 References 405 SUMMARY: Domestication is a tractable system for following evolutionary change. Under domestication, wild populations respond to shifting selective pressures, resulting in adaptation to the new ecological niche of cultivation. Owing to the important role of domesticated crops in human nutrition and agriculture, the ancestry and selection pressures transforming a wild plant into a domesticate have been extensively studied. In Zea mays, morphological, genetic and genomic studies have elucidated how a wild plant, the teosinte Z. mays subsp. parviglumis, was transformed into the domesticate Z. mays subsp. mays. Five major morphological differences distinguish these two subspecies, and careful genetic dissection has pinpointed the molecular changes responsible for several of these traits. But maize domestication was a consequence of more than just five genes, and regions throughout the genome contribute. The impacts of these additional regions are contingent on genetic background, both the interactions between alleles of a single gene and among alleles of the multiple genes that modulate phenotypes. Key genetic interactions include dominance relationships, epistatic interactions and pleiotropic constraint, including how these variants are connected in gene networks. Here, we review the role of gene interactions in generating the dramatic phenotypic evolution seen in the transition from teosinte to maize.


Assuntos
Domesticação , Epistasia Genética , Genes de Plantas , Zea mays/genética , Redes Reguladoras de Genes , Seleção Genética
11.
Trends Plant Sci ; 29(3): 355-369, 2024 03.
Artigo em Inglês | MEDLINE | ID: mdl-37749022

RESUMO

Genome alignment is one of the most foundational methods for genome sequence studies. With rapid advances in sequencing and assembly technologies, these newly assembled genomes present challenges for alignment tools to meet the increased complexity and scale. Plant genome alignment is technologically challenging because of frequent whole-genome duplications (WGDs) as well as chromosome rearrangements and fractionation, high nucleotide diversity, widespread structural variation, and high transposable element (TE) activity causing large proportions of repeat elements. We summarize classical pairwise and multiple genome alignment (MGA) methods, and highlight techniques that are widely used or are being developed by the plant research community. We also outline the remaining challenges for precise genome alignment and the interpretation of alignment results in plants.


Assuntos
Genoma de Planta , Plantas , Plantas/genética , Genoma de Planta/genética , Elementos de DNA Transponíveis/genética
12.
bioRxiv ; 2024 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-38895432

RESUMO

Interpreting function and fitness effects in diverse plant genomes requires transferable models. Language models (LMs) pre-trained on large-scale biological sequences can learn evolutionary conservation and offer cross-species prediction better than supervised models through fine-tuning limited labeled data. We introduce PlantCaduceus, a plant DNA LM based on the Caduceus and Mamba architectures, pre-trained on a curated dataset of 16 Angiosperm genomes. Fine-tuning PlantCaduceus on limited labeled Arabidopsis data for four tasks, including predicting translation initiation/termination sites and splice donor and acceptor sites, demonstrated high transferability to 160 million year diverged maize, outperforming the best existing DNA LM by 1.45 to 7.23-fold. PlantCaduceus is competitive to state-of-the-art protein LMs in terms of deleterious mutation identification, and is threefold better than PhyloP. Additionally, PlantCaduceus successfully identifies well-known causal variants in both Arabidopsis and maize. Overall, PlantCaduceus is a versatile DNA LM that can accelerate plant genomics and crop breeding applications.

14.
Science ; 382(6674): eadg8940, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-38033071

RESUMO

The origins of maize were the topic of vigorous debate for nearly a century, but neither the current genetic model nor earlier archaeological models account for the totality of available data, and recent work has highlighted the potential contribution of a wild relative, Zea mays ssp. mexicana. Our population genetic analysis reveals that the origin of modern maize can be traced to an admixture between ancient maize and Zea mays ssp. mexicana in the highlands of Mexico some 4000 years after domestication began. We show that variation in admixture is a key component of maize diversity, both at individual loci and for additive genetic variation underlying agronomic traits. Our results clarify the origin of modern maize and raise new questions about the anthropogenic mechanisms underlying dispersal throughout the Americas.


Assuntos
Produtos Agrícolas , Domesticação , Hibridização Genética , Zea mays , México , Fenótipo , Zea mays/genética , Variação Genética , Produtos Agrícolas/genética
15.
G3 (Bethesda) ; 9(11): 3673-3682, 2019 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-31506319

RESUMO

Transposable Elements (TEs) are mobile elements that contribute the majority of DNA sequences in the maize genome. Due to their repetitive nature, genomic studies of TEs are complicated by the difficulty of properly attributing multi-mapped short reads to specific genomic loci. Here, we utilize a method to attribute RNA-seq reads to TE families rather than particular loci in order to characterize transcript abundance for TE families in the maize genome. We applied this method to assess per-family expression of transposable elements in >800 published RNA-seq libraries representing a range of maize development, genotypes, and hybrids. While a relatively small proportion of TE families are transcribed, expression is highly dynamic with most families exhibiting tissue-specific expression. A large number of TE families were specifically detected in pollen and endosperm, consistent with reproductive dynamics that maintain silencing of TEs in the germ line. We find that B73 transcript abundance is a poor predictor of TE expression in other genotypes and that transcript levels can differ even for shared TEs. Finally, by assessing recombinant inbred line and hybrid transcriptomes, complex patterns of TE transcript abundance across genotypes emerged. Taken together, this study reveals a dynamic contribution of TEs to maize transcriptomes.


Assuntos
Elementos de DNA Transponíveis , Zea mays/genética , Endosperma/genética , Pólen/genética , RNA-Seq , Transcriptoma
16.
Nat Plants ; 5(9): 980-990, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31477888

RESUMO

Self-fertilization (also known as selfing) is an important reproductive strategy in plants and a widely applied tool for plant genetics and plant breeding. Selfing can lead to inbreeding depression by uncovering recessive deleterious variants, unless these variants are purged by selection. Here we investigated the dynamics of purging in a set of eleven maize lines that were selfed for six generations. We show that heterozygous, putatively deleterious single nucleotide polymorphisms are preferentially lost from the genome during selfing. Deleterious single nucleotide polymorphisms were lost more rapidly in regions of high recombination, presumably because recombination increases the efficacy of selection by uncoupling linked variants. Overall, heterozygosity decreased more slowly than expected, by an estimated 35% to 40% per generation instead of the expected 50%, perhaps reflecting pervasive associative overdominance. Finally, three lines exhibited marked decreases in genome size due to the purging of transposable elements. Genome loss was more likely to occur for lineages that began with larger genomes with more transposable elements and chromosomal knobs. These three lines purged an average of 398 Mb from their genomes, an amount equivalent to three Arabidopsis thaliana genomes per lineage, in only a few generations.


Assuntos
Genoma de Planta , Perda de Heterozigosidade , Polimorfismo de Nucleotídeo Único , Autofertilização , Zea mays/fisiologia , Zea mays/genética
17.
Genetics ; 213(1): 143-160, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31320409

RESUMO

In the course of generating populations of maize with teosinte chromosomal introgressions, an unusual sickly plant phenotype was noted in individuals from crosses with two teosinte accessions collected near Valle de Bravo, Mexico. The plants of these Bravo teosinte accessions appear phenotypically normal themselves and the F1 plants appear similar to typical maize × teosinte F1s. However, upon backcrossing to maize, the BC1 and subsequent generations display a number of detrimental characteristics including shorter stature, reduced seed set, and abnormal floral structures. This phenomenon is observed in all BC individuals and there is no chromosomal segment linked to the sickly plant phenotype in advanced backcross generations. Once the sickly phenotype appears in a lineage, normal plants are never again recovered by continued backcrossing to the normal maize parent. Whole-genome shotgun sequencing reveals a small number of genomic sequences, some with homology to transposable elements, that have increased in copy number in the backcross populations. Transcriptome analysis of seedlings, which do not have striking phenotypic abnormalities, identified segments of 18 maize genes that exhibit increased expression in sickly plants. A de novo assembly of transcripts present in plants exhibiting the sickly phenotype identified a set of 59 upregulated novel transcripts. These transcripts include some examples with sequence similarity to transposable elements and other sequences present in the recurrent maize parent (W22) genome as well as novel sequences not present in the W22 genome. Genome-wide profiles of gene expression, DNA methylation, and small RNAs are similar between sickly plants and normal controls, although a few upregulated transcripts and transposable elements are associated with altered small RNA or methylation profiles. This study documents hybrid incompatibility and genome instability triggered by the backcrossing of Bravo teosinte with maize. We name this phenomenon "hybrid decay" and present ideas on the mechanism that may underlie it.


Assuntos
Epigênese Genética , Vigor Híbrido , Hibridização Genética , Endogamia , Zea mays/genética , Elementos de DNA Transponíveis , Instabilidade Genômica , Polimorfismo Genético , Transcriptoma
18.
PLoS One ; 14(10): e0224086, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31658277

RESUMO

The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the maize genome may suggest that manual curation of gene models is no longer necessary. However, quality metrics generated by the MAKER-P gene annotation pipeline identified 17,225 of 130,330 (13%) protein-coding transcripts in the B73 Reference Genome V4 gene set with models of low concordance to available biological evidence. Working with eight graduate students, we used the Apollo annotation editor to curate 86 transcript models flagged by quality metrics and a complimentary method using the Gramene gene tree visualizer. All of the triaged models had significant errors-including missing or extra exons, non-canonical splice sites, and incorrect UTRs. A correct transcript model existed for about 60% of genes (or transcripts) flagged by quality metrics; we attribute this to the convention of elevating the transcript with the longest coding sequence (CDS) to the canonical, or first, position. The remaining 40% of flagged genes resulted in novel annotations and represent a manual curation space of about 10% of the maize genome (~4,000 protein-coding genes). MAKER-P metrics have a specificity of 100%, and a sensitivity of 85%; the gene tree visualizer has a specificity of 100%. Together with the Apollo graphical editor, our double triage provides an infrastructure to support the community curation of eukaryotic genomes by scientists, students, and potentially even citizen scientists.


Assuntos
Curadoria de Dados/métodos , Proteínas de Plantas/genética , Zea mays/genética , Algoritmos , Bases de Dados Genéticas , Educação de Pós-Graduação , Humanos , Modelos Genéticos , Anotação de Sequência Molecular , Estudantes
19.
Evolution ; 72(11): 2565-2566, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30284242

RESUMO

How are alleles that are detrimental to fitness maintained in natural populations? Zuellig and Sweigart (2018a) find that alleles from a two-locus hybrid incompatibility system segregate at considerable frequencies in two species of monkeyflowers, suggesting that despite providing a fitness cost, these alleles remain polymorphic as a consequence of gene flow between the two species. The system provides the potential to understand the evolutionary trajectory of hybrid incompatibilities and their role in speciation.


Assuntos
Mimulus/genética , Alelos , Evolução Biológica , Fluxo Gênico , Hibridização Genética
20.
Nat Genet ; 50(9): 1282-1288, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30061736

RESUMO

The maize W22 inbred has served as a platform for maize genetics since the mid twentieth century. To streamline maize genome analyses, we have sequenced and de novo assembled a W22 reference genome using short-read sequencing technologies. We show that significant structural heterogeneity exists in comparison to the B73 reference genome at multiple scales, from transposon composition and copy number variation to single-nucleotide polymorphisms. The generation of this reference genome enables accurate placement of thousands of Mutator (Mu) and Dissociation (Ds) transposable element insertions for reverse and forward genetics studies. Annotation of the genome has been achieved using RNA-seq analysis, differential nuclease sensitivity profiling and bisulfite sequencing to map open reading frames, open chromatin sites and DNA methylation profiles, respectively. Collectively, the resources developed here integrate W22 as a community reference genome for functional genomics and provide a foundation for the maize pan-genome.


Assuntos
Elementos de DNA Transponíveis/genética , Genes de Plantas/genética , Genoma de Planta/genética , Zea mays/genética , Cromatina/genética , Cromossomos de Plantas/genética , Variações do Número de Cópias de DNA/genética , Metilação de DNA/genética , DNA de Plantas/genética , Genômica/métodos , Fases de Leitura Aberta/genética , Análise de Sequência de DNA/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA