Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 24
Filter
Add more filters










Publication year range
1.
Bioessays ; : e2300206, 2024 May 20.
Article in English | MEDLINE | ID: mdl-38769697

ABSTRACT

Gene discovery reveals new biology, expands the utility of marker-assisted selection, and enables targeted mutagenesis. Still, such discoveries can take over a decade. We present a general strategy, "Agile Genetics," that uses nested, structured populations to overcome common limits on gene resolution. Extensive simulation work on realistic genetic architectures shows that, at population sizes of >5000 samples, single gene-resolution can be achieved using bulk segregant pools. At this scale, read depth and technical replication become major drivers of resolution. Emerging enrichment methods to address coverage are on the horizon; we describe one possibility - iterative depth sequencing (ID-seq). In addition, graph-based pangenomics in experimental populations will continue to maximize accuracy and improve interpretation. Based on this merger of agronomic scale with molecular and bioinformatic innovation, we predict a new age of rapid gene discovery.

2.
Front Plant Sci ; 14: 1308731, 2023.
Article in English | MEDLINE | ID: mdl-38173927

ABSTRACT

Soybean meal is a major component of livestock feed due to its high content and quality of protein. Understanding the genetic control of protein is essential to develop new cultivars with improved meal protein. Previously, a genomic region on chromosome 20 significantly associated with elevated protein content was identified in the cultivar Danbaekkong. The present research aimed to introgress the Danbaekkong high-protein allele into elite lines with different genetic backgrounds by developing and deploying robust DNA markers. A multiparent population consisting of 10 F5-derived populations with a total of 1,115 recombinant inbred lines (RILs) was developed using "Benning HP" as the donor parent of the Danbaekkong high-protein allele. A new functional marker targeting the 321-bp insertion in the gene Glyma.20g085100 was developed and used to track the Danbaekkong high-protein allele across the different populations and enable assessment of its effect and stability. Across all populations, the high-protein allele consistently increased the content, with an increase of 3.3% in seed protein. A total of 103 RILs were selected from the multiparent population for yield testing in five environments to assess the impact of the high-protein allele on yield and to enable the selection of new breeding lines with high protein and high yield. The results indicated that the high-protein allele impacts yield negatively in general; however, it is possible to select high-yielding lines with high protein content. An analysis of inheritance of the Chr 20 high-protein allele in Danbaekkong indicated that it originated from a Glycine soja line (PI 163453) and is the same as other G. soja lines studied. A survey of the distribution of the allele across 79 G. soja accessions and 35 Glycine max ancestors of North American soybean cultivars showed that the high-protein allele is present in all G. soja lines evaluated but not in any of the 35 North American soybean ancestors. These results demonstrate that G. soja accessions are a valuable source of favorable alleles for improvement of protein composition.

3.
Nat Commun ; 13(1): 7897, 2022 12 22.
Article in English | MEDLINE | ID: mdl-36550124

ABSTRACT

The genomic sequences segregating in experimental populations are often highly divergent from the community reference and from one another. Such divergence is problematic under various short-read-based genotyping strategies. In addition, large structural differences are often invisible despite being strong candidates for causal variation. These issues are exacerbated in specialty crop breeding programs with fewer, lower-quality sequence resources. Here, we examine the benefits of complete genomic information, based on long-read assemblies, in a biparental mapping experiment segregating at numerous disease resistance loci in the non-model crop, melon (Cucumis melo). We find that a graph-based approach, which uses both parental genomes, results in 19% more variants callable across the population and raw allele calls with a 2 to 3-fold error-rate reduction, even relative to single reference approaches using a parent genome. We show that structural variation has played a substantial role in shaping two Fusarium wilt resistance loci with known causal genes. We also report on the genetics of powdery mildew resistance, where copy number variation and local recombination suppression are directly interpretable via parental genome alignments. Benefits observed, even in this low-resolution biparental experiment, will inevitably be amplified in more complex populations.


Subject(s)
Cucumis melo , Cucurbitaceae , Genotype , Cucurbitaceae/genetics , DNA Copy Number Variations , Plant Breeding , Quantitative Trait Loci/genetics , Cucumis melo/genetics , Cucumis melo/microbiology
5.
Microorganisms ; 9(10)2021 Oct 01.
Article in English | MEDLINE | ID: mdl-34683396

ABSTRACT

Adenine and thymine homopolymer strings of at least 8 nucleotides (AT 8+mers) were characterized in Salmonella enterica subspecies I. The motif differed between other taxonomic classes but not between Salmonella enterica serovars. The motif in plasmids was possibly associated with serovar. Approximately 12.3% of the S. enterica motif loci had mutations. Mutability of AT 8+mers suggests that genomes undergo frequent repair to maintain optimal gene content, and that the motif facilitates self-recognition; in addition, serovar diversity is associated with plasmid content. A theory that genome regeneration accounts for both persistence of predominant Salmonella serovars and serovar diversity provides a new framework for investigating root causes of foodborne illness.

6.
PLoS Genet ; 17(3): e1009389, 2021 03.
Article in English | MEDLINE | ID: mdl-33735256

ABSTRACT

The genetic basis of general plant vigor is of major interest to food producers, yet the trait is recalcitrant to genetic mapping because of the number of loci involved, their small effects, and linkage. Observations of heterosis in many crops suggests that recessive, malfunctioning versions of genes are a major cause of poor performance, yet we have little information on the mutational spectrum underlying these disruptions. To address this question, we generated a long-read assembly of a tropical japonica rice (Oryza sativa) variety, Carolina Gold, which allowed us to identify structural mutations (>50 bp) and orient them with respect to their ancestral state using the outgroup, Oryza glaberrima. Supporting prior work, we find substantial genome expansion in the sativa branch. While transposable elements (TEs) account for the largest share of size variation, the majority of events are not directly TE-mediated. Tandem duplications are the most common source of insertions and are highly enriched among 50-200bp mutations. To explore the relative impact of various mutational classes on crop fitness, we then track these structural events over the last century of US rice improvement using 101 resequenced varieties. Within this material, a pattern of temporary hybridization between medium and long-grain varieties was followed by recent divergence. During this long-term selection, structural mutations that impact gene exons have been removed at a greater rate than intronic indels and single-nucleotide mutations. These results support the use of ab initio estimates of mutational burden, based on structural data, as an orthogonal predictor in genomic selection.


Subject(s)
Genes, Plant , Mutation , Oryza/genetics , Plant Breeding , Selection, Genetic , Crops, Agricultural/genetics , DNA Repair , DNA Transposable Elements , Environment , Gene-Environment Interaction , Genome, Plant , Hybridization, Genetic , INDEL Mutation , Seeds/genetics
7.
New Phytol ; 230(2): 683-697, 2021 04.
Article in English | MEDLINE | ID: mdl-33460457

ABSTRACT

Sorgoleone, a hydrophobic compound exuded from root hair cells of Sorghum spp., accounts for much of the allelopathic activity of the genus. The enzymes involved in the biosynthesis of this compound have been identified and functionally characterized. Here, we report the successful assembly of the biosynthetic pathway and the significant impact of in vivo synthesized sorgoleone on the heterologous host Nicotiana benthamiana. A multigene DNA construct was prepared for the expression of genes required for sorgoleone biosynthesis in planta and deployed in N. benthamiana leaf tissues via Agrobacterium-mediated transient expression. RNA-sequencing was conducted to investigate the effects of sorgoleone, via expression of its biosynthesis pathway, on host gene expression. The production of sorgoleone in agroinfiltrated leaves as detected by gas chromatography/mass spectrometry (GC/MS) resulted in the formation of necrotic lesions, indicating that the compound caused severe phytotoxicity to these tissues. RNA-sequencing profiling revealed significant changes in gene expression in the leaf tissues expressing the pathway during the formation of sorgoleone-induced necrotic lesions. Transcriptome analysis suggested that the compound produced in vivo impaired the photosynthetic system as a result of downregulated gene expression for the photosynthesis apparatus and elevated expression of proteasomal genes which may play a major role in the phytotoxicity of sorgoleone.


Subject(s)
Biosynthetic Pathways , Nicotiana , Benzoquinones , Biosynthetic Pathways/genetics , Lipids , Plant Leaves , Plant Roots/genetics , Nicotiana/genetics
8.
Microbiol Resour Announc ; 9(37)2020 Sep 10.
Article in English | MEDLINE | ID: mdl-32912903

ABSTRACT

Aspergillus flavus and Aspergillus parasiticus produce carcinogenic aflatoxins during crop infection, with extensive variations in production among isolates, ranging from atoxigenic to highly toxigenic. Here, we report draft genome sequences of one A. parasiticus isolate and nine A. flavus isolates from field environments for use in comparative, functional, and phylogenetic studies.

9.
G3 (Bethesda) ; 10(10): 3515-3531, 2020 10 05.
Article in English | MEDLINE | ID: mdl-32817124

ABSTRACT

Efforts in genome sequencing in the Aspergillus genus have led to the development of quality reference genomes for several important species including A. nidulans, A. fumigatus, and A. oryzae However, less progress has been made for A. flavus As part of the effort of the USDA-ARS Annual Aflatoxin Workshop Fungal Genome Project, the isolate NRRL3357 was sequenced and resulted in a scaffold-level genome released in 2005. Our goal has been biologically driven, focusing on two areas: isolate variation in aflatoxin production and drought stress exacerbating aflatoxin production by A. flavus Therefore, we developed two reference pseudomolecule genome assemblies derived from chromosome arms for two isolates: AF13, a MAT1-2, highly stress tolerant, and highly aflatoxigenic isolate; and NRRL3357, a MAT1-1, less stress tolerant, and moderate aflatoxin producer in comparison to AF13. Here, we report these two reference-grade assemblies for these isolates through a combination of PacBio long-read sequencing and optical mapping, and coupled them with comparative, functional, and phylogenetic analyses. This analysis resulted in the identification of 153 and 45 unique genes in AF13 and NRRL3357, respectively. We also confirmed the presence of a unique 310 Kb insertion in AF13 containing 60 genes. Analysis of this insertion revealed the presence of a bZIP transcription factor, named atfC, which may contribute to isolate pathogenicity and stress tolerance. Phylogenomic analyses comparing these and other available assemblies also suggest that the species complex of A. flavus is polyphyletic.


Subject(s)
Aflatoxins , Aspergillus flavus , Aspergillus flavus/genetics , Base Sequence , Genome, Fungal , Phylogeny
10.
PLoS One ; 15(7): e0235434, 2020.
Article in English | MEDLINE | ID: mdl-32649700

ABSTRACT

The genetic diversity of North American soybean cultivars has been largely influenced by a small number of ancestors. High yielding breeding lines that possess exotic pedigrees have been developed, but identifying beneficial exotic alleles has been difficult as a result of complex interactions of yield alleles with genetic backgrounds and environments as well as the highly quantitative nature of yield. PI 416937 has been utilized in the development of many high yielding lines that have been entered into the USDA Southern States Uniform Tests over the past ~20 years. The primary goal of this research was to identify genomic regions under breeding selection from PI 416937 and introduce a methodology for identifying and potentially utilizing beneficial diversity from lines prevalent in the ancestry of elite cultivars. Utilizing SoySNP50K Infinium BeadChips, 52 high yielding PI 416937-derived lines as well as their parents were genotyped to identify PI 416937 alleles under breeding selection. Nine genomic regions across three chromosomes and 17 genomic regions across seven chromosomes were identified where PI 416937 alleles were under positive or negative selection. Minimal significant associations between PI 416937 alleles and yield were observed in replicated yield trials of five RIL populations, highlighting the difficulty of consistently detecting yield associations.


Subject(s)
Breeding , Genetic Variation/genetics , Glycine max/genetics , Alleles , Chromosome Mapping , Chromosomes, Plant , Genome, Plant/genetics , Genomics , Genotype , Humans , Quantitative Trait Loci/genetics , Seeds/genetics , Seeds/growth & development , Glycine max/growth & development , United States
11.
G3 (Bethesda) ; 9(7): 2253-2265, 2019 07 09.
Article in English | MEDLINE | ID: mdl-31088906

ABSTRACT

Genomic selection (GS) has become viable for selection of quantitative traits for which marker-assisted selection has often proven less effective. The potential of GS for soybean was characterized using 483 elite breeding lines, genotyped with BARCSoySNP6K iSelect BeadChips. Cross validation was performed using RR-BLUP and predictive abilities (rMP) of 0.81, 0.71, and 0.26 for protein, oil, and yield, were achieved at the largest tested training set size. Minimal differences were observed when comparing different marker densities and there appeared to be inflation in rMP due to population structure. For comparison purposes, two additional methods to predict breeding values for lines of four bi-parental populations within the GS dataset were tested. The first method predicted within each bi-parental population (WP method) and utilized a training set of full-sibs of the validation set. The second method utilized a training set of all remaining breeding lines except for full-sibs of the validation set to predict across populations (AP method). The AP method is more practical as the WP method would likely delay the breeding cycle and leverage smaller training sets. Averaging across populations for protein and oil content, rMP for the AP method (0.55, 0.30) approached rMP for the WP method (0.60, 0.52). Though comparable, rMP for yield was low for both AP and WP methods (0.12, 0.13). Based on increases in rMP as training sets increased and the effectiveness of WP vs. AP method, the AP method could potentially improve with larger training sets and increased relatedness between training and validation sets.


Subject(s)
Genome, Plant , Genomics , Glycine max/physiology , Quantitative Trait, Heritable , Seeds/physiology , Databases, Genetic , Genomics/methods , Genotype , Phenotype , Plant Breeding , Selection, Genetic
12.
Pest Manag Sci ; 75(9): 2490-2504, 2019 Sep.
Article in English | MEDLINE | ID: mdl-30868714

ABSTRACT

BACKGROUND: New modes of action are needed for herbicides. The flavonoid synthesis intermediate t-chalcone causes apoptosis-like symptoms in roots and bleaching of shoots of Arabidospsis, suggesting a unique mode of action as a phytotoxin. RESULTS: Using RNA-Seq, transcriptome changes were monitored in Arabidopsis seedlings during the first 24 h of exposure (at 1, 3, 6, 12 and 24 h) to 21 µm t-chalcone (I50 dose), examining effects on roots and shoots separately. Expression of 892 and 1000 genes was affected in roots and shoots, respectively. According to biological classification, many of the affected genes were transcription factors and genes associated with oxidative stress, heat shock proteins, xenobiotic detoxification, ABA and auxin biosynthesis, and primary metabolic processess. These are secondary effects found with most phytotoxins. Potent phytotoxins usually act by inhibiting enzymes of primary metabolism. KEGG pathway analysis of transcriptome results from the first 3 h of t-chalcone exposure indicated several potential primary metabolism target sites for t-chalcone. Of these, p-hydroxyphenylpyruvate dioxygenase (HPPD) and tyrosine amino transferase were consistent with the bleaching effect of the phytotoxin. Supplementation studies with Lemna paucicostata and Arabidiopsis supported HPPD as the target, although in vitro enzyme inhibition was not found. CONCLUSIONS: t-Chalcone is possibly a protoxin that is converted to a HPPD inhibitor in vivo. © 2019 Society of Chemical Industry.


Subject(s)
Arabidopsis/drug effects , Biological Control Agents/toxicity , Chalcone/toxicity , Herbicides/toxicity , Transcriptome/drug effects , Apoptosis , Arabidopsis/growth & development , Plant Roots/drug effects , Plant Shoots/drug effects , Seedlings/drug effects , Seedlings/growth & development
13.
Sci Rep ; 9(1): 4386, 2019 03 13.
Article in English | MEDLINE | ID: mdl-30867436

ABSTRACT

Quantitative genetic simulations can save time and resources by optimizing the logistics of an experiment. Current tools are difficult to use by those unfamiliar with programming, and these tools rarely address the actual genetic structure of the population under study. Here, we introduce crossword, which utilizes the widely available re-sequencing and genomics data to create more realistic simulations and to reduce user burden. The software was written in R, to simplify installation and implementation. Because crossword is a domain-specific language, it allows complex and unique simulations to be performed, but the language is supported by a graphical interface that guides users through functions and options. We first show crossword's utility in QTL-seq design, where its output accurately reflects empirical data. By introducing the concept of levels to reflect family relatedness, crossword can simulate a broad range of breeding programs and crops. Using levels, we further illustrate crossword's capabilities by examining the effect of family size and number of selfing generations on phenotyping accuracy and genomic selection. Additionally, we explore the ramifications of large phenotypic difference between parents in a QTL mapping cross, a scenario that is common in crop genetics but often difficult to simulate.


Subject(s)
Quantitative Trait Loci/genetics , Animals , Breeding , Genomics , Genotype , Phenotype , Selection, Genetic/genetics , Software
14.
BMC Plant Biol ; 18(1): 30, 2018 02 06.
Article in English | MEDLINE | ID: mdl-29409454

ABSTRACT

BACKGROUND: The chloroplast genomes (plastome) of most plants are highly conserved in structure, gene content, and gene order. Parasitic plants, including those that are fully photosynthetic, often contain plastome rearrangements. These most notably include gene deletions that result in a smaller plastome size. The nature of gene loss and genome structural rearrangement has been investigated in several parasitic plants, but their timing and contributions to the adaptation of these parasites requires further investigation, especially among the under-studied hemi-parasites. RESULTS: De novo sequencing, assembly and annotation of the chloroplast genomes of five photosynthetic parasites from the family Orobanchaceae were employed to investigate plastome dynamics. Four had major structural rearrangements, including gene duplications and gene losses, that differentiated the taxa. The facultative parasite Aureolaria virginica had the most similar genome content to its close non-parasitic relative, Lindenbergia philippensis, with similar genome size and organization, and no differences in gene content. In contrast, the facultative parasite Buchnera americana and three obligate parasites in the genus Striga all had enlargements of their plastomes, primarily caused by expansion within the large inverted repeats (IRs) that are a standard plastome feature. Some of these IR increases were shared by multiple investigated species, but others were unique to particular lineages. Gene deletions and pseudogenization were also both shared and lineage-specific, with particularly frequent and independent loss of the ndh genes involved in electron recycling. CONCLUSIONS: Five new plastid genomes were fully assembled and compared. The results indicate that plastome instability is common in parasitic plants, even those that retain the need to perform essential plastid functions like photosynthesis. Gene losses were slow and not identical across taxa, suggesting that different lineages had different uses or needs for some of their plastome gene content, including genes involved in some aspects of photosynthesis. Recent repeat region extensions, some unique to terminal species branches, were observed after the divergence of the Buchnera/Striga clade, suggesting that this otherwise rare event has some special value in this lineage.


Subject(s)
Chloroplasts/genetics , Gene Deletion , Genes, Plant , Genome, Chloroplast , Genome, Plant , Orobanchaceae/genetics
15.
G3 (Bethesda) ; 6(9): 2693-705, 2016 09 08.
Article in English | MEDLINE | ID: mdl-27402364

ABSTRACT

Crop improvement represents a long-running experiment in artificial selection on a complex trait, namely yield. How such selection relates to natural populations is unclear, but the analysis of domesticated populations could offer insights into the relative role of selection, drift, and recombination in all species facing major shifts in selective regimes. Because of the extreme autogamy exhibited by soybean (Glycine max), many "immortalized" genotypes of elite varieties spanning the last century have been preserved and characterized using ∼50,000 single nucleotide polymorphic (SNP) markers. Also due to autogamy, the history of North American soybean breeding can be roughly divided into pre- and posthybridization eras, allowing for direct interrogation of the role of recombination in improvement and selection. Here, we report on genome-wide characterization of the structure and history of North American soybean populations and the signature of selection in these populations. Supporting previous work, we find that maturity defines population structure. Though the diversity of North American ancestors is comparable to available landraces, prehybridization line selections resulted in a clonal structure that dominated early breeding and explains many of the reductions in diversity found in the initial generations of soybean hybridization. The rate of allele frequency change does not deviate sharply from neutral expectation, yet some regions bare hallmarks of strong selection, suggesting a highly variable range of selection strengths biased toward weak effects. We also discuss the importance of haplotypes as units of analysis when complex traits fall under novel selection regimes.


Subject(s)
Genetic Variation , Genome-Wide Association Study , Glycine max/genetics , Quantitative Trait Loci/genetics , Genome, Plant , Genomics , Genotype , Haplotypes , Linkage Disequilibrium , Polymorphism, Single Nucleotide , United States
16.
BMC Plant Biol ; 15: 26, 2015 Feb 03.
Article in English | MEDLINE | ID: mdl-25644024

ABSTRACT

BACKGROUND: Among abiotic stresses, drought is the most common reducer of crop yields. The slow-wilting soybean genotype PI 416937 is somewhat robust to water deficit and has been used previously to map the trait in a bi-parental population. Since drought stress response is a complex biological process, whole genome transcriptome analysis was performed to obtain a deeper understanding of the drought response in soybean. RESULTS: Contrasting data from PI 416937 and the cultivar 'Benning', we developed a classification system to identify genes that were either responding to water-deficit in both genotypes or that had a genotype x environment (GxE) response. In spite of very different wilting phenotypes, 90% of classifiable genes had either constant expression in both genotypes (33%) or very similar response profiles (E genes, 57%). By further classifying E genes based on expression profiles, we were able to discern the functional specificity of transcriptional responses at particular stages of water-deficit, noting both the well-known reduction in photosynthesis genes as well as the less understood up-regulation of the protein transport pathway. Two percent of classifiable genes had a well-defined GxE response, many of which are located within slow-wilting QTLs. We consider these strong candidates for possible causal genes underlying PI 416937's unique drought avoidance strategy. CONCLUSIONS: There is a general and functionally significant transcriptional response to water deficit that involves not only known pathways, such as down-regulation of photosynthesis, but also up-regulation of protein transport and chromatin remodeling. Genes that show a genotypic difference are more likely to show an environmental response than genes that are constant between genotypes. In this study, at least five genes that clearly exhibited a genotype x environment response fell within known QTL and are very good candidates for further research into slow-wilting.


Subject(s)
Droughts , Glycine max/physiology , Transcriptome , Water , Glycine max/genetics
17.
PLoS One ; 9(10): e108581, 2014.
Article in English | MEDLINE | ID: mdl-25333869

ABSTRACT

Plants from the Zingiberaceae family are a key source of spices and herbal medicines. Species identification within this group is critical in the search for known and possibly novel bioactive compounds. To facilitate precise characterization of this group, we have sequenced chloroplast genomes from species representing five major groups within Zingiberaceae. Generally, the structure of these genomes is similar to the basal angiosperm excepting an expansion of 3 kb associated with the inverted repeat A region. Portions of this expansion appear to be shared across the entire Zingiberales order, which includes gingers and bananas. We used whole plastome alignment information to develop DNA barcodes that would maximize the ability to differentiate species within the Zingiberaceae. Our computation pipeline identified regions of high variability that were flanked by highly conserved regions used for primer design. This approach yielded hitherto unexploited regions of variability. These theoretically optimal barcodes were tested on a range of species throughout the family and were found to amplify and differentiate genera and, in some cases, species. Still, though these barcodes were specifically optimized for the Zingiberaceae, our data support the emerging consensus that whole plastome sequences are needed for robust species identification and phylogenetics within this family.


Subject(s)
Genome, Chloroplast , Zingiber officinale/genetics , Chloroplasts/genetics , DNA, Plant/analysis , Zingiber officinale/classification , High-Throughput Nucleotide Sequencing , Microsatellite Repeats , Molecular Sequence Annotation , Phylogeny , Plant Proteins/chemistry , Plant Proteins/genetics , Sequence Alignment , Sequence Analysis, DNA
18.
G3 (Bethesda) ; 4(11): 2283-94, 2014 Sep 22.
Article in English | MEDLINE | ID: mdl-25246241

ABSTRACT

Soybean oil and meal are major contributors to world-wide food production. Consequently, the genetic basis for soybean seed composition has been intensely studied using family-based mapping. Population-based mapping approaches, in the form of genome-wide association (GWA) scans, have been able to resolve loci controlling moderately complex quantitative traits (QTL) in numerous crop species. Yet, it is still unclear how soybean's unique population history will affect GWA scans. Using one of the populations in this study, we simulated phenotypes resulting from a range of genetic architectures. We found that with a heritability of 0.5, ∼100% and ∼33% of the 4 and 20 simulated QTL can be recovered, respectively, with a false-positive rate of less than ∼6×10(-5) per marker tested. Additionally, we demonstrated that combining information from multi-locus mixed models and compressed linear-mixed models improves QTL identification and interpretation. We applied these insights to exploring seed composition in soybean, refining the linkage group I (chromosome 20) protein QTL and identifying additional oil QTL that may allow some decoupling of highly correlated oil and protein phenotypes. Because the value of protein meal is closely related to its essential amino acid profile, we attempted to identify QTL underlying methionine, threonine, cysteine, and lysine content. Multiple QTL were found that have not been observed in family-based mapping studies, and each trait exhibited associations across multiple populations. Chromosomes 1 and 8 contain strong candidate alleles for essential amino acid increases. Overall, we present these and additional data that will be useful in determining breeding strategies for the continued improvement of soybean's nutrient portfolio.


Subject(s)
Genome, Plant , Glycine max/genetics , Seeds/genetics , Amino Acids/analysis , Genome-Wide Association Study , Quantitative Trait Loci , Seeds/chemistry
19.
Proc Natl Acad Sci U S A ; 111(18): 6684-9, 2014 May 06.
Article in English | MEDLINE | ID: mdl-24760826

ABSTRACT

The insertion of DNA into a genome can result in the duplication and dispersal of functional sequences through the genome. In addition, a deeper understanding of insertion mechanisms will inform methods of genetic engineering and plant transformation. Exploiting structural variations in numerous rice accessions, we have inferred and analyzed intermediate length (10-1,000 bp) insertions in plants. Insertions in this size class were found to be approximately equal in frequency to deletions, and compound insertion-deletions comprised only 0.1% of all events. Our findings indicate that, as observed in humans, tandem or partially tandem duplications are the dominant form of insertion (48%), although short duplications from ectopic donors account for a sizable fraction of insertions in rice (38%). Many nontandem duplications contain insertions from nearby DNA (within 200 bp) and can contain multiple donor sources--some distant--in single events. Although replication slippage is a plausible explanation for tandem duplications, the end homology required in such a model is most often absent and rarely is >5 bp. However, end homology is commonly longer than expected by chance. Such findings lead us to favor a model of patch-mediated double-strand-break creation followed by nonhomologous end-joining. Additionally, a striking bias toward 31-bp partially tandem duplications suggests that errors in nucleotide excision repair may be resolved via a similar, but distinct, pathway. In summary, the analysis of recent insertions in rice suggests multiple underappreciated causes of structural variation in eukaryotes.


Subject(s)
DNA Breaks, Double-Stranded , DNA, Plant/genetics , Mutagenesis, Insertional , Oryza/genetics , Tandem Repeat Sequences , DNA End-Joining Repair/genetics , DNA Repair/genetics , DNA, Plant/metabolism , Evolution, Molecular , Genetic Variation , Models, Genetic , Oryza/metabolism
20.
Plant Sci ; 214: 1-12, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24268158

ABSTRACT

We review the evidence that upstream open reading frames (uORFs) function as RNA sequence elements for post-transcriptional control of gene expression, specifically translation. uORFs are highly abundant in the genomes of angiosperms. Their negative effect on translation is often attenuated by ribosomal translation reinitiation, a process whose molecular biochemistry is still being investigated. Certain uORFs render translation responsive to small molecules, thus offering a path for metabolic control of gene expression in evolution and synthetic biology. In some cases, uORFs form modular logic gates in signal transduction. uORFs thus provide eukaryotes with a functionality analogous to, or comparable to, riboswitches and attenuators in prokaryotes. uORFs exist in many genes regulating development and point toward translational control of development. While many uORFs appear to be poorly conserved, and the number of genes with conserved-peptide uORFs is modest, many mRNAs have a conserved pattern of uORFs. Evolutionarily, the gain and loss of uORFs may be a widespread mechanism that diversifies gene expression patterns. Last but not least, this review includes a dedicated uORF database for Arabidopsis.


Subject(s)
Gene Expression Regulation, Plant , Open Reading Frames/genetics , Plants/genetics , Protein Biosynthesis/genetics , Models, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...