Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Nucleic Acids Res ; 51(19): 10147-10161, 2023 10 27.
Artículo en Inglés | MEDLINE | ID: mdl-37738140

RESUMEN

CRISPR-Cas9 tools have transformed genetic manipulation capabilities in the laboratory. Empirical rules-of-thumb have been developed for only a narrow range of model organisms, and mechanistic underpinnings for sgRNA efficiency remain poorly understood. This work establishes a novel feature set and new public resource, produced with quantum chemical tensors, for interpreting and predicting sgRNA efficiency. Feature engineering for sgRNA efficiency is performed using an explainable-artificial intelligence model: iterative Random Forest (iRF). By encoding quantitative attributes of position-specific sequences for Escherichia coli sgRNAs, we identify important traits for sgRNA design in bacterial species. Additionally, we show that expanding positional encoding to quantum descriptors of base-pair, dimer, trimer, and tetramer sequences captures intricate interactions in local and neighboring nucleotides of the target DNA. These features highlight variation in CRISPR-Cas9 sgRNA dynamics between E. coli and H. sapiens genomes. These novel encodings of sgRNAs enhance our understanding of the elaborate quantum biological processes involved in CRISPR-Cas9 machinery.


Asunto(s)
Sistemas CRISPR-Cas , ARN Guía de Sistemas CRISPR-Cas , Inteligencia Artificial , ADN , Escherichia coli/genética , Edición Génica , Humanos
2.
Plant Cell ; 34(1): 514-534, 2022 01 20.
Artículo en Inglés | MEDLINE | ID: mdl-34735005

RESUMEN

Changes in gene expression are important for responses to abiotic stress. Transcriptome profiling of heat- or cold-stressed maize genotypes identifies many changes in transcript abundance. We used comparisons of expression responses in multiple genotypes to identify alleles with variable responses to heat or cold stress and to distinguish examples of cis- or trans-regulatory variation for stress-responsive expression changes. We used motifs enriched near the transcription start sites (TSSs) for thermal stress-responsive genes to develop predictive models of gene expression responses. Prediction accuracies can be improved by focusing only on motifs within unmethylated regions near the TSS and vary for genes with different dynamic responses to stress. Models trained on expression responses in a single genotype and promoter sequences provided lower performance when applied to other genotypes but this could be improved by using models trained on data from all three genotypes tested. The analysis of genes with cis-regulatory variation provides evidence for structural variants that result in presence/absence of transcription factor binding sites in creating variable responses. This study provides insights into cis-regulatory motifs for heat- and cold-responsive gene expression and defines a framework for developing models to predict expression responses across multiple genotypes.


Asunto(s)
Respuesta al Choque por Frío/genética , Regulación de la Expresión Génica de las Plantas/fisiología , Genes de Plantas , Respuesta al Choque Térmico/genética , Transcriptoma , Zea mays/fisiología , Perfilación de la Expresión Génica , Zea mays/genética
3.
G3 (Bethesda) ; 11(8)2021 08 07.
Artículo en Inglés | MEDLINE | ID: mdl-34849810

RESUMEN

Accessible chromatin and unmethylated DNA are associated with many genes and cis-regulatory elements. Attempts to understand natural variation for accessible chromatin regions (ACRs) and unmethylated regions (UMRs) often rely upon alignments to a single reference genome. This limits the ability to assess regions that are absent in the reference genome assembly and monitor how nearby structural variants influence variation in chromatin state. In this study, de novo genome assemblies for four maize inbreds (B73, Mo17, Oh43, and W22) are utilized to assess chromatin accessibility and DNA methylation patterns in a pan-genome context. A more complete set of UMRs and ACRs can be identified when chromatin data are aligned to the matched genome rather than a single reference genome. While there are UMRs and ACRs present within genomic regions that are not shared between genotypes, these features are 6- to 12-fold enriched within regions between genomes. Characterization of UMRs present within shared genomic regions reveals that most UMRs maintain the unmethylated state in other genotypes with only ∼5% being polymorphic between genotypes. However, the majority (71%) of UMRs that are shared between genotypes only exhibit partial overlaps suggesting that the boundaries between methylated and unmethylated DNA are dynamic. This instability is not solely due to sequence variation as these partially overlapping UMRs are frequently found within genomic regions that lack sequence variation. The ability to compare chromatin properties among individuals with structural variation enables pan-epigenome analyses to study the sources of variation for accessible chromatin and unmethylated DNA.


Asunto(s)
Metilación de ADN , Zea mays , Cromatina/genética , Regulación de la Expresión Génica de las Plantas , Genoma de Planta , Humanos , Zea mays/genética
4.
G3 (Bethesda) ; 11(10)2021 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-34568911

RESUMEN

Intact transposable elements (TEs) account for 65% of the maize genome and can impact gene function and regulation. Although TEs comprise the majority of the maize genome and affect important phenotypes, genome-wide patterns of TE polymorphisms in maize have only been studied in a handful of maize genotypes, due to the challenging nature of assessing highly repetitive sequences. We implemented a method to use short-read sequencing data from 509 diverse inbred lines to classify the presence/absence of 445,418 nonredundant TEs that were previously annotated in four genome assemblies including B73, Mo17, PH207, and W22. Different orders of TEs (i.e., LTRs, Helitrons, and TIRs) had different frequency distributions within the population. LTRs with lower LTR similarity were generally more frequent in the population than LTRs with higher LTR similarity, though high-frequency insertions with very high LTR similarity were observed. LTR similarity and frequency estimates of nested elements and the outer elements in which they insert revealed that most nesting events occurred very near the timing of the outer element insertion. TEs within genes were at higher frequency than those that were outside of genes and this is particularly true for those not inserted into introns. Many TE insertional polymorphisms observed in this population were tagged by SNP markers. However, there were also 19.9% of the TE polymorphisms that were not well tagged by SNPs (R2 < 0.5) that potentially represent information that has not been well captured in previous SNP-based marker-trait association studies. This study provides a population scale genome-wide assessment of TE variation in maize and provides valuable insight on variation in TEs in maize and factors that contribute to this variation.


Asunto(s)
Elementos Transponibles de ADN , Zea mays , Elementos Transponibles de ADN/genética , Genotipo , Intrones , Secuencias Repetidas Terminales , Zea mays/genética
5.
G3 (Bethesda) ; 11(7)2021 07 14.
Artículo en Inglés | MEDLINE | ID: mdl-33905487

RESUMEN

Transposons can create allelic diversity that affects gene expression and phenotypic diversity. The detection of transposon polymorphisms at a genome-wide scale across a large population is difficult. Here, we developed a targeted sequencing approach to monitor transposon polymorphisms of interest. This approach can interrogate the presence or absence of transposons reliably across various genotypes. Using this approach, we genotyped a set of 965 transposon-related presence/absence polymorphisms in a diverse panel of 16 maize (Zea mays L.) inbred lines that are representative of the major maize breeding groups. About 70% of the selected regions can be effectively assayed in each genotype. The consistency between the capture-based assay and PCR-based assay are 98.6% based on analysis of 24 randomly selected transposon polymorphisms. By integrating the transposon polymorphisms data with gene expression data, ∼18% of the assayed transposon polymorphisms were found to be associated with variable gene expression levels. A detailed analysis of 18 polymorphisms in a larger association panel confirmed the effects of 10 polymorphisms, with one of them having a stronger association with expression than nearby SNP markers. The effects of seven polymorphisms were tested using a luciferase-based expression assay, and one was confirmed. Together, this study demonstrates that the targeted sequencing assay is an effective way to explore transposon function in a high-throughput manner.


Asunto(s)
Polimorfismo de Nucleótido Simple , Zea mays , Zea mays/genética , Genotipo , Alelos
6.
Genetics ; 217(1): 1-13, 2021 03 03.
Artículo en Inglés | MEDLINE | ID: mdl-33683350

RESUMEN

Transposable elements (TEs) have the potential to create regulatory variation both through the disruption of existing DNA regulatory elements and through the creation of novel DNA regulatory elements. In a species with a large genome, such as maize, many TEs interspersed with genes create opportunities for significant allelic variation due to TE presence/absence polymorphisms among individuals. We used information on putative regulatory elements in combination with knowledge about TE polymorphisms in maize to identify TE insertions that interrupt existing accessible chromatin regions (ACRs) in B73 as well as examples of polymorphic TEs that contain ACRs among four inbred lines of maize including B73, Mo17, W22, and PH207. The TE insertions in three other assembled maize genomes (Mo17, W22, or PH207) that interrupt ACRs that are present in the B73 genome can trigger changes to the chromatin, suggesting the potential for both genetic and epigenetic influences of these insertions. Nearly 20% of the ACRs located over 2 kb from the nearest gene are located within an annotated TE. These are regions of unmethylated DNA that show evidence for functional importance similar to ACRs that are not present within TEs. Using a large panel of maize genotypes, we tested if there is an association between the presence of TE insertions that interrupt, or carry, an ACR and the expression of nearby genes. While most TE polymorphisms are not associated with expression for nearby genes, the TEs that carry ACRs exhibit enrichment for being associated with higher expression of nearby genes, suggesting that these TEs may contribute novel regulatory elements. These analyses highlight the potential for a subset of TEs to rewire transcriptional responses in eukaryotic genomes.


Asunto(s)
Cromatina/metabolismo , Elementos Transponibles de ADN/genética , Regulación de la Expresión Génica de las Plantas , Zea mays/genética , Cromatina/genética , Epigénesis Genética
7.
Plant Physiol ; 186(1): 420-433, 2021 05 27.
Artículo en Inglés | MEDLINE | ID: mdl-33591319

RESUMEN

Transposable elements (TEs) pervade most eukaryotic genomes. The repetitive nature of TEs complicates the analysis of their expression. Evaluation of the expression of both TE families (using unique and multi-mapping reads) and specific elements (using uniquely mapping reads) in leaf tissue of three maize (Zea mays) inbred lines subjected to heat or cold stress reveals no evidence for genome-wide activation of TEs; however, some specific TE families generate transcripts only in stress conditions. There is substantial variation for which TE families exhibit stress-responsive expression in the different genotypes. In order to understand the factors that drive expression of TEs, we focused on a subset of families in which we could monitor expression of individual elements. The stress-responsive activation of a TE family can often be attributed to a small number of elements in the family that contains regions lacking DNA methylation. Comparisons of the expression of TEs in different genotypes revealed both genetic and epigenetic variation. Many of the specific TEs that are activated in stress in one inbred are not present in the other inbred, explaining the lack of activation. Among the elements that are shared in both genomes but only expressed in one genotype, we found that many exhibit differences in DNA methylation such that the genotype without expression is fully methylated. This study provides insights into the regulation of expression of TEs in normal and stress conditions and highlights the role of chromatin variation between elements in a family or between genotypes for contributing to expression variation. The highly repetitive nature of many TEs complicates the analysis of their expression. Although most TEs are not expressed, some exhibits expression in certain tissues or conditions. We monitored the expression of both TE families (using unique and multi-mapping reads) and specific elements (using uniquely mapping reads) in leaf tissue of three maize (Zea mays) inbred lines subjected to heat or cold stress. While genome-wide activation of TEs did not occur, some TE families generated transcripts only in stress conditions with variation by genotype. To better understand the factors that drive expression of TEs, we focused on a subset of families in which we could monitor expression of individual elements. In most cases, stress-responsive activation of a TE family was attributed to a small number of elements in the family. The elements that contained small regions lacking DNA methylation regions showed enriched expression while fully methylated elements were rarely expressed in control or stress conditions. The cause of varied expression in the different genotypes was due to both genetic and epigenetic variation. Many specific TEs activated by stress in one inbred were not present in the other inbred. Among the elements shared in both genomes, full methylation inhibited expression in one of the genotypes. This study provides insights into the regulation of TE expression in normal and stress conditions and highlights the role of chromatin variation between elements in a family or between genotypes for contributing to expression.


Asunto(s)
Elementos Transponibles de ADN/genética , Epigénesis Genética , Expresión Génica , Variación Genética , Estrés Fisiológico/genética , Zea mays/fisiología , Zea mays/genética
8.
Curr Opin Plant Biol ; 61: 101989, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33445144

RESUMEN

Epigenetic variation has been observed in many plant populations. This variation can influence qualitative and quantitative traits. A key question is whether there is novel information in the epigenome that is not captured by SNP-based genetic markers. The answer likely varies depending on the sources and stability of epigenetic variation as well as the type of population being studied. We consider the epigenetic variation in several plant systems and how this relates to potential for hidden information that could increase our understanding of phenotypic variation.


Asunto(s)
Metilación de ADN , Epigénesis Genética , Plantas/genética , ADN de Plantas , Epigenómica , Variación Genética , Fenotipo , Polimorfismo de Nucleótido Simple
9.
Proc Natl Acad Sci U S A ; 117(38): 23991-24000, 2020 09 22.
Artículo en Inglés | MEDLINE | ID: mdl-32879011

RESUMEN

The genomic sequences of crops continue to be produced at a frenetic pace. It remains challenging to develop complete annotations of functional genes and regulatory elements in these genomes. Chromatin accessibility assays enable discovery of functional elements; however, to uncover the full portfolio of cis-elements would require profiling of many combinations of cell types, tissues, developmental stages, and environments. Here, we explore the potential to use DNA methylation profiles to develop more complete annotations. Using leaf tissue in maize, we define ∼100,000 unmethylated regions (UMRs) that account for 5.8% of the genome; 33,375 UMRs are found greater than 2 kb from genes. UMRs are highly stable in multiple vegetative tissues, and they capture the vast majority of accessible chromatin regions from leaf tissue. However, many UMRs are not accessible in leaf, and these represent regions with potential to become accessible in specific cell types or developmental stages. These UMRs often occur near genes that are expressed in other tissues and are enriched for binding sites of transcription factors. The leaf-inaccessible UMRs exhibit unique chromatin modification patterns and are enriched for chromatin interactions with nearby genes. The total UMR space in four additional monocots ranges from 80 to 120 megabases, which is remarkably similar considering the range in genome size of 271 megabases to 4.8 gigabases. In summary, based on the profile from a single tissue, DNA methylation signatures provide powerful filters to distill large genomes down to the small fraction of putative functional genes and regulatory elements.


Asunto(s)
Metilación de ADN/genética , Regulación de la Expresión Génica de las Plantas/genética , Genoma de Planta/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , ADN de Plantas/genética , Hojas de la Planta/genética , Hojas de la Planta/metabolismo , Zea mays/genética
10.
Plant Cell ; 32(5): 1377-1396, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32184350

RESUMEN

The regulation of gene expression is central to many biological processes. Gene regulatory networks (GRNs) link transcription factors (TFs) to their target genes and represent maps of potential transcriptional regulation. Here, we analyzed a large number of publically available maize (Zea mays) transcriptome data sets including >6000 RNA sequencing samples to generate 45 coexpression-based GRNs that represent potential regulatory relationships between TFs and other genes in different populations of samples (cross-tissue, cross-genotype, and tissue-and-genotype samples). While these networks are all enriched for biologically relevant interactions, different networks capture distinct TF-target associations and biological processes. By examining the power of our coexpression-based GRNs to accurately predict covarying TF-target relationships in natural variation data sets, we found that presence/absence changes rather than quantitative changes in TF gene expression are more likely associated with changes in target gene expression. Integrating information from our TF-target predictions and previous expression quantitative trait loci (eQTL) mapping results provided support for 68 TFs underlying 74 previously identified trans-eQTL hotspots spanning a variety of metabolic pathways. This study highlights the utility of developing multiple GRNs within a species to detect putative regulators of important plant pathways and provides potential targets for breeding or biotechnological applications.


Asunto(s)
Regulación de la Expresión Génica de las Plantas , Redes Reguladoras de Genes , Zea mays/genética , Arabidopsis/genética , Bases de Datos Genéticas , Ontología de Genes , Anotación de Secuencia Molecular , Filogenia , Sitios de Carácter Cuantitativo/genética , Factores de Transcripción/metabolismo
12.
Nat Plants ; 5(12): 1237-1249, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31740773

RESUMEN

Genetic mapping studies on crops suggest that agronomic traits can be controlled by gene-distal intergenic loci. Despite the biological importance and the potential agronomic utility of these loci, they remain virtually uncharacterized in all crop species to date. Here, we provide genetic, epigenomic and functional molecular evidence to support the widespread existence of gene-distal (hereafter, distal) loci that act as long-range transcriptional cis-regulatory elements (CREs) in the maize genome. Such loci are enriched for euchromatic features that suggest their regulatory functions. Chromatin loops link together putative CREs with genes and recapitulate genetic interactions. Putative CREs also display elevated transcriptional enhancer activities, as measured by self-transcribing active regulatory region sequencing. These results provide functional support for the widespread existence of CREs that act over large genomic distances to control gene expression.


Asunto(s)
Genoma de Planta , Elementos Reguladores de la Transcripción , Zea mays/genética , Regulación de la Expresión Génica de las Plantas , Proteínas de Plantas/genética , Regiones Promotoras Genéticas
13.
PLoS Genet ; 15(9): e1008291, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31498837

RESUMEN

DNA methylation and epigenetic silencing play important roles in the regulation of transposable elements (TEs) in many eukaryotic genomes. A majority of the maize genome is derived from TEs that can be classified into different orders and families based on their mechanism of transposition and sequence similarity, respectively. TEs themselves are highly methylated and it can be tempting to view them as a single uniform group. However, the analysis of DNA methylation profiles in flanking regions provides evidence for distinct groups of chromatin properties at different TE families. These differences among TE families are reproducible in different tissues and different inbred lines. TE families with varying levels of DNA methylation in flanking regions also show distinct patterns of chromatin accessibility and modifications within the TEs. The differences in the patterns of DNA methylation flanking TE families arise from a combination of non-random insertion preferences of TE families, changes in DNA methylation triggered by the insertion of the TE and subsequent selection pressure. A set of nearly 70,000 TE polymorphisms among four assembled maize genomes were used to monitor the level of DNA methylation at haplotypes with and without the TE insertions. In many cases, TE families with high levels of DNA methylation in flanking sequence are enriched for insertions into highly methylated regions. The majority of the >2,500 TE insertions into unmethylated regions result in changes in DNA methylation in haplotypes with the TE, suggesting the widespread potential for TE insertions to condition altered methylation in conserved regions of the genome. This study highlights the interplay between TEs and the methylome of a major crop species.


Asunto(s)
Metilación de ADN/genética , Elementos Transponibles de ADN/genética , Zea mays/genética , Epigénesis Genética/genética , Epigenómica/métodos , Evolución Molecular , Regulación de la Expresión Génica de las Plantas/genética , Genoma de Planta/genética , Genotipo , Haplotipos/genética , Polimorfismo Genético/genética , Análisis de Secuencia de ADN/métodos
14.
Plant J ; 100(5): 1052-1065, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31381222

RESUMEN

Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genome organization and content. Most maize genomes are composed of TEs. We developed an approach to define shared and variable TE insertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17 and PH207) with uniform structural annotations of TEs. Among these genomes we identified approximately 400 000 TEs that are polymorphic, encompassing 1.6 Gb of variable TE sequence. These polymorphic TEs include a combination of recent transposition events as well as deletions of older TEs. There are examples of polymorphic TEs within each of the superfamilies of TEs and they are found distributed across the genome, including in regions of recent shared ancestry among individuals. There are many examples of polymorphic TEs within or near maize genes. In addition, there are 2380 gene annotations in the B73 genome that are located within variable TEs, providing evidence for the role of TEs in contributing to the substantial differences in annotated gene content among these genotypes. TEs are highly variable in our survey of four temperate maize genomes, highlighting the major contribution of TEs in driving variation in genome organization and gene content. OPEN RESEARCH BADGES: This article has earned an Open Data Badge for making publicly available the digitally-shareable data necessary to reproduce the reported results. The data is available at https://github.com/SNAnderson/maizeTE_variation; https://mcstitzer.github.io/maize_TEs.


Asunto(s)
Elementos Transponibles de ADN , Genoma de Planta , Zea mays/genética , Evolución Molecular , Regulación de la Expresión Génica de las Plantas , Variación Genética , Genómica , Genotipo , Anotación de Secuencia Molecular , Análisis de Secuencia de ADN/métodos
16.
Nat Genet ; 50(9): 1282-1288, 2018 09.
Artículo en Inglés | MEDLINE | ID: mdl-30061736

RESUMEN

The maize W22 inbred has served as a platform for maize genetics since the mid twentieth century. To streamline maize genome analyses, we have sequenced and de novo assembled a W22 reference genome using short-read sequencing technologies. We show that significant structural heterogeneity exists in comparison to the B73 reference genome at multiple scales, from transposon composition and copy number variation to single-nucleotide polymorphisms. The generation of this reference genome enables accurate placement of thousands of Mutator (Mu) and Dissociation (Ds) transposable element insertions for reverse and forward genetics studies. Annotation of the genome has been achieved using RNA-seq analysis, differential nuclease sensitivity profiling and bisulfite sequencing to map open reading frames, open chromatin sites and DNA methylation profiles, respectively. Collectively, the resources developed here integrate W22 as a community reference genome for functional genomics and provide a foundation for the maize pan-genome.


Asunto(s)
Elementos Transponibles de ADN/genética , Genes de Plantas/genética , Genoma de Planta/genética , Zea mays/genética , Cromatina/genética , Cromosomas de las Plantas/genética , Variaciones en el Número de Copia de ADN/genética , Metilación de ADN/genética , ADN de Plantas/genética , Genómica/métodos , Sistemas de Lectura Abierta/genética , Análisis de Secuencia de ADN/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...