Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Cell ; 166(2): 492-505, 2016 Jul 14.
Artigo em Inglês | MEDLINE | ID: mdl-27419873

RESUMO

The epigenome orchestrates genome accessibility, functionality, and three-dimensional structure. Because epigenetic variation can impact transcription and thus phenotypes, it may contribute to adaptation. Here, we report 1,107 high-quality single-base resolution methylomes and 1,203 transcriptomes from the 1001 Genomes collection of Arabidopsis thaliana. Although the genetic basis of methylation variation is highly complex, geographic origin is a major predictor of genome-wide DNA methylation levels and of altered gene expression caused by epialleles. Comparison to cistrome and epicistrome datasets identifies associations between transcription factor binding sites, methylation, nucleotide variation, and co-expression modules. Physical maps for nine of the most diverse genomes reveal how transposons and other structural variants shape the epigenome, with dramatic effects on immunity genes. The 1001 Epigenomes Project provides a comprehensive resource for understanding how variation in DNA methylation contributes to molecular and non-molecular phenotypes in natural populations of the most studied model plant.


Assuntos
Arabidopsis/genética , Epigênese Genética , Metilação de DNA , Epigenômica , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Transcriptoma
2.
Genome Res ; 26(8): 1057-68, 2016 08.
Artigo em Inglês | MEDLINE | ID: mdl-27225844

RESUMO

The three-dimensional packing of the genome plays an important role in regulating gene expression. We have used Hi-C, a genome-wide chromatin conformation capture (3C) method, to analyze Arabidopsis thaliana chromosomes dissected into subkilobase segments, which is required for gene-level resolution in this species with a gene-dense genome. We found that the repressive H3K27me3 histone mark is overrepresented in the promoter regions of genes that are in conformational linkage over long distances. In line with the globally dispersed distribution of RNA polymerase II in A. thaliana nuclear space, actively transcribed genes do not show a strong tendency to associate with each other. In general, there are often contacts between 5' and 3' ends of genes, forming local chromatin loops. Such self-loop structures of genes are more likely to occur in more highly expressed genes, although they can also be found in silent genes. Silent genes with local chromatin loops are highly enriched for the histone variant H3.3 at their 5' and 3' ends but depleted of repressive marks such as heterochromatic histone modifications and DNA methylation in flanking regions. Our results suggest that, different from animals, a major theme of genome folding in A. thaliana is the formation of structural units that correspond to gene bodies.


Assuntos
Arabidopsis/genética , Cromatina/genética , Metilação de DNA/genética , Histona-Lisina N-Metiltransferase/genética , Animais , Arabidopsis/crescimento & desenvolvimento , Cromossomos de Plantas , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Histona-Lisina N-Metiltransferase/biossíntese , Proteínas de Plantas/genética
3.
Genome Res ; 25(2): 246-56, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25367294

RESUMO

The spatial arrangement of interphase chromosomes in the nucleus is important for gene expression and genome function in animals and in plants. The recently developed Hi-C technology is an efficacious method to investigate genome packing. Here we present a detailed Hi-C map of the three-dimensional genome organization of the plant Arabidopsis thaliana. We find that local chromatin packing differs from the patterns seen in animals, with kilobasepair-sized segments that have much higher intrachromosome interaction rates than neighboring regions, representing a dominant local structural feature of genome conformation in A. thaliana. These regions, which appear as positive strips on two-dimensional representations of chromatin interaction, are enriched in epigenetic marks H3K27me3, H3.1, and H3.3. We also identify more than 400 insulator-like regions. Furthermore, although topologically associating domains (TADs), which are prominent in animals, are not an obvious feature of A. thaliana genome packing, we found more than 1000 regions that have properties of TAD boundaries, and a similar number of regions analogous to the interior of TADs. The insulator-like, TAD-boundary-like, and TAD-interior-like regions are each enriched for distinct epigenetic marks and are each correlated with different gene expression levels. We conclude that epigenetic modifications, gene density, and transcriptional activity combine to shape the local packing of the A. thaliana nuclear genome.


Assuntos
Arabidopsis/genética , Arabidopsis/metabolismo , Montagem e Desmontagem da Cromatina , Cromatina/metabolismo , Genômica , Análise por Conglomerados , Biologia Computacional/métodos , Epigênese Genética , Genoma de Planta , Genômica/métodos , Histonas/metabolismo , Elementos Isolantes
4.
PLoS Genet ; 11(10): e1005588, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26492483

RESUMO

Plants integrate seasonal cues such as temperature and day length to optimally adjust their flowering time to the environment. Compared to the control of flowering before and after winter by the vernalization and day length pathways, mechanisms that delay or promote flowering during a transient cool or warm period, especially during spring, are less well understood. Due to global warming, understanding this ambient temperature pathway has gained increasing importance. In Arabidopsis thaliana, FLOWERING LOCUS M (FLM) is a critical flowering regulator of the ambient temperature pathway. FLM is alternatively spliced in a temperature-dependent manner and the two predominant splice variants, FLM-ß and FLM-δ, can repress and activate flowering in the genetic background of the A. thaliana reference accession Columbia-0. The relevance of this regulatory mechanism for the environmental adaptation across the entire range of the species is, however, unknown. Here, we identify insertion polymorphisms in the first intron of FLM as causative for accelerated flowering in many natural A. thaliana accessions, especially in cool (15°C) temperatures. We present evidence for a potential adaptive role of this structural variation and link it specifically to changes in the abundance of FLM-ß. Our results may allow predicting flowering in response to ambient temperatures in the Brassicaceae.


Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Flores/genética , Proteínas de Domínio MADS/genética , Mutagênese Insercional/genética , Processamento Alternativo/genética , Arabidopsis/crescimento & desenvolvimento , Proteínas de Arabidopsis/biossíntese , Regulação da Expressão Gênica de Plantas , Aquecimento Global , Proteínas de Domínio MADS/biossíntese , Polimorfismo Genético , Estações do Ano , Temperatura
5.
Plant Cell ; 26(10): 3894-910, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25361952

RESUMO

Understanding the mechanisms underlying gene regulation is paramount to comprehend the translation from genotype to phenotype. The two are connected by gene expression, and it is generally thought that variation in transcription factor (TF) function is an important determinant of phenotypic evolution. We analyzed publicly available genome-wide chromatin immunoprecipitation experiments for 27 TFs in Arabidopsis thaliana and constructed an experimental network containing 46,619 regulatory interactions and 15,188 target genes. We identified hub targets and highly occupied target (HOT) regions, which are enriched for genes involved in development, stimulus responses, signaling, and gene regulatory processes in the currently profiled network. We provide several lines of evidence that TF binding at plant HOT regions is functional, in contrast to that in animals, and not merely the result of accessible chromatin. HOT regions harbor specific DNA motifs, are enriched for differentially expressed genes, and are often conserved across crucifers and dicots, even though they are not under higher levels of purifying selection than non-HOT regions. Distal bound regions are under purifying selection as well and are enriched for a chromatin state showing regulation by the Polycomb repressive complex. Gene expression complexity is positively correlated with the total number of bound TFs, revealing insights in the regulatory code for genes with different expression breadths. The integration of noncanonical and canonical DNA motif information yields new hypotheses on cobinding and tethering between specific TFs involved in flowering and light regulation.


Assuntos
Proteínas de Arabidopsis/metabolismo , DNA de Plantas/genética , Redes Reguladoras de Genes , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo , Arabidopsis/genética , Arabidopsis/metabolismo , Sequência de Bases , Sítios de Ligação/genética , Imunoprecipitação da Cromatina , DNA de Plantas/metabolismo , Bases de Dados Genéticas , Evolução Molecular , Perfilação da Expressão Gênica/estatística & dados numéricos , Regulação da Expressão Gênica de Plantas , Sequenciamento de Nucleotídeos em Larga Escala , Modelos Genéticos , Motivos de Nucleotídeos/genética , Análise de Sequência com Séries de Oligonucleotídeos , Ligação Proteica
6.
Plant Physiol ; 158(4): 1523-33, 2012 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-22345505

RESUMO

Protein interactions are fundamental to the molecular processes occurring within an organism and can be utilized in network biology to help organize, simplify, and understand biological complexity. Currently, there are more than 10 publicly available Arabidopsis (Arabidopsis thaliana) protein interaction databases. However, there are limitations with these databases, including different types of interaction evidence, a lack of defined standards for protein identifiers, differing levels of information, and, critically, a lack of integration between them. In this paper, we present an interactive bioinformatics Web tool, ANAP (Arabidopsis Network Analysis Pipeline), which serves to effectively integrate the different data sets and maximize access to available data. ANAP has been developed for Arabidopsis protein interaction integration and network-based study to facilitate functional protein network analysis. ANAP integrates 11 Arabidopsis protein interaction databases, comprising 201,699 unique protein interaction pairs, 15,208 identifiers (including 11,931 The Arabidopsis Information Resource Arabidopsis Genome Initiative codes), 89 interaction detection methods, 73 species that interact with Arabidopsis, and 6,161 references. ANAP can be used as a knowledge base for constructing protein interaction networks based on user input and supports both direct and indirect interaction analysis. It has an intuitive graphical interface allowing easy network visualization and provides extensive detailed evidence for each interaction. In addition, ANAP displays the gene and protein annotation in the generated interactive network with links to The Arabidopsis Information Resource, the AtGenExpress Visualization Tool, the Arabidopsis 1,001 Genomes GBrowse, the Protein Knowledgebase, the Kyoto Encyclopedia of Genes and Genomes, and the Ensembl Genome Browser to significantly aid functional network analysis. The tool is available open access at http://gmdd.shgmo.org/Computational-Biology/ANAP.


Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/metabolismo , Bases de Dados de Proteínas , Bases de Conhecimento , Mapas de Interação de Proteínas
7.
Nucleic Acids Res ; 39(7): e45, 2011 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-21266471

RESUMO

With the advent of DNA sequencing technologies, more and more reference genome sequences are available for many organisms. Analyzing sequence variation and understanding its biological importance are becoming a major research aim. However, how to store and process the huge amount of eukaryotic genome data, such as those of the human, mouse and rice, has become a challenge to biologists. Currently available bioinformatics tools used to compress genome sequence data have some limitations, such as the requirement of the reference single nucleotide polymorphisms (SNPs) map and information on deletions and insertions. Here, we present a novel compression tool for storing and analyzing Genome ReSequencing data, named GRS. GRS is able to process the genome sequence data without the use of the reference SNPs and other sequence variation information and automatically rebuild the individual genome sequence data using the reference genome sequence. When its performance was tested on the first Korean personal genome sequence data set, GRS was able to achieve ∼159-fold compression, reducing the size of the data from 2986.8 to 18.8 MB. While being tested against the sequencing data from rice and Arabidopsis thaliana, GRS compressed the 361.0 MB rice genome data to 4.4 MB, and the A. thaliana genome data from 115.1 MB to 6.5 KB. This de novo compression tool is available at http://gmdd.shgmo.org/Computational-Biology/GRS.


Assuntos
Genômica/métodos , Análise de Sequência de DNA , Software , Arabidopsis/genética , Genoma Humano , Genoma de Planta , Humanos , Oryza/genética
8.
BMC Bioinformatics ; 11: 81, 2010 Feb 09.
Artigo em Inglês | MEDLINE | ID: mdl-20144209

RESUMO

BACKGROUND: ChIP-Seq, which combines chromatin immunoprecipitation (ChIP) with high-throughput massively parallel sequencing, is increasingly being used for identification of protein-DNA interactions in vivo in the genome. However, to maximize the effectiveness of data analysis of such sequences requires the development of new algorithms that are able to accurately predict DNA-protein binding sites. RESULTS: Here, we present SIPeS (Site Identification from Paired-end Sequencing), a novel algorithm for precise identification of binding sites from short reads generated by paired-end solexa ChIP-Seq technology. In this paper we used ChIP-Seq data from the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS), which is expressed within the anther during pollen development, the results show that SIPeS has better resolution for binding site identification compared to two existing ChIP-Seq peak detection algorithms, Cisgenome and MACS. CONCLUSIONS: When compared to Cisgenome and MACS, SIPeS shows better resolution for binding site discovery. Moreover, SIPeS is designed to calculate the mappable genome length accurately with the fragment length based on the paired-end reads. Dynamic baselines are also employed to effectively discriminate closely adjacent binding sites, for effective binding sites discovery, which is of particular value when working with high-density genomes.


Assuntos
Imunoprecipitação da Cromatina , Biologia Computacional/métodos , Proteínas de Ligação a DNA/química , DNA/metabolismo , Sítios de Ligação , DNA/química , Proteínas de Ligação a DNA/metabolismo , Genoma
9.
Sci Rep ; 8(1): 338, 2018 01 10.
Artigo em Inglês | MEDLINE | ID: mdl-29321648

RESUMO

Safety assessment for genetically modified organisms (GMOs) is required before their release. To date, miRNAs that play important roles in eukaryotic gene regulation have not been considered in the current assessment system. In this study, we identified 6 independent Bt and EPSPS GM rice lines using PCR and immune strip. We analyzed the expression levels of Cry1Ac and EPSPS using quantitative real-time PCR and western blot. Further, miRNAs from the developing seeds of the 6 GM rice lines and the wild-type line were investigated using deep sequencing and bioinformatic approaches. Although these GM lines have different types of integration sites, copy numbers, and levels of gene expression, 21 differentially expressed miRNAs have been found compared to wild type. There is no correlation between transgenic protein expression level and the quantity of differentially expressed miRNAs. This study provides useful data about the miRNA composition of GM plants, and it might be helpful for future risk assessments of miRNA-based GM plants.


Assuntos
Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , MicroRNAs/genética , Oryza/genética , Transcriptoma , Biologia Computacional/métodos , Sequência Conservada , Ontologia Genética , Sequenciamento de Nucleotídeos em Larga Escala , Plantas Geneticamente Modificadas , Locos de Características Quantitativas , Interferência de RNA , RNA Mensageiro/genética , Análise de Sequência de RNA
10.
Plant Methods ; 14: 65, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30083222

RESUMO

BACKGROUND: Our knowledge of natural genetic variation is increasing at an extremely rapid pace, affording an opportunity to come to a much richer understanding of how effects of specific genes are dependent on the genetic background. To achieve a systematic understanding of such GxG interactions, it is desirable to develop genome editing tools that can be rapidly deployed across many different genetic varieties. RESULTS: We present an efficient CRISPR/Cas9 toolbox of super module (SM) vectors. These vectors are based on a previously described fluorescence protein marker expressed in seeds allowing identification of transgene-free mutants. We have used this vector series to delete genomic regions ranging from 1.7 to 13 kb in different natural accessions of the wild plant Arabidopsis thaliana. Based on results from 53 pairs of sgRNAs targeting individual nucleotide binding site leucine-rich repeat (NLR) genes, we provide a comprehensive overview of obtaining heritable deletions. CONCLUSIONS: The SM series of CRISPR/Cas9 vectors enables the rapid generation of transgene-free, genome edited plants for a diversity of functional studies.

11.
Sci Rep ; 7(1): 482, 2017 03 28.
Artigo em Inglês | MEDLINE | ID: mdl-28352080

RESUMO

Genome editing has emerged as a technology with a potential to revolutionize plant breeding. In this study, we report on generating, in less than ten months, Tomelo, a non-transgenic tomato variety resistant to the powdery mildew fungal pathogen using the CRISPR/Cas9 technology. We used whole-genome sequencing to show that Tomelo does not carry any foreign DNA sequences but only carries a deletion that is indistinguishable from naturally occurring mutations. We also present evidence for CRISPR/Cas9 being a highly precise tool, as we did not detect off-target mutations in Tomelo. Using our pipeline, mutations can be readily introduced into elite or locally adapted tomato varieties in less than a year with relatively minimal effort and investment.


Assuntos
Resistência à Doença/genética , Genoma de Planta , Doenças das Plantas/genética , Doenças das Plantas/microbiologia , Deleção de Sequência , Solanum lycopersicum/genética , Solanum lycopersicum/microbiologia , Transgenes , Sequência de Bases , Sistemas CRISPR-Cas , Edição de Genes , Marcação de Genes , Mutação , Fenótipo , Plantas Geneticamente Modificadas , Análise de Sequência de DNA
12.
GM Crops Food ; 4(2): 98-108, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23836108

RESUMO

The next generation sequencing technologies have been recently used for transcriptome analysis in many organisms because of the decreased sequencing cost and increased sequence output. In this study, we used digital gene expression (DGE) technique to compare the transcriptomic changes in mature seeds between transgenic maize overexpressing Aspergillus niger phyA2 and its non-transgenic counterpart. Deep sequencing of DGE libraries of the transgenic and its non-transgenic counterpart seeds generated 3,783,500 and 3,790,500 reads of 21-nucleotide, respectively, with frequencies spanning over four orders of magnitude. In transgenic maize, 53.97% of the unambiguous signature tags were mapped to the maize B73 reference genome, and 46.47% of genes were detected by at least two reads; in non-transgenic maize, the corresponding numbers were 51.38% and 47.39%. Compared with non-transgenic counterpart, about 12% of detected genes were differentially expressed in the transcriptome of transgenic maize seeds. Among these differentially expressed genes, there were 23 transcription factors in 14 families and no allergen genes. Pathway enrichment analysis revealed that 21 pathways were significantly affected by the transgenic event, in which the pathway involved in protein processing in endoplasmic reticulum was the most significantly affected. Results from this study indicated that both intended and unintended transcriptomic changes occurred in the transgenic maize, thus emphasizing the importance of transcriptome profiling in risk assessment of transgenic events.


Assuntos
6-Fitase/genética , Aspergillus niger/genética , Produtos Agrícolas/genética , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica de Plantas , Plantas Geneticamente Modificadas , Zea mays/genética , 6-Fitase/metabolismo , Biblioteca Gênica , Genoma de Planta , Sequenciamento de Nucleotídeos em Larga Escala , Sementes/genética , Zea mays/metabolismo
13.
Sci Rep ; 3: 2839, 2013 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-24088728

RESUMO

Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events.


Assuntos
Biologia Computacional , DNA de Plantas/genética , Mutagênese Insercional/genética , Oryza/genética , Plantas Geneticamente Modificadas/genética , Análise de Sequência de DNA/métodos , Transgenes/genética , Simulação por Computador , Vetores Genéticos/genética , Reação em Cadeia da Polimerase
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA