Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
BMC Genomics ; 22(1): 23, 2021 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-33407087

RESUMO

BACKGROUND: Three-dimensional chromatin loop structures connect regulatory elements to their target genes in regions known as anchors. In complex plant genomes, such as maize, it has been proposed that loops span heterochromatic regions marked by higher repeat content, but little is known on their spatial organization and genome-wide occurrence in relation to transcriptional activity. RESULTS: Here, ultra-deep Hi-C sequencing of maize B73 leaf tissue was combined with gene expression and open chromatin sequencing for chromatin loop discovery and correlation with hierarchical topologically-associating domains (TADs) and transcriptional activity. A majority of all anchors are shared between multiple loops from previous public maize high-resolution interactome datasets, suggesting a highly dynamic environment, with a conserved set of anchors involved in multiple interaction networks. Chromatin loop interiors are marked by higher repeat contents than the anchors flanking them. A small fraction of high-resolution interaction anchors, fully embedded in larger chromatin loops, co-locate with active genes and putative protein-binding sites. Combinatorial analyses indicate that all anchors studied here co-locate with at least 81.5% of expressed genes and 74% of open chromatin regions. Approximately 38% of all Hi-C chromatin loops are fully embedded within hierarchical TAD-like domains, while the remaining ones share anchors with domain boundaries or with distinct domains. Those various loop types exhibit specific patterns of overlap for open chromatin regions and expressed genes, but no apparent pattern of gene expression. In addition, up to 63% of all unique variants derived from a prior public maize eQTL dataset overlap with Hi-C loop anchors. Anchor annotation suggests that < 7% of all loops detected here are potentially devoid of any genes or regulatory elements. The overall organization of chromatin loop anchors in the maize genome suggest a loop modeling system hypothesized to resemble phase separation of repeat-rich regions. CONCLUSIONS: Sets of conserved chromatin loop anchors mapping to hierarchical domains contains core structural components of the gene expression machinery in maize. The data presented here will be a useful reference to further investigate their function in regard to the formation of transcriptional complexes and the regulation of transcriptional activity in the maize genome.


Assuntos
Cromatina , Zea mays , Cromatina/genética , Montagem e Desmontagem da Cromatina , Expressão Gênica , Genoma de Planta , Zea mays/genética
2.
Front Plant Sci ; 7: 225, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26941773

RESUMO

The Amaryllidaceae alkaloids are a family of amino acid derived alkaloids with many biological activities; examples include haemanthamine, haemanthidine, galanthamine, lycorine, and maritidine. Central to the biosynthesis of the majority of these alkaloids is a C-C phenol-coupling reaction that can have para-para', para-ortho', or ortho-para' regiospecificity. Through comparative transcriptomics of Narcissus sp. aff. pseudonarcissus, Galanthus sp., and Galanthus elwesii we have identified a para-para' C-C phenol coupling cytochrome P450, CYP96T1, capable of forming the products (10bR,4aS)-noroxomaritidine and (10bS,4aR)-noroxomaritidine from 4'-O-methylnorbelladine. CYP96T1 was also shown to catalyzed formation of the para-ortho' phenol coupled product, N-demethylnarwedine, as less than 1% of the total product. CYP96T1 co-expresses with the previously characterized norbelladine 4'-O-methyltransferase. The discovery of CYP96T1 is of special interest because it catalyzes the first major branch in Amaryllidaceae alkaloid biosynthesis. CYP96T1 is also the first phenol-coupling enzyme characterized from a monocot.

3.
PLoS One ; 10(6): e0130328, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26091106

RESUMO

The Asian citrus psyllid (ACP) Diaphorina citri Kuwayama (Hemiptera: Psyllidae) is the insect vector of the fastidious bacterium Candidatus Liberibacter asiaticus (CLas), the causal agent of citrus greening disease, or Huanglongbing (HLB). The widespread invasiveness of the psyllid vector and HLB in citrus trees worldwide has underscored the need for non-traditional approaches to manage the disease. One tenable solution is through the deployment of RNA interference technology to silence protein-protein interactions essential for ACP-mediated CLas invasion and transmission. To identify psyllid interactor-bacterial effector combinations associated with psyllid-CLas interactions, cDNA libraries were constructed from CLas-infected and CLas-free ACP adults and nymphs, and analyzed for differential expression. Library assemblies comprised 24,039,255 reads and yielded 45,976 consensus contigs. They were annotated (UniProt), classified using Gene Ontology, and subjected to in silico expression analyses using the Transcriptome Computational Workbench (TCW) (http://www.sohomoptera.org/ACPPoP/). Functional-biological pathway interpretations were carried out using the Kyoto Encyclopedia of Genes and Genomes databases. Differentially expressed contigs in adults and/or nymphs represented genes and/or metabolic/pathogenesis pathways involved in adhesion, biofilm formation, development-related, immunity, nutrition, stress, and virulence. Notably, contigs involved in gene silencing and transposon-related responses were documented in a psyllid for the first time. This is the first comparative transcriptomic analysis of ACP adults and nymphs infected and uninfected with CLas. The results provide key initial insights into host-parasite interactions involving CLas effectors that contribute to invasion-virulence, and to host nutritional exploitation and immune-related responses that appear to be essential for successful ACP-mediated circulative, propagative CLas transmission.


Assuntos
Hemípteros/microbiologia , Insetos Vetores/microbiologia , Doenças das Plantas/microbiologia , Rhizobiaceae/fisiologia , Fenômenos Fisiológicos da Nutrição Animal/imunologia , Animais , Citrus/microbiologia , Citrus/parasitologia , Mapeamento de Sequências Contíguas , Elementos de DNA Transponíveis , Regulação da Expressão Gênica no Desenvolvimento/imunologia , Ontologia Genética , Genes de Insetos , Hemípteros/crescimento & desenvolvimento , Hemípteros/imunologia , Interações Hospedeiro-Patógeno , Imunidade Inata , Insetos Vetores/fisiologia , Anotação de Sequência Molecular , Ninfa/microbiologia , Ninfa/fisiologia , Transdução de Sinais , Transcriptoma
4.
Pathogens ; 3(4): 875-907, 2014 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-25436509

RESUMO

The potato psyllid (PoP) Bactericera cockerelli (Sulc) and Asian citrus psyllid (ACP) Diaphorina citri Kuwayama are the insect vectors of the fastidious plant pathogen, Candidatus Liberibacter solanacearum (CLso) and Ca. L. asiaticus (CLas), respectively. CLso causes Zebra chip disease of potato and vein-greening in solanaceous species, whereas, CLas causes citrus greening disease. The reliance on insecticides for vector management to reduce pathogen transmission has increased interest in alternative approaches, including RNA interference to abate expression of genes essential for psyllid-mediated Ca. Liberibacter transmission. To identify genes with significantly altered expression at different life stages and conditions of CLso/CLas infection, cDNA libraries were constructed for CLso-infected and -uninfected PoP adults and nymphal instars. Illumina sequencing produced 199,081,451 reads that were assembled into 82,224 unique transcripts. PoP and the analogous transcripts from ACP adult and nymphs reported elsewhere were annotated, organized into functional gene groups using the Gene Ontology classification system, and analyzed for differential in silico expression. Expression profiles revealed vector life stage differences and differential gene expression associated with Liberibacter infection of the psyllid host, including invasion, immune system modulation, nutrition, and development.

5.
PLoS One ; 9(7): e103223, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25061748

RESUMO

Galanthamine is an Amaryllidaceae alkaloid used to treat the symptoms of Alzheimer's disease. This compound is primarily isolated from daffodil (Narcissus spp.), snowdrop (Galanthus spp.), and summer snowflake (Leucojum aestivum). Despite its importance as a medicine, no genes involved in the biosynthetic pathway of galanthamine have been identified. This absence of genetic information on biosynthetic pathways is a limiting factor in the development of synthetic biology platforms for many important botanical medicines. The paucity of information is largely due to the limitations of traditional methods for finding biochemical pathway enzymes and genes in non-model organisms. A new bioinformatic approach using several recent technological improvements was applied to search for genes in the proposed galanthamine biosynthetic pathway, first targeting methyltransferases due to strong signature amino acid sequences in the proteins. Using Illumina sequencing, a de novo transcriptome assembly was constructed for daffodil. BLAST was used to identify sequences that contain signatures for plant O-methyltransferases in this transcriptome. The program HAYSTACK was then used to identify methyltransferases that fit a model for galanthamine biosynthesis in leaf, bulb and inflorescence tissues. One candidate gene for the methylation of norbelladine to 4'-O-methylnorbelladine in the proposed galanthamine biosynthetic pathway was identified. This methyltransferase cDNA was expressed in E. coli and the protein purified by affinity chromatography. The resulting protein was found to be a norbelladine 4'-O-methyltransferase (NpN4OMT) of the proposed galanthamine biosynthetic pathway.


Assuntos
Alcaloides/metabolismo , Galantamina/metabolismo , Narcissus/enzimologia , Proteína O-Metiltransferase/genética , Alcaloides/genética , Alcaloides/uso terapêutico , Doença de Alzheimer/tratamento farmacológico , Doença de Alzheimer/patologia , Sequência de Aminoácidos , Clonagem Molecular , DNA Complementar , Escherichia coli , Galantamina/genética , Galantamina/uso terapêutico , Humanos , Narcissus/química , Narcissus/genética , Proteína O-Metiltransferase/isolamento & purificação , Proteína O-Metiltransferase/metabolismo
6.
BMC Plant Biol ; 14: 46, 2014 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-24521476

RESUMO

BACKGROUND: The rhizome, the original stem of land plants, enables species to invade new territory and is a critical component of perenniality, especially in grasses. Red rice (Oryza longistaminata) is a perennial wild rice species with many valuable traits that could be used to improve cultivated rice cultivars, including rhizomatousness, disease resistance and drought tolerance. Despite these features, little is known about the molecular mechanisms that contribute to rhizome growth, development and function in this plant. RESULTS: We used an integrated approach to compare the transcriptome, proteome and metabolome of the rhizome to other tissues of red rice. 116 Gb of transcriptome sequence was obtained from various tissues and used to identify rhizome-specific and preferentially expressed genes, including transcription factors and hormone metabolism and stress response-related genes. Proteomics and metabolomics approaches identified 41 proteins and more than 100 primary metabolites and plant hormones with rhizome preferential accumulation. Of particular interest was the identification of a large number of gene transcripts from Magnaportha oryzae, the fungus that causes rice blast disease in cultivated rice, even though the red rice plants showed no sign of disease. CONCLUSIONS: A significant set of genes, proteins and metabolites appear to be specifically or preferentially expressed in the rhizome of O. longistaminata. The presence of M. oryzae gene transcripts at a high level in apparently healthy plants suggests that red rice is resistant to this pathogen, and may be able to provide genes to cultivated rice that will enable resistance to rice blast disease.


Assuntos
Oryza/metabolismo , Rizoma/metabolismo , Resistência à Doença/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Oryza/genética , Oryza/fisiologia , Rizoma/genética , Rizoma/fisiologia , Transcriptoma/genética
7.
Viruses ; 6(2): 448-75, 2014 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-24473341

RESUMO

Development of a vaccine against congenital infection with human cytomegalovirus is complicated by the issue of re-infection, with subsequent vertical transmission, in women with pre-conception immunity to the virus. The study of experimental therapeutic prevention of re-infection would ideally be undertaken in a small animal model, such as the guinea pig cytomegalovirus (GPCMV) model, prior to human clinical trials. However, the ability to model re-infection in the GPCMV model has been limited by availability of only one strain of virus, the 22122 strain, isolated in 1957. In this report, we describe the isolation of a new GPCMV strain, the CIDMTR strain. This strain demonstrated morphological characteristics of a typical Herpesvirinae by electron microscopy. Illumina and PacBio sequencing demonstrated a genome of 232,778 nt. Novel open reading frames ORFs not found in reference strain 22122 included an additional MHC Class I homolog near the right genome terminus. The CIDMTR strain was capable of dissemination in immune compromised guinea pigs, and was found to be capable of congenital transmission in GPCMV-immune dams previously infected with salivary gland­adapted strain 22122 virus. The availability of a new GPCMV strain should facilitate study of re-infection in this small animal model.


Assuntos
DNA Viral/química , DNA Viral/genética , Genoma Viral , Infecções por Roseolovirus/veterinária , Roseolovirus/isolamento & purificação , Animais , Feminino , Cobaias , Transmissão Vertical de Doenças Infecciosas , Microscopia Eletrônica de Transmissão , Dados de Sequência Molecular , Gravidez , Roseolovirus/genética , Roseolovirus/fisiologia , Roseolovirus/ultraestrutura , Infecções por Roseolovirus/transmissão , Infecções por Roseolovirus/virologia , Análise de Sequência de DNA , Vírion/ultraestrutura
8.
Genome Announc ; 1(6)2013 Dec 26.
Artigo em Inglês | MEDLINE | ID: mdl-24371200

RESUMO

The sequence of a newly discovered isolate of guinea pig cytomegalovirus (GPCMV), the CIDMTR strain, was determined. The 232,778-nucleotide genome was generally well conserved with that of the 22122 reference strain, although some regions of substantial sequence divergence allowed annotation of strain-specific open reading frames encoding putative immune modulation gene products.

9.
J Biol Chem ; 288(1): 466-79, 2013 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-23161544

RESUMO

Podophyllum species are sources of (-)-podophyllotoxin, an aryltetralin lignan used for semi-synthesis of various powerful and extensively employed cancer-treating drugs. Its biosynthetic pathway, however, remains largely unknown, with the last unequivocally demonstrated intermediate being (-)-matairesinol. Herein, massively parallel sequencing of Podophyllum hexandrum and Podophyllum peltatum transcriptomes and subsequent bioinformatics analyses of the corresponding assemblies were carried out. Validation of the assembly process was first achieved through confirmation of assembled sequences with those of various genes previously established as involved in podophyllotoxin biosynthesis as well as other candidate biosynthetic pathway genes. This contribution describes characterization of two of the latter, namely the cytochrome P450s, CYP719A23 from P. hexandrum and CYP719A24 from P. peltatum. Both enzymes were capable of converting (-)-matairesinol into (-)-pluviatolide by catalyzing methylenedioxy bridge formation and did not act on other possible substrates tested. Interestingly, the enzymes described herein were highly similar to methylenedioxy bridge-forming enzymes from alkaloid biosynthesis, whereas candidates more similar to lignan biosynthetic enzymes were catalytically inactive with the substrates employed. This overall strategy has thus enabled facile further identification of enzymes putatively involved in (-)-podophyllotoxin biosynthesis and underscores the deductive power of next generation sequencing and bioinformatics to probe and deduce medicinal plant biosynthetic pathways.


Assuntos
Plantas Medicinais/metabolismo , Podofilotoxina/biossíntese , Podophyllum/metabolismo , Análise de Sequência de DNA/métodos , Sequência de Aminoácidos , Catálise , Biologia Computacional/métodos , Sistema Enzimático do Citocromo P-450/metabolismo , Bases de Dados Factuais , Regulação da Expressão Gênica de Plantas , Lignanas/química , Microssomos/metabolismo , Modelos Biológicos , Modelos Químicos , Dados de Sequência Molecular , Extratos Vegetais/química , Homologia de Sequência de Aminoácidos , Transcriptoma
10.
BMC Genomics ; 13: 568, 2012 Oct 29.
Artigo em Inglês | MEDLINE | ID: mdl-23107476

RESUMO

BACKGROUND: Alfalfa, a perennial, outcrossing species, is a widely planted forage legume producing highly nutritious biomass. Currently, improvement of cultivated alfalfa mainly relies on recurrent phenotypic selection. Marker assisted breeding strategies can enhance alfalfa improvement efforts, particularly if many genome-wide markers are available. Transcriptome sequencing enables efficient high-throughput discovery of single nucleotide polymorphism (SNP) markers for a complex polyploid species. RESULT: The transcriptomes of 27 alfalfa genotypes, including elite breeding genotypes, parents of mapping populations, and unimproved wild genotypes, were sequenced using an Illumina Genome Analyzer IIx. De novo assembly of quality-filtered 72-bp reads generated 25,183 contigs with a total length of 26.8 Mbp and an average length of 1,065 bp, with an average read depth of 55.9-fold for each genotype. Overall, 21,954 (87.2%) of the 25,183 contigs represented 14,878 unique protein accessions. Gene ontology (GO) analysis suggested that a broad diversity of genes was represented in the resulting sequences. The realignment of individual reads to the contigs enabled the detection of 872,384 SNPs and 31,760 InDels. High resolution melting (HRM) analysis was used to validate 91% of 192 putative SNPs identified by sequencing. Both allelic variants at about 95% of SNP sites identified among five wild, unimproved genotypes are still present in cultivated alfalfa, and all four US breeding programs also contain a high proportion of these SNPs. Thus, little evidence exists among this dataset for loss of significant DNA sequence diversity from either domestication or breeding of alfalfa. Structure analysis indicated that individuals from the subspecies falcata, the diploid subspecies caerulea, and the tetraploid subspecies sativa (cultivated tetraploid alfalfa) were clearly separated. CONCLUSION: We used transcriptome sequencing to discover large numbers of SNPs segregating in elite breeding populations of alfalfa. Little loss of SNP diversity was evident between unimproved and elite alfalfa germplasm. The EST and SNP markers generated from this study are publicly available at the Legume Information System ( http://medsa.comparative-legumes.org/) and can contribute to future alfalfa research and breeding applications.


Assuntos
Genes de Plantas , Marcadores Genéticos , Medicago sativa/genética , Polimorfismo de Nucleotídeo Único , Transcriptoma , Alelos , Cruzamento , Genótipo , Mutação INDEL , Medicago sativa/classificação , Desnaturação de Ácido Nucleico , Filogenia , Ploidias , Análise de Componente Principal , Análise de Sequência de DNA
11.
Am J Bot ; 99(2): 232-47, 2012 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-22301892

RESUMO

PREMISE OF THE STUDY: The common reed (Phragmites australis), one of the most widely distributed of all angiosperms, uses its rhizomes (underground stems) to invade new territory, making it one of the most successful weedy species worldwide. Characterization of the rhizome transcriptome and proteome is needed to identify candidate genes and proteins involved in rhizome growth, development, metabolism, and invasiveness. METHODS: We employed next-generation sequencing technologies including 454 and Illumina platforms to characterize the reed rhizome transcriptome and used quantitative proteomics techniques to identify the rhizome proteome. KEY RESULTS: Combining 336514 Roche 454 Titanium reads and 103350802 Illumina paired-end reads in a de novo hybrid assembly yielded 124450 unique transcripts with an average length of 549 bp, of which 54317 were annotated. Rhizome-specific and differentially expressed transcripts were identified between rhizome apical tips (apical meristematic region) and rhizome elongation zones. A total of 1280 nonredundant proteins were identified and quantified using GeLC-MS/MS based label-free proteomics, where 174 and 77 proteins were preferentially expressed in the rhizome elongation zone and apical tip tissues, respectively. Genes involved in allelopathy and in controlling development and potentially invasiveness were identified. CONCLUSIONS: In addition to being a valuable sequence and protein data resource for studying plant rhizome species, our results provide useful insights into identifying specific genes and proteins with potential roles in rhizome differentiation, development, and function.


Assuntos
Perfilação da Expressão Gênica , Genes de Plantas , Poaceae/genética , Proteômica , Rizoma/genética , Sequência de Bases , Cromatografia Líquida , Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Espécies Introduzidas , Espectrometria de Massas , Meristema/genética , Meristema/metabolismo , Anotação de Sequência Molecular , Proteínas de Plantas/análise , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Poaceae/crescimento & desenvolvimento , Poaceae/metabolismo , RNA de Plantas/genética , Rizoma/crescimento & desenvolvimento , Rizoma/metabolismo , Análise de Sequência de RNA , Especificidade da Espécie , Fatores de Transcrição/genética , Transcriptoma
12.
Mol Plant ; 5(5): 1020-8, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22241453

RESUMO

A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.


Assuntos
Cajanus/genética , Análise de Sequência de DNA/métodos , Transcriptoma , Cajanus/classificação , Mapeamento Cromossômico , Fabaceae/classificação , Fabaceae/genética , Genótipo , Filogenia , Polimorfismo de Nucleotídeo Único
13.
Am J Bot ; 98(9): 1504-10, 2011 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-21875969

RESUMO

UNLABELLED: • PREMISE OF THE STUDY: The Fungal Subcellular Ontology used in the Assembling the Fungal Tree of Life project is a taxon-wide ontology (controlled vocabulary for attributes) designed to clarify and integrate the broad range of subcellular characters and character states used in higher-level fungal systematics. As in the algae, cellular characters are important phylogenetic markers in kingdom Fungi. The Fungal Subcellular Ontology has been developed primarily to help researchers, especially systematists, in their search for information on subcellular characters across the Fungi, and it complements existing biological ontologies, including the Gene Ontology. • METHODS: The character and character state data set used in the Assembling the Fungal Tree of Life Structural and Biochemical Database (http://aftol.umn.edu) is the source of terms for generating the ontology. After the terms were accessioned and defined, they were combined in OBO-Edit file format, and the ontology was edited using OBO-Edit, an open source Java tool supported by the Gene Ontology project. • KEY RESULTS: The Fungal Subcellular Ontology covers both model and nonmodel fungi in great detail and is downloadable in OBO-Edit format at website http://aftol.umn.edu/ontology/fungal_subcellular.obo. • CONCLUSIONS: The ontology provides a controlled vocabulary of fungal subcellular terms and functions as an operating framework for the Assembling the Fungal Tree of Life Structural and Biochemical Database. An ontology-based design enhances reuse of data deposited in the Structural and Biochemical Database from other independent biological and genetic databases. Data integration approaches that advance access to data from the diversity of biological databases are imperative as interdisciplinary research gains importance. In this sense, the Fungal Subcellular Ontology becomes highly relevant to mycologists as well as nonmycologists because fungi interact actively as symbionts and parasites or passively with many other life forms.


Assuntos
Fungos/metabolismo , Frações Subcelulares/metabolismo
14.
BMC Plant Biol ; 10: 280, 2010 Dec 17.
Artigo em Inglês | MEDLINE | ID: mdl-21167045

RESUMO

BACKGROUND: Meiosis is a critical process in the reproduction and life cycle of flowering plants in which homologous chromosomes pair, synapse, recombine and segregate. Understanding meiosis will not only advance our knowledge of the mechanisms of genetic recombination, but also has substantial applications in crop improvement. Despite the tremendous progress in the past decade in other model organisms (e.g., Saccharomyces cerevisiae and Drosophila melanogaster), the global identification of meiotic genes in flowering plants has remained a challenge due to the lack of efficient methods to collect pure meiocytes for analyzing the temporal and spatial gene expression patterns during meiosis, and for the sensitive identification and quantitation of novel genes. RESULTS: A high-throughput approach to identify meiosis-specific genes by combining isolated meiocytes, RNA-Seq, bioinformatic and statistical analysis pipelines was developed. By analyzing the studied genes that have a meiosis function, a pipeline for identifying meiosis-specific genes has been defined. More than 1,000 genes that are specifically or preferentially expressed in meiocytes have been identified as candidate meiosis-specific genes. A group of 55 genes that have mitochondrial genome origins and a significant number of transposable element (TE) genes (1,036) were also found to have up-regulated expression levels in meiocytes. CONCLUSION: These findings advance our understanding of meiotic genes, gene expression and regulation, especially the transcript profiles of MGI genes and TE genes, and provide a framework for functional analysis of genes in meiosis.


Assuntos
Arabidopsis/genética , Perfilação da Expressão Gênica/métodos , Meiose/genética , Pólen/genética , Análise de Variância , Proteínas de Arabidopsis/genética , Mapeamento Cromossômico , Cromossomos de Plantas/genética , Análise por Conglomerados , Biologia Computacional/métodos , Elementos de DNA Transponíveis/genética , Regulação da Expressão Gênica de Plantas , Biblioteca Gênica , Pólen/citologia , Análise de Sequência de DNA
15.
J Comput Sci Syst Biol ; 1: 132, 2008 Dec 26.
Artigo em Inglês | MEDLINE | ID: mdl-20151039

RESUMO

High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricultural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage significant volumes of high-dimensional and inter-related data and analysis. Alpheus is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multi-gigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche-454 (pyrosequencing) and Applied Biosystem's SOLiD (sequencing-by-ligation). Alpheus enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. Alpheus is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximizing the identification of true positives. Alpheus also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.

16.
Nucleic Acids Res ; 35(Database issue): D888-94, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17130142

RESUMO

ForestTreeDB is intended as a resource that centralizes large-scale expressed sequence tag (EST) sequencing results from several tree species (http://foresttree.org/ftdb). It currently encompasses 344,878 quality sequences from 68 libraries, from diverse organs of conifer and hybrid poplar trees. It utilizes the Nimbus data model to provide a hosting system for multiple projects, and uses object-relational mapping APIs in Java and Perl for data accesses within an Oracle database designed to be scalable, maintainable and extendable. Transcriptome builds or unigene sets occupy the focal point of the system. Several of the five current species-specific unigenes were used to design microarrays and SNP resources. The ForestTreeDB web application provides the means for multiple combination database queries. It presents the user with a list of discrete queries to retrieve and download large EST datasets or sequences from precompiled unigene assemblies. Functional annotation assignment is not trivial in conifers which are distantly related to angiosperm model plants. Optimal annotations are achieved through database queries that integrate results from several procedures based open-source tools. ForestTreeDB aims to facilitate sequence mining of coherent annotations in multiple species to support comparative genomic approaches. We plan to continuously enrich ForestTreeDB with other resources through collaborations with other genomic projects.


Assuntos
Bases de Dados de Ácidos Nucleicos , Etiquetas de Sequências Expressas/química , Populus/genética , Traqueófitas/genética , Perfilação da Expressão Gênica , Genômica , Internet , Polimorfismo de Nucleotídeo Único , Transcrição Gênica , Árvores/genética , Interface Usuário-Computador
17.
Methods Mol Biol ; 406: 261-74, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-18287697

RESUMO

To identify the genes and gene functions that underlie key aspects of legume biology, researchers have selected the cool season legume Medicago truncatula as a model system for legume research. The mission of the M. truncatula Consortium is to promote unrestricted sharing of data and information that are provided by Medicago research groups worldwide. Through integration of a variety of data and tools, the medicago.org site intends to facilitate progress in the fields of structural, comparative, and functional genomics. To this goal, and as a consortium partner, the Center for Computational Genomics and Bioinformatics (CCGB) at the University of Minnesota has developed MtDB2.0, the M. truncatula database version 2.0. The MtDB2.0 database is the first step toward the global integration of M. truncatula genomic, genetic, and biological information. MtDB2.0 is a relational database that integrates M. truncatula transcriptome data and provides a wide range of user-defined data mining options. The database is interrogated through a series of interfaces, with 58 options grouped into two filters. Sequence identifiers from all public M. truncatula sites [e.g., IDs from GenBank, CCGB, The Institute for Genomic Research (TIGR), National Center for Genome Resources (NCGR), and I'Institut National de la Recherche Agronomique (INRA)] are fully cross-referenced to facilitate comparisons between different sites, and hypertext links to the appropriate database records are provided for all queries' results. MtDB's goal is to provide researchers with the means to quickly and independently identify sequences that match specific research interests based on user-defined criteria. MtDB2.0 offers unrestricted access to advanced and powerful querying tools unmatched by any other public databases. Structurad Query Language (SQL)-encoded queries with a Java-based Web user interface, incorporate different filtering that allow sophisticated data mining of the expressed sequence tag sequencing project results, including the CCGB M. truncatula Unigene set generated with the Phrap assembler. The underlying database and query software have been designed for ease of updates and portability to other model organisms. Public access to the database is at http://www.medicago.org/MtDB.


Assuntos
Bases de Dados Genéticas , Armazenamento e Recuperação da Informação/métodos , Medicago/genética , Biologia Computacional/métodos , Fabaceae/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Minnesota , Interface Usuário-Computador
18.
BMC Genomics ; 6: 144, 2005 Oct 19.
Artigo em Inglês | MEDLINE | ID: mdl-16236172

RESUMO

BACKGROUND: The sequencing and analysis of ESTs is for now the only practical approach for large-scale gene discovery and annotation in conifers because their very large genomes are unlikely to be sequenced in the near future. Our objective was to produce extensive collections of ESTs and cDNA clones to support manufacture of cDNA microarrays and gene discovery in white spruce (Picea glauca [Moench] Voss). RESULTS: We produced 16 cDNA libraries from different tissues and a variety of treatments, and partially sequenced 50,000 cDNA clones. High quality 3' and 5' reads were assembled into 16,578 consensus sequences, 45% of which represented full length inserts. Consensus sequences derived from 5' and 3' reads of the same cDNA clone were linked to define 14,471 transcripts. A large proportion (84%) of the spruce sequences matched a pine sequence, but only 68% of the spruce transcripts had homologs in Arabidopsis or rice. Nearly all the sequences that matched the Populus trichocarpa genome (the only sequenced tree genome) also matched rice or Arabidopsis genomes. We used several sequence similarity search approaches for assignment of putative functions, including blast searches against general and specialized databases (transcription factors, cell wall related proteins), Gene Ontology term assignation and Hidden Markov Model searches against PFAM protein families and domains. In total, 70% of the spruce transcripts displayed matches to proteins of known or unknown function in the Uniref100 database (blastx e-value < 1e-10). We identified multigenic families that appeared larger in spruce than in the Arabidopsis or rice genomes. Detailed analysis of translationally controlled tumour proteins and S-adenosylmethionine synthetase families confirmed a twofold size difference. Sequences and annotations were organized in a dedicated database, SpruceDB. Several search tools were developed to mine the data either based on their occurrence in the cDNA libraries or on functional annotations. CONCLUSION: This report illustrates specific approaches for large-scale gene discovery and annotation in an organism that is very distantly related to any of the fully sequenced genomes. The ArboreaSet sequences and cDNA clones represent a valuable resource for investigations ranging from plant comparative genomics to applied conifer genetics.


Assuntos
Etiquetas de Sequências Expressas , Regulação da Expressão Gênica de Plantas , Genes de Plantas , Picea/genética , Arabidopsis/genética , Parede Celular/metabolismo , Análise por Conglomerados , Mapeamento de Sequências Contíguas , Citoesqueleto/metabolismo , DNA Complementar/metabolismo , Bases de Dados como Assunto , Bases de Dados Genéticas , Biblioteca Gênica , Genoma de Planta , Genômica , Família Multigênica , Oryza/genética , RNA Mensageiro/metabolismo , Software
19.
Plant Physiol ; 138(1): 38-46, 2005 May.
Artigo em Inglês | MEDLINE | ID: mdl-15888676

RESUMO

An international consortium is sequencing the euchromatic genespace of Medicago truncatula. Extensive bioinformatic and database resources support the marker-anchored bacterial artificial chromosome (BAC) sequencing strategy. Existing physical and genetic maps and deep BAC-end sequencing help to guide the sequencing effort, while EST databases provide essential resources for genome annotation as well as transcriptome characterization and microarray design. Finished BAC sequences are joined into overlapping sequence assemblies and undergo an automated annotation process that integrates ab initio predictions with EST, protein, and other recognizable features. Because of the sequencing project's international and collaborative nature, data production, storage, and visualization tools are broadly distributed. This paper describes databases and Web resources for the project, which provide support for physical and genetic maps, genome sequence assembly, gene prediction, and integration of EST data. A central project Web site at medicago.org/genome provides access to genome viewers and other resources project-wide, including an Ensembl implementation at medicago.org, physical map and marker resources at mtgenome.ucdavis.edu, and genome viewers at the University of Oklahoma (www.genome.ou.edu), the Institute for Genomic Research (www.tigr.org), and Munich Information for Protein Sequences Center (mips.gsf.de).


Assuntos
Bases de Dados Genéticas , Genoma de Planta , Internet , Medicago truncatula/genética , Transcrição Gênica , Sequência de Bases , Cromossomos Artificiais Bacterianos
20.
Plant Physiol ; 133(2): 510-6, 2003 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-14555780

RESUMO

As structural and functional genomics efforts provide the biological community with ever-broadening sets of interrelated data, the need to explore such complex information for subtle relationships expands. We present wCLUTO, a Web-enabled version of the stand-alone application CLUTO, designed to apply clustering methods to genomic information. Its first application is focused on the clustering transcriptome data from microarrays. Data can be uploaded by the user into the clustering tool, a choice of several clustering methods can be made and configured, and data are presented to the user in a variety of visual formats, including a three-dimensional "mountain" view of the clusters. Parameters can be explored to rapidly examine a variety of clustering results, and the resulting clusters can be downloaded either for manipulation by other programs or to be saved in a format for publication.


Assuntos
Internet , Plantas/genética , Pesquisa/tendências , Algoritmos , Simulação por Computador , Processamento de Imagem Assistida por Computador , Projetos de Pesquisa
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...