Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
1.
Adv Exp Med Biol ; 919: 281-341, 2016.
Article in English | MEDLINE | ID: mdl-27975225

ABSTRACT

Biological systems function via intricate cellular processes and networks in which RNAs, metabolites, proteins and other cellular compounds have a precise role and are exquisitely regulated (Kumar and Mann, FEBS Lett 583(11):1703-1712, 2009). The development of high-throughput technologies, such as the Next Generation DNA Sequencing (NGS) and DNA microarrays for sequencing genomes or metagenomes, have triggered a dramatic increase in the last few years in the amount of information stored in the GenBank and UniProt Knowledgebase (UniProtKB). GenBank release 210, reported in October 2015, contains 202,237,081,559 nucleotides corresponding to 188,372,017 sequences, whilst there are only 1,222,635,267,498 nucleotides corresponding to 309,198,943 sequences from Whole Genome Shotgun (WGS) projects. In the case of UniProKB/Swiss-Prot, release 2015_12 (December 9, 2015) contains 196,219,159 amino acids that correspond to 550,116 entries. Meanwhile, UniProtKB/TrEMBL (release 2015_12 of December 9 2015) contains 1,838,851,8871 amino acids corresponding to 555,270,679 entries. Proteomics has also improved our knowledge of proteins that are being expressed in cells at a certain time of the cell cycle. It has also allowed the identification of molecules forming part of multiprotein complexes and an increasing number of posttranslational modifications (PTMs) that are present in proteins, as well as the variants of proteins expressed.


Subject(s)
Computational Biology/methods , Data Mining/methods , Databases, Protein , Mass Spectrometry/methods , Proteins/analysis , Proteome , Proteomics/methods , Algorithms , Animals , Biomarkers/analysis , High-Throughput Screening Assays , Humans , Multiprotein Complexes , Protein Interaction Mapping , Protein Interaction Maps , Protein Processing, Post-Translational , Proteins/genetics , Reproducibility of Results , Search Engine , Software , Web Browser
2.
Plant Cell Physiol ; 55(1): e5, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24323624

ABSTRACT

The SABRE (Systematic consolidation of Arabidopsis and other Botanical REsources) database cross-searches plant genetic resources through publicly available Arabidopsis information. In SABRE, plant expressed sequence tag (EST)/cDNA clones are related to TAIR (The Arabidoposis Information Resource) gene models and their annotations through sequence similarity. By entering a keyword, SABRE searches and retrieves TAIR gene models and annotations, together with homologous gene clones from various plant species. SABRE thus facilitates using TAIR annotations of Arabidopsis genes for research on homologous genes from other model plants. To expand the application range of SABRE to crop breeding, we have recently upgraded SABRE to SABRE2 (http://sabre.epd.brc.riken.jp/SABRE2.html), by newly adding six model plants (including the major crops barley, soybean, tomato and wheat), and by improving the retrieval interface. The present version has integrated information on >1.5 million plant EST/cDNA clones from the National BioResource Project (NBRP) of Japan. All clones are actual experimental resources from 14 plant species (Arabidoposis, barley, cassava, Chinese cabbage, lotus, morning glory, poplar, Physcomitrella patens, Striga hermonthica, soybean, Thellungiella halophila, tobacco, tomato and wheat), and are available from the core facilities of the NBRP. SABRE2 is thus a useful tool that can contribute towards the improvement of important crop breeds by connecting basic research and crop breeding.


Subject(s)
Arabidopsis/genetics , DNA, Complementary/genetics , Databases, Genetic , Expressed Sequence Tags/metabolism , Sequence Homology, Amino Acid , User-Computer Interface
3.
Gene X ; 2: 100012, 2019 Jun.
Article in English | MEDLINE | ID: mdl-32550546

ABSTRACT

The extensive research for predicting highly expressed genes in plant genome sequences has been going on for decades. The codon usage pattern of genes in Arabidopsis thaliana genome is a classical topic for plant biologists for its significance in the understanding of molecular plant biology. Here we have used a gene expression profiling methodology based on the score of modified relative codon bias (MRCBS) to elucidate expression pattern of genes in Arabidopsis thaliana. MRCBS relies exclusively on sequence features for identifying the highly expressed genes. In this study, a critical analysis of predicted highly expressed (PHE) genes in Arabidopsis thaliana has been performed using MRCBS as a numerical estimator of gene expression level. Consistent with previous other results, our study indicates that codon composition plays an important role in the regulation of gene expression. We found a systematic strong correlation between MRCBS and CAI (codon adaptation index) or other expression-measures. Additionally, MRCBS correlates well with experimental gene expression data. Our study highlights the relationship between gene expression and compositional signature in relation to codon usage bias and sets the ground for the further investigation of the evolution of the protein-coding genes in the plant genome.

4.
Methods Mol Biol ; 1482: 233-45, 2016.
Article in English | MEDLINE | ID: mdl-27557771

ABSTRACT

Bioinformatics tools can be employed to identify conserved cis-sequences in sets of coregulated plant genes because more and more gene expression and genomic sequence data become available. Knowledge on the specific cis-sequences, their enrichment and arrangement within promoters, facilitates the design of functional synthetic plant promoters that are responsive to specific stresses. The present chapter illustrates an example for the bioinformatic identification of conserved Arabidopsis thaliana cis-sequences enriched in drought stress-responsive genes. This workflow can be applied for the identification of cis-sequences in any sets of coregulated genes. The workflow includes detailed protocols to determine sets of coregulated genes, to extract the corresponding promoter sequences, and how to install and run a software package to identify overrepresented motifs. Further bioinformatic analyses that can be performed with the results are discussed.


Subject(s)
Computational Biology/methods , Conserved Sequence/genetics , Gene Regulatory Networks/genetics , Software , Arabidopsis/genetics , Gene Expression Regulation, Plant , Promoter Regions, Genetic , Regulatory Sequences, Nucleic Acid/genetics
5.
Environ Pollut ; 208(Pt A): 153-160, 2016 Jan.
Article in English | MEDLINE | ID: mdl-26499933

ABSTRACT

This study applies remote sensing technology to assess and examine the spatial and temporal Brightness Temperature (BT) profile in the city of Tel-Aviv, Israel over the last 30 years using Landsat imagery. The location of warmest and coldest zones are constant over the studied period. Distinct diurnal and temporal BT behavior divide the city into four different segments. As an example of future application, we applied mixed regression models with daily random slopes to correlate Landsat BT data with monitored air temperature (Tair) measurements using 14 images for 1989-2014. Our preliminary results show a good model performance with R(2) = 0.81. Furthermore, based on the model's results, we analyzed the spatial profile of Tair within the study domain for representative days.


Subject(s)
Cities , Temperature , Environmental Monitoring/methods , Israel , Models, Theoretical , Regression Analysis , Remote Sensing Technology
6.
Gene ; 531(2): 337-46, 2013 Dec 01.
Article in English | MEDLINE | ID: mdl-23994293

ABSTRACT

The Louisiana Irises serve as an important system for the study of the evolutionary processes of speciation, including reproductive isolation, hybridization, and adaptation. Sequencing methods today allow for the generation of resources key to elucidating the genetic basis of these phenomena. Here we describe the transcriptomes of floral and young leaf tissue from Iris fulva generated by massively parallel pyrosequencing. In order to identify potential candidates for the study of reproductive isolation and adaptation in the Louisiana Irises we phylogenetically analyzed the type II MIKC(C) MADS-box and R2R3 MYB transcription factors expressed in these tissues. A total of 25 Iris MIKC(C) MADS-box genes in 9 clades and 42 Iris R2R3 MYB genes in 19 clades were identified. Through the identification of eudicot and monocot specific clades, these analyses contribute to our understanding of the evolution of these prominent transcription factor families in the angiosperms.


Subject(s)
Flowers/genetics , Genes, myb , Iris Plant/genetics , MADS Domain Proteins/genetics , Plant Leaves/genetics , Transcription Factors/genetics , Evolution, Molecular , Flowers/metabolism , Gene Expression Profiling , Gene Ontology , Genes, Plant , Multigene Family/genetics , Phylogeny , Plant Leaves/metabolism , Sequence Analysis, RNA , Transcriptome
7.
Gene ; 527(2): 477-83, 2013 Sep 25.
Article in English | MEDLINE | ID: mdl-23850578

ABSTRACT

Plant microRNA (miRNA) is a crucial regulator of gene expression. It has been reported that more than one miRNA/miRNA duplex could be produced from a microRNA precursor (pre-miRNA). In this study, we performed a comprehensive search for the novel miRNA candidates on the pre-miRNAs of Arabidopsis. AGO1 enrichment, co-existence of the miRNA-like coordinates, and unique genome-wide match sites were taken into consideration for candidate screening. As a result, 43 miRNA-like candidates derived from 25 pre-miRNAs were identified. Among these candidates, 31 strong candidates from 22 pre-miRNAs passed all the filtering steps. Interestingly, some of these miRNA-like candidates showed organ-specific expression patterns. After target prediction and degradome sequencing data-based validation, five miRNA candidate-target pairs (ath-miR863-5p.2-AT1G76550.1, ath-miR822.2-AT5G03552.1, ath-miR822.3-AT5G02350.1, sRNA4-AT1G66290.1 and sRNA6-AT1G66310.1) were identified, providing a basis for in-depth functional analysis of these miRNA candidates. These results could update the current understanding of the biogenesis and the action of the plant miRNAs.


Subject(s)
Arabidopsis/genetics , MicroRNAs/genetics , RNA Precursors/genetics , Base Sequence , MicroRNAs/chemistry , Nucleic Acid Conformation , RNA Precursors/chemistry
8.
J Proteomics ; 93: 343-55, 2013 Nov 20.
Article in English | MEDLINE | ID: mdl-23994444

ABSTRACT

Most secreted proteins in eukaryotes are modified on the amino acid consensus sequence NxS/T by an N-glycan through the process of N-glycosylation. The N-glycans on glycoproteins are processed in the endoplasmic reticulum (ER) to different mannose-type N-glycans or, when the protein passes through the Golgi apparatus, to different complex glycan forms. Here we describe the capturing of N-glycopeptides from a trypsin digest of total protein extracts of Arabidopsis plants and release of these captured peptides following Peptide N-glycosidase (PNGase) treatment for analysis of N-glycan site-occupancy. The mixture of peptides released as a consequence of the PNGase treatment was analyzed by two dimensional nano-LC-MS. As the PNGase treatment of glycopeptides results in the deamidation of the asparagine (N) in the NxS/T site of the released peptide, this asparagine (N) to aspartic acid (D) conversion is used as a glycosylation 'signature'. The efficiency of PNGase F and PNGase A in peptide release is discussed. The identification of proteins with a single glycopeptide was limited by the used search algorithm but could be improved using a reference database including deamidated peptide sequences. Additional stringency settings were used for filtering results to minimize false discovery. This resulted in identification of 330 glycopeptides on 173 glycoproteins from Arabidopsis, of which 28 putative glycoproteins, that were previously not annotated as secreted protein in The Arabidopsis Information Resource database (TAIR). Furthermore, the identified glycosylation site occupancy helped to determine the correct topology for membrane proteins. A quantitative comparison of peptide signal was made between wild type and complex-glycan-less (cgl) mutant Arabidopsis from three replicate leaf samples using a label-free MS peak comparison. As an example, the identified membrane protein SKU5 (AT4G12420) showed differential glycopeptide intensity ratios between WT and cgl indicating heterogeneous glycan modification on single protein. BIOLOGICAL SIGNIFICANCE: Proteins that enter the secretory pathway are mostly modified by N-glycans. The function of N-glycosylation has been well studied in mammals. However, in plants the function of N-glycosylation is still unclear, because glycosylation mutants in plants often do not have a clear phenotype. Here we analyzed which proteins are modified by N-glycans in plants by developing a glycopeptide enrichment method for plant proteins. Subsequently, label free comparative proteomics was employed using protein fractions from wild type and from a mutant which is blocked in modification of the N-glycan into complex glycans. The results provide new information on N-glycosylation sites on numerous secreted proteins. Results allow for specific mapping of multiple glycosylation site occupancy on proteins, which provides information on which glycosylation sites are protected or non-used from downstream processing and thus presumably are buried into the protein structure. Glycoproteomics can therefore contribute to protein structure analysis. Indeed, mapping the glycosylation sites on membrane proteins gives information on the topology of protein folds over the membrane. We thus were able to correct the topology prediction of three membrane proteins. Besides, these studies also identified limitations in the software that is used to identify single modified peptide per protein. This article is part of a Special Issue entitled: Translational Plant Proteomics.


Subject(s)
Arabidopsis/chemistry , Glycoproteins/chemistry , Amino Acid Sequence , Arabidopsis/genetics , Arabidopsis Proteins/chemistry , Glycopeptides/isolation & purification , Glycosylation , Membrane Glycoproteins/chemistry , Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase/metabolism
9.
Gene ; 527(1): 131-8, 2013 Sep 15.
Article in English | MEDLINE | ID: mdl-23756193

ABSTRACT

Dendrobium officinale Kimura et Migo (Orchidaceae) is a traditional Chinese medicinal plant. The stem contains an alkaloid that is the primary bioactive component. However, the details of alkaloid biosynthesis have not been effectively explored because of the limited number of expressed sequence tags (ESTs) available in GenBank. In this study, we analyzed RNA isolated from the stem of D. officinale using a single half-run on the Roche 454 GS FLX Titanium platform to generate 553,084 ESTs with an average length of 417 bases. The ESTs were assembled into 36,407 unique putative transcripts. A total of 69.97% of the unique sequences were annotated, and a detailed view of alkaloid biosynthesis was obtained. Functional assignment based on Kyoto Encyclopedia of Genes and Genomes (KEGG) terms revealed 69 unique sequences representing 25 genes involved in alkaloid backbone biosynthesis. A series of qRT-PCR experiments confirmed that the expression levels of 5 key enzyme-encoding genes involved in alkaloid biosynthesis are greater in the leaves of D. officinale than in the stems. Cytochrome P450s, aminotransferases, methyltransferases, multidrug resistance protein (MDR) transporters and transcription factors were screened for possible involvement in alkaloid biosynthesis. Furthermore, a total of 1061 simple sequence repeat motifs (SSR) were detected from 36,407 unigenes. Dinucleotide repeats were the most abundant repeat type. Of these, 179 genes were associated with a metabolic pathway in KEGG. This study is the first to produce a large volume of transcriptome data from D. officinale. It extends the foundation to facilitate gene discovery in D. officinale and provides an important resource for the molecular genetic and functional genomic studies in this species.


Subject(s)
Dendrobium/genetics , Plant Proteins/genetics , Transcriptome , Alkaloids/biosynthesis , Biosynthetic Pathways/genetics , Dendrobium/enzymology , Genetic Markers , High-Throughput Nucleotide Sequencing , Medicine, Chinese Traditional , Molecular Sequence Annotation , Plant Proteins/metabolism , Plant Stems , Plants, Medicinal/enzymology , Plants, Medicinal/genetics , Sequence Analysis, DNA , Trinucleotide Repeats
SELECTION OF CITATIONS
SEARCH DETAIL