Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 43
Filter
1.
Genomics ; 107(4): 138-44, 2016 Apr.
Article in English | MEDLINE | ID: mdl-26898347

ABSTRACT

This study determined transcriptome-wide targets of the splicing factor RBM4 using Affymetrix GeneChip(®) Human Exon 1.0 ST Arrays and HeLa cells treated with RBM4-specific siRNA. This revealed 238 transcripts that were targeted for alternative splicing. Cross-linking and immunoprecipitation experiments identified 945 RBM4 targets in mouse HEK293 cells, 39% of which were ascribed to "alternative splicing" by in silico pathway analysis. Mouse embryonic stem cells transfected with Rbm4 siRNA hairpins exhibited reduced colony numbers and size consistent with involvement of RBM4 in cell proliferation. RBM4 cDNA probing of a cancer cDNA array involving 18 different tumor types from 13 different tissues and matching normal tissue found overexpression of RBM4 mRNA (p<0.01) in cervical, breast, lung, colon, ovarian and rectal cancers. Many RBM4 targets we identified have been implicated in these cancers. In conclusion, our findings reveal transcriptome-wide targets of RBM4 and point to potential cancer-related targets and mechanisms that may involve RBM4.


Subject(s)
Alternative Splicing , Neoplasms/genetics , RNA, Small Interfering/metabolism , RNA-Binding Proteins/metabolism , Transcriptome , Animals , Cells, Cultured , Computational Biology , Embryonic Stem Cells , Exons , Gene Expression Regulation, Neoplastic , Gene Knockdown Techniques , HEK293 Cells , HeLa Cells , Humans , Mice , Oligonucleotide Array Sequence Analysis , RNA, Small Interfering/genetics
2.
Bioinformatics ; 32(2): 252-9, 2016 Jan 15.
Article in English | MEDLINE | ID: mdl-26395771

ABSTRACT

MOTIVATION: Protein phosphorylation is a post-translational modification that underlines various aspects of cellular signaling. A key step to reconstructing signaling networks involves identification of the set of all kinases and their substrates. Experimental characterization of kinase substrates is both expensive and time-consuming. To expedite the discovery of novel substrates, computational approaches based on kinase recognition sequence (motifs) from known substrates, protein structure, interaction and co-localization have been proposed. However, rarely do these methods take into account the dynamic responses of signaling cascades measured from in vivo cellular systems. Given that recent advances in mass spectrometry-based technologies make it possible to quantify phosphorylation on a proteome-wide scale, computational approaches that can integrate static features with dynamic phosphoproteome data would greatly facilitate the prediction of biologically relevant kinase-specific substrates. RESULTS: Here, we propose a positive-unlabeled ensemble learning approach that integrates dynamic phosphoproteomics data with static kinase recognition motifs to predict novel substrates for kinases of interest. We extended a positive-unlabeled learning technique for an ensemble model, which significantly improves prediction sensitivity on novel substrates of kinases while retaining high specificity. We evaluated the performance of the proposed model using simulation studies and subsequently applied it to predict novel substrates of key kinases relevant to insulin signaling. Our analyses show that static sequence motifs and dynamic phosphoproteomics data are complementary and that the proposed integrated model performs better than methods relying only on static information for accurate prediction of kinase-specific substrates. AVAILABILITY AND IMPLEMENTATION: Executable GUI tool, source code and documentation are freely available at https://github.com/PengyiYang/KSP-PUEL. CONTACT: pengyi.yang@nih.gov or jothi@mail.nih.gov SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Insulin/metabolism , Mass Spectrometry/methods , Phosphoproteins/metabolism , Protein Kinases/metabolism , Protein Processing, Post-Translational , Proteome/analysis , Proteomics/methods , Databases, Protein , Humans , Phosphorylation , Signal Transduction , Substrate Specificity
3.
Pigment Cell Melanoma Res ; 28(3): 254-66, 2015 May.
Article in English | MEDLINE | ID: mdl-25490969

ABSTRACT

The role of microRNAs (miRNAs) in melanoma is unclear. We examined global miRNA expression profiles in fresh-frozen metastatic melanomas in relation to clinical outcome and BRAF mutation, with validation in independent cohorts of tumours and sera. We integrated miRNA and mRNA information from the same samples and elucidated networks associated with outcome and mutation. Associations with prognosis were replicated for miR-150-5p, miR-142-3p and miR-142-5p. Co-analysis of miRNA and mRNA uncovered a network associated with poor prognosis (PP) that paradoxically favoured expression of miRNAs opposing tumorigenesis. These miRNAs are likely part of an autoregulatory response to oncogenic drivers, rather than drivers themselves. Robust association of miR-150-5p and the miR-142 duplex with good prognosis and earlier stage metastatic melanoma supports their potential as biomarkers. miRNAs overexpressed in association with PP in an autoregulatory fashion will not be suitable therapeutic targets.


Subject(s)
Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Lymphatic Metastasis/genetics , Melanoma/genetics , MicroRNAs/genetics , Mutation/genetics , Proto-Oncogene Proteins B-raf/genetics , Cohort Studies , Humans , Kaplan-Meier Estimate , Lymphatic Metastasis/pathology , Melanoma/pathology , MicroRNAs/metabolism , Neoplasm Staging , Paraffin Embedding , Prognosis , RNA, Messenger/genetics , RNA, Messenger/metabolism , Signal Transduction/genetics , Skin Neoplasms/genetics , Skin Neoplasms/pathology , Treatment Outcome , Melanoma, Cutaneous Malignant
4.
BMC Syst Biol ; 8 Suppl 4: S5, 2014.
Article in English | MEDLINE | ID: mdl-25521200

ABSTRACT

BACKGROUND: Classical approaches to predicting patient clinical outcome via gene expression information are primarily based on differential expression of unrelated genes (single-gene approaches) or genes related by, for example, biologic pathway or function (gene-sets). Recently, network-based approaches utilising interaction information between genes have emerged. An open problem is whether such approaches add value to the more traditional methods of signature modelling. We explored this question via comparison of the most widely employed single-gene, gene-set, and network-based methods, using gene expression microarray data from two different cancers: melanoma and ovarian. We considered two kinds of network approaches. The first of these identifies informative genes using gene expression and network connectivity information combined, the latter drawn from prior knowledge of protein-protein interactions. The second approach focuses on identification of informative sub-networks (small networks of interacting proteins, again from prior knowledge networks). For all methods we performed 100 rounds of 5-fold cross-validation under 3 different classifiers. For network-based approaches, we considered two different protein-protein interaction networks. We quantified resulting patterns of misclassification and discussed the relative value of each relative to ongoing development of prognostic biomarkers. RESULTS: We found that single-gene, gene-set and network methods yielded similar error rates in melanoma and ovarian cancer data. Crucially, however, our novel and detailed patient-level analyses revealed that the different methods were correctly classifying alternate subsets of patients in each cohort. We also found that the network-based NetRank feature selection method was the most stable. CONCLUSIONS: Next-generation methods of gene expression signature modelling harness data from external networks and are foreshadowed as a standard mode of analysis. But what do they add to traditional approaches? Our findings indicate there is value in the way in which different subspaces of the patient sample are captured differently among the various methods, highlighting the possibility of 'combination' classifiers capable of identifying which patients will be more accurately classified by one particular method over another. We have seen this clearly for the first time because of our in-depth analysis at the level of individual patients.


Subject(s)
Computational Biology/methods , Gene Expression Profiling , Gene Regulatory Networks , Protein Interaction Mapping , Biomarkers/metabolism , Female , Humans , Models, Biological , Ovarian Neoplasms/genetics , Ovarian Neoplasms/metabolism , Prognosis
5.
Proteomics ; 13(23-24): 3393-405, 2013 Dec.
Article in English | MEDLINE | ID: mdl-24166987

ABSTRACT

High-throughput '-omics' data can be combined with large-scale molecular interaction networks, for example, protein-protein interaction networks, to provide a unique framework for the investigation of human molecular biology. Interest in these integrative '-omics' methods is growing rapidly because of their potential to understand complexity and association with disease; such approaches have a focus on associations between phenotype and "network-type." The potential of this research is enticing, yet there remain a series of important considerations. Here, we discuss interaction data selection, data quality, the relative merits of using data from large high-throughput studies versus a meta-database of smaller literature-curated studies, and possible issues of sociological or inspection bias in interaction data. Other work underway, especially international consortia to establish data formats, quality standards and address data redundancy, and the improvements these efforts are making to the field, is also evaluated. We present options for researchers intending to use large-scale molecular interaction networks as a functional context for protein or gene expression data, including microRNAs, especially in the context of human disease.


Subject(s)
Neoplasms/metabolism , Protein Interaction Maps , Data Mining , Databases, Protein/standards , Humans , MicroRNAs/genetics , Molecular Sequence Annotation , Protein Interaction Mapping , Proteome/genetics , Proteome/metabolism , RNA Interference
6.
BMC Res Notes ; 6: 430, 2013 Oct 25.
Article in English | MEDLINE | ID: mdl-24156242

ABSTRACT

BACKGROUND: Large-scale molecular interaction networks are dynamic in nature and are of special interest in the analysis of complex diseases, which are characterized by network-level perturbations rather than changes in individual genes/proteins. The methods developed for the identification of differentially expressed genes or gene sets are not suitable for network-level analyses. Consequently, bioinformatics approaches that enable a joint analysis of high-throughput transcriptomics datasets and large-scale molecular interaction networks for identifying perturbed networks are gaining popularity. Typically, these approaches require the sequential application of multiple bioinformatics techniques - ID mapping, network analysis, and network visualization. Here, we present the Variability Analysis in Networks (VAN) software package: a collection of R functions to streamline this bioinformatics analysis. FINDINGS: VAN determines whether there are network-level perturbations across biological states of interest. It first identifies hubs (densely connected proteins/microRNAs) in a network and then uses them to extract network modules (comprising of a hub and all its interaction partners). The function identifySignificantHubs identifies dysregulated modules (i.e. modules with changes in expression correlation between a hub and its interaction partners) using a single expression and network dataset. The function summarizeHubData identifies dysregulated modules based on a meta-analysis of multiple expression and/or network datasets. VAN also converts protein identifiers present in a MITAB-formatted interaction network to gene identifiers (UniProt identifier to Entrez identifier or gene symbol using the function generatePpiMap) and generates microRNA-gene interaction networks using TargetScan and Microcosm databases (generateMicroRnaMap). The function obtainCancerInfo is used to identify hubs (corresponding to significantly perturbed modules) that are already causally associated with cancer(s) in the Cancer Gene Census database. Additionally, VAN supports the visualization of changes to network modules in R and Cytoscape (visualizeNetwork and obtainPairSubset, respectively). We demonstrate the utility of VAN using a gene expression data from metastatic melanoma and a protein-protein interaction network from the Human Protein Reference Database. CONCLUSIONS: Our package provides a comprehensive and user-friendly platform for the integrative analysis of -omics data to identify disease-associated network modules. This bioinformatics approach, which is essentially focused on the question of explaining phenotype with a 'network type' and in particular, how regulation is changing among different states of interest, is relevant to many questions including those related to network perturbations across developmental timelines.


Subject(s)
Computational Biology , Melanoma/metabolism , MicroRNAs/metabolism , Neoplasm Proteins/metabolism , Software , Databases, Protein , Gene Regulatory Networks , Genetic Variation , Humans , Melanoma/genetics , MicroRNAs/genetics , Neoplasm Metastasis , Neoplasm Proteins/genetics , Protein Interaction Mapping
7.
Pigment Cell Melanoma Res ; 26(5): 708-22, 2013 Sep.
Article in English | MEDLINE | ID: mdl-23738911

ABSTRACT

For disseminated melanoma, new prognostic biomarkers and therapeutic targets are urgently needed. The organization of protein-protein interaction networks was assessed via the transcriptomes of four independent studies of metastatic melanoma and related to clinical outcome and MAP-kinase pathway mutations (BRAF/NRAS). We also examined patient outcome-related differences in a predicted network of microRNAs and their targets. The 32 hub genes with the most reproducible survival-related disturbances in co-expression with their protein partner genes included oncogenes and tumor suppressors, previously known correlates of prognosis, and other proteins not previously associated with melanoma outcome. Notably, this network-based gene set could classify patients according to clinical outcomes with 67-80% accuracy among cohorts. Reproducibly disturbed networks were also more likely to have a higher functional mutation burden than would be expected by chance. The disturbed regions of networks are therefore markers of clinically relevant, selectable tumor evolution in melanoma which may carry driver mutations.


Subject(s)
Cost of Illness , Melanoma/metabolism , Melanoma/pathology , Mutation/genetics , Protein Interaction Maps , Skin Neoplasms/metabolism , Skin Neoplasms/pathology , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Gene Regulatory Networks/genetics , Humans , Melanoma/genetics , MicroRNAs/genetics , MicroRNAs/metabolism , Neoplasm Metastasis , Neoplasm Proteins/genetics , Neoplasm Proteins/metabolism , Prognosis , Protein Binding/genetics , Reproducibility of Results , Skin Neoplasms/genetics , Treatment Outcome
8.
Mol Biol Rep ; 40(9): 5381-95, 2013 Sep.
Article in English | MEDLINE | ID: mdl-23666063

ABSTRACT

Alternative splicing is a major source of protein diversity in humans. The human splicing factor zinc finger, Ran-binding domain containing protein 2 (ZRANB2) is a splicing protein whose specific endogenous targets are unknown. Its upregulation in grade III ovarian serous papillary carcinoma could suggest a role in some cancers. To determine whether ZRANB2 is part of the supraspliceosome, nuclear supernatants from human embryonic kidney 293 cells were prepared and then fractioned on a glycerol gradient, followed by Western blotting. The same was done after treatment with a tyrosine kinase to induce phosphorylation. This showed for the first time that ZRANB2 is part of the supraspliceosome, and that phosphorylation affects its subcellular location. Studies were then performed to understand the splicing targets of ZRANB2 at the whole-transcriptome level. HeLa cells were transfected with a vector containing ZRANB2 or with a vector-only control. RNA was extracted, converted to cDNA and hybridized to Affymetrix GeneChip(®) Human Exon 1.0 ST Arrays. At the FDR ≤1.3 significance level we found that ZRANB2 influenced the alternative splicing of primary transcripts of CENTB1, WDR78, C10orf18, CABP4, SMARCC2, SPATA13, OR4C6, ZNF263, CAPN10, SALL1, ST18 and ZP2. Several of these have been implicated in tumor development. In conclusion ZRANB2 is part of the supraspliceosome and causes differential splicing of numerous primary transcripts, some of which might have a role in cancer.


Subject(s)
Alternative Splicing/genetics , RNA-Binding Proteins/metabolism , Spliceosomes/metabolism , Blotting, Western , Cell Fractionation , HEK293 Cells , HeLa Cells , Humans , Oligonucleotide Array Sequence Analysis , Phosphorylation , RNA-Binding Proteins/genetics , Spliceosomes/genetics
9.
BMC Genomics ; 14 Suppl 1: S9, 2013.
Article in English | MEDLINE | ID: mdl-23368783

ABSTRACT

BACKGROUND: The cost of RNA-Seq has been decreasing over the last few years. Despite this, experiments with four or less biological replicates are still quite common. Estimating the variances of gene expression estimates becomes both a challenging and interesting problem in these situations of low replication. However, with the wealth of microarray and other publicly available gene expression data readily accessible on public repositories, these sources of information can be leveraged to make improvements in variance estimation. RESULTS: We have proposed a novel approach called Tshrink+ for inferring differential gene expression through improved modelling of the gene-wise variances. Existing methods share information between genes of similar average expression by shrinking, or moderating, the gene-wise variances to a fitted common variance. We have been able to achieve improved estimation of the common variance by using gene-wise sample variances from external experiments, as well as gene length. CONCLUSIONS: Using biological data we show that utilising additional external information can improve the modelling of the common variance and hence the calling of differentially expressed genes. These sources of additional information include gene length and gene-wise sample variances from other RNA-Seq and microarray datasets, of both related and seemingly unrelated tissue types. The results of this are promising, with our differential expression test, Tshrink+, performing favourably when compared to existing methods such as DESeq and edgeR when considering both gene ranking and sensitivity. These improved variance models could easily be implemented in both DESeq and edgeR and highlight the need for a database that offers a profile of gene variances over a range of tissue types and organisms.


Subject(s)
Genome , RNA/metabolism , Sequence Analysis, RNA , Algorithms , Animals , Area Under Curve , Databases, Factual , Gene Expression , Mice , Mice, Inbred C57BL , RNA/chemistry , ROC Curve
10.
BMC Bioinformatics ; 14: 31, 2013 Jan 29.
Article in English | MEDLINE | ID: mdl-23360225

ABSTRACT

BACKGROUND: RNA-Seq has the potential to answer many diverse and interesting questions about the inner workings of cells. Estimating changes in the overall transcription of a gene is not straightforward. Changes in overall gene transcription can easily be confounded with changes in exon usage which alter the lengths of transcripts produced by a gene. Measuring the expression of constitutive exons--xons which are consistently conserved after splicing--ffers an unbiased estimation of the overall transcription of a gene. RESULTS: We propose a clustering-based method, exClust, for estimating the exons that are consistently conserved after splicing in a given data set. These are considered as the exons which are "constitutive" in this data. The method utilises information from both annotation and the dataset of interest. The method is implemented in an openly available R function package, sydSeq. CONCLUSION: When used on two real datasets exClust includes more than three times as many reads as the standard UI method, and improves concordance with qRT-PCR data. When compared to other methods, our method is shown to produce robust estimates of overall gene transcription.


Subject(s)
Exons , Gene Expression Profiling/methods , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, RNA/methods , Algorithms , Alternative Splicing , Cluster Analysis , Humans
11.
J Invest Dermatol ; 133(2): 509-17, 2013 Feb.
Article in English | MEDLINE | ID: mdl-22931913

ABSTRACT

Prediction of outcome for melanoma patients with surgically resected macroscopic nodal metastases is very imprecise. We performed a comprehensive clinico-pathologic assessment of fresh-frozen macroscopic nodal metastases and the preceding primary melanoma, somatic mutation profiling, and gene expression profiling to identify determinants of outcome in 79 melanoma patients. In addition to disease stage 4 years, 90% confidence interval): the presence of a nodular component in the primary melanoma (6.8, 0.6-76.0), and small cell size (11.1, 0.8-100.0) or low pigmentation (3.0, 0.8-100.0) in the nodal metastases. Absence of BRAF mutation (20.0, 1.0-1000.0) or NRAS mutation (16.7, 0.6-1000.0) were both favorable prognostic factors. A 46-gene expression signature with strong overrepresentation of immune response genes was predictive of better survival (10.9, 0.4-325.6); in the full cohort, median survival was >100 months in those with the signature, but 10 months in those without. This relationship was validated in two previously published independent stage III melanoma data sets. We conclude that the presence of BRAF mutation, NRAS mutation, and the absence of an immune-related expressed gene profile predict poor outcome in melanoma patients with macroscopic stage III disease.


Subject(s)
Genes, ras/genetics , Melanoma/mortality , Proto-Oncogene Proteins B-raf/genetics , Skin Neoplasms/mortality , Transcriptome/immunology , Databases, Factual/statistics & numerical data , Female , Genetic Testing/methods , Genetic Testing/standards , Humans , Male , Melanoma/genetics , Melanoma/secondary , Melanoma/surgery , Middle Aged , Neoplasm Staging , Predictive Value of Tests , Prognosis , Reproducibility of Results , Skin Neoplasms/genetics , Skin Neoplasms/pathology , Skin Neoplasms/surgery , Treatment Outcome
12.
Article in English | MEDLINE | ID: mdl-22689082

ABSTRACT

A critical component in mass spectrometry (MS)-based proteomics is an accurate protein identification procedure. Database search algorithms commonly generate a list of peptide-spectrum matches (PSMs). The validity of these PSMs is critical for downstream analysis since proteins that are present in the sample are inferred from those PSMs. A variety of postprocessing algorithms have been proposed to validate and filter PSMs. Among them, the most popular ones include a semi-supervised learning (SSL) approach known as Percolator and an empirical modeling approach known as PeptideProphet. However, they are predominantly designed for commercial database search algorithms, i.e., SEQUEST and MASCOT. Therefore, it is highly desirable to extend and optimize those PSM postprocessing algorithms for open source database search algorithms such as X!Tandem. In this paper, we propose a Self-boosted Percolator for postprocessing X!Tandem search results. We find that the SSL algorithm utilized by Percolator depends heavily on the initial ranking of PSMs. Starting with a poor PSM ranking list may cause Percolator to perform suboptimally. By implementing Percolator in a cascade learning manner, we can progressively improve the performance through multiple boost runs, enabling many more PSM identifications without sacrificing false discovery rate (FDR).


Subject(s)
Algorithms , Peptides/chemistry , Proteomics/methods , Databases, Protein , Tandem Mass Spectrometry
13.
PLoS One ; 7(1): e29612, 2012.
Article in English | MEDLINE | ID: mdl-22253745

ABSTRACT

MicroRNAs are a class of small non-protein coding RNAs that play an important role in the regulation of gene expression. Most studies on the identification of microRNA-mRNA pairs utilize the correlation coefficient as a measure of association. The use of correlation coefficient is appropriate if the expression data are available for several conditions and, for a given condition, both microRNA and mRNA expression profiles are obtained from the same set of individuals. However, there are many instances where one of the requirements is not satisfied. Therefore, there is a need for new measures of association to identify the microRNA-mRNA pairs of interest and we present two such measures. The first measure requires expression data for multiple conditions but, for a given condition, the microRNA and mRNA expression may be obtained from different individuals. The new measure, unlike the correlation coefficient, is suitable for analyzing large data sets which are obtained by combining several independent studies on microRNAs and mRNAs. Our second measure is able to handle expression data that correspond to just two conditions but, for a given condition, the microRNA and mRNA expression must be obtained from the same set of individuals. This measure, unlike the correlation coefficient, is appropriate for analyzing data sets with a small number of conditions. We apply our new measures of association to multiple myeloma data sets, which cannot be analyzed using the correlation coefficient, and identify several microRNA-mRNA pairs involved in apoptosis and cell proliferation.


Subject(s)
Computational Biology/methods , MicroRNAs/metabolism , RNA, Messenger/metabolism , Databases, Genetic , Gene Deletion , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Gene Regulatory Networks/genetics , Humans , MicroRNAs/genetics , Multiple Myeloma/genetics , RNA, Messenger/genetics , Retinoblastoma Protein/genetics
14.
J Invest Dermatol ; 132(2): 274-83, 2012 Feb.
Article in English | MEDLINE | ID: mdl-21956122

ABSTRACT

In melanoma, there is an urgent need to identify novel biomarkers with prognostic performance superior to traditional clinical and histological parameters. Gene expression-based prognostic signatures offer promise, but studies have been challenged by sample scarcity, cohort heterogeneity, and doubts about the efficacy of such signatures relative to current clinical practices. Motivated by new studies that have begun to address these challenges, we reviewed prognostic signatures derived from gene expression microarray analysis of human melanoma tissue. We used REMARK-based criteria to select the most relevant studies and directly compared their signature gene lists. Through functional ontology enrichment analysis, we observed that these independent data sets converge in part upon immune response processes and the G-protein signaling NRAS-regulation pathway, both important in melanoma development and progression. The signatures correctly predicted patient outcome in independent gene expression data sets with some notably low misclassification rates, particularly among studies involving more advanced-stage tumors. This successful cross-validation indicates that gene expression analysis-based signatures are becoming translationally relevant to care of melanoma patients, as well as improving understanding of the aspects of melanoma biology that determine patient outcome.


Subject(s)
Gene Expression Profiling , Melanoma/genetics , Skin Neoplasms/genetics , Humans , Melanoma/mortality , Melanoma/pathology , Neoplasm Staging , Osteopontin/genetics , Prognosis , Skin Neoplasms/mortality , Skin Neoplasms/pathology
15.
Hypertension ; 58(6): 1093-8, 2011 Dec.
Article in English | MEDLINE | ID: mdl-22042811

ABSTRACT

The kidney has long been invoked in the etiology of essential hypertension. This could involve alterations in expression of specific genes and microRNAs (miRNAs). The aim of the present study was to identify, at the transcriptome-wide level, mRNAs and miRNAs that were differentially expressed between kidneys of 15 untreated hypertensive and 7 normotensive white male subjects of white European ancestry. By microarray technology we found 14 genes and 11 miRNAs that were differentially expressed in the medulla. We then selected and confirmed by real-time quantitative PCR expression differences for NR4A1, NR4A2, NR4A3, PER1, and SIK1 mRNAs and for the miRNAs hsa-miR-638 and hsa-let-7c. Luciferase reporter gene experiments in human kidney (HEK293) cells confirmed the predicted binding of hsa-let-7c to the 3' untranslated region of NR4A2 mRNA. In the renal cortex we found differential expression of 46 genes and 13 miRNAs. We then confirmed expression differences for AIFM1, AMBP, APOE, CD36, EFNB1, NDUFAF1, PRDX5, REN, RENBP, SLC13A1, STX4, and TNNT2 mRNAs and for miRNAs hsa-miR-21, hsa-miR-126, hsa-miR-181a, hsa-miR-196a, hsa-miR-451, hsa-miR-638, and hsa-miR-663. Functional experiments in HEK293 cells demonstrated that hsa-miR-663 can bind to the REN and APOE 3' untranslated regions and can regulate REN and APOE mRNA levels, whereas hsa-miR-181a regulated REN and AIFM1 mRNA. Our data demonstrated for the first time that miRNAs can regulate renin expression. The observed downregulation of 2 miRNAs in hypertension could explain the elevation in intrarenal renin mRNA. Renin, CD36, and other mRNAs, as well as miRNAs and associated pathways identified in the present study, provide novel insights into hypertension etiology.


Subject(s)
Gene Expression Profiling , Hypertension/genetics , MicroRNAs/genetics , MicroRNAs/physiology , RNA, Messenger/biosynthesis , Renin/genetics , 3' Untranslated Regions , Adult , Genes, Reporter , Humans , Hypertension/metabolism , Kidney Cortex/metabolism , Kidney Medulla/metabolism , Male , Renin/biosynthesis
16.
PLoS One ; 6(4): e19203, 2011 Apr 26.
Article in English | MEDLINE | ID: mdl-21541337

ABSTRACT

Essential hypertension is a common multifactorial heritable condition in which increased sympathetic outflow from the central nervous system is involved in the elevation in blood pressure (BP), as well as the exaggerated morning surge in BP that is a risk factor for myocardial infarction and stroke in hypertensive patients. The Schlager BPH/2J mouse is a genetic model of hypertension in which increased sympathetic outflow from the hypothalamus has an important etiological role in the elevation of BP. Schlager hypertensive mice exhibit a large variation in BP between the active and inactive periods of the day, and also show a morning surge in BP. To investigate the genes responsible for the circadian variation in BP in hypertension, hypothalamic tissue was collected from BPH/2J and normotensive BPN/3J mice at the 'peak' (n = 12) and 'trough' (n = 6) of diurnal BP. Using Affymetrix GeneChip® Mouse Gene 1.0 ST Arrays, validation by quantitative real-time PCR and a statistical method that adjusted for clock genes, we identified 212 hypothalamic genes whose expression differed between 'peak' and 'trough' BP in the hypertensive strain. These included genes with known roles in BP regulation, such as vasopressin, oxytocin and thyrotropin releasing hormone, as well as genes not recognized previously as regulators of BP, including chemokine (C-C motif) ligand 19, hypocretin and zinc finger and BTB domain containing 16. Gene ontology analysis showed an enrichment of terms for inflammatory response, mitochondrial proton-transporting ATP synthase complex, structural constituent of ribosome, amongst others. In conclusion, we have identified genes whose expression differs between the peak and trough of 24-hour circadian BP in BPH/2J mice, pointing to mechanisms responsible for diurnal variation in BP. The findings may assist in the elucidation of the mechanism for the morning surge in BP in essential hypertension.


Subject(s)
Blood Pressure/genetics , Circadian Rhythm/genetics , Hypertension/genetics , Hypertension/physiopathology , Animals , Cluster Analysis , Gene Expression Profiling , Gene Expression Regulation , Gene Regulatory Networks/genetics , Hypothalamus/metabolism , Mice , Polymerase Chain Reaction , Reproducibility of Results
17.
BMC Med Genomics ; 4: 27, 2011 Mar 31.
Article in English | MEDLINE | ID: mdl-21453471

ABSTRACT

BACKGROUND: Diagnostic accuracy of lymphoma, a heterogeneous cancer, is essential for patient management. Several ancillary tests including immunophenotyping, and sometimes cytogenetics and PCR are required to aid histological diagnosis. In this proof of principle study, gene expression microarray was evaluated as a single platform test in the differential diagnosis of common lymphoma subtypes and reactive lymphadenopathy (RL) in lymph node biopsies. METHODS: 116 lymph node biopsies diagnosed as RL, classical Hodgkin lymphoma (cHL), diffuse large B cell lymphoma (DLBCL) or follicular lymphoma (FL) were assayed by mRNA microarray. Three supervised classification strategies (global multi-class, local binary-class and global binary-class classifications) using diagonal linear discriminant analysis was performed on training sets of array data and the classification error rates calculated by leave one out cross-validation. The independent error rate was then evaluated by testing the identified gene classifiers on an independent (test) set of array data. RESULTS: The binary classifications provided prediction accuracies, between a subtype of interest and the remaining samples, of 88.5%, 82.8%, 82.8% and 80.0% for FL, cHL, DLBCL, and RL respectively. Identified gene classifiers include LIM domain only-2 (LMO2), Chemokine (C-C motif) ligand 22 (CCL22) and Cyclin-dependent kinase inhibitor-3 (CDK3) specifically for FL, cHL and DLBCL subtypes respectively. CONCLUSIONS: This study highlights the ability of gene expression profiling to distinguish lymphoma from reactive conditions and classify the major subtypes of lymphoma in a diagnostic setting. A cost-effective single platform "mini-chip" assay could, in principle, be developed to aid the quick diagnosis of lymph node biopsies with the potential to incorporate other pathological entities into such an assay.


Subject(s)
Gene Expression Profiling , Lymph Nodes/pathology , Lymphatic Diseases/diagnosis , Lymphatic Diseases/genetics , Lymphoma/diagnosis , Lymphoma/genetics , Hodgkin Disease/genetics , Humans , Lymphoma, Follicular/genetics , Lymphoma, Large B-Cell, Diffuse/genetics , Microarray Analysis
18.
Physiol Genomics ; 43(12): 766-71, 2011 Jun 28.
Article in English | MEDLINE | ID: mdl-21487032

ABSTRACT

The hypothalamus has an important etiological role in the onset and maintenance of hypertension and stress responses in the Schlager high blood pressure (BP) (BPH/2J) mouse, a genetic model of neurogenic hypertension. Using Affymetrix GeneChip Mouse Gene 1.0 ST Arrays we identified 1,019 hypothalamic genes whose expression differed between 6 wk old BPH/2J and normal BP (BPN/3J) strains, and 466 for 26 wk old mice. Of these, 459 were in 21 mouse BP quantitative trait loci. We validated 46 genes by qPCR. Gene changes that would increase sympathetic outflow at both ages were: Dynll1 encoding dynein light chain LC8-type 1, which physically destabilizes neuronal nitric oxide synthase, decreasing neuronal nitric oxide, and Hcrt encoding hypocretin and Npsr1 encoding neuropeptide S receptor 1, each involved in sympathetic response to stress. At both ages we identified genes for inflammation, such as CC-chemokine ligand 19 (Ccl19), and oxidative stress. Via reactive oxygen species generation, these could contribute to oxidative damage. Other genes identified could be responding to such perturbations. Atp2b1, the major gene from genome-wide association studies of BP variation, was underexpressed in the early phase. Comparison of profiles of young and adult BPH/2J mice, after adjusting for maturation genes, pointed to the proopiomelanocortin-α gene (Pomc) and neuropeptide Y gene (Npy), among others, as potentially causative. The present study has identified a diversity of genes and possible mechanisms involved in hypertension etiology and maintenance in the hypothalamus of BPH/2J mice, highlighting both common and divergent processes in each phase of the condition.


Subject(s)
Gene Expression Profiling/methods , Gene Expression Regulation/genetics , Genes/genetics , Hypertension/metabolism , Hypothalamus/metabolism , Quantitative Trait Loci/genetics , Age Factors , Animals , Cytoplasmic Dyneins/metabolism , Intracellular Signaling Peptides and Proteins/metabolism , Mice , Neuropeptide Y/genetics , Neuropeptides/metabolism , Oligonucleotide Array Sequence Analysis , Orexins , Oxidative Stress/genetics , Polymerase Chain Reaction , Pro-Opiomelanocortin/genetics , Receptors, G-Protein-Coupled/metabolism
19.
Retrovirology ; 8: 18, 2011 Mar 16.
Article in English | MEDLINE | ID: mdl-21410942

ABSTRACT

BACKGROUND: HIV preferentially infects CD4+ T cells, and the functional impairment and numerical decline of CD4+ and CD8+ T cells characterize HIV disease. The numerical decline of CD4+ and CD8+ T cells affects the optimal ratio between the two cell types necessary for immune regulation. Therefore, this work aimed to define the genomic basis of HIV interactions with the cellular transcriptome of both CD4+ and CD8+ T cells. RESULTS: Genome-wide transcriptomes of primary CD4+ and CD8+ T cells from HIV+ patients were analyzed at different stages of HIV disease using Illumina microarray. For each cell subset, pairwise comparisons were performed and differentially expressed (DE) genes were identified (fold change >2 and B-statistic >0) followed by quantitative PCR validation. Gene ontology (GO) analysis of DE genes revealed enriched categories of complement activation, actin filament, proteasome core and proton-transporting ATPase complex. By gene set enrichment analysis (GSEA), a network of enriched pathways functionally connected by mitochondria was identified in both T cell subsets as a transcriptional signature of HIV disease progression. These pathways ranged from metabolism and energy production (TCA cycle and OXPHOS) to mitochondria meditated cell apoptosis and cell cycle dysregulation. The most unique and significant feature of our work was that the non-progressing status in HIV+ long-term non-progressors was associated with MAPK, WNT, and AKT pathways contributing to cell survival and anti-viral responses. CONCLUSIONS: These data offer new comparative insights into HIV disease progression from the aspect of HIV-host interactions at the transcriptomic level, which will facilitate the understanding of the genetic basis of transcriptomic interaction of HIV in vivo and how HIV subverts the human gene machinery at the individual cell type level.


Subject(s)
CD4-Positive T-Lymphocytes/metabolism , CD8-Positive T-Lymphocytes/metabolism , Gene Expression Profiling , Genome, Human , HIV Infections/immunology , HIV-1/pathogenicity , Adult , CD4-Positive T-Lymphocytes/immunology , CD4-Positive T-Lymphocytes/virology , CD8-Positive T-Lymphocytes/immunology , CD8-Positive T-Lymphocytes/virology , Disease Progression , HIV Infections/physiopathology , HIV Infections/virology , HIV-1/metabolism , Host-Pathogen Interactions , Humans , Lymphocyte Count , Middle Aged , Oligonucleotide Array Sequence Analysis , Proteins/genetics , Proteins/metabolism , Survivors , Viremia/immunology , Viremia/physiopathology , Viremia/virology
20.
BMC Bioinformatics ; 12 Suppl 1: S10, 2011 Feb 15.
Article in English | MEDLINE | ID: mdl-21342539

ABSTRACT

BACKGROUND: Complex diseases are commonly caused by multiple genes and their interactions with each other. Genome-wide association (GWA) studies provide us the opportunity to capture those disease associated genes and gene-gene interactions through panels of SNP markers. However, a proper filtering procedure is critical to reduce the search space prior to the computationally intensive gene-gene interaction identification step. In this study, we show that two commonly used SNP-SNP interaction filtering algorithms, ReliefF and tuned ReliefF (TuRF), are sensitive to the order of the samples in the dataset, giving rise to unstable and suboptimal results. However, we observe that the 'unstable' results from multiple runs of these algorithms can provide valuable information about the dataset. We therefore hypothesize that aggregating results from multiple runs of the algorithm may improve the filtering performance. RESULTS: We propose a simple and effective ensemble approach in which the results from multiple runs of an unstable filter are aggregated based on the general theory of ensemble learning. The ensemble versions of the ReliefF and TuRF algorithms, referred to as ReliefF-E and TuRF-E, are robust to sample order dependency and enable a more informative investigation of data characteristics. Using simulated and real datasets, we demonstrate that both the ensemble of ReliefF and the ensemble of TuRF can generate a much more stable SNP ranking than the original algorithms. Furthermore, the ensemble of TuRF achieved the highest success rate in comparison to many state-of-the-art algorithms as well as traditional χ2-test and odds ratio methods in terms of retaining gene-gene interactions.


Subject(s)
Algorithms , Computational Biology/methods , Polymorphism, Single Nucleotide , Software , Computer Simulation , Genome-Wide Association Study
SELECTION OF CITATIONS
SEARCH DETAIL
...