Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 29
Filter
1.
Nat Commun ; 14(1): 2461, 2023 04 28.
Article in English | MEDLINE | ID: mdl-37117207

ABSTRACT

Multidimensional measurements using state-of-the-art separations and mass spectrometry provide advantages in untargeted metabolomics analyses for studying biological and environmental bio-chemical processes. However, the lack of rapid analytical methods and robust algorithms for these heterogeneous data has limited its application. Here, we develop and evaluate a sensitive and high-throughput analytical and computational workflow to enable accurate metabolite profiling. Our workflow combines liquid chromatography, ion mobility spectrometry and data-independent acquisition mass spectrometry with PeakDecoder, a machine learning-based algorithm that learns to distinguish true co-elution and co-mobility from raw data and calculates metabolite identification error rates. We apply PeakDecoder for metabolite profiling of various engineered strains of Aspergillus pseudoterreus, Aspergillus niger, Pseudomonas putida and Rhodosporidium toruloides. Results, validated manually and against selected reaction monitoring and gas-chromatography platforms, show that 2683 features could be confidently annotated and quantified across 116 microbial sample runs using a library built from 64 standards.


Subject(s)
Algorithms , Metabolomics , Mass Spectrometry/methods , Metabolomics/methods , Chromatography, Liquid/methods , Ion Mobility Spectrometry
2.
Nat Biotechnol ; 33(9): 985-989, 2015 Sep.
Article in English | MEDLINE | ID: mdl-26121415

ABSTRACT

CRISPR-Cas-mediated genome editing relies on guide RNAs that direct site-specific DNA cleavage facilitated by the Cas endonuclease. Here we report that chemical alterations to synthesized single guide RNAs (sgRNAs) enhance genome editing efficiency in human primary T cells and CD34(+) hematopoietic stem and progenitor cells. Co-delivering chemically modified sgRNAs with Cas9 mRNA or protein is an efficient RNA- or ribonucleoprotein (RNP)-based delivery method for the CRISPR-Cas system, without the toxicity associated with DNA delivery. This approach is a simple and effective way to streamline the development of genome editing with the potential to accelerate a wide array of biotechnological and therapeutic applications of the CRISPR-Cas technology.


Subject(s)
CRISPR-Cas Systems/genetics , Clustered Regularly Interspaced Short Palindromic Repeats/genetics , Genetic Engineering/methods , Genome, Human/genetics , RNA, Guide, Kinetoplastida/chemistry , RNA, Guide, Kinetoplastida/genetics , Humans
3.
Bioinformatics ; 31(10): 1683-5, 2015 May 15.
Article in English | MEDLINE | ID: mdl-25577435

ABSTRACT

ENViz (Enrichment Analysis and Visualization) is a Cytoscape app that performs joint enrichment analysis of two types of sample matched datasets in the context of systematic annotations. Such datasets may be gene expression or any other high-throughput data collected in the same set of samples. The enrichment analysis is done in the context of pathway information, gene ontology or any custom annotation of the data. The results of the analysis consist of significant associations between profiled elements of one of the datasets to the annotation terms (e.g. miR-19 was associated to the cell-cycle process in breast cancer samples). The results of the enrichment analysis are visualized as an interactive Cytoscape network.


Subject(s)
Gene Expression Profiling/methods , Software , Breast Neoplasms/genetics , Cell Cycle , Computer Graphics , Data Interpretation, Statistical , Female , Humans
4.
Am J Hum Genet ; 88(3): 317-32, 2011 Mar 11.
Article in English | MEDLINE | ID: mdl-21397061

ABSTRACT

Copy-number variants (CNVs) can reach appreciable frequencies in the human population, and recent discoveries have shown that several of these copy-number polymorphisms (CNPs) are associated with human diseases, including lupus, psoriasis, Crohn disease, and obesity. Despite new advances, significant biases remain in terms of CNP discovery and genotyping. We developed a method based on single-channel intensity data and benchmarked against copy numbers determined from sequencing read depth to successfully obtain CNP genotypes for 1495 CNPs from 487 human DNA samples of diverse ethnic backgrounds. This microarray contained CNPs in segmental duplication-rich regions and insertions of sequences not represented in the reference genome assembly or on standard SNP microarray platforms. We observe that CNPs in segmental duplications are more likely to be population differentiated than CNPs in unique regions (p = 0.015) and that biallelic CNPs show greater stratification when compared to frequency-matched SNPs (p = 0.0026). Although biallelic CNPs show a strong correlation of copy number with flanking SNP genotypes, the majority of multicopy CNPs do not (40% with r > 0.8). We selected a subset of CNPs for further characterization in 1876 additional samples from 62 populations; this revealed striking population-differentiated structural variants in genes of clinical significance such as OCLN, a tight junction protein involved in hepatitis C viral entry. Our microarray design allows these variants to be rapidly tested for disease association and our results suggest that CNPs (especially those that cannot be imputed from SNP genotypes) might have contributed disproportionately to human diversity and selection.


Subject(s)
DNA Copy Number Variations/genetics , Genetics, Population , Comparative Genomic Hybridization , Genetic Loci/genetics , Genotype , Geography , Humans , Linkage Disequilibrium/genetics , Mutagenesis, Insertional/genetics , Polymorphism, Single Nucleotide/genetics
5.
Science ; 330(6004): 641-6, 2010 Oct 29.
Article in English | MEDLINE | ID: mdl-21030649

ABSTRACT

Copy number variants affect both disease and normal phenotypic variation, but those lying within heavily duplicated, highly identical sequence have been difficult to assay. By analyzing short-read mapping depth for 159 human genomes, we demonstrated accurate estimation of absolute copy number for duplications as small as 1.9 kilobase pairs, ranging from 0 to 48 copies. We identified 4.1 million "singly unique nucleotide" positions informative in distinguishing specific copies and used them to genotype the copy and content of specific paralogs within highly duplicated gene families. These data identify human-specific expansions in genes associated with brain development, reveal extensive population genetic diversity, and detect signatures consistent with gene conversion in the human species. Our approach makes ~1000 genes accessible to genetic studies of disease association.


Subject(s)
DNA Copy Number Variations , Gene Dosage , Gene Duplication , Genetic Variation , Genome, Human , Genomics/methods , Chromosome Mapping , Databases, Nucleic Acid , Evolution, Molecular , Female , Gene Conversion , Gene Frequency , Genes, Duplicate , Genotype , Haplotypes , Humans , Male , Polymorphism, Single Nucleotide , Racial Groups/genetics , Sequence Analysis, DNA
6.
Nat Methods ; 7(5): 365-71, 2010 May.
Article in English | MEDLINE | ID: mdl-20440878

ABSTRACT

The extent of human genomic structural variation suggests that there must be portions of the genome yet to be discovered, annotated and characterized at the sequence level. We present a resource and analysis of 2,363 new insertion sequences corresponding to 720 genomic loci. We found that a substantial fraction of these sequences are either missing, fragmented or misassigned when compared to recent de novo sequence assemblies from short-read next-generation sequence data. We determined that 18-37% of these new insertions are copy-number polymorphic, including loci that show extensive population stratification among Europeans, Asians and Africans. Complete sequencing of 156 of these insertions identified new exons and conserved noncoding sequences not yet represented in the reference genome. We developed a method to accurately genotype these new insertions by mapping next-generation sequencing datasets to the breakpoint, thereby providing a means to characterize copy-number status for regions previously inaccessible to single-nucleotide polymorphism microarrays.


Subject(s)
Contig Mapping/methods , Genome, Human , Sequence Analysis, DNA/methods , DNA Transposable Elements/genetics , Gene Frequency , Genomic Structural Variation/genetics , Genotype , Humans , In Situ Hybridization, Fluorescence , Molecular Sequence Data , Polymorphism, Genetic
7.
PLoS One ; 4(11): e8003, 2009 Nov 25.
Article in English | MEDLINE | ID: mdl-19946373

ABSTRACT

BACKGROUND: MicroRNAs (miRNAs) regulate target genes at the post-transcriptional level and play important roles in cancer pathogenesis and development. Variation amongst individuals is a significant confounding factor in miRNA (or other) expression studies. The true character of biologically or clinically meaningful differential expression can be obscured by inter-patient variation. In this study we aim to identify miRNAs with consistent differential expression in multiple tumor types using a novel data analysis approach. METHODS: Using microarrays we profiled the expression of more than 700 miRNAs in 28 matched tumor/normal samples from 8 different tumor types (breast, colon, liver, lung, lymphoma, ovary, prostate and testis). This set is unique in putting emphasis on minimizing tissue type and patient related variability using normal and tumor samples from the same patient. We develop scores for comparing miRNA expression in the above matched sample data based on a rigorous characterization of the distribution of order statistics over a discrete state set, including exact p-values. Specifically, we compute a Rank Consistency Score (RCoS) for every miRNA measured in our data. Our methods are also applicable in various other contexts. We compare our methods, as applied to matched samples, to paired t-test and to the Wilcoxon Signed Rank test. RESULTS: We identify consistent (across the cancer types measured) differentially expressed miRNAs. 41 miRNAs are under-expressed in cancer compared to normal, at FDR (False Discovery Rate) of 0.05 and 17 are over-expressed at the same FDR level. Differentially expressed miRNAs include known oncomiRs (e.g miR-96) as well as miRNAs that were not previously universally associated with cancer. Specific examples include miR-133b and miR-486-5p, which are consistently down regulated and mir-629* which is consistently up regulated in cancer, in the context of our cohort. Data is available in GEO. Software is available at: http://bioinfo.cs.technion.ac.il/people/zohar/RCoS/


Subject(s)
Gene Expression Profiling , Gene Expression Regulation, Neoplastic , MicroRNAs/metabolism , Algorithms , Cell Line, Tumor , Computational Biology/methods , Humans , Models, Statistical , Oligonucleotide Array Sequence Analysis , RNA Processing, Post-Transcriptional , Software , Tissue Distribution
8.
Int J Cancer ; 123(3): 577-85, 2008 Aug 01.
Article in English | MEDLINE | ID: mdl-18498133

ABSTRACT

Genetic factors are thought to play a role in resistance towards chemotherapeutic agents such as 5-fluorouracil (5-FU). Approximately 30 genes are directly or indirectly involved in 5-FU metabolism, and genetic variation in any of these may contribute to anti-tumor response. Polymorphisms in these genes were analyzed in relation to tumoral mRNA levels of 5-FU metabolizing genes, response to chemotherapy and survival. A total of 21 genetic variants were studied in 35 breast cancer patients treated with 5-FluoroUracil, mitomycin (FUMI) and in a similar cohort of 90 doxorubicin treated breast cancer patients. Genotype distributions were compared using 109 healthy controls. No significant association was found between any polymorphisms and response to chemotherapy as measured by shrinkage of tumor. However, carriers of 3 copies of the TYMS 5'UTR repeat had shorter survival than noncarriers (p = .048) in the FUMI treatment group, but not in the doxorubicin treated group. Carriers of 3 copies of the repeat were also more frequently observed in both patients groups than in healthy controls (p = .034). Several highly significant associations were observed between genotypes and expression levels of 5-FU metabolizing genes. A SNP in codon 72 of TP53 was revealed to be a key regulator of 5-FU metabolizing genes such as DHFR and MTHFR, constituting 50% of all significant associations observed after FUMI therapy. These data suggest that 3 copies of the TYMS 5'UTR repeat may give a treatment specific reduced survival in breast cancer patients, and that TP53 may have a direct, allele specific, role in 5-FU mediated response.


Subject(s)
Antimetabolites, Antineoplastic/metabolism , Antineoplastic Combined Chemotherapy Protocols/therapeutic use , Breast Neoplasms/drug therapy , Breast Neoplasms/genetics , Fluorouracil/metabolism , Polymorphism, Single Nucleotide , Tumor Suppressor Protein p53/genetics , Adult , Aged , Aged, 80 and over , Analysis of Variance , Antibiotics, Antineoplastic/administration & dosage , Antimetabolites, Antineoplastic/therapeutic use , Antineoplastic Combined Chemotherapy Protocols/metabolism , Breast Neoplasms/metabolism , Breast Neoplasms/mortality , Case-Control Studies , Doxorubicin/administration & dosage , Female , Fluorouracil/administration & dosage , Fluorouracil/therapeutic use , Genetic Predisposition to Disease , Haplotypes , Humans , Linkage Disequilibrium , Middle Aged , Mitomycin/administration & dosage , RNA, Messenger/analysis , RNA, Neoplasm/analysis , Survival Analysis , Treatment Outcome
9.
Genes Chromosomes Cancer ; 47(5): 427-36, 2008 May.
Article in English | MEDLINE | ID: mdl-18273836

ABSTRACT

Array comparative genomic hybridization (aCGH) is a key platform to assess cancer genomic profiles. Many structural genomic aberrations cannot be detected by aCGH alone. We have applied molecular cytogenetic analyses including spectral karyotyping, multicolor banding, and fluorescence in situ hybridization with aCGH to comprehensively investigate the genomic aberrations associated with cisplatin resistance in A2780 ovarian cancer cells. A2780 is a well-established model of chemotherapeutic resistance with distinct karyotypic abnormalities in the parental and cisplatin-resistant cells. Cytogenetic analysis revealed that two unbalanced translocations, der(8)t(1;8) and der(X)t(X;1), and loss of chromosome 13 were present only in the resistant line. Our aCGH analyses detected imbalances affecting an additional 10.59% of the genome in the cisplatin-resistant cells compared with the parental. DNA copy number changes included deletions at 1p10-p22.1, 8p23.3, and Xq13.1-pter, and a duplication of 8q11.22-q23. Cryptic genomic aberrations associated with concurrent localized changes of specific gene expression included a homozygous deletion of 0.38 Mb at 1p21.3 adjacent to SNX7, and an insertional transposition of 0.85 Mb from 13q12.12 into chromosome 22. This latter rearrangement led to an overexpression of four contiguous genes that flanked one of the breakpoint regions in chromosome 13. Furthermore, 17 genes showed differential expression correlating with genomic gain or loss between the resistant and parent lines, validated by a second expression array platform. These results highlight the integration of comprehensive profiling to determine relationships of genomic aberrations and genes associated with an in vitro drug resistance model in ovarian cancer. This article contains Supplementary Material available at http://www.interscience.wiley.com/jpages/1045-2257/suppmat.


Subject(s)
Antineoplastic Agents/pharmacology , Cisplatin/pharmacology , Ovarian Neoplasms/genetics , Cell Line , Chromosome Mapping , Drug Resistance, Neoplasm , Female , Gene Expression Profiling , Humans , In Situ Hybridization, Fluorescence , Karyotyping , Nucleic Acid Hybridization
10.
Am J Hum Genet ; 82(3): 685-95, 2008 Mar.
Article in English | MEDLINE | ID: mdl-18304495

ABSTRACT

Despite considerable excitement over the potential functional significance of copy-number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at approximately 1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1020 of 1153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of currently known common human CNVs is likely smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously uncharacterized complexity.


Subject(s)
Gene Dosage , Genetic Variation , Genome, Human/genetics , Tandem Repeat Sequences , Humans , Oligonucleotide Array Sequence Analysis/methods
11.
Biomark Insights ; 3: 1-18, 2008 Jan 21.
Article in English | MEDLINE | ID: mdl-19578489

ABSTRACT

Hepatocellular carcinoma (HCC) is the third leading cause of cancer deaths worldwide. Effective treatment of HCC patients is hampered by the lack of sensitive and specific diagnostic markers of HCC. Alpha-fetoprotein (AFP), the currently used HCC marker, misses 30%-50% of HCC patients, who therefore remain undiagnosed and untreated. In order to identify novel diagnostic markers that can be used individually or in combination with AFP, we used an antibody array platform to detect the levels of candidate proteins in the plasma of HCC patients (n = 48) and patients with chronic hepatitis B or C viral infections (n = 19) (both of which are the major risk factors of HCC). We identified 7 proteins that significantly differentiate HCC patients from hepatitis patients (p < 0.05) (AFP, CTNNB, CSF1, SELL, IGFBP6, IL6R, and VCAM1). Importantly, we also identified 8 proteins that significantly differentiate HCC patients with 'normal' levels of AFP (< 20 ng/ml) from hepatitis patients (p < 0.05) (IL1RN, IFNG, CDKN1A, RETN, CXCL14, CTNNB, FGF2, and SELL). These markers are potentially important complementary markers to AFP. Using an independent immunoassay method in an independent group of 23 HCC patients and 22 hepatitis patients, we validated that plasma levels of CTNNB were significantly higher in the HCC group (p = 0.020). In conclusion, we used an antibody array platform to identify potential circulating diagnostic markers of HCC, some of which may be valuable when used in combination with AFP. The clinical utility of these newly identified HCC diagnostic markers needs to be systematically evaluated.

12.
Hum Mol Genet ; 16(23): 2783-94, 2007 Dec 01.
Article in English | MEDLINE | ID: mdl-17666407

ABSTRACT

The discovery of copy number variation in healthy individuals is far from complete, and owing to the resolution of detection systems used, the majority of loci reported so far are relatively large ( approximately 65%>10 kb). Applying a two-stage high-resolution array comparative genomic hybridization approach to analyse 50 healthy Caucasian males from northern France, we discovered 2208 copy number variants (CNVs) detected by more than one consecutive probe. These clustered into 1469 CNV regions (CNVRs), of which 721 are thought to be novel. The majority of these are small (median size 4.4 kb) and most have common boundaries, with a coefficient of variation less than 0.1 for 83% of endpoints in those observed in multiple samples. Only 6% of the CNVRs analysed showed evidence of both copy number losses and gains at the same site. A further 6089 variants were detected by single probes: 48% of these were observed in more than one individual. In total, 2570 genes were seen to intersect variants: 1284 in novel loci. Genes involved in differentiation and development were significantly over-represented and approximately half of the genes identified feature in the Online Mendelian Inheritance in Man database. The biological importance of many genes affected, along with the well-conserved nature of the majority of the CNVs, suggests that they could have important implications for phenotype and, thus, be useful for association studies of complex diseases.


Subject(s)
Gene Dosage , Genetic Variation , Adult , Base Sequence , DNA Primers/genetics , Databases, Nucleic Acid , France , Humans , Male , Middle Aged , Oligonucleotide Array Sequence Analysis/methods , Phenotype
13.
Prostate ; 67(13): 1447-55, 2007 Sep 15.
Article in English | MEDLINE | ID: mdl-17654530

ABSTRACT

BACKGROUND: The current focus on biomarker discovery is a result of an improved understanding of the biological basis for carcinogenesis and advances in technology. Biomarkers can aid in diagnosis, prognosis, treatment selection, and drug development. There is an urgent need for high-resolution tools that perform well using archived tissue for biomarker discovery and tools that can translate into the clinic. METHODS: Oligonucleotide array comparative genomic hybridization (oCGH) was compared to BAC-based aCGH using unamplified total genomic DNA from formalin fixed paraffin-embedded (FFPE) prostate tissue. RESULTS: The copy number aberrations detected with the BAC and oligonucleotide arrays were highly correlated in cases where the arrays contained probes in similar genomic locations. The oligonucleotide array platform provided more precise mapping due to the higher density of oligonucleotide probes. CONCLUSIONS: These results demonstrate the utility of high-resolution oligonucleotide arrays designed to use genomic DNA for CGH measurements using archived tissue samples for discovery and clinic based assays.


Subject(s)
DNA, Neoplasm/genetics , Genomics/methods , Nucleic Acid Hybridization/methods , Prostatic Neoplasms/genetics , Chromosome Aberrations , Chromosome Mapping , Chromosomes, Artificial, Bacterial , Humans , Male , Oligonucleotide Array Sequence Analysis/methods
14.
BMC Genomics ; 8: 148, 2007 Jun 07.
Article in English | MEDLINE | ID: mdl-17553173

ABSTRACT

BACKGROUND: The increasing use of DNA microarrays in biomedical research, toxicogenomics, pharmaceutical development, and diagnostics has focused attention on the reproducibility and reliability of microarray measurements. While the reproducibility of microarray gene expression measurements has been the subject of several recent reports, there is still a need for systematic investigation into what factors most contribute to variability of measured expression levels observed among different laboratories and different experimenters. RESULTS: We report the results of an interlaboratory comparison of gene expression array measurements on the same microarray platform, in which the RNA amplification and labeling, hybridization and wash, and slide scanning were each individually varied. Identical input RNA was used for all experiments. While some sources of variation have measurable influence on individual microarray signals, they showed very low influence on sample-to-reference ratios based on averaged triplicate measurements in the two-color experiments. RNA labeling was the largest contributor to interlaboratory variation. CONCLUSION: Despite this variation, measurement of one particular breast cancer gene expression signature in three different laboratories was found to be highly robust, showing a high intralaboratory and interlaboratory reproducibility when using strictly controlled standard operating procedures.


Subject(s)
Breast Neoplasms/genetics , Oligonucleotide Array Sequence Analysis/methods , RNA/isolation & purification , Analysis of Variance , Humans , Oligonucleotide Array Sequence Analysis/instrumentation , Reproducibility of Results
15.
Bioinformatics ; 23(2): e91-8, 2007 Jan 15.
Article in English | MEDLINE | ID: mdl-17237111

ABSTRACT

MOTIVATION: Large-scale association studies, investigating the genetic determinants of a phenotype of interest, are producing increasing amounts of genomic variation data on human cohorts. A fundamental challenge in these studies is the detection of genotypic patterns that discriminate individuals exhibiting the phenotype under study from individuals that do not possess it. The difficulty stems from the large number of single nucleotide polymorphism (SNP) combinations that have to be tested. The discrimination problem becomes even more involved when additional high-throughput data, such as gene expression data, are available for the same cohort. RESULTS: We have developed a graph theoretic approach for identifying discriminating patterns (DPs) for a given phenotype in a genotyped population. The method is based on representing the SNP data as a bipartite graph of individuals and their SNP states, and identifying fully connected subgraphs of this graph that relate individuals enriched for a given phenotypic group. The method can handle additional data types such as expression profiles of the genotyped population. It is reminiscent of biclustering approaches with the crucial difference that its search process is guided by the phenotype under consideration in a supervised manner. We tested our approach in simulations and on real data. In simulations, our method was able to retrieve planted patterns with high success rate. We then applied our approach to a dataset of 72 breast cancer patients with available gene expression profiles, genotyped over 695 SNPs. We detected several DPs that were highly significant with respect to various clinical phenotypes, and investigated the groups of patients and the groups of genes they defined. We found the patient groups to be highly enriched for other phenotypes and to display expression coherency among their profiles. The gene groups displayed functional coherency and involved genes with known role in cancer, providing additional support to their involvement. AVAILABILITY: The program is available upon request.


Subject(s)
Biomarkers, Tumor/metabolism , Breast Neoplasms/diagnosis , Breast Neoplasms/metabolism , Diagnosis, Computer-Assisted/methods , Gene Expression Profiling/methods , Neoplasm Proteins/metabolism , Oligonucleotide Array Sequence Analysis/methods , Algorithms , Artificial Intelligence , Discriminant Analysis , Genotype , Humans , Pattern Recognition, Automated/methods
16.
Circulation ; 114(24): 2644-54, 2006 Dec 12.
Article in English | MEDLINE | ID: mdl-17145989

ABSTRACT

BACKGROUND: Recent successes in the treatment of in-stent restenosis (ISR) by drug-eluting stents belie the challenges still faced in certain lesions and patient groups. We analyzed human coronary atheroma in de novo and restenotic disease to identify targets of therapy that might avoid these limitations. METHODS AND RESULTS: We recruited 89 patients who underwent coronary atherectomy for de novo atherosclerosis (n=55) or in-stent restenosis (ISR) of a bare metal stent (n=34). Samples were fixed for histology, and gene expression was assessed with a dual-dye 22,000 oligonucleotide microarray. Histological analysis revealed significantly greater cellularity and significantly fewer inflammatory infiltrates and lipid pools in the ISR group. Gene ontology analysis demonstrated the prominence of cell proliferation programs in ISR and inflammation/immune programs in de novo restenosis. Network analysis, which combines semantic mining of the published literature with the expression signature of ISR, revealed gene expression modules suggested as candidates for selective inhibition of restenotic disease. Two modules are presented in more detail, the procollagen type 1 alpha2 gene and the ADAM17/tumor necrosis factor-alpha converting enzyme gene. We tested our contention that this method is capable of identifying successful targets of therapy by comparing mean significance scores for networks generated from subsets of the published literature containing the terms "sirolimus" or "paclitaxel." In addition, we generated 2 large networks with sirolimus and paclitaxel at their centers. Both analyses revealed higher mean values for sirolimus, suggesting that this agent has a broader suppressive action against ISR than paclitaxel. CONCLUSIONS: Comprehensive histological and gene network analysis of human ISR reveals potential targets for directed abrogation of restenotic disease and recapitulates the results of clinical trials of existing agents.


Subject(s)
Coronary Restenosis/therapy , Gene Regulatory Networks , Stents , ADAM Proteins/genetics , ADAM Proteins/physiology , ADAM17 Protein , Adult , Aged , Collagen/antagonists & inhibitors , Collagen/genetics , Collagen Type I , Coronary Artery Disease/pathology , Coronary Restenosis/metabolism , Coronary Restenosis/pathology , Female , Gene Expression Profiling , Humans , Male , Middle Aged
17.
J Bioinform Comput Biol ; 4(2): 259-74, 2006 Apr.
Article in English | MEDLINE | ID: mdl-16819783

ABSTRACT

High throughput expression profiling and genotyping technologies provide the means to study the genetic determinants of population variation in gene expression variation. In this paper we present a general statistical framework for the simultaneous analysis of gene expression data and SNP genotype data measured for the same cohort. The framework consists of methods to associate transcripts with SNPs affecting their expression, algorithms to detect subsets of transcripts that share significantly many associations with a subset of SNPs, and methods to visualize the identified relations. We apply our framework to SNP-expression data collected from 50 breast cancer patients. Our results demonstrate an overabundance of transcript-SNP associations in this data, and pinpoint SNPs that are potential master regulators of transcription. We also identify several statistically significant transcript-subsets with common putative regulators that fall into well-defined functional categories.


Subject(s)
Algorithms , Breast Neoplasms/genetics , Neoplasm Proteins/genetics , Polymorphism, Single Nucleotide/genetics , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Transcription Factors/genetics , Artificial Intelligence , Base Sequence , Expressed Sequence Tags , Gene Expression/genetics , Genetic Predisposition to Disease/genetics , Humans , Molecular Sequence Data , Pattern Recognition, Automated/methods
18.
Proc Natl Acad Sci U S A ; 103(20): 7735-40, 2006 May 16.
Article in English | MEDLINE | ID: mdl-16684880

ABSTRACT

Candidate single-nucleotide polymorphisms (SNPs) were analyzed for associations to an unselected whole genome pool of tumor mRNA transcripts in 50 unrelated patients with breast cancer. SNPs were selected from 203 candidate genes of the reactive oxygen species pathway. We describe a general statistical framework for the simultaneous analysis of gene expression data and SNP genotype data measured for the same cohort, which revealed significant associations between subsets of SNPs and transcripts, shedding light on the underlying biology. We identified SNPs in EGF, IL1A, MAPK8, XPC, SOD2, and ALOX12 that are associated with the expression patterns of a significant number of transcripts, indicating the presence of regulatory SNPs in these genes. SNPs were found to act in trans in a total of 115 genes. SNPs in 43 of these 115 genes were found to act both in cis and in trans. Finally, subsets of SNPs that share significantly many common associations with a set of transcripts (biclusters) were identified. The subsets of transcripts that are significantly associated with the same set of SNPs or to a single SNP were shown to be functionally coherent in Gene Ontology and pathway analyses and coexpressed in other independent data sets, suggesting that many of the observed associations are within the same functional pathways. To our knowledge, this article is the first study to correlate SNP genotype data in the germ line with somatic gene expression data in breast tumors. It provides the statistical framework for further genotype expression correlation studies in cancer data sets.


Subject(s)
Breast Neoplasms/genetics , Gene Expression Regulation, Neoplastic , Genetic Variation , Polymorphism, Single Nucleotide , Regulatory Sequences, Nucleic Acid , Computational Biology , Female , Gene Expression Profiling , Humans , Multigene Family , RNA, Messenger/metabolism , Statistics as Topic
19.
Arterioscler Thromb Vasc Biol ; 26(5): 1058-65, 2006 May.
Article in English | MEDLINE | ID: mdl-16456091

ABSTRACT

OBJECTIVE: Phenotypic differences between vascular smooth muscle cell (VSMC) subtypes lead to diverse pathological processes including atherosclerosis, postangioplasty restenosis and vein graft disease. To better understand the molecular mechanisms underlying functional differences among distinct SMC subtypes, we compared gene expression profiles and functional responses to oxidized low-density lipoprotein (OxLDL) and platelet-derived growth factor (PDGF) between cultured SMCs from human coronary artery (CASM) and saphenous vein (SVSM). METHODS AND RESULTS: OxLDL and PDGF elicited markedly different functional responses and expression profiles between the 2 SMC subtypes. In CASM, OxLDL inhibited cell proliferation and migration and modified gene expression of chemokines (CXCL10, CXCL11 and CXCL12), proinflammatory cytokines (IL-1, IL-6, and IL-18), insulin-like growth factor binding proteins (IGFBPs), and both endothelial and smooth muscle marker genes. In SVSM, OxLDL promoted proliferation partially via IGF1 signaling, activated NF-kappaB and phosphatidylinositol signaling pathways, and upregulated prostaglandin (PG) receptors and synthases. In untreated cells, alpha-chemokines, proinflammatory cytokines, and genes associated with apoptosis, inflammation, and lipid biosynthesis were higher in CASM, whereas some beta-chemokines, metalloproteinase inhibitors, and IGFBPs were higher in SVSM. Interestingly, the basal expression levels of these genes seemed closely related to their responses to OxLDL and PDGF. In summary, our results suggest dramatic differences in gene expression patterns and functional responses to OxLDL and PDGF between venous and arterial SMCs, with venous SMCs having stronger proliferative/migratory responses to stimuli but also higher expression of atheroprotective genes at baseline. CONCLUSIONS: These results reveal molecular signatures that define the distinct phenotypes characteristics of coronary artery and saphenous vein SMC subtypes.


Subject(s)
Coronary Vessels/metabolism , Muscle, Smooth, Vascular/metabolism , Myocytes, Smooth Muscle/metabolism , Saphenous Vein/metabolism , Atherosclerosis/etiology , Cell Cycle Proteins/genetics , Cell Movement/drug effects , Cell Proliferation/drug effects , Cells, Cultured , Collagen/genetics , Coronary Vessels/cytology , Cytokines/genetics , Gene Expression Regulation/drug effects , Guanine Nucleotide Exchange Factors/genetics , Humans , Insulin-Like Growth Factor Binding Proteins/genetics , Insulin-Like Growth Factor I/physiology , Lipoproteins, LDL/pharmacology , Muscle, Smooth, Vascular/cytology , Myocytes, Smooth Muscle/physiology , NF-kappa B/physiology , Nuclear Proteins/genetics , Phenotype , Phosphatidylinositol 3-Kinases/physiology , Platelet-Derived Growth Factor/pharmacology , Saphenous Vein/cytology
20.
Pharmacogenet Genomics ; 16(3): 207-17, 2006 Mar.
Article in English | MEDLINE | ID: mdl-16495780

ABSTRACT

Rapid development in the annotation of human genetic variation has increased the numbers of single nucleotide polymorphisms (SNPs) in candidate genes by several orders of magnitude. The selection of both useful target SNPs for disease-gene association studies and SNPs associated with the treatment response is therefore an increasingly challenging task. We describe a workflow for selecting SNPs based on their putative function and frequency in candidate genes extracted from PubMed resources. The annotation of each SNP and its frequency in a Caucasian population was assessed in several databases. Approximately 4000 SNPs were identified from an initial 233 candidate genes. In a case study, we performed actual genotyping of 1030 of these SNPs in 213 genes and obtained 710 successfully genotyped SNPs. Using the flow-chart outlined here, only 87 SNPs were monomorphic (approximately 12%). This study reports the frequency of SNPs in a Caucasian population, selected in silico, using a candidate gene approach and validated by actually genotyping 193 individuals. The selected genotypes represent a valuable set of verified candidate SNPs for pharmacogenetic studies in Caucasian populations.


Subject(s)
Databases, Genetic , Gene Expression Regulation, Neoplastic , Genetic Predisposition to Disease , Pharmacogenetics/methods , Polymorphism, Single Nucleotide , Breast Neoplasms/genetics , Female , Gene Frequency , Genotype , Humans , Models, Genetic , Mutation , White People
SELECTION OF CITATIONS
SEARCH DETAIL
...