Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 41
Filter
Add more filters

Affiliation country
Publication year range
1.
Cell ; 152(4): 714-26, 2013 Feb 14.
Article in English | MEDLINE | ID: mdl-23415222

ABSTRACT

Clonal evolution is a key feature of cancer progression and relapse. We studied intratumoral heterogeneity in 149 chronic lymphocytic leukemia (CLL) cases by integrating whole-exome sequence and copy number to measure the fraction of cancer cells harboring each somatic mutation. We identified driver mutations as predominantly clonal (e.g., MYD88, trisomy 12, and del(13q)) or subclonal (e.g., SF3B1 and TP53), corresponding to earlier and later events in CLL evolution. We sampled leukemia cells from 18 patients at two time points. Ten of twelve CLL cases treated with chemotherapy (but only one of six without treatment) underwent clonal evolution, predominantly involving subclones with driver mutations (e.g., SF3B1 and TP53) that expanded over time. Furthermore, presence of a subclonal driver mutation was an independent risk factor for rapid disease progression. Our study thus uncovers patterns of clonal evolution in CLL, providing insights into its stepwise transformation, and links the presence of subclones with adverse clinical outcomes.


Subject(s)
Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Mutation , Algorithms , Animals , B-Lymphocytes/metabolism , DNA Copy Number Variations , Genome-Wide Association Study , Humans , Leukemia, Lymphocytic, Chronic, B-Cell/drug therapy , Ploidies
2.
Cell ; 153(3): 666-77, 2013 Apr 25.
Article in English | MEDLINE | ID: mdl-23622249

ABSTRACT

The analysis of exonic DNA from prostate cancers has identified recurrently mutated genes, but the spectrum of genome-wide alterations has not been profiled extensively in this disease. We sequenced the genomes of 57 prostate tumors and matched normal tissues to characterize somatic alterations and to study how they accumulate during oncogenesis and progression. By modeling the genesis of genomic rearrangements, we identified abundant DNA translocations and deletions that arise in a highly interdependent manner. This phenomenon, which we term "chromoplexy," frequently accounts for the dysregulation of prostate cancer genes and appears to disrupt multiple cancer genes coordinately. Our modeling suggests that chromoplexy may induce considerable genomic derangement over relatively few events in prostate cancer and other neoplasms, supporting a model of punctuated cancer evolution. By characterizing the clonal hierarchy of genomic lesions in prostate tumors, we charted a path of oncogenic events along which chromoplexy may drive prostate carcinogenesis.


Subject(s)
Chromosome Aberrations , Gene Expression Regulation, Neoplastic , Genome, Human , Prostatic Neoplasms/genetics , Adenocarcinoma/genetics , Adenocarcinoma/pathology , Cohort Studies , Genome-Wide Association Study , Humans , Male , Neuroendocrine Tumors/genetics , Neuroendocrine Tumors/pathology , Prostatic Neoplasms/pathology
3.
Cell ; 150(6): 1107-20, 2012 Sep 14.
Article in English | MEDLINE | ID: mdl-22980975

ABSTRACT

Lung adenocarcinoma, the most common subtype of non-small cell lung cancer, is responsible for more than 500,000 deaths per year worldwide. Here, we report exome and genome sequences of 183 lung adenocarcinoma tumor/normal DNA pairs. These analyses revealed a mean exonic somatic mutation rate of 12.0 events/megabase and identified the majority of genes previously reported as significantly mutated in lung adenocarcinoma. In addition, we identified statistically recurrent somatic mutations in the splicing factor gene U2AF1 and truncating mutations affecting RBM10 and ARID1A. Analysis of nucleotide context-specific mutation signatures grouped the sample set into distinct clusters that correlated with smoking history and alterations of reported lung adenocarcinoma genes. Whole-genome sequence analysis revealed frequent structural rearrangements, including in-frame exonic alterations within EGFR and SIK2 kinases. The candidate genes identified in this study are attractive targets for biological characterization and therapeutic targeting of lung adenocarcinoma.


Subject(s)
Adenocarcinoma/genetics , Carcinoma, Non-Small-Cell Lung/genetics , Genes, Neoplasm , High-Throughput Nucleotide Sequencing , Lung Neoplasms/genetics , Adenocarcinoma/pathology , Adenocarcinoma of Lung , Adult , Aged , Aged, 80 and over , Carcinoma, Non-Small-Cell Lung/pathology , Cohort Studies , Exome , Female , Genome-Wide Association Study , Humans , Lung Neoplasms/pathology , Male , Middle Aged , Mutation , Mutation Rate
4.
Cell ; 150(2): 251-63, 2012 Jul 20.
Article in English | MEDLINE | ID: mdl-22817889

ABSTRACT

Despite recent insights into melanoma genetics, systematic surveys for driver mutations are challenged by an abundance of passenger mutations caused by carcinogenic UV light exposure. We developed a permutation-based framework to address this challenge, employing mutation data from intronic sequences to control for passenger mutational load on a per gene basis. Analysis of large-scale melanoma exome data by this approach discovered six novel melanoma genes (PPP6C, RAC1, SNX31, TACC1, STK19, and ARID2), three of which-RAC1, PPP6C, and STK19-harbored recurrent and potentially targetable mutations. Integration with chromosomal copy number data contextualized the landscape of driver mutations, providing oncogenic insights in BRAF- and NRAS-driven melanoma as well as those without known NRAS/BRAF mutations. The landscape also clarified a mutational basis for RB and p53 pathway deregulation in this malignancy. Finally, the spectrum of driver mutations provided unequivocal genomic evidence for a direct mutagenic role of UV light in melanoma pathogenesis.


Subject(s)
Genome-Wide Association Study , Melanoma/genetics , Mutagenesis , Ultraviolet Rays , Amino Acid Sequence , Cells, Cultured , Exome , Humans , Melanocytes/metabolism , Models, Molecular , Molecular Sequence Data , Proto-Oncogene Proteins B-raf/genetics , Sequence Alignment , rac1 GTP-Binding Protein/genetics
5.
Nature ; 499(7457): 214-218, 2013 Jul 11.
Article in English | MEDLINE | ID: mdl-23770567

ABSTRACT

Major international projects are underway that are aimed at creating a comprehensive catalogue of all the genes responsible for the initiation and progression of cancer. These studies involve the sequencing of matched tumour-normal samples followed by mathematical analysis to identify those genes in which mutations occur more frequently than expected by random chance. Here we describe a fundamental problem with cancer genome studies: as the sample size increases, the list of putatively significant genes produced by current analytical methods burgeons into the hundreds. The list includes many implausible genes (such as those encoding olfactory receptors and the muscle protein titin), suggesting extensive false-positive findings that overshadow true driver events. We show that this problem stems largely from mutational heterogeneity and provide a novel analytical methodology, MutSigCV, for resolving the problem. We apply MutSigCV to exome sequences from 3,083 tumour-normal pairs and discover extraordinary variation in mutation frequency and spectrum within cancer types, which sheds light on mutational processes and disease aetiology, and in mutation frequency across the genome, which is strongly correlated with DNA replication timing and also with transcriptional activity. By incorporating mutational heterogeneity into the analyses, MutSigCV is able to eliminate most of the apparent artefactual findings and enable the identification of genes truly associated with cancer.


Subject(s)
Genetic Heterogeneity , Mutation/genetics , Neoplasms/genetics , Oncogenes/genetics , Artifacts , DNA Replication Timing , Exome/genetics , False Positive Reactions , Gene Expression , Genome, Human/genetics , Humans , Lung Neoplasms/genetics , Mutation Rate , Neoplasms/classification , Neoplasms/pathology , Neoplasms, Squamous Cell/genetics , Reproducibility of Results , Sample Size
6.
Nature ; 485(7399): 502-6, 2012 May 09.
Article in English | MEDLINE | ID: mdl-22622578

ABSTRACT

Melanoma is notable for its metastatic propensity, lethality in the advanced setting and association with ultraviolet exposure early in life. To obtain a comprehensive genomic view of melanoma in humans, we sequenced the genomes of 25 metastatic melanomas and matched germline DNA. A wide range of point mutation rates was observed: lowest in melanomas whose primaries arose on non-ultraviolet-exposed hairless skin of the extremities (3 and 14 per megabase (Mb) of genome), intermediate in those originating from hair-bearing skin of the trunk (5-55 per Mb), and highest in a patient with a documented history of chronic sun exposure (111 per Mb). Analysis of whole-genome sequence data identified PREX2 (phosphatidylinositol-3,4,5-trisphosphate-dependent Rac exchange factor 2)--a PTEN-interacting protein and negative regulator of PTEN in breast cancer--as a significantly mutated gene with a mutation frequency of approximately 14% in an independent extension cohort of 107 human melanomas. PREX2 mutations are biologically relevant, as ectopic expression of mutant PREX2 accelerated tumour formation of immortalized human melanocytes in vivo. Thus, whole-genome sequencing of human melanoma tumours revealed genomic evidence of ultraviolet pathogenesis and discovered a new recurrently mutated gene in melanoma.


Subject(s)
Genome, Human/genetics , Guanine Nucleotide Exchange Factors/genetics , Melanoma/genetics , Mutation/genetics , Sunlight/adverse effects , Chromosome Breakpoints/radiation effects , DNA Damage , DNA Mutational Analysis , Gene Expression Regulation, Neoplastic , Guanine Nucleotide Exchange Factors/metabolism , Humans , Melanocytes/metabolism , Melanocytes/pathology , Melanoma/pathology , Mutagenesis/radiation effects , Mutation/radiation effects , Oncogenes/genetics , Ultraviolet Rays/adverse effects
7.
Nature ; 488(7409): 106-10, 2012 Aug 02.
Article in English | MEDLINE | ID: mdl-22820256

ABSTRACT

Medulloblastomas are the most common malignant brain tumours in children. Identifying and understanding the genetic events that drive these tumours is critical for the development of more effective diagnostic, prognostic and therapeutic strategies. Recently, our group and others described distinct molecular subtypes of medulloblastoma on the basis of transcriptional and copy number profiles. Here we use whole-exome hybrid capture and deep sequencing to identify somatic mutations across the coding regions of 92 primary medulloblastoma/normal pairs. Overall, medulloblastomas have low mutation rates consistent with other paediatric tumours, with a median of 0.35 non-silent mutations per megabase. We identified twelve genes mutated at statistically significant frequencies, including previously known mutated genes in medulloblastoma such as CTNNB1, PTCH1, MLL2, SMARCA4 and TP53. Recurrent somatic mutations were newly identified in an RNA helicase gene, DDX3X, often concurrent with CTNNB1 mutations, and in the nuclear co-repressor (N-CoR) complex genes GPS2, BCOR and LDB1. We show that mutant DDX3X potentiates transactivation of a TCF promoter and enhances cell viability in combination with mutant, but not wild-type, ß-catenin. Together, our study reveals the alteration of WNT, hedgehog, histone methyltransferase and now N-CoR pathways across medulloblastomas and within specific subtypes of this disease, and nominates the RNA helicase DDX3X as a component of pathogenic ß-catenin signalling in medulloblastoma.


Subject(s)
Cerebellar Neoplasms/genetics , Exome/genetics , Genome, Human/genetics , Medulloblastoma/genetics , Mutation/genetics , Cerebellar Neoplasms/classification , Child , DEAD-box RNA Helicases/chemistry , DEAD-box RNA Helicases/genetics , DEAD-box RNA Helicases/metabolism , DNA Helicases/chemistry , DNA Helicases/genetics , DNA-Binding Proteins/genetics , Hedgehog Proteins/metabolism , Histone Methyltransferases , Histone-Lysine N-Methyltransferase/genetics , Histone-Lysine N-Methyltransferase/metabolism , Humans , Intracellular Signaling Peptides and Proteins/genetics , LIM Domain Proteins/genetics , Medulloblastoma/classification , Models, Molecular , Neoplasm Proteins/genetics , Nuclear Proteins/chemistry , Nuclear Proteins/genetics , Patched Receptors , Patched-1 Receptor , Promoter Regions, Genetic/genetics , Protein Structure, Tertiary/genetics , Proto-Oncogene Proteins/genetics , Receptors, Cell Surface/genetics , Repressor Proteins/genetics , Signal Transduction , TCF Transcription Factors/metabolism , Transcription Factors/chemistry , Transcription Factors/genetics , Tumor Suppressor Protein p53/genetics , Wnt Proteins/metabolism , beta Catenin/genetics , beta Catenin/metabolism
8.
Nature ; 486(7403): 405-9, 2012 Jun 20.
Article in English | MEDLINE | ID: mdl-22722202

ABSTRACT

Breast carcinoma is the leading cause of cancer-related mortality in women worldwide, with an estimated 1.38 million new cases and 458,000 deaths in 2008 alone. This malignancy represents a heterogeneous group of tumours with characteristic molecular features, prognosis and responses to available therapy. Recurrent somatic alterations in breast cancer have been described, including mutations and copy number alterations, notably ERBB2 amplifications, the first successful therapy target defined by a genomic aberration. Previous DNA sequencing studies of breast cancer genomes have revealed additional candidate mutations and gene rearrangements. Here we report the whole-exome sequences of DNA from 103 human breast cancers of diverse subtypes from patients in Mexico and Vietnam compared to matched-normal DNA, together with whole-genome sequences of 22 breast cancer/normal pairs. Beyond confirming recurrent somatic mutations in PIK3CA, TP53, AKT1, GATA3 and MAP3K1, we discovered recurrent mutations in the CBFB transcription factor gene and deletions of its partner RUNX1. Furthermore, we have identified a recurrent MAGI3-AKT3 fusion enriched in triple-negative breast cancer lacking oestrogen and progesterone receptors and ERBB2 expression. The MAGI3-AKT3 fusion leads to constitutive activation of AKT kinase, which is abolished by treatment with an ATP-competitive AKT small-molecule inhibitor.


Subject(s)
Breast Neoplasms/classification , Breast Neoplasms/genetics , Mutation/genetics , Translocation, Genetic/genetics , Algorithms , Breast Neoplasms/pathology , Core Binding Factor Alpha 2 Subunit/genetics , Core Binding Factor beta Subunit/genetics , DNA Mutational Analysis , Exome/genetics , Female , Gene Fusion/genetics , Humans , Membrane Proteins/genetics , Mexico , Proto-Oncogene Proteins c-akt/antagonists & inhibitors , Proto-Oncogene Proteins c-akt/genetics , Proto-Oncogene Proteins c-akt/metabolism , Vietnam
9.
Nature ; 470(7333): 214-20, 2011 Feb 10.
Article in English | MEDLINE | ID: mdl-21307934

ABSTRACT

Prostate cancer is the second most common cause of male cancer deaths in the United States. However, the full range of prostate cancer genomic alterations is incompletely characterized. Here we present the complete sequence of seven primary human prostate cancers and their paired normal counterparts. Several tumours contained complex chains of balanced (that is, 'copy-neutral') rearrangements that occurred within or adjacent to known cancer genes. Rearrangement breakpoints were enriched near open chromatin, androgen receptor and ERG DNA binding sites in the setting of the ETS gene fusion TMPRSS2-ERG, but inversely correlated with these regions in tumours lacking ETS fusions. This observation suggests a link between chromatin or transcriptional regulation and the genesis of genomic aberrations. Three tumours contained rearrangements that disrupted CADM2, and four harboured events disrupting either PTEN (unbalanced events), a prostate tumour suppressor, or MAGI2 (balanced events), a PTEN interacting protein not previously implicated in prostate tumorigenesis. Thus, genomic rearrangements may arise from transcriptional or chromatin aberrancies and engage prostate tumorigenic mechanisms.


Subject(s)
Genome, Human/genetics , Prostatic Neoplasms/genetics , Adaptor Proteins, Signal Transducing , Carrier Proteins/genetics , Case-Control Studies , Cell Adhesion Molecules/genetics , Chromatin/genetics , Chromatin/metabolism , Chromosome Aberrations , Chromosome Breakpoints , Epigenesis, Genetic/genetics , Gene Expression Regulation, Neoplastic , Guanylate Kinases , Humans , Male , PTEN Phosphohydrolase/genetics , PTEN Phosphohydrolase/metabolism , Recombination, Genetic/genetics , Signal Transduction/genetics , Transcription, Genetic
10.
Nature ; 471(7339): 467-72, 2011 Mar 24.
Article in English | MEDLINE | ID: mdl-21430775

ABSTRACT

Multiple myeloma is an incurable malignancy of plasma cells, and its pathogenesis is poorly understood. Here we report the massively parallel sequencing of 38 tumour genomes and their comparison to matched normal DNAs. Several new and unexpected oncogenic mechanisms were suggested by the pattern of somatic mutation across the data set. These include the mutation of genes involved in protein translation (seen in nearly half of the patients), genes involved in histone methylation, and genes involved in blood coagulation. In addition, a broader than anticipated role of NF-κB signalling was indicated by mutations in 11 members of the NF-κB pathway. Of potential immediate clinical relevance, activating mutations of the kinase BRAF were observed in 4% of patients, suggesting the evaluation of BRAF inhibitors in multiple myeloma clinical trials. These results indicate that cancer genome sequencing of large collections of samples will yield new insights into cancer not anticipated by existing knowledge.


Subject(s)
Genome, Human/genetics , Multiple Myeloma/genetics , Mutation/genetics , Amino Acid Sequence , Blood Coagulation/genetics , CpG Islands/genetics , DNA Mutational Analysis , DNA Repair/genetics , Exons/genetics , Exosome Multienzyme Ribonuclease Complex , Genomics , Histones/metabolism , Homeodomain Proteins/genetics , Homeostasis/genetics , Humans , Methylation , Models, Molecular , Molecular Sequence Data , Multiple Myeloma/drug therapy , Multiple Myeloma/enzymology , Multiple Myeloma/metabolism , NF-kappa B/metabolism , Oncogenes/genetics , Open Reading Frames/genetics , Protein Biosynthesis/genetics , Protein Conformation , Proto-Oncogene Proteins B-raf/antagonists & inhibitors , Proto-Oncogene Proteins B-raf/genetics , Proto-Oncogene Proteins B-raf/metabolism , RNA Processing, Post-Transcriptional/genetics , Ribonucleases/chemistry , Ribonucleases/genetics , Signal Transduction/genetics , Transcription, Genetic/genetics
11.
Proc Natl Acad Sci U S A ; 111(51): E5564-73, 2014 Dec 23.
Article in English | MEDLINE | ID: mdl-25512523

ABSTRACT

Osteosarcoma is the most common primary bone tumor, yet there have been no substantial advances in treatment or survival in three decades. We examined 59 tumor/normal pairs by whole-exome, whole-genome, and RNA-sequencing. Only the TP53 gene was mutated at significant frequency across all samples. The mean nonsilent somatic mutation rate was 1.2 mutations per megabase, and there was a median of 230 somatic rearrangements per tumor. Complex chains of rearrangements and localized hypermutation were detected in almost all cases. Given the intertumor heterogeneity, the extent of genomic instability, and the difficulty in acquiring a large sample size in a rare tumor, we used several methods to identify genomic events contributing to osteosarcoma survival. Pathway analysis, a heuristic analytic algorithm, a comparative oncology approach, and an shRNA screen converged on the phosphatidylinositol 3-kinase/mammalian target of rapamycin (PI3K/mTOR) pathway as a central vulnerability for therapeutic exploitation in osteosarcoma. Osteosarcoma cell lines are responsive to pharmacologic and genetic inhibition of the PI3K/mTOR pathway both in vitro and in vivo.


Subject(s)
Bone Neoplasms/metabolism , Genome, Human , Osteosarcoma/metabolism , Phosphatidylinositol 3-Kinases/metabolism , TOR Serine-Threonine Kinases/metabolism , Bone Neoplasms/genetics , Bone Neoplasms/pathology , Cell Line, Tumor , Cell Proliferation , Genetic Heterogeneity , Germ-Line Mutation , Humans , Osteosarcoma/genetics , Osteosarcoma/pathology , Tumor Suppressor Protein p53/genetics
12.
Nat Methods ; 10(7): 623-9, 2013 Jul.
Article in English | MEDLINE | ID: mdl-23685885

ABSTRACT

RNA-seq is an effective method for studying the transcriptome, but it can be difficult to apply to scarce or degraded RNA from fixed clinical samples, rare cell populations or cadavers. Recent studies have proposed several methods for RNA-seq of low-quality and/or low-quantity samples, but the relative merits of these methods have not been systematically analyzed. Here we compare five such methods using metrics relevant to transcriptome annotation, transcript discovery and gene expression. Using a single human RNA sample, we constructed and sequenced ten libraries with these methods and compared them against two control libraries. We found that the RNase H method performed best for chemically fragmented, low-quality RNA, and we confirmed this through analysis of actual degraded samples. RNase H can even effectively replace oligo(dT)-based methods for standard RNA-seq. SMART and NuGEN had distinct strengths for measuring low-quantity RNA. Our analysis allows biologists to select the most suitable methods and provides a benchmark for future method development.


Subject(s)
Algorithms , Artifacts , Gene Expression Profiling/methods , RNA/genetics , Sample Size , Sequence Analysis, RNA/methods , Software , Transcriptome/genetics
13.
Bioinformatics ; 30(15): 2224-6, 2014 Aug 01.
Article in English | MEDLINE | ID: mdl-24695405

ABSTRACT

SUMMARY: Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program. AVAILABILITY AND IMPLEMENTATION: http://sourceforge.net/projects/prada/ CONTACT: gadgetz@broadinstitute.org or rverhaak@mdanderson.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Sequence Analysis, RNA/methods , Software , Statistics as Topic/methods , Base Sequence , Gene Fusion , Genome, Human/genetics , Humans , Neoplasms/genetics , Oligonucleotide Array Sequence Analysis , RNA, Messenger/genetics , RNA, Messenger/metabolism
14.
N Engl J Med ; 365(26): 2497-506, 2011 Dec 29.
Article in English | MEDLINE | ID: mdl-22150006

ABSTRACT

BACKGROUND: The somatic genetic basis of chronic lymphocytic leukemia, a common and clinically heterogeneous leukemia occurring in adults, remains poorly understood. METHODS: We obtained DNA samples from leukemia cells in 91 patients with chronic lymphocytic leukemia and performed massively parallel sequencing of 88 whole exomes and whole genomes, together with sequencing of matched germline DNA, to characterize the spectrum of somatic mutations in this disease. RESULTS: Nine genes that are mutated at significant frequencies were identified, including four with established roles in chronic lymphocytic leukemia (TP53 in 15% of patients, ATM in 9%, MYD88 in 10%, and NOTCH1 in 4%) and five with unestablished roles (SF3B1, ZMYM3, MAPK1, FBXW7, and DDX3X). SF3B1, which functions at the catalytic core of the spliceosome, was the second most frequently mutated gene (with mutations occurring in 15% of patients). SF3B1 mutations occurred primarily in tumors with deletions in chromosome 11q, which are associated with a poor prognosis in patients with chronic lymphocytic leukemia. We further discovered that tumor samples with mutations in SF3B1 had alterations in pre-messenger RNA (mRNA) splicing. CONCLUSIONS: Our study defines the landscape of somatic mutations in chronic lymphocytic leukemia and highlights pre-mRNA splicing as a critical cellular process contributing to chronic lymphocytic leukemia.


Subject(s)
DNA, Neoplasm/analysis , Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Mutation , Spliceosomes/genetics , Adult , Chromosome Deletion , Chromosomes, Human, Pair 11/genetics , Exome/genetics , Gene Library , High-Throughput Nucleotide Sequencing , Humans , Mutation, Missense , RNA Splicing
15.
Nature ; 454(7205): 766-70, 2008 Aug 07.
Article in English | MEDLINE | ID: mdl-18600261

ABSTRACT

DNA methylation is essential for normal development and has been implicated in many pathologies including cancer. Our knowledge about the genome-wide distribution of DNA methylation, how it changes during cellular differentiation and how it relates to histone methylation and other chromatin modifications in mammals remains limited. Here we report the generation and analysis of genome-scale DNA methylation profiles at nucleotide resolution in mammalian cells. Using high-throughput reduced representation bisulphite sequencing and single-molecule-based sequencing, we generated DNA methylation maps covering most CpG islands, and a representative sampling of conserved non-coding elements, transposons and other genomic features, for mouse embryonic stem cells, embryonic-stem-cell-derived and primary neural cells, and eight other primary tissues. Several key findings emerge from the data. First, DNA methylation patterns are better correlated with histone methylation patterns than with the underlying genome sequence context. Second, methylation of CpGs are dynamic epigenetic marks that undergo extensive changes during cellular differentiation, particularly in regulatory regions outside of core promoters. Third, analysis of embryonic-stem-cell-derived and primary cells reveals that 'weak' CpG islands associated with a specific set of developmentally regulated genes undergo aberrant hypermethylation during extended proliferation in vitro, in a pattern reminiscent of that reported in some primary tumours. More generally, the results establish reduced representation bisulphite sequencing as a powerful technology for epigenetic profiling of cell populations relevant to developmental biology, cancer and regenerative medicine.


Subject(s)
Cell Differentiation , DNA Methylation , Genomics , Pluripotent Stem Cells/cytology , Pluripotent Stem Cells/metabolism , Animals , Cells, Cultured , Conserved Sequence , CpG Islands/genetics , Embryonic Stem Cells/cytology , Embryonic Stem Cells/metabolism , Fibroblasts/cytology , Genome/genetics , Histones/genetics , Histones/metabolism , Male , Mice , Neurons/cytology
16.
J Cyst Fibros ; 23(4): 664-675, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38388235

ABSTRACT

BACKGROUND: In 2017, the US Food and Drug Administration initiated expansion of drug labels for the treatment of cystic fibrosis (CF) to include CF transmembrane conductance regulator (CFTR) gene variants based on in vitro functional studies. This study aims to identify CFTR variants that result in increased chloride (Cl-) transport function by the CFTR protein after treatment with the CFTR modulator combination elexacaftor/tezacaftor/ivacaftor (ELX/TEZ/IVA). These data may benefit people with CF (pwCF) who are not currently eligible for modulator therapies. METHODS: Plasmid DNA encoding 655 CFTR variants and wild-type (WT) CFTR were transfected into Fisher Rat Thyroid cells that do not natively express CFTR. After 24 h of incubation with control or TEZ and ELX, and acute addition of IVA, CFTR function was assessed using the transepithelial current clamp conductance assay. Each variant's forskolin/cAMP-induced baseline Cl- transport activity, responsiveness to IVA alone, and responsiveness to the TEZ/ELX/IVA combination were measured in three different laboratories. Western blots were conducted to evaluate CFTR protein maturation and complement the functional data. RESULTS AND CONCLUSIONS: 253 variants not currently approved for CFTR modulator therapy showed low baseline activity (<10 % of normal CFTR Cl- transport activity). For 152 of these variants, treatment with ELX/TEZ/IVA improved the Cl- transport activity by ≥10 % of normal CFTR function, which is suggestive of clinical benefit. ELX/TEZ/IVA increased CFTR function by ≥10 percentage points for an additional 140 unapproved variants with ≥10 % but <50 % of normal CFTR function at baseline. These findings significantly expand the number of rare CFTR variants for which ELX/TEZ/IVA treatment should result in clinical benefit.


Subject(s)
Aminophenols , Benzodioxoles , Cystic Fibrosis Transmembrane Conductance Regulator , Cystic Fibrosis , Indoles , Pyrazoles , Quinolones , Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Cystic Fibrosis/genetics , Cystic Fibrosis/drug therapy , Quinolones/therapeutic use , Quinolones/pharmacology , Aminophenols/therapeutic use , Aminophenols/pharmacology , Animals , Rats , Humans , Benzodioxoles/therapeutic use , Benzodioxoles/pharmacology , Indoles/therapeutic use , Indoles/pharmacology , Pyrazoles/pharmacology , Pyrazoles/therapeutic use , Pyridines/pharmacology , Pyridines/therapeutic use , Drug Combinations , Chloride Channel Agonists/therapeutic use , Pyrrolidines/pharmacology , Pyrrolidines/therapeutic use
17.
Genome Res ; 20(9): 1297-303, 2010 Sep.
Article in English | MEDLINE | ID: mdl-20644199

ABSTRACT

Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS--the 1000 Genome pilot alone includes nearly five terabases--make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.


Subject(s)
Genome , Genomics/methods , Sequence Analysis, DNA/methods , Software , Base Sequence
18.
Genome Res ; 20(4): 413-27, 2010 Apr.
Article in English | MEDLINE | ID: mdl-20179022

ABSTRACT

Global studies of transcript structure and abundance in cancer cells enable the systematic discovery of aberrations that contribute to carcinogenesis, including gene fusions, alternative splice isoforms, and somatic mutations. We developed a systematic approach to characterize the spectrum of cancer-associated mRNA alterations through integration of transcriptomic and structural genomic data, and we applied this approach to generate new insights into melanoma biology. Using paired-end massively parallel sequencing of cDNA (RNA-seq) together with analyses of high-resolution chromosomal copy number data, we identified 11 novel melanoma gene fusions produced by underlying genomic rearrangements, as well as 12 novel readthrough transcripts. We mapped these chimeric transcripts to base-pair resolution and traced them to their genomic origins using matched chromosomal copy number information. We also used these data to discover and validate base-pair mutations that accumulated in these melanomas, revealing a surprisingly high rate of somatic mutation and lending support to the notion that point mutations constitute the major driver of melanoma progression. Taken together, these results may indicate new avenues for target discovery in melanoma, while also providing a template for large-scale transcriptome studies across many tumor types.


Subject(s)
Gene Expression Profiling , Melanoma/genetics , Skin Neoplasms/genetics , Base Sequence , DNA Mutational Analysis , Gene Amplification , Gene Dosage , Gene Expression Regulation, Neoplastic , Gene Fusion , Genomics/methods , Humans , K562 Cells , Matched-Pair Analysis , Melanoma/metabolism , Melanoma/pathology , Polymorphism, Genetic , Protein Isoforms/genetics , Sequence Analysis, DNA , Skin Neoplasms/metabolism , Skin Neoplasms/pathology , Systems Integration , Tumor Cells, Cultured
19.
Bioinformatics ; 28(11): 1530-2, 2012 Jun 01.
Article in English | MEDLINE | ID: mdl-22539670

ABSTRACT

UNLABELLED: RNA-seq, the application of next-generation sequencing to RNA, provides transcriptome-wide characterization of cellular activity. Assessment of sequencing performance and library quality is critical to the interpretation of RNA-seq data, yet few tools exist to address this issue. We introduce RNA-SeQC, a program which provides key measures of data quality. These metrics include yield, alignment and duplication rates; GC bias, rRNA content, regions of alignment (exon, intron and intragenic), continuity of coverage, 3'/5' bias and count of detectable transcripts, among others. The software provides multi-sample evaluation of library construction protocols, input materials and other experimental parameters. The modularity of the software enables pipeline integration and the routine monitoring of key measures of data quality such as the number of alignable reads, duplication rates and rRNA contamination. RNA-SeQC allows investigators to make informed decisions about sample inclusion in downstream analysis. In summary, RNA-SeQC provides quality control measures critical to experiment design, process optimization and downstream computational analysis. AVAILABILITY AND IMPLEMENTATION: See www.genepattern.org to run online, or www.broadinstitute.org/rna-seqc/ for a command line tool.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, RNA/methods , Software , Gene Expression Profiling , Gene Library , Internet , Quality Control , RNA/genetics , RNA, Ribosomal/genetics
20.
J Cyst Fibros ; 18(4): 476-483, 2019 07.
Article in English | MEDLINE | ID: mdl-30563749

ABSTRACT

BACKGROUND: Assessment of approved drugs and developmental drug candidates for rare cystic fibrosis (CF)-causing variants of the Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) requires abundant material from relevant models. METHODS: Isogenic cell lines harboring CFTR variants in the native genomic context were created through the development and utilization of a footprint-less, CRISPR/Cas9 gene editing pipeline in 16HBE14o- immortalized bronchial epithelial cells. RESULTS: Isogenic, homozygous cell lines for three CFTR variants (F508del and the two most common CF-causing nonsense variants, G542X and W1282X) were established and characterized. The F508del model recapitulates the known molecular pathology and pharmacology. The two models of nonsense variants (G542X and W1282X) are sensitive to Nonsense Mediated mRNA Decay (NMD) and responsive to reference compounds that inhibit NMD and promote ribosomal readthrough. CONCLUSIONS: We present a versatile, efficient gene editing pipeline that can be used to create CFTR variants in the native genomic context and the utilization of this pipeline to create homozygous cell models for the CF-causing variants F508del, G542X, and W1282X. The resulting cell lines provide a virtually unlimited source of material with specific pathogenic mutations that can be used in a variety of assays, including functional assays.


Subject(s)
Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Cystic Fibrosis/genetics , Epithelial Cells , Models, Biological , Respiratory Mucosa/cytology , Cell Line , Gene Editing , Genetic Variation , Humans , Lung , Mutation
SELECTION OF CITATIONS
SEARCH DETAIL