Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Trends Genet ; 38(2): 194-208, 2022 02.
Article in English | MEDLINE | ID: mdl-34483003

ABSTRACT

The somatic mutations in each cancer genome are caused by multiple mutational processes, each of which leaves a characteristic imprint (or 'signature'), potentially caused by specific etiologies or exposures. Deconvolution of these signatures offers a glimpse into the evolutionary history of individual tumors. Recent work has shown that mutational signatures may also yield therapeutic and prognostic insights, including the identification of cell-intrinsic signatures as biomarkers of drug response and prognosis. For example, mutational signatures indicating homologous recombination deficiency are associated with poly(ADP)-ribose polymerase (PARP) inhibitor sensitivity, whereas APOBEC-associated signatures are associated with ataxia telangiectasia and Rad3-related kinase (ATR) inhibitor sensitivity. Furthermore, therapy-induced mutational signatures implicated in cancer progression have also been uncovered, including the identification of thiopurine-induced TP53 mutations in leukemia. In this review, we explore the various ways mutational signatures can reveal new therapeutic and prognostic insights, thus extending their traditional role in identifying disease etiology.


Subject(s)
Neoplasms , Humans , Mutation , Neoplasms/drug therapy , Neoplasms/genetics , Poly(ADP-ribose) Polymerases , Prognosis
2.
Mol Cancer Res ; 21(4): 301-306, 2023 04 01.
Article in English | MEDLINE | ID: mdl-36637394

ABSTRACT

Myeloid sarcoma is a rare condition consisting of extramedullary myeloid blasts found in association with acute myeloid leukemia or, in the absence of bone marrow involvement. We identified an infant with isolated myeloid sarcoma whose bone marrow was negative for involvement by flow cytometry. Sequencing revealed the fusion oncogene CIC-NUTM2A and identified the sarcoma to be clonally evolved from the bone marrow, which carried the fusion despite the absence of pathology. Murine modeling confirmed the ability of the fusion to transform hematopoietic cells and identified receptor tyrosine kinase (RTK) signaling activation consistent with disruption of the CIC transcriptional repressor. These findings extend the definition of CIC-rearranged malignancies to include hematologic disease, provide insight into the mechanism of oncogenesis, and demonstrate the importance of molecular analysis and tracking of bone marrow involvement over the course of treatment in myeloid sarcoma, including patients that lack flow cytometric evidence of leukemia at diagnosis. IMPLICATIONS: This study illustrates molecular involvement of phenotypically normal bone marrow in myeloid sarcoma, which has significant implications in clinical care. Further, it extends the definition of CIC-rearrangements to include hematologic malignancies and shows evidence of RTK activation that may be exploited therapeutically in cancer(s) driven by these fusions.


Subject(s)
Leukemia, Myeloid, Acute , Sarcoma, Myeloid , Humans , Animals , Mice , Sarcoma, Myeloid/genetics , Sarcoma, Myeloid/diagnosis , Sarcoma, Myeloid/pathology , Bone Marrow/pathology , Transcription Factors , Leukemia, Myeloid, Acute/pathology , Clone Cells/pathology
3.
Nat Commun ; 14(1): 1739, 2023 04 05.
Article in English | MEDLINE | ID: mdl-37019972

ABSTRACT

Oncogenic fusions formed through chromosomal rearrangements are hallmarks of childhood cancer that define cancer subtype, predict outcome, persist through treatment, and can be ideal therapeutic targets. However, mechanistic understanding of the etiology of oncogenic fusions remains elusive. Here we report a comprehensive detection of 272 oncogenic fusion gene pairs by using tumor transcriptome sequencing data from 5190 childhood cancer patients. We identify diverse factors, including translation frame, protein domain, splicing, and gene length, that shape the formation of oncogenic fusions. Our mathematical modeling reveals a strong link between differential selection pressure and clinical outcome in CBFB-MYH11. We discover 4 oncogenic fusions, including RUNX1-RUNX1T1, TCF3-PBX1, CBFA2T3-GLIS2, and KMT2A-AFDN, with promoter-hijacking-like features that may offer alternative strategies for therapeutic targeting. We uncover extensive alternative splicing in oncogenic fusions including KMT2A-MLLT3, KMT2A-MLLT10, C11orf95-RELA, NUP98-NSD1, KMT2A-AFDN and ETV6-RUNX1. We discover neo splice sites in 18 oncogenic fusion gene pairs and demonstrate that such splice sites confer therapeutic vulnerability for etiology-based genome editing. Our study reveals general principles on the etiology of oncogenic fusions in childhood cancer and suggests profound clinical implications including etiology-based risk stratification and genome-editing-based therapeutics.


Subject(s)
Core Binding Factor Alpha 2 Subunit , Precursor Cell Lymphoblastic Leukemia-Lymphoma , Humans , Child , Core Binding Factor Alpha 2 Subunit/genetics , Oncogene Fusion , Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics , Transcriptome , Causality , Oncogene Proteins, Fusion/genetics
5.
Biochim Biophys Acta Rev Cancer ; 1876(1): 188571, 2021 08.
Article in English | MEDLINE | ID: mdl-34051287

ABSTRACT

Pediatric cancer is a rare disease with a distinct etiology and mutational landscape compared with adult cancer. Multi-omics profiling of retrospective and prospective cohorts coupled with innovative computational analysis have been instrumental in uncovering mechanisms of tumorigenesis and drug resistance that are now informing pediatric cancer clinical therapy. In this review we present the major data resources of pediatric cancer and actionable insights into pediatric cancer etiology stemming from the identification of oncogenic gene fusions, mutational signature analysis, systems biology, cancer predisposition and survivorship studies - that have led to improved clinical diagnosis, discovery of new drug-targets, pharmacological therapy, and screening for genetic predisposition. Ultimately, integration of large-scale omics datasets generated through international collaboration is required to maximize the power of data-driven approaches to advance pediatric cancer research informing clinical therapy.


Subject(s)
Biomarkers, Tumor/genetics , Databases, Genetic , Genomics , Machine Learning , Medical Oncology , Mutation , Neoplasms/genetics , Pediatrics , Age of Onset , Data Mining , Genetic Predisposition to Disease , Humans , Neoplasms/epidemiology , Neoplasms/therapy , Precision Medicine , Transcriptome
6.
Genome Biol ; 22(1): 37, 2021 01 25.
Article in English | MEDLINE | ID: mdl-33487172

ABSTRACT

BACKGROUND: There is currently no method to precisely measure the errors that occur in the sequencing instrument/sequencer, which is critical for next-generation sequencing applications aimed at discovering the genetic makeup of heterogeneous cellular populations. RESULTS: We propose a novel computational method, SequencErr, to address this challenge by measuring the base correspondence between overlapping regions in forward and reverse reads. An analysis of 3777 public datasets from 75 research institutions in 18 countries revealed the sequencer error rate to be ~ 10 per million (pm) and 1.4% of sequencers and 2.7% of flow cells have error rates > 100 pm. At the flow cell level, error rates are elevated in the bottom surfaces and > 90% of HiSeq and NovaSeq flow cells have at least one outlier error-prone tile. By sequencing a common DNA library on different sequencers, we demonstrate that sequencers with high error rates have reduced overall sequencing accuracy, and removal of outlier error-prone tiles improves sequencing accuracy. We demonstrate that SequencErr can reveal novel insights relative to the popular quality control method FastQC and achieve a 10-fold lower error rate than popular error correction methods including Lighter and Musket. CONCLUSIONS: Our study reveals novel insights into the nature of DNA sequencing errors incurred on DNA sequencers. Our method can be used to assess, calibrate, and monitor sequencer accuracy, and to computationally suppress sequencer errors in existing datasets.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Algorithms , Calibration , Gene Library , Humans , Models, Genetic , SARS-CoV-2 , Sequence Analysis, DNA/methods
7.
Cancer Epidemiol Biomarkers Prev ; 30(11): 2096-2104, 2021 11.
Article in English | MEDLINE | ID: mdl-34465587

ABSTRACT

BACKGROUND: Subsequent thyroid cancer (STC) is one of the most common malignancies in childhood cancer survivors. We aimed to evaluate the polygenic contributions to STC risk and potential utility in improving risk prediction. METHODS: A polygenic risk score (PRS) was calculated from 12 independent SNPs associated with thyroid cancer risk in the general population. Associations between PRS and STC risk were evaluated among survivors from St. Jude Lifetime Cohort (SJLIFE) and were replicated in survivors from Childhood Cancer Survivor Study (CCSS). A risk prediction model integrating the PRS and clinical factors, initially developed in SJLIFE, and its performance were validated in CCSS. RESULTS: Among 2,370 SJLIFE survivors with a median follow-up of 28.8 [interquartile range (IQR) = 21.9-36.1] years, 65 (2.7%) developed STC. Among them, the standardized PRS was associated with an increased rate of STC [relative rate (RR) = 1.57; 95% confidence interval (CI) = 1.24-1.98; P < 0.001]. Similar associations were replicated in 6,416 CCSS survivors, among whom 121 (1.9%) developed STC during median follow-up of 28.9 (IQR = 22.6-34.6) years (RR = 1.52; 95% CI = 1.25-1.83; P < 0.001). A risk prediction model integrating the PRS with clinical factors showed better performance than the model considering only clinical factors in SJLIFE (P = 0.004, AUC = 83.2% vs. 82.1%, at age 40), which was further validated in CCSS (P = 0.010, AUC = 72.9% vs. 70.6%). CONCLUSIONS: Integration of the PRS with clinical factors provided a statistically significant improvement in risk prediction of STC, although the magnitude of improvement was modest. IMPACT: PRS improves risk stratification and prediction of STC, suggesting its potential utility for optimizing screening strategies in survivorship care.


Subject(s)
Cancer Survivors/statistics & numerical data , Thyroid Neoplasms/epidemiology , Adult , Child , Female , Humans , Male , Middle Aged , Neoplasms, Second Primary/epidemiology , Polymorphism, Single Nucleotide , Retrospective Studies , Risk Assessment , Risk Factors
8.
Cancer Discov ; 11(12): 3008-3027, 2021 12 01.
Article in English | MEDLINE | ID: mdl-34301788

ABSTRACT

Genomic studies of pediatric cancer have primarily focused on specific tumor types or high-risk disease. Here, we used a three-platform sequencing approach, including whole-genome sequencing (WGS), whole-exome sequencing (WES), and RNA sequencing (RNA-seq), to examine tumor and germline genomes from 309 prospectively identified children with newly diagnosed (85%) or relapsed/refractory (15%) cancers, unselected for tumor type. Eighty-six percent of patients harbored diagnostic (53%), prognostic (57%), therapeutically relevant (25%), and/or cancer-predisposing (18%) variants. Inclusion of WGS enabled detection of activating gene fusions and enhancer hijacks (36% and 8% of tumors, respectively), small intragenic deletions (15% of tumors), and mutational signatures revealing of pathogenic variant effects. Evaluation of paired tumor-normal data revealed relevance to tumor development for 55% of pathogenic germline variants. This study demonstrates the power of a three-platform approach that incorporates WGS to interrogate and interpret the full range of genomic variants across newly diagnosed as well as relapsed/refractory pediatric cancers. SIGNIFICANCE: Pediatric cancers are driven by diverse genomic lesions, and sequencing has proven useful in evaluating high-risk and relapsed/refractory cases. We show that combined WGS, WES, and RNA-seq of tumor and paired normal tissues enables identification and characterization of genetic drivers across the full spectrum of pediatric cancers. This article is highlighted in the In This Issue feature, p. 2945.


Subject(s)
Neoplasms , Child , DNA , Humans , Mutation , Neoplasms/genetics , Sequence Analysis, RNA , Exome Sequencing
9.
Cancer Discov ; 11(5): 1082-1099, 2021 05.
Article in English | MEDLINE | ID: mdl-33408242

ABSTRACT

Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem-Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community-enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. SIGNIFICANCE: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.This article is highlighted in the In This Issue feature, p. 995.


Subject(s)
Anemia, Sickle Cell/genetics , Cloud Computing , Genomics , Information Dissemination , Neoplasms/genetics , Child , Ecosystem , Hospitals, Pediatric , Humans
10.
Nat Commun ; 11(1): 5183, 2020 10 14.
Article in English | MEDLINE | ID: mdl-33056981

ABSTRACT

Neuroblastoma is a pediatric malignancy with heterogeneous clinical outcomes. To better understand neuroblastoma pathogenesis, here we analyze whole-genome, whole-exome and/or transcriptome data from 702 neuroblastoma samples. Forty percent of samples harbor at least one recurrent driver gene alteration and most aberrations, including MYCN, ATRX, and TERT alterations, differ in frequency by age. MYCN alterations occur at median 2.3 years of age, TERT at 3.8 years, and ATRX at 5.6 years. COSMIC mutational signature 18, previously associated with reactive oxygen species, is the most common cause of driver point mutations in neuroblastoma, including most ALK and Ras-activating variants. Signature 18 appears early and is continuous throughout disease evolution. Signature 18 is enriched in neuroblastomas with MYCN amplification, 17q gain, and increased expression of mitochondrial ribosome and electron transport-associated genes. Recurrent FGFR1 variants in six patients, and ALK N-terminal structural alterations in five samples, identify additional patients potentially amenable to precision therapy.


Subject(s)
Biomarkers, Tumor/genetics , Gene Expression Regulation, Neoplastic , Neuroblastoma/genetics , Adolescent , Adult , Age Factors , Anaplastic Lymphoma Kinase/genetics , Child , Child, Preschool , Cohort Studies , DNA Copy Number Variations , DNA Mutational Analysis , Datasets as Topic , Electron Transport/genetics , Exome/genetics , Female , Humans , Infant , Infant, Newborn , Male , Mitochondrial Ribosomes , Mutation , Neuroblastoma/pathology , Receptor, Fibroblast Growth Factor, Type 1/genetics , Ribosomal Proteins/genetics , Transcriptome/genetics , Whole Genome Sequencing , Young Adult
11.
Sci Rep ; 8(1): 1511, 2018 01 24.
Article in English | MEDLINE | ID: mdl-29367592

ABSTRACT

Atopic asthma is a persistent disease characterized by intermittent wheeze and progressive loss of lung function. The disease is thought to be driven primarily by chronic aeroallergen-induced type 2-associated inflammation. However, the vast majority of atopics do not develop asthma despite ongoing aeroallergen exposure, suggesting additional mechanisms operate in conjunction with type 2 immunity to drive asthma pathogenesis. We employed RNA-Seq profiling of sputum-derived cells to identify gene networks operative at baseline in house dust mite-sensitized (HDMS) subjects with/without wheezing history that are characteristic of the ongoing asthmatic state. The expression of type 2 effectors (IL-5, IL-13) was equivalent in both cohorts of subjects. However, in HDMS-wheezers they were associated with upregulation of two coexpression modules comprising multiple type 2- and epithelial-associated genes. The first module was interlinked by the hubs EGFR, ERBB2, CDH1 and IL-13. The second module was associated with CDHR3 and mucociliary clearance genes. Our findings provide new insight into the molecular mechanisms operative at baseline in the airway mucosa in atopic asthmatics undergoing natural aeroallergen exposure, and suggest that susceptibility to asthma amongst these subjects involves complex interactions between type 2- and epithelial-associated gene networks, which are not operative in equivalently sensitized/exposed atopic non-asthmatics.


Subject(s)
Allergens/metabolism , Asthma/pathology , Epithelial Cells/drug effects , Gene Expression Regulation/drug effects , Gene Regulatory Networks , Sputum/cytology , Female , Gene Expression Profiling , Humans , Male , Sequence Analysis, RNA
12.
Hum Mutat ; 28(7): 654-9, 2007 Jul.
Article in English | MEDLINE | ID: mdl-17370309

ABSTRACT

Autosomal dominant polycystic kidney disease (ADPKD) arises from mutations in the PKD1 and PKD2 genes. The Polycystic Kidney Disease Mutation Database (PKDB) is an internet-accessible relational database containing comprehensive information about germline and somatic disease-causing variants within these two genes, as well as polymorphisms and variants of indeterminate pathogenicity. The PKDB database structure incorporates an interface between these gene variant data and any associated patient clinical data. An initiative of the Polycystic Kidney Disease Foundation, PKDB is a publicly accessible database that aims to streamline the evaluation of PKD1 and PKD2 gene variants detected in samples from those with ADPKD, as well as to assist ongoing clinical and molecular research in the field. As the accurate reporting of nucleotide variants is essential for ensuring the quality of data within PKDB, a mutation checker has been mounted on the PKDB server allowing contributors to assess the accuracy of their PKD1 and PKD2 variant reports. Researchers and clinicians may submit their PKD1/PKD2 gene variants and any associated deidentified clinical data via standardized downloadable data entry forms accessible through the PKDB site. PKDB has been launched with the full details of PKD1 and PKD2 gene variant reports published in 73 peer-reviewed articles. Through a series of user-friendly advanced search facilities, users are able to query the database as required. The PKDB server is accessible at http://pkdb.mayo.edu.


Subject(s)
Databases, Genetic , Genes, Dominant , Polycystic Kidney, Autosomal Dominant/genetics , TRPP Cation Channels/metabolism , Humans , Software , User-Computer Interface
13.
PLoS One ; 7(9): e46160, 2012.
Article in English | MEDLINE | ID: mdl-23049965

ABSTRACT

Host cell infection by apicomplexan parasites plays an essential role in lifecycle progression for these obligate intracellular pathogens. For most species, including the etiological agents of malaria and toxoplasmosis, infection requires active host-cell invasion dependent on formation of a tight junction - the organising interface between parasite and host cell during entry. Formation of this structure is not, however, shared across all Apicomplexa or indeed all parasite lifecycle stages. Here, using an in silico integrative genomic search and endogenous gene-tagging strategy, we sought to characterise proteins that function specifically during junction-dependent invasion, a class of proteins we term invasins to distinguish them from adhesins that function in species specific host-cell recognition. High-definition imaging of tagged Plasmodium falciparum invasins localised proteins to multiple cellular compartments of the blood stage merozoite. This includes several that localise to distinct subcompartments within the rhoptries. While originating from the same organelle, however, each has very different dynamics during invasion. Apical Sushi Protein and Rhoptry Neck protein 2 release early, following the junction, whilst a novel rhoptry protein PFF0645c releases only after invasion is complete. This supports the idea that organisation of proteins within a secretory organelle determines the order and destination of protein secretion and provides a localisation-based classification strategy for predicting invasin function during apicomplexan parasite invasion.


Subject(s)
Erythrocytes/parasitology , Malaria/parasitology , Organelles/metabolism , Plasmodium falciparum/metabolism , Plasmodium falciparum/pathogenicity , Protozoan Proteins/metabolism , Blotting, Western , Fluorescent Antibody Technique , Host-Parasite Interactions , Humans , Microscopy, Immunoelectron , Organelles/ultrastructure
14.
Plant Cell ; 16(1): 241-56, 2004 Jan.
Article in English | MEDLINE | ID: mdl-14671022

ABSTRACT

A novel insight into Arabidopsis mitochondrial function was revealed from a large experimental proteome derived by liquid chromatography-tandem mass spectrometry. Within the experimental set of 416 identified proteins, a significant number of low-abundance proteins involved in DNA synthesis, transcriptional regulation, protein complex assembly, and cellular signaling were discovered. Nearly 20% of the experimentally identified proteins are of unknown function, suggesting a wealth of undiscovered mitochondrial functions in plants. Only approximately half of the experimental set is predicted to be mitochondrial by targeting prediction programs, allowing an assessment of the benefits and limitations of these programs in determining plant mitochondrial proteomes. Maps of putative orthology networks between yeast, human, and Arabidopsis mitochondrial proteomes and the Rickettsia prowazekii proteome provide detailed insights into the divergence of the plant mitochondrial proteome from those of other eukaryotes. These show a clear set of putative cross-species orthologs in the core metabolic functions of mitochondria, whereas considerable diversity exists in many signaling and regulatory functions.


Subject(s)
Arabidopsis/metabolism , Mitochondrial Proteins/metabolism , Proteome/metabolism , Signal Transduction/physiology , Arabidopsis/genetics , Arabidopsis Proteins/analysis , Arabidopsis Proteins/metabolism , Carrier Proteins/metabolism , Chloroplasts/metabolism , Computational Biology , DNA, Plant/genetics , DNA, Plant/metabolism , Databases, Factual , Electron Transport Chain Complex Proteins/metabolism , Humans , Mass Spectrometry , Mitochondrial Proteins/analysis , Peroxisomes/metabolism , Proteome/analysis , RNA, Plant/genetics , RNA, Plant/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL