Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
Lancet Oncol ; 24(10): 1147-1156, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37797633

RESUMO

BACKGROUND: Carriers of cancer predisposing variants are at an increased risk of developing subsequent malignant neoplasms among those who have survived childhood cancer. We aimed to investigate whether cancer predisposing variants contribute to the risk of subsequent malignant neoplasm-related late mortality (5 years or more after diagnosis). METHODS: In this analysis, data were included from two retrospective cohort studies, St Jude Lifetime Cohort (SJLIFE) and the Childhood Cancer Survivor Study (CCSS), with prospective follow-up of patients who were alive for at least 5 years after diagnosis with childhood cancer (ie, long-term childhood cancer survivors) with corresponding germline whole genome or whole exome sequencing data. Cancer predisposing variants affecting 60 genes associated with well-established autosomal-dominant cancer-predisposition syndromes were characterised. Subsequent malignant neoplasms were graded using the National Cancer Institute Common Terminology Criteria for Adverse Events (CTCAE) version 4.03 with modifications. Cause-specific late mortality was based on linkage with the US National Death Index and systematic cohort follow up. Fine-Gray subdistribution hazard models were used to estimate subsequent malignant neoplasm-related late mortality starting from the first biospecimen collection, treating non-subsequent malignant neoplasm-related deaths as a competing risk, adjusting for genetic ancestry, sex, age at diagnosis, and cancer treatment exposures. SJLIFE (NCT00760656) and CCSS (NCT01120353) are registered with ClinicalTrials.gov. FINDINGS: 12 469 (6172 male and 6297 female) participants were included, 4402 from the SJLIFE cohort (median follow-up time since collection of the first biospecimen 7·4 years [IQR 3·1-9·4]) and 8067 from the CCSS cohort (median follow-up time since collection of the first biospecimen 12·6 years [2·2-16·6]). 641 (5·1%) of 12 469 participants carried cancer predisposing variants (294 [6·7%] in the SJLIFE cohort and 347 [4·3%] in the CCSS cohort), which were significantly associated with an increased severity of subsequent malignant neoplasms (CTCAE grade ≥4 vs grade <4: odds ratio 2·15, 95% CI 1·18-4·19, p=0·0085). 263 (2·1%) subsequent malignant neoplasm-related deaths (44 [1·0%] in the SJLIFE cohort; and 219 [2·7%] in the CCSS cohort) and 426 (3·4%) other-cause deaths (103 [2·3%] in SJLIFE; and 323 [4·0%] in CCSS) occurred. Cumulative subsequent malignant neoplasm-related mortality at 10 years after the first biospecimen collection in carriers of cancer predisposing variants was 3·7% (95% CI 1·2-8·5) in SJLIFE and 6·9% (4·1-10·7) in CCSS versus 1·5% (1·0-2·1) in SJLIFE and 2·1% (1·7-2·5) in CCSS in non-carriers. Carrying a cancer predisposing variant was associated with an increased risk of subsequent malignant neoplasm-related mortality (SJLIFE: subdistribution hazard ratio 3·40 [95% CI 1·37-8·43]; p=0·0082; CCSS: 3·58 [2·27-5·63]; p<0·0001). INTERPRETATION: Identifying participants at increased risk of subsequent malignant neoplasms via genetic counselling and clinical genetic testing for cancer predisposing variants and implementing early personalised cancer surveillance and prevention strategies might reduce the substantial subsequent malignant neoplasm-related mortality burden. FUNDING: American Lebanese Syrian Associated Charities and US National Institutes of Health.


Assuntos
Sobreviventes de Câncer , Neoplasias , Criança , Humanos , Masculino , Feminino , Neoplasias/patologia , Estudos Retrospectivos , Seguimentos , Estudos Prospectivos , Fatores de Risco
2.
Nat Commun ; 14(1): 6008, 2023 09 28.
Artigo em Inglês | MEDLINE | ID: mdl-37770423

RESUMO

Fusion oncoproteins (FOs) arise from chromosomal translocations in ~17% of cancers and are often oncogenic drivers. Although some FOs can promote oncogenesis by undergoing liquid-liquid phase separation (LLPS) to form aberrant biomolecular condensates, the generality of this phenomenon is unknown. We explored this question by testing 166 FOs in HeLa cells and found that 58% formed condensates. The condensate-forming FOs displayed physicochemical features distinct from those of condensate-negative FOs and segregated into distinct feature-based groups that aligned with their sub-cellular localization and biological function. Using Machine Learning, we developed a predictor of FO condensation behavior, and discovered that 67% of ~3000 additional FOs likely form condensates, with 35% of those predicted to function by altering gene expression. 47% of the predicted condensate-negative FOs were associated with cell signaling functions, suggesting a functional dichotomy between condensate-positive and -negative FOs. Our Datasets and reagents are rich resources to interrogate FO condensation in the future.


Assuntos
Condensados Biomoleculares , Proteínas de Fusão Oncogênica , Humanos , Células HeLa , Carcinogênese , Transformação Celular Neoplásica
3.
Bioinformatics ; 38(2): 549-551, 2022 01 03.
Artigo em Inglês | MEDLINE | ID: mdl-34431982

RESUMO

SUMMARY: Small insertions and deletions (indels) in nucleotide sequence may be represented differently between mapping algorithms and variant callers, or in the flanking sequence context. Representational ambiguity is especially profound for complex indels, complicating comparisons between multiple mappings and call sets. Complex indels may additionally suffer from incomplete allele representation, potentially leading to critical misannotation of variant effect. We present indelPost, a Python library that harmonizes these ambiguities for simple and complex indels via realignment and read-based phasing. We demonstrate that indelPost enables accurate analysis of ambiguous data and can derive the correct complex indel alleles from the simple indel predictions provided by standard small variant detectors, with improved performance over a specialized tool for complex indel analysis. AVAILABILITY AND IMPLEMENTATION: indelPost is freely available at: https://github.com/stjude/indelPost. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Software , Análise de Sequência de DNA , Mutação INDEL , Biblioteca Gênica
4.
BMC Cancer ; 21(1): 1233, 2021 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-34789196

RESUMO

BACKGROUND: RNA editing leads to post-transcriptional variation in protein sequences and has important biological implications. We sought to elucidate the landscape of RNA editing events across pediatric cancers. METHODS: Using RNA-Seq data mapped by a pipeline designed to minimize mapping ambiguity, we investigated RNA editing in 711 pediatric cancers from the St. Jude/Washington University Pediatric Cancer Genome Project focusing on coding variants which can potentially increase protein sequence diversity. We combined de novo detection using paired tumor DNA-RNA data with analysis of known RNA editing sites. RESULTS: We identified 722 unique RNA editing sites in coding regions across pediatric cancers, 70% of which were nonsynonymous recoding variants. Nearly all editing sites represented the canonical A-to-I (n = 706) or C-to-U sites (n = 14). RNA editing was enriched in brain tumors compared to other cancers, including editing of glutamate receptors and ion channels involved in neurotransmitter signaling. RNA editing profiles of each pediatric cancer subtype resembled those of the corresponding normal tissue profiled by the Genotype-Tissue Expression (GTEx) project. CONCLUSIONS: In this first comprehensive analysis of RNA editing events in pediatric cancer, we found that the RNA editing profile of each cancer subtype is similar to its normal tissue of origin. Tumor-specific RNA editing events were not identified indicating that successful immunotherapeutic targeting of RNA-edited peptides in pediatric cancer should rely on increased antigen presentation on tumor cells compared to normal but not on tumor-specific RNA editing per se.


Assuntos
Neoplasias/genética , Edição de RNA , Análise de Sequência de RNA/métodos , Neoplasias Encefálicas/genética , Criança , DNA de Neoplasias , Humanos , Imunoterapia , Neoplasias/metabolismo , Neoplasias/terapia , Fases de Leitura Aberta , Especificidade de Órgãos , RNA Neoplásico , Sequenciamento Completo do Genoma
5.
Cancer Discov ; 11(12): 3008-3027, 2021 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-34301788

RESUMO

Genomic studies of pediatric cancer have primarily focused on specific tumor types or high-risk disease. Here, we used a three-platform sequencing approach, including whole-genome sequencing (WGS), whole-exome sequencing (WES), and RNA sequencing (RNA-seq), to examine tumor and germline genomes from 309 prospectively identified children with newly diagnosed (85%) or relapsed/refractory (15%) cancers, unselected for tumor type. Eighty-six percent of patients harbored diagnostic (53%), prognostic (57%), therapeutically relevant (25%), and/or cancer-predisposing (18%) variants. Inclusion of WGS enabled detection of activating gene fusions and enhancer hijacks (36% and 8% of tumors, respectively), small intragenic deletions (15% of tumors), and mutational signatures revealing of pathogenic variant effects. Evaluation of paired tumor-normal data revealed relevance to tumor development for 55% of pathogenic germline variants. This study demonstrates the power of a three-platform approach that incorporates WGS to interrogate and interpret the full range of genomic variants across newly diagnosed as well as relapsed/refractory pediatric cancers. SIGNIFICANCE: Pediatric cancers are driven by diverse genomic lesions, and sequencing has proven useful in evaluating high-risk and relapsed/refractory cases. We show that combined WGS, WES, and RNA-seq of tumor and paired normal tissues enables identification and characterization of genetic drivers across the full spectrum of pediatric cancers. This article is highlighted in the In This Issue feature, p. 2945.


Assuntos
Neoplasias , Criança , DNA , Humanos , Mutação , Neoplasias/genética , Análise de Sequência de RNA , Sequenciamento do Exoma
6.
Cancer Cell ; 39(1): 83-95.e4, 2021 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-33434514

RESUMO

GenomePaint (https://genomepaint.stjude.cloud/) is an interactive visualization platform for whole-genome, whole-exome, transcriptome, and epigenomic data of tumor samples. Its design captures the inter-relatedness between DNA variations and RNA expression, supporting in-depth exploration of both individual cancer genomes and full cohorts. Regulatory non-coding variants can be inspected and analyzed along with coding variants, and their functional impact further explored by examining 3D genome data from cancer cell lines. Further, GenomePaint correlates mutation and expression patterns with patient outcomes, and supports custom data upload. We used GenomePaint to unveil aberrant splicing that disrupts the RING domain of CREBBP, discover cis activation of the MYC oncogene by duplication of the NOTCH1-MYC enhancer in B-lineage acute lymphoblastic leukemia, and explore the inter- and intra-tumor heterogeneity at EGFR in adult glioblastomas. These examples demonstrate that deep multi-omics exploration of individual cancer genomes enabled by GenomePaint can lead to biological insights for follow-up validation.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Variação Genética , Neoplasias/genética , Adulto , Linhagem Celular Tumoral , Criança , Bases de Dados Genéticas , Epigênese Genética , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Interface Usuário-Computador , Sequenciamento do Exoma , Sequenciamento Completo do Genoma
7.
Cancer Discov ; 11(5): 1082-1099, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33408242

RESUMO

Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem-Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community-enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. SIGNIFICANCE: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.This article is highlighted in the In This Issue feature, p. 995.


Assuntos
Anemia Falciforme/genética , Computação em Nuvem , Genômica , Disseminação de Informação , Neoplasias/genética , Criança , Ecossistema , Hospitais Pediátricos , Humanos
8.
Blood Cancer Discov ; 1(1): 96-111, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32793890

RESUMO

Relapse of acute lymphoblastic leukemia (ALL) remains a leading cause of childhood death. Prior studies have shown clonal mutations at relapse often arise from relapse-fated subclones that exist at diagnosis. However, the genomic landscape, evolutionary trajectories and mutational mechanisms driving relapse are incompletely understood. In an analysis of 92 cases of relapsed childhood ALL, incorporating multimodal DNA and RNA sequencing, deep digital mutational tracking and xenografting to formally define clonal structure, we identify 50 significant targets of mutation with distinct patterns of mutational acquisition or enrichment. CREBBP, NOTCH1, and Ras signaling mutations rose from diagnosis subclones, whereas variants in NCOR2, USH2A and NT5C2 were exclusively observed at relapse. Evolutionary modeling and xenografting demonstrated that relapse-fated clones were minor (50%), major (27%) or multiclonal (18%) at diagnosis. Putative second leukemias, including those with lineage shift, were shown to most commonly represent relapse from an ancestral clone rather than a truly independent second primary leukemia. A subset of leukemias prone to repeated relapse exhibited hypermutation driven by at least three distinct mutational processes, resulting in heightened neoepitope burden and potential vulnerability to immunotherapy. Finally, relapse-driving sequence mutations were detected prior to relapse using deep digital PCR at levels comparable to orthogonal approaches to monitor levels of measurable residual disease. These results provide a genomic framework to anticipate and circumvent relapse by earlier detection and targeting of relapse-fated clones.


Assuntos
Evolução Clonal , Leucemia-Linfoma Linfoblástico de Células Precursoras , Criança , Evolução Clonal/genética , Genômica , Humanos , Mutação , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Recidiva
9.
Nat Genet ; 52(8): 811-818, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32632335

RESUMO

We developed cis-X, a computational method for discovering regulatory noncoding variants in cancer by integrating whole-genome and transcriptome sequencing data from a single cancer sample. cis-X first finds aberrantly cis-activated genes that exhibit allele-specific expression accompanied by an elevated outlier expression. It then searches for causal noncoding variants that may introduce aberrant transcription factor binding motifs or enhancer hijacking by structural variations. Analysis of 13 T-lineage acute lymphoblastic leukemias identified a recurrent intronic variant predicted to cis-activate the TAL1 oncogene, a finding validated in vivo by chromatin immunoprecipitation sequencing of a patient-derived xenograft. Candidate oncogenes include the prolactin receptor PRLR activated by a focal deletion that removes a CTCF-insulated neighborhood boundary. cis-X may be applied to pediatric and adult solid tumors that are aneuploid and heterogeneous. In contrast to existing approaches, which require large sample cohorts, cis-X enables the discovery of regulatory noncoding variants in individual cancer genomes.


Assuntos
Elementos Facilitadores Genéticos/genética , Variação Genética/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , RNA não Traduzido/genética , Adolescente , Alelos , Criança , Pré-Escolar , Cromatina/genética , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Masculino , Oncogenes/genética , Transcrição Gênica/genética
11.
J Clin Oncol ; 38(24): 2728-2740, 2020 08 20.
Artigo em Inglês | MEDLINE | ID: mdl-32496904

RESUMO

PURPOSE: To investigate cancer treatment plus pathogenic germline mutations (PGMs) in DNA repair genes (DRGs) for identification of childhood cancer survivors at increased risk of subsequent neoplasms (SNs). METHODS: Whole-genome sequencing was performed on blood-derived DNA from survivors in the St Jude Lifetime Cohort. PGMs were evaluated in 127 genes from 6 major DNA repair pathways. Cumulative doses of chemotherapy and body region-specific radiotherapy (RT) were abstracted from medical records. Relative rates (RRs) and 95% CIs of SNs by mutation status were estimated using multivariable piecewise exponential models. RESULTS: Of 4,402 survivors, 495 (11.2%) developed 1,269 SNs. We identified 538 PGMs in 98 DRGs (POLG, MUTYH, ERCC2, and BRCA2, among others) in 508 (11.5%) survivors. Mutations in homologous recombination (HR) genes were significantly associated with an increased rate of subsequent female breast cancer (RR, 3.7; 95% CI, 1.8 to 7.7), especially among survivors with chest RT ≥ 20 Gy (RR, 4.4; 95% CI, 1.6 to 12.4), or with a cumulative dose of anthracyclines in the second or third tertile (RR, 4.4; 95% CI, 1.7 to 11.4). Mutations in HR genes were also associated with an increased rate of subsequent sarcoma among those who received alkylating agent doses in the third tertile (RR, 14.9; 95% CI, 4.0 to 38.0). Mutations in nucleotide excision repair genes were associated with subsequent thyroid cancer for those treated with neck RT ≥ 30 Gy (RR, 12.9; 95% CI, 1.6 to 46.6) with marginal statistical significance. CONCLUSION: Our study provides novel insights regarding the contribution of genetics, in combination with known treatment-related risks, for the development of SNs. These findings have the potential to facilitate identification of high-risk survivors who may benefit from genetic counseling and/or testing of DRGs, which may further inform personalized cancer surveillance and prevention strategies.


Assuntos
Sequenciamento do Exoma/métodos , Mutação em Linhagem Germinativa/genética , Neoplasias/tratamento farmacológico , Neoplasias/genética , Adulto , Sobreviventes de Câncer , Criança , Pré-Escolar , Estudos de Coortes , Reparo do DNA , Feminino , Humanos , Masculino , Neoplasias/mortalidade , Estudos Retrospectivos , Sobreviventes , Adulto Jovem
12.
Genome Biol ; 21(1): 126, 2020 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-32466770

RESUMO

To discover driver fusions beyond canonical exon-to-exon chimeric transcripts, we develop CICERO, a local assembly-based algorithm that integrates RNA-seq read support with extensive annotation for candidate ranking. CICERO outperforms commonly used methods, achieving a 95% detection rate for 184 independently validated driver fusions including internal tandem duplications and other non-canonical events in 170 pediatric cancer transcriptomes. Re-analysis of TCGA glioblastoma RNA-seq unveils previously unreported kinase fusions (KLHL7-BRAF) and a 13% prevalence of EGFR C-terminal truncation. Accessible via standard or cloud-based implementation, CICERO enhances driver fusion detection for research and precision oncology. The CICERO source code is available at https://github.com/stjude/Cicero.


Assuntos
Fusão Gênica , Anotação de Sequência Molecular/métodos , Neoplasias/genética , Software , Algoritmos , Humanos , Análise de Sequência de RNA
13.
Cancer Discov ; 10(4): 568-587, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-32086311

RESUMO

Disease recurrence causes significant mortality in B-progenitor acute lymphoblastic leukemia (B-ALL). Genomic analysis of matched diagnosis and relapse samples shows relapse often arising from minor diagnosis subclones. However, why therapy eradicates some subclones while others survive and progress to relapse remains obscure. Elucidation of mechanisms underlying these differing fates requires functional analysis of isolated subclones. Here, large-scale limiting dilution xenografting of diagnosis and relapse samples, combined with targeted sequencing, identified and isolated minor diagnosis subclones that initiate an evolutionary trajectory toward relapse [termed diagnosis Relapse Initiating clones (dRI)]. Compared with other diagnosis subclones, dRIs were drug-tolerant with distinct engraftment and metabolic properties. Transcriptionally, dRIs displayed enrichment for chromatin remodeling, mitochondrial metabolism, proteostasis programs, and an increase in stemness pathways. The isolation and characterization of dRI subclones reveals new avenues for eradicating dRI cells by targeting their distinct metabolic and transcriptional pathways before further evolution renders them fully therapy-resistant. SIGNIFICANCE: Isolation and characterization of subclones from diagnosis samples of patients with B-ALL who relapsed showed that relapse-fated subclones had increased drug tolerance and distinct metabolic and survival transcriptional programs compared with other diagnosis subclones. This study provides strategies to identify and target clinically relevant subclones before further evolution toward relapse.


Assuntos
Leucemia Mieloide Aguda/genética , Células Clonais , Feminino , Humanos , Masculino , Recidiva
14.
Clin Cancer Res ; 26(10): 2362-2371, 2020 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-31969337

RESUMO

PURPOSE: We aimed to analyze and compare leukocyte telomere length (LTL) and age-dependent LTL attrition between childhood cancer survivors and noncancer controls, and to evaluate the associations of LTL with treatment exposures, chronic health conditions (CHC), and health behaviors among survivors. EXPERIMENTAL DESIGN: We included 2,427 survivors and 293 noncancer controls of European ancestry, drawn from the participants in St. Jude Lifetime Cohort Study (SJLIFE), a retrospective hospital-based study with prospective follow-up (2007-2016). Common nonneoplastic CHCs (59 types) and subsequent malignant neoplasms (5 types) were clinically assessed. LTL was measured with whole-genome sequencing data. RESULTS: After adjusting for age at DNA sampling, gender, genetic risk score based on 9 SNPs known to be associated with telomere length, and eigenvectors, LTL among survivors was significantly shorter both overall [adjusted mean (AM) = 6.20 kb; SE = 0.03 kb] and across diagnoses than controls (AM = 6.69 kb; SE = 0.07 kb). Among survivors, specific treatment exposures associated with shorter LTL included chest or abdominal irradiation, glucocorticoid, and vincristine chemotherapies. Significant negative associations of LTL with 14 different CHCs, and a positive association with subsequent thyroid cancer occurring out of irradiation field were identified. Health behaviors were significantly associated with LTL among survivors aged 18 to 35 years (P trend = 0.03). CONCLUSIONS: LTL is significantly shorter among childhood cancer survivors than noncancer controls, and is associated with CHCs and health behaviors, suggesting LTL as an aging biomarker may be a potential mechanistic target for future intervention studies designed to prevent or delay onset of CHCs in childhood cancer survivors.See related commentary by Walsh, p. 2281.


Assuntos
Sobreviventes de Câncer , Neoplasias , Adolescente , Adulto , Criança , Estudos de Coortes , Humanos , Leucócitos , Neoplasias/epidemiologia , Neoplasias/genética , Prevalência , Estudos Prospectivos , Estudos Retrospectivos , Sobreviventes , Telômero/genética , Adulto Jovem
15.
J Natl Cancer Inst ; 112(7): 756-764, 2020 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-31647544

RESUMO

BACKGROUND: We aimed to systematically evaluate telomere dynamics across a spectrum of pediatric cancers, search for underlying molecular mechanisms, and assess potential prognostic value. METHODS: The fraction of telomeric reads was determined from whole-genome sequencing data for paired tumor and normal samples from 653 patients with 23 cancer types from the Pediatric Cancer Genome Project. Telomere dynamics were characterized as the ratio of telomere fractions between tumor and normal samples. Somatic mutations were gathered, RNA sequencing data for 330 patients were analyzed for gene expression, and Cox regression was used to assess the telomere dynamics on patient survival. RESULTS: Telomere lengthening was observed in 28.7% of solid tumors, 10.5% of brain tumors, and 4.3% of hematological cancers. Among 81 samples with telomere lengthening, 26 had somatic mutations in alpha thalassemia/mental retardation syndrome X-linked gene, corroborated by a low level of the gene expression in the subset of tumors with RNA sequencing. Telomerase reverse transcriptase gene amplification and/or activation was observed in 10 tumors with telomere lengthening, including two leukemias of the E2A-PBX1 subtype. Among hematological cancers, pathway analysis for genes with expressions most negatively correlated with telomere fractions suggests the implication of a gene ontology process of antigen presentation by Major histocompatibility complex class II. A higher ratio of telomere fractions was statistically significantly associated with poorer survival for patients with brain tumors (hazard ratio = 2.18, 95% confidence interval = 1.37 to 3.46). CONCLUSION: Because telomerase inhibitors are currently being explored as potential agents to treat pediatric cancer, these data are valuable because they identify a subpopulation of patients with reactivation of telomerase who are most likely to benefit from this novel therapeutic option.


Assuntos
Neoplasias/genética , Homeostase do Telômero , Encurtamento do Telômero , Telômero/genética , Adolescente , Criança , Pré-Escolar , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Mutação , Telomerase/genética , Sequenciamento Completo do Genoma
16.
Bioinformatics ; 36(5): 1382-1390, 2020 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-31593214

RESUMO

MOTIVATION: Reliable identification of expressed somatic insertions/deletions (indels) is an unmet need due to artifacts generated in PCR-based RNA-Seq library preparation and the lack of normal RNA-Seq data, presenting analytical challenges for discovery of somatic indels in tumor transcriptome. RESULTS: We present RNAIndel, a tool for predicting somatic, germline and artifact indels from tumor RNA-Seq data. RNAIndel leverages features derived from indel sequence context and biological effect in a machine-learning framework. Except for tumor samples with microsatellite instability, RNAIndel robustly predicts 88-100% of somatic indels in five diverse test datasets of pediatric and adult cancers, even recovering subclonal (VAF range 0.01-0.15) driver indels missed by targeted deep-sequencing, outperforming the current best-practice for RNA-Seq variant calling which had 57% sensitivity but with 14 times more false positives. AVAILABILITY AND IMPLEMENTATION: RNAIndel is freely available at https://github.com/stjude/RNAIndel. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Neoplasias/genética , RNA-Seq , Criança , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL , Software , Sequenciamento do Exoma
17.
Blood ; 135(1): 41-55, 2020 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-31697823

RESUMO

To study the mechanisms of relapse in acute lymphoblastic leukemia (ALL), we performed whole-genome sequencing of 103 diagnosis-relapse-germline trios and ultra-deep sequencing of 208 serial samples in 16 patients. Relapse-specific somatic alterations were enriched in 12 genes (NR3C1, NR3C2, TP53, NT5C2, FPGS, CREBBP, MSH2, MSH6, PMS2, WHSC1, PRPS1, and PRPS2) involved in drug response. Their prevalence was 17% in very early relapse (<9 months from diagnosis), 65% in early relapse (9-36 months), and 32% in late relapse (>36 months) groups. Convergent evolution, in which multiple subclones harbor mutations in the same drug resistance gene, was observed in 6 relapses and confirmed by single-cell sequencing in 1 case. Mathematical modeling and mutational signature analysis indicated that early relapse resistance acquisition was frequently a 2-step process in which a persistent clone survived initial therapy and later acquired bona fide resistance mutations during therapy. In contrast, very early relapses arose from preexisting resistant clone(s). Two novel relapse-specific mutational signatures, one of which was caused by thiopurine treatment based on in vitro drug exposure experiments, were identified in early and late relapses but were absent from 2540 pan-cancer diagnosis samples and 129 non-ALL relapses. The novel signatures were detected in 27% of relapsed ALLs and were responsible for 46% of acquired resistance mutations in NT5C2, PRPS1, NR3C1, and TP53. These results suggest that chemotherapy-induced drug resistance mutations facilitate a subset of pediatric ALL relapses.


Assuntos
Biomarcadores Tumorais/genética , Metotrexato/uso terapêutico , Mutagênese/efeitos dos fármacos , Mutação , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/patologia , 5'-Nucleotidase/genética , Antimetabólitos Antineoplásicos/uso terapêutico , Criança , Análise Mutacional de DNA , Feminino , Seguimentos , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Leucemia-Linfoma Linfoblástico de Células Precursoras/tratamento farmacológico , Prognóstico , Receptores de Glucocorticoides/genética , Taxa de Sobrevida , Proteína Supressora de Tumor p53/genética
18.
Artigo em Inglês | MEDLINE | ID: mdl-31604778

RESUMO

Patients harboring germline pathogenic biallelic variants in genes involved in the recognition and repair of DNA damage are known to have a substantially increased cancer risk. Emerging evidence suggests that individuals harboring heterozygous variants in these same genes may also be at heightened, albeit lesser, risk for cancer. Herein, we sought to determine whether heterozygous variants in RECQL4, the gene encoding an essential DNA helicase that is defective in children with the autosomal recessive cancer-predisposing condition Rothmund-Thomson syndrome (RTS), are associated with increased risk for childhood cancer. To address this question, we interrogated germline sequence data from 4435 pediatric cancer patients at St. Jude Children's Research Hospital and 1127 from the National Cancer Institute Therapeutically Applicable Research to Generate Effective Treatment (TARGET) database and identified 24 (0.43%) who harbored loss-of-function (LOF) RECQL4 variants, including five of 249 (2.0%) with osteosarcoma (OS). These RECQL4 variants were significantly overrepresented in children with OS, the cancer most frequently observed in patients with RTS, as compared to 134,187 noncancer controls in the Genome Aggregation Database (gnomAD v2.1; P = 0.00087, odds ratio [OR] = 7.1, 95% CI, 2.9-17). Nine of the 24 (38%) individuals possessed the same c.1573delT (p.Cys525Alafs) variant located in the highly conserved DNA helicase domain, suggesting that disruption of this domain is central to oncogenesis. Altogether these data expand our understanding of the genetic factors predisposing to childhood cancer and reveal a novel association between heterozygous RECQL4 LOF variants and development of pediatric OS.


Assuntos
Osteossarcoma/genética , RecQ Helicases/genética , Adolescente , Criança , Feminino , Células Germinativas , Humanos , Mutação com Perda de Função/genética , Perda de Heterozigosidade/genética , Masculino , Mutação , Osteossarcoma/metabolismo , Linhagem , RecQ Helicases/metabolismo
19.
Genome Res ; 29(9): 1555-1565, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31439692

RESUMO

Variant interpretation in the era of massively parallel sequencing is challenging. Although many resources and guidelines are available to assist with this task, few integrated end-to-end tools exist. Here, we present the Pediatric Cancer Variant Pathogenicity Information Exchange (PeCanPIE), a web- and cloud-based platform for annotation, identification, and classification of variations in known or putative disease genes. Starting from a set of variants in variant call format (VCF), variants are annotated, ranked by putative pathogenicity, and presented for formal classification using a decision-support interface based on published guidelines from the American College of Medical Genetics and Genomics (ACMG). The system can accept files containing millions of variants and handle single-nucleotide variants (SNVs), simple insertions/deletions (indels), multiple-nucleotide variants (MNVs), and complex substitutions. PeCanPIE has been applied to classify variant pathogenicity in cancer predisposition genes in two large-scale investigations involving >4000 pediatric cancer patients and serves as a repository for the expert-reviewed results. PeCanPIE was originally developed for pediatric cancer but can be easily extended for use for nonpediatric cancers and noncancer genetic diseases. Although PeCanPIE's web-based interface was designed to be accessible to non-bioinformaticians, its back-end pipelines may also be run independently on the cloud, facilitating direct integration and broader adoption. PeCanPIE is publicly available and free for research use.


Assuntos
Biologia Computacional/métodos , Mutação em Linhagem Germinativa , Neoplasias/genética , Criança , Computação em Nuvem , Bases de Dados Genéticas , Predisposição Genética para Doença , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Interface Usuário-Computador
20.
Genome Biol ; 20(1): 50, 2019 03 14.
Artigo em Inglês | MEDLINE | ID: mdl-30867008

RESUMO

BACKGROUND: Sequencing errors are key confounding factors for detecting low-frequency genetic variants that are important for cancer molecular diagnosis, treatment, and surveillance using deep next-generation sequencing (NGS). However, there is a lack of comprehensive understanding of errors introduced at various steps of a conventional NGS workflow, such as sample handling, library preparation, PCR enrichment, and sequencing. In this study, we use current NGS technology to systematically investigate these questions. RESULTS: By evaluating read-specific error distributions, we discover that the substitution error rate can be computationally suppressed to 10-5 to 10-4, which is 10- to 100-fold lower than generally considered achievable (10-3) in the current literature. We then quantify substitution errors attributable to sample handling, library preparation, enrichment PCR, and sequencing by using multiple deep sequencing datasets. We find that error rates differ by nucleotide substitution types, ranging from 10-5 for A>C/T>G, C>A/G>T, and C>G/G>C changes to 10-4 for A>G/T>C changes. Furthermore, C>T/G>A errors exhibit strong sequence context dependency, sample-specific effects dominate elevated C>A/G>T errors, and target-enrichment PCR led to ~ 6-fold increase of overall error rate. We also find that more than 70% of hotspot variants can be detected at 0.1 ~ 0.01% frequency with the current NGS technology by applying in silico error suppression. CONCLUSIONS: We present the first comprehensive analysis of sequencing error sources in conventional NGS workflows. The error profiles revealed by our study highlight new directions for further improving NGS analysis accuracy both experimentally and computationally, ultimately enhancing the precision of deep sequencing.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/normas , Neoplasias/genética , Reação em Cadeia da Polimerase/normas , Análise de Sequência de DNA/normas , Software , Estudos de Casos e Controles , Humanos , Mutação , Controle de Qualidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA