RESUMEN
BACKGROUND: Solving the structure of mRNA transcripts is a major challenge for both research and molecular diagnostic purposes. Current approaches based on short-read RNA sequencing and RT-PCR techniques cannot fully explore the complexity of transcript structure. The emergence of third-generation long-read sequencing addresses this problem by solving this sequence directly. However, genes with low expression levels are difficult to study with the whole transcriptome sequencing approach. To fix this technical limitation, we propose a novel method to capture transcripts of a gene panel using a targeted enrichment approach suitable for Pacific Biosciences and Oxford Nanopore Technologies platforms. RESULTS: We designed a set of probes to capture transcripts of a panel of genes involved in hereditary breast and ovarian cancer syndrome. We present SOSTAR (iSofOrmS annoTAtoR), a versatile pipeline to assemble, quantify and annotate isoforms from long read sequencing using a new tool specially designed for this application. The significant enrichment of transcripts by our capture protocol, together with the SOSTAR annotation, allowed the identification of 1,231 unique transcripts within the gene panel from the eight patients sequenced. The structure of these transcripts was annotated with a resolution of one base relative to a reference transcript. All major alternative splicing events of the BRCA1 and BRCA2 genes described in the literature were found. Complex splicing events such as pseudoexons were correctly annotated. SOSTAR enabled the identification of abnormal transcripts in the positive controls. In addition, a case of unexplained inheritance in a family with a history of breast and ovarian cancer was solved by identifying an SVA retrotransposon in intron 13 of the BRCA1 gene. CONCLUSIONS: We have validated a new protocol for the enrichment of transcripts of interest using probes adapted to the ONT and PacBio platforms. This protocol allows a complete description of the alternative structures of transcripts, the estimation of their expression and the identification of aberrant transcripts in a single experiment. This proof-of-concept opens new possibilities for RNA structure exploration in both research and molecular diagnostics.
Asunto(s)
Biología Computacional , Isoformas de ARN , Análisis de Secuencia de ARN , Humanos , Análisis de Secuencia de ARN/métodos , Biología Computacional/métodos , Isoformas de ARN/genética , Empalme Alternativo , Femenino , Proteína BRCA2/genética , Proteína BRCA1/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Síndrome de Cáncer de Mama y Ovario Hereditario/genéticaRESUMEN
Modeling splicing is essential for tackling the challenge of variant interpretation as each nucleotide variation can be pathogenic by affecting pre-mRNA splicing via disruption/creation of splicing motifs such as 5'/3' splice sites, branch sites, or splicing regulatory elements. Unfortunately, most in silico tools focus on a specific type of splicing motif, which is why we developed the Splicing Prediction Pipeline (SPiP) to perform, in one single bioinformatic analysis based on a machine learning approach, a comprehensive assessment of the variant effect on different splicing motifs. We gathered a curated set of 4616 variants scattered all along the sequence of 227 genes, with their corresponding splicing studies. The Bayesian analysis provided us with the number of control variants, that is, variants without impact on splicing, to mimic the deluge of variants from high-throughput sequencing data. Results show that SPiP can deal with the diversity of splicing alterations, with 83.13% sensitivity and 99% specificity to detect spliceogenic variants. Overall performance as measured by area under the receiving operator curve was 0.986, better than SpliceAI and SQUIRLS (0.965 and 0.766) for the same data set. SPiP lends itself to a unique suite for comprehensive prediction of spliceogenicity in the genomic medicine era. SPiP is available at: https://sourceforge.net/projects/splicing-prediction-pipeline/.
Asunto(s)
Sitios de Empalme de ARN , Empalme del ARN , Humanos , Teorema de Bayes , Empalme del ARN/genética , Exones/genética , Sitios de Empalme de ARN/genética , Aprendizaje Automático , Intrones/genéticaRESUMEN
BACKGROUND: Branch points (BPs) map within short motifs upstream of acceptor splice sites (3'ss) and are essential for splicing of pre-mature mRNA. Several BP-dedicated bioinformatics tools, including HSF, SVM-BPfinder, BPP, Branchpointer, LaBranchoR and RNABPS were developed during the last decade. Here, we evaluated their capability to detect the position of BPs, and also to predict the impact on splicing of variants occurring upstream of 3'ss. RESULTS: We used a large set of constitutive and alternative human 3'ss collected from Ensembl (n = 264,787 3'ss) and from in-house RNAseq experiments (n = 51,986 3'ss). We also gathered an unprecedented collection of functional splicing data for 120 variants (62 unpublished) occurring in BP areas of disease-causing genes. Branchpointer showed the best performance to detect the relevant BPs upstream of constitutive and alternative 3'ss (99.48 and 65.84% accuracies, respectively). For variants occurring in a BP area, BPP emerged as having the best performance to predict effects on mRNA splicing, with an accuracy of 89.17%. CONCLUSIONS: Our investigations revealed that Branchpointer was optimal to detect BPs upstream of 3'ss, and that BPP was most relevant to predict splicing alteration due to variants in the BP area.
Asunto(s)
Intrones , Precursores del ARN , Sitios de Empalme de ARN , Empalme del ARN , Empalme Alternativo , Biología Computacional/métodos , Humanos , Motivos de Nucleótidos , Posición Específica de Matrices de Puntuación , Procesamiento Postranscripcional del ARN , Curva ROC , Reproducibilidad de los ResultadosRESUMEN
PURPOSE: Integration of gene panels in the diagnosis of hereditary breast and ovarian cancer (HBOC) requires a careful evaluation of the risk associated with pathogenic or likely pathogenic variants (PVs) detected in each gene. Here we analyzed 34 genes in 5131 suspected HBOC index cases by next-generation sequencing. METHODS: Using the Exome Aggregation Consortium data sets plus 571 individuals from the French Exome Project, we simulated the probability that an individual from the Exome Aggregation Consortium carries a PV and compared it to the estimated frequency within the HBOC population. RESULTS: Odds ratio conferred by PVs within BRCA1, BRCA2, PALB2, RAD51C, RAD51D, ATM, BRIP1, CHEK2, and MSH6 were estimated at 13.22 [10.01-17.22], 8.61 [6.78-10.82], 8.22 [4.91-13.05], 4.54 [2.55-7.48], 5.23 [1.46-13.17], 3.20 [2.14-4.53], 2.49 [1.42-3.97], 1.67 [1.18-2.27], and 2.50 [1.12-4.67], respectively. PVs within RAD51C, RAD51D, and BRIP1 were associated with ovarian cancer family history (OR = 11.36 [5.78-19.59], 12.44 [2.94-33.30] and 3.82 [1.66-7.11]). PALB2 PVs were associated with bilateral breast cancer (OR = 16.17 [5.48-34.10]) and BARD1 PVs with triple-negative breast cancer (OR = 11.27 [3.37-25.01]). Burden tests performed in both patients and the French Exome Project population confirmed the association of PVs of BRCA1, BRCA2, PALB2, and RAD51C with HBOC. CONCLUSION: Our results validate the integration of PALB2, RAD51C, and RAD51D in the diagnosis of HBOC and suggest that the other genes are involved in an oligogenic determinism.
Asunto(s)
Proteínas de Unión al ADN/genética , Proteína del Grupo de Complementación N de la Anemia de Fanconi/genética , Síndrome de Cáncer de Mama y Ovario Hereditario/genética , Adulto , Proteína BRCA1/genética , Proteína BRCA2/genética , Francia/epidemiología , Predisposición Genética a la Enfermedad , Pruebas Genéticas , Variación Genética/genética , Síndrome de Cáncer de Mama y Ovario Hereditario/diagnóstico , Síndrome de Cáncer de Mama y Ovario Hereditario/epidemiología , Síndrome de Cáncer de Mama y Ovario Hereditario/patología , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Persona de Mediana Edad , Proteínas de Neoplasias/genética , Factores de Riesgo , Secuenciación del ExomaRESUMEN
PURPOSE: The optimal application of maintenance PARP inhibitor therapy for ovarian cancer requires accessible, robust, and rapid testing of homologous recombination deficiency (HRD). However, in many countries, access to HRD testing is problematic and the failure rate is high. We developed an academic HRD test to support treatment decision-making. EXPERIMENTAL DESIGN: Genomic Instability Scar (GIScar) was developed through targeted sequencing of a 127-gene panel to determine HRD status. GIScar was trained from a noninterventional study with 250 prospectively collected ovarian tumor samples. GIScar was validated on 469 DNA tumor samples from the PAOLA-1 trial evaluating maintenance olaparib for newly diagnosed ovarian cancer, and its predictive value was compared with Myriad Genetics MyChoice (MGMC). RESULTS: GIScar showed significant correlation with MGMC HRD classification (kappa statistics: 0.780). From PAOLA-1 samples, more HRD-positive tumors were identified by GIScar (258) than MGMC (242), with a lower proportion of inconclusive results (1% vs. 9%, respectively). The HRs for progression-free survival (PFS) with olaparib versus placebo were 0.45 [95% confidence interval (CI), 0.33-0.62] in GIScar-identified HRD-positive BRCA-mutated tumors, 0.50 (95% CI, 0.31-0.80) in HRD-positive BRCA-wild-type tumors, and 1.02 (95% CI, 0.74-1.40) in HRD-negative tumors. Tumors identified as HRD positive by GIScar but HRD negative by MGMC had better PFS with olaparib (HR, 0.23; 95% CI, 0.07-0.72). CONCLUSIONS: GIScar is a valuable diagnostic tool, reliably detecting HRD and predicting sensitivity to olaparib for ovarian cancer. GIScar showed high analytic concordance with MGMC test and fewer inconclusive results. GIScar is easily implemented into diagnostic laboratories with a rapid turnaround.
Asunto(s)
Neoplasias Ováricas , Inhibidores de Poli(ADP-Ribosa) Polimerasas , Humanos , Femenino , Inhibidores de Poli(ADP-Ribosa) Polimerasas/uso terapéutico , Neoplasias Ováricas/tratamiento farmacológico , Neoplasias Ováricas/genética , Neoplasias Ováricas/patología , Ftalazinas/uso terapéutico , Inestabilidad GenómicaRESUMEN
(1) Background: In literature, approximately 20% of mCRPC present somatic DNA damage repair (DDR) gene mutations, and their relationship with response to standard therapies in mCRPC is not well understood. The objective was to evaluate outcomes of mCRPC patients treated with standard therapies according to somatic DDR status. (2) Methods: Eighty-three patients were recruited at Caen Cancer Center (France). Progression-free survival (PFS) after first-line treatment was analyzed according to somatic DDR mutation as primary endpoint. PFS according to first exposure to taxane chemotherapy and PFS2 (time to second event of disease progression) depending on therapeutic sequences were also analyzed. (3) Results: Median first-line PFS was 9.7 months in 33 mutated patients and 8.4 months in 50 non-mutated patients (p = 0.9). PFS of first exposure to taxanes was 8.1 months in mutated patients and 5.7 months in non-mutated patients (p = 0.32) and significantly longer among patients with ATM/BRCA1/BRCA2 mutations compared to the others (10.6 months vs. 5.5 months, p = 0.04). PFS2 was 16.5 months in mutated patients, whatever the sequence, and 11.7 months in non-mutated patients (p = 0.07). The mutated patients treated with chemotherapy followed by NHT had a long median PFS2 (49.8 months). (4) Conclusions: mCRPC patients with BRCA1/2 and ATM benefit from standard therapies, with a long response to taxanes.
Asunto(s)
Neoplasias de la Próstata Resistentes a la Castración , Proteínas de la Ataxia Telangiectasia Mutada/genética , Reparación del ADN/genética , Genes BRCA2 , Humanos , Masculino , Neoplasias de la Próstata Resistentes a la Castración/tratamiento farmacológico , Neoplasias de la Próstata Resistentes a la Castración/genética , Taxoides/uso terapéuticoRESUMEN
Highlighting tumoral mutations is a key step in oncology for personalizing care. Considering the genetic heterogeneity in a tumor, software used for detecting mutations should clearly distinguish real tumor events of interest that could be predictive markers for personalized medicine from false positives. OutLyzer is a new variant-caller designed for the specific and sensitive detection of mutations for research and diagnostic purposes. It is based on statistic and local evaluation of sequencing background noise to highlight potential true positive variants. 130 previously genotyped patients were sequenced after enrichment by capturing the exons of 22 genes. Sequencing data were analyzed by HaplotypeCaller, LofreqStar, Varscan2 and OutLyzer. OutLyzer had the best sensitivity and specificity with a fixed limit of detection for all tools of 1% for SNVs and 2% for Indels. OutLyzer is a useful tool for detecting mutations of interest in tumors including low allele-frequency mutations, and could be adopted in standard practice for delivering targeted therapies in cancer treatment.