Your browser doesn't support javascript.
loading
Combinatorial and Machine Learning Approaches for Improved Somatic Variant Calling From Formalin-Fixed Paraffin-Embedded Genome Sequence Data.
Dodani, Dollina D; Nguyen, Matthew H; Morin, Ryan D; Marra, Marco A; Corbett, Richard D.
Afiliación
  • Dodani DD; The Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada.
  • Nguyen MH; The Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada.
  • Morin RD; Canada's Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada.
  • Marra MA; Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
  • Corbett RD; Canada's Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada.
Front Genet ; 13: 834764, 2022.
Article en En | MEDLINE | ID: mdl-35571031
ABSTRACT
Formalin fixation of paraffin-embedded tissue samples is a well-established method for preserving tissue and is routinely used in clinical settings. Although formalin-fixed, paraffin-embedded (FFPE) tissues are deemed crucial for research and clinical applications, the fixation process results in molecular damage to nucleic acids, thus confounding their use in genome sequence analysis. Methods to improve genomic data quality from FFPE tissues have emerged, but there remains significant room for improvement. Here, we use whole-genome sequencing (WGS) data from matched Fresh Frozen (FF) and FFPE tissue samples to optimize a sensitive and precise FFPE single nucleotide variant (SNV) calling approach. We present methods to reduce the prevalence of false-positive SNVs by applying combinatorial techniques to five publicly available variant callers. We also introduce FFPolish, a novel variant classification method that efficiently classifies FFPE-specific false-positive variants. Our combinatorial and statistical techniques improve precision and F1 scores compared to the results of publicly available tools when tested individually.
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Tipo de estudio: Risk_factors_studies Idioma: En Revista: Front Genet Año: 2022 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Bases de datos: MEDLINE Tipo de estudio: Risk_factors_studies Idioma: En Revista: Front Genet Año: 2022 Tipo del documento: Article País de afiliación: Canadá