Your browser doesn't support javascript.
loading
Combinatorial and Machine Learning Approaches for Improved Somatic Variant Calling From Formalin-Fixed Paraffin-Embedded Genome Sequence Data.
Dodani, Dollina D; Nguyen, Matthew H; Morin, Ryan D; Marra, Marco A; Corbett, Richard D.
Afiliação
  • Dodani DD; The Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada.
  • Nguyen MH; The Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada.
  • Morin RD; Canada's Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada.
  • Marra MA; Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
  • Corbett RD; Canada's Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada.
Front Genet ; 13: 834764, 2022.
Article em En | MEDLINE | ID: mdl-35571031
Formalin fixation of paraffin-embedded tissue samples is a well-established method for preserving tissue and is routinely used in clinical settings. Although formalin-fixed, paraffin-embedded (FFPE) tissues are deemed crucial for research and clinical applications, the fixation process results in molecular damage to nucleic acids, thus confounding their use in genome sequence analysis. Methods to improve genomic data quality from FFPE tissues have emerged, but there remains significant room for improvement. Here, we use whole-genome sequencing (WGS) data from matched Fresh Frozen (FF) and FFPE tissue samples to optimize a sensitive and precise FFPE single nucleotide variant (SNV) calling approach. We present methods to reduce the prevalence of false-positive SNVs by applying combinatorial techniques to five publicly available variant callers. We also introduce FFPolish, a novel variant classification method that efficiently classifies FFPE-specific false-positive variants. Our combinatorial and statistical techniques improve precision and F1 scores compared to the results of publicly available tools when tested individually.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Risk_factors_studies Idioma: En Revista: Front Genet Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Canadá

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Risk_factors_studies Idioma: En Revista: Front Genet Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Canadá