Your browser doesn't support javascript.
loading
Inference of chromosome selection parameters and missegregation rate in cancer from DNA-sequencing data.
Xiang, Zijin; Liu, Zhihan; Dinh, Khanh N.
Affiliation
  • Xiang Z; Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, NY, USA.
  • Liu Z; Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, NY, USA.
  • Dinh KN; Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, NY, USA. knd2127@columbia.edu.
Sci Rep ; 14(1): 17699, 2024 07 31.
Article in En | MEDLINE | ID: mdl-39085295
ABSTRACT
Aneuploidy is frequently observed in cancers and has been linked to poor patient outcome. Analysis of aneuploidy in DNA-sequencing (DNA-seq) data necessitates untangling the effects of the Copy Number Aberration (CNA) occurrence rates and the selection coefficients that act upon the resulting karyotypes. We introduce a parameter inference algorithm that takes advantage of both bulk and single-cell DNA-seq cohorts. The method is based on Approximate Bayesian Computation (ABC) and utilizes CINner, our recently introduced simulation algorithm of chromosomal instability in cancer. We examine three groups of statistics to summarize the data in the ABC routine (A) Copy Number-based measures, (B) phylogeny tip statistics, and (C) phylogeny balance indices. Using these statistics, our method can recover both the CNA probabilities and selection parameters from ground truth data, and performs well even for data cohorts of relatively small sizes. We find that only statistics in groups A and C are well-suited for identifying CNA probabilities, and only group A carries the signals for estimating selection parameters. Moreover, the low number of CNA events at large scale compared to cell counts in single-cell samples means that statistics in group B cannot be estimated accurately using phylogeny reconstruction algorithms at the chromosome level. As data from both bulk and single-cell DNA-sequencing techniques becomes increasingly available, our inference framework promises to facilitate the analysis of distinct cancer types, differentiation between selection and neutral drift, and prediction of cancer clonal dynamics.
Subject(s)

Full text: 1 Database: MEDLINE Main subject: Algorithms / Sequence Analysis, DNA / Neoplasms Limits: Humans Language: En Journal: Sci Rep Year: 2024 Type: Article Affiliation country: United States

Full text: 1 Database: MEDLINE Main subject: Algorithms / Sequence Analysis, DNA / Neoplasms Limits: Humans Language: En Journal: Sci Rep Year: 2024 Type: Article Affiliation country: United States