Pesquisa | BVS Educação Profissional em Saúde

Reconstructing tumor clonal lineage trees incorporating single-nucleotide variants, copy number alterations and structural variations.

Fu, Xuecong; Lei, Haoyun; Tao, Yifeng; Schwartz, Russell.

Bioinformatics ; 38(Suppl 1): i125-i133, 2022 06 24.

Artigo em Inglês | MEDLINE | ID: mdl-35758777

RESUMO

MOTIVATION: Cancer develops through a process of clonal evolution in which an initially healthy cell gives rise to progeny gradually differentiating through the accumulation of genetic and epigenetic mutations. These mutations can take various forms, including single-nucleotide variants (SNVs), copy number alterations (CNAs) or structural variations (SVs), with each variant type providing complementary insights into tumor evolution as well as offering distinct challenges to phylogenetic inference. RESULTS: In this work, we develop a tumor phylogeny method, TUSV-ext, which incorporates SNVs, CNAs and SVs into a single inference framework. We demonstrate on simulated data that the method produces accurate tree inferences in the presence of all three variant types. We further demonstrate the method through application to real prostate tumor data, showing how our approach to coordinated phylogeny inference and clonal construction with all three variant types can reveal a more complicated clonal structure than is suggested by prior work, consistent with extensive polyclonal seeding or migration. AVAILABILITY AND IMPLEMENTATION: https://github.com/CMUSchwartzLab/TUSV-ext. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Variações do Número de Cópias de DNA , Neoplasias , Algoritmos , Evolução Clonal , Humanos , Neoplasias/genética , Nucleotídeos , Filogenia , Software

Improving and evaluating deep learning models of cellular organization.

Sun, Huangqingbo; Fu, Xuecong; Abraham, Serena; Jin, Shen; Murphy, Robert F.

Bioinformatics ; 38(23): 5299-5306, 2022 11 30.

Artigo em Inglês | MEDLINE | ID: mdl-36264139

RESUMO

MOTIVATION: Cells contain dozens of major organelles and thousands of other structures, many of which vary extensively in their number, size, shape and spatial distribution. This complexity and variation dramatically complicates the use of both traditional and deep learning methods to build accurate models of cell organization. Most cellular organelles are distinct objects with defined boundaries that do not overlap, while the pixel resolution of most imaging methods is n sufficient to resolve these boundaries. Thus while cell organization is conceptually object-based, most current methods are pixel-based. Using extensive image collections in which particular organelles were fluorescently labeled, deep learning methods can be used to build conditional autoencoder models for particular organelles. A major advance occurred with the use of a U-net approach to make multiple models all conditional upon a common reference, unlabeled image, allowing the relationships between different organelles to be at least partially inferred. RESULTS: We have developed improved Generative Adversarial Networks-based approaches for learning these models and have also developed novel criteria for evaluating how well synthetic cell images reflect the properties of real images. The first set of criteria measure how well models preserve the expected property that organelles do not overlap. We also developed a modified loss function that allows retraining of the models to minimize that overlap. The second set of criteria uses object-based modeling to compare object shape and spatial distribution between synthetic and real images. Our work provides the first demonstration that, at least for some organelles, deep learning models can capture object-level properties of cell images. AVAILABILITY AND IMPLEMENTATION: http://murphylab.cbd.cmu.edu/Software/2022_insilico. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Aprendizado Profundo , Organelas , Processamento de Imagem Assistida por Computador/métodos

Semi-deconvolution of bulk and single-cell RNA-seq data with application to metastatic progression in breast cancer.

Lei, Haoyun; Guo, Xiaoyan A; Tao, Yifeng; Ding, Kai; Fu, Xuecong; Oesterreich, Steffi; Lee, Adrian V; Schwartz, Russell.

Bioinformatics ; 38(Suppl 1): i386-i394, 2022 06 24.

Artigo em Inglês | MEDLINE | ID: mdl-35758822

RESUMO

MOTIVATION: Identifying cell types and their abundances and how these evolve during tumor progression is critical to understanding the mechanisms of metastasis and identifying predictors of metastatic potential that can guide the development of new diagnostics or therapeutics. Single-cell RNA sequencing (scRNA-seq) has been especially promising in resolving heterogeneity of expression programs at the single-cell level, but is not always feasible, e.g. for large cohort studies or longitudinal analysis of archived samples. In such cases, clonal subpopulations may still be inferred via genomic deconvolution, but deconvolution methods have limited ability to resolve fine clonal structure and may require reference cell type profiles that are missing or imprecise. Prior methods can eliminate the need for reference profiles but show unstable performance when few bulk samples are available. RESULTS: In this work, we develop a new method using reference scRNA-seq to interpret sample collections for which only bulk RNA-seq is available for some samples, e.g. clonally resolving archived primary tissues using scRNA-seq from metastases. By integrating such information in a Quadratic Programming framework, our method can recover more accurate cell types and corresponding cell type abundances in bulk samples. Application to a breast tumor bone metastases dataset confirms the power of scRNA-seq data to improve cell type inference and quantification in same-patient bulk samples. AVAILABILITY AND IMPLEMENTATION: Source code is available on Github at https://github.com/CMUSchwartzLab/RADs.

Assuntos

Neoplasias da Mama , Análise de Célula Única , Neoplasias da Mama/genética , Feminino , Perfilação da Expressão Gênica/métodos , Humanos , RNA-Seq , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos

Tumor heterogeneity assessed by sequencing and fluorescence in situ hybridization (FISH) data.

Lei, Haoyun; Gertz, E Michael; Schäffer, Alejandro A; Fu, Xuecong; Tao, Yifeng; Heselmeyer-Haddad, Kerstin; Torres, Irianna; Li, Guibo; Xu, Liqin; Hou, Yong; Wu, Kui; Shi, Xulian; Dean, Michael; Ried, Thomas; Schwartz, Russell.

Bioinformatics ; 37(24): 4704-4711, 2021 12 11.

Artigo em Inglês | MEDLINE | ID: mdl-34289030

RESUMO

MOTIVATION: Computational reconstruction of clonal evolution in cancers has become a crucial tool for understanding how tumors initiate and progress and how this process varies across patients. The field still struggles, however, with special challenges of applying phylogenetic methods to cancers, such as the prevalence and importance of copy number alteration (CNA) and structural variation events in tumor evolution, which are difficult to profile accurately by prevailing sequencing methods in such a way that subsequent reconstruction by phylogenetic inference algorithms is accurate. RESULTS: In this work, we develop computational methods to combine sequencing with multiplex interphase fluorescence in situ hybridization to exploit the complementary advantages of each technology in inferring accurate models of clonal CNA evolution accounting for both focal changes and aneuploidy at whole-genome scales. By integrating such information in an integer linear programming framework, we demonstrate on simulated data that incorporation of FISH data substantially improves accurate inference of focal CNA and ploidy changes in clonal evolution from deconvolving bulk sequence data. Analysis of real glioblastoma data for which FISH, bulk sequence and single cell sequence are all available confirms the power of FISH to enhance accurate reconstruction of clonal copy number evolution in conjunction with bulk and optionally single-cell sequence data. AVAILABILITY AND IMPLEMENTATION: Source code is available on Github at https://github.com/CMUSchwartzLab/FISH_deconvolution. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Neoplasias , Software , Humanos , Hibridização in Situ Fluorescente , Filogenia , Algoritmos , Neoplasias/patologia

Robust and accurate deconvolution of tumor populations uncovers evolutionary mechanisms of breast cancer metastasis.

Tao, Yifeng; Lei, Haoyun; Fu, Xuecong; Lee, Adrian V; Ma, Jian; Schwartz, Russell.

Bioinformatics ; 36(Suppl_1): i407-i416, 2020 07 01.

Artigo em Inglês | MEDLINE | ID: mdl-32657393

RESUMO

MOTIVATION: Cancer develops and progresses through a clonal evolutionary process. Understanding progression to metastasis is of particular clinical importance, but is not easily analyzed by recent methods because it generally requires studying samples gathered years apart, for which modern single-cell sequencing is rarely an option. Revealing the clonal evolution mechanisms in the metastatic transition thus still depends on unmixing tumor subpopulations from bulk genomic data. METHODS: We develop a novel toolkit called robust and accurate deconvolution (RAD) to deconvolve biologically meaningful tumor populations from multiple transcriptomic samples spanning the two progression states. RAD uses gene module compression to mitigate considerable noise in RNA, and a hybrid optimizer to achieve a robust and accurate solution. Finally, we apply a phylogenetic algorithm to infer how associated cell populations adapt across the metastatic transition via changes in expression programs and cell-type composition. RESULTS: We validated the superior robustness and accuracy of RAD over alternative algorithms on a real dataset, and validated the effectiveness of gene module compression on both simulated and real bulk RNA data. We further applied the methods to a breast cancer metastasis dataset, and discovered common early events that promote tumor progression and migration to different metastatic sites, such as dysregulation of ECM-receptor, focal adhesion and PI3k-Akt pathways. AVAILABILITY AND IMPLEMENTATION: The source code of the RAD package, models, experiments and technical details such as parameters, is available at https://github.com/CMUSchwartzLab/RAD. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Neoplasias da Mama , Algoritmos , Neoplasias da Mama/genética , Humanos , Fosfatidilinositol 3-Quinases , Filogenia , Software

Marker selection strategies for circulating tumor DNA guided by phylogenetic inference.

Fu, Xuecong; Luo, Zhicheng; Deng, Yueqian; LaFramboise, William; Bartlett, David; Schwartz, Russell.

bioRxiv ; 2024 Mar 27.

Artigo em Inglês | MEDLINE | ID: mdl-38586041

RESUMO

Motivation: Blood-based profiling of tumor DNA ("liquid biopsy") has offered great prospects for non-invasive early cancer diagnosis, treatment monitoring, and clinical guidance, but require further advances in computational methods to become a robust quantitative assay of tumor clonal evolution. We propose new methods to better characterize tumor clonal dynamics from circulating tumor DNA (ctDNA), through application to two specific questions: 1) How to apply longitudinal ctDNA data to refine phylogeny models of clonal evolution, and 2) how to quantify changes in clonal frequencies that may be indicative of treatment response or tumor progression. We pose these questions through a probabilistic framework for optimally identifying maximum likelihood markers and applying them to characterizing clonal evolution. Results: We first estimate a distribution over plausible clonal lineage models, using bootstrap samples over pre-treatment tissue-based sequence data. We then refine these lineage models and the clonal frequencies they imply over successive longitudinal samples. We use the resulting framework for modeling and refining tree distributions to pose a set of optimization problems to select ctDNA markers to maximize measures of utility capturing ability to solve the two questions of reducing uncertain in phylogeny models or quantifying clonal frequencies given the models. We tested our methods on synthetic data and showed them to be effective at refining distributions of tree models and clonal frequencies so as to minimize measures of tree distance relative to the ground truth. Application of the tree refinement methods to real tumor data further demonstrated their effectiveness in refining a clonal lineage model and assessing its clonal frequencies. The work shows the power of computational methods to improve marker selection, clonal lineage reconstruction, and clonal dynamics profiling for more precise and quantitative assays of tumor progression. Availability: https://github.com/CMUSchwartzLab/Mase-phi.git. Contact: russells@andrew.cmu.edu.

Sc-TUSV-ext: Single-cell clonal lineage inference from single nucleotide variants (SNV), copy number alterations (CNA) and structural variants (SV).

Bristy, Nishat Anjum; Fu, Xuecong; Schwartz, Russell.

bioRxiv ; 2023 Dec 08.

Artigo em Inglês | MEDLINE | ID: mdl-38106049

RESUMO

Clonal lineage inference ("tumor phylogenetics") has become a crucial tool for making sense of somatic evolution processes that underlie cancer development and are increasingly recognized as part of normal tissue growth and aging. The inference of clonal lineage trees from single cell sequence data offers particular promise for revealing processes of somatic evolution in unprecedented detail. However, most such tools are based on fairly restrictive models of the types of mutation events observed in somatic evolution and of the processes by which they develop. The present work seeks to enhance the power and versatility of tools for single-cell lineage reconstruction by making more comprehensive use of the range of molecular variant types by which tumors evolve. We introduce Sc-TUSV-ext, an integer linear programming (ILP) based tumor phylogeny reconstruction method that, for the first time, integrates single nucleotide variants (SNV), copy number alterations (CNA) and structural variations (SV) into clonal lineage reconstruction from single-cell DNA sequencing data. We show on synthetic data that accounting for these variant types collectively leads to improved accuracy in clonal lineage reconstruction relative to prior methods that consider only subsets of the variant types. We further demonstrate the effectiveness on real data in resolving clonal evolution in the presence of multiple variant types, providing a path towards more comprehensive insight into how various forms of somatic mutability collectively shape tissue development.

Joint Clustering of Single-Cell Sequencing and Fluorescence In Situ Hybridization Data for Reconstructing Clonal Heterogeneity in Cancers.

Fu, Xuecong; Lei, Haoyun; Tao, Yifeng; Heselmeyer-Haddad, Kerstin; Torres, Irianna; Dean, Michael; Ried, Thomas; Schwartz, Russell.

J Comput Biol ; 28(11): 1035-1051, 2021 11.

Artigo em Inglês | MEDLINE | ID: mdl-34612714

RESUMO

Aneuploidy and whole genome duplication (WGD) events are common features of cancers associated with poor outcomes, but the ways they influence trajectories of clonal evolution are poorly understood. Phylogenetic methods for reconstructing clonal evolution from genomic data have proven a powerful tool for understanding how clonal evolution occurs in the process of cancer progression, but extant methods so far have limited the ability to resolve tumor evolution via ploidy changes. This limitation exists in part because single-cell DNA-sequencing (scSeq), which has been crucial to developing detailed profiles of clonal evolution, has difficulty in resolving ploidy changes and WGD. Multiplex interphase fluorescence in situ hybridization (miFISH) provides a more unambiguous signal of single-cell ploidy changes but it is limited to profiling small numbers of single markers. Here, we develop a joint clustering method to combine these two data sources with the goal of better resolving ploidy changes in tumor evolution. We develop a probabilistic framework to maximize the probability of latent variables given the pre-clustered datasets, which we optimize via Markov chain Monte Carlo sampling combined with linear regression. We validate the method by using simulated data derived from a glioblastoma (GBM) case profiled by both scSeq and miFISH. We further apply the method to two GBM cases with scSeq and miFISH data by reconstructing a phylogenetic tree from the joint clustering results, demonstrating their synergistic value in understanding how focal copy number changes and WGD events can collectively contribute to tumor progression.

Assuntos

Neoplasias Encefálicas/genética , Biologia Computacional/métodos , Glioblastoma/genética , Hibridização in Situ Fluorescente/métodos , Análise de Célula Única/métodos , Anáfase , Aneuploidia , Evolução Clonal , Análise por Conglomerados , Evolução Molecular , Humanos , Cadeias de Markov , Método de Monte Carlo , Filogenia , Análise de Sequência de RNA

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA