Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Bull Math Biol ; 86(3): 24, 2024 01 31.
Artigo em Inglês | MEDLINE | ID: mdl-38294587

RESUMO

Phylogenetic trees are a mathematical formalisation of evolutionary histories between organisms, species, genes, cancer cells, etc. For many applications, e.g. when analysing virus transmission trees or cancer evolution, (phylogenetic) time trees are of interest, where branch lengths represent times. Computational methods for reconstructing time trees from (typically molecular) sequence data, for example Bayesian phylogenetic inference using Markov Chain Monte Carlo (MCMC) methods, rely on algorithms that sample the treespace. They employ tree rearrangement operations such as [Formula: see text] (Subtree Prune and Regraft) and [Formula: see text] (Nearest Neighbour Interchange) or, in the case of time tree inference, versions of these that take times of internal nodes into account. While the classic [Formula: see text] tree rearrangement is well-studied, its variants for time trees are less understood, limiting comparative analysis for time tree methods. In this paper we consider a modification of the classical [Formula: see text] rearrangement on the space of ranked phylogenetic trees, which are trees equipped with a ranking of all internal nodes. This modification results in two novel treespaces, which we propose to study. We begin this study by discussing algorithmic properties of these treespaces, focusing on those relating to the complexity of computing distances under the ranked [Formula: see text] operations as well as similarities and differences to known tree rearrangement based treespaces. Surprisingly, we show the counterintuitive result that adding leaves to trees can actually decrease their ranked [Formula: see text] distance, which may have an impact on the results of time tree sampling algorithms given uncertain "rogue taxa".


Assuntos
Conceitos Matemáticos , Modelos Biológicos , Teorema de Bayes , Filogenia , Algoritmos
2.
J Comput Biol ; 30(4): 518-537, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36475926

RESUMO

Phylogenetic methods are emerging as a useful tool to understand cancer evolutionary dynamics, including tumor structure, heterogeneity, and progression. Most currently used approaches utilize either bulk whole genome sequencing or single-cell DNA sequencing and are based on calling copy number alterations and single nucleotide variants (SNVs). Single-cell RNA sequencing (scRNA-seq) is commonly applied to explore differential gene expression of cancer cells throughout tumor progression. The method exacerbates the single-cell sequencing problem of low yield per cell with uneven expression levels. This accounts for low and uneven sequencing coverage and makes SNV detection and phylogenetic analysis challenging. In this article, we demonstrate for the first time that scRNA-seq data contain sufficient evolutionary signal and can also be utilized in phylogenetic analyses. We explore and compare results of such analyses based on both expression levels and SNVs called from scRNA-seq data. Both techniques are shown to be useful for reconstructing phylogenetic relationships between cells, reflecting the clonal composition of a tumor. Both standardized expression values and SNVs appear to be equally capable of reconstructing a similar pattern of phylogenetic relationship. This pattern is stable even when phylogenetic uncertainty is taken in account. Our results open up a new direction of somatic phylogenetics based on scRNA-seq data. Further research is required to refine and improve these approaches to capture the full picture of somatic evolutionary dynamics in cancer.


Assuntos
Neoplasias , Análise da Expressão Gênica de Célula Única , Humanos , Filogenia , Neoplasias/genética , Análise de Célula Única/métodos , Análise de Sequência de RNA/métodos , Perfilação da Expressão Gênica/métodos
3.
PLoS Comput Biol ; 18(12): e1010730, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36580499

RESUMO

Large-scale genotype-phenotype screens provide a wealth of data for identifying molecular alterations associated with a phenotype. Epistatic effects play an important role in such association studies. For example, siRNA perturbation screens can be used to identify combinatorial gene-silencing effects. In bacteria, epistasis has practical consequences in determining antimicrobial resistance as the genetic background of a strain plays an important role in determining resistance. Recently developed tools scale to human exome-wide screens for pairwise interactions, but none to date have included the possibility of three-way interactions. Expanding upon recent state-of-the-art methods, we make a number of improvements to the performance on large-scale data, making consideration of three-way interactions possible. We demonstrate our proposed method, Pint, on both simulated and real data sets, including antibiotic resistance testing and siRNA perturbation screens. Pint outperforms known methods in simulated data, and identifies a number of biologically plausible gene effects in both the antibiotic and siRNA models. For example, we have identified a combination of known tumour suppressor genes that is predicted (using Pint) to cause a significant increase in cell proliferation.


Assuntos
Antibacterianos , Epistasia Genética , Humanos , Fenótipo , Antibacterianos/farmacologia
4.
Mol Biol Evol ; 39(8)2022 08 03.
Artigo em Inglês | MEDLINE | ID: mdl-35733333

RESUMO

Single-cell sequencing provides a new way to explore the evolutionary history of cells. Compared to traditional bulk sequencing, where a population of heterogeneous cells is pooled to form a single observation, single-cell sequencing isolates and amplifies genetic material from individual cells, thereby preserving the information about the origin of the sequences. However, single-cell data are more error-prone than bulk sequencing data due to the limited genomic material available per cell. Here, we present error and mutation models for evolutionary inference of single-cell data within a mature and extensible Bayesian framework, BEAST2. Our framework enables integration with biologically informative models such as relaxed molecular clocks and population dynamic models. Our simulations show that modeling errors increase the accuracy of relative divergence times and substitution parameters. We reconstruct the phylogenetic history of a colorectal cancer patient and a healthy patient from single-cell DNA sequencing data. We find that the estimated times of terminal splitting events are shifted forward in time compared to models which ignore errors. We observed that not accounting for errors can overestimate the phylogenetic diversity in single-cell DNA sequencing data. We estimate that 30-50% of the apparent diversity can be attributed to error. Our work enables a full Bayesian approach capable of accounting for errors in the data within the integrative Bayesian software framework BEAST2.


Assuntos
Neoplasias , Software , Teorema de Bayes , Evolução Molecular , Genômica , Humanos , Modelos Genéticos , Filogenia
5.
Artigo em Inglês | MEDLINE | ID: mdl-35483879

RESUMO

Tuberous sclerosis complex (TSC) is an inheritable disorder characterized by the formation of benign yet disorganized tumors in multiple organ systems. Germline mutations in the TSC1 (hamartin) or more frequently TSC2 (tuberin) genes are causative for TSC. The malignant manifestations of TSC, pulmonary lymphangioleiomyomatosis (LAM) and renal angiomyolipoma (AML), may also occur as independent sporadic perivascular epithelial cell tumor (PEComa) characterized by somatic TSC2 mutations. Thus, discerning TSC from the copresentation of sporadic LAM and sporadic AML may be obscured in TSC patients lacking additional features. In this report, we present a case study on a single patient initially reported to have sporadic LAM and a mucinous duodenal adenocarcinoma deficient in DNA mismatch repair proteins. Moreover, the patient had a history of Wilms' tumor, which was reclassified as AML following the LAM diagnosis. Therefore, we investigated the origins and relatedness of these tumors. Using germline whole-genome sequencing, we identified a premature truncation in one of the patient's TSC2 alleles. Using immunohistochemistry, loss of tuberin expression was observed in AML and LAM tissue. However, no evidence of a somatic loss of heterozygosity or DNA methylation epimutations was observed at the TSC2 locus, suggesting alternate mechanisms may contribute to loss of the tumor suppressor protein. In the mucinous duodenal adenocarcinoma, no causative mutations were found in the DNA mismatch repair genes MLH1, MSH2, MSH6, or PMS2 Rather, clonal deconvolution analyses were used to identify mutations contributing to pathogenesis. This report highlights both the utility of using multiple sequencing techniques and the complexity of interpreting the data in a clinical context.


Assuntos
Adenocarcinoma , Angiomiolipoma , Neoplasias Renais , Leucemia Mieloide Aguda , Esclerose Tuberosa , Angiomiolipoma/genética , Angiomiolipoma/patologia , Feminino , Humanos , Masculino , Esclerose Tuberosa/diagnóstico , Esclerose Tuberosa/genética , Esclerose Tuberosa/metabolismo , Proteína 2 do Complexo Esclerose Tuberosa/genética , Proteínas Supressoras de Tumor/genética
6.
J Math Biol ; 83(5): 60, 2021 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-34739608

RESUMO

In many phylogenetic applications, such as cancer and virus evolution, time trees, evolutionary histories where speciation events are timed, are inferred. Of particular interest are clock-like trees, where all leaves are sampled at the same time and have equal distance to the root. One popular approach to model clock-like trees is coalescent theory, which is used in various tree inference software packages. Methodologically, phylogenetic inference methods require a tree space over which the inference is performed, and the geometry of this space plays an important role in statistical and computational aspects of tree inference algorithms. It has recently been shown that coalescent tree spaces possess a unique geometry, different from that of classical phylogenetic tree spaces. Here we introduce and study a space of discrete coalescent trees. They assume that time is discrete, which is natural in many computational applications. This tree space is a generalisation of the previously studied ranked nearest neighbour interchange space, and is built upon tree-rearrangement operations. We generalise existing results about ranked trees, including an algorithm for computing distances in polynomial time, and in particular provide new results for both the space of discrete coalescent trees and the space of ranked trees. We establish several geometrical properties of these spaces and show how these properties impact various algorithms used in phylogenetic analyses. Our tree space is a discretisation of a previously introduced time tree space, called t-space, and hence our results can be used to approximate solutions to various open problems in t-space.


Assuntos
Algoritmos , Análise por Conglomerados , Filogenia
7.
J Math Biol ; 82(1-2): 8, 2021 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-33492606

RESUMO

Many popular algorithms for searching the space of leaf-labelled (phylogenetic) trees are based on tree rearrangement operations. Under any such operation, the problem is reduced to searching a graph where vertices are trees and (undirected) edges are given by pairs of trees connected by one rearrangement operation (sometimes called a move). Most popular are the classical nearest neighbour interchange, subtree prune and regraft, and tree bisection and reconnection moves. The problem of computing distances, however, is [Formula: see text]-hard in each of these graphs, making tree inference and comparison algorithms challenging to design in practice. Although anked phylogenetic trees are one of the central objects of interest in applications such as cancer research, immunology, and epidemiology, the computational complexity of the shortest path problem for these trees remained unsolved for decades. In this paper, we settle this problem for the ranked nearest neighbour interchange operation by establishing that the complexity depends on the weight difference between the two types of tree rearrangements (rank moves and edge moves), and varies from quadratic, which is the lowest possible complexity for this problem, to [Formula: see text]-hard, which is the highest. In particular, our result provides the first example of a phylogenetic tree rearrangement operation for which shortest paths, and hence the distance, can be computed efficiently. Specifically, our algorithm scales to trees with tens of thousands of leaves (and likely hundreds of thousands if implemented efficiently).


Assuntos
Algoritmos , Modelos Genéticos , Análise por Conglomerados , Biologia Computacional , Filogenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA