Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 2024 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-39163479

RESUMO

MOTIVATION: Single cell RNA sequencing (scRNA-seq) data is widely used to study cancer cell states and their heterogeneity. However, the tumour microenvironment is usually a mixture of healthy and cancerous cells and it can be difficult to fully separate these two populations based on transcriptomics alone. If available, somatic single nucleotide variants (SNVs) observed in the scRNA-seq data could be used to identify the cancer population and match that information with the single cells' expression profile. However, calling somatic SNVs in scRNA-seq data is a challeng-ing task, as most variants seen in the short read data are not somatic, but can instead be germline variants, RNA edits or transcription, sequencing or processing errors. Additionally, only variants present in actively transcribed regions for each individual cell will be seen in the data. RESULTS: To address these challenges, we develop CCLONE (Cancer Cell Labelling On Noisy Expression), an interpretable tool adapted to handle the uncertainty and sparsity of SNVs called from scRNA-seq data. CCLONE jointly identifies cancer clonal populations, and their associated variants. We apply CCLONE on two acute myeloid leukaemia datasets and one lung adenocarcinoma dataset and show that CCLONE captures both genetic clones and somatic events for multiple patients. These results show how CCLONE can be used to gather insight into the course of the disease and the origin of cancer cells in scRNA-seq data. AVAILABILITY AND IMPLEMENTATION: Source code is available at github.com/HaghverdiLab/CCLONE. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

2.
Life Sci Alliance ; 7(3)2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38110222

RESUMO

Hematopoietic stem and progenitor cells (HSPCs) are known to respond to acute inflammation; however, little is understood about the dynamics and heterogeneity of these stress responses in HSPCs. Here, we performed single-cell sequencing during the sensing, response, and recovery phases of the inflammatory response of HSPCs to treatment (a total of 10,046 cells from four time points spanning the first 72 h of response) with the pro-inflammatory cytokine IFNα to investigate the HSPCs' dynamic changes during acute inflammation. We developed the essential novel computational approaches to process and analyze the resulting single-cell time series dataset. This includes an unbiased cell type annotation and abundance analysis post inflammation, tools for identification of global and cell type-specific responding genes, and a semi-supervised linear regression approach for response pseudotime reconstruction. We discovered a variety of different gene responses of the HSPCs to the treatment. Interestingly, we were able to associate a global reduced myeloid differentiation program and a locally enhanced pyroptosis activity with reduced myeloid progenitor and differentiated cells after IFNα treatment. Altogether, the single-cell time series analyses have allowed us to unbiasedly study the heterogeneous and dynamic impact of IFNα on the HSPCs.


Assuntos
Hematopoese , Células-Tronco Hematopoéticas , Humanos , Fatores de Tempo , Diferenciação Celular/genética , Hematopoese/genética , Inflamação/metabolismo
3.
PLoS Comput Biol ; 18(9): e1010031, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-36170235

RESUMO

A few years ago, it was proposed to use the simultaneous quantification of unspliced and spliced messenger RNA (mRNA) to add a temporal dimension to high-throughput snapshots of single cell RNA sequencing data. This concept can yield additional insight into the transcriptional dynamics of the biological systems under study. However, current methods for inferring cell state velocities from such data (known as RNA velocities) are afflicted by several theoretical and computational problems, hindering realistic and reliable velocity estimation. We discuss these issues and propose new solutions for addressing some of the current challenges in consistency of data processing, velocity inference and visualisation. We translate our computational conclusion in two velocity analysis tools: one detailed method κ-velo and one heuristic method eco-velo, each of which uses a different set of assumptions about the data.


Assuntos
RNA Mensageiro , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos
4.
J Mol Evol ; 89(8): 544-553, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34328525

RESUMO

The native subcellular location (also referred to as localization or cellular compartment) of a protein is the one in which it acts most frequently; it is one aspect of protein function. Do ten eukaryotic model organisms differ in their location spectrum, i.e., the fraction of its proteome in each of seven major cellular compartments? As experimental annotations of locations remain biased and incomplete, we need prediction methods to answer this question. After systematic bias corrections, the complete but faulty prediction methods appeared to be more appropriate to compare location spectra between species than the incomplete more accurate experimental data. This work compared the location spectra for ten eukaryotes: Homo sapiens (human), Gorilla gorilla (gorilla), Pan troglodytes (chimpanzee), Mus musculus (mouse), Rattus norvegicus (rat), Drosophila melanogaster (fruit/vinegar fly), Anopheles gambiae (African malaria mosquito), Caenorhabitis elegans (nematode), Saccharomyces cerevisiae (baker's yeast), and Schizosaccharomyces pombe (fission yeast). The two largest classes were predicted to be the nucleus and the cytoplasm together accounting for 47-62% of all proteins, while 7-21% of the proteins were predicted in the plasma membrane and 4-15% to be secreted. Overall, the predicted location spectra were largely similar. However, in detail, the differences sufficed to plot trees (UPGMA) and 2D (PCA) maps relating the ten organisms using a simple Euclidean distance in seven states (location classes). The relations based on the simple predicted location spectra captured aspects of cross-species comparisons usually revealed only by much more detailed evolutionary comparisons. Most interestingly, known phylogenetic relations were reproduced better by paralog-only than by ortholog-only trees.


Assuntos
Drosophila melanogaster , Proteoma , Animais , Drosophila , Drosophila melanogaster/genética , Camundongos , Filogenia , Proteoma/genética , Ratos , Saccharomyces cerevisiae/genética
5.
Bioinformatics ; 34(19): 3385-3386, 2018 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-29762646

RESUMO

Motivation: Many applications monitor predictions of a whole range of features for biological datasets, e.g. the fraction of secreted human proteins in the human proteome. Results and error estimates are typically derived from publications. Results: Here, we present a simple, alternative approximation that uses performance estimates of methods to error-correct the predicted distributions. This approximation uses the confusion matrix (TP true positives, TN true negatives, FP false positives and FN false negatives) describing the performance of the prediction tool for correction. As proof-of-principle, the correction was applied to a two-class (membrane/not) and to a seven-class (localization) prediction. Availability and implementation: Datasets and a simple JavaScript tool available freely for all users at http://www.rostlab.org/services/distributions. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Humanos , Proteoma/análise
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA