Pesquisa | BVS Violência e Saúde

BnpC: Bayesian non-parametric clustering of single-cell mutation profiles.

Borgsmüller, Nico; Bonet, Jose; Marass, Francesco; Gonzalez-Perez, Abel; Lopez-Bigas, Nuria; Beerenwinkel, Niko.

Bioinformatics ; 36(19): 4854-4859, 2020 12 08.

Artigo em Inglês | MEDLINE | ID: mdl-32592465

RESUMO

MOTIVATION: The high resolution of single-cell DNA sequencing (scDNA-seq) offers great potential to resolve intratumor heterogeneity (ITH) by distinguishing clonal populations based on their mutation profiles. However, the increasing size of scDNA-seq datasets and technical limitations, such as high error rates and a large proportion of missing values, complicate this task and limit the applicability of existing methods. RESULTS: Here, we introduce BnpC, a novel non-parametric method to cluster individual cells into clones and infer their genotypes based on their noisy mutation profiles. We benchmarked our method comprehensively against state-of-the-art methods on simulated data using various data sizes, and applied it to three cancer scDNA-seq datasets. On simulated data, BnpC compared favorably against current methods in terms of accuracy, runtime and scalability. Its inferred genotypes were the most accurate, especially on highly heterogeneous data, and it was the only method able to run and produce results on datasets with 5000 cells. On tumor scDNA-seq data, BnpC was able to identify clonal populations missed by the original cluster analysis but supported by Supplementary Experimental Data. With ever growing scDNA-seq datasets, scalable and accurate methods such as BnpC will become increasingly relevant, not only to resolve ITH but also as a preprocessing step to reduce data size. AVAILABILITY AND IMPLEMENTATION: BnpC is freely available under MIT license at https://github.com/cbg-ethz/BnpC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Algoritmos , Análise de Célula Única , Teorema de Bayes , Análise por Conglomerados , Mutação , Análise de Sequência de RNA , Software

De novo detection of somatic variants in long-read single-cell RNA sequencing data.

Dondi, Arthur; Borgsmüller, Nico; Ferreira, Pedro F; Haas, Brian J; Jacob, Francis; Heinzelmann-Schwarz, Viola; Beerenwinkel, Niko.

bioRxiv ; 2024 Mar 08.

Artigo em Inglês | MEDLINE | ID: mdl-38496441

RESUMO

In cancer, genetic and transcriptomic variations generate clonal heterogeneity, possibly leading to treatment resistance. Long-read single-cell RNA sequencing (LR scRNA-seq) has the potential to detect genetic and transcriptomic variations simultaneously. Here, we present LongSom, a computational workflow leveraging LR scRNA-seq data to call de novo somatic single-nucleotide variants (SNVs), copy-number alterations (CNAs), and gene fusions to reconstruct the tumor clonal heterogeneity. For SNV calling, LongSom distinguishes somatic SNVs from germline polymorphisms by reannotating marker gene expression-based cell types using called variants and applying strict filters. Applying LongSom to ovarian cancer samples, we detected clinically relevant somatic SNVs that were validated against single-cell and bulk panel DNA-seq data and could not be detected with short-read (SR) scRNA-seq. Leveraging somatic SNVs and fusions, LongSom found subclones with different predicted treatment outcomes. In summary, LongSom enables de novo SNVs, CNAs, and fusions detection, thus enabling the study of cancer evolution, clonal heterogeneity, and treatment resistance.

Single-cell phylogenies reveal changes in the evolutionary rate within cancer and healthy tissues.

Borgsmüller, Nico; Valecha, Monica; Kuipers, Jack; Beerenwinkel, Niko; Posada, David.

Cell Genom ; 3(9): 100380, 2023 Sep 13.

Artigo em Inglês | MEDLINE | ID: mdl-37719146

RESUMO

Cell lineages accumulate somatic mutations during organismal development, potentially leading to pathological states. The rate of somatic evolution within a cell population can vary due to multiple factors, including selection, a change in the mutation rate, or differences in the microenvironment. Here, we developed a statistical test called the Poisson Tree (PT) test to detect varying evolutionary rates among cell lineages, leveraging the phylogenetic signal of single-cell DNA sequencing (scDNA-seq) data. We applied the PT test to 24 healthy and cancer samples, rejecting a constant evolutionary rate in 11 out of 15 cancer and five out of nine healthy scDNA-seq datasets. In six cancer datasets, we identified subclonal mutations in known driver genes that could explain the rate accelerations of particular cancer lineages. Our findings demonstrate the efficacy of scDNA-seq for studying somatic evolution and suggest that cell lineages often evolve at different rates within cancer and healthy tissues.

Detection of isoforms and genomic alterations by high-throughput full-length single-cell RNA sequencing in ovarian cancer.

Dondi, Arthur; Lischetti, Ulrike; Jacob, Francis; Singer, Franziska; Borgsmüller, Nico; Coelho, Ricardo; Heinzelmann-Schwarz, Viola; Beisel, Christian; Beerenwinkel, Niko.

Nat Commun ; 14(1): 7780, 2023 Nov 27.

Artigo em Inglês | MEDLINE | ID: mdl-38012143

RESUMO

Understanding the complex background of cancer requires genotype-phenotype information in single-cell resolution. Here, we perform long-read single-cell RNA sequencing (scRNA-seq) on clinical samples from three ovarian cancer patients presenting with omental metastasis and increase the PacBio sequencing depth to 12,000 reads per cell. Our approach captures 152,000 isoforms, of which over 52,000 were not previously reported. Isoform-level analysis accounting for non-coding isoforms reveals 20% overestimation of protein-coding gene expression on average. We also detect cell type-specific isoform and poly-adenylation site usage in tumor and mesothelial cells, and find that mesothelial cells transition into cancer-associated fibroblasts in the metastasis, partly through the TGF-ß/miR-29/Collagen axis. Furthermore, we identify gene fusions, including an experimentally validated IGF2BP2::TESPA1 fusion, which is misclassified as high TESPA1 expression in matched short-read data, and call mutations confirmed by targeted NGS cancer gene panel results. With these findings, we envision long-read scRNA-seq to become increasingly relevant in oncology and personalized medicine.

Assuntos

Genômica , Neoplasias Ovarianas , Humanos , Feminino , Análise de Sequência de RNA/métodos , Genômica/métodos , Isoformas de Proteínas/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias Ovarianas/genética , Transcriptoma/genética , Proteínas de Ligação a RNA

SIEVE: joint inference of single-nucleotide variants and cell phylogeny from single-cell DNA sequencing data.

Kang, Senbai; Borgsmüller, Nico; Valecha, Monica; Kuipers, Jack; Alves, Joao M; Prado-López, Sonia; Chantada, Débora; Beerenwinkel, Niko; Posada, David; Szczurek, Ewa.

Genome Biol ; 23(1): 248, 2022 11 30.

Artigo em Inglês | MEDLINE | ID: mdl-36451239

RESUMO

We present SIEVE, a statistical method for the joint inference of somatic variants and cell phylogeny under the finite-sites assumption from single-cell DNA sequencing. SIEVE leverages raw read counts for all nucleotides and corrects the acquisition bias of branch lengths. In our simulations, SIEVE outperforms other methods in phylogenetic reconstruction and variant calling accuracy, especially in the inference of homozygous variants. Applying SIEVE to three datasets, one for triple-negative breast (TNBC), and two for colorectal cancer (CRC), we find that double mutant genotypes are rare in CRC but unexpectedly frequent in the TNBC samples.

Assuntos

Neoplasias de Mama Triplo Negativas , Humanos , Filogenia , Sequência de Bases , Análise de Sequência de DNA , DNA , Nucleotídeos

WiPP: Workflow for Improved Peak Picking for Gas Chromatography-Mass Spectrometry (GC-MS) Data.

Borgsmüller, Nico; Gloaguen, Yoann; Opialla, Tobias; Blanc, Eric; Sicard, Emilie; Royer, Anne-Lise; Le Bizec, Bruno; Durand, Stéphanie; Migné, Carole; Pétéra, Mélanie; Pujos-Guillot, Estelle; Giacomoni, Franck; Guitton, Yann; Beule, Dieter; Kirwan, Jennifer.

Metabolites ; 9(9)2019 Aug 21.

Artigo em Inglês | MEDLINE | ID: mdl-31438611

RESUMO

Lack of reliable peak detection impedes automated analysis of large-scale gas chromatography-mass spectrometry (GC-MS) metabolomics datasets. Performance and outcome of individual peak-picking algorithms can differ widely depending on both algorithmic approach and parameters, as well as data acquisition method. Therefore, comparing and contrasting between algorithms is difficult. Here we present a workflow for improved peak picking (WiPP), a parameter optimising, multi-algorithm peak detection for GC-MS metabolomics. WiPP evaluates the quality of detected peaks using a machine learning-based classification scheme based on seven peak classes. The quality information returned by the classifier for each individual peak is merged with results from different peak detection algorithms to create one final high-quality peak set for immediate down-stream analysis. Medium- and low-quality peaks are kept for further inspection. By applying WiPP to standard compound mixes and a complex biological dataset, we demonstrate that peak detection is improved through the novel way to assign peak quality, an automated parameter optimisation, and results in integration across different embedded peak picking algorithms. Furthermore, our approach can provide an impartial performance comparison of different peak picking algorithms. WiPP is freely available on GitHub (https://github.com/bihealth/WiPP) under MIT licence.

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA