Búsqueda | Portal Regional de la BVS

Automatic identification of relevant genes from low-dimensional embeddings of single-cell RNA-seq data.

Angerer, Philipp; Fischer, David S; Theis, Fabian J; Scialdone, Antonio; Marr, Carsten.

Bioinformatics ; 36(15): 4291-4295, 2020 08 01.

Artículo en Inglés | MEDLINE | ID: mdl-32207520

RESUMEN

MOTIVATION: Dimensionality reduction is a key step in the analysis of single-cell RNA-sequencing data. It produces a low-dimensional embedding for visualization and as a calculation base for downstream analysis. Nonlinear techniques are most suitable to handle the intrinsic complexity of large, heterogeneous single-cell data. However, with no linear relation between gene and embedding coordinate, there is no way to extract the identity of genes driving any cell's position in the low-dimensional embedding, making it difficult to characterize the underlying biological processes. RESULTS: In this article, we introduce the concepts of local and global gene relevance to compute an equivalent of principal component analysis loadings for non-linear low-dimensional embeddings. Global gene relevance identifies drivers of the overall embedding, while local gene relevance identifies those of a defined sub-region. We apply our method to single-cell RNA-seq datasets from different experimental protocols and to different low-dimensional embedding techniques. This shows our method's versatility to identify key genes for a variety of biological processes. AVAILABILITY AND IMPLEMENTATION: To ensure reproducibility and ease of use, our method is released as part of destiny 3.0, a popular R package for building diffusion maps from single-cell transcriptomic data. It is readily available through Bioconductor. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

RNA-Seq , ARN , Perfilación de la Expresión Génica , Análisis de Componente Principal , ARN/genética , Reproducibilidad de los Resultados , Análisis de Secuencia de ARN , Análisis de la Célula Individual , Programas Informáticos

BART-Seq: cost-effective massively parallelized targeted sequencing for genomics, transcriptomics, and single-cell analysis.

Uzbas, Fatma; Opperer, Florian; Sönmezer, Can; Shaposhnikov, Dmitry; Sass, Steffen; Krendl, Christian; Angerer, Philipp; Theis, Fabian J; Mueller, Nikola S; Drukker, Micha.

Genome Biol ; 20(1): 155, 2019 08 06.

Artículo en Inglés | MEDLINE | ID: mdl-31387612

RESUMEN

We describe a highly sensitive, quantitative, and inexpensive technique for targeted sequencing of transcript cohorts or genomic regions from thousands of bulk samples or single cells in parallel. Multiplexing is based on a simple method that produces extensive matrices of diverse DNA barcodes attached to invariant primer sets, which are all pre-selected and optimized in silico. By applying the matrices in a novel workflow named Barcode Assembly foR Targeted Sequencing (BART-Seq), we analyze developmental states of thousands of single human pluripotent stem cells, either in different maintenance media or upon Wnt/ß-catenin pathway activation, which identifies the mechanisms of differentiation induction. Moreover, we apply BART-Seq to the genetic screening of breast cancer patients and identify BRCA mutations with very high precision. The processing of thousands of samples and dynamic range measurements that outperform global transcriptomics techniques makes BART-Seq first targeted sequencing technique suitable for numerous research applications.

Asunto(s)

Perfilación de la Expresión Génica/métodos , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ARN/métodos , Neoplasias de la Mama/genética , Análisis Costo-Beneficio , Células Madre Embrionarias/metabolismo , Femenino , Perfilación de la Expresión Génica/economía , Genómica/economía , Secuenciación de Nucleótidos de Alto Rendimiento/economía , Humanos , Células Madre Pluripotentes/metabolismo , Análisis de Secuencia de ARN/economía , Análisis de la Célula Individual/economía , Análisis de la Célula Individual/métodos , Vía de Señalización Wnt , Flujo de Trabajo

SCANPY: large-scale single-cell gene expression data analysis.

Wolf, F Alexander; Angerer, Philipp; Theis, Fabian J.

Genome Biol ; 19(1): 15, 2018 02 06.

Artículo en Inglés | MEDLINE | ID: mdl-29409532

RESUMEN

SCANPY is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks. Its Python-based implementation efficiently deals with data sets of more than one million cells ( https://github.com/theislab/Scanpy ). Along with SCANPY, we present ANNDATA, a generic class for handling annotated data matrices ( https://github.com/theislab/anndata ).

Asunto(s)

Perfilación de la Expresión Génica/métodos , Programas Informáticos , Redes Reguladoras de Genes , Análisis de la Célula Individual

destiny: diffusion maps for large-scale single-cell data in R.

Angerer, Philipp; Haghverdi, Laleh; Büttner, Maren; Theis, Fabian J; Marr, Carsten; Buettner, Florian.

Bioinformatics ; 32(8): 1241-3, 2016 04 15.

Artículo en Inglés | MEDLINE | ID: mdl-26668002

RESUMEN

UNLABELLED: : Diffusion maps are a spectral method for non-linear dimension reduction and have recently been adapted for the visualization of single-cell expression data. Here we present destiny, an efficient R implementation of the diffusion map algorithm. Our package includes a single-cell specific noise model allowing for missing and censored values. In contrast to previous implementations, we further present an efficient nearest-neighbour approximation that allows for the processing of hundreds of thousands of cells and a functionality for projecting new data on existing diffusion maps. We exemplarily apply destiny to a recent time-resolved mass cytometry dataset of cellular reprogramming. AVAILABILITY AND IMPLEMENTATION: destiny is an open-source R/Bioconductor package "bioconductor.org/packages/destiny" also available at www.helmholtz-muenchen.de/icb/destiny A detailed vignette describing functions and workflows is provided with the package. CONTACT: carsten.marr@helmholtz-muenchen.de or f.buettner@helmholtz-muenchen.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Algoritmos , Análisis de la Célula Individual/métodos , Análisis por Conglomerados , Difusión , Programas Informáticos

LocTree3 prediction of localization.

Goldberg, Tatyana; Hecht, Maximilian; Hamp, Tobias; Karl, Timothy; Yachdav, Guy; Ahmed, Nadeem; Altermann, Uwe; Angerer, Philipp; Ansorge, Sonja; Balasz, Kinga; Bernhofer, Michael; Betz, Alexander; Cizmadija, Laura; Do, Kieu Trinh; Gerke, Julia; Greil, Robert; Joerdens, Vadim; Hastreiter, Maximilian; Hembach, Katharina; Herzog, Max; Kalemanov, Maria; Kluge, Michael; Meier, Alice; Nasir, Hassan; Neumaier, Ulrich; Prade, Verena; Reeb, Jonas; Sorokoumov, Aleksandr; Troshani, Ilira; Vorberg, Susann; Waldraff, Sonja; Zierer, Jonas; Nielsen, Henrik; Rost, Burkhard.

Nucleic Acids Res ; 42(Web Server issue): W350-5, 2014 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-24848019

RESUMEN

The prediction of protein sub-cellular localization is an important step toward elucidating protein function. For each query protein sequence, LocTree2 applies machine learning (profile kernel SVM) to predict the native sub-cellular localization in 18 classes for eukaryotes, in six for bacteria and in three for archaea. The method outputs a score that reflects the reliability of each prediction. LocTree2 has performed on par with or better than any other state-of-the-art method. Here, we report the availability of LocTree3 as a public web server. The server includes the machine learning-based LocTree2 and improves over it through the addition of homology-based inference. Assessed on sequence-unique data, LocTree3 reached an 18-state accuracy Q18=80±3% for eukaryotes and a six-state accuracy Q6=89±4% for bacteria. The server accepts submissions ranging from single protein sequences to entire proteomes. Response time of the unloaded server is about 90 s for a 300-residue eukaryotic protein and a few hours for an entire eukaryotic proteome not considering the generation of the alignments. For over 1000 entirely sequenced organisms, the predictions are directly available as downloads. The web server is available at http://www.rostlab.org/services/loctree3.

Asunto(s)

Proteínas/análisis , Programas Informáticos , Proteínas Arqueales/análisis , Inteligencia Artificial , Proteínas Bacterianas/análisis , Internet , Homología de Secuencia de Aminoácido

Graduate journal: PhD limitations.

Angerer, Philipp.

Nature ; 431(7012): 1128, 2004 Oct 28.

Artículo en Inglés | MEDLINE | ID: mdl-15510156

Asunto(s)

Educación de Postgrado , Empleo , Investigadores/educación , Industria Farmacéutica , Mercadotecnía , Investigadores/economía , Recursos Humanos

Graduate journal: the lab environment.

Angerer, Philipp.

Nature ; 428(6979): 238, 2004 Mar 11.

Artículo en Inglés | MEDLINE | ID: mdl-15014521

Asunto(s)

Educación de Postgrado , Satisfacción en el Trabajo , Laboratorios , Investigadores/psicología , Educación de Postgrado/normas , Ambiente

Graduate journal: seeking perspective.

Angerer, Philipp.

Nature ; 427(6975): 660, 2004 Feb 12.

Artículo en Inglés | MEDLINE | ID: mdl-14961126

Asunto(s)

Mentores , Investigadores/educación , Orientación Vocacional , Selección de Profesión , Educación de Postgrado , Europa (Continente) , Estudiantes , Universidades

Graduate journal: at the crossroads.

Angerer, Philipp.

Nature ; 427(6971): 270, 2004 Jan 15.

Artículo en Inglés | MEDLINE | ID: mdl-14724646

Asunto(s)

Educación de Postgrado , Emprendimiento , Investigación , Comunicación Interdisciplinaria , Suiza

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

Asunto(s)

Asunto(s)

Asunto(s)

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA