MoSBi: Automated signature mining for molecular stratification and subtyping.

Rose, Tim Daniel; Bechtler, Thibault; Ciora, Octavia-Andreea; Anh Lilian Le, Kim; Molnar, Florian; Köhler, Nikolai; Baumbach, Jan; Röttger, Richard; Pauling, Josch Konstantin

Rose, Tim Daniel; Bechtler, Thibault; Ciora, Octavia-Andreea; Anh Lilian Le, Kim; Molnar, Florian; Köhler, Nikolai; Baumbach, Jan; Röttger, Richard; Pauling, Josch Konstantin.

Rose TD; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.
Bechtler T; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.
Ciora OA; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.
Anh Lilian Le K; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.
Molnar F; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.
Köhler N; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.
Baumbach J; Department for Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark.
Röttger R; Institute for Computational Systems Biology, University of Hamburg, 22607 Hamburg, Germany.
Pauling JK; LipiTUM, TUM School of Life Sciences, Technical University of Munich (TUM), 65354 Freising, Germany.

Proc Natl Acad Sci U S A ; 119(16): e2118210119, 2022 04 19.

Article en En | MEDLINE | ID: mdl-35412913

RESUMEN

The improving access to increasing amounts of biomedical data provides completely new chances for advanced patient stratification and disease subtyping strategies. This requires computational tools that produce uniformly robust results across highly heterogeneous molecular data. Unsupervised machine learning methodologies are able to discover de novo patterns in such data. Biclustering is especially suited by simultaneously identifying sample groups and corresponding feature sets across heterogeneous omics data. The performance of available biclustering algorithms heavily depends on individual parameterization and varies with their application. Here, we developed MoSBi (molecular signature identification using biclustering), an automated multialgorithm ensemble approach that integrates results utilizing an error model-supported similarity network. We systematically evaluated the performance of 11 available and established biclustering algorithms together with MoSBi. For this, we used transcriptomics, proteomics, and metabolomics data, as well as synthetic datasets covering various data properties. Profiting from multialgorithm integration, MoSBi identified robust group and disease-specific signatures across all scenarios, overcoming single algorithm specificities. Furthermore, we developed a scalable network-based visualization of bicluster communities that supports biological hypothesis generation. MoSBi is available as an R package and web service to make automated biclustering analysis accessible for application in molecular sample stratification.

Asunto(s)

Enfermedad; Perfilación de la Expresión Génica; Metabolómica; Pacientes; Proteómica; Programas Informáticos; Algoritmos; Análisis por Conglomerados; Enfermedad/clasificación; Humanos; Pacientes/clasificación

Palabras clave

biclustering; multiomics; pathomechanism; stratification; subtyping

Texto completo

Imprimir

XML

PubMed Links

Search on Google

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Pacientes / Programas Informáticos / Enfermedad / Perfilación de la Expresión Génica / Proteómica / Metabolómica Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Año: 2022 Tipo del documento: Article

Texto completo

Imprimir

XML

PubMed Links

Search on Google