Your browser doesn't support javascript.
loading
Linear: a framework to enable existing software to resolve structural variants in long reads with flexible and efficient alignment-free statistical models.
Pan, Chenxu; Rahn, René; Heller, David; Reinert, Knut.
Afiliación
  • Pan C; Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, Berlin 14195, Germany.
  • Rahn R; Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, Berlin 14195, Germany.
  • Heller D; Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany.
  • Reinert K; Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, Berlin 14195, Germany.
Brief Bioinform ; 24(2)2023 03 19.
Article en En | MEDLINE | ID: mdl-36869850
ABSTRACT
Alignment is the cornerstone of many long-read pipelines and plays an essential role in resolving structural variants (SVs). However, forced alignments of SVs embedded in long reads, inflexibility of integrating novel SVs models and computational inefficiency remain problems. Here, we investigate the feasibility of resolving long-read SVs with alignment-free algorithms. We ask (1) Is it possible to resolve long-read SVs with alignment-free approaches? and (2) Does it provide an advantage over existing approaches? To this end, we implemented the framework named Linear, which can flexibly integrate alignment-free algorithms such as the generative model for long-read SV detection. Furthermore, Linear addresses the problem of compatibility of alignment-free approaches with existing software. It takes as input long reads and outputs standardized results existing software can directly process. We conducted large-scale assessments in this work and the results show that the sensitivity, and flexibility of Linear outperform alignment-based pipelines. Moreover, the computational efficiency is orders of magnitude faster.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Genoma Humano Tipo de estudio: Risk_factors_studies Límite: Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Genoma Humano Tipo de estudio: Risk_factors_studies Límite: Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Alemania