Your browser doesn't support javascript.
loading
pathMap: a path-based mapping tool for long noisy reads with high sensitivity.
Wei, Ze-Gang; Zhang, Xiao-Dan; Fan, Xing-Guo; Qian, Yu; Liu, Fei; Wu, Fang-Xiang.
Afiliação
  • Wei ZG; School of Physics and Opto-Electronics Technology, Baoji University of Arts and Sciences, Baoji, 721016, China.
  • Zhang XD; Division of Biomedical Engineering, Department of Computer Science and Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada.
  • Fan XG; School of Physics and Opto-Electronics Technology, Baoji University of Arts and Sciences, Baoji, 721016, China.
  • Qian Y; School of Physics and Opto-Electronics Technology, Baoji University of Arts and Sciences, Baoji, 721016, China.
  • Liu F; School of Physics and Opto-Electronics Technology, Baoji University of Arts and Sciences, Baoji, 721016, China.
  • Wu FX; School of Physics and Opto-Electronics Technology, Baoji University of Arts and Sciences, Baoji, 721016, China.
Brief Bioinform ; 25(2)2024 Jan 22.
Article em En | MEDLINE | ID: mdl-38517696
ABSTRACT
With the rapid development of single-molecule sequencing (SMS) technologies, the output read length is continuously increasing. Mapping such reads onto a reference genome is one of the most fundamental tasks in sequence analysis. Mapping sensitivity is becoming a major concern since high sensitivity can detect more aligned regions on the reference and obtain more aligned bases, which are useful for downstream analysis. In this study, we present pathMap, a novel k-mer graph-based mapper that is specifically designed for mapping SMS reads with high sensitivity. By viewing the alignment chain as a path containing as many anchors as possible in the matched k-mer graph, pathMap treats chaining as a path selection problem in the directed graph. pathMap iteratively searches the longest path in the remaining nodes; more candidate chains with high quality can be effectively detected and aligned. Compared to other state-of-the-art mapping methods such as minimap2 and Winnowmap2, experiment results on simulated and real-life datasets demonstrate that pathMap obtains the number of mapped chains at least 11.50% more than its closest competitor and increases the mapping sensitivity by 17.28% and 13.84% of bases over the next-best mapper for Pacific Biosciences and Oxford Nanopore sequencing data, respectively. In addition, pathMap is more robust to sequence errors and more sensitive to species- and strain-specific identification of pathogens using MinION reads.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Sequenciamento de Nucleotídeos em Larga Escala / Sequenciamento por Nanoporos Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Sequenciamento de Nucleotídeos em Larga Escala / Sequenciamento por Nanoporos Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China