Your browser doesn't support javascript.
loading
T-S2Inet: Transformer-based sequence-to-image network for accurate nanopore sequence recognition.
Guan, Xiaoyu; Shao, Wei; Zhang, Daoqiang.
Affiliation
  • Guan X; College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, MIIT Key Laboratory of Pattern Analysis and Machine Intelligence, Nanjing 211106, China.
  • Shao W; Key Laboratory of Brain-Machine Intelligence Technology, Ministry of Education, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China.
  • Zhang D; College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, MIIT Key Laboratory of Pattern Analysis and Machine Intelligence, Nanjing 211106, China.
Bioinformatics ; 40(2)2024 02 01.
Article in En | MEDLINE | ID: mdl-38366607
ABSTRACT
MOTIVATION Nanopore sequencing is a new macromolecular recognition and perception technology that enables high-throughput sequencing of DNA, RNA, even protein molecules. The sequences generated by nanopore sequencing span a large time frame, and the labor and time costs incurred by traditional analysis methods are substantial. Recently, research on nanopore data analysis using machine learning algorithms has gained unceasing momentum, but there is often a significant gap between traditional and deep learning methods in terms of classification results. To analyze nanopore data using deep learning technologies, measures such as sequence completion and sequence transformation can be employed. However, these technologies do not preserve the local features of the sequences. To address this issue, we propose a sequence-to-image (S2I) module that transforms sequences of unequal length into images. Additionally, we propose the Transformer-based T-S2Inet model to capture the important information and improve the classification accuracy.

RESULTS:

Quantitative and qualitative analysis shows that the experimental results have an improvement of around 2% in accuracy compared to previous methods. The proposed method is adaptable to other nanopore platforms, such as the Oxford nanopore. It is worth noting that the proposed method not only aims to achieve the most advanced performance, but also provides a general idea for the analysis of nanopore sequences of unequal length. AVAILABILITY AND IMPLEMENTATION The main program is available at https//github.com/guanxiaoyu11/S2Inet.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / Nanopores Language: En Journal: Bioinformatics / Bioinformatics (Oxford. Online) Journal subject: INFORMATICA MEDICA Year: 2024 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / Nanopores Language: En Journal: Bioinformatics / Bioinformatics (Oxford. Online) Journal subject: INFORMATICA MEDICA Year: 2024 Type: Article Affiliation country: China