Your browser doesn't support javascript.
loading
DeF-GPU: Efficient and effective deletions finding in hepatitis B viral genomic DNA using a GPU architecture.
Cheng, Chun-Pei; Lan, Kuo-Lun; Liu, Wen-Chun; Chang, Ting-Tsung; Tseng, Vincent S.
Affiliation
  • Cheng CP; Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan.
  • Lan KL; Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan.
  • Liu WC; Department of Internal Medicine, National Cheng Kung University Medical College and Hospital, Tainan 701, Taiwan.
  • Chang TT; Department of Internal Medicine, National Cheng Kung University Medical College and Hospital, Tainan 701, Taiwan.
  • Tseng VS; Department of Computer Science, National Chiao Tung University, Hsinchu 308, Taiwan. Electronic address: vtseng@cs.nctu.edu.tw.
Methods ; 111: 56-63, 2016 12 01.
Article in En | MEDLINE | ID: mdl-27480381
ABSTRACT
Hepatitis B viral (HBV) infection is strongly associated with an increased risk of liver diseases like cirrhosis or hepatocellular carcinoma (HCC). Many lines of evidence suggest that deletions occurring in HBV genomic DNA are highly associated with the activity of HBV via the interplay between aberrant viral proteins release and human immune system. Deletions finding on the HBV whole genome sequences is thus a very important issue though there exist underlying the challenges in mining such big and complex biological data. Although some next generation sequencing (NGS) tools are recently designed for identifying structural variations such as insertions or deletions, their validity is generally committed to human sequences study. This design may not be suitable for viruses due to different species. We propose a graphics processing unit (GPU)-based data mining method called DeF-GPU to efficiently and precisely identify HBV deletions from large NGS data, which generally contain millions of reads. To fit the single instruction multiple data instructions, sequencing reads are referred to as multiple data and the deletion finding procedure is referred to as a single instruction. We use Compute Unified Device Architecture (CUDA) to parallelize the procedures, and further validate DeF-GPU on 5 synthetic and 1 real datasets. Our results suggest that DeF-GPU outperforms the existing commonly-used method Pindel and is able to exactly identify the deletions of our ground truth in few seconds. The source code and other related materials are available at https//sourceforge.net/projects/defgpu/.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Hepatitis B virus / Genome, Viral / Computational Biology / Hepatitis B Type of study: Diagnostic_studies / Prognostic_studies Limits: Humans Language: En Journal: Methods Journal subject: BIOQUIMICA Year: 2016 Document type: Article Affiliation country: Taiwán

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Hepatitis B virus / Genome, Viral / Computational Biology / Hepatitis B Type of study: Diagnostic_studies / Prognostic_studies Limits: Humans Language: En Journal: Methods Journal subject: BIOQUIMICA Year: 2016 Document type: Article Affiliation country: Taiwán