Your browser doesn't support javascript.
loading
Cloud-BS: A MapReduce-based bisulfite sequencing aligner on cloud.
Choi, Joungmin; Park, Yoonjae; Kim, Sun; Chae, Heejoon.
Afiliação
  • Choi J; * Division of Computer Science, Sookmyung Women's University, 100 Cheongpa-ro 47-gil, 04310 Seoul, Republic of Korea.
  • Park Y; † Department of Electrical and Computer Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, 08826 Seoul, Republic of Korea.
  • Kim S; ‡ Department of Computer Science and Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, 08826 Seoul, Republic of Korea.
  • Chae H; * Division of Computer Science, Sookmyung Women's University, 100 Cheongpa-ro 47-gil, 04310 Seoul, Republic of Korea.
J Bioinform Comput Biol ; 16(6): 1840028, 2018 12.
Article em En | MEDLINE | ID: mdl-30567473
ABSTRACT
In recent years, there have been many studies utilizing DNA methylome data to answer fundamental biological questions. Bisulfite sequencing (BS-seq) has enabled measurement of a genome-wide absolute level of DNA methylation at single-nucleotide resolution. However, due to the ambiguity introduced by bisulfite-treatment, the aligning process especially in large-scale epigenetic research is still considered a huge burden. We present Cloud-BS, an efficient BS-seq aligner designed for parallel execution on a distributed environment. Utilizing Apache Hadoop framework, Cloud-BS splits sequencing reads into multiple blocks and transfers them to distributed nodes. By designing each aligning procedure into separate map and reducing tasks while an internal key-value structure is optimized based on the MapReduce programming model, the algorithm significantly improves alignment performance without sacrificing mapping accuracy. In addition, Cloud-BS minimizes the innate burden of configuring a distributed environment by providing a pre-configured cloud image. Cloud-BS shows significantly improved bisulfite alignment performance compared to other existing BS-seq aligners. We believe our algorithm facilitates large-scale methylome data analysis. The algorithm is freely available at https//paryoja.github.io/Cloud-BS/ .
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Sulfitos / Algoritmos / Análise de Sequência de DNA / Metilação de DNA / Computação em Nuvem Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Sulfitos / Algoritmos / Análise de Sequência de DNA / Metilação de DNA / Computação em Nuvem Idioma: En Ano de publicação: 2018 Tipo de documento: Article