Your browser doesn't support javascript.
loading
Integrated genome sizing (IGS) approach for the parallelization of whole genome analysis.
Sona, Peter; Hong, Jong Hui; Lee, Sunho; Kim, Byong Joon; Hong, Woon-Young; Jung, Jongcheol; Kim, Han-Na; Kim, Hyung-Lae; Christopher, David; Herviou, Laurent; Im, Young Hwan; Lee, Kwee-Yum; Kim, Tae Soon; Jung, Jongsun.
Afiliación
  • Sona P; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Hong JH; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Lee S; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Kim BJ; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Hong WY; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Jung J; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Kim HN; PGM21 (Personalized Genomic Medicine 21), Ewha Womans University Medical Center, 1071, Anyang Cheon-ro, Yangcheon-gu, Seoul, 158-710, Korea.
  • Kim HL; PGM21 (Personalized Genomic Medicine 21), Ewha Womans University Medical Center, 1071, Anyang Cheon-ro, Yangcheon-gu, Seoul, 158-710, Korea.
  • Christopher D; Bioinformatics Solutions, 900 N McCarthy Blvd., Milpitas, CA, 95035, USA.
  • Herviou L; Bioinformatics Solutions, 900 N McCarthy Blvd., Milpitas, CA, 95035, USA.
  • Im YH; Bioinformatics Solutions, 900 N McCarthy Blvd., Milpitas, CA, 95035, USA.
  • Lee KY; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
  • Kim TS; Faculty of Medicine, University of Queensland, QLD, Brisbane, 4072, Australia.
  • Jung J; Genome Data Integration Center, Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, Republic of Korea, 34025.
BMC Bioinformatics ; 19(1): 462, 2018 Dec 03.
Article en En | MEDLINE | ID: mdl-30509173
ABSTRACT

BACKGROUND:

The use of whole genome sequence has increased recently with rapid progression of next-generation sequencing (NGS) technologies. However, storing raw sequence reads to perform large-scale genome analysis pose hardware challenges. Despite advancement in genome analytic platforms, efficient approaches remain relevant especially as applied to the human genome. In this study, an Integrated Genome Sizing (IGS) approach is adopted to speed up multiple whole genome analysis in high-performance computing (HPC) environment. The approach splits a genome (GRCh37) into 630 chunks (fragments) wherein multiple chunks can simultaneously be parallelized for sequence analyses across cohorts.

RESULTS:

IGS was integrated on Maha-Fs (HPC) system, to provide the parallelization required to analyze 2504 whole genomes. Using a single reference pilot genome, NA12878, we compared the NGS process time between Maha-Fs (NFS SATA hard disk drive) and SGI-UV300 (solid state drive memory). It was observed that SGI-UV300 was faster, having 32.5 mins of process time, while that of the Maha-Fs was 55.2 mins.

CONCLUSIONS:

The implementation of IGS can leverage the ability of HPC systems to analyze multiple genomes simultaneously. We believe this approach will accelerate research advancement in personalized genomic medicine. Our method is comparable to the fastest methods for sequence alignment.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Análisis de Secuencia de ADN / Genómica / Secuenciación de Nucleótidos de Alto Rendimiento / Tamaño del Genoma Límite: Humans Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2018 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Análisis de Secuencia de ADN / Genómica / Secuenciación de Nucleótidos de Alto Rendimiento / Tamaño del Genoma Límite: Humans Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2018 Tipo del documento: Article
...