Your browser doesn't support javascript.
loading
Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings.
Mabvakure, Batsirai M; Rott, Raymond; Dobrowsky, Leslie; Van Heusden, Peter; Morris, Lynn; Scheepers, Cathrine; Moore, Penny L.
Affiliation
  • Mabvakure BM; Center for HIV and STIs, National Institute for Communicable Diseases, National Health Laboratory Service (NHLS), Johannesburg, South Africa.
  • Rott R; Antibody Immunity Research Unit, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa.
  • Dobrowsky L; Division of Transfusion Medicine, Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
  • Van Heusden P; Bridge-the-Gap, Johannesburg, South Africa.
  • Morris L; Bridge-the-Gap, Johannesburg, South Africa.
  • Scheepers C; South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa.
  • Moore PL; Center for HIV and STIs, National Institute for Communicable Diseases, National Health Laboratory Service (NHLS), Johannesburg, South Africa.
Bioinform Biol Insights ; 13: 1177932219882347, 2019.
Article in En | MEDLINE | ID: mdl-35173421
ABSTRACT
Next-generation sequencing (NGS) technologies have revolutionized biological research by generating genomic data that were once unaffordable by traditional first-generation sequencing technologies. These sequencing methodologies provide an opportunity for in-depth analyses of host and pathogen genomes as they are able to sequence millions of templates at a time. However, these large datasets can only be efficiently explored using bioinformatics analyses requiring huge data storage and computational resources adapted for high-performance processing. High-performance computing allows for efficient handling of large data and tasks that may require multi-threading and prolonged computational times, which is not feasible with ordinary computers. However, high-performance computing resources are costly and therefore not always readily available in low-income settings. We describe the establishment of an affordable high-performance computing bioinformatics cluster consisting of 3 nodes, constructed using ordinary desktop computers and open-source software including Linux Fedora, SLURM Workload Manager, and the Conda package manager. For the analysis of large antibody sequence datasets and for complex viral phylodynamic analyses, the cluster out-performed desktop computers. This has demonstrated that it is possible to construct high-performance computing capacity capable of analyzing large NGS data from relatively low-cost hardware and entirely free (open-source) software, even in resource-limited settings. Such a cluster design has broad utility beyond bioinformatics to other studies that require high-performance computing.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Health_economic_evaluation Language: En Journal: Bioinform Biol Insights Year: 2019 Document type: Article Affiliation country: South Africa

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Health_economic_evaluation Language: En Journal: Bioinform Biol Insights Year: 2019 Document type: Article Affiliation country: South Africa