Your browser doesn't support javascript.
loading
Scalable deep text comprehension for Cancer surveillance on high-performance computing.
Qiu, John X; Yoon, Hong-Jun; Srivastava, Kshitij; Watson, Thomas P; Blair Christian, J; Ramanathan, Arvind; Wu, Xiao C; Fearn, Paul A; Tourassi, Georgia D.
Afiliação
  • Qiu JX; Biomedical Sciences, Engineering, and Computing Group, Health Data Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
  • Yoon HJ; Biomedical Sciences, Engineering, and Computing Group, Health Data Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA. yoonh@ornl.gov.
  • Srivastava K; Biomedical Sciences, Engineering, and Computing Group, Health Data Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
  • Watson TP; Herff College of Engineering, University of Memphis, Memphis, TN, USA.
  • Blair Christian J; Biomedical Sciences, Engineering, and Computing Group, Health Data Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
  • Ramanathan A; Biomedical Sciences, Engineering, and Computing Group, Health Data Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
  • Wu XC; Louisiana Tumor Registry, Louisiana State University Health Sciences Center, New Orleans, LA, USA.
  • Fearn PA; Surveillance Research Program, National Cancer Institute, Bethesda, MD, USA.
  • Tourassi GD; Biomedical Sciences, Engineering, and Computing Group, Health Data Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
BMC Bioinformatics ; 19(Suppl 18): 488, 2018 Dec 21.
Article em En | MEDLINE | ID: mdl-30577743
ABSTRACT

BACKGROUND:

Deep Learning (DL) has advanced the state-of-the-art capabilities in bioinformatics applications which has resulted in trends of increasingly sophisticated and computationally demanding models trained by larger and larger data sets. This vastly increased computational demand challenges the feasibility of conducting cutting-edge research. One solution is to distribute the vast computational workload across multiple computing cluster nodes with data parallelism algorithms. In this study, we used a High-Performance Computing environment and implemented the Downpour Stochastic Gradient Descent algorithm for data parallelism to train a Convolutional Neural Network (CNN) for the natural language processing task of information extraction from a massive dataset of cancer pathology reports. We evaluated the scalability improvements using data parallelism training and the Titan supercomputer at Oak Ridge Leadership Computing Facility. To evaluate scalability, we used different numbers of worker nodes and performed a set of experiments comparing the effects of different training batch sizes and optimizer functions.

RESULTS:

We found that Adadelta would consistently converge at a lower validation loss, though requiring over twice as many training epochs as the fastest converging optimizer, RMSProp. The Adam optimizer consistently achieved a close 2nd place minimum validation loss significantly faster; using a batch size of 16 and 32 allowed the network to converge in only 4.5 training epochs.

CONCLUSIONS:

We demonstrated that the networked training process is scalable across multiple compute nodes communicating with message passing interface while achieving higher classification accuracy compared to a traditional machine learning algorithm.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Metodologias Computacionais / Aprendizado Profundo / Neoplasias Tipo de estudo: Prognostic_studies / Screening_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Metodologias Computacionais / Aprendizado Profundo / Neoplasias Tipo de estudo: Prognostic_studies / Screening_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article