Your browser doesn't support javascript.
loading
SteroidXtract: Deep Learning-Based Pattern Recognition Enables Comprehensive and Rapid Extraction of Steroid-Like Metabolic Features for Automated Biology-Driven Metabolomics.
Xing, Shipei; Jiao, Yibo; Salehzadeh, Melody; Soma, Kiran K; Huan, Tao.
Afiliación
  • Xing S; Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, BC V6T 1Z1, Canada.
  • Jiao Y; Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, BC V6T 1Z1, Canada.
  • Salehzadeh M; Department of Zoology, Faculty of Science, University of British Columbia, Vancouver Campus, 6270 University Boulevard, Vancouver, BC V6T 1Z4, Canada.
  • Soma KK; Department of Zoology, Faculty of Science, University of British Columbia, Vancouver Campus, 6270 University Boulevard, Vancouver, BC V6T 1Z4, Canada.
  • Huan T; Department of Psychology, Faculty of Arts, University of British Columbia, Vancouver Campus, 2136 West Mall, Vancouver, BC V6T 1Z4, Canada.
Anal Chem ; 93(14): 5735-5743, 2021 04 13.
Article en En | MEDLINE | ID: mdl-33784068
ABSTRACT
Despite the vast amount of metabolic information that can be captured in untargeted metabolomics, many biological applications are looking for a biology-driven metabolomics platform that targets a set of metabolites that are relevant to the given biological question. Steroids are a class of important molecules that play critical roles in many physiological systems and diseases. Besides known steroids, there are a large number of unknown steroids that have not been reported in the literature. The ability to rapidly detect and quantify both known and unknown steroid molecules in a biological sample can greatly accelerate a broad range of steroid-focused life science research. This work describes the development and application of SteroidXtract, a convolutional neural network (CNN)-based bioinformatics tool that can recognize steroid molecules in mass spectrometry (MS)-based untargeted metabolomics using their unique tandem MS (MS2) spectral patterns. SteroidXtract was trained using a comprehensive set of standard MS2 spectra from MassBank of North America (MoNA) and an in-house steroid library. Data augmentation strategies, including intensity thresholding and Gaussian noise addition, were created and applied to minimize data overfitting caused by the limited number of standard steroid MS2 spectra. The CNN model embedded in SteroidXtract was further compared with random forest and XGBoost using nested cross-validations to demonstrate its performance. Finally, SteroidXtract was applied in several metabolomics studies to demonstrate its sensitivity, specificity, and robustness. Compared to conventional statistics-driven metabolomics data interpretation, our work offers a novel automated biology-driven approach to interpreting untargeted metabolomics data, prioritizing biologically important molecules with high throughput and sensitivity.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Idioma: En Revista: Anal Chem Año: 2021 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Idioma: En Revista: Anal Chem Año: 2021 Tipo del documento: Article País de afiliación: Canadá