Your browser doesn't support javascript.
loading
Prediction of Vacuum Ultraviolet/Ultraviolet Gas-Phase Absorption Spectra Using Molecular Feature Representations and Machine Learning.
Ho Manh, Linh; Chen, Victoria C P; Rosenberger, Jay; Wang, Shouyi; Yang, Yujing; Schug, Kevin A.
Affiliation
  • Ho Manh L; Department of Industrial, Manufacturing, and Systems Engineering, The University of Texas at Arlington, Arlington, Texas 76019, United States.
  • Chen VCP; Department of Industrial, Manufacturing, and Systems Engineering, The University of Texas at Arlington, Arlington, Texas 76019, United States.
  • Rosenberger J; Department of Industrial, Manufacturing, and Systems Engineering, The University of Texas at Arlington, Arlington, Texas 76019, United States.
  • Wang S; Department of Industrial, Manufacturing, and Systems Engineering, The University of Texas at Arlington, Arlington, Texas 76019, United States.
  • Yang Y; Department of Industrial, Manufacturing, and Systems Engineering, The University of Texas at Arlington, Arlington, Texas 76019, United States.
  • Schug KA; Department of Chemistry and Biochemistry, The University of Texas at Arlington, Arlington, Texas 76019, United States.
J Chem Inf Model ; 64(14): 5547-5556, 2024 Jul 22.
Article in En | MEDLINE | ID: mdl-38938209
ABSTRACT
Ultraviolet (UV) absorption spectroscopy is a widely used tool for quantitative and qualitative analyses of chemical compounds. In the gas phase, vacuum UV (VUV) and UV absorption spectra are specific and diagnostic for many small molecules. An accurate prediction of VUV/UV absorption spectra can aid the characterization of new or unknown molecules in areas such as fuels, forensics, and pharmaceutical research. An alternative to quantum chemical spectral prediction is the use of artificial intelligence. Here, different molecular feature representation techniques were used and developed to encode chemical structures for testing three machine learning models to predict gas-phase VUV/UV absorption spectra. Structure data files (.sdf) and VUV/UV absorption spectra for 1397 volatile and semivolatile chemical compounds were used to train and test the models. New molecular features (termed ABOCH) were introduced to better capture pi-bonding, aromaticity, and halogenation. The incorporation of these new features benefited spectral prediction and demonstrated superior performance compared to computationally intensive molecular-based deep learning methods. Of the machine learning methods, the use of a Random Forest regressor returned the best accuracy score with the shortest training time. The developed machine learning prediction model also outperformed spectral predictions based on the time-dependent density functional theory.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Spectrophotometry, Ultraviolet / Machine Learning / Gases Language: En Journal: J Chem Inf Model / J. chem. inf. model / Journal of chemical information and modeling Journal subject: INFORMATICA MEDICA / QUIMICA Year: 2024 Type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Spectrophotometry, Ultraviolet / Machine Learning / Gases Language: En Journal: J Chem Inf Model / J. chem. inf. model / Journal of chemical information and modeling Journal subject: INFORMATICA MEDICA / QUIMICA Year: 2024 Type: Article Affiliation country: United States