Your browser doesn't support javascript.
loading
Saagar-A New, Extensible Set of Molecular Substructures for QSAR/QSPR and Read-Across Predictions.
Sedykh, Alexander Y; Shah, Ruchir R; Kleinstreuer, Nicole C; Auerbach, Scott S; Gombar, Vijay K.
Affiliation
  • Sedykh AY; Sciome LLC, Research Triangle Park, North Carolina 27709, United States.
  • Shah RR; Sciome LLC, Research Triangle Park, North Carolina 27709, United States.
  • Kleinstreuer NC; National Institute of Environmental Health Sciences (NIEHS), National Toxicology Program (NTP), Research Triangle Park, North Carolina 27709, United States.
  • Auerbach SS; National Institute of Environmental Health Sciences (NIEHS), National Toxicology Program (NTP), Research Triangle Park, North Carolina 27709, United States.
  • Gombar VK; Sciome LLC, Research Triangle Park, North Carolina 27709, United States.
Chem Res Toxicol ; 34(2): 634-640, 2021 02 15.
Article in En | MEDLINE | ID: mdl-33356152
ABSTRACT
Molecular structure-based predictive models provide a proven alternative to costly and inefficient animal testing. However, due to a lack of interpretability of predictive models built with abstract molecular descriptors they have earned the notoriety of being black boxes. Interpretable models require interpretable descriptors to provide chemistry-backed predictive reasoning and facilitate intelligent molecular design. We developed a novel set of extensible chemistry-aware substructures, Saagar, to support interpretable predictive models and read-across protocols. Performance of Saagar in chemical characterization and search for structurally similar actives for read-across applications was compared with four publicly available fingerprint sets (MACCS (166), PubChem (881), ECFP4 (1024), ToxPrint (729)) in three benchmark sets (MUV, ULS, and Tox21) spanning ∼145 000 compounds and 78 molecular targets at 1%, 2%, 5%, and 10% false discovery rates. In 18 of the 20 comparisons, interpretable Saagar features performed better than the publicly available, but less interpretable and fixed-bit length, fingerprints. Examples are provided to show the enhanced capability of Saagar in extracting compounds with higher scaffold similarity. Saagar features are interpretable and efficiently characterize diverse chemical collections, thus making them a better choice for building interpretable predictive in silico models and read-across protocols.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Anthraquinones / Quantitative Structure-Activity Relationship Type of study: Prognostic_studies / Risk_factors_studies Limits: Animals Language: En Journal: Chem Res Toxicol Journal subject: TOXICOLOGIA Year: 2021 Type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Anthraquinones / Quantitative Structure-Activity Relationship Type of study: Prognostic_studies / Risk_factors_studies Limits: Animals Language: En Journal: Chem Res Toxicol Journal subject: TOXICOLOGIA Year: 2021 Type: Article Affiliation country: United States