Your browser doesn't support javascript.
loading
Dataset for discovering new hypertension small molecules using machine learning-aided computational fragment-based design.
Lehasa, Odifentse Mapula-E; Chude-Okonkwo, Uche A K.
Afiliação
  • Lehasa OM; Institute for Intelligent Systems, University of Johannesburg, 69 Kingsway Avenue, Auckland Park, Johannesburg 2092, Gauteng Province, South Africa.
  • Chude-Okonkwo UAK; Institute for Intelligent Systems, University of Johannesburg, 69 Kingsway Avenue, Auckland Park, Johannesburg 2092, Gauteng Province, South Africa.
Data Brief ; 55: 110677, 2024 Aug.
Article em En | MEDLINE | ID: mdl-39071972
ABSTRACT
This dataset demonstrates the use of computational fragmentation-based and machine learning-aided drug discovery to generate new lead molecules for the treatment of hypertension. Specifically, the focus is on agents targeting the renin-angiotensin-aldosterone system (RAAS), commonly classified as Angiotensin-Converting Enzyme Inhibitors (ACEIs) and Angiotensin II Receptor Blockers (ARBs). The preliminary dataset was a target-specific, user-generated fragment library of 63 molecular fragments of the 26 approved ACEI and ARB molecules obtained from the ChEMBL and DrugBank molecular databases. This fragment library provided the primary input dataset to generate the new lead molecules presented in the dataset. The newly generated molecules were screened to check whether they met the criteria for oral drugs and comprised the ACEI or ARB core functional group criterion. Using unsupervised machine learning, the molecules that met the criterion were divided into clusters of drug classes based on their functional group allocation. This process led to three final output datasets, one containing the new ACEI molecules, another for the new ARB molecules, and the last for the new unassigned class molecules. This data can aid in the timely and efficient design of novel antihypertensive drugs. It can also be used in precision hypertension medicine for patients with treatment resistance, non-response or co-morbidities. Although this dataset is specific to antihypertensive agents, the model can be reused with minimal changes to produce new lead molecules for other health conditions.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article