RESUMO
Molecular Property Diagnostic Suite (MPDS) was conceived and developed as an open-source disease-specific web portal based on Galaxy. MPDSCOVID-19 was developed for COVID-19 as a one-stop solution for drug discovery research. Galaxy platforms enable the creation of customized workflows connecting various modules in the web server. The architecture of MPDSCOVID-19 effectively employs Galaxy v22.04 features, which are ported on CentOS 7.8 and Python 3.7. MPDSCOVID-19 provides significant updates and the addition of several new tools updated after six years. Tools developed by our group in Perl/Python and open-source tools are collated and integrated into MPDSCOVID-19 using XML scripts. Our MPDS suite aims to facilitate transparent and open innovation. This approach significantly helps bring inclusiveness in the community while promoting free access and participation in software development. Availability & Implementation: The MPDSCOVID-19 portal can be accessed at https://mpds.neist.res.in:8085/.
RESUMO
The cation-aromatic database (CAD) is a comprehensive repository of cation-aromatic motifs found in experimentally determined protein structures, first reported in 2007 [Proteins, 2007, 67, 1179]. The present article is an update of CAD that contains information of approximately 27.26 million cation-aromatic motifs. CAD uses three distance parameters (r, d1, and d2) to determine the position of the cation relative to the centroid of the aromatic residue and classifies the motifs as cation-π or cation-σ interactions. As of June 2023, about 193 936 protein structures were retrieved from Protein Data Bank, and this resulted in the identification of an impressive number of 27 255 817 cation-aromatic motifs. Among these motifs, spherical motifs constituted 94.09%, while cylindrical motifs made up the remaining 5.91%. When considering the interaction of metal ions with aromatic residues, 965 564 motifs are identified. Remarkably, 82.08% of these motifs involved the binding of metal ions to the amino acid HIS. Moreover, the analysis of binding preferences between cations and aromatic residues revealed that the HIS-HIS, PHE-ARG, and TRP-ARG pairs exhibited a preferential geometry. The motif pair HIS-HIS was the most prevalent, accounting for 19.87% of the total, closely followed by TYR-LYS at 10.17%. Conversely, the motif pair TRP-HIS had the lowest occurrence, representing only 4.20% of the total. The data generated help in revealing the characteristics and biological functions of cation-aromatic interactions in biological molecules. The updated version of CAD (Cation-Aromatic Database V2.0) can be accessed at https://acds.neist.res.in/cadv2.
Assuntos
Aminoácidos , Proteínas , Aminoácidos/química , Cátions/química , MetaisRESUMO
Molecular Property Diagnostic Suite Compound Library (MPDS-CL) is an open-source Galaxy-based cheminformatics web portal which presents a structure-based classification of the molecules. A structure-based classification of nearly 150 million unique compounds, obtained from 42 publicly available databases and curated for redundancy removal through 97 hierarchically well-defined atom composition-based portions, has been done. These are further subjected to 56-bit fingerprint-based classification algorithm which led to the formation of 56 structurally well-defined classes. The classes thus obtained were further divided into clusters based on their molecular weight. Thus, the entire set of molecules was put into 56 different classes and 625 clusters. This led to the assignment of a unique ID, named as MPDS-AadharID, for each of these 149,169,443 molecules. MPDS-AadharID is akin to the unique number given to citizens in India (similar to SSN in the US and NINO in the UK). The unique features of MPDS-CL are (a) several search options, such as exact structure search, substructure search, property-based search, fingerprint-based search, using SMILES, InChIKey and key-in; (b) automatic generation of information for the processing for MPDS and other galaxy tools; (c) providing the class and cluster of a molecule which makes it easier and fast to search for similar molecules and (d) information related to the presence of the molecules in multiple databases. The MPDS-CL can be accessed at https://mpds.neist.res.in:8086/ .
RESUMO
The Aromatic-Aromatic Interactions Database (A2ID) is a comprehensive repository dedicated to documenting aromatic-aromatic (π-π) networks observed in experimentally determined protein structures. The first version of A2ID was reported in 2011 [Int J Biol Macromol, 2011, 48, 540]. It has undergone a series of significant updates, leading to its current version, which focuses on the identification and analysis of 3,444,619 π-π networks from proteins. The geometrical parameters such as centroid-centroid distances (r) and interplanar angles (Ï) were used to identify and characterize π-π networks. It was observed that among the 84,500 proteins with at least one aromatic π-π network, about 92.50 % of the instances are found to be either 2π (77.34 %) or 3π (15.23 %) networks. The analysis of interacting amino acid pairs in 2π networks indicated a dominance of PHE residues followed by TYR. The updated version of A2ID incorporates analysis of π-π networks based on SCOP2 and ECOD classifiers, in addition to the existing SCOP, CATH, and EC classifications. This expanded scope allows researchers to explore the characteristics and functional implications of π-π networks in protein structures from multiple perspectives. The current version of A2ID along with its extensive dataset and detailed geometric information is publicly accessible using https://acds.neist.res.in/a2idv2.