Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
J Chem Inf Model ; 63(7): 1882-1893, 2023 04 10.
Artigo em Inglês | MEDLINE | ID: mdl-36971750

RESUMO

Drug-induced gene expression profiling provides a lot of useful information covering various aspects of drug discovery and development. Most importantly, this knowledge can be used to discover drugs' mechanisms of action. Recently, deep learning-based drug design methods are in the spotlight due to their ability to explore huge chemical space and design property-optimized target-specific drug molecules. Recent advances in accessibility of open-source drug-induced transcriptomic data along with the ability of deep learning algorithms to understand hidden patterns have opened opportunities for designing drug molecules based on desired gene expression signatures. In this study, we propose a deep learning model, Gex2SGen (Gene Expression 2 SMILES Generation), to generate novel drug-like molecules based on desired gene expression profiles. The model accepts desired gene expression profiles in a cell-specific manner as input and designs drug-like molecules which can elicit the required transcriptomic profile. The model was first tested against individual gene-knocked-out transcriptomic profiles, where the newly designed molecules showed high similarity with known inhibitors of the knocked-out target genes. The model was next applied on a triple negative breast cancer signature profile, where it could generate novel molecules, highly similar to known anti-breast cancer drugs. Overall, this work provides a generalized method, where the method first learned the molecular signature of a given cell due to a specific condition, and designs new small molecules with drug-like properties.


Assuntos
Descoberta de Drogas , Transcriptoma , Perfilação da Expressão Gênica , Algoritmos
2.
Molecules ; 27(2)2022 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-35056738

RESUMO

Ankyrin is one of the most abundant protein repeat families found across all forms of life. It is found in a variety of multi-domain and single domain proteins in humans with diverse number of repeating units. They are observed to occur in several functionally diverse proteins, such as transcriptional initiators, cell cycle regulators, cytoskeletal organizers, ion transporters, signal transducers, developmental regulators, and toxins, and, consequently, defects in ankyrin repeat proteins have been associated with a number of human diseases. In this study, we have classified the human ankyrin proteins into clusters based on the sequence similarity in their ankyrin repeat domains. We analyzed the amino acid compositional bias and consensus ankyrin motif sequence of the clusters to understand the diversity of the human ankyrin proteins. We carried out network-based structural analysis of human ankyrin proteins across different clusters and showed the association of conserved residues with topologically important residues identified by network centrality measures. The analysis of conserved and structurally important residues helps in understanding their role in structural stability and function of these proteins. In this paper, we also discuss the significance of these conserved residues in disease association across the human ankyrin protein clusters.


Assuntos
Repetição de Anquirina , Anquirinas/química , Bases de Dados de Proteínas , Humanos
3.
Protein Sci ; 31(1): 23-36, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-33641184

RESUMO

Recent interest in repeat proteins has arisen due to stable structural folds, high evolutionary conservation and repertoire of functions provided by these proteins. However, repeat proteins are poorly characterized because of high sequence variation between repeating units and structure-based identification and classification of repeats is desirable. Using a robust network-based pipeline, manual curation and Kajava's structure-based classification schema, we have developed a database of tandem structural repeats, Database of Structural Repeats in Proteins (DbStRiPs). A unique feature of this database is that available knowledge on sequence repeat families is incorporated by mapping Pfam classification scheme onto structural classification. Integration of sequence and structure-based classifications help in identifying different functional groups within the same structural subclass, leading to refinement in the annotation of repeat proteins. Analysis of complete Protein Data Bank revealed 16,472 repeat annotations in 15,141 protein chains, one previously uncharacterized novel protein repeat family (PRF), named left-handed beta helix, and 33 protein repeat clusters (PRCs). Based on their unique structural motif, ~79% of these repeat proteins are classified in one of the 14 PRFs or 33 PRCs, and the remaining are grouped as unclassified repeat proteins. Each repeat protein is provided with a detailed annotation in DbStRiPs that includes start and end boundaries of repeating units, copy number, secondary and tertiary structure view, repeat class/subclass, disease association, MSA of repeating units and cross-references to various protein pattern databases, human protein atlas and interaction resources. DbStRiPs provides easy search and download options to high-quality annotations of structural repeat proteins (URL: http://bioinf.iiit.ac.in/dbstrips/).


Assuntos
Algoritmos , Bases de Dados de Proteínas , Modelos Moleculares , Sequências Repetitivas de Aminoácidos , Software
4.
IEEE/ACM Trans Comput Biol Bioinform ; 18(4): 1271-1280, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33891554

RESUMO

COVID-19 is a highly contagious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The case-fatality rate is significantly higher in older patients and those with diabetes, cancer or cardiovascular disorders. The human proteins, angiotensin-converting enzyme 2 (ACE2), transmembrane protease serine 2 (TMPRSS2) and basigin (BSG), are involved in high-confidence host-pathogen interactions with SARS-CoV-2 proteins. We considered these three proteins as seed nodes and applied the random walk with restart method on the human interactome to construct a protein-protein interaction sub-network, which captures the effects of viral invasion. We found that 'Insulin resistance', 'AGE-RAGE signaling in diabetic complications' and 'adipocytokine signaling' were the common pathways associated with diabetes, cancer and cardiovascular disorders. The association of these critical pathways with aging and its related diseases explains the molecular basis of COVID-19 fatality. We further identified drugs that have effects on these proteins/pathways based on gene expression studies. We particularly focused on drugs that significantly downregulate ACE2 along with other critical proteins identified by the network-based approach. Among them, COL-3 had earlier shown activity against acute lung injury and acute respiratory distress, while entinostat and mocetinostat have been investigated for non-small-cell lung cancer. We propose that these drugs can be repurposed for COVID-19.


Assuntos
COVID-19/mortalidade , SARS-CoV-2 , Enzima de Conversão de Angiotensina 2/antagonistas & inibidores , Enzima de Conversão de Angiotensina 2/genética , Antivirais/uso terapêutico , COVID-19/epidemiologia , COVID-19/terapia , Doenças Cardiovasculares/epidemiologia , Comorbidade , Biologia Computacional , Reposicionamento de Medicamentos , Gastroenteropatias/epidemiologia , Perfilação da Expressão Gênica/estatística & dados numéricos , Interações entre Hospedeiro e Microrganismos/efeitos dos fármacos , Interações entre Hospedeiro e Microrganismos/genética , Interações entre Hospedeiro e Microrganismos/fisiologia , Humanos , Pandemias , Mapas de Interação de Proteínas/efeitos dos fármacos , Doenças Respiratórias/epidemiologia , SARS-CoV-2/efeitos dos fármacos , SARS-CoV-2/patogenicidade , SARS-CoV-2/fisiologia , Tratamento Farmacológico da COVID-19
5.
J Mol Graph Model ; 99: 107641, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32619952

RESUMO

Hydroxymethylbilane synthase (HMBS) is one of the key enzymes of the heme biosynthetic pathway that catalyzes porphobilinogen to form the linear tetrapyrrole 1-hydroxymethylbilane through four intermediate steps. Mutations in the human HMBS (hHMBS) can lead to acute intermittent porphyria (AIP), a lethal metabolic disorder. The molecular basis of importance of the amino acid residues at the catalytic site of hHMBS has been well studied. However, the role of non-active site residues toward the activity of the enzyme and hence the association of their mutations with AIP is not known. Network-based analyses of protein structures provide a systems approach to understand the correlations of the residues through a series of inter-residue interactions. We analyzed the dynamic network representation of HMBS protein derived from five molecular dynamics trajectories corresponding to the five steps of pyrrole polymerization. We analyzed the network clusters for each stage and identified the amino acid residues and interactions responsible for the structural stability and catalytic function of the protein. The analysis of high betweenness nodes and interaction paths from the active site help in understanding the molecular basis of the effect of non-active site AIP-causing mutations on the catalytic activity.


Assuntos
Hidroximetilbilano Sintase , Porfiria Aguda Intermitente , Humanos , Hidroximetilbilano Sintase/genética , Hidroximetilbilano Sintase/metabolismo , Simulação de Dinâmica Molecular , Mutação , Pirróis
6.
J Biosci ; 452020.
Artigo em Inglês | MEDLINE | ID: mdl-32713858

RESUMO

Tandemly repeated structural motifs in proteins form highly stable structural folds and provide multiple binding sites associated with diverse functional roles. The tertiary structure and function of these proteins are determined by the type and copy number of the repeating units. Each repeat type exhibits a unique pattern of intra- and inter-repeat unit interactions that is well-captured by the topological features in the network representation of protein structures. Here we present an improved version of our graph based algorithm, PRIGSA, with structure-based validation and filtering steps incorporated for accurate detection of tandem structural repeats. The algorithm integrates available knowledge on repeat families with de novo prediction to detect repeats in single monomer chains as well as in multimeric protein complexes. Three levels of performance evaluation are presented: comparison with state-of-the-art algorithms on benchmark dataset of repeat and nonrepeat proteins, accuracy in the detection of members of 13 known repeat families reported in UniProt and execution on the complete Protein Data Bank to show its ability to identify previously uncharacterized proteins. A ~3-fold increase in the coverage of the members of 13 known families and 3408 novel uncharacterized structural repeat proteins are identified on executing it on PDB. PRIGSA2 is available at http:// bioinf.iiit.ac.in/PRIGSA2/.


Assuntos
Proteínas/isolamento & purificação , Sequências Repetitivas de Aminoácidos/genética , Software , Algoritmos , Sítios de Ligação/genética , Simulação por Computador , Bases de Dados de Proteínas , Humanos , Modelos Moleculares , Proteínas/genética
7.
Nucleic Acids Res ; 47(W1): W462-W470, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31106363

RESUMO

Network theory is now a method of choice to gain insights in understanding protein structure, folding and function. In combination with molecular dynamics (MD) simulations, it is an invaluable tool with widespread applications such as analyzing subtle conformational changes and flexibility regions in proteins, dynamic correlation analysis across distant regions for allosteric communications, in drug design to reveal alternative binding pockets for drugs, etc. Updated version of NAPS now facilitates network analysis of the complete repertoire of these biomolecules, i.e., proteins, protein-protein/nucleic acid complexes, MD trajectories, and RNA. Various options provided for analysis of MD trajectories include individual network construction and analysis of intermediate time-steps, comparative analysis of these networks, construction and analysis of average network of the ensemble of trajectories and dynamic cross-correlations. For protein-nucleic acid complexes, networks of the whole complex as well as that of the interface can be constructed and analyzed. For analysis of proteins, protein-protein complexes and MD trajectories, network construction based on inter-residue interaction energies with realistic edge-weights obtained from standard force fields is provided to capture the atomistic details. Updated version of NAPS also provides improved visualization features, interactive plots and bulk execution. URL: http://bioinf.iiit.ac.in/NAPS/.


Assuntos
Proteínas de Ligação a DNA/química , DNA/química , Simulação de Dinâmica Molecular , Proteínas de Ligação a RNA/química , RNA/química , Software , Complexos Multiproteicos/química , Conformação de Ácido Nucleico , Conformação Proteica
8.
Nucleic Acids Res ; 44(W1): W375-82, 2016 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-27151201

RESUMO

Traditionally, protein structures have been analysed by the secondary structure architecture and fold arrangement. An alternative approach that has shown promise is modelling proteins as a network of non-covalent interactions between amino acid residues. The network representation of proteins provide a systems approach to topological analysis of complex three-dimensional structures irrespective of secondary structure and fold type and provide insights into structure-function relationship. We have developed a web server for network based analysis of protein structures, NAPS, that facilitates quantitative and qualitative (visual) analysis of residue-residue interactions in: single chains, protein complex, modelled protein structures and trajectories (e.g. from molecular dynamics simulations). The user can specify atom type for network construction, distance range (in Å) and minimal amino acid separation along the sequence. NAPS provides users selection of node(s) and its neighbourhood based on centrality measures, physicochemical properties of amino acids or cluster of well-connected residues (k-cliques) for further analysis. Visual analysis of interacting domains and protein chains, and shortest path lengths between pair of residues are additional features that aid in functional analysis. NAPS support various analyses and visualization views for identifying functional residues, provide insight into mechanisms of protein folding, domain-domain and protein-protein interactions for understanding communication within and between proteins. URL:http://bioinf.iiit.ac.in/NAPS/.


Assuntos
Aminoácidos/química , Internet , Modelos Moleculares , Domínios Proteicos , Proteínas/química , Software , Simulação de Dinâmica Molecular , Dobramento de Proteína , Mapeamento de Interação de Proteínas
9.
BMC Bioinformatics ; 15: 6599, 2014 Dec 30.
Artigo em Inglês | MEDLINE | ID: mdl-25547411

RESUMO

BACKGROUND: Tandem repetition of structural motifs in proteins is frequently observed across all forms of life. Topology of repeating unit and its frequency of occurrence are associated to a wide range of structural and functional roles in diverse proteins, and defects in repeat proteins have been associated with a number of diseases. It is thus desirable to accurately identify specific repeat type and its copy number. Weak evolutionary constraints on repeat units and insertions/deletions between them make their identification difficult at the sequence level and structure based approaches are desired. The proposed graph spectral approach is based on protein structure represented as a graph for detecting one of the most frequently observed structural repeats, Ankyrin repeat. RESULTS: It has been shown in a large number of studies that 3-dimensional topology of a protein structure is well captured by a graph, making it possible to analyze a complex protein structure as a mathematical entity. In this study we show that eigen spectra profile of a protein structure graph exhibits a unique repetitive profile for contiguous repeating units enabling the detection of the repeat region and the repeat type. The proposed approach uses a non-redundant set of 58 Ankyrin proteins to define rules for the detection of Ankyrin repeat motifs. It is evaluated on a set of 370 proteins comprising 125 known Ankyrin proteins and remaining non-solenoid proteins and the prediction compared with UniProt annotation, sequence-based approach, RADAR, and structure-based approach, ConSole. To show the efficacy of the approach, we analyzed the complete PDB structural database and identified 641 previously unrecognized Ankyrin repeat proteins. We observe a unique eigen spectra profile for different repeat types and show that the method can be easily extended to detect other repeat types. It is implemented as a web server, AnkPred. It is freely available at 'bioinf.iiit.ac.in/AnkPred'. CONCLUSIONS: AnkPred provides an elegant and computationally efficient graph-based approach for detecting Ankyrin structural repeats in proteins. By analyzing the eigen spectra of the protein structure graph and secondary structure information, characteristic features of a known repeat family are identified. This method is especially useful in correctly identifying new members of a repeat family.


Assuntos
Algoritmos , Repetição de Anquirina , Anquirinas/química , Conformação Proteica , Sequência de Aminoácidos , Simulação por Computador , Humanos , Modelos Moleculares , Dados de Sequência Molecular , Homologia de Sequência de Aminoácidos
10.
J Bioinform Comput Biol ; 12(6): 1442009, 2014 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-25385083

RESUMO

Repetition of a structural motif within protein is associated with a wide range of structural and functional roles. In most cases the repeating units are well conserved at the structural level while at the sequence level, they are mostly undetectable suggesting the need for structure-based methods. Since most known methods require a training dataset, de novo approach is desirable. Here, we propose an efficient graph-based approach for detecting structural repeats in proteins. In a protein structure represented as a graph, interactions between inter- and intra-repeat units are well captured by the eigen spectra of adjacency matrix of the graph. These conserved interactions give rise to similar connections and a unique profile of the principal eigen spectra for each repeating unit. The efficacy of the approach is shown on eight repeat families annotated in UniProt, comprising of both solenoid and nonsolenoid repeats with varied secondary structure architecture and repeat lengths. The performance of the approach is also tested on other known benchmark datasets and the performance compared with two repeat identification methods. For a known repeat type, the algorithm also identifies the type of repeat present in the protein. A web tool implementing the algorithm is available at the URL http://bioinf.iiit.ac.in/PRIGSA/.


Assuntos
Algoritmos , Modelos Químicos , Modelos Moleculares , Proteínas/química , Proteínas/ultraestrutura , Análise de Sequência de Proteína/métodos , Simulação por Computador , Dados de Sequência Molecular , Dobramento de Proteína , Estrutura Secundária de Proteína , Sequências Repetitivas de Aminoácidos , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA