Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Nucleic Acids Res ; 46(12): e72, 2018 07 06.
Artigo em Inglês | MEDLINE | ID: mdl-29617876

RESUMO

Identifying transcription factor (TF) binding sites (TFBSs) is important in the computational inference of gene regulation. Widely used computational methods of TFBS prediction based on position weight matrices (PWMs) usually have high false positive rates. Moreover, computational studies of transcription regulation in eukaryotes frequently require numerous PWM models of TFBSs due to a large number of TFs involved. To overcome these problems we developed DRAF, a novel method for TFBS prediction that requires only 14 prediction models for 232 human TFs, while at the same time significantly improves prediction accuracy. DRAF models use more features than PWM models, as they combine information from TFBS sequences and physicochemical properties of TF DNA-binding domains into machine learning models. Evaluation of DRAF on 98 human ChIP-seq datasets shows on average 1.54-, 1.96- and 5.19-fold reduction of false positives at the same sensitivities compared to models from HOCOMOCO, TRANSFAC and DeepBind, respectively. This observation suggests that one can efficiently replace the PWM models for TFBS prediction by a small number of DRAF models that significantly improve prediction accuracy. The DRAF method is implemented in a web tool and in a stand-alone software freely available at http://cbrc.kaust.edu.sa/DRAF.


Assuntos
Análise de Sequência de DNA/métodos , Fatores de Transcrição/metabolismo , Sítios de Ligação , Imunoprecipitação da Cromatina , DNA/química , DNA/metabolismo , Humanos , Aprendizado de Máquina , Matrizes de Pontuação de Posição Específica
2.
Nucleic Acids Res ; 44(D1): D624-33, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26546514

RESUMO

Microorganisms produce an enormous variety of chemical compounds. It is of general interest for microbiology and biotechnology researchers to have means to explore information about molecular and genetic basis of functioning of different microorganisms and their ability for bioproduction. To enable such exploration, we compiled 45 topic-specific knowledgebases (KBs) accessible through DESM portal (www.cbrc.kaust.edu.sa/desm). The KBs contain information derived through text-mining of PubMed information and complemented by information data-mined from various other resources (e.g. ChEBI, Entrez Gene, GO, KOBAS, KEGG, UniPathways, BioGrid). All PubMed records were indexed using 4,538,278 concepts from 29 dictionaries, with 1 638 986 records utilized in KBs. Concepts used are normalized whenever possible. Most of the KBs focus on a particular type of microbial activity, such as production of biocatalysts or nutraceuticals. Others are focused on specific categories of microorganisms, e.g. streptomyces or cyanobacteria. KBs are all structured in a uniform manner and have a standardized user interface. Information exploration is enabled through various searches. Users can explore statistically most significant concepts or pairs of concepts, generate hypotheses, create interactive networks of associated concepts and export results. We believe DESM will be a useful complement to the existing resources to benefit microbiology and biotechnology research.


Assuntos
Bases de Dados Factuais , Microbiologia Industrial , Antituberculosos/farmacologia , Archaea/genética , Archaea/metabolismo , Bactérias/genética , Bactérias/metabolismo , Mineração de Dados , Dicionários como Assunto , Reposicionamento de Medicamentos , Fungos/genética , Fungos/metabolismo , Humanos , Internet , Bases de Conhecimento , Vírus/genética , Vírus/metabolismo , Vocabulário Controlado
3.
BMC Genomics ; 18(1): 33, 2017 01 05.
Artigo em Inglês | MEDLINE | ID: mdl-28056772

RESUMO

BACKGROUND: Finding a source from which high-energy-density biofuels can be derived at an industrial scale has become an urgent challenge for renewable energy production. Some microorganisms can produce free fatty acids (FFA) as precursors towards such high-energy-density biofuels. In particular, photosynthetic cyanobacteria are capable of directly converting carbon dioxide into FFA. However, current engineered strains need several rounds of engineering to reach the level of production of FFA to be commercially viable; thus new chassis strains that require less engineering are needed. Although more than 120 cyanobacterial genomes are sequenced, the natural potential of these strains for FFA production and excretion has not been systematically estimated. RESULTS: Here we present the FFA SC (FFASC), an in silico screening method that evaluates the potential for FFA production and excretion of cyanobacterial strains based on their proteomes. A literature search allowed for the compilation of 64 proteins, most of which influence FFA production and a few of which affect FFA excretion. The proteins are classified into 49 orthologous groups (OGs) that helped create rules used in the scoring/ranking of algorithms developed to estimate the potential for FFA production and excretion of an organism. Among 125 cyanobacterial strains, FFASC identified 20 candidate chassis strains that rank in their FFA producing and excreting potential above the specifically engineered reference strain, Synechococcus sp. PCC 7002. We further show that the top ranked cyanobacterial strains are unicellular and primarily include Prochlorococcus (order Prochlorales) and marine Synechococcus (order Chroococcales) that cluster phylogenetically. Moreover, two principal categories of enzymes were shown to influence FFA production the most: those ensuring precursor availability for the biosynthesis of lipids, and those involved in handling the oxidative stress associated to FFA synthesis. CONCLUSION: To our knowledge FFASC is the first in silico method to screen cyanobacteria proteomes for their potential to produce and excrete FFA, as well as the first attempt to parameterize the criteria derived from genetic characteristics that are favorable/non-favorable for this purpose. Thus, FFASC helps focus experimental evaluation only on the most promising cyanobacteria.


Assuntos
Biologia Computacional/métodos , Cianobactérias/genética , Cianobactérias/metabolismo , Ácidos Graxos não Esterificados/biossíntese , Algoritmos , Análise por Conglomerados , Simulação por Computador , Cianobactérias/classificação , Redes e Vias Metabólicas , Fotossíntese , Filogenia , Proteoma , Proteômica/métodos
4.
Artigo em Inglês | MEDLINE | ID: mdl-35239491

RESUMO

BACKGROUND & OBJECTIVE: Genomic medicine stands to be revolutionized by understanding single nucleotide variants (SNVs) and their expression in single-gene disorders (Mendelian diseases). Computational tools can play a vital role in the exploration of such variations and their pathogenicity. Consequently, we developed the ensemble prediction tool AllelePred to identify deleterious SNVs and disease causative genes. RESULTS: The model utilizes different population genetics backgrounds and restricted criteria for features selection to help generate high accuracy results. In comparison to other tools, such as Eigen, PROVEAN, and fathmm-MKL our classifier achieves higher accuracy (98%), precision (96%), F1 score (93%), and coverage (100%) for different types of coding variants. The new method was also compared against a bioinformatics analytical workflow, which uses gnomAD overall AFs (less than 1%) and CADD (scaled C-score of at least 15). Furthermore, this research highlights the stature of genetic variant sharing and curation. We accumulated a list of highly probable deleterious variants and recommended further experimental validation before medical diagnostic usage. CONCLUSIONS: The ensemble prediction tool AllelePred enables increased accuracy in recognizing deleterious SNVs and the genetic determinants in real clinical data.


Assuntos
Biologia Computacional , Nucleotídeos , Frequência do Gene , Biologia Computacional/métodos , Polimorfismo de Nucleotídeo Único/genética
5.
Sci Rep ; 13(1): 21114, 2023 11 30.
Artigo em Inglês | MEDLINE | ID: mdl-38036622

RESUMO

Circulating tumor cells (CTCs) are cancer cells that detach from the primary tumor and intravasate into the bloodstream. Thus, non-invasive liquid biopsies are being used to analyze CTC-expressed genes to identify potential cancer biomarkers. In this regard, several studies have used gene expression changes in blood to predict the presence of CTC and, consequently, cancer. However, the CTC mRNA data has not been used to develop a generic approach that indicates the presence of multiple cancer types. In this study, we developed such a generic approach. Briefly, we designed two computational workflows, one using the raw mRNA data and deep learning (DL) and the other exploiting five hub gene ranking algorithms (Degree, Maximum Neighborhood Component, Betweenness Centrality, Closeness Centrality, and Stress Centrality) with machine learning (ML). Both workflows aim to determine the top genes that best distinguish cancer types based on the CTC mRNA data. We demonstrate that our automated, robust DL framework (DNNraw) more accurately indicates the presence of multiple cancer types using the CTC gene expression data than multiple ML approaches. The DL approach achieved average precision of 0.9652, recall of 0.9640, f1-score of 0.9638 and overall accuracy of 0.9640. Furthermore, since we designed multiple approaches, we also provide a bioinformatics analysis of the gene commonly identified as top-ranked by the different methods. To our knowledge, this is the first study wherein a generic approach has been developed to predict the presence of multiple cancer types using raw CTC mRNA data, as opposed to other models that require a feature selection step.


Assuntos
Aprendizado Profundo , Células Neoplásicas Circulantes , Humanos , Células Neoplásicas Circulantes/patologia , Prognóstico , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , RNA Mensageiro/genética
6.
Artigo em Inglês | MEDLINE | ID: mdl-33079651

RESUMO

Pseudomonas genus is among the top nosocomial pathogens known to date. Being highly opportunistic, members of pseudomonas genus are most commonly connected with nosocomial infections of urinary tract and ventilator-associated pneumonia. Nevertheless, vaccine development for this pathogenic genus is slow because of no information regarding immunity correlated functional mechanism. In this present work, an immunoinformatics pipeline is used for vaccine development based on epitope-based peptide design, which can result in crucial immune response against outer membrane proteins of pseudomonas genus. A total of 127 outer membrane proteins were analysed, studied and out of them three sequences were obtained to be the producer of non-allergic, highly antigenic T-cell and B-cell epitopes which show good binding affinity towards class II HLA molecules. After performing rigorous screening utilizing docking, simulation, modelling techniques, we had one nonameric peptide (WLLATGIFL)as a good vaccine candidate. The predicted epitopes needs to be further validated for its apt use as vaccine. This work paves a new way with extensive therapeutic application against Pseudomonas genus and their associated diseases.


Assuntos
Epitopos de Linfócito T , Proteínas de Membrana , Epitopos de Linfócito B/química , Epitopos de Linfócito T/química , Epitopos de Linfócito T/metabolismo , Humanos , Simulação de Acoplamento Molecular , Peptídeos , Pseudomonas/metabolismo , Desenvolvimento de Vacinas , Vacinas de Subunidades Antigênicas/química
7.
PLoS One ; 17(7): e0271737, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35877764

RESUMO

More than 30 types of amyloids are linked to close to 50 diseases in humans, the most prominent being Alzheimer's disease (AD). AD is brain-related local amyloidosis, while another amyloidosis, such as AA amyloidosis, tends to be more systemic. Therefore, we need to know more about the biological entities' influencing these amyloidosis processes. However, there is currently no support system developed specifically to handle this extraordinarily complex and demanding task. To acquire a systematic view of amyloidosis and how this may be relevant to the brain and other organs, we needed a means to explore "amyloid network systems" that may underly processes that leads to an amyloid-related disease. In this regard, we developed the DES-Amyloidoses knowledgebase (KB) to obtain fast and relevant information regarding the biological network related to amyloid proteins/peptides and amyloid-related diseases. This KB contains information obtained through text and data mining of available scientific literature and other public repositories. The information compiled into the DES-Amyloidoses system based on 19 topic-specific dictionaries resulted in 796,409 associations between terms from these dictionaries. Users can explore this information through various options, including enriched concepts, enriched pairs, and semantic similarity. We show the usefulness of the KB using an example focused on inflammasome-amyloid associations. To our knowledge, this is the only KB dedicated to human amyloid-related diseases derived primarily through literature text mining and complemented by data mining that provides a novel way of exploring information relevant to amyloidoses.


Assuntos
Doença de Alzheimer , Amiloidose , Amiloide , Humanos , Bases de Conhecimento , Proteína Amiloide A Sérica
8.
J Biomol Struct Dyn ; 39(17): 6761-6771, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-32762537

RESUMO

The ongoing pandemic COVID-19 (COrona Virus Immuno Deficiency-2019) which is caused by SARS-CoV-2 (Severe Acute Respiratory Syndrome-CoronaVirus-2) has emerged as a pandemic with 400,000 plus deaths till date. We do not have any drug or vaccine available for the inhibition of this deadly virus. The expedition for searching a potential drug or vaccine against COVID-19 will be of massive potential and favor. This study is focused on finding an effective natural origin compound which can put a check on the activity of this virus. We chose important proteins from the SARS-CoV-2 genome such as NSP4, NSP15 and RdRp along-with the human ACE2 receptor which is the first point of contact with the virus. Virtual screening and followed up molecular docking resulted in Baicalin and Limonin as the final lead molecules. 200 ns of MD simulation for each protein-ligand complex provides the insights that Baicalin has a potential to target NSP4, NSP15 and RdRp proteins. Limonin which is largely used in traditional Indian medicine system is found to inhibit the human ACE2 receptor (making it inefficient in binding to the receptor binding domain of SARS-CoV-2). Our studies propose Baicalin and Limonin in combination to be studied in vitro and in vivo against COVID-19.Communicated by Ramaswamy H. Sarma.


Assuntos
COVID-19 , Simulação de Dinâmica Molecular , Vacinas contra COVID-19 , Humanos , Simulação de Acoplamento Molecular , SARS-CoV-2
9.
J Mol Model ; 27(6): 160, 2021 May 08.
Artigo em Inglês | MEDLINE | ID: mdl-33963942

RESUMO

Coronavirus infectious disease 2019 (COVID-19), a viral infection caused by a novel coronavirus (nCoV), continues to emerge as a serious threat to public health. This pandemic caused by SARS-CoV-2 (severe acute respiratory syndrome-coronavirus-2) has infected globally with 1,550,000 plus deaths to date, representing a high risk to public health. No effective drug or vaccine is available to curb down this deadly virus. The expedition for searching for a potential drug or vaccine against COVID-19 is of massive potential and favour to the community. This study is focused on finding an effective natural compound that can be processed further into a potential inhibitor to check the activity of SARS-CoV-2 with minimal side effects targeting NSP15 protein, which belongs to the EndoU enzyme family. The natural screening suggested two efficient compounds (PubChem ID: 95372568 and 1776037) with dihydroxyphenyl region of the compound, found to be important in the interaction with the viral protein showing promising activity which may act as a potent lead inhibitory molecule against the virus. In combination with virtual screening, modelling, drug likeliness, molecular docking, and 500 ns cumulative molecular dynamics simulations (100 ns for each complex) along with the decomposition analysis to calculate and confirm the stability and fold, we propose 95372568 and 1776037 as novel compounds of natural origin capable of getting developed into potent lead molecules against SARS-CoV-2 target protein NSP15.


Assuntos
Antivirais/química , Produtos Biológicos/química , Tratamento Farmacológico da COVID-19 , Biologia Computacional , Endorribonucleases , Simulação de Acoplamento Molecular , Simulação de Dinâmica Molecular , SARS-CoV-2/química , Proteínas não Estruturais Virais , Antivirais/uso terapêutico , Produtos Biológicos/uso terapêutico , Endorribonucleases/química , Humanos , Proteínas não Estruturais Virais/química
10.
J Mol Model ; 26(12): 338, 2020 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-33175236

RESUMO

A novel coronavirus (SARS-CoV-2) identified in Wuhan state of China in 2019 is the causative agent of deadly disease COVID-19. It has spread across the globe (more than 210 countries) within a short period. Coronaviruses pose serious health threats to both humans and animals. A recent publication reported an experimental 3D complex structure of the S protein of SARS-CoV-2 showed that the ectodomain of the SARS-CoV-2 S protein binds to the peptidase domain (PD) of human ACE2 with a dissociation constant (Kd) of ~ 15 nM. In this study, we focused on inhibitors for ACE2: S protein complex using virtual screening and inhibition studies through molecular docking for over 200,000 natural compounds. Toxicity analysis was also performed for the best hits, and the final complex structures for four complexes were subjected to 400 ns molecular dynamics simulations for stability testing. We found two natural origin inhibitors for the S protein: human ACE2 complex (Andrographolide and Pterostilbene) which displayed better inhibition potential for ACE2 receptor and its binding with the S protein of SARS-CoV-2. Comparative studies were also performed to test and verify that these two drug candidates are also better than hydroxychloroquine which is known to inhibit this complex. However, we needed better potential drug candidates to overcome the side effects of hydroxychloroquine. Supplementary experimental studies need to be carried forward to corroborate the viability of these two new inhibitors for ACE2: S protein complex so as to curb down COVID-19.


Assuntos
Betacoronavirus/fisiologia , Infecções por Coronavirus/epidemiologia , Peptídeo Hidrolases/metabolismo , Peptidil Dipeptidase A/metabolismo , Pneumonia Viral/epidemiologia , Glicoproteína da Espícula de Coronavírus/metabolismo , Enzima de Conversão de Angiotensina 2 , Betacoronavirus/genética , COVID-19 , Infecções por Coronavirus/virologia , Reposicionamento de Medicamentos , Humanos , Simulação de Acoplamento Molecular , Simulação de Dinâmica Molecular , Pandemias , Peptídeo Hidrolases/genética , Peptidil Dipeptidase A/genética , Pneumonia Viral/virologia , Domínios Proteicos , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA