Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 2024 May 06.
Artigo em Inglês | MEDLINE | ID: mdl-38709873

RESUMO

Small ubiquitin-like modifiers (SUMOs) are tiny but important protein regulators involved in orchestrating a broad spectrum of biological processes, either by covalently modifying protein substrates or by noncovalently interacting with other proteins. Here, we report an updated server, GPS-SUMO 2.0, for the prediction of SUMOylation sites and SUMO-interacting motifs (SIMs). For predictor training, we adopted three machine learning algorithms, penalized logistic regression (PLR), a deep neural network (DNN), and a transformer, and used 52 404 nonredundant SUMOylation sites in 8262 proteins and 163 SIMs in 102 proteins. To further increase the accuracy of predicting SUMOylation sites, a pretraining model was first constructed using 145 545 protein lysine modification sites, followed by transfer learning to fine-tune the model. GPS-SUMO 2.0 exhibited greater accuracy in predicting SUMOylation sites than did other existing tools. For users, one or multiple protein sequences or identifiers can be input, and the prediction results are shown in a tabular list. In addition to the basic statistics, we integrated knowledge from 35 public resources to annotate SUMOylation sites or SIMs. The GPS-SUMO 2.0 server is freely available at https://sumo.biocuckoo.cn/. We believe that GPS-SUMO 2.0 can serve as a useful tool for further analysis of SUMOylation and SUMO interactions.

2.
Nat Commun ; 15(1): 3685, 2024 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-38693116

RESUMO

Sleep, locomotor and social activities are essential animal behaviors, but their reciprocal relationships and underlying mechanisms remain poorly understood. Here, we elicit information from a cutting-edge large-language model (LLM), generative pre-trained transformer (GPT) 3.5, which interprets 10.2-13.8% of Drosophila genes known to regulate the 3 behaviors. We develop an instrument for simultaneous video tracking of multiple moving objects, and conduct a genome-wide screen. We have identified 758 fly genes that regulate sleep and activities, including mre11 which regulates sleep only in the presence of conspecifics, and NELF-B which regulates sleep regardless of whether conspecifics are present. Based on LLM-reasoning, an educated signal web is modeled for understanding of potential relationships between its components, presenting comprehensive molecular signatures that control sleep, locomotor and social activities. This LLM-aided strategy may also be helpful for addressing other complex scientific questions.


Assuntos
Comportamento Animal , Drosophila melanogaster , Locomoção , Sono , Animais , Sono/fisiologia , Sono/genética , Drosophila melanogaster/genética , Drosophila melanogaster/fisiologia , Locomoção/fisiologia , Locomoção/genética , Comportamento Animal/fisiologia , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Comportamento Social , Masculino
3.
Nucleic Acids Res ; 51(W1): W243-W250, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37158278

RESUMO

Protein phosphorylation, catalyzed by protein kinases (PKs), is one of the most important post-translational modifications (PTMs), and involved in regulating almost all of biological processes. Here, we report an updated server, Group-based Prediction System (GPS) 6.0, for prediction of PK-specific phosphorylation sites (p-sites) in eukaryotes. First, we pre-trained a general model using penalized logistic regression (PLR), deep neural network (DNN), and Light Gradient Boosting Machine (LightGMB) on 490 762 non-redundant p-sites in 71 407 proteins. Then, transfer learning was conducted to obtain 577 PK-specific predictors at the group, family and single PK levels, using a well-curated data set of 30 043 known site-specific kinase-substrate relations in 7041 proteins. Together with the evolutionary information, GPS 6.0 could hierarchically predict PK-specific p-sites for 44046 PKs in 185 species. Besides the basic statistics, we also offered the knowledge from 22 public resources to annotate the prediction results, including the experimental evidence, physical interactions, sequence logos, and p-sites in sequences and 3D structures. The GPS 6.0 server is freely available at https://gps.biocuckoo.cn. We believe that GPS 6.0 could be a highly useful service for further analysis of phosphorylation.


Assuntos
Biologia Computacional , Proteínas , Software , Fosforilação , Proteínas Quinases/química , Proteínas Quinases/metabolismo , Processamento de Proteína Pós-Traducional , Proteínas/química , Proteínas/metabolismo , Biologia Computacional/instrumentação , Biologia Computacional/métodos , Internet
4.
Nucleic Acids Res ; 50(W1): W405-W411, 2022 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-35670661

RESUMO

Recent high-throughput omics techniques have produced a large amount of biological data. Visualization of big omics data is essential to answer a wide range of biological problems. As a concise but comprehensive strategy, a heatmap can analyze and visualize high-dimensional and heterogeneous biomolecular expression data in an attractive artwork. In 2014, we developed a stand-alone software package, Heat map Illustrator (HemI 1.0), which implemented three clustering methods and seven distance metrics for heatmap illustration. Here, we significantly improved 1.0 and released the online service of HemI 2.0, in which 7 clustering methods and 22 types of distance metrics were implemented. In HemI 2.0, the clustering results and publication-quality heatmaps can be exported directly. For an in-depth analysis of the data, we further added an option of enrichment analysis for 12 model organisms, with 15 types of functional annotations. The enrichment results can be visualized in five idioms, including bubble chart, bar graph, coxcomb chart, pie chart and word cloud. We anticipate that HemI 2.0 can be a helpful web server for visualization of biomolecular expression data, as well as the additional enrichment analysis. HemI 2.0 is freely available for all users at: https://hemi.biocuckoo.org/.


Assuntos
Análise por Conglomerados , Análise de Dados , Visualização de Dados , Internet , Software , Big Data , Animais , Modelos Animais , Perfilação da Expressão Gênica/métodos
5.
Brief Bioinform ; 23(2)2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35037020

RESUMO

As an important post-translational modification, lysine ubiquitination participates in numerous biological processes and is involved in human diseases, whereas the site specificity of ubiquitination is mainly decided by ubiquitin-protein ligases (E3s). Although numerous ubiquitination predictors have been developed, computational prediction of E3-specific ubiquitination sites is still a great challenge. Here, we carefully reviewed the existing tools for the prediction of general ubiquitination sites. Also, we developed a tool named GPS-Uber for the prediction of general and E3-specific ubiquitination sites. From the literature, we manually collected 1311 experimentally identified site-specific E3-substrate relations, which were classified into different clusters based on corresponding E3s at different levels. To predict general ubiquitination sites, we integrated 10 types of sequence and structure features, as well as three types of algorithms including penalized logistic regression, deep neural network and convolutional neural network. Compared with other existing tools, the general model in GPS-Uber exhibited a highly competitive accuracy, with an area under curve values of 0.7649. Then, transfer learning was adopted for each E3 cluster to construct E3-specific models, and in total 112 individual E3-specific predictors were implemented. Using GPS-Uber, we conducted a systematic prediction of human cancer-associated ubiquitination events, which could be helpful for further experimental consideration. GPS-Uber will be regularly updated, and its online service is free for academic research at http://gpsuber.biocuckoo.cn/.


Assuntos
Lisina , Ubiquitina-Proteína Ligases , Algoritmos , Humanos , Lisina/metabolismo , Processamento de Proteína Pós-Traducional , Ubiquitina-Proteína Ligases/química , Ubiquitina-Proteína Ligases/genética , Ubiquitina-Proteína Ligases/metabolismo , Ubiquitinação
6.
Nucleic Acids Res ; 50(D1): D451-D459, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34581824

RESUMO

Here, we reported the compendium of protein lysine modifications (CPLM 4.0, http://cplm.biocuckoo.cn/), a data resource for various post-translational modifications (PTMs) specifically occurred at the side-chain amino group of lysine residues in proteins. From the literature and public databases, we collected 450 378 protein lysine modification (PLM) events, and combined them with the existing data of our previously developed protein lysine modification database (PLMD 3.0). In total, CPLM 4.0 contained 592 606 experimentally identified modification events on 463 156 unique lysine residues of 105 673 proteins for up to 29 types of PLMs across 219 species. Furthermore, we carefully annotated the data using the knowledge from 102 additional resources that covered 13 aspects, including variation and mutation, disease-associated information, protein-protein interaction, protein functional annotation, DNA & RNA element, protein structure, chemical-target relation, mRNA expression, protein expression/proteomics, subcellular localization, biological pathway annotation, functional domain annotation, and physicochemical property. Compared to PLMD 3.0 and other existing resources, CPLM 4.0 achieved a >2-fold increase in collection of PLM events, with a data volume of ∼45GB. We anticipate that CPLM 4.0 can serve as a more useful database for further study of PLMs.


Assuntos
Bases de Dados de Proteínas , Lisina/metabolismo , Processamento de Proteína Pós-Traducional , Proteínas/metabolismo , Software , Acetilação , Animais , Bactérias/genética , Bactérias/metabolismo , Biotinilação , Humanos , Hidroxilação , Internet , Lisina/química , Metilação , Modelos Moleculares , Anotação de Sequência Molecular , Mutação , Fosforilação , Plantas/genética , Plantas/metabolismo , Ligação Proteica , Conformação Proteica , Mapeamento de Interação de Proteínas , Proteínas/química , Proteínas/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Ubiquitinação
7.
Theranostics ; 11(16): 8008-8026, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34335977

RESUMO

Rationale: Children usually develop less severe symptoms responding to Coronavirus Disease 2019 (COVID-19) than adults. However, little is known about the molecular alterations and pathogenesis of COVID-19 in children. Methods: We conducted plasma proteomic and metabolomic profilings of the blood samples of a cohort containing 18 COVID-19-children with mild symptoms and 12 healthy children, which were enrolled from hospital admissions and outpatients, respectively. Statistical analyses were performed to identify molecules specifically altered in COVID-19-children. We also developed a machine learning-based pipeline named inference of biomolecular combinations with minimal bias (iBM) to prioritize proteins and metabolites strongly altered in COVID-19-children, and experimentally validated the predictions. Results: By comparing to the multi-omic data in adults, we identified 44 proteins and 249 metabolites differentially altered in COVID-19-children against healthy children or COVID-19-adults. Further analyses demonstrated that both deteriorative immune response/inflammation processes and protective antioxidant or anti-inflammatory processes were markedly induced in COVID-19-children. Using iBM, we prioritized two combinations that contained 5 proteins and 5 metabolites, respectively, each exhibiting a total area under curve (AUC) value of 100% to accurately distinguish COVID-19-children from healthy children or COVID-19-adults. Further experiments validated that all the 5 proteins were up-regulated upon coronavirus infection. Interestingly, we found that the prioritized metabolites inhibited the expression of pro-inflammatory factors, and two of them, methylmalonic acid (MMA) and mannitol, also suppressed coronaviral replication, implying a protective role of these metabolites in COVID-19-children. Conclusion: The finding of a strong antagonism of deteriorative and protective effects provided new insights on the mechanism and pathogenesis of COVID-19 in children that mostly underwent mild symptoms. The identified metabolites strongly altered in COVID-19-children could serve as potential therapeutic agents of COVID-19.


Assuntos
COVID-19/sangue , COVID-19/virologia , Adulto , COVID-19/epidemiologia , COVID-19/imunologia , Criança , Pré-Escolar , China/epidemiologia , Feminino , Hospitalização , Humanos , Masculino , Metabolômica/métodos , Pessoa de Meia-Idade , Proteômica/métodos , SARS-CoV-2/isolamento & purificação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...