Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
1.
Cancers (Basel) ; 16(18)2024 Sep 20.
Artículo en Inglés | MEDLINE | ID: mdl-39335178

RESUMEN

Background: The development of tumors is a highly complex process that entails numerous interactions and intricate relationships between the host immune system and cancer cells. It has been demonstrated in studies that the treatment response of patients can be correlated with the tumor microenvironment (TME). Consequently, the examination of diverse immune profiles within the TME can facilitate the elucidation of tumor development and the development of advantageous models for diagnoses and prognoses. Methods: In this study, we utilized a single-cell decomposition method to analyze the relationships between cell proportions and immune signatures in lung adenocarcinoma (LUAD) patients. Results: Our findings indicate that specific immune cell populations and immune signatures are significantly associated with patient prognosis. By identifying poor prognosis signatures (PPS), we reveal the critical role of immune profiles and cellular composition in disease outcomes, emphasizing their diagnostic potential for predicting patient prognosis. Conclusions: This study highlights the importance of immune signatures and cellular composition, which may serve as valuable biomarkers for disease prognosis in LUAD patients.

2.
iScience ; 27(9): 110718, 2024 Sep 20.
Artículo en Inglés | MEDLINE | ID: mdl-39262770

RESUMEN

The rise of antibiotic resistance necessitates effective alternative therapies. Antimicrobial peptides (AMPs) are promising due to their broad inhibitory effects. This study focuses on predicting the minimum inhibitory concentration (MIC) of AMPs against whom-priority pathogens: Staphylococcus aureus ATCC 25923, Escherichia coli ATCC 25922, and Pseudomonas aeruginosa ATCC 27853. We developed a comprehensive regression model integrating AMP sequence-based and genomic features. Using eight AI-based architectures, including deep learning with protein language model embeddings, we created an ensemble model combining bi-directional long short-term memory (BiLSTM), convolutional neural network (CNN), and multi-branch model (MBM). The ensemble model showed superior performance with Pearson correlation coefficients of 0.756, 0.781, and 0.802 for the bacterial strains, demonstrating its accuracy in predicting MIC values. This work sets a foundation for future studies to enhance model performance and advance AMP applications in combating antibiotic resistance.

3.
Life (Basel) ; 13(12)2023 Nov 30.
Artículo en Inglés | MEDLINE | ID: mdl-38137893

RESUMEN

BACKGROUND: Mobile phones, laptops, and computers have become an indispensable part of our lives in recent years. Workers may have an incorrect posture when using a computer for a prolonged period of time. Using these products with an incorrect posture can lead to neck pain. However, there are limited data on postures in real-life situations. METHODS: In this study, we used a common camera to record images of subjects carrying out three different tasks (a typing task, a gaming task, and a video-watching task) on a computer. Different artificial intelligence (AI)-based pose estimation approaches were applied to analyze the head's yaw, pitch, and roll and coordinate information of the eyes, nose, neck, and shoulders in the images. We used machine learning models such as random forest, XGBoost, logistic regression, and ensemble learning to build a model to predict whether a subject had neck pain by analyzing their posture when using the computer. RESULTS: After feature selection and adjustment of the predictive models, nested cross-validation was applied to evaluate the models and fine-tune the hyperparameters. Finally, the ensemble learning approach was utilized to construct a model via bagging, which achieved a performance with 87% accuracy, 92% precision, 80.3% recall, 95.5% specificity, and an AUROC of 0.878. CONCLUSIONS: We developed a predictive model for the identification of non-specific neck pain using 2D video images without the need for costly devices, advanced environment settings, or extra sensors. This method could provide an effective way for clinically evaluating poor posture during real-world computer usage scenarios.

4.
J Clin Endocrinol Metab ; 108(9): 2389-2399, 2023 08 18.
Artículo en Inglés | MEDLINE | ID: mdl-36810613

RESUMEN

CONTEXT: Extremely early age at menarche, also called precocious puberty, has been associated with various cardiometabolic traits, but their shared heritability remains unclear. OBJECTIVES: This work aimed to identify new shared genetic variants and their pathways for age at menarche and cardiometabolic traits and to investigate the influence of central precocious puberty on childhood cardiometabolic traits. METHODS: Using the conjunction false discovery rate method, this study analyzed genome-wide association study data from the menarche-cardiometabolic traits among 59 655 females of Taiwanese ancestry and systemically investigated pleiotropy between age at menarche and cardiometabolic traits. To support the novel hypertension link, we used the Taiwan Puberty Longitudinal Study (TPLS) to investigate the influence of precocious puberty on childhood cardiometabolic traits. RESULTS: We discovered 27 novel loci, with an overlap between age at menarche and cardiometabolic traits, including body fat and blood pressure. Among the novel genes discovered, SEC16B, CSK, CYP1A1, FTO, and USB1 are within a protein interaction network with known cardiometabolic genes, including traits for obesity and hypertension. These loci were confirmed through demonstration of significant changes in the methylation or expression levels of neighboring genes. Moreover, the TPLS provided evidence regarding a 2-fold higher risk of early-onset hypertension that occurred in girls with central precocious puberty. CONCLUSION: Our study highlights the usefulness of cross-trait analyses for identifying shared etiology between age at menarche and cardiometabolic traits, especially early-onset hypertension. The menarche-related loci may contribute to early-onset hypertension through endocrinological pathways.


Asunto(s)
Hipertensión , Pubertad Precoz , Femenino , Humanos , Niño , Menarquia/genética , Pubertad Precoz/epidemiología , Pubertad Precoz/genética , Estudios Longitudinales , Estudio de Asociación del Genoma Completo , Hipertensión/epidemiología , Hipertensión/genética , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato/genética , Hidrolasas Diéster Fosfóricas
5.
Front Oncol ; 12: 862326, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35795066

RESUMEN

Background and Purpose: Benzimidazoles have attracted much attention over the last few decades due to their broad-spectrum pharmacological properties. Increasing evidence is showing the potential use of benzimidazoles as anti-angiogenic agents, although the mechanisms that impact angiogenesis remain to be fully defined. In this study, we aim to investigate the anti-angiogenic mechanisms of MFB, a novel 2-aminobenzimidazole derivative, to develop a novel angiogenesis inhibitor. Experimental Approach: MTT, BrdU, migration and invasion assays, and immunoblotting were employed to examine MFB's effects on vascular endothelial growth factor (VEGF)-induced endothelial cell proliferation, migration, invasion, as well as signaling molecules activation. The anti-angiogenic effects of MFB were analyzed by tube formation, aorta ring sprouting, and matrigel plug assays. We also used a mouse model of lung metastasis to determine the MFB's anti-metastatic effects. Key Results: MFB suppressed cell proliferation, migration, invasion, and endothelial tube formation of VEGF-A-stimulated human umbilical vascular endothelial cells (HUVECs) or VEGF-C-stimulated lymphatic endothelial cells (LECs). MFB suppressed VEGF-A and VEGF-C signaling in HUVECs or LECs. In addition, MFB reduced VEGF-A- or tumor cells-induced neovascularization in vivo. MFB also diminished B16F10 melanoma lung metastasis. The molecular docking results further showed that MFB may bind to VEGFR-2 rather than VEGF-A with high affinity. Conclusions and Implications: These observations indicated that MFB may target VEGF/VEGFR signaling to suppress angiogenesis and lymphangiogenesis. It also supports the role of MFB as a potential lead in developing novel agents for the treatment of angiogenesis- or lymphangiogenesis-associated diseases and cancer.

6.
Front Genet ; 13: 1064980, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36712865

RESUMEN

Background: Left ventricular mass is a highly heritable disease. Previous studies have suggested common genetic variants to be associated with left ventricular mass; however, the roles of rare variants are still unknown. We performed targeted next-generation sequencing using the TruSight Cardio panel, which provides comprehensive coverage of 175 genes with known associations to 17 inherited cardiac conditions. Methods: We conducted next-generation sequencing using the Illumina TruSight Cardiomyopathy Target Genes platform using the 5% and 95% extreme values of left ventricular mass from community-based participants. After removing poor-quality next-generation sequencing subjects, including call rate <98% and Mendelian errors, 144 participants were used for the analysis. We performed downstream analysis, including quality control, alignment, coverage length, and annotation; after setting filtering criteria for depths more than 60, we found a total of 144 samples and 165 target genes for further analysis. Results: Of the 12,287 autosomal variants, most had minor allele frequencies of <1% (rare frequency), and variants had minor allele frequencies ranging from 1% to 5%. In the multi-allele variant analyses, 16 loci in 15 genes were significant using the false discovery rate of less than .1. In addition, gene-based analyses using continuous and binary outcomes showed that three genes (CASQ2, COL5A1, and FXN) remained to be associated with left ventricular mass status. One single-nucleotide polymorphism (rs7538337) was enriched for the CASQ2 gene expressed in aorta artery (p = 4.6 × 10-18), as was another single-nucleotide polymorphism (rs11103536) for the COL5A1 gene expressed in aorta artery (p = 2.0 × 10-9). Among the novel genes discovered, CASQ2, COL5A1, and FXN are within a protein-protein interaction network with known cardiovascular genes. Conclusion: We clearly demonstrated candidate genes to be associated with left ventricular mass. Further studies to characterize the target genes and variants for their functional mechanisms are warranted.

7.
Front Cardiovasc Med ; 9: 1023355, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36698922

RESUMEN

Background: Menarche timing may not be directly associated with the risk of coronary artery disease (CAD). Therefore, we investigated the roles of metabolic factors in explaining the effect of age at menarche on CAD risk. Methods: We identified women with age at menarche and CAD by using three analytical methods: Mendelian randomization (MR), logistic regression analysis, and Cox proportional hazard regression. The first two analyses were performed in the Taiwan Biobank (N = 71,923) study, and the last analysis was performed in the Chin-Shan Community Cardiovascular Cohort study (N = 1,598). We further investigated the role of metabolic factors in mediating the effect of age at menarche on CAD risk by using three complementary methods with mediation analyses. Results: One standard deviation of earlier age at menarche was associated with a 2% higher CAD risk [odds ratio = 1.02, 95% confidence interval (CI) = 1.001-1.03] in the MR analysis, an 11% higher risk (odds ratio = 1.11, 95% CI = 1.02-1.21) in the logistic regression analysis, and a 57% higher risk (hazard ratio = 1.57, 95% CI = 1.12-2.19) in the Cox proportional hazard regression. All the analyses consistently supported the role of systolic blood pressure in mediating this effect. The MR results indicated that 29% (95% CI = 26%-32%) of the effect of genetically predicted earlier age at menarche on CAD risk was mediated by genetically predicted systolic blood pressure. Conclusion: The results obtained using different analytical methods suggest that interventions aimed at lowering systolic blood pressure can reduce the cases of CAD attributable to earlier age at menarche.

8.
Nucleic Acids Res ; 50(D1): D471-D479, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34788852

RESUMEN

Protein post-translational modifications (PTMs) play an important role in different cellular processes. In view of the importance of PTMs in cellular functions and the massive data accumulated by the rapid development of mass spectrometry (MS)-based proteomics, this paper presents an update of dbPTM with over 2 777 000 PTM substrate sites obtained from existing databases and manual curation of literature, of which more than 2 235 000 entries are experimentally verified. This update has manually curated over 42 new modification types that were not included in the previous version. Due to the increasing number of studies on the mechanism of PTMs in the past few years, a great deal of upstream regulatory proteins of PTM substrate sites have been revealed. The updated dbPTM thus collates regulatory information from databases and literature, and merges them into a protein-protein interaction network. To enhance the understanding of the association between PTMs and molecular functions/cellular processes, the functional annotations of PTMs are curated and integrated into the database. In addition, the existing PTM-related resources, including annotation databases and prediction tools are also renewed. Overall, in this update, we would like to provide users with the most abundant data and comprehensive annotations on PTMs of proteins. The updated dbPTM is now freely accessible at https://awi.cuhk.edu.cn/dbPTM/.


Asunto(s)
Bases de Datos de Proteínas , Redes Reguladoras de Genes , Procesamiento Proteico-Postraduccional , Proteínas/metabolismo , Programas Informáticos , Animales , Arabidopsis/genética , Arabidopsis/metabolismo , Bacterias/genética , Bacterias/metabolismo , Humanos , Internet , Ratones , Modelos Moleculares , Anotación de Secuencia Molecular , Unión Proteica , Conformación Proteica , Mapeo de Interacción de Proteínas , Proteínas/química , Proteínas/genética , Ratas , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo
9.
J Pers Med ; 11(11)2021 Oct 20.
Artículo en Inglés | MEDLINE | ID: mdl-34834399

RESUMEN

The molecular heterogeneity of gene expression profiles of glioblastoma multiforme (GBM) are the most important prognostic factors for tumor recurrence and drug resistance. Thus, the aim of this study was to identify potential target genes related to temozolomide (TMZ) resistance and GBM recurrence. The genomic data of patients with GBM from The Cancer Genome Atlas (TCGA; 154 primary and 13 recurrent tumors) and a local cohort (29 primary and 4 recurrent tumors), samples from different tumor regions from a local cohort (29 tumor and 25 peritumoral regions), and Gene Expression Omnibus data (GSE84465, single-cell RNA sequencing; 3589 cells) were included in this study. Critical gene signatures were identified based an analysis of differentially expressed genes (DEGs). DEGs were further used to evaluate gene enrichment levels among primary and recurrent GBMs and different tumor regions through gene set enrichment analysis. Protein-protein interactions (PPIs) were incorporated into gene regulatory networks to identify the affected metabolic pathways. The enrichment levels of 135 genes were identified in the peritumoral regions as being risk signatures for tumor recurrence. Fourteen genes (DVL1, PRKACB, ARRB1, APC, MAPK9, CAMK2A, PRKCB, CACNA1A, ERBB4, RASGRF1, NF1, RPS6KA2, MAPK8IP2, and PPM1A) derived from the PPI network of 135 genes were upregulated and involved in the regulation of cancer stem cell (CSC) development and relevant signaling pathways (Notch, Hedgehog, Wnt, and MAPK). The single-cell data analysis results indicated that 14 key genes were mainly expressed in oligodendrocyte progenitor cells, which could produce a CSC niche in the peritumoral region. The enrichment levels of 336 genes were identified as biomarkers for evaluating TMZ resistance in the solid tumor region. Eleven genes (ARID5A, CDC42EP3, CDKN1A, FLT3, JUNB, MAP2K3, MYBPC2, RGS14, RNASEK, TBC1D30, and TXNDC11) derived from the PPI network of 336 genes were upregulated and may be associated with a high risk of TMZ resistance; these genes were identified in both the TCGA and local cohorts. Furthermore, the expression patterns of ARID5A, CDKN1A, and MAP2K3 were identical to the gene signatures of TMZ-resistant cell lines. The identified enrichment levels of the two gene sets expressed in tumor and peritumoral regions are potentially helpful for evaluating TMZ resistance in GBM. Moreover, these key genes could be used as biomarkers, potentially providing new molecular strategies for GBM treatment.

10.
Int J Nanomedicine ; 16: 5233-5246, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34366665

RESUMEN

PURPOSE: Targeted superparamagnetic iron oxide (SPIO) nanoparticles are a promising tool for molecular magnetic resonance imaging (MRI) diagnosis. Lipid-coated SPIO nanoparticles have a nonfouling property that can reduce nonspecific binding to off-target cells and prevent agglomeration, making them suitable contrast agents for molecular MRI diagnosis. PD-L1 is a poor prognostic factor for patients with glioblastoma. Most recurrent glioblastomas are temozolomide resistant. Diagnostic probes targeting PD-L1 could facilitate early diagnosis and be used to predict responses to targeted PD-L1 immunotherapy in patients with primary or recurrent glioblastoma. We conjugated lipid-coated SPIO nanoparticles with PD-L1 antibodies to identify PD-L1 expression in glioblastoma or temozolomide-resistant glioblastoma by using MRI. METHODS: The synthesized PD-L1 antibody-conjugated SPIO (PDL1-SPIO) nanoparticles were characterized using dynamic light scattering, zeta potential assays, transmission electron microscopy images, Prussian blue assay, in vitro cell affinity assay, and animal MRI analysis. RESULTS: PDL1-SPIO exhibited a specific binding capacity to PD-L1 of the mouse glioblastoma cell line (GL261). The presence and quantity of PDL1-SPIO in temozolomide-resistant glioblastoma cells and tumor tissue were confirmed through Prussian blue staining and in vivo T2* map MRI, respectively. CONCLUSION: This is the first study to demonstrate that PDL1-SPIO can specifically target temozolomide-resistant glioblastoma with PD-L1 expression in the brain and can be quantified through MRI analysis, thus making it suitable for the diagnosis of PD-L1 expression in temozolomide-resistant glioblastoma in vivo.


Asunto(s)
Glioblastoma , Animales , Antígeno B7-H1 , Línea Celular Tumoral , Medios de Contraste , Compuestos Férricos , Glioblastoma/diagnóstico por imagen , Glioblastoma/tratamiento farmacológico , Humanos , Lípidos , Nanopartículas Magnéticas de Óxido de Hierro , Imagen por Resonancia Magnética , Nanopartículas de Magnetita , Ratones , Temozolomida/farmacología
11.
Nutrients ; 13(5)2021 May 05.
Artículo en Inglés | MEDLINE | ID: mdl-34063157

RESUMEN

Maternal nutrition intake during pregnancy may affect the mother-to-child transmission of bacteria, resulting in gut microflora changes in the offspring, with long-term health consequences in later life. Longitudinal human studies are lacking, as only a small amount of studies showing the effect of nutrition intake during pregnancy on the gut microbiome of infants have been performed, and these studies have been mainly conducted on animals. This pilot study explores the effects of high or low fruit and vegetable gestational intake on the infant microbiome. We enrolled pregnant women with a complete 3-day dietary record and received postpartum follow-up. The 16S rRNA gene sequence was used to characterize the infant gut microbiome at 2 months (n = 39). Principal coordinate analysis ordination revealed that the infant gut microbiome clustered differently for high and low maternal fruit and vegetable consumption (p < 0.001). The linear discriminant analysis effect size and feature selection identified 6 and 17 taxa from both the high and low fruit and vegetable consumption groups. Among the 23 abundant taxa, we observed that six maternal intake nutrients were associated with nine taxa (e.g., Erysipelatoclostridium, Isobaculum, Lachnospiraceae, Betaproteobacteria, Burkholderiaceae, Sutterella, Clostridia, Clostridiales, and Lachnoclostridium). The amount of gestational fruit and vegetable consumption is associated with distinct changes in the infant gut microbiome at 2 months of age. Therefore, strategies involving increased fruit and vegetable consumption during pregnancy should be employed for modifying the gut microbiome early in life.


Asunto(s)
Dieta/estadística & datos numéricos , Frutas , Microbioma Gastrointestinal/genética , Fenómenos Fisiologicos Nutricionales Maternos/fisiología , Verduras , Adulto , Estudios de Cohortes , Encuestas sobre Dietas , Heces/microbiología , Femenino , Humanos , Lactante , Modelos Lineales , Masculino , Proyectos Piloto , Embarazo , ARN Ribosómico 16S/análisis
12.
Cancers (Basel) ; 13(11)2021 May 21.
Artículo en Inglés | MEDLINE | ID: mdl-34064004

RESUMEN

This study is to identify potential multiomics biomarkers for the early detection of the prognostic recurrence of PC patients. A total of 494 prostate adenocarcinoma (PRAD) patients (60-recurrent included) from the Cancer Genome Atlas (TCGA) portal were analyzed using the autoencoder model and similarity network fusion. Then, multiomics panels were constructed according to the intersected omics biomarkers identified from the two models. Six intersected omics biomarkers, TELO2, ZMYND19, miR-143, miR-378a, cg00687383 (MED4), and cg02318866 (JMJD6; METTL23), were collected for multiomics panel construction. The difference between the Kaplan-Meier curves of high and low recurrence-risk groups generated from the multiomics panel achieved p-value = 5.33 × 10-9, which is better than the former study (p-value = 5 × 10-7). Additionally, when evaluating the selected multiomics biomarkers with clinical information (Gleason score, age, and cancer stage), a high-performance prediction model was generated with C-index = 0.713, p-value = 2.97 × 10-15, and AUC = 0.789. The risk score generated from the selected multiomics biomarkers worked as an effective indicator for the prediction of PRAD recurrence. This study helps us to understand the etiology and pathways of PRAD and further benefits both patients and physicians with potential prognostic biomarkers when making clinical decisions after surgical treatment.

13.
Cancers (Basel) ; 12(10)2020 Oct 19.
Artículo en Inglés | MEDLINE | ID: mdl-33086550

RESUMEN

Characterization of immunophenotypes in glioblastoma (GBM) is important for therapeutic stratification and helps predict treatment response and prognosis. Radiomics can be used to predict molecular subtypes and gene expression levels. However, whether radiomics aids immunophenotyping prediction is still unknown. In this study, to classify immunophenotypes in patients with GBM, we developed machine learning-based magnetic resonance (MR) radiomic models to evaluate the enrichment levels of four immune subsets: Cytotoxic T lymphocytes (CTLs), activated dendritic cells, regulatory T cells (Tregs), and myeloid-derived suppressor cells (MDSCs). Independent testing data and the leave-one-out cross-validation method were used to evaluate model effectiveness and model performance, respectively. We identified five immunophenotypes (G1 to G5) based on the enrichment level for the four immune subsets. G2 had the worst prognosis and comprised highly enriched MDSCs and lowly enriched CTLs. G3 had the best prognosis and comprised lowly enriched MDSCs and Tregs and highly enriched CTLs. The average accuracy of T1-weighted contrasted MR radiomics models of the enrichment level for the four immune subsets reached 79% and predicted G2, G3, and the "immune-cold" phenotype (G1) according to our radiomics models. Our radiomic immunophenotyping models feasibly characterize the immunophenotypes of GBM and can predict patient prognosis.

14.
PLoS One ; 15(4): e0231594, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32315343

RESUMEN

Recurrence and poorly differentiated (grade 3 and above) and atypical cell type endometrial cancer (EC) have poor prognosis outcome. The mechanisms and characteristics of recurrence and distal metastasis of EC remain unclear. The extracellular matrix (ECM) of the reproductive tract in women undergoes extensive structural remodelling changes every month. Altered ECMs surrounding cells were believed to play crucial roles in a cancer progression. To decipher the associations between ECM and EC development, we generated a PAN-ECM Data list of 1516 genes including ECM molecules (ECMs), synthetic and degradation enzymes for ECMs, ECM receptors, and soluble molecules that regulate ECM and used RNA-Seq data from The Cancer Genome Atlas (TCGA) for the studies. The alterations of PAN-ECM genes by comparing the RNA-Seq expressions profiles of EC samples which have been grouped as tumorigenesis and metastasis group based on their pathological grading were identified. Differential analyses including functional enrichment, co-expression network, and molecular network analysis were carried out to identify the specific PAN-ECM genes that may involve in the progression of EC. Eight hundred and thirty-one and 241 PAN-ECM genes were significantly involved in tumorigenesis (p-value <1.571e-15) and metastasis (p-value <2.2e-16), respectively, whereas 140 genes were in the intersection of tumorigenesis and metastasis. Interestingly, 92 of the 140 intersecting PAN-ECM genes showed contrasting fold changes between the tumorigenesis and metastasis datasets. Enrichment analysis for the contrast PAN-ECM genes indicated pathways such as GP6 signaling, ILK signaling, and interleukin (IL)-8 signaling pathways were activated in metastasis but inhibited in tumorigenesis. The significantly activated ECM and ECM associated genes in GP6 signaling, ILK signaling, and interleukin (IL)-8 signaling pathways may play crucial roles in metastasis of EC. Our study provides a better understanding of the etiology and the progression of EC.


Asunto(s)
Carcinogénesis/genética , Neoplasias Endometriales/genética , Matriz Extracelular/genética , Proteínas de Neoplasias/genética , Biología Computacional , Progresión de la Enfermedad , Neoplasias Endometriales/patología , Endometrio/metabolismo , Endometrio/patología , Femenino , Regulación Neoplásica de la Expresión Génica/genética , Humanos , Metástasis de la Neoplasia , RNA-Seq , Receptores de Superficie Celular/genética , Transducción de Señal/genética
15.
Sci Rep ; 9(1): 16175, 2019 11 07.
Artículo en Inglés | MEDLINE | ID: mdl-31700141

RESUMEN

Succinylation is a type of protein post-translational modification (PTM), which can play important roles in a variety of cellular processes. Due to an increasing number of site-specific succinylated peptides obtained from high-throughput mass spectrometry (MS), various tools have been developed for computationally identifying succinylated sites on proteins. However, most of these tools predict succinylation sites based on traditional machine learning methods. Hence, this work aimed to carry out the succinylation site prediction based on a deep learning model. The abundance of MS-verified succinylated peptides enabled the investigation of substrate site specificity of succinylation sites through sequence-based attributes, such as position-specific amino acid composition, the composition of k-spaced amino acid pairs (CKSAAP), and position-specific scoring matrix (PSSM). Additionally, the maximal dependence decomposition (MDD) was adopted to detect the substrate signatures of lysine succinylation sites by dividing all succinylated sequences into several groups with conserved substrate motifs. According to the results of ten-fold cross-validation, the deep learning model trained using PSSM and informative CKSAAP attributes can reach the best predictive performance and also perform better than traditional machine-learning methods. Moreover, an independent testing dataset that truly did not exist in the training dataset was used to compare the proposed method with six existing prediction tools. The testing dataset comprised of 218 positive and 2621 negative instances, and the proposed model could yield a promising performance with 84.40% sensitivity, 86.99% specificity, 86.79% accuracy, and an MCC value of 0.489. Finally, the proposed method has been implemented as a web-based prediction tool (CNN-SuccSite), which is now freely accessible at http://csb.cse.yzu.edu.tw/CNN-SuccSite/ .


Asunto(s)
Bases de Datos de Proteínas , Aprendizaje Profundo , Procesamiento Proteico-Postraduccional , Proteínas , Análisis de Secuencia de Proteína , Ácido Succínico/metabolismo , Lisina/genética , Lisina/metabolismo , Proteínas/genética , Proteínas/metabolismo
16.
BMC Med Genomics ; 11(Suppl 7): 34, 2019 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-30894197

RESUMEN

BACKGROUND: Recent studies have proposed several gene signatures as biomarkers for different grades of gliomas from various perspectives. However, most of these genes can only be used appropriately for patients with specific grades of gliomas. METHODS: In this study, we aimed to identify survival-relevant genes shared between glioblastoma multiforme (GBM) and lower-grade glioma (LGG), which could be used as potential biomarkers to classify patients into different risk groups. Cox proportional hazard regression model (Cox model) was used to extract relative genes, and effectiveness of genes was estimated against random forest regression. Finally, risk models were constructed with logistic regression. RESULTS: We identified 104 key genes that were shared between GBM and LGG, which could be significantly correlated with patients' survival based on next-generation sequencing data obtained from The Cancer Genome Atlas for gene expression analysis. The effectiveness of these genes in the survival prediction of GBM and LGG was evaluated, and the average receiver operating characteristic curve (ROC) area under the curve values ranged from 0.7 to 0.8. Gene set enrichment analysis revealed that these genes were involved in eight significant pathways and 23 molecular functions. Moreover, the expressions of ten (CTSZ, EFEMP2, ITGA5, KDELR2, MDK, MICALL2, MAP 2 K3, PLAUR, SERPINE1, and SOCS3) of these genes were significantly higher in GBM than in LGG, and comparing their expression levels to those of the proposed control genes (TBP, IPO8, and SDHA) could have the potential capability to classify patients into high- and low- risk groups, which differ significantly in the overall survival. Signatures of candidate genes were validated, by multiple microarray datasets from Gene Expression Omnibus, to increase the robustness of using these potential prognostic factors. In both the GBM and LGG cohort study, most of the patients in the high-risk group had the IDH1 wild-type gene, and those in the low-risk group had IDH1 mutations. Moreover, most of the high-risk patients with LGG possessed a 1p/19q-noncodeletion. CONCLUSION: In this study, we identified survival relevant genes which were shared between GBM and LGG, and those enabled to classify patients into high- and low-risk groups based on expression level analysis. Both the risk groups could be correlated with the well-known genetic variants, thus suggesting their potential prognostic value in clinical application.


Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias Encefálicas/genética , Glioblastoma/genética , Glioma/genética , Transcriptoma , Adulto , Anciano , Neoplasias Encefálicas/fisiopatología , Estudios de Cohortes , Femenino , Glioblastoma/fisiopatología , Glioma/fisiopatología , Humanos , Masculino , Persona de Mediana Edad , Pronóstico , Modelos de Riesgos Proporcionales , Factores de Riesgo , Análisis de Supervivencia
17.
BMC Bioinformatics ; 19(Suppl 13): 384, 2019 Feb 04.
Artículo en Inglés | MEDLINE | ID: mdl-30717647

RESUMEN

BACKGROUND: Glutarylation, the addition of a glutaryl group (five carbons) to a lysine residue of a protein molecule, is an important post-translational modification and plays a regulatory role in a variety of physiological and biological processes. As the number of experimentally identified glutarylated peptides increases, it becomes imperative to investigate substrate motifs to enhance the study of protein glutarylation. We carried out a bioinformatics investigation of glutarylation sites based on amino acid composition using a public database containing information on 430 non-homologous glutarylation sites. RESULTS: The TwoSampleLogo analysis indicates that positively charged and polar amino acids surrounding glutarylated sites may be associated with the specificity in substrate site of protein glutarylation. Additionally, the chi-squared test was utilized to explore the intrinsic interdependence between two positions around glutarylation sites. Further, maximal dependence decomposition (MDD), which consists of partitioning a large-scale dataset into subgroups with statistically significant amino acid conservation, was used to capture motif signatures of glutarylation sites. We considered single features, such as amino acid composition (AAC), amino acid pair composition (AAPC), and composition of k-spaced amino acid pairs (CKSAAP), as well as the effectiveness of incorporating MDD-identified substrate motifs into an integrated prediction model. Evaluation by five-fold cross-validation showed that AAC was most effective in discriminating between glutarylation and non-glutarylation sites, according to support vector machine (SVM). CONCLUSIONS: The SVM model integrating MDD-identified substrate motifs performed well, with a sensitivity of 0.677, a specificity of 0.619, an accuracy of 0.638, and a Matthews Correlation Coefficient (MCC) value of 0.28. Using an independent testing dataset (46 glutarylated and 92 non-glutarylated sites) obtained from the literature, we demonstrated that the integrated SVM model could improve the predictive performance effectively, yielding a balanced sensitivity and specificity of 0.652 and 0.739, respectively. This integrated SVM model has been implemented as a web-based system (MDDGlutar), which is now freely available at http://csb.cse.yzu.edu.tw/MDDGlutar/ .


Asunto(s)
Biología Computacional/métodos , Glutaratos/metabolismo , Lisina/metabolismo , Secuencias de Aminoácidos , Secuencia de Aminoácidos , Animales , Bases de Datos de Proteínas , Lisina/química , Ratones , Proteínas/química , Curva ROC , Reproducibilidad de los Resultados , Especificidad por Sustrato , Máquina de Vectores de Soporte , Interfaz Usuario-Computador
18.
Clin Cancer Res ; 24(18): 4429-4436, 2018 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-29789422

RESUMEN

Purpose: The new classification announced by the World Health Organization in 2016 recognized five molecular subtypes of diffuse gliomas based on isocitrate dehydrogenase (IDH) and 1p/19q genotypes in addition to histologic phenotypes. We aim to determine whether clinical MRI can stratify these molecular subtypes to benefit the diagnosis and monitoring of gliomas.Experimental Design: The data from 456 subjects with gliomas were obtained from The Cancer Imaging Archive. Overall, 214 subjects, including 106 cases of glioblastomas and 108 cases of lower grade gliomas with preoperative MRI, survival data, histology, IDH, and 1p/19q status were included. We proposed a three-level machine-learning model based on multimodal MR radiomics to classify glioma subtypes. An independent dataset with 70 glioma subjects was further collected to verify the model performance.Results: The IDH and 1p/19q status of gliomas can be classified by radiomics and machine-learning approaches, with areas under ROC curves between 0.922 and 0.975 and accuracies between 87.7% and 96.1% estimated on the training dataset. The test on the validation dataset showed a comparable model performance with that on the training dataset, suggesting the efficacy of the trained classifiers. The classification of 5 molecular subtypes solely based on the MR phenotypes achieved an 81.8% accuracy, and a higher accuracy of 89.2% could be achieved if the histology diagnosis is available.Conclusions: The MR radiomics-based method provides a reliable alternative to determine the histology and molecular subtypes of gliomas. Clin Cancer Res; 24(18); 4429-36. ©2018 AACR.


Asunto(s)
Glioma/diagnóstico por imagen , Glioma/genética , Isocitrato Deshidrogenasa/genética , Imagen por Resonancia Magnética , Deleción Cromosómica , Cromosomas Humanos Par 1/genética , Cromosomas Humanos Par 19/genética , Femenino , Genotipo , Glioma/clasificación , Glioma/patología , Humanos , Aprendizaje Automático , Masculino , Persona de Mediana Edad , Mutación
19.
BMC Syst Biol ; 11(Suppl 7): 132, 2017 12 21.
Artículo en Inglés | MEDLINE | ID: mdl-29322920

RESUMEN

BACKGROUND: Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM's function is critical to our ability to manipulate the biological mechanisms of protein. RESULTS: In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking tools for exploring the structural characteristics of PTMs, is presented. In addition, all tertiary structures of PTM sites on proteins can be visualized using the JSmol program. CONCLUSION: Resolving the function of PTM sites is important for understanding the role that proteins play in biological mechanisms. Our work attempted to delineate the structural correlation between PTM sites and PPI or drug-target binding. CurxPTM could help scientists narrow the scope of their PTM research and enhance the efficiency of PTM identification in the face of big proteome data. CruxPTM is now available at http://csb.cse.yzu.edu.tw/CruxPTM/ .


Asunto(s)
Preparaciones Farmacéuticas/metabolismo , Mapeo de Interacción de Proteínas , Procesamiento Proteico-Postraduccional , Proteínas/metabolismo , Humanos , Modelos Moleculares , Unión Proteica , Estructura Terciaria de Proteína , Proteínas/química
20.
BMC Syst Biol ; 11(Suppl 7): 137, 2017 12 21.
Artículo en Inglés | MEDLINE | ID: mdl-29322938

RESUMEN

BACKGROUND: Carbonylation, which takes place through oxidation of reactive oxygen species (ROS) on specific residues, is an irreversibly oxidative modification of proteins. It has been reported that the carbonylation is related to a number of metabolic or aging diseases including diabetes, chronic lung disease, Parkinson's disease, and Alzheimer's disease. Due to the lack of computational methods dedicated to exploring motif signatures of protein carbonylation sites, we were motivated to exploit an iterative statistical method to characterize and identify carbonylated sites with motif signatures. RESULTS: By manually curating experimental data from research articles, we obtained 332, 144, 135, and 140 verified substrate sites for K (lysine), R (arginine), T (threonine), and P (proline) residues, respectively, from 241 carbonylated proteins. In order to examine the informative attributes for classifying between carbonylated and non-carbonylated sites, multifarious features including composition of twenty amino acids (AAC), composition of amino acid pairs (AAPC), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM) were investigated in this study. Additionally, in an attempt to explore the motif signatures of carbonylation sites, an iterative statistical method was adopted to detect statistically significant dependencies of amino acid compositions between specific positions around substrate sites. Profile hidden Markov model (HMM) was then utilized to train a predictive model from each motif signature. Moreover, based on the method of support vector machine (SVM), we adopted it to construct an integrative model by combining the values of bit scores obtained from profile HMMs. The combinatorial model could provide an enhanced performance with evenly predictive sensitivity and specificity in the evaluation of cross-validation and independent testing. CONCLUSION: This study provides a new scheme for exploring potential motif signatures at substrate sites of protein carbonylation. The usefulness of the revealed motifs in the identification of carbonylated sites is demonstrated by their effective performance in cross-validation and independent testing. Finally, these substrate motifs were adopted to build an available online resource (MDD-Carb, http://csb.cse.yzu.edu.tw/MDDCarb/ ) and are also anticipated to facilitate the study of large-scale carbonylated proteomes.


Asunto(s)
Modelos Moleculares , Carbonilación Proteica , Proteínas/química , Proteínas/metabolismo , Secuencias de Aminoácidos , Secuencia de Aminoácidos , Sitios de Unión , Internet
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...