Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Brief Bioinform ; 23(3)2022 05 13.
Artículo en Inglés | MEDLINE | ID: mdl-35419584

RESUMEN

Gene Ontology (GO) is widely used in the biological domain. It is the most comprehensive ontology providing formal representation of gene functions (GO concepts) and relations between them. However, unintentional quality defects (e.g. missing or erroneous relations) in GO may exist due to the large size of GO concepts and complexity of GO structures. Such quality defects would impact the results of GO-based analyses and applications. In this work, we introduce a novel evidence-based lexical pattern approach for quality assurance of GO relations. We leverage two layers of evidence to suggest potentially missing relations in GO as follows. We first utilize related concept pairs (i.e. existing relations) in GO to extract relationship-specific lexical patterns, which serve as the first layer evidence to automatically suggest potentially missing relations between unrelated concept pairs. For each suggested missing relation, we further identify two other existing relations as the second layer of evidence that resemble the difference between the missing relation and the existing relation based on which the missing relation is suggested. Applied to the 15 December 2021 release of GO, this approach suggested a total of 866 potentially missing relations. Local domain experts evaluated the entire set of potentially missing relations, and identified 821 as missing relations and 45 indicate erroneous existing relations. We submitted these findings to the GO consortium for further validation and received encouraging feedback. These indicate that our evidence-based approach can be utilized to uncover missing relations and erroneous existing relations in GO.


Asunto(s)
Ontología de Genes
2.
Bioinformatics ; 38(8): 2096-2101, 2022 04 12.
Artículo en Inglés | MEDLINE | ID: mdl-35176131

RESUMEN

MOTIVATION: Cross-sectional analyses of primary cancer genomes have identified regions of recurrent somatic copy-number alteration, many of which result from positive selection during cancer formation and contain driver genes. However, no effective approach exists for identifying genomic loci under significantly different degrees of selection in cancers of different subtypes, anatomic sites or disease stages. RESULTS: CNGPLD is a new tool for performing case-control somatic copy-number analysis that facilitates the discovery of differentially amplified or deleted copy-number aberrations in a case group of cancer compared with a control group of cancer. This tool uses a Gaussian process statistical framework in order to account for the covariance structure of copy-number data along genomic coordinates and to control the false discovery rate at the region level. AVAILABILITY AND IMPLEMENTATION: CNGPLD is freely available at https://bitbucket.org/djhshih/cngpld as an R package. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genoma , Neoplasias , Humanos , Estudios Transversales , Genómica , Variaciones en el Número de Copia de ADN , Neoplasias/genética , Estudios de Casos y Controles , Programas Informáticos
3.
Am J Respir Cell Mol Biol ; 66(1): 53-63, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34370624

RESUMEN

Idiopathic pulmonary fibrosis (IPF), a devastating, fibroproliferative, chronic lung disorder, is associated with expansion of fibroblasts/myofibroblasts, which leads to excessive production and deposition of extracellular matrix. IPF is typically clinically identified as end-stage lung disease, after fibrotic processes are well-established and advanced. Fibroblasts have been shown to be critically important in the development and progression of IPF. We hypothesize that differential chromatin access can drive genetic differences in IPF fibroblasts relative to healthy fibroblasts. To this end, we performed assay of transposase-accessible chromatin sequencing to identify differentially accessible regions within the genomes of fibroblasts from healthy and IPF lungs. Multiple motifs were identified to be enriched in IPF fibroblasts compared with healthy fibroblasts, including binding motifs for TWIST1 and FOXA1. RNA sequencing identified 93 genes that could be annotated to differentially accessible regions. Pathway analysis of the annotated genes identified cellular adhesion, cytoskeletal anchoring, and cell differentiation as important biological processes. In addition, single nucleotide polymorphism analysis showed that linkage disequilibrium blocks of IPF risk single nucleotide polymorphisms with IPF-accessible regions that have been identified to be located in genes that are important in IPF, including MUC5B, TERT, and TOLLIP. Validation studies in isolated lung tissue confirmed increased expression for TWIST1 and FOXA1 in addition to revealing SHANK2 and CSPR2 as novel targets. Thus, modulation of differential chromatin access may be an important mechanism in the pathogenesis of lung fibrosis.


Asunto(s)
Epigénesis Genética , Fibroblastos/metabolismo , Fibrosis Pulmonar Idiopática/genética , Fibrosis Pulmonar Idiopática/patología , Transcriptoma/genética , Secuencia de Bases , Cromatina/metabolismo , Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Humanos , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple/genética , Factores de Transcripción/metabolismo , Transposasas/metabolismo
4.
Ann Rheum Dis ; 81(6): 854-860, 2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35190386

RESUMEN

OBJECTIVES: To characterise the peripheral blood cell (PBC) gene expression changes ensuing from mycophenolate mofetil (MMF) or cyclophosphamide (CYC) treatment and to determine the predictive significance of baseline PBC transcript scores for response to immunosuppression in systemic sclerosis (SSc)-related interstitial lung disease (ILD). METHODS: PBC RNA samples from baseline and 12-month visits, corresponding to the active treatment period of both arms in Scleroderma Lung Study II, were investigated by global RNA sequencing. Joint models were created to examine the predictive significance of baseline composite modular scores for the course of forced vital capacity (FVC) per cent predicted measurements from 3 to 12 months. RESULTS: 134 patients with SSc-ILD (CYC=69 and MMF=65) were investigated. CYC led to an upregulation of erythropoiesis, inflammation and myeloid lineage-related modules and a downregulation of lymphoid lineage-related modules. The modular changes resulting from MMF treatment were more modest and included a downregulation of plasmablast module. In the longitudinal analysis, none of the baseline transcript module scores showed predictive significance for FVC% course in the CYC arm. In contrast, in the MMF arm, higher baseline lymphoid lineage modules predicted better subsequent FVC% course, while higher baseline myeloid lineage and inflammation modules predicted worse subsequent FVC% course. CONCLUSION: Consistent with the primary mechanism of action of MMF on lymphocytes, patients with SSc-ILD with higher baseline lymphoid module scores had better FVC% course, while those with higher myeloid cell lineage activation score had poorer FVC% course on MMF.


Asunto(s)
Enfermedades Pulmonares Intersticiales , Esclerodermia Sistémica , Ciclofosfamida/uso terapéutico , Perfilación de la Expresión Génica , Humanos , Inmunosupresores/uso terapéutico , Inflamación , Pulmón , Enfermedades Pulmonares Intersticiales/etiología , Enfermedades Pulmonares Intersticiales/genética , Ácido Micofenólico/uso terapéutico , Esclerodermia Sistémica/complicaciones , Esclerodermia Sistémica/tratamiento farmacológico , Esclerodermia Sistémica/genética , Capacidad Vital
5.
Hum Genet ; 139(10): 1261-1272, 2020 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-32318854

RESUMEN

Nonsyndromic cleft lip with or without cleft palate (NSCLP) is a common birth defect for which only ~ 20% of the underlying genetic variation has been identified. Variants in noncoding regions have been increasingly suggested to contribute to the missing heritability. In this study, we investigated whether variation in craniofacial enhancers contributes to NSCLP. Candidate enhancers were identified using VISTA Enhancer Browser and previous publications. Prioritization was based on patterning defects in knockout mice, deletion/duplication of craniofacial genes in animal models and results of whole exome/whole genome sequencing studies. This resulted in 20 craniofacial enhancers to be investigated. Custom amplicon-based sequencing probes were designed and used for sequencing 380 NSCLP probands (from multiplex and simplex families of non-Hispanic white (NHW) and Hispanic ethnicities) using Illumina MiSeq. The frequencies of identified variants were compared to ethnically matched European (CEU) and Los Angeles Mexican (MXL) control genomes and used for association analyses. Variants in mm427/MSX1 and hs1582/SPRY1 showed genome-wide significant association with NSCLP (p ≤ 6.4 × 10-11). In silico analysis showed that these enhancer variants may disrupt important transcription factor binding sites. Haplotypes involving these enhancers and also mm435/ABCA4 were significantly associated with NSCLP, especially in NHW (p ≤ 6.3 × 10-7). Importantly, groupwise burden analysis showed several enhancer combinations significantly over-represented in NSCLP individuals, revealing novel NSCLP pathways and supporting a polygenic inheritance model. Our findings support the role of craniofacial enhancer sequence variation in the etiology of NSCLP.


Asunto(s)
Labio Leporino/genética , Fisura del Paladar/genética , Elementos de Facilitación Genéticos , Predisposición Genética a la Enfermedad , Variación Genética , Herencia Multifactorial , Transportadoras de Casetes de Unión a ATP/genética , Animales , Enfermedades Asintomáticas , Labio Leporino/etnología , Labio Leporino/patología , Fisura del Paladar/etnología , Fisura del Paladar/patología , Embrión de Mamíferos , Femenino , Estudios de Asociación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Hispánicos o Latinos , Humanos , Factor de Transcripción MSX1/genética , Masculino , Proteínas de la Membrana/genética , Ratones , Linaje , Fosfoproteínas/genética , Estados Unidos , Población Blanca
6.
Ann Rheum Dis ; 79(3): 379-386, 2020 03.
Artículo en Inglés | MEDLINE | ID: mdl-31767698

RESUMEN

OBJECTIVES: Determine global skin transcriptome patterns of early diffuse systemic sclerosis (SSc) and how they differ from later disease. METHODS: Skin biopsy RNA from 48 patients in the Prospective Registry for Early Systemic Sclerosis (PRESS) cohort (mean disease duration 1.3 years) and 33 matched healthy controls was examined by next-generation RNA sequencing. Data were analysed for cell type-specific signatures and compared with similarly obtained data from 55 previously biopsied patients in Genetics versus Environment in Scleroderma Outcomes Study cohort with longer disease duration (mean 7.4 years) and their matched controls. Correlations with histological features and clinical course were also evaluated. RESULTS: SSc patients in PRESS had a high prevalence of M2 (96%) and M1 (94%) macrophage and CD8 T cell (65%), CD4 T cell (60%) and B cell (69%) signatures. Immunohistochemical staining of immune cell markers correlated with the gene expression-based immune cell signatures. The prevalence of immune cell signatures in early diffuse SSc patients was higher than in patients with longer disease duration. In the multivariable model, adaptive immune cell signatures were significantly associated with shorter disease duration, while fibroblast and macrophage cell type signatures were associated with higher modified Rodnan Skin Score (mRSS). Immune cell signatures also correlated with skin thickness progression rate prior to biopsy, but did not predict subsequent mRSS progression. CONCLUSIONS: Skin in early diffuse SSc has prominent innate and adaptive immune cell signatures. As a prominently affected end organ, these signatures reflect the preceding rate of disease progression. These findings could have implications in understanding SSc pathogenesis and clinical trial design.


Asunto(s)
Inmunidad Adaptativa/genética , Inmunidad Innata/genética , Esclerodermia Difusa/genética , Esclerodermia Difusa/inmunología , Adulto , Biomarcadores/análisis , Biopsia , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Persona de Mediana Edad , Análisis Multivariante , Estudios Prospectivos , Sistema de Registros , Análisis de Regresión , Esclerodermia Difusa/patología , Análisis de Secuencia de ARN , Índice de Severidad de la Enfermedad , Piel/inmunología , Piel/patología , Transcriptoma
7.
J Biomed Inform ; 104: 103399, 2020 04.
Artículo en Inglés | MEDLINE | ID: mdl-32151769

RESUMEN

OBJECTIVE: The centrality of data to biomedical research is difficult to understate, and the same is true for the importance of the biomedical literature in disseminating empirical findings to scientific questions made on such data. But the connections between the literature and related datasets are often weak, hampering the ability of scientists to easily move between existing datasets and existing findings to derive new scientific hypotheses. This work aims to recommend relevant literature articles for datasets with the ultimate goal of increasing the productivity of researchers. Our approach to literature recommendation for datasets is a part of the dataset reusability platform developed at the University Texas Health Science Center at Houston for datasets related to gene expression. This platform incorporates datasets from Gene Expression Omnibus (GEO). An average of 34 datasets were added to GEO daily in the last five years (i.e. 2014 to 2018), demonstrating the need for automatic methods to connect these datasets with relevant literature. The relevant literature for a given dataset may describe that dataset, provide a scientific finding based on that dataset, or even describe prior and related work to the dataset's topic that is of interest to users of the dataset. MATERIALS AND METHODS: We adopt an information retrieval paradigm for literature recommendation. In our experiments, distributional semantic features are created from the title and abstract of MEDLINE articles. Then, related articles are identified for datasets in GEO. We evaluate multiple distributional methods such as TF-IDF, BM25, Latent Semantic Analysis, Latent Dirichlet Allocation, word2vec, and doc2vec. Top similar papers are recommended for each dataset using cosine similarity between the dataset's vector representation and every paper's vector representation. We also propose several novel re-ranking and normalization methods over embeddings to improve the recommendations. RESULTS: The top-performing literature recommendation technique achieved a strict precision at 10 of 0.8333 and a partial precision at 10 of 0.9000 using BM25 based on a manual evaluation of 36 datasets. Evaluation on a larger, automatically-collected benchmark shows small but consistent gains by emphasizing the similarity of dataset and article titles. CONCLUSION: This work is the first step toward developing a literature recommendation tool by recommending relevant literature for datasets. This will hopefully lead to better data reuse experience.


Asunto(s)
Investigación Biomédica , Almacenamiento y Recuperación de la Información , Expresión Génica , Humanos , Publicaciones , Semántica
8.
J Med Internet Res ; 22(7): e16981, 2020 07 31.
Artículo en Inglés | MEDLINE | ID: mdl-32735224

RESUMEN

BACKGROUND: Asthma exacerbation is an acute or subacute episode of progressive worsening of asthma symptoms and can have a significant impact on patients' quality of life. However, efficient methods that can help identify personalized risk factors and make early predictions are lacking. OBJECTIVE: This study aims to use advanced deep learning models to better predict the risk of asthma exacerbations and to explore potential risk factors involved in progressive asthma. METHODS: We proposed a novel time-sensitive, attentive neural network to predict asthma exacerbation using clinical variables from large electronic health records. The clinical variables were collected from the Cerner Health Facts database between 1992 and 2015, including 31,433 adult patients with asthma. Interpretations on both patient and cohort levels were investigated based on the model parameters. RESULTS: The proposed model obtained an area under the curve value of 0.7003 through a five-fold cross-validation, which outperformed the baseline methods. The results also demonstrated that the addition of elapsed time embeddings considerably improved the prediction performance. Further analysis observed diverse distributions of contributing factors across patients as well as some possible cohort-level risk factors, which could be found supporting evidence from peer-reviewed literature such as respiratory diseases and esophageal reflux. CONCLUSIONS: The proposed neural network model performed better than previous methods for the prediction of asthma exacerbation. We believe that personalized risk scores and analyses of contributing factors can help clinicians better assess the individual's level of disease progression and afford the opportunity to adjust treatment, prevent exacerbation, and improve outcomes.


Asunto(s)
Asma/fisiopatología , Aprendizaje Profundo/normas , Redes Neurales de la Computación , Calidad de Vida/psicología , Progresión de la Enfermedad , Femenino , Humanos , Masculino , Estudios Retrospectivos , Medición de Riesgo , Factores de Riesgo
9.
Ann Rheum Dis ; 78(10): 1371-1378, 2019 10.
Artículo en Inglés | MEDLINE | ID: mdl-31391177

RESUMEN

OBJECTIVE: In the randomised scleroderma: Cyclophosphamide Or Transplantation (SCOT trial) (NCT00114530), myeloablation, followed by haematopoietic stem cell transplantation (HSCT), led to improved clinical outcomes compared with monthly cyclophosphamide (CYC) treatment in systemic sclerosis (SSc). Herein, the study aimed to determine global molecular changes at the whole blood transcript and serum protein levels ensuing from HSCT in comparison to intravenous monthly CYC in 62 participants enrolled in the SCOT study. METHODS: Global transcript studies were performed at pretreatment baseline, 8 months and 26 months postrandomisation using Illumina HT-12 arrays. Levels of 102 proteins were measured in the concomitantly collected serum samples. RESULTS: At the baseline visit, interferon (IFN) and neutrophil transcript modules were upregulated and the cytotoxic/NK module was downregulated in SSc compared with unaffected controls. A paired comparison of the 26 months to the baseline samples revealed a significant decrease of the IFN and neutrophil modules and an increase in the cytotoxic/NK module in the HSCT arm while there was no significant change in the CYC control arm. Also, a composite score of correlating serum proteins with IFN and neutrophil transcript modules, as well as a multilevel analysis showed significant changes in SSc molecular signatures after HSCT while similar changes were not observed in the CYC arm. Lastly, a decline in the IFN and neutrophil modules was associated with an improvement in pulmonary forced vital capacity and an increase in the cytotoxic/NK module correlated with improvement in skin score. CONCLUSION: HSCT contrary to conventional treatment leads to a significant 'correction' in disease-related molecular signatures.


Asunto(s)
Interferones/sangre , Neutrófilos/metabolismo , Esclerodermia Sistémica/genética , Transcriptoma , Acondicionamiento Pretrasplante/métodos , Adulto , Ciclofosfamida/uso terapéutico , Regulación hacia Abajo , Femenino , Trasplante de Células Madre Hematopoyéticas/métodos , Humanos , Masculino , Persona de Mediana Edad , Análisis Multinivel , Agonistas Mieloablativos/uso terapéutico , Ensayos Clínicos Controlados Aleatorios como Asunto , Esclerodermia Sistémica/sangre , Esclerodermia Sistémica/terapia , Trasplante Autólogo , Resultado del Tratamiento , Regulación hacia Arriba
10.
J Biomed Inform ; 85: 149-154, 2018 09.
Artículo en Inglés | MEDLINE | ID: mdl-30081101

RESUMEN

The synergistic effect of drug combination is one of the most desirable properties for treating cancer. However, systematically predicting effective drug combination is a significant challenge. We report here a novel method based on deep belief network to predict drug synergy from gene expression, pathway and the Ontology Fingerprints-a literature derived ontological profile of genes. Using data sets provided by 2015 DREAM competition, our analysis shows that this integrative method outperforms published results from the DREAM website for 4999 drug pairs, demonstrating the feasibility of predicting drug synergy from literature and the -omics data using advanced artificial intelligence approach.


Asunto(s)
Aprendizaje Profundo , Combinación de Medicamentos , Sinergismo Farmacológico , Protocolos de Quimioterapia Combinada Antineoplásica , Línea Celular Tumoral , Biología Computacional , Bases de Datos Farmacéuticas , Perfilación de la Expresión Génica , Ontología de Genes , Redes Reguladoras de Genes , Humanos , Neoplasias/tratamiento farmacológico , Neoplasias/genética
11.
J Biomed Inform ; 84: 11-16, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29908902

RESUMEN

Recently, recurrent neural networks (RNNs) have been applied in predicting disease onset risks with Electronic Health Record (EHR) data. While these models demonstrated promising results on relatively small data sets, the generalizability and transferability of those models and its applicability to different patient populations across hospitals have not been evaluated. In this study, we evaluated an RNN model, RETAIN, over Cerner Health Facts® EMR data, for heart failure onset risk prediction. Our data set included over 150,000 heart failure patients and over 1,000,000 controls from nearly 400 hospitals. Convincingly, RETAIN achieved an AUC of 82% in comparison to an AUC of 79% for logistic regression, demonstrating the power of more expressive deep learning models for EHR predictive modeling. The prediction performance fluctuated across different patient groups and varied from hospital to hospital. Also, we trained RETAIN models on individual hospitals and found that the model can be applied to other hospitals with only about 3.6% of reduction of AUC. Our results demonstrated the capability of RNN for predictive modeling with large and heterogeneous EHR data, and pave the road for future improvements.


Asunto(s)
Aprendizaje Profundo , Registros Electrónicos de Salud , Insuficiencia Cardíaca/diagnóstico , Redes Neurales de la Computación , Anciano , Anciano de 80 o más Años , Algoritmos , Área Bajo la Curva , Estudios de Casos y Controles , Simulación por Computador , Bases de Datos Factuales , Femenino , Humanos , Modelos Logísticos , Masculino , Informática Médica/métodos , Persona de Mediana Edad , Reproducibilidad de los Resultados
12.
BMC Bioinformatics ; 18(Suppl 11): 405, 2017 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-28984189

RESUMEN

The 2016 International Conference on Intelligent Biology and Medicine (ICIBM 2016) was held on December 8-10, 2016 in Houston, Texas, USA. ICIBM included eight scientific sessions, four tutorials, one poster session, four highlighted talks and four keynotes that covered topics on 3D genomics structural analysis, next generation sequencing (NGS) analysis, computational drug discovery, medical informatics, cancer genomics, and systems biology. Here, we present a summary of the nine research articles selected from ICIBM 2016 program for publishing in BMC Bioinformatics.


Asunto(s)
Biología , Congresos como Asunto , Internacionalidad , Medicina , Estadística como Asunto , Algoritmos , Variaciones en el Número de Copia de ADN/genética , Humanos , Aprendizaje Automático , Redes Neurales de la Computación , Empalme del ARN/genética , Análisis de Secuencia de ARN
13.
Stat Med ; 36(22): 3461-3474, 2017 Sep 30.
Artículo en Inglés | MEDLINE | ID: mdl-28675924

RESUMEN

In systems biology, it is of great interest to identify new genes that were not previously reported to be associated with biological pathways related to various functions and diseases. Identification of these new pathway-modulating genes does not only promote understanding of pathway regulation mechanisms but also allow identification of novel targets for therapeutics. Recently, biomedical literature has been considered as a valuable resource to investigate pathway-modulating genes. While the majority of currently available approaches are based on the co-occurrence of genes within an abstract, it has been reported that these approaches show only sub-optimal performances because 70% of abstracts contain information only for a single gene. To overcome such limitation, we propose a novel statistical framework based on the concept of ontology fingerprint that uses gene ontology to extract information from large biomedical literature data. The proposed framework simultaneously identifies pathway-modulating genes and facilitates interpreting functions of these new genes. We also propose a computationally efficient posterior inference procedure based on Metropolis-Hastings within Gibbs sampler for parameter updates and the poor man's reversible jump Markov chain Monte Carlo approach for model selection. We evaluate the proposed statistical framework with simulation studies, experimental validation, and an application to studies of pathway-modulating genes in yeast. The R implementation of the proposed model is currently available at https://dongjunchung.github.io/bayesGO/. Copyright © 2017 John Wiley & Sons, Ltd.


Asunto(s)
Teorema de Bayes , Minería de Datos/métodos , Ontología de Genes , Biología de Sistemas/métodos , Algoritmos , Biometría , Simulación por Computador , Dermatoglifia del ADN , Genes , Humanos , Cadenas de Markov , Método de Montecarlo , Reproducibilidad de los Resultados , Saccharomyces cerevisiae/genética , Esfingolípidos/genética
14.
Nucleic Acids Res ; 42(18): e138, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25063300

RESUMEN

To enhance our knowledge regarding biological pathway regulation, we took an integrated approach, using the biomedical literature, ontologies, network analyses and experimental investigation to infer novel genes that could modulate biological pathways. We first constructed a novel gene network via a pairwise comparison of all yeast genes' Ontology Fingerprints--a set of Gene Ontology terms overrepresented in the PubMed abstracts linked to a gene along with those terms' corresponding enrichment P-values. The network was further refined using a Bayesian hierarchical model to identify novel genes that could potentially influence the pathway activities. We applied this method to the sphingolipid pathway in yeast and found that many top-ranked genes indeed displayed altered sphingolipid pathway functions, initially measured by their sensitivity to myriocin, an inhibitor of de novo sphingolipid biosynthesis. Further experiments confirmed the modulation of the sphingolipid pathway by one of these genes, PFA4, encoding a palmitoyl transferase. Comparative analysis showed that few of these novel genes could be discovered by other existing methods. Our novel gene network provides a unique and comprehensive resource to study pathway modulations and systems biology in general.


Asunto(s)
Ontología de Genes , Redes Reguladoras de Genes , Teorema de Bayes , Genes Fúngicos , Redes y Vías Metabólicas/genética , PubMed , Esfingolípidos/metabolismo , Levaduras/genética , Levaduras/metabolismo
15.
Res Sq ; 2024 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-38746131

RESUMEN

Background: The potential benefits of drug combination synergy in cancer medicine are significant, yet the risks must be carefully managed due to the possibility of increased toxicity. Although artificial intelligence applications have demonstrated notable success in predicting drug combination synergy, several key challenges persist: (1) Existing models often predict average synergy values across a restricted range of testing dosages, neglecting crucial dose amounts and the mechanisms of action of the drugs involved. (2) Many graph-based models rely on static protein-protein interactions, failing to adapt to dynamic and context-dependent networks. This limitation constrains the applicability of current methods. Results: We introduced SAFER, a Sub-hypergraph Attention-based graph model, addressing these issues by incorporating complex relationships among biological knowledge networks and considering dosing effects on subject-specific networks. SAFER outperformed previous models on the benchmark and the independent test set. The analysis of subgraph attention weight for the lung cancer cell line highlighted JAK-STAT signaling pathway, PRDM12, ZNF781, and CDC5L that have been implicated in lung fibrosis. Conclusions: SAFER presents an interpretable framework designed to identify drug-responsive signals. Tailored for comprehending dose effects on subject-specific molecular contexts, our model uniquely captures dose-level drug combination responses. This capability unlocks previously inaccessible avenues of investigation compared to earlier models. Finally, the SAFER framework can be leveraged by future inquiries to investigate molecular networks that uniquely characterize individual patients.

16.
bioRxiv ; 2024 May 22.
Artículo en Inglés | MEDLINE | ID: mdl-38826482

RESUMEN

Background: The cardinal feature of systemic sclerosis (SSc) is skin thickening and tightening. Targetable mechanisms for skin features remain elusive. Drugs successful in treating internal organ manifestations have failed efficacy in skin. Dermal white adipose tissue (DWAT) is amongst the understudied contributors to skin manifestations. This study proposes the role of sine oculis homeobox homolog 1 (SIX1), a gene previously unrecognized as a contributor to dermal lipoatrophy characteristic of early skin fibrosis in SSc. Methods: Skin gene expression of SIX1 was analyzed in the GENISOS and PRESS SSc cohorts. Correlation analysis was performed with Spearman rank analysis. Novel mouse models were developed using the Cre-loxp system to knock out Six1 in all cells and mature adipocytes. Subcutaneous bleomycin was used to model early DWAT atrophy and dermal fibrosis characteristic of SSc. Findings: SIX1 was upregulated in SSc skin, the expression of which correlates with adipose-associated genes and molecular pathways. Genetic deletion of Six1 in all cells in mice challenged with bleomycin abrogated end-stage fibrotic gene expression and dermal adipocyte shrinkage. Adipocyte specific Six1 deletion was able to attenuate the early increase in skin thickness, a hallmark of experimental skin fibrosis. Further studies revealed a link between elevated SIX1 and increased expression of SERPINE1 and its protein PAI-1 which are known pro-fibrotic mediators. Interpretation: This work identifies SIX1 as an early marker of skin fibrosis in SSc. We also demonstrate a causative role of Six1 in skin fibrosis by promoting adipocyte loss and show that deletion of Six1 in adipocytes has the potential of impacting early disease progression.

17.
Artículo en Inglés | MEDLINE | ID: mdl-38520725

RESUMEN

OBJECTIVES: The rapid expansion of biomedical literature necessitates automated techniques to discern relationships between biomedical concepts from extensive free text. Such techniques facilitate the development of detailed knowledge bases and highlight research deficiencies. The LitCoin Natural Language Processing (NLP) challenge, organized by the National Center for Advancing Translational Science, aims to evaluate such potential and provides a manually annotated corpus for methodology development and benchmarking. MATERIALS AND METHODS: For the named entity recognition (NER) task, we utilized ensemble learning to merge predictions from three domain-specific models, namely BioBERT, PubMedBERT, and BioM-ELECTRA, devised a rule-driven detection method for cell line and taxonomy names and annotated 70 more abstracts as additional corpus. We further finetuned the T0pp model, with 11 billion parameters, to boost the performance on relation extraction and leveraged entites' location information (eg, title, background) to enhance novelty prediction performance in relation extraction (RE). RESULTS: Our pioneering NLP system designed for this challenge secured first place in Phase I-NER and second place in Phase II-relation extraction and novelty prediction, outpacing over 200 teams. We tested OpenAI ChatGPT 3.5 and ChatGPT 4 in a Zero-Shot setting using the same test set, revealing that our finetuned model considerably surpasses these broad-spectrum large language models. DISCUSSION AND CONCLUSION: Our outcomes depict a robust NLP system excelling in NER and RE across various biomedical entities, emphasizing that task-specific models remain superior to generic large ones. Such insights are valuable for endeavors like knowledge graph development and hypothesis formulation in biomedical research.

18.
J Mol Med (Berl) ; 2024 Jun 28.
Artículo en Inglés | MEDLINE | ID: mdl-38940937

RESUMEN

The rapidly aging population is consuming more alcohol, leading to increased alcohol-associated acute pancreatitis (AAP) with high mortality. However, the mechanisms remain undefined, and currently there are no effective therapies available. This study aims to elucidate aging- and alcohol-associated spatial transcriptomic signature by establishing an aging AAP mouse model and applying Visium spatial transcriptomics for understanding of the mechanisms in the context of the pancreatic tissue. Upon alcohol diet feeding and caerulein treatment, aging mice (18 months) developed significantly more severe AAP with 5.0-fold increase of injury score and 2.4-fold increase of amylase compared to young mice (3 months). Via Visium spatial transcriptomics, eight distinct tissue clusters were revealed from aggregated transcriptomes of aging and young AAP mice: five acinar, two stromal, and one islet, which were then merged into three clusters: acinar, stromal, and islet for the comparative analysis. Compared to young AAP mice, > 1300 differentially expressed genes (DEGs) and approximately 3000 differentially regulated pathways were identified in aging AAP mice. The top five DEGs upregulated in aging AAP mice include Mmp8, Ppbp, Serpina3m, Cxcl13, and Hamp with heterogeneous distributions among the clusters. Taken together, this study demonstrates spatial heterogeneity of inflammatory processes in aging AAP mice, offering novel insights into the mechanisms and potential drivers for AAP development. KEY MESSAGES: Mechanisms regarding high mortality of AAP in aging remain undefined. An aging AAP mouse model was developed recapturing clinical exhibition in humans. Spatial transcriptomics identified contrasted DEGs in aging vs. young AAP mice. Top five DEGs were Mmp8, Ppbp, Serpina3m, Cxcl13, and Hamp in aging vs. young AAP mice. Our findings shed insights for identification of molecular drivers in aging AAP.

19.
NPJ Vaccines ; 9(1): 70, 2024 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-38561339

RESUMEN

Human cytomegalovirus (HCMV) is a leading infectious cause of birth defects and the most common opportunistic infection that causes life-threatening diseases post-transplantation; however, an effective vaccine remains elusive. V160 is a live-attenuated replication defective HCMV vaccine that showed a 42.4% efficacy against primary HCMV infection among seronegative women in a phase 2b clinical trial. Here, we integrated the multicolor flow cytometry, longitudinal T cell receptor (TCR) sequencing, and single-cell RNA/TCR sequencing approaches to characterize the magnitude, phenotype, and functional quality of human T cell responses to V160. We demonstrated that V160 de novo induces IE-1 and pp65 specific durable polyfunctional effector CD8 T cells that are comparable to those induced by natural HCMV infection. We identified a variety of V160-responsive T cell clones which exhibit distinctive "transient" and "durable" expansion kinetics, and revealed a transcriptional signature that marks durable CD8 T cells post-vaccination. Our study enhances the understanding of human T-cell immune responses to V160 vaccination.

20.
bioRxiv ; 2024 Feb 19.
Artículo en Inglés | MEDLINE | ID: mdl-38464054

RESUMEN

Alternative splicing is an important cellular process in eukaryotes, altering pre-mRNA to yield multiple protein isoforms from a single gene. However, our understanding of the impact of alternative splicing events on protein structures is currently constrained by a lack of sufficient protein structural data. To address this limitation, we employed AlphaFold 2, a cutting-edge protein structure prediction tool, to conduct a comprehensive analysis of alternative splicing for approximately 3,000 human genes, providing valuable insights into its impact on the protein structural. Our investigation employed state of the art high-performance computing infrastructure to systematically characterize structural features in alternatively spliced regions and identified changes in protein structure following alternative splicing events. Notably, we found that alternative splicing tends to alter the structure of residues primarily located in coils and beta-sheets. Our research highlighted a significant enrichment of loops and highly exposed residues within human alternatively spliced regions. Specifically, our examination of the Septin-9 protein revealed potential associations between loops and alternative splicing, providing insights into its evolutionary role. Furthermore, our analysis uncovered two missense mutations in the Tau protein that could influence alternative splicing, potentially contributing to the pathogenesis of Alzheimer's disease. In summary, our work, through a thorough statistical analysis of extensive protein structural data, sheds new light on the intricate relationship between alternative splicing, evolution, and human disease.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA