Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
Bioinformatics ; 40(4)2024 Mar 29.
Artículo en Inglés | MEDLINE | ID: mdl-38514400

RESUMEN

MOTIVATION: Large Language Models (LLMs) have the potential to revolutionize the field of Natural Language Processing, excelling not only in text generation and reasoning tasks but also in their ability for zero/few-shot learning, swiftly adapting to new tasks with minimal fine-tuning. LLMs have also demonstrated great promise in biomedical and healthcare applications. However, when it comes to Named Entity Recognition (NER), particularly within the biomedical domain, LLMs fall short of the effectiveness exhibited by fine-tuned domain-specific models. One key reason is that NER is typically conceptualized as a sequence labeling task, whereas LLMs are optimized for text generation and reasoning tasks. RESULTS: We developed an instruction-based learning paradigm that transforms biomedical NER from a sequence labeling task into a generation task. This paradigm is end-to-end and streamlines the training and evaluation process by automatically repurposing pre-existing biomedical NER datasets. We further developed BioNER-LLaMA using the proposed paradigm with LLaMA-7B as the foundational LLM. We conducted extensive testing on BioNER-LLaMA across three widely recognized biomedical NER datasets, consisting of entities related to diseases, chemicals, and genes. The results revealed that BioNER-LLaMA consistently achieved higher F1-scores ranging from 5% to 30% compared to the few-shot learning capabilities of GPT-4 on datasets with different biomedical entities. We show that a general-domain LLM can match the performance of rigorously fine-tuned PubMedBERT models and PMC-LLaMA, biomedical-specific language model. Our findings underscore the potential of our proposed paradigm in developing general-domain LLMs that can rival SOTA performances in multi-task, multi-domain scenarios in biomedical and health applications. AVAILABILITY AND IMPLEMENTATION: Datasets and other resources are available at https://github.com/BIDS-Xu-Lab/BioNER-LLaMA.


Asunto(s)
Camélidos del Nuevo Mundo , Aprendizaje Profundo , Animales , Lenguaje , Procesamiento de Lenguaje Natural
2.
Mol Biol Evol ; 41(3)2024 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-38376487

RESUMEN

The blue whale, Balaenoptera musculus, is the largest animal known to have ever existed, making it an important case study in longevity and resistance to cancer. To further this and other blue whale-related research, we report a reference-quality, long-read-based genome assembly of this fascinating species. We assembled the genome from PacBio long reads and utilized Illumina/10×, optical maps, and Hi-C data for scaffolding, polishing, and manual curation. We also provided long read RNA-seq data to facilitate the annotation of the assembly by NCBI and Ensembl. Additionally, we annotated both haplotypes using TOGA and measured the genome size by flow cytometry. We then compared the blue whale genome with other cetaceans and artiodactyls, including vaquita (Phocoena sinus), the world's smallest cetacean, to investigate blue whale's unique biological traits. We found a dramatic amplification of several genes in the blue whale genome resulting from a recent burst in segmental duplications, though the possible connection between this amplification and giant body size requires further study. We also discovered sites in the insulin-like growth factor-1 gene correlated with body size in cetaceans. Finally, using our assembly to examine the heterozygosity and historical demography of Pacific and Atlantic blue whale populations, we found that the genomes of both populations are highly heterozygous and that their genetic isolation dates to the last interglacial period. Taken together, these results indicate how a high-quality, annotated blue whale genome will serve as an important resource for biology, evolution, and conservation research.


Asunto(s)
Balaenoptera , Neoplasias , Animales , Balaenoptera/genética , Duplicaciones Segmentarias en el Genoma , Genoma , Demografía , Neoplasias/genética
3.
BMC Bioinformatics ; 24(1): 412, 2023 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-37915001

RESUMEN

BACKGROUND: The PubMed archive contains more than 34 million articles; consequently, it is becoming increasingly difficult for a biomedical researcher to keep up-to-date with different knowledge domains. Computationally efficient and interpretable tools are needed to help researchers find and understand associations between biomedical concepts. The goal of literature-based discovery (LBD) is to connect concepts in isolated literature domains that would normally go undiscovered. This usually takes the form of an A-B-C relationship, where A and C terms are linked through a B term intermediate. Here we describe Serial KinderMiner (SKiM), an LBD algorithm for finding statistically significant links between an A term and one or more C terms through some B term intermediate(s). The development of SKiM is motivated by the observation that there are only a few LBD tools that provide a functional web interface, and that the available tools are limited in one or more of the following ways: (1) they identify a relationship but not the type of relationship, (2) they do not allow the user to provide their own lists of B or C terms, hindering flexibility, (3) they do not allow for querying thousands of C terms (which is crucial if, for instance, the user wants to query connections between a disease and the thousands of available drugs), or (4) they are specific for a particular biomedical domain (such as cancer). We provide an open-source tool and web interface that improves on all of these issues. RESULTS: We demonstrate SKiM's ability to discover useful A-B-C linkages in three control experiments: classic LBD discoveries, drug repurposing, and finding associations related to cancer. Furthermore, we supplement SKiM with a knowledge graph built with transformer machine-learning models to aid in interpreting the relationships between terms found by SKiM. Finally, we provide a simple and intuitive open-source web interface ( https://skim.morgridge.org ) with comprehensive lists of drugs, diseases, phenotypes, and symptoms so that anyone can easily perform SKiM searches. CONCLUSIONS: SKiM is a simple algorithm that can perform LBD searches to discover relationships between arbitrary user-defined concepts. SKiM is generalized for any domain, can perform searches with many thousands of C term concepts, and moves beyond the simple identification of an existence of a relationship; many relationships are given relationship type labels from our knowledge graph.


Asunto(s)
Algoritmos , Neoplasias , Humanos , PubMed , Conocimiento , Descubrimiento del Conocimiento
4.
Bioinformatics ; 2023 Sep 05.
Artículo en Inglés | MEDLINE | ID: mdl-37669123

RESUMEN

MOTIVATION: Automated extraction of participants, intervention, comparison/control, and outcome (PICO) from the randomized controlled trial (RCT) abstracts is important for evidence synthesis. Previous studies have demonstrated the feasibility of applying natural language processing (NLP) for PICO extraction. However, the performance is not optimal due to the complexity of PICO information in RCT abstracts and the challenges involved in their annotation. RESULTS: We propose a two-step NLP pipeline to extract PICO elements from RCT abstracts: (i) sentence classification using a prompt-based learning model and (ii) PICO extraction using a named entity recognition (NER) model. First, the sentences in abstracts were categorized into four sections namely background, methods, results, and conclusions. Next, the NER model was applied to extract the PICO elements from the sentences within the title and methods sections that include >96% of PICO information. We evaluated our proposed NLP pipeline on three datasets, the EBM-NLPmoddataset, a randomly selected and reannotated dataset of 500 RCT abstracts from the EBM-NLP corpus, a dataset of 150 COVID-19 RCT abstracts, and a dataset of 150 Alzheimer's disease (AD) RCT abstracts. The end-to-end evaluation reveals that our proposed approach achieved an overall micro F1 score of 0.833 on the EBM-NLPmod dataset, 0.928 on the COVID-19 dataset, and 0.899 on the AD dataset when measured at the token-level and an overall micro F1 score of 0.712 on EBM-NLPmod dataset, 0.850 on the COVID-19 dataset, and 0.805 on the AD dataset when measured at the entity-level. AVAILABILITY: Our codes and datasets are publicly available at https://github.com/BIDS-Xu-Lab/section_specific_annotation_of_PICO. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

5.
bioRxiv ; 2023 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-37397987

RESUMEN

Background: The PubMed database contains more than 34 million articles; consequently, it is becoming increasingly difficult for a biomedical researcher to keep up-to-date with different knowledge domains. Computationally efficient and interpretable tools are needed to help researchers find and understand associations between biomedical concepts. The goal of literature-based discovery (LBD) is to connect concepts in isolated literature domains that would normally go undiscovered. This usually takes the form of an A-B-C relationship, where A and C terms are linked through a B term intermediate. Here we describe Serial KinderMiner (SKiM), an LBD algorithm for finding statistically significant links between an A term and one or more C terms through some B term intermediate(s). The development of SKiM is motivated by the the observation that there are only a few LBD tools that provide a functional web interface, and that the available tools are limited in one or more of the following ways: 1) they identify a relationship but not the type of relationship, 2) they do not allow the user to provide their own lists of B or C terms, hindering flexibility, 3) they do not allow for querying thousands of C terms (which is crucial if, for instance, the user wants to query connections between a disease and the thousands of available drugs), or 4) they are specific for a particular biomedical domain (such as cancer). We provide an open-source tool and web interface that improves on all of these issues. Results: We demonstrate SKiM's ability to discover useful A-B-C linkages in three control experiments: classic LBD discoveries, drug repurposing, and finding associations related to cancer. Furthermore, we supplement SKiM with a knowledge graph built with transformer machine-learning models to aid in interpreting the relationships between terms found by SKiM. Finally, we provide a simple and intuitive open-source web interface ( https://skim.morgridge.org ) with comprehensive lists of drugs, diseases, phenotypes, and symptoms so that anyone can easily perform SKiM searches. Conclusions: SKiM is a simple algorithm that can perform LBD searches to discover relationships between arbitrary user-defined concepts. SKiM is generalized for any domain, can perform searches with many thousands of C term concepts, and moves beyond the simple identification of an existence of a relationship; many relationships are given relationship type labels from our knowledge graph.

6.
J Biomed Inform ; 142: 104343, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-36935011

RESUMEN

Clinical documentation in electronic health records contains crucial narratives and details about patients and their care. Natural language processing (NLP) can unlock the information conveyed in clinical notes and reports, and thus plays a critical role in real-world studies. The NLP Working Group at the Observational Health Data Sciences and Informatics (OHDSI) consortium was established to develop methods and tools to promote the use of textual data and NLP in real-world observational studies. In this paper, we describe a framework for representing and utilizing textual data in real-world evidence generation, including representations of information from clinical text in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), the workflow and tools that were developed to extract, transform and load (ETL) data from clinical notes into tables in OMOP CDM, as well as current applications and specific use cases of the proposed OHDSI NLP solution at large consortia and individual institutions with English textual data. Challenges faced and lessons learned during the process are also discussed to provide valuable insights for researchers who are planning to implement NLP solutions in real-world studies.


Asunto(s)
Ciencia de los Datos , Informática Médica , Humanos , Registros Electrónicos de Salud , Procesamiento de Lenguaje Natural , Narración
7.
BMC Biol ; 20(1): 245, 2022 11 08.
Artículo en Inglés | MEDLINE | ID: mdl-36344967

RESUMEN

BACKGROUND: The Nile rat (Avicanthis niloticus) is an important animal model because of its robust diurnal rhythm, a cone-rich retina, and a propensity to develop diet-induced diabetes without chemical or genetic modifications. A closer similarity to humans in these aspects, compared to the widely used Mus musculus and Rattus norvegicus models, holds the promise of better translation of research findings to the clinic. RESULTS: We report a 2.5 Gb, chromosome-level reference genome assembly with fully resolved parental haplotypes, generated with the Vertebrate Genomes Project (VGP). The assembly is highly contiguous, with contig N50 of 11.1 Mb, scaffold N50 of 83 Mb, and 95.2% of the sequence assigned to chromosomes. We used a novel workflow to identify 3613 segmental duplications and quantify duplicated genes. Comparative analyses revealed unique genomic features of the Nile rat, including some that affect genes associated with type 2 diabetes and metabolic dysfunctions. We discuss 14 genes that are heterozygous in the Nile rat or highly diverged from the house mouse. CONCLUSIONS: Our findings reflect the exceptional level of genomic resolution present in this assembly, which will greatly expand the potential of the Nile rat as a model organism.


Asunto(s)
Diabetes Mellitus Tipo 2 , Humanos , Animales , Haplotipos , Diabetes Mellitus Tipo 2/genética , Murinae , Genoma , Genómica
8.
Diagnostics (Basel) ; 12(8)2022 Aug 08.
Artículo en Inglés | MEDLINE | ID: mdl-36010264

RESUMEN

Multiple sclerosis (MS), a chronic autoimmune disorder, affects the central nervous system of many young adults. More than half of MS patients develop cognition problems. Although several genomic and transcriptomic studies are currently reported in MS cognitive impairment, a comprehensive repository dealing with all the experimental data is still underdeveloped. In this study, we combined text mining, gene regulation, pathway analysis, and genome-wide association studies (GWAS) to identify miRNA biomarkers to explore the cognitive dysfunction in MS, and to understand the genomic etiology of the disease. We first identified the dysregulated miRNAs associated with MS and cognitive dysfunction using PubTator (text mining), HMDD (experimental associations), miR2Disease, and PhenomiR database (differentially expressed miRNAs). Our results suggest that miRNAs such as hsa-mir-148b-3p, hsa-mir-7b-5p, and hsa-mir-7a-5p are commonly associated with MS and cognitive dysfunction. Next, we retrieved GWAS signals from GWAS Catalog, and analyzed the enrichment analysis of association signals in genes/miRNAs and their association networks. Then, we identified susceptible genetic loci, rs17119 (chromosome 6; p = 1 × 10-10), rs1843938 (chromosome 7; p = 1 × 10-10), and rs11637611 (chromosome 15; p = 1.00 × 10-15), associated with significant genetic risk. Lastly, we conducted a pathway analysis for the susceptible genetic variants and identified novel risk pathways. The ECM receptor signaling pathway (p = 3.98 × 10-8) and PI3K/Akt signaling pathway (p = 5.98 × 10-5) were found to be associated with differentially expressed miRNA biomarkers.

9.
Methods Mol Biol ; 2496: 1-16, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35713856

RESUMEN

The published biomedical articles are the best source of knowledge to understand the importance of biomedical entities such as disease, drugs, and their role in different patient population groups. The number of biomedical literature available and being published is increasing at an exponential rate with the use of large scale experimental techniques. Manual extraction of such information is becoming extremely difficult because of the huge number of biomedical literature available. Alternatively, text mining approaches receive much interest within biomedicine by providing automatic extraction of such information in more structured format from the unstructured biomedical text. Here, a text mining protocol to extract the patient population information, to identify the disease and drug mentions in PubMed titles and abstracts, and a simple information retrieval approach to retrieve a list of relevant documents for a user query are presented. The text mining protocol presented in this chapter is useful for retrieving information on drugs for patients with a specific disease. The protocol covers three major text mining tasks, namely, information retrieval, information extraction, and knowledge discovery.


Asunto(s)
Minería de Datos , Publicaciones , Minería de Datos/métodos , Humanos , PubMed
10.
Methods Mol Biol ; 2496: 111-121, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35713861

RESUMEN

Multiple sclerosis, a disease of central nervous system leads to potential disability. In the USA, one million cases are diagnosed with multiple sclerosis in 2019. Multiple sclerosis is identified as one of the diseases causing global burden. Cognitive disorder is highly prevalent among 43-70% of multiple sclerosis patients. However, treating cognitive disorder in multiple sclerosis patients is mostly ignored and this leads to several complications. We utilized various expert curated resources to identify potential drugs for multiple sclerosis and cognitive disorder, with specific focus on identifying drugs that are capable of treating both the conditions. We used simple text mining techniques to compile two databases, disease-drug association database and gene-drug interaction database from various existing standard resources. Our study suggests four drugs, Baclofen, Levodopa, Minocycline, and Vitamin B12, for treating both multiple sclerosis and cognitive disorder. In addition, our approach suggests six drugs for multiple sclerosis and 10 drugs for cognitive disorder. We obtained pharmacologist opinion on the drugs suggested for each condition and provided literature evidence for our claim. Here, we present our computational approach as a protocol such that it can be applied to other comorbid diseases that did not gain much attention so far.


Asunto(s)
Trastornos del Conocimiento , Esclerosis Múltiple , Cognición , Trastornos del Conocimiento/etiología , Minería de Datos/métodos , Bases de Datos Factuales , Humanos , Esclerosis Múltiple/complicaciones , Esclerosis Múltiple/tratamiento farmacológico
11.
Methods Mol Biol ; 2496: 237-258, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35713868

RESUMEN

Drug-drug interactions (DDIs) and adverse drug reactions (ADRs) occur during the pharmacotherapy of multiple comorbidities and in susceptible individuals. DDIs and ADRs limit the therapeutic outcomes in pharmacotherapy. DDIs and ADRs have significant impact on patients' life and health care cost. Hence, knowledge of DDI and ADRs is required for providing better clinical outcomes to patients. Various approaches are developed by the scientific community to document and report the occurrences of DDIs and ADRs through scientific publications. Due to the enormously increasing number of publications and the requirement of updated information on DDIs and ADRs, manual retrieval of data is time consuming and laborious. Various automated techniques are developed to get information on DDIs and ADRs. One such technique is text mining of DDIs and ADRs from published biomedical literature in PubMed. Here, we present a recently developed text mining protocol for predicting DDIs and ADRs from PubMed abstracts.


Asunto(s)
Efectos Colaterales y Reacciones Adversas Relacionados con Medicamentos , Comorbilidad , Minería de Datos/métodos , Interacciones Farmacológicas , Efectos Colaterales y Reacciones Adversas Relacionados con Medicamentos/diagnóstico , Humanos , PubMed
12.
Methods Mol Biol ; 2496: 259-282, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35713869

RESUMEN

Drug-drug interactions (DDIs) and adverse drug reactions (ADR) are experienced by many patients, especially by elderly population due to their multiple comorbidities and polypharmacy. Databases such as PubMed contain hundreds of abstracts with DDI and ADR information. PubMed is being updated every day with thousands of abstracts. Therefore, manually retrieving the data and extracting the relevant information is tedious task. Hence, automated text mining approaches are required to retrieve DDI and ADR information from PubMed. Recently we developed a hybrid approach for predicting DDI and ADR information from PubMed. There are many other existing approaches for retrieving DDI and ADR information from PubMed. However, none of the approaches are meant for retrieving DDI and ADR specific to patient population, gender, pharmacokinetics, and pharmacodynamics. Here, we present a text mining protocol which is based on our recent work for retrieving DDI and ADR information specific to patient population, gender, pharmacokinetics, and pharmacodynamics from PubMed.


Asunto(s)
Efectos Colaterales y Reacciones Adversas Relacionados con Medicamentos , Anciano , Minería de Datos/métodos , Bases de Datos Factuales , Interacciones Farmacológicas , Humanos , PubMed
13.
Methods Mol Biol ; 2496: 283-299, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35713870

RESUMEN

Text mining is an important research area to be explored in terms of understanding disease associations and have an insight in disease comorbidities. The reason for comorbid occurrence in any patient may be genetic or molecular interference from any other processes. Comorbidity and multimorbidity may be technically different, yet still are inseparable in studies. They have overlapping nature of associations and hence can be integrated for a more rational approach. The association rule generally used to determine comorbidity may also be helpful in novel knowledge prediction or may even serve as an important tool of assessment in surgical cases. Another approach of interest may be to utilize biological vocabulary resources like UMLS/MeSH across a patient health information and analyze the interrelationship between different health conditions. The protocol presented here can be utilized for understanding the disease associations and analyze at an extensive level.


Asunto(s)
Indización y Redacción de Resúmenes , Medical Subject Headings , Minería de Datos , Humanos , Procesamiento de Lenguaje Natural , PubMed
14.
J Am Med Inform Assoc ; 28(6): 1159-1167, 2021 06 12.
Artículo en Inglés | MEDLINE | ID: mdl-33544847

RESUMEN

OBJECTIVE: Drug-drug interactions (DDIs) can result in adverse and potentially life-threatening health consequences; however, it is challenging to predict potential DDIs in advance. We introduce a new computational approach to comprehensively assess the drug pairs which may be involved in specific DDI types by combining information from large-scale gene expression (984 transcriptomic datasets), molecular structure (2159 drugs), and medical claims (150 million patients). MATERIALS AND METHODS: Features were integrated using ensemble machine learning techniques, and we evaluated the DDIs predicted with a large hospital-based medical records dataset. Our pipeline integrates information from >30 different resources, including >10 000 drugs and >1.7 million drug-gene pairs. We applied our technique to predict interactions between 37 611 drug pairs used to treat psoriasis and its comorbidities. RESULTS: Our approach achieves >0.9 area under the receiver operator curve (AUROC) for differentiating 11 861 known DDIs from 25 750 non-DDI drug pairs. Significantly, we demonstrate that the novel DDIs we predict can be confirmed through independent data sources and supported using clinical medical records. CONCLUSIONS: By applying machine learning and taking advantage of molecular, genomic, and health record data, we are able to accurately predict potential new DDIs that can have an impact on public health.


Asunto(s)
Preparaciones Farmacéuticas , Psoriasis , Interacciones Farmacológicas , Humanos , Psoriasis/tratamiento farmacológico , Psoriasis/genética
15.
J Pharmacol Toxicol Methods ; 106: 106932, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33091537

RESUMEN

Alzheimer's disease (AD) is regarded as one of the significant health burdens, as the prevalence is raising worldwide and gradually reaching to epidemic proportions. Consequently, a number of scientific investigations have been initiated to derive therapeutics to combat AD with a concurrent advancement in pharmacological methods and experimental models. Whilst, the available experimental pharmacological approaches both in vivo and in vitro led to the development of AD therapeutics, the precise manner by which experimental models mimic either one or more biomarkers of human pathology of AD is gaining scientific attentions. Caenorhabditis elegans (C. elegans) has been regarded as an emerging model for various reasons, including its high similarities with the biomarkers of human AD. Our review supports the versatile nature of C. elegans and collates that it is a well-suited model to elucidate various molecular mechanisms by which AD therapeutics elicit their pharmacological effects. It is apparent that C. elegans is capable of establishing the pathological processes that links the endoplasmic reticulum and mitochondria dysfunctions in AD, exploring novel molecular cascades of AD pathogenesis and underpinning causal and consequential changes in the associated proteins and genes. In summary, C. elegans is a unique and feasible model for the screening of anti-Alzheimer's therapeutics and has the potential for further scientific exploration.


Asunto(s)
Enfermedad de Alzheimer/tratamiento farmacológico , Caenorhabditis elegans/genética , Enfermedad de Alzheimer/genética , Animales , Animales Modificados Genéticamente , Proteínas de Caenorhabditis elegans/genética , Modelos Animales de Enfermedad , Evaluación Preclínica de Medicamentos/métodos , Estudios de Factibilidad , Humanos
16.
Methods Mol Biol ; 2074: 13-34, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-31583627

RESUMEN

Proteins perform their functions by interacting with other proteins. Protein-protein interaction (PPI) is critical for understanding the functions of individual proteins, the mechanisms of biological processes, and the disease mechanisms. High-throughput experiments accumulated a huge number of PPIs in PubMed articles, and their extraction is possible only through automated approaches. The standard text-mining protocol includes four major tasks, namely, recognizing protein mentions, normalizing protein names and aliases to unique identifiers such as gene symbol, extracting PPIs, and visualizing the PPI network using Cytoscape or other visualization tools. Each task is challenging and has been revised over several years to improve the performance. We present a protocol based on our hybrid approaches and show the possibility of presenting each task as an independent web-based tool, NAGGNER for protein name recognition, ProNormz for protein name normalization, PPInterFinder for PPI extraction, and HPIminer for PPI network visualization. The protocol is specific to human but can be generalized to other organisms. We include KinderMiner, our most recent text-mining tool that predicts PPIs by retrieving significant co-occurring protein pairs. The algorithm is simple, easy to implement, and generalizable to other biological challenges.


Asunto(s)
Minería de Datos , Algoritmos , Biología Computacional/métodos , Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Mapas de Interacción de Proteínas , Programas Informáticos
17.
Front Genet ; 10: 304, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31031798

RESUMEN

We recently conducted a large association analysis to compare the genetic profiles between patients with psoriatic arthritis (PsA) and cutaneous-only psoriasis (PsC). Despite including over 7,000 genotyped patients, only the MHC achieved genome-wide significance. In this study, we hypothesized that appropriate epigenomic elements (H3K27ac marks for active enhancers) can guide us to reveal valuable information about the loci with suggestive evidence of association. Our aim is to investigate these loci and explore how they may lead to the development of PsA. We evaluated this potential by investigating the genes connected with these loci from the perspective of pharmacogenomics and gene expression. We illustrated that markers with suggestive evidence of association outside the MHC region are enriched in H3K27ac marks for osteoblast and chondrogenic differentiated cells; using pharmacogenomics resources, we showed that genes near these markers are targeted by existing drugs used to treat psoriatic arthritis. Significantly, six of the ten suggestive significant loci overlapping the regulatory elements encompass genes differentially expressed (FDR < 5%) in differentiated osteoblasts, including genes participating in the Wnt signaling such as RUNX1, FUT8, and CTNNAL1. Our approach shows that epigenomic information can be used as cost-effective approach to enhance the inferences for GWAS results, especially in situations when few genome-wide significant loci are available. Our results also point the way to more directed investigations comparing the genetics of PsA and PsC.

18.
J Immunol ; 202(7): 2121-2130, 2019 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-30745462

RESUMEN

Systemic lupus erythematosus (SLE) is a complex autoimmune disease in which 70% of patients experience disfiguring skin inflammation (grouped under the rubric of cutaneous lupus erythematosus [CLE]). There are limited treatment options for SLE and no Food and Drug Administration-approved therapies for CLE. Studies have revealed that IFNs are important mediators for SLE and CLE, but the mechanisms by which IFNs lead to disease are still poorly understood. We aimed to investigate how IFN responses in SLE keratinocytes contribute to development of CLE. A cohort of 72 RNA sequencing samples from 14 individuals (seven SLE and seven healthy controls) were analyzed to study the transcriptomic effects of type I and type II IFNs on SLE versus control keratinocytes. In-depth analysis of the IFN responses was conducted. Bioinformatics and functional assays were conducted to provide implications for the change of IFN response. A significant hypersensitive response to IFNs was identified in lupus keratinocytes, including genes (IFIH1, STAT1, and IRF7) encompassed in SLE susceptibility loci. Binding sites for the transcription factor PITX1 were enriched in genes that exhibit IFN-sensitive responses. PITX1 expression was increased in CLE lesions based on immunohistochemistry, and by using small interfering RNA knockdown, we illustrated that PITX1 was required for upregulation of IFN-regulated genes in vitro. SLE patients exhibit increased IFN signatures in their skin secondary to increased production and a robust, skewed IFN response that is regulated by PITX1. Targeting these exaggerated pathways may prove to be beneficial to prevent and treat hyperinflammatory responses in SLE skin.


Asunto(s)
Regulación de la Expresión Génica/inmunología , Interferones/inmunología , Queratinocitos/inmunología , Lupus Eritematoso Cutáneo/inmunología , Factores de Transcripción Paired Box/inmunología , Adulto , Femenino , Humanos , Masculino
19.
J Invest Dermatol ; 139(7): 1480-1489, 2019 07.
Artículo en Inglés | MEDLINE | ID: mdl-30641038

RESUMEN

Atopic dermatitis (AD) affects up to 20% of children and adults worldwide. To gain a deeper understanding of the pathophysiology of AD, we conducted a large-scale transcriptomic study of AD with deeply sequenced RNA-sequencing samples using long (126-bp) paired-end reads. In addition to the comparisons against previous transcriptomic studies, we conducted in-depth analysis to obtain a high-resolution view of the global architecture of the AD transcriptome and contrasted it with that of psoriasis from the same cohort. By using 147 RNA samples in total, we found striking correlation between dysregulated genes in lesional psoriasis and lesional AD skin with 81% of AD dysregulated genes being shared with psoriasis. However, we described disease-specific molecular and cellular features, with AD skin showing dominance of IL-13 pathways, but with near undetectable IL-4 expression. We also demonstrated greater disease heterogeneity and larger proportion of dysregulated long noncoding RNAs in AD, and illustrated the translational impact, including skin-type classification and drug-target prediction. This study is by far the largest study comparing the AD and psoriasis transcriptomes using RNA sequencing and demonstrating the shared inflammatory components, as well as specific discordant cytokine signatures of these two skin diseases.


Asunto(s)
Dermatitis Atópica/inmunología , Interleucina-13/metabolismo , Especificidad de Órganos/genética , Psoriasis/inmunología , ARN/genética , Piel/metabolismo , Células Th2/inmunología , Estudios de Cohortes , Dermatitis Atópica/genética , Perfilación de la Expresión Génica , Humanos , Interleucina-13/genética , Interleucina-4/metabolismo , Psoriasis/genética , ARN Largo no Codificante/genética , Análisis de Secuencia de ARN , Transducción de Señal , Piel/patología , Transcriptoma
20.
Infect Genet Evol ; 67: 101-111, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-30396000

RESUMEN

Ornithine decarboxylase (ODC) is an immediate precursor of polyamine biosynthesis in Serratia marcescens and a potential target for inhibition of its growth. We predicted the 3D structural conformation of ODC enzyme and validated it using MDS in our previous study. In this current study, the potential inhibitors of ODC were obtained by virtual screening of potential inhibitors from ZINC database and studied in depth for their different binding pose. Among the ten virtually screened inhibitors, Conessine exhibited the best binding with ODC and its inhibition property was studied further by MDS studies. The natural compound conessine is isolated from plant Holarrhena antidysenterica and it is studied against ODC of Serratia marcenses for its inhibitory potentials. This revealed unforeseen twisted position in root mean square fluctuation (RMSF) and ODC modelled conformation that influenced ligand binding. Both predicted model and ligand bound model were compared and found to be stable with Root Mean Square Deviation (RMSD) of approximately 7 nm and 0.25 nm to that of crystallographic structure over simulation time of 55 ns and 70 ns respectively. This work paves the way for future development of new drugs against nosocomial diseases caused by Serratia marcescens.


Asunto(s)
Alcaloides/química , Alcaloides/farmacología , Antibacterianos/química , Antibacterianos/farmacología , Farmacorresistencia Bacteriana Múltiple/efectos de los fármacos , Simulación de Dinámica Molecular , Serratia marcescens/efectos de los fármacos , Proteínas Bacterianas/antagonistas & inhibidores , Proteínas Bacterianas/química , Sitios de Unión , Dominio Catalítico , Evaluación Preclínica de Medicamentos , Enlace de Hidrógeno , Ligandos , Conformación Molecular , Unión Proteica , Mapeo de Interacción de Proteínas , Mapas de Interacción de Proteínas , Relación Estructura-Actividad Cuantitativa
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...