RESUMO
Stroke is the leading cause of death and disability worldwide. Novel and effective therapies for ischemic stroke are urgently needed. Here, we report that melatonin receptor 1A (MT1) agonist ramelteon is a neuroprotective drug candidate as demonstrated by comprehensive experimental models of ischemic stroke, including a middle cerebral artery occlusion (MCAO) mouse model of cerebral ischemia in vivo, organotypic hippocampal slice cultures ex vivo, and cultured neurons in vitro; the neuroprotective effects of ramelteon are diminished in MT1-knockout (KO) mice and MT1-KO cultured neurons. For the first time, we report that the MT1 receptor is significantly depleted in the brain of MCAO mice, and ramelteon treatment significantly recovers the brain MT1 losses in MCAO mice, which is further explained by the Connectivity Map L1000 bioinformatic analysis that shows gene-expression signatures of MCAO mice are negatively connected to melatonin receptor agonist like Ramelteon. We demonstrate that ramelteon improves the cerebral blood flow signals in ischemic stroke that is potentially mediated, at least, partly by mechanisms of activating endothelial nitric oxide synthase. Our results also show that the neuroprotection of ramelteon counteracts reactive oxygen species-induced oxidative stress and activates the nuclear factor erythroid 2-related factor 2/heme oxygenase-1 pathway. Ramelteon inhibits the mitochondrial and autophagic death pathways in MCAO mice and cultured neurons, consistent with gene set enrichment analysis from a bioinformatics perspective angle. Our data suggest that Ramelteon is a potential neuroprotective drug candidate, and MT1 is the neuroprotective target for ischemic stroke, which provides new insights into stroke therapy. MT1-KO mice and cultured neurons may provide animal and cellular models of accelerated ischemic damage and neuronal cell death.
Assuntos
Isquemia Encefálica , Indenos , AVC Isquêmico , Melatonina , Fármacos Neuroprotetores , Acidente Vascular Cerebral , Animais , Camundongos , AVC Isquêmico/tratamento farmacológico , Receptor MT1 de Melatonina/agonistas , Neuroproteção , Fármacos Neuroprotetores/farmacologia , Fármacos Neuroprotetores/uso terapêutico , Transdução de Sinais , Melatonina/farmacologia , Isquemia Encefálica/tratamento farmacológico , Acidente Vascular Cerebral/tratamento farmacológico , Acidente Vascular Cerebral/genética , Camundongos Knockout , Infarto da Artéria Cerebral Média/tratamento farmacológico , Infarto da Artéria Cerebral Média/metabolismoRESUMO
MOTIVATION: Single-cell RNA sequencing allows us to study cell heterogeneity at an unprecedented cell-level resolution and identify known and new cell populations. Current cell labeling pipeline uses unsupervised clustering and assigns labels to clusters by manual inspection. However, this pipeline does not utilize available gold-standard labels because there are usually too few of them to be useful to most computational methods. This article aims to facilitate cell labeling with a semi-supervised method in an alternative pipeline, in which a few gold-standard labels are first identified and then extended to the rest of the cells computationally. RESULTS: We built a semi-supervised dimensionality reduction method, a network-enhanced autoencoder (netAE). Tested on three public datasets, netAE outperforms various dimensionality reduction baselines and achieves satisfactory classification accuracy even when the labeled set is very small, without disrupting the similarity structure of the original space. AVAILABILITY AND IMPLEMENTATION: The code of netAE is available on GitHub: https://github.com/LeoZDong/netAE. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Análise de Célula Única , Sequência de Bases , Análise por Conglomerados , Análise de Sequência de RNA , Sequenciamento do ExomaRESUMO
BACKGROUND: COVID-19 has been observed to be associated with venous and arterial thrombosis. The inflammatory disease prolongs hospitalization, and preexisting comorbidities can intensity the thrombotic burden in patients with COVID-19. However, venous thromboembolism, arterial thrombosis, and other vascular complications may go unnoticed in critical care settings. Early risk stratification is paramount in the COVID-19 patient population for proactive monitoring of thrombotic complications. OBJECTIVE: The aim of this exploratory research was to characterize thrombotic complication risk factors associated with COVID-19 using information from electronic health record (EHR) and insurance claims databases. The goal is to develop an approach for analysis using real-world data evidence that can be generalized to characterize thrombotic complications and additional conditions in other clinical settings as well, such as pneumonia or acute respiratory distress syndrome in COVID-19 patients or in the intensive care unit. METHODS: We extracted deidentified patient data from the insurance claims database IBM MarketScan, and formulated hypotheses on thrombotic complications in patients with COVID-19 with respect to patient demographic and clinical factors using logistic regression. The hypotheses were then verified with analysis of deidentified patient data from the Research Patient Data Registry (RPDR) Mass General Brigham (MGB) patient EHR database. Data were analyzed according to odds ratios, 95% CIs, and P values. RESULTS: The analysis identified significant predictors (P<.001) for thrombotic complications in 184,831 COVID-19 patients out of the millions of records from IBM MarketScan and the MGB RPDR. With respect to age groups, patients 60 years and older had higher odds (4.866 in MarketScan and 6.357 in RPDR) to have thrombotic complications than those under 60 years old. In terms of gender, men were more likely (odds ratio of 1.245 in MarketScan and 1.693 in RPDR) to have thrombotic complications than women. Among the preexisting comorbidities, patients with heart disease, cerebrovascular diseases, hypertension, and personal history of thrombosis all had significantly higher odds of developing a thrombotic complication. Cancer and obesity were also associated with odds>1. The results from RPDR validated the IBM MarketScan findings, as they were largely consistent and afford mutual enrichment. CONCLUSIONS: The analysis approach adopted in this study can work across heterogeneous databases from diverse organizations and thus facilitates collaboration. Searching through millions of patient records, the analysis helped to identify factors influencing a phenotype. Use of thrombotic complications in COVID-19 patients represents only a case study; however, the same design can be used across other disease areas by extracting corresponding disease-specific patient data from available databases.
Assuntos
COVID-19 , Trombose , Humanos , Feminino , COVID-19/complicações , COVID-19/epidemiologia , Trombose/epidemiologia , Trombose/etiologia , Fatores de Risco , Estudos Retrospectivos , Razão de ChancesRESUMO
BACKGROUND: COVID-19 caused by SARS-CoV-2 has infected 219 million individuals at the time of writing of this paper. A large volume of research findings from observational studies about disease interactions with COVID-19 is being produced almost daily, making it difficult for physicians to keep track of the latest information on COVID-19's effect on patients with certain pre-existing conditions. OBJECTIVE: In this paper, we describe the creation of a clinical decision support tool, the SMART COVID Navigator, a web application to assist clinicians in treating patients with COVID-19. Our application allows clinicians to access a patient's electronic health records and identify disease interactions from a large set of observational research studies that affect the severity and fatality due to COVID-19. METHODS: The SMART COVID Navigator takes a 2-pronged approach to clinical decision support. The first part is a connection to electronic health record servers, allowing the application to access a patient's medical conditions. The second is accessing data sets with information from various observational studies to determine the latest research findings about COVID-19 outcomes for patients with certain medical conditions. By connecting these 2 data sources, users can see how a patient's medical history will affect their COVID-19 outcomes. RESULTS: The SMART COVID Navigator aggregates patient health information from multiple Fast Healthcare Interoperability Resources-enabled electronic health record systems. This allows physicians to see a comprehensive view of patient health records. The application accesses 2 data sets of over 1100 research studies to provide information on the fatality and severity of COVID-19 for several pre-existing conditions. We also analyzed the results of the collected studies to determine which medical conditions result in an increased chance of severity and fatality of COVID-19 progression. We found that certain conditions result in a higher likelihood of severity and fatality probabilities. We also analyze various cancer tissues and find that the probabilities for fatality vary greatly depending on the tissue being examined. CONCLUSIONS: The SMART COVID Navigator allows physicians to predict the fatality and severity of COVID-19 progression given a particular patient's medical conditions. This can allow physicians to determine how aggressively to treat patients infected with COVID-19 and to prioritize different patients for treatment considering their prior medical conditions.
Assuntos
COVID-19 , Sistemas de Apoio a Decisões Clínicas , Registros Eletrônicos de Saúde , Humanos , SARS-CoV-2 , SoftwareRESUMO
BACKGROUND: VCF formatted files are the lingua franca of next-generation sequencing, whereas HL7 FHIR is emerging as a standard language for electronic health record interoperability. A growing number of FHIR-based clinical genomics applications are emerging. Here, we describe an open source utility for converting variants from VCF format into HL7 FHIR format. RESULTS: vcf2fhir converts VCF variants into a FHIR Genomics Diagnostic Report. Conversion translates each VCF row into a corresponding FHIR-formatted variant in the generated report. In scope are simple variants (SNVs, MNVs, Indels), along with zygosity and phase relationships, for autosomes, sex chromosomes, and mitochondrial DNA. Input parameters include VCF file and genome build ('GRCh37' or 'GRCh38'); and optionally a conversion region that indicates the region(s) to convert, a studied region that lists genomic regions studied by the lab, and a non-callable region that lists studied regions deemed uncallable by the lab. Conversion can be limited to a subset of VCF by supplying genomic coordinates of the conversion region(s). If studied and non-callable regions are also supplied, the output FHIR report will include 'region-studied' observations that detail which portions of the conversion region were studied, and of those studied regions, which portions were deemed uncallable. We illustrate the vcf2fhir utility via two case studies. The first, 'SMART Cancer Navigator', is a web application that offers clinical decision support by linking patient EHR information to cancerous gene variants. The second, 'Precision Genomics Integration Platform', intersects a patient's FHIR-formatted clinical and genomic data with knowledge bases in order to provide on-demand delivery of contextually relevant genomic findings and recommendations to the EHR. CONCLUSIONS: Experience to date shows that the vcf2fhir utility can be effectively woven into clinically useful genomic-EHR integration pipelines. Additional testing will be a critical step towards the clinical validation of this utility, enabling it to be integrated in a variety of real world data flow scenarios. For now, we propose the use of this utility primarily to accelerate FHIR Genomics understanding and to facilitate experimentation with further integration of genomics data into the EHR.
Assuntos
Sistemas de Apoio a Decisões Clínicas , Genômica , Registros Eletrônicos de Saúde , Humanos , Bases de Conhecimento , OncogenesRESUMO
A personalized approach based on a patient's or pathogen's unique genomic sequence is the foundation of precision medicine. Genomic findings must be robust and reproducible, and experimental data capture should adhere to findable, accessible, interoperable, and reusable (FAIR) guiding principles. Moreover, effective precision medicine requires standardized reporting that extends beyond wet-lab procedures to computational methods. The BioCompute framework (https://w3id.org/biocompute/1.3.0) enables standardized reporting of genomic sequence data provenance, including provenance domain, usability domain, execution domain, verification kit, and error domain. This framework facilitates communication and promotes interoperability. Bioinformatics computation instances that employ the BioCompute framework are easily relayed, repeated if needed, and compared by scientists, regulators, test developers, and clinicians. Easing the burden of performing the aforementioned tasks greatly extends the range of practical application. Large clinical trials, precision medicine, and regulatory submissions require a set of agreed upon standards that ensures efficient communication and documentation of genomic analyses. The BioCompute paradigm and the resulting BioCompute Objects (BCOs) offer that standard and are freely accessible as a GitHub organization (https://github.com/biocompute-objects) following the "Open-Stand.org principles for collaborative open standards development." With high-throughput sequencing (HTS) studies communicated using a BCO, regulatory agencies (e.g., Food and Drug Administration [FDA]), diagnostic test developers, researchers, and clinicians can expand collaboration to drive innovation in precision medicine, potentially decreasing the time and cost associated with next-generation sequencing workflow exchange, reporting, and regulatory reviews.
Assuntos
Biologia Computacional/métodos , Análise de Sequência de DNA/métodos , Animais , Comunicação , Biologia Computacional/normas , Genoma , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Medicina de Precisão/tendências , Reprodutibilidade dos Testes , Análise de Sequência de DNA/normas , Software , Fluxo de TrabalhoRESUMO
BACKGROUND: Interdisciplinary collaborations bring lots of benefits to researchers in multiple areas, including precision medicine. OBJECTIVE: This viewpoint aims at studying how cross-institution team science would affect the development of precision medicine. METHODS: Publications of organizations on the eHealth Catalogue of Activities were collected in 2015 and 2017. The significance of the correlation between coleadership and coauthorship among different organizations was calculated using the Pearson chi-square test of independence. Other nonparametric tests examined whether organizations with coleaders publish more and better papers than organizations without coleaders. RESULTS: A total of 374 publications from 69 organizations were analyzed in 2015, and 7064 papers from 87 organizations were analyzed in 2017. Organizations with coleadership published more papers (P<.001, 2015 and 2017), which received higher citations (Z=-13.547, P<.001, 2017), compared to those without coleadership. Organizations with coleaders tended to publish papers together (P<.001, 2015 and 2017). CONCLUSIONS: Our findings suggest that organizations in the field of precision medicine could greatly benefit from institutional-level team science. As a result, stronger collaboration is recommended.
Assuntos
Pesquisa Interdisciplinar , Telemedicina , Humanos , Medicina de Precisão , PublicaçõesRESUMO
Researchers used to focus on analyzing single gene or protein expression of the microbes. But recently, genome, transcriptome, proteome and metabolome have gained more and more attention. Based on technologies of omics, including genomics, transcriptomics and metabolomics, a large quantity of information about cells, microbes and human, such as the information about phylogeny, virulence, antibiotic resistance and other aspects, has been revealed. Genus Streptococcus is one of the most invasive groups of bacteria that cause both human and animal diseases, threatening public health. In this review, we summarize the application of omics to analyze this genus-Streptococcus.
Assuntos
Proteínas de Bactérias/genética , Biologia Computacional/métodos , Farmacorresistência Bacteriana Múltipla/genética , Regulação Bacteriana da Expressão Gênica , Genoma Bacteriano , Streptococcus/genética , Antibacterianos/farmacologia , Proteínas de Bactérias/metabolismo , Sistemas CRISPR-Cas , Mapeamento Cromossômico , DNA Bacteriano/genética , DNA Bacteriano/metabolismo , Transferência Genética Horizontal , Sequenciamento de Nucleotídeos em Larga Escala , Interações Hospedeiro-Patógeno/genética , Humanos , Filogenia , Infecções Estreptocócicas/tratamento farmacológico , Infecções Estreptocócicas/microbiologia , Infecções Estreptocócicas/patologia , Streptococcus/classificação , Streptococcus/efeitos dos fármacos , Streptococcus/patogenicidade , VirulênciaRESUMO
The increase of fungal infectious diseases and lack of safe and efficacious antifungal drugs result in the urgent need of new therapeutic strategies. Here, we repurposed the lovastatin (LOV) as a synergistic antifungal potentiator to itraconazole (ITZ) against Candida albicans planktonic cells and biofilms in vitro for the first time. Mutants from ergosterol biosynthesis pathway were employed and key gene expression profiles of ergosterol pathway were also measured. LOV single treatment was unable to inhibit C. albicans strains except the ERG3 and ERG11 double mutant. LOV and ITZ combination was capable of inhibiting the C. albicans planktonic cells and biofilms synergistically including the ITZ resistant mutants. The synergistic antifungal ability was stronger in either ERG11 or ERG3 dysfunctional mutants compared to wild type. The combination lost the synergistic activities in the ERG11 and ERG3 double mutant, while it was sensitive to LOV single treatment. The expression of HMG1, encoding HMG-CoA the target of LOV, was significantly upregulated in ERG11 and ERG3 double mutant strain by the treatment of the combination at 1.5 and 3 h. The combination also significantly increased the HMG1 expression in mutants from ergosterol pathway compared with wild type. The ERG11 and ERG3 gene expressions were upregulated by ITZ and its combination with LOV, but seemingly not by LOV single treatment after 1.5 and 3 h. The combination of LOV and ITZ on C. albicans planktonic cells and biofilms highlights its potential clinical practice especially against the azole drug-resistant mutants.
Assuntos
Biofilmes/efeitos dos fármacos , Candida albicans/efeitos dos fármacos , Itraconazol/química , Itraconazol/farmacologia , Lovastatina/química , Lovastatina/farmacologia , Antifúngicos/química , Antifúngicos/farmacologia , Candida albicans/genética , Ergosterol/biossíntese , Ergosterol/genética , Proteínas Fúngicas/genética , Regulação Fúngica da Expressão Gênica , Testes de Sensibilidade Microbiana , MutaçãoRESUMO
BACKGROUND: Visualizing data by dimensionality reduction is an important strategy in Bioinformatics, which could help to discover hidden data properties and detect data quality issues, e.g. data noise, inappropriately labeled data, etc. As crowdsourcing-based synthetic biology databases face similar data quality issues, we propose to visualize biobricks to tackle them. However, existing dimensionality reduction methods could not be directly applied on biobricks datasets. Hereby, we use normalized edit distance to enhance dimensionality reduction methods, including Isomap and Laplacian Eigenmaps. RESULTS: By extracting biobricks from synthetic biology database Registry of Standard Biological Parts, six combinations of various types of biobricks are tested. The visualization graphs illustrate discriminated biobricks and inappropriately labeled biobricks. Clustering algorithm K-means is adopted to quantify the reduction results. The average clustering accuracy for Isomap and Laplacian Eigenmaps are 0.857 and 0.844, respectively. Besides, Laplacian Eigenmaps is 5 times faster than Isomap, and its visualization graph is more concentrated to discriminate biobricks. CONCLUSIONS: By combining normalized edit distance with Isomap and Laplacian Eigenmaps, synthetic biology biobircks are successfully visualized in two dimensional space. Various types of biobricks could be discriminated and inappropriately labeled biobricks could be determined, which could help to assess crowdsourcing-based synthetic biology databases' quality, and make biobricks selection.
Assuntos
Biologia Computacional/métodos , Modelos Teóricos , Dinâmica não Linear , Análise de Sequência de DNA , Biologia Sintética , Algoritmos , Análise por Conglomerados , Bases de Dados Factuais , Reprodutibilidade dos TestesRESUMO
BACKGROUND: An exponential growth of high-throughput biological information and data has occurred in the past decade, supported by technologies, such as microarrays and RNA-Seq. Most data generated using such methods are used to encode large amounts of rich information, and determine diagnostic and prognostic biomarkers. Although data storage costs have reduced, process of capturing data using aforementioned technologies is still expensive. Moreover, the time required for the assay, from sample preparation to raw value measurement is excessive (in the order of days). There is an opportunity to reduce both the cost and time for generating such expression datasets. RESULTS: We propose a framework in which complete gene expression values can be reliably predicted in-silico from partial measurements. This is achieved by modelling expression data as a low-rank matrix and then applying recently discovered techniques of matrix completion by using nonlinear convex optimisation. We evaluated prediction of gene expression data based on 133 studies, sourced from a combined total of 10,921 samples. It is shown that such datasets can be constructed with a low relative error even at high missing value rates (>50 %), and that such predicted datasets can be reliably used as surrogates for further analysis. CONCLUSION: This method has potentially far-reaching applications including how bio-medical data is sourced and generated, and transcriptomic prediction by optimisation. We show that gene expression data can be computationally constructed, thereby potentially reducing the costs of gene expression profiling. In conclusion, this method shows great promise of opening new avenues in research on low-rank matrix completion in biological sciences.
Assuntos
Simulação por Computador , Aprendizado de Máquina , Modelos Genéticos , Transcriptoma , HumanosRESUMO
Hospital acquired complications (HACs) are serious problems affecting modern day healthcare institutions. It is estimated that HACs result in an approximately 10% increase in total inpatient hospital costs across US hospitals. With US hospital spending totaling nearly $900 billion per annum, the damages caused by HACs are no small matter. Early detection and prevention of HACs could greatly reduce strains on the US healthcare system and improve patient morbidity & mortality rates. Here, we describe a machine-learning model for predicting the occurrence of HACs within five distinct categories using temporal clinical data. Using our approach, we find that at least $10 billion of excessive hospital costs could be saved in the US alone, with the institution of effective preventive measures. In addition, we also identify several keystone features that demonstrate high predictive power for HACs over different time periods following patient admission. The classifiers and features analyzed in this study show high promise of being able to be used for accurate prediction of HACs in clinical settings, and furthermore provide novel insights into the contribution of various clinical factors to the risk of developing HACs as a function of healthcare system exposure.
Assuntos
Infecção Hospitalar/classificação , Registros Eletrônicos de Saúde/classificação , Hospitalização/estatística & dados numéricos , Informática Médica/métodos , Complicações Pós-Operatórias/classificação , Codificação Clínica , HumanosRESUMO
OBJECTIVE: To combine mathematical modeling of salivary gene expression microarray data and systems biology annotation with reverse-transcription quantitative polymerase chain reaction amplification to identify (phase I) and validate (phase II) salivary biomarker analysis for the prediction of oral feeding readiness in preterm infants. STUDY DESIGN: Comparative whole-transcriptome microarray analysis from 12 preterm newborns pre- and postoral feeding success was used for computational modeling and systems biology analysis to identify potential salivary transcripts associated with oral feeding success (phase I). Selected gene expression biomarkers (15 from computational modeling; 6 evidence-based; and 3 reference) were evaluated by reverse-transcription quantitative polymerase chain reaction amplification on 400 salivary samples from successful (n = 200) and unsuccessful (n = 200) oral feeders (phase II). Genes, alone and in combination, were evaluated by a multivariate analysis controlling for sex and postconceptional age (PCA) to determine the probability that newborns achieved successful oral feeding. RESULTS: Advancing PCA (P < .001) and female sex (P = .05) positively predicted an infant's ability to feed orally. A combination of 5 genes, neuropeptide Y2 receptor (hunger signaling), adneosine-monophosphate-activated protein kinase (energy homeostasis), plexin A1 (olfactory neurogenesis), nephronophthisis 4 (visual behavior), and wingless-type MMTV integration site family, member 3 (facial development), in addition to PCA and sex, demonstrated good accuracy for determining feeding success (area under the receiver operator characteristic curve = 0.78). CONCLUSIONS: We have identified objective and biologically relevant salivary biomarkers that noninvasively assess a newborn's developing brain, sensory, and facial development as they relate to oral feeding success. Understanding the mechanisms that underlie the development of oral feeding readiness through translational and computational methods may improve clinical decision making while decreasing morbidities and health care costs.
Assuntos
Simulação por Computador , Comportamento Alimentar , Regulação da Expressão Gênica , Análise em Microsséries , Modelos Genéticos , Saliva/química , Comportamento de Sucção , Biomarcadores/análise , Feminino , Humanos , Recém-Nascido , Masculino , Valor Preditivo dos TestesRESUMO
Recent advances in high-throughput biotechnologies have led to the rapid growing research interest in reverse engineering of biomolecular systems (REBMS). 'Data-driven' approaches, i.e. data mining, can be used to extract patterns from large volumes of biochemical data at molecular-level resolution while 'design-driven' approaches, i.e. systems modeling, can be used to simulate emergent system properties. Consequently, both data- and design-driven approaches applied to -omic data may lead to novel insights in reverse engineering biological systems that could not be expected before using low-throughput platforms. However, there exist several challenges in this fast growing field of reverse engineering biomolecular systems: (i) to integrate heterogeneous biochemical data for data mining, (ii) to combine top-down and bottom-up approaches for systems modeling and (iii) to validate system models experimentally. In addition to reviewing progress made by the community and opportunities encountered in addressing these challenges, we explore the emerging field of synthetic biology, which is an exciting approach to validate and analyze theoretical system models directly through experimental synthesis, i.e. analysis-by-synthesis. The ultimate goal is to address the present and future challenges in reverse engineering biomolecular systems (REBMS) using integrated workflow of data mining, systems modeling and synthetic biology.
Assuntos
Mineração de Dados/métodos , Biologia de Sistemas , Bioengenharia/métodos , BiotecnologiaRESUMO
A constant false alarm rate (CFAR) target detector in non-homogenous backgrounds is proposed. Based on K-sample Anderson-Darling (AD) tests, the method re-arranges the reference cells by merging homogenous sub-blocks surrounding the cell under test (CUT) into a new reference window to estimate the background statistics. Double partition test, clutter edge refinement and outlier elimination are used as an anti-clutter processor in the proposed Modified AD (MAD) detector. Simulation results show that the proposed MAD test based detector outperforms cell-averaging (CA) CFAR, greatest of (GO) CFAR, smallest of (SO) CFAR, order-statistic (OS) CFAR, variability index (VI) CFAR, and CUT inclusive (CI) CFAR in most non-homogenous situations.
RESUMO
Clinical trials are critical to many medical advances; however, recruiting patients remains a persistent obstacle. Automated clinical trial matching could expedite recruitment across all trial phases. We detail our initial efforts towards automating the matching process by linking realistic synthetic electronic health records to clinical trial eligibility criteria using natural language processing methods. We also demonstrate how the Sørensen-Dice Index can be adapted to quantify match quality between a patient and a clinical trial.
RESUMO
Alzheimer's disease (AD) is an age-dependent neurodegenerative disease characterized by extracellular Amyloid Aß peptide (Aß) deposition and intracellular Tau protein aggregation. Glia, especially microglia and astrocytes are core participants during the progression of AD and these cells are the mediators of Aß clearance and degradation. The microbiota-gut-brain axis (MGBA) is a complex interactive network between the gut and brain involved in neurodegeneration. MGBA affects the function of glia in the central nervous system (CNS), and microbial metabolites regulate the communication between astrocytes and microglia; however, whether such communication is part of AD pathophysiology remains unknown. One of the potential links in bilateral gut-brain communication is tryptophan (Trp) metabolism. The microbiota-originated Trp and its metabolites enter the CNS to control microglial activation, and the activated microglia subsequently affect astrocyte functions. The present review highlights the role of MGBA in AD pathology, especially the roles of Trp per se and its metabolism as a part of the gut microbiota and brain communications. We (i) discuss the roles of Trp derivatives in microglia-astrocyte crosstalk from a bioinformatics perspective, (ii) describe the role of glia polarization in the microglia-astrocyte crosstalk and AD pathology, and (iii) summarize the potential of Trp metabolism as a therapeutic target. Finally, we review the role of Trp in AD from the perspective of the gut-brain axis and microglia, as well as astrocyte crosstalk, to inspire the discovery of novel AD therapeutics.
Assuntos
Doença de Alzheimer , Astrócitos , Eixo Encéfalo-Intestino , Microbioma Gastrointestinal , Microglia , Triptofano , Doença de Alzheimer/metabolismo , Doença de Alzheimer/patologia , Humanos , Astrócitos/metabolismo , Microglia/metabolismo , Triptofano/metabolismo , Eixo Encéfalo-Intestino/fisiologia , Microbioma Gastrointestinal/fisiologia , Encéfalo/metabolismo , Encéfalo/patologia , AnimaisRESUMO
High-resolution whole slide image scans of histopathology slides have been widely used in recent years for prediction in cancer. However, in some cases, clinical informatics practitioners may only have access to low-resolution snapshots of histopathology slides, not high-resolution scans. We evaluated strategies for training neural network prognostic models in non-small cell lung cancer (NSCLC) based on low-resolution snapshots, using data from the Veterans Affairs Precision Oncology Data Repository. We compared strategies without transfer learning, with transfer learning from general domain images, and with transfer learning from publicly available high-resolution histopathology scans. We found transfer learning from high-resolution scans achieved significantly better performance than other strategies. Our contribution provides a foundation for future development of prognostic models in NSCLC that incorporate data from low-resolution pathology slide snapshots alongside known clinical predictors.
Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Informática Médica , Humanos , Carcinoma Pulmonar de Células não Pequenas/diagnóstico por imagem , Neoplasias Pulmonares/diagnóstico por imagem , Medicina de Precisão , Aprendizado de MáquinaRESUMO
BACKGROUND AND OBJECTIVE: To develop the COVid Veteran (COVet) score for clinical deterioration in Veterans hospitalized with COVID-19 and further validate this model in both Veteran and non-Veteran samples. No such score has been derived and validated while incorporating a Veteran sample. DERIVATION COHORT: Adults (age ≥ 18 yr) hospitalized outside the ICU with a diagnosis of COVID-19 for model development to the Veterans Health Administration (VHA) (n = 80 hospitals). VALIDATION COHORT: External validation occurred in a VHA cohort of 34 hospitals, as well as six non-Veteran health systems for further external validation (n = 21 hospitals) between 2020 and 2023. PREDICTION MODEL: eXtreme Gradient Boosting machine learning methods were used, and performance was assessed using the area under the receiver operating characteristic curve and compared with the National Early Warning Score (NEWS). The primary outcome was transfer to the ICU or death within 24 hours of each new variable observation. Model predictor variables included demographics, vital signs, structured flowsheet data, and laboratory values. RESULTS: A total of 96,908 admissions occurred during the study period, of which 59,897 were in the Veteran sample and 37,011 were in the non-Veteran sample. During external validation in the Veteran sample, the model demonstrated excellent discrimination, with an area under the receiver operating characteristic curve of 0.88. This was significantly higher than NEWS (0.79; p < 0.01). In the non-Veteran sample, the model also demonstrated excellent discrimination (0.86 vs. 0.79 for NEWS; p < 0.01). The top three variables of importance were eosinophil percentage, mean oxygen saturation in the prior 24-hour period, and worst mental status in the prior 24-hour period. CONCLUSIONS: We used machine learning methods to develop and validate a highly accurate early warning score in both Veterans and non-Veterans hospitalized with COVID-19. The model could lead to earlier identification and therapy, which may improve outcomes.
Assuntos
COVID-19 , Aprendizado de Máquina , Veteranos , Humanos , COVID-19/diagnóstico , COVID-19/epidemiologia , Masculino , Feminino , Pessoa de Meia-Idade , Veteranos/estatística & dados numéricos , Idoso , Medição de Risco/métodos , Estados Unidos/epidemiologia , Hospitalização/estatística & dados numéricos , Adulto , Unidades de Terapia Intensiva , Curva ROC , Estudos de CoortesRESUMO
The Wilms' tumor suppressor 1 (WT1) gene encodes a DNA- and RNA-binding protein that plays an essential role in nephron progenitor differentiation during renal development. To identify WT1 target genes that might regulate nephron progenitor differentiation in vivo, we performed chromatin immunoprecipitation (ChIP) coupled to mouse promoter microarray (ChIP-chip) using chromatin prepared from embryonic mouse kidney tissue. We identified 1663 genes bound by WT1, 86% of which contain a previously identified, conserved, high-affinity WT1 binding site. To investigate functional interactions between WT1 and candidate target genes in nephron progenitors, we used a novel, modified WT1 morpholino loss-of-function model in embryonic mouse kidney explants to knock down WT1 expression in nephron progenitors ex vivo. Low doses of WT1 morpholino resulted in reduced WT1 target gene expression specifically in nephron progenitors, whereas high doses of WT1 morpholino arrested kidney explant development and were associated with increased nephron progenitor cell apoptosis, reminiscent of the phenotype observed in Wt1(-/-) embryos. Collectively, our results provide a comprehensive description of endogenous WT1 target genes in nephron progenitor cells in vivo, as well as insights into the transcriptional signaling networks controlled by WT1 that might direct nephron progenitor fate during renal development.