RESUMEN
Computational data-centric research techniques play a prevalent and multi-disciplinary role in life science research. In the past, scientists in wet labs generated the data, and computational researchers focused on creating tools for the analysis of those data. Computational researchers are now becoming more independent and taking leadership roles within biomedical projects, leveraging the increased availability of public data. We are now able to generate vast amounts of data, and the challenge has shifted from data generation to data analysis. Here we discuss the pitfalls, challenges, and opportunities facing the field of data-centric research in biology. We discuss the evolving perception of computational data-driven research and its rise as an independent domain in biomedical research while also addressing the significant collaborative opportunities that arise from integrating computational research with experimental and translational biology. Additionally, we discuss the future of data-centric research and its applications across various areas of the biomedical field.
Asunto(s)
Investigación Biomédica , Biología Computacional , Biología Computacional/métodos , HumanosRESUMEN
Tissue fibrosis is a major cause of mortality that results from the deposition of matrix proteins by an activated mesenchyme. Macrophages accumulate in fibrosis, but the role of specific subgroups in supporting fibrogenesis has not been investigated in vivo. Here, we used single-cell RNA sequencing (scRNA-seq) to characterize the heterogeneity of macrophages in bleomycin-induced lung fibrosis in mice. A novel computational framework for the annotation of scRNA-seq by reference to bulk transcriptomes (SingleR) enabled the subclustering of macrophages and revealed a disease-associated subgroup with a transitional gene expression profile intermediate between monocyte-derived and alveolar macrophages. These CX3CR1+SiglecF+ transitional macrophages localized to the fibrotic niche and had a profibrotic effect in vivo. Human orthologs of genes expressed by the transitional macrophages were upregulated in samples from patients with idiopathic pulmonary fibrosis. Thus, we have identified a pathological subgroup of transitional macrophages that are required for the fibrotic response to injury.
Asunto(s)
Fibrosis Pulmonar Idiopática/inmunología , Pulmón/patología , Activación de Macrófagos , Macrófagos Alveolares/inmunología , Animales , Antígenos de Diferenciación Mielomonocítica/genética , Antígenos de Diferenciación Mielomonocítica/inmunología , Antígenos de Diferenciación Mielomonocítica/metabolismo , Bleomicina/inmunología , Receptor 1 de Quimiocinas CX3C/genética , Receptor 1 de Quimiocinas CX3C/inmunología , Receptor 1 de Quimiocinas CX3C/metabolismo , Células Cultivadas , Modelos Animales de Enfermedad , Femenino , Perfilación de la Expresión Génica/métodos , Humanos , Fibrosis Pulmonar Idiopática/patología , Pulmón/citología , Pulmón/inmunología , Macrófagos Alveolares/metabolismo , Masculino , Ratones , Análisis de Secuencia de ARN/métodos , Lectinas Similares a la Inmunoglobulina de Unión a Ácido Siálico , Análisis de la Célula Individual/métodos , Regulación hacia ArribaRESUMEN
There is considerable heterogeneity in immunological parameters between individuals, but its sources are largely unknown. To assess the relative contribution of heritable versus non-heritable factors, we have performed a systems-level analysis of 210 healthy twins between 8 and 82 years of age. We measured 204 different parameters, including cell population frequencies, cytokine responses, and serum proteins, and found that 77% of these are dominated (>50% of variance) and 58% almost completely determined (>80% of variance) by non-heritable influences. In addition, some of these parameters become more variable with age, suggesting the cumulative influence of environmental exposure. Similarly, the serological responses to seasonal influenza vaccination are also determined largely by non-heritable factors, likely due to repeated exposure to different strains. Lastly, in MZ twins discordant for cytomegalovirus infection, more than half of all parameters are affected. These results highlight the largely reactive and adaptive nature of the immune system in healthy individuals.
Asunto(s)
Inmunidad , Gemelos Dicigóticos , Gemelos Monocigóticos , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Proteínas Sanguíneas/análisis , Proteínas Sanguíneas/inmunología , Niño , Citocinas/inmunología , Infecciones por Citomegalovirus/inmunología , Humanos , Vacunas contra la Influenza/inmunología , Persona de Mediana Edad , Adulto JovenRESUMEN
Sympathetic activation during cold exposure increases adipocyte thermogenesis via the expression of mitochondrial protein uncoupling protein 1 (UCP1)1. The propensity of adipocytes to express UCP1 is under a critical influence of the adipose microenvironment and varies between sexes and among various fat depots2-7. Here we report that mammary gland ductal epithelial cells in the adipose niche regulate cold-induced adipocyte UCP1 expression in female mouse subcutaneous white adipose tissue (scWAT). Single-cell RNA sequencing shows that glandular luminal epithelium subtypes express transcripts that encode secretory factors controlling adipocyte UCP1 expression under cold conditions. We term these luminal epithelium secretory factors 'mammokines'. Using 3D visualization of whole-tissue immunofluorescence, we reveal sympathetic nerve-ductal contact points. We show that mammary ducts activated by sympathetic nerves limit adipocyte UCP1 expression via the mammokine lipocalin 2. In vivo and ex vivo ablation of mammary duct epithelium enhance the cold-induced adipocyte thermogenic gene programme in scWAT. Since the mammary duct network extends throughout most of the scWAT in female mice, females show markedly less scWAT UCP1 expression, fat oxidation, energy expenditure and subcutaneous fat mass loss compared with male mice, implicating sex-specific roles of mammokines in adipose thermogenesis. These results reveal a role of sympathetic nerve-activated glandular epithelium in adipocyte UCP1 expression and suggest that mammary duct luminal epithelium has an important role in controlling glandular adiposity.
Asunto(s)
Adipocitos , Tejido Adiposo Blanco , Epitelio , Glándulas Mamarias Animales , Termogénesis , Animales , Femenino , Masculino , Ratones , Adipocitos/metabolismo , Tejido Adiposo Blanco/citología , Tejido Adiposo Blanco/metabolismo , Epitelio/inervación , Epitelio/metabolismo , Proteína Desacopladora 1/genética , Proteína Desacopladora 1/metabolismo , Glándulas Mamarias Animales/citología , Glándulas Mamarias Animales/inervación , Glándulas Mamarias Animales/fisiología , Frío , Sistema Nervioso Simpático/fisiología , Metabolismo Energético , Oxidación-Reducción , Caracteres SexualesRESUMEN
Personalized medicine is expected to benefit from combining genomic information with regular monitoring of physiological states by multiple high-throughput methods. Here, we present an integrative personal omics profile (iPOP), an analysis that combines genomic, transcriptomic, proteomic, metabolomic, and autoantibody profiles from a single individual over a 14 month period. Our iPOP analysis revealed various medical risks, including type 2 diabetes. It also uncovered extensive, dynamic changes in diverse molecular components and biological pathways across healthy and diseased conditions. Extremely high-coverage genomic and transcriptomic data, which provide the basis of our iPOP, revealed extensive heteroallelic changes during healthy and diseased states and an unexpected RNA editing mechanism. This study demonstrates that longitudinal iPOP can be used to interpret healthy and diseased states by connecting genomic information with additional dynamic omics activity.
Asunto(s)
Genoma Humano , Genómica , Medicina de Precisión , Diabetes Mellitus Tipo 2/genética , Femenino , Perfilación de la Expresión Génica , Humanos , Masculino , Metabolómica , Persona de Mediana Edad , Mutación , Proteómica , Virus Sincitiales Respiratorios/aislamiento & purificación , Rhinovirus/aislamiento & purificaciónRESUMEN
BACKGROUND: Exosome therapy shows potential for cardiac repair after injury. However, intrinsic challenges such as short half-life and lack of clear targets hinder the clinical feasibility. Here, we report a noninvasive and repeatable method for exosome delivery through inhalation after myocardial infarction (MI), which we called stem cell-derived exosome nebulization therapy (SCENT). METHODS: Stem cell-derived exosomes were characterized for size distribution and surface markers. C57BL/6 mice with MI model received exosome inhalation treatment through a nebulizer for 7 consecutive days. Echocardiographies were performed to monitor cardiac function after SCENT, and histological analysis helped with the investigation of myocardial repair. Single-cell RNA sequencing of the whole heart was performed to explore the mechanism of action by SCENT. Last, the feasibility, efficacy, and general safety of SCENT were demonstrated in a swine model of MI, facilitated by 3-dimensional cardiac magnetic resonance imaging. RESULTS: Recruitment of exosomes to the ischemic heart after SCENT was detected by ex vivo IVIS imaging and fluorescence microscopy. In a mouse model of MI, SCENT ameliorated cardiac repair by improving left ventricular function, reducing fibrotic tissue, and promoting cardiomyocyte proliferation. Mechanistic studies using single-cell RNA sequencing of mouse heart after SCENT revealed a downregulation of Cd36 in endothelial cells (ECs). In an EC-Cd36fl/- conditional knockout mouse model, the inhibition of CD36, a fatty acid transporter in ECs, led to a compensatory increase in glucose utilization in the heart and higher ATP generation, which enhanced cardiac contractility. In pigs, cardiac magnetic resonance imaging showed an enhanced ejection fraction (Δ=11.66±5.12%) and fractional shortening (Δ=5.72±2.29%) at day 28 after MI by SCENT treatment compared with controls, along with reduced infarct size and thickened ventricular wall. CONCLUSIONS: In both rodent and swine models, our data proved the feasibility, efficacy, and general safety of SCENT treatment against acute MI injury, laying the groundwork for clinical investigation. Moreover, the EC-Cd36fl/- mouse model provides the first in vivo evidence showing that conditional EC-CD36 knockout can ameliorate cardiac injury. Our study introduces a noninvasive treatment option for heart disease and identifies new potential therapeutic targets.
Asunto(s)
Exosomas , Ratones Endogámicos C57BL , Infarto del Miocardio , Animales , Infarto del Miocardio/metabolismo , Infarto del Miocardio/patología , Infarto del Miocardio/terapia , Infarto del Miocardio/fisiopatología , Exosomas/metabolismo , Ratones , Administración por Inhalación , Modelos Animales de Enfermedad , Porcinos , Miocitos Cardíacos/metabolismo , Miocitos Cardíacos/patología , Masculino , Función Ventricular Izquierda , Humanos , Miocardio/metabolismo , Miocardio/patología , Células Madre/metabolismo , Antígenos CD36/metabolismo , Antígenos CD36/genéticaRESUMEN
Gene-expression profiling has become a mainstay in immunology, but subtle changes in gene networks related to biological processes are hard to discern when comparing various datasets. For instance, conservation of the transcriptional response to sepsis in mouse models and human disease remains controversial. To improve transcriptional analysis in immunology, we created ImmuneSigDB: a manually annotated compendium of â¼5,000 gene-sets from diverse cell states, experimental manipulations, and genetic perturbations in immunology. Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice. Sepsis triggered conserved patterns of gene expression in humans and mouse models. However, we also identified species-specific biological processes in the sepsis transcriptional response: although both species upregulated phagocytosis-related genes, a mitosis signature was specific to humans. ImmuneSigDB enables granular analysis of transcriptomic data to improve biological understanding of immune processes of the human and mouse immune systems.
Asunto(s)
Bases de Datos Genéticas , Inflamación/inmunología , Transcriptoma , Animales , Humanos , Ratones , Especificidad de la EspecieRESUMEN
OBJECTIVES: Vedolizumab (VDZ) and ustekinumab (UST) are second-line treatments in pediatric patients with ulcerative colitis (UC) refractory to antitumor necrosis factor (anti-TNF) therapy. Pediatric studies comparing the effectiveness of these medications are lacking. Using a registry from ImproveCareNow (ICN), a global research network in pediatric inflammatory bowel disease, we compared the effectiveness of UST and VDZ in anti-TNF refractory UC. METHODS: We performed a propensity-score weighted regression analysis to compare corticosteroid-free clinical remission (CFCR) at 6 months from starting second-line therapy. Sensitivity analyses tested the robustness of our findings to different ways of handling missing outcome data. Secondary analyses evaluated alternative proxies of response and infection risk. RESULTS: Our cohort included 262 patients on VDZ and 74 patients on UST. At baseline, the two groups differed on their mean pediatric UC activity index (PUCAI) (p = 0.03) but were otherwise similar. At Month 6, 28.3% of patients on VDZ and 25.8% of those on UST achieved CFCR (p = 0.76). Our primary model showed no difference in CFCR (odds ratio: 0.81; 95% confidence interval [CI]: 0.41-1.59) (p = 0.54). The time to biologic discontinuation was similar in both groups (hazard ratio: 1.26; 95% CI: 0.76-2.08) (p = 0.36), with the reference group being VDZ, and we found no differences in clinical response, growth parameters, hospitalizations, surgeries, infections, or malignancy risk. Sensitivity analyses supported these findings of similar effectiveness. CONCLUSIONS: UST and VDZ are similarly effective for inducing clinical remission in anti-TNF refractory UC in pediatric patients. Providers should consider safety, tolerability, cost, and comorbidities when deciding between these therapies.
Asunto(s)
Anticuerpos Monoclonales Humanizados , Colitis Ulcerosa , Fármacos Gastrointestinales , Ustekinumab , Humanos , Colitis Ulcerosa/tratamiento farmacológico , Ustekinumab/uso terapéutico , Femenino , Masculino , Niño , Anticuerpos Monoclonales Humanizados/uso terapéutico , Adolescente , Fármacos Gastrointestinales/uso terapéutico , Resultado del Tratamiento , Factor de Necrosis Tumoral alfa/antagonistas & inhibidores , Inducción de Remisión/métodos , Puntaje de Propensión , Sistema de RegistrosRESUMEN
BACKGROUND: Diabetes mellitus (DM) is a major health concern among children with the widespread adoption of advanced technologies. However, concerns are growing about the transparency, replicability, biasedness, and overall validity of artificial intelligence studies in medicine. OBJECTIVE: We aimed to systematically review the reporting quality of machine learning (ML) studies of pediatric DM using the Minimum Information About Clinical Artificial Intelligence Modelling (MI-CLAIM) checklist, a general reporting guideline for medical artificial intelligence studies. METHODS: We searched the PubMed and Web of Science databases from 2016 to 2020. Studies were included if the use of ML was reported in children with DM aged 2 to 18 years, including studies on complications, screening studies, and in silico samples. In studies following the ML workflow of training, validation, and testing of results, reporting quality was assessed via MI-CLAIM by consensus judgments of independent reviewer pairs. Positive answers to the 17 binary items regarding sufficient reporting were qualitatively summarized and counted as a proxy measure of reporting quality. The synthesis of results included testing the association of reporting quality with publication and data type, participants (human or in silico), research goals, level of code sharing, and the scientific field of publication (medical or engineering), as well as with expert judgments of clinical impact and reproducibility. RESULTS: After screening 1043 records, 28 studies were included. The sample size of the training cohort ranged from 5 to 561. Six studies featured only in silico patients. The reporting quality was low, with great variation among the 21 studies assessed using MI-CLAIM. The number of items with sufficient reporting ranged from 4 to 12 (mean 7.43, SD 2.62). The items on research questions and data characterization were reported adequately most often, whereas items on patient characteristics and model examination were reported adequately least often. The representativeness of the training and test cohorts to real-world settings and the adequacy of model performance evaluation were the most difficult to judge. Reporting quality improved over time (r=0.50; P=.02); it was higher than average in prognostic biomarker and risk factor studies (P=.04) and lower in noninvasive hypoglycemia detection studies (P=.006), higher in studies published in medical versus engineering journals (P=.004), and higher in studies sharing any code of the ML pipeline versus not sharing (P=.003). The association between expert judgments and MI-CLAIM ratings was not significant. CONCLUSIONS: The reporting quality of ML studies in the pediatric population with DM was generally low. Important details for clinicians, such as patient characteristics; comparison with the state-of-the-art solution; and model examination for valid, unbiased, and robust results, were often the weak points of reporting. To assess their clinical utility, the reporting standards of ML studies must evolve, and algorithms for this challenging population must become more transparent and replicable.
Asunto(s)
Inteligencia Artificial , Diabetes Mellitus , Humanos , Niño , Reproducibilidad de los Resultados , Aprendizaje Automático , Diabetes Mellitus/diagnóstico , Lista de VerificaciónRESUMEN
Importance: The ways in which we access, acquire, and use data in clinical trials have evolved very little over time, resulting in a fragmented and inefficient system that limits the amount and quality of evidence that can be generated. Observations: Clinical trial design has advanced steadily over several decades. Yet the infrastructure for clinical trial data collection remains expensive and labor intensive and limits the amount of evidence that can be collected to inform whether and how interventions work for different patient populations. Meanwhile, there is increasing demand for evidence from randomized clinical trials to inform regulatory decisions, payment decisions, and clinical care. Although substantial public and industry investment in advancing electronic health record interoperability, data standardization, and the technology systems used for data capture have resulted in significant progress on various aspects of data generation, there is now a need to combine the results of these efforts and apply them more directly to the clinical trial data infrastructure. Conclusions and Relevance: We describe a vision for a modernized infrastructure that is centered around 2 related concepts. First, allowing the collection and rigorous evaluation of multiple data sources and types and, second, enabling the possibility to reuse health data for multiple purposes. We address the need for multidisciplinary collaboration and suggest ways to measure progress toward this goal.
RESUMEN
BACKGROUND: Pulmonary hypertension (PH) is life-threatening, and often diagnosed late in its course. We aimed to evaluate if a deep learning approach using electrocardiogram (ECG) data alone can detect PH and clinically important subtypes. We asked: does an automated deep learning approach to ECG interpretation detect PH and its clinically important subtypes? METHODS AND RESULTS: Adults with right heart catheterization or an echocardiogram within 90 days of an ECG at the University of California, San Francisco (2012-2019) were retrospectively identified as PH or non-PH. A deep convolutional neural network was trained on patients' 12-lead ECG voltage data. Patients were divided into training, development, and test sets in a ratio of 7:1:2. Overall, 5016 PH and 19,454 patients without PH were used in the study. The mean age at the time of ECG was 62.29 ± 17.58 years and 49.88% were female. The mean interval between ECG and right heart catheterization or echocardiogram was 3.66 and 2.23 days for patients with PH and patients without PH, respectively. In the test dataset, the model achieved an area under the receiver operating characteristic curve, sensitivity, and specificity, respectively of 0.89, 0.79, and 0.84 to detect PH; 0.91, 0.83, and 0.84 to detect precapillary PH; 0.88, 0.81, and 0.81 to detect pulmonary arterial hypertension, and 0.80, 0.73, and 0.76 to detect group 3 PH. We additionally applied the trained model on ECGs from participants in the test dataset that were obtained from up to 2 years before diagnosis of PH; the area under the receiver operating characteristic curve was 0.79 or greater. CONCLUSIONS: A deep learning ECG algorithm can detect PH and PH subtypes around the time of diagnosis and can detect PH using ECGs that were done up to 2 years before right heart catheterization/echocardiogram diagnosis. This approach has the potential to decrease diagnostic delays in PH.
Asunto(s)
Aprendizaje Profundo , Insuficiencia Cardíaca , Hipertensión Pulmonar , Adulto , Humanos , Femenino , Masculino , Hipertensión Pulmonar/diagnóstico , Estudios Retrospectivos , Electrocardiografía/métodosRESUMEN
BACKGROUND: Transfusion-associated circulatory overload (TACO) is a severe adverse reaction (AR) contributing to the leading cause of mortality associated with transfusions. As strategies to mitigate TACO have been increasingly adopted, an update of prevalence rates and risk factors associated with TACO using the growing sources of electronic health record (EHR) data can help understand transfusion safety. STUDY DESIGN AND METHODS: This retrospective study aimed to provide a timely and reproducible assessment of prevalence rates and risk factors associated with TACO. Novel natural language processing methods, now made publicly available on GitHub, were developed to extract ARs from 3178 transfusion reaction reports. Other patient-level data were extracted computationally from UCSF EHR between 2012 and 2022. The odds ratio estimates of risk factors were calculated using a multivariate logistic regression analysis with case-to-control matched on sex and age at a ratio of 1:5. RESULTS: A total of 56,208 patients received transfusions (total 573,533 units) at UCSF during the study period and 102 patients developed TACO. The prevalence of TACO was estimated to be 0.2% per patient (102/total 56,208). Patients with a history of coagulopathy (OR, 1.36; 95% CI, 1.04-1.79) and transplant (OR, 1.99; 95% CI, 1.48-2.68) were associated with increased odds of TACO. DISCUSSION: While TACO is a serious AR, events remained rare, even in populations enriched with high-risk patients. Novel computational methods can be used to find and continually surveil for transfusion ARs. Results suggest that patients with history or presence of coagulopathy and organ transplant should be carefully monitored to mitigate potential risks of TACO.
Asunto(s)
Registros Electrónicos de Salud , Reacción a la Transfusión , Humanos , Estudios Retrospectivos , Reacción a la Transfusión/epidemiología , Transfusión Sanguínea/métodos , Factores de RiesgoRESUMEN
BACKGROUND: The advent of clinical trial data sharing platforms has created opportunities for making new discoveries and answering important questions using already collected data. However, existing methods for meta-analyzing these data require the presence of shared control groups across studies, significantly limiting the number of questions that can be confidently addressed. We sought to develop a method for meta-analyzing potentially heterogeneous clinical trials even in the absence of a common control group. METHODS: This work was conducted within the context of a broader effort to study comparative efficacy in Crohn's disease. Following a search of clnicaltrials.gov we obtained access to the individual participant data from nine trials of FDA-approved treatments in Crohn's Disease (N = 3392). We developed a method involving sequences of regression and simulation to separately model the placebo- and drug-attributable effects, and to simulate head-to-head trials against an appropriately normalized background. We validated this method by comparing the outcome of a simulated trial comparing the efficacies of adalimumab and ustekinumab against the recently published results of SEAVUE, an actual head-to-head trial of these drugs. This study was pre-registered on PROSPERO (#157,827) prior to the completion of SEAVUE. RESULTS: Using our method of sequential regression and simulation, we compared the week eight outcomes of two virtual cohorts subject to the same patient selection criteria as SEAVUE and treated with adalimumab or ustekinumab. Our primary analysis replicated the corresponding published results from SEAVUE (p = 0.9). This finding proved stable under multiple sensitivity analyses. CONCLUSIONS: This new method may help reduce the bias of individual participant data meta-analyses, expand the scope of what can be learned from these already-collected data, and reduce the costs of obtaining high-quality evidence to guide patient care.
Asunto(s)
Enfermedad de Crohn , Ustekinumab , Humanos , Adalimumab/uso terapéutico , Grupos Control , Enfermedad de Crohn/tratamiento farmacológico , Inducción de Remisión , Ustekinumab/uso terapéutico , Ensayos Clínicos como AsuntoRESUMEN
Cytometry technologies are essential tools for immunology research, providing high-throughput measurements of the immune cells at the single-cell level. Existing approaches in interpreting and using cytometry measurements include manual or automated gating to identify cell subsets from the cytometry data, providing highly intuitive results but may lead to significant information loss, in that additional details in measured or correlated cell signals might be missed. In this study, we propose and test a deep convolutional neural network for analyzing cytometry data in an end-to-end fashion, allowing a direct association between raw cytometry data and the clinical outcome of interest. Using nine large cytometry by time-of-flight mass spectrometry or mass cytometry (CyTOF) studies from the open-access ImmPort database, we demonstrated that the deep convolutional neural network model can accurately diagnose the latent cytomegalovirus (CMV) in healthy individuals, even when using highly heterogeneous data from different studies. In addition, we developed a permutation-based method for interpreting the deep convolutional neural network model. We were able to identify a CD27- CD94+ CD8+ T cell population significantly associated with latent CMV infection, confirming the findings in previous studies. Finally, we provide a tutorial for creating, training, and interpreting the tailored deep learning model for cytometry data using Keras and TensorFlow (https://github.com/hzc363/DeepLearningCyTOF).
Asunto(s)
Aprendizaje Profundo , Citometría de Flujo , Infecciones por Citomegalovirus/diagnóstico , Humanos , Linfocitos T/citologíaRESUMEN
It is unclear what mechanisms govern latent HIV infection in vivo or in primary cell models. To investigate these questions, we compared the HIV and cellular transcription profile in three primary cell models and peripheral CD4+ T cells from HIV-infected ART-suppressed individuals using RT-ddPCR and RNA-seq. All primary cell models recapitulated the block to HIV multiple splicing seen in cells from ART-suppressed individuals, suggesting that this may be a key feature of HIV latency in primary CD4+ T cells. Blocks to HIV transcriptional initiation and elongation were observed more variably among models. A common set of 234 cellular genes, including members of the minor spliceosome pathway, was differentially expressed between unstimulated and activated cells from primary cell models and ART-suppressed individuals, suggesting these genes may play a role in the blocks to HIV transcription and splicing underlying latent infection. These genes may represent new targets for therapies designed to reactivate or silence latently-infected cells.
Asunto(s)
Linfocitos T CD4-Positivos/virología , Infecciones por VIH/virología , VIH-1/genética , Transcriptoma , Latencia del Virus/genética , Antirretrovirales/uso terapéutico , VIH-1/fisiología , Humanos , ARN Viral/genéticaRESUMEN
BACKGROUND: Accurate, pragmatic risk stratification for postoperative delirium (POD) is necessary to target preventative resources toward high-risk patients. Machine learning (ML) offers a novel approach to leveraging electronic health record (EHR) data for POD prediction. We sought to develop and internally validate a ML-derived POD risk prediction model using preoperative risk features, and to compare its performance to models developed with traditional logistic regression. METHODS: This was a retrospective analysis of preoperative EHR data from 24,885 adults undergoing a procedure requiring anesthesia care, recovering in the main post-anesthesia care unit, and staying in the hospital at least overnight between December 2016 and December 2019 at either of two hospitals in a tertiary care health system. One hundred fifteen preoperative risk features including demographics, comorbidities, nursing assessments, surgery type, and other preoperative EHR data were used to predict postoperative delirium (POD), defined as any instance of Nursing Delirium Screening Scale ≥2 or positive Confusion Assessment Method for the Intensive Care Unit within the first 7 postoperative days. Two ML models (Neural Network and XGBoost), two traditional logistic regression models ("clinician-guided" and "ML hybrid"), and a previously described delirium risk stratification tool (AWOL-S) were evaluated using the area under the receiver operating characteristic curve (AUC-ROC), sensitivity, specificity, positive likelihood ratio, and positive predictive value. Model calibration was assessed with a calibration curve. Patients with no POD assessments charted or at least 20% of input variables missing were excluded. RESULTS: POD incidence was 5.3%. The AUC-ROC for Neural Net was 0.841 [95% CI 0. 816-0.863] and for XGBoost was 0.851 [95% CI 0.827-0.874], which was significantly better than the clinician-guided (AUC-ROC 0.763 [0.734-0.793], p < 0.001) and ML hybrid (AUC-ROC 0.824 [0.800-0.849], p < 0.001) regression models and AWOL-S (AUC-ROC 0.762 [95% CI 0.713-0.812], p < 0.001). Neural Net, XGBoost, and ML hybrid models demonstrated excellent calibration, while calibration of the clinician-guided and AWOL-S models was moderate; they tended to overestimate delirium risk in those already at highest risk. CONCLUSION: Using pragmatically collected EHR data, two ML models predicted POD in a broad perioperative population with high discrimination. Optimal application of the models would provide automated, real-time delirium risk stratification to improve perioperative management of surgical patients at risk for POD.
Asunto(s)
Delirio/diagnóstico , Registros Electrónicos de Salud/estadística & datos numéricos , Aprendizaje Automático , Complicaciones Posoperatorias/diagnóstico , Anciano , Estudios de Cohortes , Femenino , Humanos , Masculino , Persona de Mediana Edad , Valor Predictivo de las Pruebas , Periodo Preoperatorio , Reproducibilidad de los Resultados , Estudios RetrospectivosRESUMEN
Recent remarkable advances in genome sequencing have enabled detailed maps of identified and interpreted genomic variation, dubbed "mutanomes." The availability of thousands of exome/genome sequencing data has prompted the emergence of new challenges in the identification of novel druggable targets and therapeutic strategies. Typically, mutanomes are viewed as one- or two-dimensional. The three-dimensional protein structural view of personal mutanomes sheds light on the functional consequences of clinically actionable mutations revealed in tumor diagnosis and followed up in personalized treatments, in a mutanome-informed manner. In this review, we describe the protein structural landscape of personal mutanomes and provide expert opinions on rational strategies for more streamlined oncological drug discovery and molecularly targeted therapies for each individual and each tumor. We provide the structural mechanism of orthosteric versus allosteric drugs at the atom-level via targeting specific somatic alterations for combating drug resistance and the "undruggable" challenges in solid and hematologic neoplasias. We discuss computational biophysics strategies for innovative mutanome-informed cancer immunotherapies and combination immunotherapies. Finally, we highlight a personal mutanome infrastructure for the emerging development of personalized cancer medicine using a breast cancer case study.
Asunto(s)
Descubrimiento de Drogas , Mutación , Neoplasias/genética , Genómica , Humanos , Inmunoterapia , Terapia Molecular Dirigida , Neoplasias/terapia , Medicina de PrecisiónRESUMEN
There is an urgent need to identify therapies that prevent SARS-CoV-2 infection and improve the outcome of COVID-19 patients. Although repurposed drugs with favorable safety profiles could have significant benefit, widely available prevention or treatment options for COVID-19 have yet to be identified. Efforts to identify approved drugs with in vitro activity against SARS-CoV-2 resulted in identification of antiviral sigma-1 receptor ligands, including antihistamines in the histamine-1 receptor binding class. We identified antihistamine candidates for repurposing by mining electronic health records of usage in population of more than 219,000 subjects tested for SARS-CoV-2. Usage of diphenhydramine, hydroxyzine and azelastine was associated with reduced incidence of SARS-CoV-2 positivity in subjects greater than age 61. We found diphenhydramine, hydroxyzine and azelastine to exhibit direct antiviral activity against SARS-CoV-2 in vitro. Although mechanisms by which specific antihistamines exert antiviral effects is not clear, hydroxyzine, and possibly azelastine, bind Angiotensin Converting Enzyme-2 (ACE2) and the sigma-1 receptor as off-targets. Clinical studies are needed to measure the effectiveness of diphenhydramine, hydroxyzine and azelastine for disease prevention, for early intervention, or as adjuvant therapy for severe COVID-19.
Asunto(s)
Enzima Convertidora de Angiotensina 2/química , Antivirales/química , Tratamiento Farmacológico de COVID-19 , Reposicionamiento de Medicamentos , Antagonistas de los Receptores Histamínicos/química , SARS-CoV-2/efectos de los fármacos , Enzima Convertidora de Angiotensina 2/genética , Enzima Convertidora de Angiotensina 2/metabolismo , Animales , Antivirales/farmacología , Antivirales/uso terapéutico , Dominio Catalítico , Chlorocebus aethiops , Células HEK293 , Antagonistas de los Receptores Histamínicos/farmacología , Antagonistas de los Receptores Histamínicos/uso terapéutico , Humanos , Ligandos , Unión Proteica , Receptores Histamínicos/química , Receptores sigma/química , Células Vero , Receptor Sigma-1RESUMEN
Over a decade ago, the Atacama humanoid skeleton (Ata) was discovered in the Atacama region of Chile. The Ata specimen carried a strange phenotype-6-in stature, fewer than expected ribs, elongated cranium, and accelerated bone age-leading to speculation that this was a preserved nonhuman primate, human fetus harboring genetic mutations, or even an extraterrestrial. We previously reported that it was human by DNA analysis with an estimated bone age of about 6-8 yr at the time of demise. To determine the possible genetic drivers of the observed morphology, DNA from the specimen was subjected to whole-genome sequencing using the Illumina HiSeq platform with an average 11.5× coverage of 101-bp, paired-end reads. In total, 3,356,569 single nucleotide variations (SNVs) were found as compared to the human reference genome, 518,365 insertions and deletions (indels), and 1047 structural variations (SVs) were detected. Here, we present the detailed whole-genome analysis showing that Ata is a female of human origin, likely of Chilean descent, and its genome harbors mutations in genes (COL1A1, COL2A1, KMT2D, FLNB, ATR, TRIP11, PCNT) previously linked with diseases of small stature, rib anomalies, cranial malformations, premature joint fusion, and osteochondrodysplasia (also known as skeletal dysplasia). Together, these findings provide a molecular characterization of Ata's peculiar phenotype, which likely results from multiple known and novel putative gene mutations affecting bone development and ossification.
Asunto(s)
ADN Antiguo/análisis , Genoma Humano/genética , Osteocondrodisplasias/genética , Secuenciación Completa del Genoma , Animales , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación INDEL , Anotación de Secuencia Molecular , Mutación/genética , Osteocondrodisplasias/fisiopatología , Fenotipo , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
MOTIVATION: Flow cytometry and mass cytometry are widely used to diagnose diseases and to predict clinical outcomes. When associating clinical features with cytometry data, traditional analysis methods require cell gating as an intermediate step, leading to information loss and susceptibility to batch effects. Here, we wish to explore an alternative approach that predicts clinical features from cytometry data without the cell-gating step. We also wish to test if such a gating-free approach increases the accuracy and robustness of the prediction. RESULTS: We propose a novel strategy (CytoDx) to predict clinical outcomes using cytometry data without cell gating. Applying CytoDx on real-world datasets allow us to predict multiple types of clinical features. In particular, CytoDx is able to predict the response to influenza vaccine using highly heterogeneous datasets, demonstrating that it is not only accurate but also robust to batch effects and cytometry platforms. AVAILABILITY AND IMPLEMENTATION: CytoDx is available as an R package on Bioconductor (bioconductor.org/packages/CytoDx). Data and scripts for reproducing the results are available on bitbucket.org/zichenghu_ucsf/cytodx_study_code/downloads. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.