RESUMEN
Advanced imaging methods now allow cell-type-specific recording of neural activity across the mammalian brain, potentially enabling the exploration of how brain-wide dynamical patterns give rise to complex behavioural states1-12. Dissociation is an altered behavioural state in which the integrity of experience is disrupted, resulting in reproducible cognitive phenomena including the dissociation of stimulus detection from stimulus-related affective responses. Dissociation can occur as a result of trauma, epilepsy or dissociative drug use13,14, but despite its substantial basic and clinical importance, the underlying neurophysiology of this state is unknown. Here we establish such a dissociation-like state in mice, induced by precisely-dosed administration of ketamine or phencyclidine. Large-scale imaging of neural activity revealed that these dissociative agents elicited a 1-3-Hz rhythm in layer 5 neurons of the retrosplenial cortex. Electrophysiological recording with four simultaneously deployed high-density probes revealed rhythmic coupling of the retrosplenial cortex with anatomically connected components of thalamus circuitry, but uncoupling from most other brain regions was observed-including a notable inverse correlation with frontally projecting thalamic nuclei. In testing for causal significance, we found that rhythmic optogenetic activation of retrosplenial cortex layer 5 neurons recapitulated dissociation-like behavioural effects. Local retrosplenial hyperpolarization-activated cyclic-nucleotide-gated potassium channel 1 (HCN1) pacemakers were required for systemic ketamine to induce this rhythm and to elicit dissociation-like behavioural effects. In a patient with focal epilepsy, simultaneous intracranial stereoencephalography recordings from across the brain revealed a similarly localized rhythm in the homologous deep posteromedial cortex that was temporally correlated with pre-seizure self-reported dissociation, and local brief electrical stimulation of this region elicited dissociative experiences. These results identify the molecular, cellular and physiological properties of a conserved deep posteromedial cortical rhythm that underlies states of dissociation.
Asunto(s)
Ondas Encefálicas/fisiología , Corteza Cerebral/fisiología , Trastornos Disociativos/fisiopatología , Potenciales de Acción/efectos de los fármacos , Animales , Conducta/efectos de los fármacos , Ondas Encefálicas/efectos de los fármacos , Corteza Cerebral/citología , Corteza Cerebral/diagnóstico por imagen , Corteza Cerebral/efectos de los fármacos , Trastornos Disociativos/diagnóstico por imagen , Electrofisiología , Femenino , Humanos , Canales Regulados por Nucleótidos Cíclicos Activados por Hiperpolarización/metabolismo , Ketamina/farmacología , Masculino , Ratones , Ratones Endogámicos C57BL , Neuronas/efectos de los fármacos , Optogenética , Autoinforme , Tálamo/citología , Tálamo/diagnóstico por imagen , Tálamo/efectos de los fármacos , Tálamo/fisiologíaRESUMEN
Endometriosis is a common estrogen-dependent disorder wherein uterine lining tissue (endometrium) is found mainly in the pelvis where it causes inflammation, chronic pelvic pain, pain with intercourse and menses, and infertility. Recent evidence also supports a systemic inflammatory component that underlies associated co-morbidities, e.g., migraines and cardiovascular and autoimmune diseases. Genetics and environment contribute significantly to disease risk, and with the explosion of omics technologies, underlying mechanisms of symptoms are increasingly being elucidated, although novel and effective therapeutics for pain and infertility have lagged behind these advances. Moreover, there are stark disparities in diagnosis, access to care, and treatment among persons of color and transgender/nonbinary identity, socioeconomically disadvantaged populations, and adolescents, and a disturbing low awareness among health care providers, policymakers, and the lay public about endometriosis, which, if left undiagnosed and under-treated can lead to significant fibrosis, infertility, depression, and markedly diminished quality of life. This review summarizes endometriosis epidemiology, compelling evidence for its pathogenesis, mechanisms underlying its pathophysiology in the age of precision medicine, recent biomarker discovery, novel therapeutic approaches, and issues around reproductive justice for marginalized populations with this disorder spanning the past 100 years. As we enter the next revolution in health care and biomedical research, with rich molecular and clinical datasets, single-cell omics, and population-level data, endometriosis is well positioned to benefit from data-driven research leveraging computational and artificial intelligence approaches integrating data and predicting disease risk, diagnosis, response to medical and surgical therapies, and prognosis for recurrence.
Asunto(s)
Dolor Crónico , Endometriosis , Adolescente , Humanos , Femenino , Anciano de 80 o más Años , Medicina de Precisión , Endometriosis/epidemiología , Endometriosis/terapia , Longevidad , Inteligencia Artificial , Calidad de Vida , Salud ReproductivaRESUMEN
Drug repurposing requires distinguishing established drug class targets from novel molecule-specific mechanisms and rapidly derisking their therapeutic potential in a time-critical manner, particularly in a pandemic scenario. In response to the challenge to rapidly identify treatment options for COVID-19, several studies reported that statins, as a drug class, reduce mortality in these patients. However, it is unknown if different statins exhibit consistent function or may have varying therapeutic benefit. A Bayesian network tool was used to predict drugs that shift the host transcriptomic response to SARS-CoV-2 infection towards a healthy state. Drugs were predicted using 14 RNA-sequencing datasets from 72 autopsy tissues and 465 COVID-19 patient samples or from cultured human cells and organoids infected with SARS-CoV-2. Top drug predictions included statins, which were then assessed using electronic medical records containing over 4,000 COVID-19 patients on statins to determine mortality risk in patients prescribed specific statins versus untreated matched controls. The same drugs were tested in Vero E6 cells infected with SARS-CoV-2 and human endothelial cells infected with a related OC43 coronavirus. Simvastatin was among the most highly predicted compounds (14/14 datasets) and five other statins, including atorvastatin, were predicted to be active in > 50% of analyses. Analysis of the clinical database revealed that reduced mortality risk was only observed in COVID-19 patients prescribed a subset of statins, including simvastatin and atorvastatin. In vitro testing of SARS-CoV-2 infected cells revealed simvastatin to be a potent direct inhibitor whereas most other statins were less effective. Simvastatin also inhibited OC43 infection and reduced cytokine production in endothelial cells. Statins may differ in their ability to sustain the lives of COVID-19 patients despite having a shared drug target and lipid-modifying mechanism of action. These findings highlight the value of target-agnostic drug prediction coupled with patient databases to identify and clinically evaluate non-obvious mechanisms and derisk and accelerate drug repurposing opportunities.
Asunto(s)
COVID-19 , Inhibidores de Hidroximetilglutaril-CoA Reductasas , Humanos , Inhibidores de Hidroximetilglutaril-CoA Reductasas/farmacología , Inhibidores de Hidroximetilglutaril-CoA Reductasas/uso terapéutico , SARS-CoV-2 , Atorvastatina/farmacología , Teorema de Bayes , Células Endoteliales , Simvastatina/farmacología , Simvastatina/uso terapéutico , Reposicionamiento de Medicamentos , Registros MédicosRESUMEN
Artificial Intelligence (AI) has the power to improve our lives through a wide variety of applications, many of which fall into the healthcare space; however, a lack of diversity is contributing to limitations in how broadly AI can help people. The UCSF AI4ALL program was established in 2019 to address this issue by targeting high school students from underrepresented backgrounds in AI, giving them a chance to learn about AI with a focus on biomedicine, and promoting diversity and inclusion. In 2020, the UCSF AI4ALL three-week program was held entirely online due to the COVID-19 pandemic. Thus, students participated virtually to gain experience with AI, interact with diverse role models in AI, and learn about advancing health through AI. Specifically, they attended lectures in coding and AI, received an in-depth research experience through hands-on projects exploring COVID-19, and engaged in mentoring and personal development sessions with faculty, researchers, industry professionals, and undergraduate and graduate students, many of whom were women and from underrepresented racial and ethnic backgrounds. At the conclusion of the program, the students presented the results of their research projects at the final symposium. Comparison of pre- and post-program survey responses from students demonstrated that after the program, significantly more students were familiar with how to work with data and to evaluate and apply machine learning algorithms. There were also nominally significant increases in the students' knowing people in AI from historically underrepresented groups, feeling confident in discussing AI, and being aware of careers in AI. We found that we were able to engage young students in AI via our online training program and nurture greater diversity in AI. This work can guide AI training programs aspiring to engage and educate students entirely online, and motivate people in AI to strive towards increasing diversity and inclusion in this field.
Asunto(s)
Inteligencia Artificial , Investigación Biomédica , Biología Computacional , Diversidad Cultural , Tutoría , Adolescente , Investigación Biomédica/educación , Investigación Biomédica/organización & administración , Biología Computacional/educación , Biología Computacional/organización & administración , Femenino , Humanos , Masculino , Grupos Minoritarios , EstudiantesRESUMEN
Human neuroscience research has been significantly advanced by neuroelectrophysiological studies from people with refractory epilepsy-the only routine clinical intervention that acquires multi-day, multi-electrode human intracranial electroencephalography (iEEG). While a sampling rate below 2 kHz is sufficient for manual iEEG review by epileptologists, computational methods and research studies may benefit from higher resolution, which requires significant technical development. At adult and pediatric Stanford hospitals, research ports of commercial clinical acquisition systems were configured to collect 10 kHz iEEG of up to 256 electrodes simultaneously with the clinical data. The research digital stream was designed to be acquired post-digitization, resulting in no loss in clinical signal quality. This novel framework implements a near-invisible research platform to facilitate the secure, routine collection of high-resolution iEEG that minimizes research hardware footprint and clinical workflow interference. The addition of a pocket-sized router in the patient room enabled an encrypted tunnel to securely transmit research-quality iEEG across hospital networks to a research computer within the hospital server room, where data was coded, de-identified, and uploaded to cloud storage. Every eligible patient undergoing iEEG clinical evaluation at both hospitals since September 2017 has been recruited; participant recruitment is ongoing. Over 350+ terabytes (representing 1000+ days) of neuroelectrophysiology were recorded across 200+ participants of diverse demographics. To our knowledge, this is the first report of such a research integration within a hospital setting. It is a promising approach to promoting equitable participant enrollment and building comprehensive data repositories with consistent, high-fidelity specifications towards new discoveries in human neuroscience.
Asunto(s)
Electrocorticografía , Humanos , Adulto , Masculino , Femenino , Electrocorticografía/métodos , Electrocorticografía/instrumentación , Niño , Adolescente , Electroencefalografía/métodos , Electroencefalografía/instrumentación , Persona de Mediana Edad , Adulto Joven , Procesamiento de Señales Asistido por Computador , Epilepsia Refractaria/fisiopatologíaRESUMEN
This perspective explores the transformative potential of data-driven insights to understand and address women's reproductive health conditions. Historically, clinical studies often excluded women, hindering comprehensive research into conditions such as adverse pregnancy outcomes and endometriosis. Recent advances in technology (e.g., next-generation sequencing techniques, electronic medical records (EMRs), computational power) provide unprecedented opportunities for research in women's reproductive health. Studies of molecular data, including large-scale meta-analyses, provide valuable insights into conditions like preterm birth and preeclampsia. Moreover, EMRs and other clinical data sources enable researchers to study populations of individuals, uncovering trends and associations in women's reproductive health conditions. Despite these advancements, challenges such as data completeness, accuracy, and representation persist. We emphasize the importance of holistic approaches, greater inclusion, and refining and expanding on how we leverage data and computational integrative approaches for discoveries so that we can benefit not only women's reproductive health but overall human health.
RESUMEN
With the increasing availability of rich, longitudinal, real-world clinical data recorded in electronic health records (EHRs) for millions of patients, there is a growing interest in leveraging these records to improve the understanding of human health and disease and translate these insights into clinical applications. However, there is also a need to consider the limitations of these data due to various biases and to understand the impact of missing information. Recognizing and addressing these limitations can inform the design and interpretation of EHR-based informatics studies that avoid confusing or incorrect conclusions, particularly when applied to population or precision medicine. Here we discuss key considerations in the design, implementation and interpretation of EHR-based informatics studies, drawing from examples in the literature across hypothesis generation, hypothesis testing and machine learning applications. We outline the growing opportunities for EHR-based informatics studies, including association studies and predictive modeling, enabled by evolving AI capabilities-while addressing limitations and potential pitfalls to avoid.
Asunto(s)
Registros Electrónicos de Salud , Humanos , Aprendizaje Automático , Investigación Biomédica , Medicina de Precisión/métodos , Informática Médica/métodosRESUMEN
Delirium is a detrimental mental condition often seen in older, hospitalized patients and is currently hard to predict. In this study, we leverage electronic health records (EHR) to identify 7,492 UCSF patients and 19,417 UC health system patients with an inpatient delirium diagnosis and the same number of control patients without delirium. We found significant associations between comorbidities or laboratory values and an inpatient delirium diagnosis, including metabolic abnormalities and psychiatric diagnoses. Some associations were sex-specific, including dementia subtypes and infections. We further explored the associations with anemia and bipolar disorder by conducting longitudinal analyses from the time of first diagnosis to development of delirium, demonstrating a significant relationship across time. Finally, we show that an inpatient delirium diagnosis leads to increased risk of mortality. These results demonstrate the powerful application of the EHR to shed insights into prior diagnoses and laboratory values that could help predict development of inpatient delirium and the importance of sex when making these assessments.
RESUMEN
Molecular studies of Alzheimer's disease (AD) implicate potential links between autoimmunity and AD, but the underlying clinical relationships between these conditions remain poorly understood. Electronic health records (EHRs) provide an opportunity to determine the clinical risk relationship between autoimmune disorders and AD and understand whether specific disorders and disorder subtypes affect AD risk at the phenotypic level in human populations. We evaluated relationships between 26 autoimmune disorders and AD across retrospective observational case-control and cohort study designs in the EHR systems at UCSF and Stanford. We quantified overall and sex-specific AD risk effects that these autoimmune disorders confer. We identified significantly increased AD risk in autoimmune disorder patients in both study designs at UCSF and at Stanford. This pattern was driven by specific autoimmunity subtypes including endocrine, gastrointestinal, dermatologic, and musculoskeletal disorders. We also observed increased AD risk from autoimmunity in both women and men, but women with autoimmune disorders continued to have a higher AD prevalence than men, indicating persistent sex-specificity. This study identifies autoimmune disorders as strong risk factors for AD that validate across several study designs and EHR databases. It sets the foundation for exploring how underlying autoimmune mechanisms increase AD risk and contribute to AD pathogenesis.
RESUMEN
Existing medical treatments for endometriosis-related pain are often ineffective, underscoring the need for new therapeutic strategies. In this study, we applied a computational drug repurposing pipeline to stratified and unstratified disease signatures based on endometrial gene expression data to identify potential therapeutics from existing drugs, based on expression reversal. Of 3,131 unique genes differentially expressed by at least one of six endometriosis signatures, only 308 (9.8%) were in common; however, 221 out of 299 drugs identified, (73.9%) were shared. We selected fenoprofen, an uncommonly prescribed NSAID that was the top therapeutic candidate for further investigation. When testing fenoprofen in an established rat model of endometriosis, fenoprofen successfully alleviated endometriosis-associated vaginal hyperalgesia, a surrogate marker for endometriosis-related pain. These findings validate fenoprofen as a therapeutic that could be utilized more frequently for endometriosis and suggest the utility of the aforementioned computational drug repurposing approach for endometriosis.
RESUMEN
Identification of Alzheimer's disease (AD) onset risk can facilitate interventions before irreversible disease progression. We demonstrate that electronic health records from the University of California, San Francisco, followed by knowledge networks (for example, SPOKE) allow for (1) prediction of AD onset and (2) prioritization of biological hypotheses, and (3) contextualization of sex dimorphism. We trained random forest models and predicted AD onset on a cohort of 749 individuals with AD and 250,545 controls with a mean area under the receiver operating characteristic of 0.72 (7 years prior) to 0.81 (1 day prior). We further harnessed matched cohort models to identify conditions with predictive power before AD onset. Knowledge networks highlight shared genes between multiple top predictors and AD (for example, APOE, ACTB, IL6 and INS). Genetic colocalization analysis supports AD association with hyperlipidemia at the APOE locus, as well as a stronger female AD association with osteoporosis at a locus near MS4A6A. We therefore show how clinical data can be utilized for early AD prediction and identification of personalized biological hypotheses.
Asunto(s)
Enfermedad de Alzheimer , Masculino , Humanos , Femenino , Enfermedad de Alzheimer/diagnóstico , Registros Electrónicos de Salud , Apolipoproteínas E/genética , San FranciscoRESUMEN
Adoption of high-content omic technologies in clinical studies, coupled with computational methods, has yielded an abundance of candidate biomarkers. However, translating such findings into bona fide clinical biomarkers remains challenging. To facilitate this process, we introduce Stabl, a general machine learning method that identifies a sparse, reliable set of biomarkers by integrating noise injection and a data-driven signal-to-noise threshold into multivariable predictive modeling. Evaluation of Stabl on synthetic datasets and five independent clinical studies demonstrates improved biomarker sparsity and reliability compared to commonly used sparsity-promoting regularization methods while maintaining predictive performance; it distills datasets containing 1,400-35,000 features down to 4-34 candidate biomarkers. Stabl extends to multi-omic integration tasks, enabling biological interpretation of complex predictive models, as it hones in on a shortlist of proteomic, metabolomic and cytometric events predicting labor onset, microbial biomarkers of pre-term birth and a pre-operative immune signature of post-surgical infections. Stabl is available at https://github.com/gregbellan/Stabl .
Asunto(s)
Biomarcadores , Aprendizaje Automático , Biomarcadores/metabolismo , Humanos , Proteómica/métodos , Biología Computacional/métodos , Metabolómica/métodos , Reproducibilidad de los ResultadosRESUMEN
Objectives: To enable interactive visualization of the vaginal microbiome across the pregnancy and facilitate discovery of novel insights and generation of new hypotheses. Material and Methods: Vaginal Microbiome Atlas during Pregnancy (VMAP) was created with R shiny to generate visualizations of structured vaginal microbiome data from multiple studies. Results: VMAP (http://vmapapp.org) visualizes 3880 vaginal microbiome samples of 1402 pregnant individuals from 11 studies, aggregated via open-source tool MaLiAmPi. Visualized features include diversity measures, VALENCIA community state types, and composition (phylotypes, taxonomy) that can be filtered by various categories. Discussion: This work represents one of the largest and most geographically diverse aggregations of the vaginal microbiome in pregnancy to date and serves as a user-friendly resource to further analyze vaginal microbiome data and better understand pregnancies and associated outcomes. Conclusion: VMAP can be obtained from https://github.com/msirota/vmap.git and is currently deployed as an online app for non-R users.
RESUMEN
Every year, 11% of infants are born preterm with significant health consequences, with the vaginal microbiome a risk factor for preterm birth. We crowdsource models to predict (1) preterm birth (PTB; <37 weeks) or (2) early preterm birth (ePTB; <32 weeks) from 9 vaginal microbiome studies representing 3,578 samples from 1,268 pregnant individuals, aggregated from public raw data via phylogenetic harmonization. The predictive models are validated on two independent unpublished datasets representing 331 samples from 148 pregnant individuals. The top-performing models (among 148 and 121 submissions from 318 teams) achieve area under the receiver operator characteristic (AUROC) curve scores of 0.69 and 0.87 predicting PTB and ePTB, respectively. Alpha diversity, VALENCIA community state types, and composition are important features in the top-performing models, most of which are tree-based methods. This work is a model for translation of microbiome data into clinically relevant predictive models and to better understand preterm birth.
Asunto(s)
Colaboración de las Masas , Microbiota , Nacimiento Prematuro , Embarazo , Femenino , Recién Nacido , Humanos , Filogenia , Vagina , Microbiota/genéticaRESUMEN
BACKGROUND: Alzheimer's dementia (AD) is a neurodegenerative disease that is disproportionately prevalent in racially marginalized individuals. However, due to research underrepresentation, the spectrum of AD-associated comorbidities that increase AD risk or suggest AD treatment disparities in these individuals is not completely understood. We leveraged electronic medical records (EMR) to explore AD-associated comorbidities and disease networks in racialized individuals identified as Asian, Non-Latine Black, Latine, or Non-Latine White. METHODS: We performed low-dimensional embedding, differential analysis, and disease network-based analyses of 5664 patients with AD and 11,328 demographically matched controls across two EMR systems and five medical centers, with equal representation of Asian-, Non-Latine Black-, Latine-, and Non-Latine White-identified individuals. For low-dimensional embedding and disease network comparisons, Mann-Whitney U tests or Kruskal-Wallis tests followed by Dunn's tests were used to compare categories. Fisher's exact or chi-squared tests were used for differential analysis. Spearman's rank correlation coefficients were used to compare results between the two EMR systems. RESULTS: Here we show that primarily established AD-associated comorbidities, such as essential hypertension and major depressive disorder, are generally similar across racialized populations. However, a few comorbidities, including respiratory diseases, may be significantly associated with AD in Black- and Latine- identified individuals. CONCLUSIONS: Our study revealed similarities and differences in AD-associated comorbidities and disease networks between racialized populations. Our approach could be a starting point for hypothesis-driven studies that can further explore the relationship between these comorbidities and AD in racialized populations, potentially identifying interventions that can reduce AD health disparities.
Black- and Latine- identified individuals in the United States are more likely to have Alzheimer's dementia (AD) relative to Asian- and White- identified individuals. Despite this, Black- and Latine- identified individuals are less likely to be included in studies that attempt to understand and treat AD. Patients' medical information, electronically recorded by healthcare providers, was used to explore whether patients with AD were more likely to have different conditions relative to patients who do not have AD. We did this analysis separately for Asian-, Non-Latine Black-, Latine- and Non-Latine White- identified individuals for a total of four analyses. While we found many conditions that were shared by all individuals, a few, such as lung-related diseases, may be more common in specific identified race and ethnicity categories.
RESUMEN
Delirium is a heterogeneous and detrimental mental condition often seen in older, hospitalized patients and is currently hard to predict. In this study, we leverage large-scale, real- world data using the electronic health records (EHR) to identify two cohorts comprised of 7,492 UCSF patients and 19,417 UC health system patients (excluding UCSF patients) with an inpatient delirium diagnosis and the same number of propensity score-matched control patients without delirium. We found significant associations between comorbidities or laboratory test values and an inpatient delirium diagnosis which were validated independently. Most of these associations were those previously-identified as risk factors for delirium, including metabolic abnormalities, mental health diagnoses, and infections. Some of the associations were sex- specific, including those related to dementia subtypes and infections. We further explored the diagnostic associations with anemia and bipolar disorder by conducting longitudinal analyses from the time of first diagnosis of the risk factor to development of delirium demonstrating a significant relationship across time. Finally, we show that an inpatient delirium diagnosis leads to dramatic increases in mortality outcome across both cohorts. These results demonstrate the powerful application of leveraging EHR data to shed insights into prior diagnoses and laboratory test values that could help predict development of inpatient delirium and emphasize the importance of considering patient demographic characteristics including documented sex when making these assessments. One Sentence Summary: Longitudinal analysis of electronic health record data reveals associations between inpatient delirium, comorbidities, and mortality.
RESUMEN
Background: Preterm birth (PTB) is the leading cause of infant mortality and follows multiple biological pathways, many of which are poorly understood. Some PTBs result from medically indicated labor following complications from hypertension and/or diabetes, while many others are spontaneous with unknown causes. Previously, investigation of potential risk factors has been limited by lack of data on maternal medical history and the difficulty of classifying PTBs as indicated or spontaneous. Here, we leverage electronic health record (EHR) data (patient health information including demographics, diagnoses, and medications) and a supplemental curated pregnancy database to overcome these limitations. Novel associations may provide new insight into the pathophysiology of PTB as well as help identify individuals who would be at risk of PTB. Methods: We quantified associations between maternal diagnoses and preterm birth using logistic regression controlling for maternal age and socioeconomic factors within a University of California, San Francisco (UCSF), EHR cohort with 10,643 births ( nterm = 9692, nspontaneous_preterm = 449, nindicated_preterm = 418) and maternal pre-conception diagnosis phenotypes derived from International Classification of Diseases (ICD) 9 and 10 codes. Results: Eighteen conditions significantly and robustly (False Discovery Rate (FDR)<0.05) associated with PTBs compared to term. We discovered known (hypertension, diabetes, and chronic kidney disease) and less established (blood, cardiac, gynecological, and liver conditions) associations. Type 1 diabetes was the most significant overall association (adjusted p = 1.6×10 -14 , adjusted OR = 7 (95% CI 5, 12)), and the odds ratios for the significant phenotypes ranged from 3 to 13. We further carried out analysis stratified by spontaneous vs. indicated PTB. No phenotypes significantly associated with spontaneous PTB; however, the results for indicated PTB largely recapitulated the phenotype associations with all PTBs. Conclusions: Our study underscores the limitations of approaches that combine indicated and spontaneous births together. When combined, significant associations were almost entirely driven by indicated PTBs, although our spontaneous and indicated groups were of a similar size. Investigating the spontaneous population has the potential to reveal new pathways and understanding of the heterogeneity of PTB.
RESUMEN
For studies using microbiome data, the ability to robustly combine data from technically and biologically distinct microbiome studies is a crucial means of supporting more robust and clinically relevant inferences. Formidable technical challenges arise when attempting to combine data from technically diverse 16S rRNA gene variable region amplicon sequencing (16S) studies. Closed operational taxonomic units and taxonomy are criticized as being heavily dependent upon reference sets and with limited precision relative to the underlying biology. Phylogenetic placement has been demonstrated to be a promising taxonomy-free manner of harmonizing microbiome data, but it has lacked a validated count-based feature suitable for use in machine learning and association studies. Here we introduce a phylogenetic-placement-based, taxonomy-independent, compositional feature of microbiota: phylotypes. Phylotypes were predictive of clinical outcomes such as obesity or pre-term birth on technically diverse independent validation sets harmonized post hoc. Thus, phylotypes enable the rigorous cross-validation of 16S-based clinical prognostic models and associative microbiome studies.
Asunto(s)
Microbiota , Filogenia , ARN Ribosómico 16S/genética , Microbiota/genética , Aprendizaje AutomáticoRESUMEN
Recurrent pregnancy loss (RPL), defined as 2 or more pregnancy losses, affects 5-6% of ever-pregnant individuals. Approximately half of these cases have no identifiable explanation. To generate hypotheses about RPL etiologies, we implemented a case-control study comparing the history of over 1,600 diagnoses between RPL and live-birth patients, leveraging the University of California San Francisco (UCSF) and Stanford University electronic health record databases. In total, our study included 8,496 RPL (UCSF: 3,840, Stanford: 4,656) and 53,278 Control (UCSF: 17,259, Stanford: 36,019) patients. Menstrual abnormalities and infertility-associated diagnoses were significantly positively associated with RPL in both medical centers. Age-stratified analysis revealed that the majority of RPL-associated diagnoses had higher odds ratios for patients <35 compared with 35+ patients. While Stanford results were sensitive to control for healthcare utilization, UCSF results were stable across analyses with and without utilization. Intersecting significant results between medical centers was an effective filter to identify associations that are robust across center-specific utilization patterns.
RESUMEN
The vaginal microbiome has been shown to be associated with pregnancy outcomes including preterm birth (PTB) risk. Here we present VMAP: Vaginal Microbiome Atlas during Pregnancy (http://vmapapp.org), an application to visualize features of 3,909 vaginal microbiome samples of 1,416 pregnant individuals from 11 studies, aggregated from raw public and newly generated sequences via an open-source tool, MaLiAmPi. Our visualization tool (http://vmapapp.org) includes microbial features such as various measures of diversity, VALENCIA community state types (CST), and composition (via phylotypes and taxonomy). This work serves as a resource for the research community to further analyze and visualize vaginal microbiome data in order to better understand both healthy term pregnancies and those associated with adverse outcomes.