RESUMO
The human major histocompatibility complex (MHC) is a â¼4 Mb genomic segment on Chromosome 6 that plays a pivotal role in the immune response. Despite its importance in various traits and diseases, its complex nature makes it challenging to accurately characterize on a routine basis. We present a novel approach allowing targeted sequencing and de novo haplotypic assembly of the MHC region in heterozygous samples, using long-read sequencing technologies. Our approach is validated using two reference samples, two family trios, and an African-American sample. We achieved excellent coverage (96.6%-99.9% with at least 30× depth) and high accuracy (99.89%-99.99%) for the different haplotypes. This methodology offers a reliable and cost-effective method for sequencing and fully characterizing the MHC without the need for whole-genome sequencing, facilitating broader studies on this important genomic segment and having significant implications in immunology, genetics, and medicine.
Assuntos
Haplótipos , Heterozigoto , Complexo Principal de Histocompatibilidade , Humanos , Complexo Principal de Histocompatibilidade/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento Completo do Genoma/métodos , Análise de Sequência de DNA/métodos , Genoma HumanoRESUMO
Next-generation sequencing (NGS) technologies have transformed medical genetics. However, short-read lengths pose a limitation on identification of structural variants, sequencing repetitive regions, phasing of distant nucleotide changes, and distinguishing highly homologous genomic regions. Long-read sequencing technologies may offer improvements in the characterization of genes that are currently difficult to assess. We used a combination of targeted DNA capture, long-read sequencing, and a customized bioinformatics pipeline to fully assemble the RH region, which harbors variation relevant to red cell donor-recipient mismatch, particularly among patients with sickle cell disease. RHD and RHCE are a pair of duplicated genes located within an â¼175 kb region on human chromosome 1 that have high sequence similarity and frequent structural variations. To achieve the assembly, we utilized palindrome repeats in PacBio SMRT reads to obtain consensus sequences of 2.1 to 2.9 kb average length with over 99% accuracy. We used these long consensus sequences to identify 771 assembly markers and to phase the RHD-RHCE region with high confidence. The dataset enabled direct linkage between coding and intronic variants, phasing of distant SNPs to determine RHD-RHCE haplotypes, and identification of known and novel structural variations along with the breakpoints. A limiting factor in phasing is the frequency of heterozygous assembly markers and therefore was most successful in samples from African Black individuals with increased heterogeneity at the RH locus. Overall, this approach allows RH genotyping and de novo assembly in an unbiased and comprehensive manner that is necessary to expand application of NGS technology to high-resolution RH typing.
Assuntos
Transfusão de Sangue , Duplicação Gênica , Variação Genética , Sistema do Grupo Sanguíneo Rh-Hr/genética , Alelos , Anemia Falciforme/genética , Anemia Falciforme/terapia , Quebra Cromossômica , Biologia Computacional/métodos , Frequência do Gene , Heterogeneidade Genética , Ligação Genética , Genômica/métodos , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo Genético , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/métodosRESUMO
BACKGROUND: Substantial effort has been directed toward demonstrating uses of predictive models in health care. However, implementation of these models into clinical practice may influence patient outcomes, which in turn are captured in electronic health record data. As a result, deployed models may affect the predictive ability of current and future models. OBJECTIVE: To estimate changes in predictive model performance with use through 3 common scenarios: model retraining, sequentially implementing 1 model after another, and intervening in response to a model when 2 are simultaneously implemented. DESIGN: Simulation of model implementation and use in critical care settings at various levels of intervention effectiveness and clinician adherence. Models were either trained or retrained after simulated implementation. SETTING: Admissions to the intensive care unit (ICU) at Mount Sinai Health System (New York, New York) and Beth Israel Deaconess Medical Center (Boston, Massachusetts). PATIENTS: 130 000 critical care admissions across both health systems. INTERVENTION: Across 3 scenarios, interventions were simulated at varying levels of clinician adherence and effectiveness. MEASUREMENTS: Statistical measures of performance, including threshold-independent (area under the curve) and threshold-dependent measures. RESULTS: At fixed 90% sensitivity, in scenario 1 a mortality prediction model lost 9% to 39% specificity after retraining once and in scenario 2 a mortality prediction model lost 8% to 15% specificity when created after the implementation of an acute kidney injury (AKI) prediction model; in scenario 3, models for AKI and mortality prediction implemented simultaneously, each led to reduced effective accuracy of the other by 1% to 28%. LIMITATIONS: In real-world practice, the effectiveness of and adherence to model-based recommendations are rarely known in advance. Only binary classifiers for tabular ICU admissions data were simulated. CONCLUSION: In simulated ICU settings, a universally effective model-updating approach for maintaining model performance does not seem to exist. Model use may have to be recorded to maintain viability of predictive modeling. PRIMARY FUNDING SOURCE: National Center for Advancing Translational Sciences.
Assuntos
Injúria Renal Aguda , Inteligência Artificial , Humanos , Unidades de Terapia Intensiva , Cuidados Críticos , Atenção à SaúdeRESUMO
SUMMARY: A number of methods have been devised to address the need for targeted genomic resequencing. One of these methods, region-specific extraction (RSE) is characterized by the capture of long DNA fragments (15-20 kb) by magnetic beads, after enzymatic extension of oligonucleotides hybridized to selected genomic regions. Facilitating the selection of the most appropriate capture oligos for targeting a region of interest, satisfying the properties of temperature (Tm) and entropy (ΔG), while minimizing the formation of primer-dimers in a pooled experiment, is therefore necessary. Manual design and selection of oligos becomes very challenging, complicated by factors such as length of the target region and number of targeted regions. Here we describe, AnthOligo, a web-based application developed to optimally automate the process of generation of oligo sequences used to target and capture the continuum of large and complex genomic regions. Apart from generating oligos for RSE, this program may have wider applications in the design of customizable internal oligos to be used as baits for gene panel analysis or even probes for large-scale comparative genomic hybridization array processes. AnthOligo was tested by capturing the Major Histocompatibility Complex (MHC) of a random sample.The application provides users with a simple interface to upload an input file in BED format and customize parameters for each task. The task of probe design in AnthOligo commences when a user uploads an input file and concludes with the generation of a result-set containing an optimal set of region-specific oligos. AnthOligo is currently available as a public web application with URL: http://antholigo.chop.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Genoma , Genômica , Hibridização Genômica Comparativa , Complexo Principal de Histocompatibilidade , Oligonucleotídeos/genéticaRESUMO
PURPOSE: Hereditary hearing loss is highly heterogeneous. To keep up with rapidly emerging disease-causing genes, we developed the AUDIOME test for nonsyndromic hearing loss (NSHL) using an exome sequencing (ES) platform and targeted analysis for the curated genes. METHODS: A tiered strategy was implemented for this test. Tier 1 includes combined Sanger and targeted deletion analyses of the two most common NSHL genes and two mitochondrial genes. Nondiagnostic tier 1 cases are subjected to ES and array followed by targeted analysis of the remaining AUDIOME genes. RESULTS: ES resulted in good coverage of the selected genes with 98.24% of targeted bases at >15 ×. A fill-in strategy was developed for the poorly covered regions, which generally fell within GC-rich or highly homologous regions. Prospective testing of 33 patients with NSHL revealed a diagnosis in 11 (33%) and a possible diagnosis in 8 cases (24.2%). Among those, 10 individuals had variants in tier 1 genes. The ES data in the remaining nondiagnostic cases are readily available for further analysis. CONCLUSION: The tiered and ES-based test provides an efficient and cost-effective diagnostic strategy for NSHL, with the potential to reflex to full exome to identify causal changes outside of the AUDIOME test.
Assuntos
Predisposição Genética para Doença , Perda Auditiva Neurossensorial/diagnóstico , Perda Auditiva Neurossensorial/genética , Patologia Molecular , Exoma/genética , Feminino , Perda Auditiva Neurossensorial/fisiopatologia , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Mutação , Polimorfismo de Nucleotídeo Único/genética , Análise de Sequência de DNA , Sequenciamento do ExomaRESUMO
MOTIVATION: RNA-Seq (also called whole-transcriptome sequencing) is an emerging technology that uses the capabilities of next-generation sequencing to detect and quantify entire transcripts. One of its important applications is the improvement of existing genome annotations. RNA-Seq provides rapid, comprehensive and cost-effective tools for the discovery of novel genes and transcripts compared with expressed sequence tag (EST), which is instrumental in gene discovery and gene sequence determination. The rat is widely used as a laboratory disease model, but has a less well-annotated genome as compared with humans and mice. In this study, we incorporated deep RNA-Seq data from three rat tissues-bone marrow, brain and kidney-with EST data to improve the annotation of the rat genome. RESULTS: Our analysis identified 32 197 transcripts, including 13 461 known transcripts, 13 934 novel isoforms and 4802 new genes, which almost doubled the numbers of transcripts in the current public rat genome database (rn5). Comparisons of our predicted protein-coding gene sets with those in public datasets suggest that RNA-Seq significantly improves genome annotation and identifies novel genes and isoforms in the rat. Importantly, the large majority of novel genes and isoforms are supported by direct evidence of RNA-Seq experiments. These predicted genes were integrated into the Rat Genome Database (RGD) and can serve as an important resource for functional studies in the research community. AVAILABILITY AND IMPLEMENTATION: The predicted genes are available at http://rgd.mcw.edu.
Assuntos
Genoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Anotação de Sequência Molecular , RNA/genética , Transcriptoma , Animais , Etiquetas de Sequências Expressas , Variação Genética , Camundongos , RatosRESUMO
The RGD Pathway Portal provides pathway annotations for rat, human and mouse genes and pathway diagrams and suites, all interconnected via the pathway ontology. Diagram pages present the diagram and description, with diagram objects linked to additional resources. A newly-developed dual-functionality web application composes the diagram page. Curators input the description, diagram, references and additional pathway objects. The application combines these with tables of rat, human and mouse pathway genes, including genetic information, analysis tool and reference links, and disease, phenotype and other pathway annotations to pathway genes. The application increases the information content of diagram pages while expediting publication.
Assuntos
Biologia Computacional/métodos , Genoma Humano , Software , Animais , Bases de Dados Genéticas , Redes Reguladoras de Genes , Humanos , Internet , Redes e Vias Metabólicas , Camundongos , Anotação de Sequência Molecular , Locos de Características Quantitativas , Ratos , Reprodutibilidade dos Testes , Ferramenta de Busca , Transdução de SinaisRESUMO
Computational subphenotyping, a data-driven approach to understanding disease subtypes, is a prominent topic in medical research. Numerous ongoing studies are dedicated to developing advanced computational subphenotyping methods for cross-sectional data. However, the potential of time-series data has been underexplored until now. Here, we propose a Multivariate Levenshtein Distance (MLD) that can account for address correlation in multiple discrete features over time-series data. Our algorithm has two distinct components: it integrates an optimal threshold score to enhance the sensitivity in discriminating between pairs of instances, and the MLD itself. We have applied the proposed distance metrics on the k-means clustering algorithm to derive temporal subphenotypes from time-series data of biomarkers and treatment administrations from 1039 critically ill patients with COVID-19 and compare its effectiveness to standard methods. In conclusion, the Multivariate Levenshtein Distance metric is a novel method to quantify the distance from multiple discrete features over time-series data and demonstrates superior clustering performance among competing time-series distance metrics.
Assuntos
COVID-19 , Estado Terminal , Humanos , Fatores de Tempo , Estudos Transversais , AlgoritmosRESUMO
Congenital long QT syndrome (LQTS) diagnosis is complicated by limited genetic testing at scale, low prevalence, and normal QT corrected interval in patients with high-risk genotypes. We developed a deep learning approach combining electrocardiogram (ECG) waveform and electronic health record data to assess whether patients had pathogenic variants causing LQTS. We defined patients with high-risk genotypes as having ≥1 pathogenic variant in one of the LQTS-susceptibility genes. We trained the model using data from United Kingdom Biobank (UKBB) and then fine-tuned in a racially/ethnically diverse cohort using Mount Sinai BioMe Biobank. Following group-stratified 5-fold splitting, the fine-tuned model achieved area under the precision-recall curve of 0.29 (95% confidence interval [CI] 0.28-0.29) and area under the receiver operating curve of 0.83 (0.82-0.83) on independent testing data from BioMe. Multimodal fusion learning has promise to identify individuals with pathogenic genetic mutations to enable patient prioritization for further work up.
RESUMO
Increased intracranial pressure (ICP) ≥15 mmHg is associated with adverse neurological outcomes, but needs invasive intracranial monitoring. Using the publicly available MIMIC-III Waveform Database (2000-2013) from Boston, we developed an artificial intelligence-derived biomarker for elevated ICP (aICP) for adult patients. aICP uses routinely collected extracranial waveform data as input, reducing the need for invasive monitoring. We externally validated aICP with an independent dataset from the Mount Sinai Hospital (2020-2022) in New York City. The AUROC, accuracy, sensitivity, and specificity on the external validation dataset were 0.80 (95% CI, 0.80-0.80), 73.8% (95% CI, 72.0-75.6%), 73.5% (95% CI 72.5-74.5%), and 73.0% (95% CI, 72.0-74.0%), respectively. We also present an exploratory analysis showing aICP predictions are associated with clinical phenotypes. A ten-percentile increment was associated with brain malignancy (OR = 1.68; 95% CI, 1.09-2.60), intracerebral hemorrhage (OR = 1.18; 95% CI, 1.07-1.32), and craniotomy (OR = 1.43; 95% CI, 1.12-1.84; P < 0.05 for all).
RESUMO
Importance: Increased intracranial pressure (ICP) is associated with adverse neurological outcomes, but needs invasive monitoring. Objective: Development and validation of an AI approach for detecting increased ICP (aICP) using only non-invasive extracranial physiological waveform data. Design: Retrospective diagnostic study of AI-assisted detection of increased ICP. We developed an AI model using exclusively extracranial waveforms, externally validated it and assessed associations with clinical outcomes. Setting: MIMIC-III Waveform Database (2000-2013), a database derived from patients admitted to an ICU in an academic Boston hospital, was used for development of the aICP model, and to report association with neurologic outcomes. Data from Mount Sinai Hospital (2020-2022) in New York City was used for external validation. Participants: Patients were included if they were older than 18 years, and were monitored with electrocardiograms, arterial blood pressure, respiratory impedance plethysmography and pulse oximetry. Patients who additionally had intracranial pressure monitoring were used for development (N=157) and external validation (N=56). Patients without intracranial monitors were used for association with outcomes (N=1694). Exposures: Extracranial waveforms including electrocardiogram, arterial blood pressure, plethysmography and SpO2. Main Outcomes and Measures: Intracranial pressure > 15 mmHg. Measures were Area under receiver operating characteristic curves (AUROCs), sensitivity, specificity, and accuracy at threshold of 0.5. We calculated odds ratios and p-values for phenotype association. Results: The AUROC was 0.91 (95% CI, 0.90-0.91) on testing and 0.80 (95% CI, 0.80-0.80) on external validation. aICP had accuracy, sensitivity, and specificity of 73.8% (95% CI, 72.0%-75.6%), 99.5% (95% CI 99.3%-99.6%), and 76.9% (95% CI, 74.0-79.8%) on external validation. A ten-percentile increment was associated with stroke (OR=2.12; 95% CI, 1.27-3.13), brain malignancy (OR=1.68; 95% CI, 1.09-2.60), subdural hemorrhage (OR=1.66; 95% CI, 1.07-2.57), intracerebral hemorrhage (OR=1.18; 95% CI, 1.07-1.32), and procedures like percutaneous brain biopsy (OR=1.58; 95% CI, 1.15-2.18) and craniotomy (OR = 1.43; 95% CI, 1.12-1.84; P < 0.05 for all). Conclusions and Relevance: aICP provides accurate, non-invasive estimation of increased ICP, and is associated with neurological outcomes and neurosurgical procedures in patients without intracranial monitoring.
RESUMO
COVID-19 has been a significant public health concern for the last four years; however, little is known about the mechanisms that lead to severe COVID-associated kidney injury. In this multicenter study, we combined quantitative deep urinary proteomics and machine learning to predict severe acute outcomes in hospitalized COVID-19 patients. Using a 10-fold cross-validated random forest algorithm, we identified a set of urinary proteins that demonstrated predictive power for both discovery and validation set with 87% and 79% accuracy, respectively. These predictive urinary biomarkers were recapitulated in non-COVID acute kidney injury revealing overlapping injury mechanisms. We further combined orthogonal multiomics datasets to understand the mechanisms that drive severe COVID-associated kidney injury. Functional overlap and network analysis of urinary proteomics, plasma proteomics and urine sediment single-cell RNA sequencing showed that extracellular matrix and autophagy-associated pathways were uniquely impacted in severe COVID-19. Differentially abundant proteins associated with these pathways exhibited high expression in cells in the juxtamedullary nephron, endothelial cells, and podocytes, indicating that these kidney cell types could be potential targets. Further, single-cell transcriptomic analysis of kidney organoids infected with SARS-CoV-2 revealed dysregulation of extracellular matrix organization in multiple nephron segments, recapitulating the clinically observed fibrotic response across multiomics datasets. Ligand-receptor interaction analysis of the podocyte and tubule organoid clusters showed significant reduction and loss of interaction between integrins and basement membrane receptors in the infected kidney organoids. Collectively, these data suggest that extracellular matrix degradation and adhesion-associated mechanisms could be a main driver of COVID-associated kidney injury and severe outcomes.
RESUMO
The rat has been widely used as a disease model in a laboratory setting, resulting in an abundance of genetic and phenotype data from a wide variety of studies. These data can be found at the Rat Genome Database (RGD, http://rgd.mcw.edu/), which provides a platform for researchers interested in linking genomic variations to phenotypes. Quantitative trait loci (QTLs) form one of the earliest and core datasets, allowing researchers to identify loci harboring genes associated with disease. These QTLs are not only important for those using the rat to identify genes and regions associated with disease, but also for cross-organism analyses of syntenic regions on the mouse and the human genomes to identify potential regions for study in these organisms. Currently, RGD has data on >1,900 rat QTLs that include details about the methods and animals used to determine the respective QTL along with the genomic positions and markers that define the region. RGD also curates human QTLs (>1,900) and houses>4,000 mouse QTLs (imported from Mouse Genome Informatics). Multiple ontologies are used to standardize traits, phenotypes, diseases, and experimental methods to facilitate queries, analyses, and cross-organism comparisons. QTLs are visualized in tools such as GBrowse and GViewer, with additional tools for analysis of gene sets within QTL regions. The QTL data at RGD provide valuable information for the study of mapped phenotypes and identification of candidate genes for disease associations.
Assuntos
Bases de Dados Genéticas , Genoma , Locos de Características Quantitativas , Acesso à Informação , Animais , Marcadores Genéticos , Humanos , Internet , Camundongos , Fenótipo , RatosRESUMO
This year marks the 63rd anniversary of the International Society of Nephrology, which signaled nephrology's emergence as a modern medical discipline. In this article, we briefly trace the course of nephrology's history to show a clear arc in its evolution-of increasing resolution in nephrological data-an arc that is converging with computational capabilities to enable precision nephrology. In general, precision medicine refers to tailoring treatment to the individual characteristics of patients. For an operational definition, this tailoring takes the form of an optimization, in which treatments are selected to maximize a patient's expected health with respect to all available data. Because modern health data are large and high resolution, this optimization process requires computational intervention, and it must be tuned to the contours of specific medical disciplines. An advantage of this operational definition for precision medicine is that it allows us to better understand what precision medicine means in the context of a specific medical discipline. The goal of this article was to demonstrate how to instantiate this definition of precision medicine for the field of nephrology. Correspondingly, the goal of precision nephrology was to answer two related questions: ( 1 ) How do we optimize kidney health with respect to all available data? and ( 2 ) How do we optimize general health with respect to kidney data?
Assuntos
Medicina Geral , Nefrologia , Humanos , Rim , Medicina de Precisão , Cuidados PaliativosRESUMO
Background Acute kidney injury (AKI) is a known complication of COVID-19 and is associated with an increased risk of in-hospital mortality. Unbiased proteomics using biological specimens can lead to improved risk stratification and discover pathophysiological mechanisms. Methods Using measurements of ~4000 plasma proteins in two cohorts of patients hospitalized with COVID-19, we discovered and validated markers of COVID-associated AKI (stage 2 or 3) and long-term kidney dysfunction. In the discovery cohort (N= 437), we identified 413 higher plasma abundances of protein targets and 40 lower plasma abundances of protein targets associated with COVID-AKI (adjusted p <0.05). Of these, 62 proteins were validated in an external cohort (p <0.05, N =261). Results We demonstrate that COVID-AKI is associated with increased markers of tubular injury ( NGAL ) and myocardial injury. Using estimated glomerular filtration (eGFR) measurements taken after discharge, we also find that 25 of the 62 AKI-associated proteins are significantly associated with decreased post-discharge eGFR (adjusted p <0.05). Proteins most strongly associated with decreased post-discharge eGFR included desmocollin-2 , trefoil factor 3 , transmembrane emp24 domain-containing protein 10 , and cystatin-C indicating tubular dysfunction and injury. Conclusions Using clinical and proteomic data, our results suggest that while both acute and long-term COVID-associated kidney dysfunction are associated with markers of tubular dysfunction, AKI is driven by a largely multifactorial process involving hemodynamic instability and myocardial damage.
RESUMO
Background: Acute kidney injury (AKI) is common in hospitalized patients with SARS-CoV2 infection despite vaccination and leads to long-term kidney dysfunction. However, peripheral blood molecular signatures in AKI from COVID-19 and their association with long-term kidney dysfunction are yet unexplored. Methods: In patients hospitalized with SARS-CoV2, we performed bulk RNA sequencing using peripheral blood mononuclear cells(PBMCs). We applied linear models accounting for technical and biological variability on RNA-Seq data accounting for false discovery rate (FDR) and compared functional enrichment and pathway results to a historical sepsis-AKI cohort. Finally, we evaluated the association of these signatures with long-term trends in kidney function. Results: Of 283 patients, 106 had AKI. After adjustment for sex, age, mechanical ventilation, and chronic kidney disease (CKD), we identified 2635 significant differential gene expressions at FDR<0.05. Top canonical pathways were EIF2 signaling, oxidative phosphorylation, mTOR signaling, and Th17 signaling, indicating mitochondrial dysfunction and endoplasmic reticulum (ER) stress. Comparison with sepsis associated AKI showed considerable overlap of key pathways (48.14%). Using follow-up estimated glomerular filtration rate (eGFR) measurements from 115 patients, we identified 164/2635 (6.2%) of the significantly differentiated genes associated with overall decrease in long-term kidney function. The strongest associations were 'autophagy', 'renal impairment via fibrosis', and 'cardiac structure and function'. Conclusions: We show that AKI in SARS-CoV2 is a multifactorial process with mitochondrial dysfunction driven by ER stress whereas long-term kidney function decline is associated with cardiac structure and function and immune dysregulation. Functional overlap with sepsis-AKI also highlights common signatures, indicating generalizability in therapeutic approaches. SIGNIFICANCE STATEMENT: Peripheral transcriptomic findings in acute and long-term kidney dysfunction after hospitalization for SARS-CoV2 infection are unclear. We evaluated peripheral blood molecular signatures in AKI from COVID-19 (COVID-AKI) and their association with long-term kidney dysfunction using the largest hospitalized cohort with transcriptomic data. Analysis of 283 hospitalized patients of whom 37% had AKI, highlighted the contribution of mitochondrial dysfunction driven by endoplasmic reticulum stress in the acute stages. Subsequently, long-term kidney function decline exhibits significant associations with markers of cardiac structure and function and immune mediated dysregulation. There were similar biomolecular signatures in other inflammatory states, such as sepsis. This enhances the potential for repurposing and generalizability in therapeutic approaches.
RESUMO
BACKGROUND: Acute kidney injury (AKI) is a known complication of COVID-19 and is associated with an increased risk of in-hospital mortality. Unbiased proteomics using biological specimens can lead to improved risk stratification and discover pathophysiological mechanisms. METHODS: Using measurements of ~4000 plasma proteins in two cohorts of patients hospitalized with COVID-19, we discovered and validated markers of COVID-associated AKI (stage 2 or 3) and long-term kidney dysfunction. In the discovery cohort (N = 437), we identified 413 higher plasma abundances of protein targets and 30 lower plasma abundances of protein targets associated with COVID-AKI (adjusted p < 0.05). Of these, 62 proteins were validated in an external cohort (p < 0.05, N = 261). RESULTS: We demonstrate that COVID-AKI is associated with increased markers of tubular injury (NGAL) and myocardial injury. Using estimated glomerular filtration (eGFR) measurements taken after discharge, we also find that 25 of the 62 AKI-associated proteins are significantly associated with decreased post-discharge eGFR (adjusted p < 0.05). Proteins most strongly associated with decreased post-discharge eGFR included desmocollin-2, trefoil factor 3, transmembrane emp24 domain-containing protein 10, and cystatin-C indicating tubular dysfunction and injury. CONCLUSIONS: Using clinical and proteomic data, our results suggest that while both acute and long-term COVID-associated kidney dysfunction are associated with markers of tubular dysfunction, AKI is driven by a largely multifactorial process involving hemodynamic instability and myocardial damage.
Acute kidney injury (AKI) is a sudden, sometimes fatal, episode of kidney failure or damage. It is a known complication of COVID-19, albeit through unclear mechanisms. COVID-19 is also associated with kidney dysfunction in the long term, or chronic kidney disease (CKD). There is a need to better understand which patients with COVID-19 are at risk of AKI or CKD. We measure levels of several thousand proteins in the blood of hospitalized COVID-19 patients. We discover and validate sets of proteins associated with severe AKI and CKD in these patients. The markers identified suggest that kidney injury in COVID-19 patients involves damage to kidney cells that reabsorb fluid from urine and reduced blood flow to the heart, causing damage to heart muscles. Our findings might help clinicians to predict kidney injury in patients with COVID-19, and to understand its mechanisms.
RESUMO
OBJECTIVE: The novel coronavirus disease 2019 (COVID-19) has heterogenous clinical courses, indicating that there might be distinct subphenotypes in critically ill patients. Although prior research has identified these subphenotypes, the temporal pattern of multiple clinical features has not been considered in cluster models. We aimed to identify temporal subphenotypes in critically ill patients with COVID-19 using a novel sequence cluster analysis and associate them with clinically relevant outcomes. MATERIALS AND METHODS: We analyzed 1036 confirmed critically ill patients with laboratory-confirmed SARS-COV-2 infection admitted to the Mount Sinai Health System in New York city. The agglomerative hierarchical clustering method was used with Levenshtein distance and Ward's minimum variance linkage. RESULTS: We identified four subphenotypes. Subphenotype I (N = 233 [22.5%]) included patients with rapid respirations and a rapid heartbeat but less need for invasive interventions within the first 24 hours, along with a relatively good prognosis. Subphenotype II (N = 418 [40.3%]) represented patients with the least degree of ailments, relatively low mortality, and the highest probability of discharge from the hospital. Subphenotype III (N = 259 [25.0%]) represented patients who experienced clinical deterioration during the first 24 hours of intensive care unit admission, leading to poor outcomes. Subphenotype IV (N = 126 [12.2%]) represented an acute respiratory distress syndrome trajectory with an almost universal need for mechanical ventilation. CONCLUSION: We utilized the sequence cluster analysis to identify clinical subphenotypes in critically ill COVID-19 patients who had distinct temporal patterns and different clinical outcomes. This study points toward the utility of including temporal information in subphenotyping approaches.
Assuntos
COVID-19 , Síndrome do Desconforto Respiratório , Análise por Conglomerados , Humanos , Unidades de Terapia Intensiva , SARS-CoV-2RESUMO
Clinical exome sequencing (CES) aids in the diagnosis of rare genetic disorders. Herein, we report the molecular diagnostic yield and spectrum of genetic alterations contributing to disease in 700 pediatric cases analyzed at the Children's Hospital of Philadelphia. The overall diagnostic yield was 23%, with three cases having more than one molecular diagnosis and 2.6% having secondary/additional findings. A candidate gene finding was reported in another 8.4% of cases. The clinical indications with the highest diagnostic yield were neurodevelopmental disorders (including seizures), whereas immune- and oncology-related indications were negatively associated with molecular diagnosis. The rapid expansion of knowledge regarding the genome's role in human disease necessitates reanalysis of CES samples. To capture these new discoveries, a subset of cases (n = 240) underwent reanalysis, with an increase in diagnostic yield. We describe our experience reporting CES results in a pediatric setting, including reporting of secondary findings, reporting newly discovered genetic conditions, and revisiting negative test results. Finally, we highlight the challenges associated with implementing critical updates to the CES workflow. Although these updates are necessary, they demand an investment of time and resources from the laboratory. In summary, these data demonstrate the clinical utility of exome sequencing and reanalysis, while highlighting the critical considerations for continuous improvement of a CES test in a clinical laboratory.
Assuntos
Exoma , Patologia Molecular , Criança , Exoma/genética , Humanos , Mutação , Doenças Raras/genética , Estudos Retrospectivos , Sequenciamento do Exoma/métodosRESUMO
Acute kidney injury (AKI) is a known complication of COVID-19 and is associated with an increased risk of in-hospital mortality. Unbiased proteomics using biological specimens can lead to improved risk stratification and discover pathophysiological mechanisms. Using measurements of â¼4000 plasma proteins in two cohorts of patients hospitalized with COVID-19, we discovered and validated markers of COVID-associated AKI (stage 2 or 3) and long-term kidney dysfunction. In the discovery cohort (N= 437), we identified 413 higher plasma abundances of protein targets and 40 lower plasma abundances of protein targets associated with COVID-AKI (adjusted p <0.05). Of these, 62 proteins were validated in an external cohort (p <0.05, N =261). We demonstrate that COVID-AKI is associated with increased markers of tubular injury (NGAL) and myocardial injury. Using estimated glomerular filtration (eGFR) measurements taken after discharge, we also find that 25 of the 62 AKI-associated proteins are significantly associated with decreased post-discharge eGFR (adjusted p <0.05). Proteins most strongly associated with decreased post-discharge eGFR included desmocollin-2, trefoil factor 3, transmembrane emp24 domain-containing protein 10, and cystatin-C indicating tubular dysfunction and injury. Using clinical and proteomic data, our results suggest that while both acute and long-term COVID-associated kidney dysfunction are associated with markers of tubular dysfunction, AKI is driven by a largely multifactorial process involving hemodynamic instability and myocardial damage.