RESUMO
BACKGROUND: Whole blood host transcript signatures show great potential for diagnosis of infectious and inflammatory illness, with most published signatures performing binary classification tasks. Barriers to clinical implementation include validation studies, and development of strategies that enable simultaneous, multiclass diagnosis of febrile illness based on gene expression. METHODS: We validated five distinct diagnostic signatures for paediatric infectious diseases in parallel using a single NanoString nCounter® experiment. We included a novel 3-transcript signature for childhood tuberculosis, and four published signatures which differentiate bacterial infection, viral infection, or Kawasaki disease from other febrile illnesses. Signature performance was assessed using receiver operating characteristic curve statistics. We also explored conceptual frameworks for multiclass diagnostic signatures, including additional transcripts found to be significantly differentially expressed in previous studies. Relaxed, regularised logistic regression models were used to derive two novel multiclass signatures: a mixed One-vs-All model (MOVA), running multiple binomial models in parallel, and a full-multiclass model. In-sample performance of these models was compared using radar-plots and confusion matrix statistics. RESULTS: Samples from 91 children were included in the study: 23 bacterial infections (DB), 20 viral infections (DV), 14 Kawasaki disease (KD), 18 tuberculosis disease (TB), and 16 healthy controls. The five signatures tested demonstrated cross-platform performance similar to their primary discovery-validation cohorts. The signatures could differentiate: KD from other diseases with area under ROC curve (AUC) of 0.897 [95% confidence interval: 0.822-0.972]; DB from DV with AUC of 0.825 [0.691-0.959] (signature-1) and 0.867 [0.753-0.982] (signature-2); TB from other diseases with AUC of 0.882 [0.787-0.977] (novel signature); TB from healthy children with AUC of 0.910 [0.808-1.000]. Application of signatures outside of their designed context reduced performance. In-sample error rates for the multiclass models were 13.3% for the MOVA model and 0.0% for the full-multiclass model. The MOVA model misclassified DB cases most frequently (18.7%) and TB cases least (2.7%). CONCLUSIONS: Our study demonstrates the feasibility of NanoString technology for cross-platform validation of multiple transcriptomic signatures in parallel. This external cohort validated performance of all five signatures, including a novel sparse TB signature. Two exploratory multi-class models showed high potential accuracy across four distinct diagnostic groups.
Assuntos
Febre , Tuberculose , Humanos , Tuberculose/diagnóstico , Tuberculose/genética , Criança , Febre/diagnóstico , Febre/microbiologia , Pré-Escolar , Feminino , Masculino , Curva ROC , Perfilação da Expressão Gênica , Reprodutibilidade dos Testes , Lactente , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Mensageiro/sangue , Transcriptoma/genéticaRESUMO
BACKGROUND: Kawasaki disease (KD) is a systemic vasculitis that mainly affects children under 5 years of age. Up to 30% of patients develop coronary artery abnormalities, which are reduced with early treatment. Timely diagnosis of KD is challenging but may become more straightforward with the recent discovery of a whole-blood host response classifier that discriminates KD patients from patients with other febrile conditions. Here, we bridged this microarray-based classifier to a clinically applicable quantitative reverse transcription-polymerase chain reaction (qRT-PCR) assay: the Kawasaki Disease Gene Expression Profiling (KiDs-GEP) classifier. METHODS: We designed and optimized a qRT-PCR assay and applied it to a subset of samples previously used for the classifier discovery to reweight the original classifier. RESULTS: The performance of the KiDs-GEP classifier was comparable to the original classifier with a cross-validated area under the ROC curve of 0.964 [95% CI: 0.924-1.00] vs 0.992 [95% CI: 0.978-1.00], respectively. Both classifiers demonstrated similar trends over various disease conditions, with the clearest distinction between individuals diagnosed with KD vs viral infections. CONCLUSION: We successfully bridged the microarray-based classifier into the KiDs-GEP classifier, a more rapid and more cost-efficient qRT-PCR assay, bringing a diagnostic test for KD closer to the hospital clinical laboratory. IMPACT: A diagnostic test is needed for Kawasaki disease and is currently not available. We describe the development of a One-Step multiplex qRT-PCR assay and the subsequent modification (i.e., bridging) of the microarray-based host response classifier previously described by Wright et al. The bridged KiDs-GEP classifier performs well in discriminating Kawasaki disease patients from febrile controls. This host response clinical test for Kawasaki disease can be adapted to the hospital clinical laboratory.
Assuntos
Síndrome de Linfonodos Mucocutâneos , Criança , Humanos , Pré-Escolar , Síndrome de Linfonodos Mucocutâneos/diagnóstico , Síndrome de Linfonodos Mucocutâneos/genética , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Perfilação da Expressão Gênica , Febre , Curva ROCRESUMO
The emergence of next-generation sequencing technologies and computational advances have expanded our understanding of gene expression regulation (i.e., the transcriptome). This has also led to an increased interest in using transcriptomic biomarkers to improve disease diagnosis and stratification, to assess prognosis and predict the response to treatment. Significant progress in identifying transcriptomic signatures for various clinical needs has been made, with large discovery studies accounting for challenges such as patient variability, unwanted batch effects, and data complexities; however, obstacles related to the technical aspects of cross-platform implementation still hinder the successful integration of transcriptomic technologies into standard diagnostic workflows. In this article, we discuss the challenges associated with integrating transcriptomic signatures derived using high-throughput technologies (such as RNA-sequencing) into clinical diagnostic tools using nucleic acid amplification (NAA) techniques. The novelty of the proposed approach lies in our aim to embed constraints related to cross-platform implementation in the process of signature discovery. These constraints could include technical limitations of amplification platform and chemistry, the maximal number of targets imposed by the chosen multiplexing strategy, and the genomic context of identified RNA biomarkers. Finally, we propose to build a computational framework that would integrate these constraints in combination with existing statistical and machine learning models used for signature identification. We envision that this could accelerate the integration of RNA signatures discovered by high-throughput technologies into NAA-based approaches suitable for clinical applications.
Assuntos
Biologia Computacional , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Transcriptoma , Humanos , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , BiomarcadoresRESUMO
Severe febrile illnesses in children encompass life-threatening organ dysfunction caused by diverse pathogens and other severe inflammatory syndromes. A comparative approach to these illnesses may identify shared and distinct features of host immune dysfunction amenable to immunomodulation. Here, using immunophenotyping with mass cytometry and cell stimulation experiments, we illustrate trajectories of immune dysfunction in 74 children with multi-system inflammatory syndrome in children (MIS-C) associated with SARS-CoV-2, 30 with bacterial infection, 16 with viral infection, 8 with Kawasaki disease, and 42 controls. We explore these findings in a secondary cohort of 500 children with these illnesses and 134 controls. We show that neutrophil activation and apoptosis are prominent in multi-system inflammatory syndrome, and that this is partially shared with bacterial infection. We show that memory T cells from patients with multi-system inflammatory syndrome and bacterial infection are exhausted. In contrast, we show viral infection to be characterized by a distinct signature of decreased interferon signaling and lower interferon receptor gene expression. Improved understanding of immune dysfunction may improve approaches to immunomodulator therapy in severe febrile illnesses in children.
Assuntos
COVID-19 , Neutrófilos , Síndrome de Resposta Inflamatória Sistêmica , Humanos , Criança , Neutrófilos/imunologia , COVID-19/imunologia , COVID-19/virologia , COVID-19/complicações , Masculino , Feminino , Pré-Escolar , Síndrome de Resposta Inflamatória Sistêmica/imunologia , Febre/imunologia , SARS-CoV-2/imunologia , Lactente , Interferons/metabolismo , Infecções Bacterianas/imunologia , Linfócitos T/imunologia , Síndrome de Linfonodos Mucocutâneos/imunologia , Adolescente , Apoptose , Ativação de NeutrófiloRESUMO
OBJECTIVES: The amount of SARS-CoV-2 detected in the upper respiratory tract (URT viral load) is a key driver of transmission of infection. Current evidence suggests that mechanisms constraining URT viral load are different from those controlling lower respiratory tract viral load and disease severity. Understanding such mechanisms may help to develop treatments and vaccine strategies to reduce transmission. Combining mathematical modelling of URT viral load dynamics with transcriptome analyses we aimed to identify mechanisms controlling URT viral load. METHODS: COVID-19 patients were recruited in Spain during the first wave of the pandemic. RNA sequencing of peripheral blood and targeted NanoString nCounter transcriptome analysis of nasal epithelium were performed and gene expression analysed in relation to paired URT viral load samples collected within 15 days of symptom onset. Proportions of major immune cells in blood were estimated from transcriptional data using computational differential estimation. Weighted correlation network analysis (adjusted for cell proportions) and fixed transcriptional repertoire analysis were used to identify associations with URT viral load, quantified as standard deviations (z-scores) from an expected trajectory over time. RESULTS: Eighty-two subjects (50% female, median age 54 years (range 3-73)) with COVID-19 were recruited. Paired URT viral load samples were available for 16 blood transcriptome samples, and 17 respiratory epithelial transcriptome samples. Natural Killer (NK) cells were the only blood cell type significantly correlated with URT viral load z-scores (r = -0.62, P = 0.010). Twenty-four blood gene expression modules were significantly correlated with URT viral load z-score, the most significant being a module of genes connected around IFNA14 (Interferon Alpha-14) expression (r = -0.60, P = 1e-10). In fixed repertoire analysis, prostanoid-related gene expression was significantly associated with higher viral load. In nasal epithelium, only GNLY (granulysin) gene expression showed significant negative correlation with viral load. CONCLUSIONS: Correlations between the transcriptional host response and inter-individual variations in SARS-CoV-2 URT viral load, revealed many molecular mechanisms plausibly favouring or constraining viral replication. Existing evidence corroborates many of these mechanisms, including likely roles for NK cells, granulysin, prostanoids and interferon alpha-14. Inhibition of prostanoid production and administration of interferon alpha-14 may be attractive transmission-blocking interventions.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Feminino , Pré-Escolar , Criança , Adolescente , Adulto Jovem , Adulto , Pessoa de Meia-Idade , Idoso , Masculino , SARS-CoV-2/genética , Carga Viral , Transcriptoma , Mucosa Nasal , Prostaglandinas , Interferon-alfaRESUMO
BACKGROUND: Appropriate treatment and management of children presenting with fever depend on accurate and timely diagnosis, but current diagnostic tests lack sensitivity and specificity and are frequently too slow to inform initial treatment. As an alternative to pathogen detection, host gene expression signatures in blood have shown promise in discriminating several infectious and inflammatory diseases in a dichotomous manner. However, differential diagnosis requires simultaneous consideration of multiple diseases. Here, we show that diverse infectious and inflammatory diseases can be discriminated by the expression levels of a single panel of genes in blood. METHODS: A multi-class supervised machine-learning approach, incorporating clinical consequence of misdiagnosis as a "cost" weighting, was applied to a whole-blood transcriptomic microarray dataset, incorporating 12 publicly available datasets, including 1,212 children with 18 infectious or inflammatory diseases. The transcriptional panel identified was further validated in a new RNA sequencing dataset comprising 411 febrile children. FINDINGS: We identified 161 transcripts that classified patients into 18 disease categories, reflecting individual causative pathogen and specific disease, as well as reliable prediction of broad classes comprising bacterial infection, viral infection, malaria, tuberculosis, or inflammatory disease. The transcriptional panel was validated in an independent cohort and benchmarked against existing dichotomous RNA signatures. CONCLUSIONS: Our data suggest that classification of febrile illness can be achieved with a single blood sample and opens the way for a new approach for clinical diagnosis. FUNDING: European Union's Seventh Framework no. 279185; Horizon2020 no. 668303 PERFORM; Wellcome Trust (206508/Z/17/Z); Medical Research Foundation (MRF-160-0008-ELP-KAFO-C0801); NIHR Imperial BRC.
Assuntos
Benchmarking , Pesquisa Biomédica , Criança , Humanos , Diagnóstico Diferencial , Motivos de Nucleotídeos , Febre/diagnóstico , Febre/genética , RNARESUMO
BACKGROUND: To identify a diagnostic blood transcriptomic signature that distinguishes multisystem inflammatory syndrome in children (MIS-C) from Kawasaki disease (KD), bacterial infections, and viral infections. METHODS: Children presenting with MIS-C to participating hospitals in the United Kingdom and the European Union between April 2020 and April 2021 were prospectively recruited. Whole-blood RNA Sequencing was performed, contrasting the transcriptomes of children with MIS-C (n = 38) to those from children with KD (n = 136), definite bacterial (DB; n = 188) and viral infections (DV; n = 138). Genes significantly differentially expressed (SDE) between MIS-C and comparator groups were identified. Feature selection was used to identify genes that optimally distinguish MIS-C from other diseases, which were subsequently translated into RT-qPCR assays and evaluated in an independent validation set comprising MIS-C (n = 37), KD (n = 19), DB (n = 56), DV (n = 43), and COVID-19 (n = 39). RESULTS: In the discovery set, 5696 genes were SDE between MIS-C and combined comparator disease groups. Five genes were identified as potential MIS-C diagnostic biomarkers (HSPBAP1, VPS37C, TGFB1, MX2, and TRBV11-2), achieving an AUC of 96.8% (95% CI: 94.6%-98.9%) in the discovery set, and were translated into RT-qPCR assays. The RT-qPCR 5-gene signature achieved an AUC of 93.2% (95% CI: 88.3%-97.7%) in the independent validation set when distinguishing MIS-C from KD, DB, and DV. CONCLUSIONS: MIS-C can be distinguished from KD, DB, and DV groups using a 5-gene blood RNA expression signature. The small number of genes in the signature and good performance in both discovery and validation sets should enable the development of a diagnostic test for MIS-C.
Assuntos
COVID-19 , Síndrome de Linfonodos Mucocutâneos , Criança , Humanos , COVID-19/diagnóstico , COVID-19/genética , Síndrome de Resposta Inflamatória Sistêmica/diagnóstico , Síndrome de Resposta Inflamatória Sistêmica/genética , Hospitais , Síndrome de Linfonodos Mucocutâneos/diagnóstico , Síndrome de Linfonodos Mucocutâneos/genética , Teste para COVID-19RESUMO
The unmet clinical need for accurate point-of-care (POC) diagnostic tests able to discriminate bacterial from viral infection demands a solution that can be used both within healthcare settings and in the field, and that can also stem the tide of antimicrobial resistance. Our approach to solve this problem combine the use of host gene signatures with our Lab-on-a-Chip (LoC) technology enabling low-cost POC expression analysis to detect Infectious Disease. Transcriptomics have been extensively investigated as a potential tool to be implemented in the diagnosis of infectious disease. On the other hand, LoC technologies using ion-sensitive field-effect transistor (ISFET), in conjunction with isothermal chemistries, are offering a promising alternative to conventional amplification instruments, owing to their portable and affordable nature. Currently, the data analysis of ISFET arrays are restricted to established methods by averaging the output of every sensor to give a single time-series. This simple approach makes unrealistic assumptions, leading to insufficient performance for applications that require accurate quantification such as Host-Transcriptomics. In order to reliably quantify transcripts on our LoC platform enabling the classification of infectious disease on-chip, we propose a novel data-driven algorithm for extracting time-to-positive values from ISFET arrays. The algorithm proposed correctly outputs a time-to-positive for all the reactions, with a high correlation to RT-qLAMP (0.85, R2 = 0.98, p < 0.01), resulting in a classification accuracy of 100% (CI, 95-100%). This work aims to bridge the gap between translating assays from microarray analysis to ISFET arrays providing benefits on tackling infectious disease and diagnostic testing in hard-to-reach areas of the world.
Assuntos
Anti-Infecciosos , Técnicas Biossensoriais , Doenças Transmissíveis , Viroses , Bactérias/genética , Humanos , Dispositivos Lab-On-A-Chip , Técnicas de Amplificação de Ácido Nucleico/métodos , Sistemas Automatizados de Assistência Junto ao Leito , RNARESUMO
Infection with SARS-CoV-2 has highly variable clinical manifestations, ranging from asymptomatic infection through to life-threatening disease. Host whole blood transcriptomics can offer unique insights into the biological processes underpinning infection and disease, as well as severity. We performed whole blood RNA Sequencing of individuals with varying degrees of COVID-19 severity. We used differential expression analysis and pathway enrichment analysis to explore how the blood transcriptome differs between individuals with mild, moderate, and severe COVID-19, performing pairwise comparisons between groups. Increasing COVID-19 severity was characterised by an abundance of inflammatory immune response genes and pathways, including many related to neutrophils and macrophages, in addition to an upregulation of immunoglobulin genes. In this study, for the first time, we show how immunomodulatory treatments commonly administered to COVID-19 patients greatly alter the transcriptome. Our insights into COVID-19 severity reveal the role of immune dysregulation in the progression to severe disease and highlight the need for further research exploring the interplay between SARS-CoV-2 and the inflammatory immune response.
Assuntos
COVID-19 , Humanos , Imunidade , RNA , SARS-CoV-2 , TranscriptomaRESUMO
Recently, host whole blood gene expression signatures have been identified for diagnosis of tuberculosis (TB). Absolute quantification of the concentrations of signature transcripts in blood have not been reported, but would facilitate diagnostic test development. To identify minimal transcript signatures, we applied a transcript selection procedure to microarray data from African adults comprising 536 patients with TB, other diseases (OD) and latent TB (LTBI), divided into training and test sets. Signatures were further investigated using reverse transcriptase (RT)-digital PCR (dPCR). A four-transcript signature (GBP6, TMCC1, PRDM1, and ARG1) measured using RT-dPCR distinguished TB patients from those with OD (area under the curve (AUC) 93.8% (CI95% 82.2-100%). A three-transcript signature (FCGR1A, ZNF296, and C1QB) differentiated TB from LTBI (AUC 97.3%, CI95%: 93.3-100%), regardless of HIV. These signatures have been validated across platforms and across samples offering strong, quantitative support for their use as diagnostic biomarkers for TB.
Assuntos
Proteínas de Transporte/sangue , Tuberculose Latente/diagnóstico , Proteínas Mitocondriais/sangue , Receptores de IgG/sangue , Transcriptoma/genética , Tuberculose Pulmonar/diagnóstico , Dedos de Zinco/fisiologia , Adulto , Idoso , Biomarcadores/sangue , Proteínas de Transporte/genética , Testes Diagnósticos de Rotina , Feminino , Perfilação da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Proteínas Mitocondriais/genética , Mycobacterium tuberculosis/genética , Análise Serial de Proteínas , RNA Mensageiro/sangue , RNA Mensageiro/genética , Receptores de IgG/genética , Reação em Cadeia da Polimerase Via Transcriptase Reversa , África do Sul , Adulto JovemRESUMO
BACKGROUND: Emergency admissions for infection often lack initial diagnostic certainty. COVID-19 has highlighted a need for novel diagnostic approaches to indicate likelihood of viral infection in a pandemic setting. We aimed to derive and validate a blood transcriptional signature to detect viral infections, including COVID-19, among adults with suspected infection who presented to the emergency department. METHODS: Individuals (aged ≥18 years) presenting with suspected infection to an emergency department at a major teaching hospital in the UK were prospectively recruited as part of the Bioresource in Adult Infectious Diseases (BioAID) discovery cohort. Whole-blood RNA sequencing was done on samples from participants with subsequently confirmed viral, bacterial, or no infection diagnoses. Differentially expressed host genes that met additional filtering criteria were subjected to feature selection to derive the most parsimonious discriminating signature. We validated the signature via RT-qPCR in a prospective validation cohort of participants who presented to an emergency department with undifferentiated fever, and a second case-control validation cohort of emergency department participants with PCR-positive COVID-19 or bacterial infection. We assessed signature performance by calculating the area under receiver operating characteristic curves (AUROCs), sensitivities, and specificities. FINDINGS: A three-gene transcript signature, comprising HERC6, IGF1R, and NAGK, was derived from the discovery cohort of 56 participants with bacterial infections and 27 with viral infections. In the validation cohort of 200 participants, the signature differentiated bacterial from viral infections with an AUROC of 0·976 (95% CI 0·919-1·000), sensitivity of 97·3% (85·8-99·9), and specificity of 100% (63·1-100). The AUROC for C-reactive protein (CRP) was 0·833 (0·694-0·944) and for leukocyte count was 0·938 (0·840-0·986). The signature achieved higher net benefit in decision curve analysis than either CRP or leukocyte count for discriminating viral infections from all other infections. In the second validation analysis, which included SARS-CoV-2-positive participants, the signature discriminated 35 bacterial infections from 34 SARS-CoV-2-positive COVID-19 infections with AUROC of 0·953 (0·893-0·992), sensitivity 88·6%, and specificity of 94·1%. INTERPRETATION: This novel three-gene signature discriminates viral infections, including COVID-19, from other emergency infection presentations in adults, outperforming both leukocyte count and CRP, thus potentially providing substantial clinical utility in managing acute presentations with infection. FUNDING: National Institute for Health Research, Medical Research Council, Wellcome Trust, and EU-FP7.
Assuntos
Infecções Bacterianas , COVID-19 , Doenças Transmissíveis , Viroses , Adolescente , Adulto , Bactérias , Infecções Bacterianas/diagnóstico , Proteína C-Reativa/análise , COVID-19/diagnóstico , Estudos de Casos e Controles , Estudos de Coortes , Humanos , SARS-CoV-2/genética , Viroses/diagnósticoRESUMO
Background: Rotavirus (RV) is an enteric pathogen that has devastating impact on childhood morbidity and mortality worldwide. The immunologic mechanism underlying the protection achieved after RV vaccination is not yet fully understood. Methods: We compared the transcriptome of children affected by community-acquired RV infection and children immunized with a live attenuated RV vaccine (RotaTeq®). Results: RV vaccination mimics the wild type infection causing similar changes in children's transcriptome, including transcripts associated with cell cycle, diarrhea, nausea, vomiting, intussusception, and abnormal morphology of midgut. A machine learning approach allowed to detect a combination of nine-transcripts that differentiates vaccinated from convalescent-naturally infected children (AUC: 90%; 95%CI: 70-100) and distinguishes between acute-infected and healthy control children (in both cases, AUC: 100%; 95%CI: 100-100). We identified a miRNA hsa-mir-149 that seems to play a role in the host defense against viral pathogens and may have an antiviral role. Discussion: Our findings might shed further light in the understanding of RV infection, its functional link to intussusception causes, as well as guide development of antiviral treatments and safer and more effective vaccines. The nine-transcript signature may constitute a marker of vaccine protection and helps to differentiate vaccinated from naturally infected or susceptible children.