RESUMEN
Genetic variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have repeatedly altered the course of the coronavirus disease 2019 (COVID-19) pandemic. Delta variants are now the focus of intense international attention because they are causing widespread COVID-19 globally and are associated with vaccine breakthrough cases. We sequenced 16,965 SARS-CoV-2 genomes from samples acquired March 15, 2021, through September 20, 2021, in the Houston Methodist hospital system. This sample represents 91% of all Methodist system COVID-19 patients during the study period. Delta variants increased rapidly from late April onward to cause 99.9% of all COVID-19 cases and spread throughout the Houston metroplex. Compared with all other variants combined, Delta caused a significantly higher rate of vaccine breakthrough cases (23.7% for Delta compared with 6.6% for all other variants combined). Importantly, significantly fewer fully vaccinated individuals required hospitalization. Vaccine breakthrough cases caused by Delta had a low median PCR cycle threshold value (a proxy for high virus load). This value was similar to the median cycle threshold value for unvaccinated patients with COVID-19 caused by Delta variants, suggesting that fully vaccinated individuals can transmit SARS-CoV-2 to others. Patients infected with Alpha and Delta variants had several significant differences. The integrated analysis indicates that vaccines used in the United States are highly effective in decreasing severe COVID-19, hospitalizations, and deaths.
Asunto(s)
COVID-19/virología , SARS-CoV-2 , Adulto , Vacunas contra la COVID-19 , Femenino , Humanos , Masculino , Persona de Mediana Edad , TexasRESUMEN
Since the beginning of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic, there has been international concern about the emergence of virus variants with mutations that increase transmissibility, enhance escape from the human immune response, or otherwise alter biologically important phenotypes. In late 2020, several variants of concern emerged globally, including the UK variant (B.1.1.7), the South Africa variant (B.1.351), Brazil variants (P.1 and P.2), and two related California variants of interest (B.1.429 and B.1.427). These variants are believed to have enhanced transmissibility. For the South Africa and Brazil variants, there is evidence that mutations in spike protein permit it to escape from some vaccines and therapeutic monoclonal antibodies. On the basis of our extensive genome sequencing program involving 20,453 coronavirus disease 2019 patient samples collected from March 2020 to February 2021, we report identification of all six of these SARS-CoV-2 variants among Houston Methodist Hospital (Houston, TX) patients residing in the greater metropolitan area. Although these variants are currently at relatively low frequency (aggregate of 1.1%) in the population, they are geographically widespread. Houston is the first city in the United States in which active circulation of all six current variants of concern has been documented by genome sequencing. As vaccine deployment accelerates, increased genomic surveillance of SARS-CoV-2 is essential to understanding the presence, frequency, and medical impact of consequential variants and their patterns and trajectory of dissemination.
Asunto(s)
COVID-19 , Mutación , Pandemias , SARS-CoV-2/genética , COVID-19/epidemiología , COVID-19/genética , COVID-19/transmisión , Femenino , Humanos , Masculino , SARS-CoV-2/aislamiento & purificación , Texas/epidemiologíaRESUMEN
Certain genetic variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are of substantial concern because they may be more transmissible or detrimentally alter the pandemic course and disease features in individual patients. SARS-CoV-2 genome sequences from 12,476 patients in the Houston Methodist health care system diagnosed from January 1 through May 31, 2021 are reported here. Prevalence of the B.1.1.7 (Alpha) variant increased rapidly and caused 63% to 90% of new cases in the latter half of May. Eleven B.1.1.7 genomes had an E484K replacement in spike protein, a change also identified in other SARS-CoV-2 lineages. Compared with non-B.1.1.7-infected patients, individuals with B.1.1.7 had a significantly lower cycle threshold (a proxy for higher virus load) and significantly higher hospitalization rate. Other variants [eg, B.1.429 and B.1.427 (Epsilon), P.1 (Gamma), P.2 (Zeta), and R.1] also increased rapidly, although the magnitude was less than that in B.1.1.7. Twenty-two patients infected with B.1.617.1 (Kappa) or B.1.617.2 (Delta) variants had a high rate of hospitalization. Breakthrough cases (n = 207) in fully vaccinated patients were caused by a heterogeneous array of virus genotypes, including many not currently designated variants of interest or concern. In the aggregate, this study delineates the trajectory of SARS-CoV-2 variants circulating in a major metropolitan area, documents B.1.1.7 as the major cause of new cases in Houston, TX, and heralds the arrival of B.1.617 variants in the metroplex.
Asunto(s)
COVID-19/epidemiología , Genoma Viral , Mutación , SARS-CoV-2/genética , COVID-19/genética , COVID-19/transmisión , COVID-19/virología , Femenino , Humanos , Masculino , Persona de Mediana Edad , SARS-CoV-2/aislamiento & purificación , Texas/epidemiologíaRESUMEN
Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2, has spread globally, and no proven treatments are available. Convalescent plasma therapy has been used with varying degrees of success to treat severe microbial infections for >100 years. Patients (n = 25) with severe and/or life-threatening COVID-19 disease were enrolled at the Houston Methodist hospitals from March 28, 2020, to April 14, 2020. Patients were transfused with convalescent plasma, obtained from donors with confirmed severe acute respiratory syndrome coronavirus 2 infection who had recovered. The primary study outcome was safety, and the secondary outcome was clinical status at day 14 after transfusion. Clinical improvement was assessed on the basis of a modified World Health Organization six-point ordinal scale and laboratory parameters. Viral genome sequencing was performed on donor and recipient strains. At day 7 after transfusion with convalescent plasma, nine patients had at least a one-point improvement in clinical scale, and seven of those were discharged. By day 14 after transfusion, 19 (76%) patients had at least a one-point improvement in clinical status, and 11 were discharged. No adverse events as a result of plasma transfusion were observed. Whole genome sequencing data did not identify a strain genotype-disease severity correlation. The data indicate that administration of convalescent plasma is a safe treatment option for those with severe COVID-19 disease.
Asunto(s)
Infecciones por Coronavirus/terapia , Neumonía Viral/terapia , Adulto , Anciano , Betacoronavirus/genética , COVID-19 , Femenino , Humanos , Inmunización Pasiva , Aplicación de Nuevas Drogas en Investigación , Masculino , Persona de Mediana Edad , Pandemias , SARS-CoV-2 , Texas , Secuenciación Completa del Genoma , Adulto Joven , Sueroterapia para COVID-19RESUMEN
Probabilistic topic modelling has become essential in many types of single-cell data analysis. Based on probabilistic topic assignments in each cell, we identify the latent representation of cellular states. A dictionary matrix, consisting of topic-specific gene frequency vectors, provides interpretable bases to be compared with known cell type-specific marker genes and other pathway annotations. However, fitting a topic model on a large number of cells would require heavy computational resources-specialized computing units, computing time and memory. Here, we present a scalable approximation method customized for single-cell RNA-seq data analysis, termed ASAP, short for Annotating a Single-cell data matrix by Approximate Pseudobulk estimation. Our approach is more accurate than existing methods but requires orders of magnitude less computing time, leaving much lower memory consumption. We also show that our approach is widely applicable for atlas-scale data analysis; our method seamlessly integrates single-cell and bulk data in joint analysis, not requiring additional preprocessing or feature selection steps.
Asunto(s)
Biología Computacional , Análisis de la Célula Individual , Análisis de la Célula Individual/métodos , Humanos , Biología Computacional/métodos , Algoritmos , Modelos Estadísticos , Análisis de Secuencia de ARN/métodos , RNA-Seq/métodos , Perfilación de la Expresión Génica/métodosRESUMEN
In multicellular organisms, cell identity and functions are primed and refined through interactions with other surrounding cells. Here, we propose a scalable machine learning method, termed SPRUCE, which is designed to systematically ascertain common cell-cell communication patterns embedded in single-cell RNA-seq data. We applied our approach to investigate tumor microenvironments consolidating multiple breast cancer datasets and found seven frequently observed interaction signatures and underlying gene-gene interaction networks. Our results implicate that a part of tumor heterogeneity, especially within the same subtype, is better understood by differential interaction patterns rather than the static expression of known marker genes.
RESUMEN
The COVID-19 pandemic has resulted in extensive surveillance of the genomic diversity of SARS-CoV-2. Sequencing data generated as part of these efforts can also capture the diversity of the SARS-CoV-2 virus populations replicating within infected individuals. To assess this within-host diversity of SARS-CoV-2 we quantified low frequency (minor) variants from deep sequence data of thousands of clinical samples collected by a large urban hospital system over the course of a year. Using a robust analytical pipeline to control for technical artefacts, we observe that at comparable viral loads, specimens from patients hospitalized due to COVID-19 had a greater number of minor variants than samples from outpatients. Since individuals with highly diverse viral populations could be disproportionate drivers of new viral lineages in the patient population, these results suggest that transmission control should pay special attention to patients with severe or protracted disease to prevent the spread of novel variants.
RESUMEN
INTRODUCTION: Manual gating of flow cytometry (FCM) data for marrow cell analysis is a standard approach in current practice, although it is time- and labor-consuming. Recent advances in cytometry technology have led to significant efforts in developing partially or fully automated analysis methods. Although multiple supervised and unsupervised FCM data analysis algorithms have been developed, they have not been widely adopted by the clinical and research laboratories. In this study, we evaluated flowDensity, an open source freely available algorithm, as an automated analysis tool for classification of lymphocyte subsets in the bone marrow biopsy specimens. MATERIALS AND METHODS: FlowDensity-based gating was applied to 102 normal bone marrow samples and compared with the manual analysis. Independent expression of each cell marker was assessed for comprehensive expression analysis and visualization. RESULTS: Our findings showed a correlation between the manual and flowDensity-based gating in the lymphocyte subsets. However, flowDensity-based gating in the populations with a small number of cells in each cluster showed a low degree of correlation. Comprehensive expression analysis successfully identified and visualized the lymphocyte subsets. DISCUSSION: Our study found that although flowDensity might be a promising method for FCM data analysis, more optimization is required before implementing this algorithm into day-to-day workflow.
RESUMEN
Measures intended to limit the spread of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus at the start of the coronavirus disease 2019 (COVID-19) pandemic resulted in a rapid decrease in other respiratory pathogens. Herein, we describe the trends of respiratory pathogens in a major metropolitan health care system central microbiology reference laboratory before and during the COVID-19 pandemic, with attention to when COVID-19 mitigation measures were implemented and relaxed. During the initial lockdown period, COVID-19 was the primary respiratory pathogen detected by multiplex respiratory panels. As COVID-19 containment measures were relaxed, the first non-COVID respiratory viruses to return to prepandemic levels were members of the rhinovirus/enterovirus family. After the complete removal of COVID-19 precautions at the state level, including an end to mask mandates, we observed the robust return of seasonal coronaviruses, parainfluenza virus, and respiratory syncytial virus. Inasmuch as COVID-19 has dominated the landscape of respiratory infections since early 2020, it is important for clinicians to recognize that the return of non-COVID respiratory pathogens may be rapid and significant when COVID-19 containment measures are removed. IMPORTANCE We describe the return of non-COVID respiratory viruses after the removal of COVID-19 mitigation measures. It is important for the public and physicians to recognize that, after months of COVID-19 being the primary driver of respiratory infection, more typical seasonal respiratory illnesses have returned, and this return is out of the normal season for some of these pathogens. Thus, clinicians and the public must now consider both COVID-19 and other respiratory illnesses when a patient presents with symptomatic respiratory illness.
Asunto(s)
COVID-19/epidemiología , COVID-19/prevención & control , Control de Enfermedades Transmisibles , Infecciones del Sistema Respiratorio/epidemiología , Infecciones del Sistema Respiratorio/prevención & control , Infecciones por Coxsackievirus/epidemiología , Infecciones por Coxsackievirus/prevención & control , Enterovirus/aislamiento & purificación , Humanos , Programas Obligatorios/estadística & datos numéricos , Orthomyxoviridae/aislamiento & purificación , Infecciones por Orthomyxoviridae/epidemiología , Infecciones por Orthomyxoviridae/prevención & control , Infecciones por Picornaviridae/epidemiología , Infecciones por Picornaviridae/prevención & control , Rhinovirus/aislamiento & purificación , SARS-CoV-2/crecimiento & desarrollo , Texas/epidemiologíaRESUMEN
The ARTIC Network provides a common resource of PCR primer sequences and recommendations for amplifying SARS-CoV-2 genomes. The initial tiling strategy was developed with the reference genome Wuhan-01, and subsequent iterations have addressed areas of low amplification and sequence drop out. Recently, a new version (V4) was released, based on new variant genome sequences, in response to the realization that some V3 primers were located in regions with key mutations. Herein, we compare the performance of the ARTIC V3 and V4 primer sets with a matched set of 663 SARS-CoV-2 clinical samples sequenced with an Illumina NovaSeq 6000 instrument. We observe general improvements in sequencing depth and quality, and improved resolution of the SNP causing the D950N variation in the spike protein. Importantly, we also find nearly universal presence of spike protein substitution G142D in Delta-lineage samples. Due to the prior release and widespread use of the ARTIC V3 primers during the initial surge of the Delta variant, it is likely that the G142D amino acid substitution is substantially underrepresented among early Delta variant genomes deposited in public repositories. In addition to the improved performance of the ARTIC V4 primer set, this study also illustrates the importance of the primer scheme in downstream analyses. IMPORTANCE ARTIC Network primers are commonly used by laboratories worldwide to amplify and sequence SARS-CoV-2 present in clinical samples. As new variants have evolved and spread, it was found that the V3 primer set poorly amplified several key mutations. In this report, we compare the results of sequencing a matched set of samples with the V3 and V4 primer sets. We find that adoption of the ARTIC V4 primer set is critical for accurate sequencing of the SARS-CoV-2 spike region. The absence of metadata describing the primer scheme used will negatively impact the downstream use of publicly available SARS-Cov-2 sequencing reads and assembled genomes.
Asunto(s)
Sustitución de Aminoácidos , COVID-19/virología , SARS-CoV-2/clasificación , SARS-CoV-2/genética , SARS-CoV-2/aislamiento & purificación , Glicoproteína de la Espiga del Coronavirus/genética , Secuencia de Bases , Genoma Viral , Humanos , Mutación , Secuenciación Completa del GenomaRESUMEN
Although the major events in prokaryotic cell cycle progression are likely to be coordinated with transcriptional and metabolic changes, these processes remain poorly characterized. Unlike many rapidly growing bacteria, DNA replication and cell division are temporally resolved in mycobacteria, making these slow-growing organisms a potentially useful system to investigate the prokaryotic cell cycle. To determine whether cell-cycle-dependent gene regulation occurs in mycobacteria, we characterized the temporal changes in the transcriptome of synchronously replicating populations of Mycobacterium tuberculosis (Mtb). By enriching for genes that display a sinusoidal expression pattern, we discover 485 genes that oscillate with a period consistent with the cell cycle. During cytokinesis, the timing of gene induction could be used to predict the timing of gene function, as mRNA abundance was found to correlate with the order in which proteins were recruited to the developing septum. Similarly, the expression pattern of primary metabolic genes could be used to predict the relative importance of these pathways for different cell cycle processes. Pyrimidine synthetic genes peaked during DNA replication, and their depletion caused a filamentation phenotype that phenocopied defects in this process. In contrast, the inosine monophasphate dehydrogenase dedicated to guanosine synthesis, GuaB2, displayed the opposite expression pattern and its depletion perturbed septation. Together, these data imply obligate coordination between primary metabolism and cell division and identify periodically regulated genes that can be related to specific cell biological functions.
Asunto(s)
Ciclo Celular/genética , División Celular/genética , Replicación del ADN/genética , Regulación Bacteriana de la Expresión Génica/genética , Mycobacterium tuberculosis/genética , Citocinesis/genética , Expresión Génica , Guanosina/biosíntesis , IMP Deshidrogenasa/genética , Mycobacterium tuberculosis/crecimiento & desarrolloRESUMEN
OBJECTIVES: Informatics tools that support next-generation sequencing workflows are essential to deliver timely interpretation of somatic variants in cancer. Here, we describe significant updates to our laboratory developed bioinformatics pipelines and data management application termed Houston Methodist Variant Viewer (HMVV). MATERIALS AND METHODS: We collected feature requests and workflow improvement suggestions from the end-users of HMVV version 1. Over 1.5 years, we iteratively implemented these features in five sequential updates to HMVV version 3. RESULTS: We improved the performance and data throughput of the application while reducing the opportunity for manual data entry errors. We enabled end-user workflows for pipeline monitoring, variant interpretation and annotation, and integration with our laboratory information system. System maintenance was improved through enhanced defect reporting, heightened data security, and improved modularity in the code and system environments. DISCUSSION AND CONCLUSION: Validation of each HMVV update was performed according to expert guidelines. We enabled an 8× reduction in the bioinformatics pipeline computation time for our longest running assay. Our molecular pathologists can interpret the assay results at least 2 days sooner than was previously possible. The application and pipeline code are publicly available at https://github.com/hmvv.
RESUMEN
We sequenced the genomes of 5,085 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strains causing two coronavirus disease 2019 (COVID-19) disease waves in metropolitan Houston, TX, an ethnically diverse region with 7 million residents. The genomes were from viruses recovered in the earliest recognized phase of the pandemic in Houston and from viruses recovered in an ongoing massive second wave of infections. The virus was originally introduced into Houston many times independently. Virtually all strains in the second wave have a Gly614 amino acid replacement in the spike protein, a polymorphism that has been linked to increased transmission and infectivity. Patients infected with the Gly614 variant strains had significantly higher virus loads in the nasopharynx on initial diagnosis. We found little evidence of a significant relationship between virus genotype and altered virulence, stressing the linkage between disease severity, underlying medical conditions, and host genetics. Some regions of the spike protein-the primary target of global vaccine efforts-are replete with amino acid replacements, perhaps indicating the action of selection. We exploited the genomic data to generate defined single amino acid replacements in the receptor binding domain of spike protein that, importantly, produced decreased recognition by the neutralizing monoclonal antibody CR3022. Our report represents the first analysis of the molecular architecture of SARS-CoV-2 in two infection waves in a major metropolitan region. The findings will help us to understand the origin, composition, and trajectory of future infection waves and the potential effect of the host immune response and therapeutic maneuvers on SARS-CoV-2 evolution.IMPORTANCE There is concern about second and subsequent waves of COVID-19 caused by the SARS-CoV-2 coronavirus occurring in communities globally that had an initial disease wave. Metropolitan Houston, TX, with a population of 7 million, is experiencing a massive second disease wave that began in late May 2020. To understand SARS-CoV-2 molecular population genomic architecture and evolution and the relationship between virus genotypes and patient features, we sequenced the genomes of 5,085 SARS-CoV-2 strains from these two waves. Our report provides the first molecular characterization of SARS-CoV-2 strains causing two distinct COVID-19 disease waves.
Asunto(s)
Betacoronavirus/genética , Infecciones por Coronavirus/virología , Neumonía Viral/virología , Glicoproteína de la Espiga del Coronavirus/química , Glicoproteína de la Espiga del Coronavirus/genética , Secuencia de Aminoácidos , Sustitución de Aminoácidos , Anticuerpos Neutralizantes/inmunología , Secuencia de Bases , Betacoronavirus/inmunología , COVID-19 , Prueba de COVID-19 , Técnicas de Laboratorio Clínico , Infecciones por Coronavirus/diagnóstico , Infecciones por Coronavirus/epidemiología , Infecciones por Coronavirus/inmunología , ARN Polimerasa Dependiente de ARN de Coronavirus , Genoma Viral , Genotipo , Humanos , Aprendizaje Automático , Modelos Moleculares , Técnicas de Diagnóstico Molecular , Pandemias , Filogenia , Neumonía Viral/epidemiología , Neumonía Viral/inmunología , ARN Polimerasa Dependiente del ARN/química , ARN Polimerasa Dependiente del ARN/genética , SARS-CoV-2 , Análisis de Secuencia de Proteína , Glicoproteína de la Espiga del Coronavirus/inmunología , Texas/epidemiología , Proteínas no Estructurales Virales/química , Proteínas no Estructurales Virales/genéticaRESUMEN
We sequenced the genomes of 5,085 SARS-CoV-2 strains causing two COVID-19 disease waves in metropolitan Houston, Texas, an ethnically diverse region with seven million residents. The genomes were from viruses recovered in the earliest recognized phase of the pandemic in Houston, and an ongoing massive second wave of infections. The virus was originally introduced into Houston many times independently. Virtually all strains in the second wave have a Gly614 amino acid replacement in the spike protein, a polymorphism that has been linked to increased transmission and infectivity. Patients infected with the Gly614 variant strains had significantly higher virus loads in the nasopharynx on initial diagnosis. We found little evidence of a significant relationship between virus genotypes and altered virulence, stressing the linkage between disease severity, underlying medical conditions, and host genetics. Some regions of the spike protein - the primary target of global vaccine efforts - are replete with amino acid replacements, perhaps indicating the action of selection. We exploited the genomic data to generate defined single amino acid replacements in the receptor binding domain of spike protein that, importantly, produced decreased recognition by the neutralizing monoclonal antibody CR30022. Our study is the first analysis of the molecular architecture of SARS-CoV-2 in two infection waves in a major metropolitan region. The findings will help us to understand the origin, composition, and trajectory of future infection waves, and the potential effect of the host immune response and therapeutic maneuvers on SARS-CoV-2 evolution.
RESUMEN
BACKGROUND: COVID-19 disease, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has spread globally, and no proven treatments are available. Convalescent plasma therapy has been used with varying degrees of success to treat severe microbial infections for more than 100 years. METHODS: Patients (n=25) with severe and/or life-threatening COVID-19 disease were enrolled at the Houston Methodist hospitals from March 28 to April 14, 2020. Patients were transfused with convalescent plasma obtained from donors with confirmed SARS-CoV-2 infection and had been symptom free for 14 days. The primary study outcome was safety, and the secondary outcome was clinical status at day 14 post-transfusion. Clinical improvement was assessed based on a modified World Health Organization 6-point ordinal scale and laboratory parameters. Viral genome sequencing was performed on donor and recipient strains. RESULTS: At baseline, all patients were receiving supportive care, including anti-inflammatory and anti-viral treatments, and all patients were on oxygen support. At day 7 post-transfusion with convalescent plasma, nine patients had at least a 1-point improvement in clinical scale, and seven of those were discharged. By day 14 post-transfusion, 19 (76%) patients had at least a 1-point improvement in clinical status and 11 were discharged. No adverse events as a result of plasma transfusion were observed. The whole genome sequencing data did not identify a strain genotype-disease severity correlation. CONCLUSIONS: The data indicate that administration of convalescent plasma is a safe treatment option for those with severe COVID-19 disease. Randomized, controlled trials are needed to determine its efficacy.