RESUMEN
Compared with other SARS-related coronaviruses (SARSr-CoVs), SARS-CoV-2 possesses a unique furin cleavage site (FCS) in its spike. This has stimulated discussion pertaining to the origin of SARS-CoV-2 because the FCS has been observed to be under strong selective pressure in humans and confers the enhanced ability to infect some cell types and induce cell-cell fusion. Furthermore, scientists have demonstrated interest in studying novel cleavage sites by introducing them into SARSr-CoVs. We review what is known about the SARS-CoV-2 FCS in the context of its pathogenesis, origin, and how future wildlife coronavirus sampling may alter the interpretation of existing data.
Asunto(s)
Furina , Glicoproteína de la Espiga del Coronavirus , Evolución Molecular , Furina/genética , SARS-CoV-2 , Glicoproteína de la Espiga del Coronavirus/genéticaRESUMEN
COVID-19 CG (covidcg.org) is an open resource for tracking SARS-CoV-2 single-nucleotide variations (SNVs), lineages, and clades using the virus genomes on the GISAID database while filtering by location, date, gene, and mutation of interest. COVID-19 CG provides significant time, labor, and cost-saving utility to projects on SARS-CoV-2 transmission, evolution, diagnostics, therapeutics, vaccines, and intervention tracking. Here, we describe case studies in which users can interrogate (1) SNVs in the SARS-CoV-2 spike receptor binding domain (RBD) across different geographical regions to inform the design and testing of therapeutics, (2) SNVs that may impact the sensitivity of commonly used diagnostic primers, and (3) the emergence of a dominant lineage harboring an S477N RBD mutation in Australia in 2020. To accelerate COVID-19 efforts, COVID-19 CG will be upgraded with new features for users to rapidly pinpoint mutations as the virus evolves throughout the pandemic and in response to therapeutic and public health interventions.
The discovery of faster spreading variants of the virus that causes coronavirus disease 2019 (COVID-19) has raised alarm. These new variants are the result of changes (called mutations) in the virus' genetic code. Random mutations can occur each time a virus multiplies. Although most mutations do not introduce any meaningful changes, some can alter the characteristics of the virus, for instance, helping the virus to spread more easily, reinfecting people who have had COVID-19 before, or reducing the sensitivity to treatments or vaccines. Scientists need to know about mutations in the virus that make treatments or vaccines less effective as soon as possible, so they can adjust their pandemic response. As a result, tracking these genetic changes is essential. But individual scientists or public health agencies may not have the staff, time or computer resources to extract usable information from the growing amount of genetic data available. A free online tool created by Chen et al. may help scientists and public health officials to track changes to the virus more easily. The COVID-19 CoV Genetics tool (COVID-19 CG) can quickly provide information on which virus mutations are present in an area during a specific period. It does this by processing data on mutations found in viral genetic material collected worldwide from hundreds of thousands of people with COVID-19, which are hosted in an existing online database. The COVID-19 CG tool presents customizable, interactive visualizations of the data. Thousands of scientists, public health agencies, and COVID-19 vaccine and treatment developers in over 100 countries are already using the COVID-19 CG tool to find the most common mutations in their area and use it for research. They can use this information to develop more effective vaccines or treatments. Chen et al. plan to update and improve the tool as more information becomes available to help advance global efforts to end the COVID-19 pandemic.
Asunto(s)
COVID-19/prevención & control , Biología Computacional/métodos , Genoma Viral/genética , Mutación , SARS-CoV-2/genética , Secuencia de Aminoácidos , Sitios de Unión/genética , COVID-19/epidemiología , COVID-19/virología , Geografía , Salud Global , Humanos , Internet , Pandemias , Sistemas de Identificación de Pacientes/métodos , Filogenia , SARS-CoV-2/clasificación , SARS-CoV-2/fisiología , Homología de Secuencia de Aminoácido , Programas Informáticos , Glicoproteína de la Espiga del Coronavirus/genética , Glicoproteína de la Espiga del Coronavirus/metabolismoRESUMEN
Biases in data used to train machine learning (ML) models can inflate their prediction performance and confound our understanding of how and what they learn. Although biases are common in biological data, systematic auditing of ML models to identify and eliminate these biases is not a common practice when applying ML in the life sciences. Here we devise a systematic, principled, and general approach to audit ML models in the life sciences. We use this auditing framework to examine biases in three ML applications of therapeutic interest and identify unrecognized biases that hinder the ML process and result in substantially reduced model performance on new datasets. Ultimately, we show that ML models tend to learn primarily from data biases when there is insufficient signal in the data to learn from. We provide detailed protocols, guidelines, and examples of code to enable tailoring of the auditing framework to other biomedical applications.
Asunto(s)
Minería de Datos , Aprendizaje Automático , Proteínas/metabolismo , Proteoma , Proteómica , Animales , Sesgo , Bases de Datos de Proteínas , Antígenos de Histocompatibilidad/metabolismo , Humanos , Preparaciones Farmacéuticas/química , Preparaciones Farmacéuticas/metabolismo , Unión Proteica , Mapas de Interacción de Proteínas , Proteínas/química , Reproducibilidad de los ResultadosRESUMEN
COVID-19 CG is an open resource for tracking SARS-CoV-2 single-nucleotide variations (SNVs) and lineages while filtering by location, date, gene, and mutation of interest. COVID-19 CG provides significant time, labor, and cost-saving utility to diverse projects on SARS-CoV-2 transmission, evolution, emergence, immune interactions, diagnostics, therapeutics, vaccines, and intervention tracking. Here, we describe case studies in which users can interrogate (1) SNVs in the SARS-CoV-2 Spike receptor binding domain (RBD) across different geographic regions to inform the design and testing of therapeutics, (2) SNVs that may impact the sensitivity of commonly used diagnostic primers, and (3) the recent emergence of a dominant lineage harboring an S477N RBD mutation in Australia. To accelerate COVID-19 research and public health efforts, COVID-19 CG will be continually upgraded with new features for users to quickly and reliably pinpoint mutations as the virus evolves throughout the pandemic and in response to therapeutic and public health interventions.
RESUMEN
The engineered AAV-PHP.B family of adeno-associated virus efficiently delivers genes throughout the mouse central nervous system. To guide their application across disease models, and to inspire the development of translational gene therapy vectors for targeting neurological diseases in humans, we sought to elucidate the host factors responsible for the CNS tropism of the AAV-PHP.B vectors. Leveraging CNS tropism differences across 13 mouse strains, we systematically determined a set of genetic variants that segregate with the permissivity phenotype, and rapidly identified LY6A as an essential receptor for the AAV-PHP.B vectors. Interfering with LY6A by CRISPR/Cas9-mediated Ly6a disruption or with blocking antibodies reduced transduction of mouse brain endothelial cells by AAV-PHP.eB, while ectopic expression of Ly6a increased AAV-PHP.eB transduction of HEK293T and CHO cells by 30-fold or more. Importantly, we demonstrate that this newly discovered mode of AAV binding and transduction can occur independently of other known AAV receptors. These findings illuminate the previously reported species- and strain-specific tropism characteristics of the AAV-PHP.B vectors and inform ongoing efforts to develop next-generation AAV vehicles for human CNS gene therapy.