RESUMO
Significant variations have been observed in viral copies generated during SARS-CoV-2 infections. However, the factors that impact viral copies and infection dynamics are not fully understood, and may be inherently dependent upon different viral and host factors. Here, we conducted virus whole genome sequencing and measured viral copies using RT-qPCR from 9,902 SARS-CoV-2 infections over a 2-year period to examine the impact of virus genetic variation on changes in viral copies adjusted for host age and vaccination status. Using a genome-wide association study (GWAS) approach, we identified multiple single-nucleotide polymorphisms (SNPs) corresponding to amino acid changes in the SARS-CoV-2 genome associated with variations in viral copies. We further applied a marginal epistasis test to detect interactions among SNPs and identified multiple pairs of substitutions located in the spike gene that have non-linear effects on viral copies. We also analyzed the temporal patterns and found that SNPs associated with increased viral copies were predominantly observed in Delta and Omicron BA.2/BA.4/BA.5/XBB infections, whereas those associated with decreased viral copies were only observed in infections with Omicron BA.1 variants. Our work showcases how GWAS can be a useful tool for probing phenotypes related to SNPs in viral genomes that are worth further exploration. We argue that this approach can be used more broadly across pathogens to characterize emerging variants and monitor therapeutic interventions.
Assuntos
COVID-19 , Genoma Viral , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , SARS-CoV-2 , Polimorfismo de Nucleotídeo Único/genética , Humanos , SARS-CoV-2/genética , Estudo de Associação Genômica Ampla/métodos , COVID-19/genética , COVID-19/virologia , Genoma Viral/genética , Glicoproteína da Espícula de Coronavírus/genética , Pessoa de Meia-Idade , Adulto , Masculino , Feminino , Carga Viral/genética , Idoso , Sequenciamento Completo do Genoma/métodosRESUMO
LD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a cis-interaction score (i.e. interactions between a focal variant and proximal variants) recovers genetic variance that is not captured by LDSC. For each of the 25 traits analyzed in the UK Biobank and BioBank Japan, i-LDSC detects additional variation contributed by genetic interactions. The i-LDSC software and its application to these biobanks represent a step towards resolving further genetic contributions of sources of non-additive genetic effects to complex trait variation.
Assuntos
Estudo de Associação Genômica Ampla , Estudo de Associação Genômica Ampla/métodos , Humanos , Japão , Reino Unido , Polimorfismo de Nucleotídeo Único/genética , Modelos Genéticos , Fenótipo , Variação Genética , Herança Multifatorial/genética , Bancos de Espécimes BiológicosRESUMO
Epistasis, commonly defined as the interaction between genetic loci, is known to play an important role in the phenotypic variation of complex traits. As a result, many statistical methods have been developed to identify genetic variants that are involved in epistasis, and nearly all of these approaches carry out this task by focusing on analyzing one trait at a time. Previous studies have shown that jointly modeling multiple phenotypes can often dramatically increase statistical power for association mapping. In this study, we present the "multivariate MArginal ePIstasis Test" (mvMAPIT)-a multioutcome generalization of a recently proposed epistatic detection method which seeks to detect marginal epistasis or the combined pairwise interaction effects between a given variant and all other variants. By searching for marginal epistatic effects, one can identify genetic variants that are involved in epistasis without the need to identify the exact partners with which the variants interact-thus, potentially alleviating much of the statistical and computational burden associated with conventional explicit search-based methods. Our proposed mvMAPIT builds upon this strategy by taking advantage of correlation structure between traits to improve the identification of variants involved in epistasis. We formulate mvMAPIT as a multivariate linear mixed model and develop a multitrait variance component estimation algorithm for efficient parameter inference and P-value computation. Together with reasonable model approximations, our proposed approach is scalable to moderately sized genome-wide association studies. With simulations, we illustrate the benefits of mvMAPIT over univariate (or single-trait) epistatic mapping strategies. We also apply mvMAPIT framework to protein sequence data from two broadly neutralizing anti-influenza antibodies and approximately 2,000 heterogeneous stock of mice from the Wellcome Trust Centre for Human Genetics. The mvMAPIT R package can be downloaded at https://github.com/lcrawlab/mvMAPIT.
Assuntos
Epistasia Genética , Estudo de Associação Genômica Ampla , Humanos , Animais , Camundongos , Fenótipo , Locos de Características Quantitativas , AlgoritmosRESUMO
Continuous enzyme-free replication of oligonucleotides is central for open-ended evolution experiments that mimic the origin of life. Here, we studied a reaction system, whereby two 24mer DNA templates cross-catalyzed each other's synthesis from four 12mer DNA fragments, two of which were in situ activated with the condensing agent 1-ethyl-3-(3-dimethylamino-propyl)carbodiimide (EDC). We circumvented the problem of product inhibition by melting the stable product duplexes for their reuse as templates in the following ligation step. The system reproduced itself through ligation/melting cycles and survived exponential dilution. We quantified EDC-induced side reactions in a detailed kinetic model. The model allowed us to analyze the effects of various reaction rates on the system's kinetics and confirmed maximal replication under the chosen conditions. The presented system enables us to study nonenzymatic open-ended evolution experiments starting from diverse sequence pools.