Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 22
Filter
1.
Genet Epidemiol ; 47(2): 198-212, 2023 03.
Article in English | MEDLINE | ID: mdl-36701426

ABSTRACT

Genetic variants in drug targets can be used to predict the long-term, on-target effect of drugs. Here, we extend this principle to assess how sex and body mass index may modify the effect of genetically predicted lower CETP levels on biomarkers and cardiovascular outcomes. We found sex and body mass index (BMI) to be modifiers of the association between genetically predicted lower CETP and lipid biomarkers in UK Biobank participants. Female sex and lower BMI were associated with higher high-density lipoprotein cholesterol and lower low-density lipoprotein cholesterol for the same genetically predicted reduction in CETP concentration. We found that sex also modulated the effect of genetically lower CETP on cholesterol efflux capacity in samples from the Montreal Heart Institute Biobank. However, these modifying effects did not extend to sex differences in cardiovascular outcomes in our data. Our results provide insight into the clinical effects of CETP inhibitors in the presence of effect modification based on genetic data. The approach can support precision medicine applications and help assess the external validity of clinical trials.


Subject(s)
Cholesterol Ester Transfer Proteins , Humans , Male , Female , Cholesterol Ester Transfer Proteins/genetics , Cholesterol, HDL , Cholesterol, LDL , Biomarkers
2.
Nature ; 530(7589): 171-176, 2016 Feb 11.
Article in English | MEDLINE | ID: mdl-26840484

ABSTRACT

The DNA-binding protein PRDM9 directs positioning of the double-strand breaks (DSBs) that initiate meiotic recombination in mice and humans. Prdm9 is the only mammalian speciation gene yet identified and is responsible for sterility phenotypes in male hybrids of certain mouse subspecies. To investigate PRDM9 binding and its role in fertility and meiotic recombination, we humanized the DNA-binding domain of PRDM9 in C57BL/6 mice. This change repositions DSB hotspots and completely restores fertility in male hybrids. Here we show that alteration of one Prdm9 allele impacts the behaviour of DSBs controlled by the other allele at chromosome-wide scales. These effects correlate strongly with the degree to which each PRDM9 variant binds both homologues at the DSB sites it controls. Furthermore, higher genome-wide levels of such 'symmetric' PRDM9 binding associate with increasing fertility measures, and comparisons of individual hotspots suggest binding symmetry plays a downstream role in the recombination process. These findings reveal that subspecies-specific degradation of PRDM9 binding sites by meiotic drive, which steadily increases asymmetric PRDM9 binding, has impacts beyond simply changing hotspot positions, and strongly support a direct involvement in hybrid infertility. Because such meiotic drive occurs across mammals, PRDM9 may play a wider, yet transient, role in the early stages of speciation.


Subject(s)
Genetic Speciation , Histone-Lysine N-Methyltransferase/chemistry , Histone-Lysine N-Methyltransferase/metabolism , Hybridization, Genetic/genetics , Infertility/genetics , Protein Engineering , Zinc Fingers/genetics , Alleles , Animals , Binding Sites , Chromosome Pairing/genetics , Chromosomes, Mammalian/genetics , Chromosomes, Mammalian/metabolism , DNA Breaks, Double-Stranded , Female , Histone-Lysine N-Methyltransferase/genetics , Humans , Male , Meiosis/genetics , Mice , Mice, Inbred C57BL , Protein Binding , Protein Structure, Tertiary/genetics , Recombination, Genetic/genetics
3.
Diabetologia ; 64(9): 2012-2025, 2021 09.
Article in English | MEDLINE | ID: mdl-34226943

ABSTRACT

AIMS/HYPOTHESIS: Type 2 diabetes increases the risk of cardiovascular and renal complications, but early risk prediction could lead to timely intervention and better outcomes. Genetic information can be used to enable early detection of risk. METHODS: We developed a multi-polygenic risk score (multiPRS) that combines ten weighted PRSs (10 wPRS) composed of 598 SNPs associated with main risk factors and outcomes of type 2 diabetes, derived from summary statistics data of genome-wide association studies. The 10 wPRS, first principal component of ethnicity, sex, age at onset and diabetes duration were included into one logistic regression model to predict micro- and macrovascular outcomes in 4098 participants in the ADVANCE study and 17,604 individuals with type 2 diabetes in the UK Biobank study. RESULTS: The model showed a similar predictive performance for cardiovascular and renal complications in different cohorts. It identified the top 30% of ADVANCE participants with a mean of 3.1-fold increased risk of major micro- and macrovascular events (p = 6.3 × 10-21 and p = 9.6 × 10-31, respectively) and a 4.4-fold (p = 6.8 × 10-33) higher risk of cardiovascular death. While in ADVANCE overall, combined intensive blood pressure and glucose control decreased cardiovascular death by 24%, the model identified a high-risk group in whom it decreased the mortality rate by 47%, and a low-risk group in whom it had no discernible effect. High-risk individuals had the greatest absolute risk reduction with a number needed to treat of 12 to prevent one cardiovascular death over 5 years. CONCLUSIONS/INTERPRETATION: This novel multiPRS model stratified individuals with type 2 diabetes according to risk of complications and helped to target earlier those who would receive greater benefit from intensive therapy.


Subject(s)
Diabetes Complications , Diabetes Mellitus, Type 2 , Multifactorial Inheritance , Blood Glucose , Blood Pressure/genetics , Diabetes Complications/complications , Diabetes Mellitus, Type 2/complications , Diabetes Mellitus, Type 2/genetics , Genome-Wide Association Study , Humans , Risk Factors
5.
Can J Cardiol ; 40(10): 1907-1921, 2024 Oct.
Article in English | MEDLINE | ID: mdl-38735528

ABSTRACT

In the dynamic field of medical artificial intelligence (AI), cardiology stands out as a key area for its technological advancements and clinical application. In this review we explore the complex issue of data bias, specifically addressing those encountered during the development and implementation of AI tools in cardiology. We dissect the origins and effects of these biases, which challenge their reliability and widespread applicability in health care. Using a case study, we highlight the complexities involved in addressing these biases from a clinical viewpoint. The goal of this review is to equip researchers and clinicians with the practical knowledge needed to identify, understand, and mitigate these biases, advocating for the creation of AI solutions that are not just technologically sound, but also fair and effective for all patients.


Subject(s)
Artificial Intelligence , Cardiology , Humans , Bias , Reproducibility of Results
6.
Genome Biol Evol ; 16(1)2024 01 05.
Article in English | MEDLINE | ID: mdl-38207129

ABSTRACT

Cytochromes P450 (CYP450) are hemoproteins generally involved in the detoxification of the body of xenobiotic molecules. They participate in the metabolism of many drugs and genetic polymorphisms in humans have been found to impact drug responses and metabolic functions. In this study, we investigate the genetic diversity of CYP450 genes. We found that two clusters, CYP3A and CYP4F, are notably differentiated across human populations with evidence for selective pressures acting on both clusters: we found signals of recent positive selection in CYP3A and CYP4F genes and signals of balancing selection in CYP4F genes. Furthermore, an extensive amount of unusual linkage disequilibrium is detected in this latter cluster, indicating co-evolution signatures among CYP4F genes. Several of the selective signals uncovered co-localize with expression quantitative trait loci (eQTL), which could suggest epistasis acting on co-regulation in these gene families. In particular, we detected a potential co-regulation event between CYP3A5 and CYP3A43, a gene whose function remains poorly characterized. We further identified a causal relationship between CYP3A5 expression and reticulocyte count through Mendelian randomization analyses, potentially involving a regulatory region displaying a selective signal specific to African populations. Our findings linking natural selection and gene expression in CYP3A and CYP4F subfamilies are of importance in understanding population differences in metabolism of nutrients and drugs.


Subject(s)
Cytochrome P-450 CYP3A , Hominidae , Animals , Humans , Cytochrome P-450 CYP3A/genetics , Cytochrome P-450 CYP3A/metabolism , Hominidae/metabolism , Cytochrome P-450 Enzyme System/genetics , Polymorphism, Genetic , Selection, Genetic
7.
Can J Cardiol ; 40(10): 1774-1787, 2024 Oct.
Article in English | MEDLINE | ID: mdl-38825181

ABSTRACT

Large language models (LLMs) have emerged as powerful tools in artificial intelligence, demonstrating remarkable capabilities in natural language processing and generation. In this article, we explore the potential applications of LLMs in enhancing cardiovascular care and research. We discuss how LLMs can be used to simplify complex medical information, improve patient-physician communication, and automate tasks such as summarising medical articles and extracting key information. In addition, we highlight the role of LLMs in categorising and analysing unstructured data, such as medical notes and test results, which could revolutionise data handling and interpretation in cardiovascular research. However, we also emphasise the limitations and challenges associated with LLMs, including potential biases, reasoning opacity, and the need for rigourous validation in medical contexts. This review provides a practical guide for cardiovascular professionals to understand and harness the power of LLMs while navigating their limitations. We conclude by discussing the future directions and implications of LLMs in transforming cardiovascular care and research.


Subject(s)
Cardiovascular Diseases , Humans , Cardiovascular Diseases/therapy , Natural Language Processing , Artificial Intelligence , Cardiology
8.
Can J Cardiol ; 40(10): 1828-1840, 2024 Oct.
Article in English | MEDLINE | ID: mdl-38885787

ABSTRACT

The potential of artificial intelligence (AI) in medicine lies in its ability to enhance clinicians' capacity to analyse medical images, thereby improving diagnostic precision and accuracy and thus enhancing current tests. However, the integration of AI within health care is fraught with difficulties. Heterogeneity among health care system applications, reliance on proprietary closed-source software, and rising cybersecurity threats pose significant challenges. Moreover, before their deployment in clinical settings, AI models must demonstrate their effectiveness across a wide range of scenarios and must be validated by prospective studies, but doing so requires testing in an environment mirroring the clinical workflow, which is difficult to achieve without dedicated software. Finally, the use of AI techniques in health care raises significant legal and ethical issues, such as the protection of patient privacy, the prevention of bias, and the monitoring of the device's safety and effectiveness for regulatory compliance. This review describes challenges to AI integration in health care and provides guidelines on how to move forward. We describe an open-source solution that we developed that integrates AI models into the Picture Archives Communication System (PACS), called PACS-AI. This approach aims to increase the evaluation of AI models by facilitating their integration and validation with existing medical imaging databases. PACS-AI may overcome many current barriers to AI deployment and offer a pathway toward responsible, fair, and effective deployment of AI models in health care. In addition, we propose a list of criteria and guidelines that AI researchers should adopt when publishing a medical AI model to enhance standardisation and reproducibility.


Subject(s)
Artificial Intelligence , Radiology Information Systems , Humans , Artificial Intelligence/ethics , Point-of-Care Systems
9.
Viruses ; 16(3)2024 02 23.
Article in English | MEDLINE | ID: mdl-38543708

ABSTRACT

Throughout the SARS-CoV-2 pandemic, several variants of concern (VOCs) have been identified, many of which share recurrent mutations in the spike glycoprotein's receptor-binding domain (RBD). This region coincides with known epitopes and can therefore have an impact on immune escape. Protracted infections in immunosuppressed patients have been hypothesized to lead to an enrichment of such mutations and therefore drive evolution towards VOCs. Here, we present the case of an immunosuppressed patient that developed distinct populations with immune escape mutations throughout the course of their infection. Notably, by investigating the co-occurrence of substitutions on individual sequencing reads in the RBD, we found quasispecies harboring mutations that confer resistance to known monoclonal antibodies (mAbs) such as S:E484K and S:E484A. These mutations were acquired without the patient being treated with mAbs nor convalescent sera and without them developing a detectable immune response to the virus. We also provide additional evidence for a viral reservoir based on intra-host phylogenetics, which led to a viral substrain that evolved elsewhere in the patient's body, colonizing their upper respiratory tract (URT). The presence of SARS-CoV-2 viral reservoirs can shed light on protracted infections interspersed with periods where the virus is undetectable, and potential explanations for long-COVID cases.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , Post-Acute COVID-19 Syndrome , COVID-19 Serotherapy , Immunocompromised Host , Antibodies, Monoclonal , Mutation , Spike Glycoprotein, Coronavirus/genetics , Antibodies, Viral , Antibodies, Neutralizing
10.
Bioinform Adv ; 3(1): vbad097, 2023.
Article in English | MEDLINE | ID: mdl-37720006

ABSTRACT

Summary: We describe the problem of computing local feature attributions for dimensionality reduction methods. We use one such method that is well established within the context of supervised classification-using the gradients of target outputs with respect to the inputs-on the popular dimensionality reduction technique t-SNE, widely used in analyses of biological data. We provide an efficient implementation for the gradient computation for this dimensionality reduction technique. We show that our explanations identify significant features using novel validation methodology; using synthetic datasets and the popular MNIST benchmark dataset. We then demonstrate the practical utility of our algorithm by showing that it can produce explanations that agree with domain knowledge on a SARS-CoV-2 sequence dataset. Throughout, we provide a road map so that similar explanation methods could be applied to other dimensionality reduction techniques to rigorously analyze biological datasets. Availability and implementation: We have created a Python package that can be installed using the following command: pip install interpretable_tsne. All code used can be found at github.com/MattScicluna/interpretable_tsne.

11.
iScience ; 26(8): 107394, 2023 Aug 18.
Article in English | MEDLINE | ID: mdl-37599818

ABSTRACT

Here, we exploit a deep serological profiling strategy coupled with an integrated, computational framework for the analysis of SARS-CoV-2 humoral immune responses. Applying a high-density peptide array (HDPA) spanning the entire proteomes of SARS-CoV-2 and endemic human coronaviruses allowed identification of B cell epitopes and relate them to their evolutionary and structural properties. We identify hotspots of pre-existing immunity and identify cross-reactive epitopes that contribute to increasing the overall humoral immune response to SARS-CoV-2. Using a public dataset of over 38,000 viral genomes from the early phase of the pandemic, capturing both inter- and within-host genetic viral diversity, we determined the evolutionary profile of epitopes and the differences across proteins, waves, and SARS-CoV-2 variants. Lastly, we show that mutations in spike and nucleocapsid epitopes are under stronger selection between than within patients, suggesting that most of the selective pressure for immune evasion occurs upon transmission between hosts.

12.
iScience ; 26(12): 108473, 2023 Dec 15.
Article in English | MEDLINE | ID: mdl-38077122

ABSTRACT

Metabolite genome-wide association studies (mGWAS) have advanced our understanding of the genetic control of metabolite levels. However, interpreting these associations remains challenging due to a lack of tools to annotate gene-metabolite pairs beyond the use of conservative statistical significance threshold. Here, we introduce the shortest reactional distance (SRD) metric, drawing from the comprehensive KEGG database, to enhance the biological interpretation of mGWAS results. We applied this approach to three independent mGWAS, including a case study on sickle cell disease patients. Our analysis reveals an enrichment of small SRD values in reported mGWAS pairs, with SRD values significantly correlating with mGWAS p values, even beyond the standard conservative thresholds. We demonstrate the utility of SRD annotation in identifying potential false negatives and inaccuracies within current metabolic pathway databases. Our findings highlight the SRD metric as an objective, quantitative and easy-to-compute annotation for gene-metabolite pairs, suitable to integrate statistical evidence to biological networks.

13.
bioRxiv ; 2023 Mar 24.
Article in English | MEDLINE | ID: mdl-36993181

ABSTRACT

Studies combining metabolomics and genetics, known as metabolite genome-wide association studies (mGWAS), have provided valuable insights into our understanding of the genetic control of metabolite levels. However, the biological interpretation of these associations remains challenging due to a lack of existing tools to annotate mGWAS gene-metabolite pairs beyond the use of conservative statistical significance threshold. Here, we computed the shortest reactional distance (SRD) based on the curated knowledge of the KEGG database to explore its utility in enhancing the biological interpretation of results from three independent mGWAS, including a case study on sickle cell disease patients. Results show that, in reported mGWAS pairs, there is an excess of small SRD values and that SRD values and p-values significantly correlate, even beyond the standard conservative thresholds. The added-value of SRD annotation is shown for identification of potential false negative hits, exemplified by the finding of gene-metabolite associations with SRD ≤1 that did not reach standard genome-wide significance cut-off. The wider use of this statistic as an mGWAS annotation would prevent the exclusion of biologically relevant associations and can also identify errors or gaps in current metabolic pathway databases. Our findings highlight the SRD metric as an objective, quantitative and easy-to-compute annotation for gene-metabolite pairs that can be used to integrate statistical evidence to biological networks.

14.
Genome Biol Evol ; 14(5)2022 05 03.
Article in English | MEDLINE | ID: mdl-35482036

ABSTRACT

The molecular mechanisms of aging and life expectancy have been studied in model organisms with short lifespans. However, long-lived species may provide insights into successful strategies for healthy aging, potentially opening the door for novel therapeutic interventions in age-related diseases. Notably, naked mole-rats, the longest-lived rodent, present attenuated aging phenotypes compared with mice. Their resistance toward oxidative stress has been proposed as one hallmark of their healthy aging, suggesting their ability to maintain cell homeostasis, specifically their protein homeostasis. To identify the general principles behind their protein homeostasis robustness, we compared the aggregation propensity and mutation tolerance of naked mole-rat and mouse orthologous proteins. Our analysis showed no proteome-wide differential effects in aggregation propensity and mutation tolerance between these species, but several subsets of proteins with a significant difference in aggregation propensity. We found an enrichment of proteins with higher aggregation propensity in naked mole-rat, and these are functionally involved in the inflammasome complex and nucleic acid binding. On the other hand, proteins with lower aggregation propensity in naked mole-rat have a significantly higher mutation tolerance compared with the rest of the proteins. Among them, we identified proteins known to be associated with neurodegenerative and age-related diseases. These findings highlight the intriguing hypothesis about the capacity of the naked mole-rat proteome to delay aging through its proteomic intrinsic architecture.


Subject(s)
Protein Aggregates , Proteomics , Animals , Longevity/genetics , Mice , Mole Rats/genetics , Mutation , Proteome/genetics
15.
Front Med (Lausanne) ; 9: 826746, 2022.
Article in English | MEDLINE | ID: mdl-35265640

ABSTRACT

The genome of the Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2), the pathogen that causes coronavirus disease 2019 (COVID-19), has been sequenced at an unprecedented scale leading to a tremendous amount of viral genome sequencing data. To assist in tracing infection pathways and design preventive strategies, a deep understanding of the viral genetic diversity landscape is needed. We present here a set of genomic surveillance tools from population genetics which can be used to better understand the evolution of this virus in humans. To illustrate the utility of this toolbox, we detail an in depth analysis of the genetic diversity of SARS-CoV-2 in first year of the COVID-19 pandemic. We analyzed 329,854 high-quality consensus sequences published in the GISAID database during the pre-vaccination phase. We demonstrate that, compared to standard phylogenetic approaches, haplotype networks can be computed efficiently on much larger datasets. This approach enables real-time lineage identification, a clear description of the relationship between variants of concern, and efficient detection of recurrent mutations. Furthermore, time series change of Tajima's D by haplotype provides a powerful metric of lineage expansion. Finally, principal component analysis (PCA) highlights key steps in variant emergence and facilitates the visualization of genomic variation in the context of SARS-CoV-2 diversity. The computational framework presented here is simple to implement and insightful for real-time genomic surveillance of SARS-CoV-2 and could be applied to any pathogen that threatens the health of populations of humans and other organisms.

16.
Nat Biotechnol ; 40(5): 681-691, 2022 05.
Article in English | MEDLINE | ID: mdl-35228707

ABSTRACT

As the biomedical community produces datasets that are increasingly complex and high dimensional, there is a need for more sophisticated computational tools to extract biological insights. We present Multiscale PHATE, a method that sweeps through all levels of data granularity to learn abstracted biological features directly predictive of disease outcome. Built on a coarse-graining process called diffusion condensation, Multiscale PHATE learns a data topology that can be analyzed at coarse resolutions for high-level summarizations of data and at fine resolutions for detailed representations of subsets. We apply Multiscale PHATE to a coronavirus disease 2019 (COVID-19) dataset with 54 million cells from 168 hospitalized patients and find that patients who die show CD16hiCD66blo neutrophil and IFN-γ+ granzyme B+ Th17 cell responses. We also show that population groupings from Multiscale PHATE directly fed into a classifier predict disease outcome more accurately than naive featurizations of the data. Multiscale PHATE is broadly generalizable to different data types, including flow cytometry, single-cell RNA sequencing (scRNA-seq), single-cell sequencing assay for transposase-accessible chromatin (scATAC-seq), and clinical variables.


Subject(s)
COVID-19 , Single-Cell Analysis , Chromatin , Humans , Single-Cell Analysis/methods , Transposases , Exome Sequencing
17.
Front Cardiovasc Med ; 8: 711401, 2021.
Article in English | MEDLINE | ID: mdl-34957230

ABSTRACT

Driven by recent innovations and technological progress, the increasing quality and amount of biomedical data coupled with the advances in computing power allowed for much progress in artificial intelligence (AI) approaches for health and biomedical research. In interventional cardiology, the hope is for AI to provide automated analysis and deeper interpretation of data from electrocardiography, computed tomography, magnetic resonance imaging, and electronic health records, among others. Furthermore, high-performance predictive models supporting decision-making hold the potential to improve safety, diagnostic and prognostic prediction in patients undergoing interventional cardiology procedures. These applications include robotic-assisted percutaneous coronary intervention procedures and automatic assessment of coronary stenosis during diagnostic coronary angiograms. Machine learning (ML) has been used in these innovations that have improved the field of interventional cardiology, and more recently, deep Learning (DL) has emerged as one of the most successful branches of ML in many applications. It remains to be seen if DL approaches will have a major impact on current and future practice. DL-based predictive systems also have several limitations, including lack of interpretability and lack of generalizability due to cohort heterogeneity and low sample sizes. There are also challenges for the clinical implementation of these systems, such as ethical limits and data privacy. This review is intended to bring the attention of health practitioners and interventional cardiologists to the broad and helpful applications of ML and DL algorithms to date in the field. Their implementation challenges in daily practice and future applications in the field of interventional cardiology are also discussed.

18.
Cell Rep Med ; 2(6): 100299, 2021 06 15.
Article in English | MEDLINE | ID: mdl-34195679

ABSTRACT

Untargeted metabolomics is used to refine the development of biomarkers for the diagnosis of cardiovascular disease. Myocardial infarction (MI) has major individual and societal consequences for patients, who remain at high risk of secondary events, despite advances in pharmacological therapy. To monitor their differential response to treatment, we performed untargeted plasma metabolomics on 175 patients from the platelet inhibition and patient outcomes (PLATO) trial treated with ticagrelor and clopidogrel, two common P2Y12 inhibitors. We identified a signature that discriminates patients, which involves polyunsaturated fatty acids (PUFAs) and particularly the omega-3 fatty acids docosahexaenoate and eicosapentaenoate. The known cardiovascular benefits of PUFAs could contribute to the efficacy of ticagrelor. Our work, beyond pointing out the high relevance of untargeted metabolomics in evaluating response to treatment, establishes PUFA metabolism as a pathway of clinical interest in the recovery path from MI.


Subject(s)
Acute Coronary Syndrome/drug therapy , Clopidogrel/therapeutic use , Fatty Acids, Unsaturated/metabolism , Myocardial Infarction/drug therapy , Platelet Aggregation Inhibitors/therapeutic use , Purinergic P2Y Receptor Antagonists/therapeutic use , Ticagrelor/therapeutic use , Acute Coronary Syndrome/metabolism , Acute Coronary Syndrome/pathology , Aged , Blood Platelets/drug effects , Blood Platelets/metabolism , Fatty Acids, Unsaturated/agonists , Female , Humans , Lipid Metabolism/drug effects , Male , Metabolomics/methods , Middle Aged , Myocardial Infarction/metabolism , Myocardial Infarction/pathology , Treatment Outcome
19.
PLoS One ; 16(12): e0260714, 2021.
Article in English | MEDLINE | ID: mdl-34855869

ABSTRACT

The first confirmed case of COVID-19 in Quebec, Canada, occurred at Verdun Hospital on February 25, 2020. A month later, a localized outbreak was observed at this hospital. We performed tiled amplicon whole genome nanopore sequencing on nasopharyngeal swabs from all SARS-CoV-2 positive samples from 31 March to 17 April 2020 in 2 local hospitals to assess viral diversity (unknown at the time in Quebec) and potential associations with clinical outcomes. We report 264 viral genomes from 242 individuals-both staff and patients-with associated clinical features and outcomes, as well as longitudinal samples and technical replicates. Viral lineage assessment identified multiple subclades in both hospitals, with a predominant subclade in the Verdun outbreak, indicative of hospital-acquired transmission. Dimensionality reduction identified two subclades with mutations of clinical interest, namely in the Spike protein, that evaded supervised lineage assignment methods-including Pangolin and NextClade supervised lineage assignment tools. We also report that certain symptoms (headache, myalgia and sore throat) are significantly associated with favorable patient outcomes. Our findings demonstrate the strength of unsupervised, data-driven analyses whilst suggesting that caution should be used when employing supervised genomic workflows, particularly during the early stages of a pandemic.


Subject(s)
COVID-19/virology , Cross Infection/virology , Disease Outbreaks , Genome, Viral/genetics , SARS-CoV-2/genetics , Adolescent , Adult , Aged , Aged, 80 and over , COVID-19/epidemiology , COVID-19/mortality , Child , Child, Preschool , Cross Infection/epidemiology , Disease Outbreaks/statistics & numerical data , Female , Haplotypes/genetics , Humans , Male , Middle Aged , Phylogeny , Quebec/epidemiology , SARS-CoV-2/pathogenicity , Sequence Analysis, RNA , Treatment Outcome , Young Adult
20.
Philos Trans R Soc Lond B Biol Sci ; 372(1736)2017 Dec 19.
Article in English | MEDLINE | ID: mdl-29109227

ABSTRACT

Recombination promotes genomic integrity among cells and tissues through double-strand break repair, and is critical for gamete formation and fertility through a strict regulation of the molecular mechanisms associated with proper chromosomal disjunction. In humans, congenital defects and recurrent structural abnormalities can be attributed to aberrant meiotic recombination. Moreover, mutations affecting genes involved in recombination pathways are directly linked to pathologies including infertility and cancer. Recombination is among the most prominent mechanism shaping genome variation, and is associated with not only the structuring of genomic variability, but is also tightly linked with the purging of deleterious mutations from populations. Together, these observations highlight the multiple roles of recombination in human genetics: its ability to act as a major force of evolution, its molecular potential to maintain genome repair and integrity in cell division and its mutagenic cost impacting disease evolution.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.


Subject(s)
Communicable Diseases/genetics , Evolution, Molecular , Histone-Lysine N-Methyltransferase/genetics , Mutation , Recombination, Genetic , Genetic Linkage , Humans
SELECTION OF CITATIONS
SEARCH DETAIL