Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Cell ; 184(10): 2587-2594.e7, 2021 05 13.
Article in English | MEDLINE | ID: mdl-33861950

ABSTRACT

The highly transmissible B.1.1.7 variant of SARS-CoV-2, first identified in the United Kingdom, has gained a foothold across the world. Using S gene target failure (SGTF) and SARS-CoV-2 genomic sequencing, we investigated the prevalence and dynamics of this variant in the United States (US), tracking it back to its early emergence. We found that, while the fraction of B.1.1.7 varied by state, the variant increased at a logistic rate with a roughly weekly doubling rate and an increased transmission of 40%-50%. We revealed several independent introductions of B.1.1.7 into the US as early as late November 2020, with community transmission spreading it to most states within months. We show that the US is on a similar trajectory as other countries where B.1.1.7 became dominant, requiring immediate and decisive action to minimize COVID-19 morbidity and mortality.


Subject(s)
COVID-19 , Models, Biological , SARS-CoV-2 , COVID-19/genetics , COVID-19/mortality , COVID-19/transmission , Female , Humans , Male , SARS-CoV-2/genetics , SARS-CoV-2/metabolism , SARS-CoV-2/pathogenicity , United States/epidemiology
2.
J Infect Dis ; 2024 Jul 19.
Article in English | MEDLINE | ID: mdl-39028664

ABSTRACT

Within a multi-state viral genomic surveillance program, we evaluated whether proportions of SARS-CoV-2 infections attributed to the JN.1 variant and to XBB-lineage variants (including HV.1 and EG.5) differed between inpatient and outpatient care settings during periods of cocirculation. Both JN.1 and HV.1 were less likely than EG.5 to account for infections among inpatients versus outpatients (aOR=0.60 [95% CI: 0.43-0.84; p=0.003] and aOR=0.35 [95% CI: 0.21-0.58; p<0.001], respectively). JN.1 and HV.1 variants may be associated with a lower risk of severe illness. The severity of COVID-19 may have attenuated as predominant circulating SARS-CoV-2 lineages shifted from EG.5 to HV.1 to JN.1.

3.
Clin Infect Dis ; 78(6): 1531-1535, 2024 Jun 14.
Article in English | MEDLINE | ID: mdl-38170452

ABSTRACT

Within a multistate clinical cohort, SARS-CoV-2 antiviral prescribing patterns were evaluated from April 2022-June 2023 among nonhospitalized patients with SARS-CoV-2 with risk factors for severe COVID-19. Among 3247 adults, only 31.9% were prescribed an antiviral agent (87.6% nirmatrelvir/ritonavir, 11.9% molnupiravir, 0.5% remdesivir), highlighting the need to identify and address treatment barriers.


Subject(s)
Antiviral Agents , COVID-19 Drug Treatment , SARS-CoV-2 , Humans , Antiviral Agents/therapeutic use , Male , Middle Aged , Female , Adult , Aged , Risk Factors , Ritonavir/therapeutic use , COVID-19/epidemiology , Adenosine Monophosphate/analogs & derivatives , Adenosine Monophosphate/therapeutic use , Alanine/therapeutic use , Alanine/analogs & derivatives , Practice Patterns, Physicians'/statistics & numerical data , Cytidine/analogs & derivatives , Hydroxylamines
4.
Genet Med ; 25(4): 100012, 2023 04.
Article in English | MEDLINE | ID: mdl-36637017

ABSTRACT

PURPOSE: TTN truncating variants (TTNtvs) represent the largest known genetic cause of dilated cardiomyopathies (DCMs), however their penetrance for DCM in general populations is low. More broadly, patients with cardiomyopathies (CMs) often exhibit other cardiac conditions, such as atrial fibrillation (Afib), which has also been linked to TTNtvs. This retrospective analysis aims to characterize the relationship between different cardiac conditions in those with TTNtvs and identify individuals with the highest risk of DCM. METHODS: In this work we leverage longitudinal electronic health record and exome sequencing data from approximately 450,000 individuals in 2 health systems to statistically confirm and pinpoint the genetic footprint of TTNtv-related diagnoses aside from CM, such as Afib, and determine whether vetting additional significantly associated phenotypes better stratifies CM risk across those with TTNtvs. We focused on TTNtvs in exons with a percentage spliced in >90% (hiPSI TTNtvs), a representation of constitutive cardiac expression. RESULTS: When controlling for CM and Afib, other cardiac conditions retained only nominal association with TTNtvs. A sliding window analysis of TTNtvs across the locus confirms that the association is specific to hiPSI exons for both CM and Afib, with no meaningful associations in percent spliced in ≤90% exons (loPSI TTNtvs). The combination of hiPSI TTNtv status and early Afib diagnosis (before age 60) found a subset of TTNtv individuals at high risk for CM. The prevalence of CM in this subset was 33%, a rate that was 3.5 fold higher than that in individuals with hiPSI TTNtvs (9% prevalence), 5-fold higher than that in individuals without TTNtvs with early Afib (6% prevalence), and 80-fold higher than that in the general population. CONCLUSION: Our retrospective analyses revealed that those with hiPSI TTNtvs and early Afib (∼1/2900) have a high prevalence of CM (33%), far exceeding that in other individuals with TTNtvs and in those without TTNtvs with an early Afib diagnosis. These results show that combining phenotypic information along with genomic population screening can identify patients at higher risk for progressing to symptomatic heart failure.


Subject(s)
Atrial Fibrillation , Cardiomyopathies , Cardiomyopathy, Dilated , Heart Diseases , Humans , Atrial Fibrillation/epidemiology , Atrial Fibrillation/genetics , Retrospective Studies , Prevalence , Cardiomyopathies/epidemiology , Cardiomyopathies/genetics , Connectin/genetics , Connectin/metabolism , Cardiomyopathy, Dilated/epidemiology , Cardiomyopathy, Dilated/genetics
5.
Genet Med ; 23(12): 2300-2308, 2021 12.
Article in English | MEDLINE | ID: mdl-34385667

ABSTRACT

PURPOSE: To identify conditions that are candidates for population genetic screening based on population prevalence, penetrance of rare variants, and actionability. METHODS: We analyzed exome and medical record data from >220,000 participants across two large population health cohorts with different demographics. We performed a gene-based collapsing analysis of rare variants to identify genes significantly associated with disease status. RESULTS: We identify 74 statistically significant gene-disease associations across 27 genes. Seven of these conditions have a positive predictive value (PPV) of at least 30% in both cohorts. Three are already used in population screening programs (BRCA1, BRCA2, LDLR), and we also identify four new candidates for population screening: GCK with diabetes mellitus, HBB with ß-thalassemia minor and intermedia, PKD1 with cystic kidney disease, and MIP with cataracts. Importantly, the associations are actionable in that early genetic screening of each of these conditions is expected to improve outcomes. CONCLUSION: We identify seven genetic conditions where rare variation appears appropriate to assess in population screening, four of which are not yet used in screening programs. The addition of GCK, HBB, PKD1, and MIP rare variants into genetic screening programs would reach an additional 0.21% of participants with actionable disease risk, depending on the population.


Subject(s)
Genes, BRCA2 , Genetic Testing , Exome , Genetic Predisposition to Disease , Humans , Predictive Value of Tests , Exome Sequencing
6.
Am J Hum Genet ; 99(3): 595-606, 2016 09 01.
Article in English | MEDLINE | ID: mdl-27569544

ABSTRACT

The interpretation of non-coding variants still constitutes a major challenge in the application of whole-genome sequencing in Mendelian disease, especially for single-nucleotide and other small non-coding variants. Here we present Genomiser, an analysis framework that is able not only to score the relevance of variation in the non-coding genome, but also to associate regulatory variants to specific Mendelian diseases. Genomiser scores variants through either existing methods such as CADD or a bespoke machine learning method and combines these with allele frequency, regulatory sequences, chromosomal topological domains, and phenotypic relevance to discover variants associated to specific Mendelian disorders. Overall, Genomiser is able to identify causal regulatory variants as the top candidate in 77% of simulated whole genomes, allowing effective detection and discovery of regulatory variants in Mendelian disease.


Subject(s)
Algorithms , Genetic Diseases, Inborn/genetics , Genome, Human/genetics , Mutation/genetics , Gene Frequency , Genome-Wide Association Study , Humans , Machine Learning , Open Reading Frames/genetics , Phenotype , Point Mutation/genetics
7.
J Genet Couns ; 28(2): 456-465, 2019 04.
Article in English | MEDLINE | ID: mdl-30964579

ABSTRACT

The practice of genetic counseling is going to be impacted by the public's expectation that goods, services, information, and experiences happen on demand, wherever and whenever people want them. Building from trends that are currently taking shape, this article looks just over a decade into the future-to 2030-to provide a description of how the field of genetics and genetic counseling will be changed, as well as advice for genetic counselors for how to prepare. We build from the prediction that a large portion of the general public will have access to their digitized whole genome sequence anytime, any place, on any device. We focus on five topics downstream of this prediction: public health, personal autonomy, polygenic scores (PGS), evolving clinical practices, and genetic privacy.


Subject(s)
Genetic Counseling/trends , Public Health/trends , Female , Genetic Counseling/ethics , Humans , Public Health/ethics
8.
Am J Hum Genet ; 97(1): 111-24, 2015 Jul 02.
Article in English | MEDLINE | ID: mdl-26119816

ABSTRACT

The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.


Subject(s)
Gene Ontology/trends , Genetic Diseases, Inborn/classification , Genetic Diseases, Inborn/genetics , Phenotype , Terminology as Topic , Genetic Diseases, Inborn/pathology , Humans , MEDLINE , Models, Biological
9.
Genet Med ; 18(6): 608-17, 2016 06.
Article in English | MEDLINE | ID: mdl-26562225

ABSTRACT

PURPOSE: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. METHODS: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. RESULTS: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease-gene associations and ranked the correct seeded variant in up to 87% when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. CONCLUSION: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.Genet Med 18 6, 608-617.


Subject(s)
Exome Sequencing/methods , Exome/genetics , Rare Diseases/genetics , Rare Diseases/physiopathology , Animals , Computational Biology , Databases, Genetic , Disease Models, Animal , Genetic Association Studies , Genetic Variation , Humans , Mice , National Institutes of Health (U.S.) , Patients , Phenotype , Rare Diseases/diagnosis , Rare Diseases/epidemiology , United States , Zebrafish
10.
Hum Mutat ; 36(10): 979-84, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26269093

ABSTRACT

The Matchmaker Exchange application programming interface (API) allows searching a patient's genotypic or phenotypic profiles across clinical sites, for the purposes of cohort discovery and variant disease causal validation. This API can be used not only to search for matching patients, but also to match against public disease and model organism data. This public disease data enable matching known diseases and variant-phenotype associations using phenotype semantic similarity algorithms developed by the Monarch Initiative. The model data can provide additional evidence to aid diagnosis, suggest relevant models for disease mechanism and treatment exploration, and identify collaborators across the translational divide. The Monarch Initiative provides an implementation of this API for searching multiple integrated sources of data that contextualize the knowledge about any given patient or patient family into the greater biomedical knowledge landscape. While this corpus of data can aid diagnosis, it is also the beginning of research to improve understanding of rare human diseases.


Subject(s)
Databases, Genetic , Disease/genetics , Genetic Predisposition to Disease/genetics , Animals , Disease Models, Animal , Genetic Variation , Humans , Information Dissemination , Phenotype , User-Computer Interface
11.
Hum Mutat ; 36(10): 931-40, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26251998

ABSTRACT

The discovery of disease-causing mutations typically requires confirmation of the variant or gene in multiple unrelated individuals, and a large number of rare genetic diseases remain unsolved due to difficulty identifying second families. To enable the secure sharing of case records by clinicians and rare disease scientists, we have developed the PhenomeCentral portal (https://phenomecentral.org). Each record includes a phenotypic description and relevant genetic information (exome or candidate genes). PhenomeCentral identifies similar patients in the database based on semantic similarity between clinical features, automatically prioritized genes from whole-exome data, and candidate genes entered by the users, enabling both hypothesis-free and hypothesis-driven matchmaking. Users can then contact other submitters to follow up on promising matches. PhenomeCentral incorporates data for over 1,000 patients with rare genetic diseases, contributed by the FORGE and Care4Rare Canada projects, the US NIH Undiagnosed Diseases Program, the EU Neuromics and ANDDIrare projects, as well as numerous independent clinicians and scientists. Though the majority of these records have associated exome data, most lack a molecular diagnosis. PhenomeCentral has already been used to identify causative mutations for several patients, and its ability to find matching patients and diagnose these diseases will grow with each additional patient that is entered.


Subject(s)
Genetic Predisposition to Disease/genetics , Information Dissemination/methods , Rare Diseases/genetics , Databases, Genetic , Genetic Variation , Genotype , Humans , Phenotype , Software , User-Computer Interface , Web Browser
12.
Hum Mutat ; 36(10): 915-21, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26295439

ABSTRACT

There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.


Subject(s)
Genetic Predisposition to Disease/genetics , Information Dissemination/methods , Rare Diseases/genetics , Database Management Systems , Databases, Genetic , Genetic Association Studies , Humans , Software
13.
Nucleic Acids Res ; 40(Database issue): D1082-8, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22080565

ABSTRACT

In an effort to comprehensively characterize the functional elements within the genomes of the important model organisms Drosophila melanogaster and Caenorhabditis elegans, the NHGRI model organism Encyclopaedia of DNA Elements (modENCODE) consortium has generated an enormous library of genomic data along with detailed, structured information on all aspects of the experiments. The modMine database (http://intermine.modencode.org) described here has been built by the modENCODE Data Coordination Center to allow the broader research community to (i) search for and download data sets of interest among the thousands generated by modENCODE; (ii) access the data in an integrated form together with non-modENCODE data sets; and (iii) facilitate fine-grained analysis of the above data. The sophisticated search features are possible because of the collection of extensive experimental metadata by the consortium. Interfaces are provided to allow both biologists and bioinformaticians to exploit these rich modENCODE data sets now available via modMine.


Subject(s)
Caenorhabditis elegans/genetics , Databases, Genetic , Drosophila melanogaster/genetics , Animals , Gene Expression , Genome, Helminth , Genome, Insect , Genomics , Internet , User-Computer Interface
14.
HGG Adv ; 5(3): 100284, 2024 Jul 18.
Article in English | MEDLINE | ID: mdl-38509709

ABSTRACT

Systematic determination of novel variant pathogenicity remains a major challenge, even when there is an established association between a gene and phenotype. Here we present Power Window (PW), a sliding window technique that identifies the impactful regions of a gene using population-scale clinico-genomic datasets. By sizing analysis windows on the number of variant carriers, rather than the number of variants or nucleotides, statistical power is held constant, enabling the localization of clinical phenotypes and removal of unassociated gene regions. The windows can be built by sliding across either the nucleotide sequence of the gene (through 1D space) or the positions of the amino acids in the folded protein (through 3D space). Using a training set of 350k exomes from the UK Biobank (UKB), we developed PW models for well-established gene-disease associations and tested their accuracy in two independent cohorts (117k UKB exomes and 65k exomes sequenced at Helix in the Healthy Nevada Project, myGenetics, or In Our DNA SC studies). The significant models retained a median of 49% of the qualifying variant carriers in each gene (range 2%-98%), with quantitative traits showing a median effect size improvement of 66% compared with aggregating variants across the entire gene, and binary traits' odds ratios improving by a median of 2.2-fold. PW showcases that electronic health record-based statistical analyses can accurately distinguish between novel coding variants in established genes that will have high phenotypic penetrance and those that will not, unlocking new potential for human genomics research, drug development, variant interpretation, and precision medicine.


Subject(s)
Genetic Variation , Humans , Genetic Variation/genetics , Protein Folding , Phenotype , Base Sequence/genetics , Genetic Predisposition to Disease/genetics , Exome/genetics
15.
PLoS Biol ; 7(11): e1000247, 2009 Nov.
Article in English | MEDLINE | ID: mdl-19956802

ABSTRACT

Scientists and clinicians who study genetic alterations and disease have traditionally described phenotypes in natural language. The considerable variation in these free-text descriptions has posed a hindrance to the important task of identifying candidate genes and models for human diseases and indicates the need for a computationally tractable method to mine data resources for mutant phenotypes. In this study, we tested the hypothesis that ontological annotation of disease phenotypes will facilitate the discovery of new genotype-phenotype relationships within and across species. To describe phenotypes using ontologies, we used an Entity-Quality (EQ) methodology, wherein the affected entity (E) and how it is affected (Q) are recorded using terms from a variety of ontologies. Using this EQ method, we annotated the phenotypes of 11 gene-linked human diseases described in Online Mendelian Inheritance in Man (OMIM). These human annotations were loaded into our Ontology-Based Database (OBD) along with other ontology-based phenotype descriptions of mutants from various model organism databases. Phenotypes recorded with this EQ method can be computationally compared based on the hierarchy of terms in the ontologies and the frequency of annotation. We utilized four similarity metrics to compare phenotypes and developed an ontology of homologous and analogous anatomical structures to compare phenotypes between species. Using these tools, we demonstrate that we can identify, through the similarity of the recorded phenotypes, other alleles of the same gene, other members of a signaling pathway, and orthologous genes and pathway members across species. We conclude that EQ-based annotation of phenotypes, in conjunction with a cross-species ontology, and a variety of similarity metrics can identify biologically meaningful similarities between genes by comparing phenotypes alone. This annotation and search method provides a novel and efficient means to identify gene candidates and animal models of human disease, which may shorten the lengthy path to identification and understanding of the genetic basis of human disease.


Subject(s)
Disease Models, Animal , Genetic Association Studies , Phenotype , Alleles , Animals , Hedgehog Proteins/genetics , Humans , Signal Transduction/genetics , Zebrafish , Zebrafish Proteins/genetics
16.
HGG Adv ; 3(2): 100084, 2022 Apr 14.
Article in English | MEDLINE | ID: mdl-35005651

ABSTRACT

COVID-19 vaccines are safe and highly effective, but some individuals experience unpleasant reactions to vaccination. As the majority of adults in the United States have received a COVID-19 vaccine this year, there is an unprecedented opportunity to study the genetics of reactions to vaccination via surveys of individuals who are already part of genetic research studies. Here, we have queried 17,440 participants in the Helix DNA Discovery Project and Healthy Nevada Project about their reactions to COVID-19 vaccination. Our genome-wide association study identifies an association between severe difficulties with daily routine after vaccination and HLA-A∗03:01. This association was statistically significant only for those who received the Pfizer-BioNTech vaccine (BNT162b2; n = 3,694; p = 4.70E-11; OR = 2.07 [95% CI 1.67-2.56]), and showed a smaller effect size in those who received the Moderna vaccine (mRNA-1273; n = 3,610; p = 0.005; OR = 1.32 [95% CI 1.09-1.59]). In Pfizer-BioNTech recipients, HLA-A∗03:01 was associated with a 2-fold increase in risk of self-reported severe difficulties with daily routine following vaccination. The effect was consistent across ages, sexes, and whether the person had previously had a COVID-19 infection. The reactions experienced by HLA-A∗03:01 carriers were driven by associations with chills, fever, fatigue, and generally feeling unwell.

17.
Cell Rep Med ; 3(3): 100564, 2022 03 15.
Article in English | MEDLINE | ID: mdl-35474739

ABSTRACT

We report on the sequencing of 74,348 SARS-CoV-2 positive samples collected across the United States and show that the Delta variant, first detected in the United States in March 2021, made up the majority of SARS-CoV-2 infections by July 1, 2021 and accounted for >99.9% of the infections by September 2021. Not only did Delta displace variant Alpha, which was the dominant variant at the time, it also displaced the Gamma, Iota, and Mu variants. Through an analysis of quantification cycle (Cq) values, we demonstrate that Delta infections tend to have a 1.7× higher viral load compared to Alpha infections (a decrease of 0.8 Cq) on average. Our results are consistent with the hypothesis that the increased transmissibility of the Delta variant could be due to the ability of the Delta variant to establish a higher viral load earlier in the infection as compared to the Alpha variant.


Subject(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiology , Humans , SARS-CoV-2/genetics , United States/epidemiology , Viral Load/genetics
18.
Med ; 3(12): 848-859.e4, 2022 Dec 09.
Article in English | MEDLINE | ID: mdl-36332633

ABSTRACT

BACKGROUND: Between November 2021 and February 2022, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Delta and Omicron variants co-circulated in the United States, allowing for co-infections and possible recombination events. METHODS: We sequenced 29,719 positive samples during this period and analyzed the presence and fraction of reads supporting mutations specific to either the Delta or Omicron variant. FINDINGS: We identified 18 co-infections, one of which displayed evidence of a low Delta-Omicron recombinant viral population. We also identified two independent cases of infection by a Delta-Omicron recombinant virus, where 100% of the viral RNA came from one clonal recombinant. In the three cases, the 5' end of the viral genome was from the Delta genome and the 3' end from Omicron, including the majority of the spike protein gene, though the breakpoints were different. CONCLUSIONS: Delta-Omicron recombinant viruses were rare, and there is currently no evidence that Delta-Omicron recombinant viruses are more transmissible between hosts compared with the circulating Omicron lineages. FUNDING: This research was supported by the NIH RADx initiative and by the Centers for Disease Control Contract 75D30121C12730 (Helix).


Subject(s)
COVID-19 , Coinfection , Orthopoxvirus , Humans , SARS-CoV-2/genetics , Genome, Viral/genetics
19.
Clin Pharmacol Ther ; 110(3): 759-767, 2021 09.
Article in English | MEDLINE | ID: mdl-33930192

ABSTRACT

Genomic-guided pharmaceutical prescribing is increasingly recognized as an important clinical application of genetics. Accurate genotyping of pharmacogenomic (PGx) genes can be difficult, owing to their complex genetic architecture involving combinations of single-nucleotide polymorphisms and structural variation. Here, we introduce the Helix PGx database, an open-source star allele, genotype, and resulting metabolic phenotype frequency database for CYP2C9, CYP2C19, CYP2D6, and CYP4F2, based on short-read sequencing of >86,000 unrelated individuals enrolled in the Helix DNA Discovery Project. The database is annotated using a pipeline that is clinically validated against a broad range of alleles and designed to call CYP2D6 structural variants with high (98%) accuracy. We find that CYP2D6 has greater allelic diversity than the other genes, manifest in both a long tail of low-frequency star alleles, as well as a disproportionate fraction (36%) of all novel predicted loss-of-function variants identified. Across genes, we observe that many rare alleles (<0.1% frequency) in the overall cohort have 10 times higher frequency in one or more subgroups with non-European genetic ancestry. Extending these PGx genotypes to predicted metabolic phenotypes, we demonstrate that >90% of the cohort harbors a high-risk variant in one of the four pharmacogenes. Based on the recorded prescriptions for >30,000 individuals in the Healthy Nevada Project, combined with predicted PGx metabolic phenotypes, we anticipate that standard-of-care screening of these 4 pharmacogenes could impact nearly half of the general population.


Subject(s)
Cytochrome P-450 Enzyme System/genetics , DNA/genetics , Gene Frequency/genetics , Alleles , Databases, Nucleic Acid , Genomics/methods , Genotype , Humans , Pharmacogenetics/methods , Phenotype , Polymorphism, Single Nucleotide/genetics
20.
Front Genet ; 12: 639418, 2021.
Article in English | MEDLINE | ID: mdl-33763119

ABSTRACT

Clinical conditions correlated with elevated triglyceride levels are well-known: coronary heart disease, hypertension, and diabetes. Underlying genetic and phenotypic mechanisms are not fully understood, partially due to lack of coordinated genotypic-phenotypic data. Here we use a subset of the Healthy Nevada Project, a population of 9,183 sequenced participants with longitudinal electronic health records to examine consequences of altered triglyceride levels. Specifically, Healthy Nevada Project participants sequenced by the Helix Exome+ platform were cross-referenced to their electronic medical records to identify: (1) rare and common single-variant genome-wide associations; (2) gene-based associations using a Sequence Kernel Association Test; (3) phenome-wide associations with triglyceride levels; and (4) pleiotropic variants linked to triglyceride levels. The study identified 549 significant single-variant associations (p < 8.75 × 10-9), many in chromosome 11's triglyceride hotspot: ZPR1, BUD13, APOC3, APOA5. A well-known protective loss-of-function variant in APOC3 (R19X) was associated with a 51% decrease in triglyceride levels in the cohort. Sixteen gene-based triglyceride associations were identified; six of these genes surprisingly did not include a single variant with significant associations. Results at the variant and gene level were validated with the UK Biobank. The combination of a single-variant genome-wide association, a gene-based association method, and phenome wide-association studies identified rare and common variants, genes, and phenotypes associated with elevated triglyceride levels, some of which may have been overlooked with standard approaches.

SELECTION OF CITATIONS
SEARCH DETAIL