Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
1.
Hum Mol Genet ; 31(18): 3120-3132, 2022 09 10.
Article in English | MEDLINE | ID: mdl-35552711

ABSTRACT

Plasma levels of fibrinogen, coagulation factors VII and VIII and von Willebrand factor (vWF) are four intermediate phenotypes that are heritable and have been associated with the risk of clinical thrombotic events. To identify rare and low-frequency variants associated with these hemostatic factors, we conducted whole-exome sequencing in 10 860 individuals of European ancestry (EA) and 3529 African Americans (AAs) from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and the National Heart, Lung and Blood Institute's Exome Sequencing Project. Gene-based tests demonstrated significant associations with rare variation (minor allele frequency < 5%) in fibrinogen gamma chain (FGG) (with fibrinogen, P = 9.1 × 10-13), coagulation factor VII (F7) (with factor VII, P = 1.3 × 10-72; seven novel variants) and VWF (with factor VIII and vWF; P = 3.2 × 10-14; one novel variant). These eight novel rare variant associations were independent of the known common variants at these loci and tended to have much larger effect sizes. In addition, one of the rare novel variants in F7 was significantly associated with an increased risk of venous thromboembolism in AAs (Ile200Ser; rs141219108; P = 4.2 × 10-5). After restricting gene-based analyses to only loss-of-function variants, a novel significant association was detected and replicated between factor VIII levels and a stop-gain mutation exclusive to AAs (rs3211938) in CD36 molecule (CD36). This variant has previously been linked to dyslipidemia but not with the levels of a hemostatic factor. These efforts represent the largest integration of whole-exome sequence data from two national projects to identify genetic variation associated with plasma hemostatic factors.


Subject(s)
Factor VIII , Hemostatics , Factor VII/genetics , Factor VIII/genetics , Fibrinogen/genetics , Humans , Polymorphism, Single Nucleotide/genetics , Exome Sequencing , von Willebrand Factor/analysis , von Willebrand Factor/genetics
3.
Genet Med ; 24(5): 1062-1072, 2022 05.
Article in English | MEDLINE | ID: mdl-35331649

ABSTRACT

PURPOSE: The Mayo-Baylor RIGHT 10K Study enabled preemptive, sequence-based pharmacogenomics (PGx)-driven drug prescribing practices in routine clinical care within a large cohort. We also generated the tools and resources necessary for clinical PGx implementation and identified challenges that need to be overcome. Furthermore, we measured the frequency of both common genetic variation for which clinical guidelines already exist and rare variation that could be detected by DNA sequencing, rather than genotyping. METHODS: Targeted oligonucleotide-capture sequencing of 77 pharmacogenes was performed using DNA from 10,077 consented Mayo Clinic Biobank volunteers. The resulting predicted drug response-related phenotypes for 13 genes, including CYP2D6 and HLA, affecting 21 drug-gene pairs, were deposited preemptively in the Mayo electronic health record. RESULTS: For the 13 pharmacogenes of interest, the genomes of 79% of participants carried clinically actionable variants in 3 or more genes, and DNA sequencing identified an average of 3.3 additional conservatively predicted deleterious variants that would not have been evident using genotyping. CONCLUSION: Implementation of preemptive rather than reactive and sequence-based rather than genotype-based PGx prescribing revealed nearly universal patient applicability and required integrated institution-wide resources to fully realize individualized drug therapy and to show more efficient use of health care resources.


Subject(s)
Cytochrome P-450 CYP2D6 , Pharmacogenetics , Academic Medical Centers , Base Sequence , Cytochrome P-450 CYP2D6/genetics , Genotype , Humans , Pharmacogenetics/methods
4.
Genet Med ; 23(12): 2404-2414, 2021 12.
Article in English | MEDLINE | ID: mdl-34363016

ABSTRACT

PURPOSE: Cardiovascular disease (CVD) is the leading cause of death in adults in the United States, yet the benefits of genetic testing are not universally accepted. METHODS: We developed the "HeartCare" panel of genes associated with CVD, evaluating high-penetrance Mendelian conditions, coronary artery disease (CAD) polygenic risk, LPA gene polymorphisms, and specific pharmacogenetic (PGx) variants. We enrolled 709 individuals from cardiology clinics at Baylor College of Medicine, and samples were analyzed in a CAP/CLIA-certified laboratory. Results were returned to the ordering physician and uploaded to the electronic medical record. RESULTS: Notably, 32% of patients had a genetic finding with clinical management implications, even after excluding PGx results, including 9% who were molecularly diagnosed with a Mendelian condition. Among surveyed physicians, 84% reported medical management changes based on these results, including specialist referrals, cardiac tests, and medication changes. LPA polymorphisms and high polygenic risk of CAD were found in 20% and 9% of patients, respectively, leading to diet, lifestyle, and other changes. Warfarin and simvastatin pharmacogenetic variants were present in roughly half of the cohort. CONCLUSION: Our results support the use of genetic information in routine cardiovascular health management and provide a roadmap for accompanying research.


Subject(s)
Cardiology , Cardiovascular Diseases , Adult , Cardiovascular Diseases/diagnosis , Cardiovascular Diseases/genetics , Cardiovascular Diseases/therapy , Genetic Testing , Humans , Pharmacogenetics/methods , Pharmacogenomic Testing , United States
5.
Nature ; 527(7579): 459-65, 2015 Nov 26.
Article in English | MEDLINE | ID: mdl-26580012

ABSTRACT

Acorn worms, also known as enteropneust (literally, 'gut-breathing') hemichordates, are marine invertebrates that share features with echinoderms and chordates. Together, these three phyla comprise the deuterostomes. Here we report the draft genome sequences of two acorn worms, Saccoglossus kowalevskii and Ptychodera flava. By comparing them with diverse bilaterian genomes, we identify shared traits that were probably inherited from the last common deuterostome ancestor, and then explore evolutionary trajectories leading from this ancestor to hemichordates, echinoderms and chordates. The hemichordate genomes exhibit extensive conserved synteny with amphioxus and other bilaterians, and deeply conserved non-coding sequences that are candidates for conserved gene-regulatory elements. Notably, hemichordates possess a deuterostome-specific genomic cluster of four ordered transcription factor genes, the expression of which is associated with the development of pharyngeal 'gill' slits, the foremost morphological innovation of early deuterostomes, and is probably central to their filter-feeding lifestyle. Comparative analysis reveals numerous deuterostome-specific gene novelties, including genes found in deuterostomes and marine microbes, but not other animals. The putative functions of these genes can be linked to physiological, metabolic and developmental specializations of the filter-feeding ancestor.


Subject(s)
Chordata, Nonvertebrate/genetics , Evolution, Molecular , Genome/genetics , Animals , Chordata, Nonvertebrate/classification , Conserved Sequence/genetics , Echinodermata/classification , Echinodermata/genetics , Multigene Family/genetics , Phylogeny , Signal Transduction , Synteny/genetics , Transforming Growth Factor beta
6.
Genome Res ; 24(7): 1209-23, 2014 Jul.
Article in English | MEDLINE | ID: mdl-24985915

ABSTRACT

Accurate gene model annotation of reference genomes is critical for making them useful. The modENCODE project has improved the D. melanogaster genome annotation by using deep and diverse high-throughput data. Since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function, we have performed large-scale interspecific comparisons to increase confidence in predicted annotations. To support comparative genomics, we filled in divergence gaps in the Drosophila phylogeny by generating draft genomes for eight new species. For comparative transcriptome analysis, we generated mRNA expression profiles on 81 samples from multiple tissues and developmental stages of 15 Drosophila species, and we performed cap analysis of gene expression in D. melanogaster and D. pseudoobscura. We also describe conservation of four distinct core promoter structures composed of combinations of elements at three positions. Overall, each type of genomic feature shows a characteristic divergence rate relative to neutral models, highlighting the value of multispecies alignment in annotating a target genome that should prove useful in the annotation of other high priority genomes, especially human and other mammalian genomes that are rich in noncoding sequences. We report that the vast majority of elements in the annotation are evolutionarily conserved, indicating that the annotation will be an important springboard for functional genetic testing by the Drosophila community.


Subject(s)
Computational Biology/methods , Drosophila melanogaster/genetics , Gene Expression Profiling , Molecular Sequence Annotation , Transcriptome , Animals , Cluster Analysis , Drosophila melanogaster/classification , Evolution, Molecular , Exons , Female , Genome, Insect , Humans , Male , Nucleotide Motifs , Phylogeny , Position-Specific Scoring Matrices , Promoter Regions, Genetic , RNA Editing , RNA Splice Sites , RNA Splicing , Reproducibility of Results , Transcription Initiation Site
7.
Nature ; 469(7331): 529-33, 2011 Jan 27.
Article in English | MEDLINE | ID: mdl-21270892

ABSTRACT

'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.


Subject(s)
Genetic Variation , Genome/genetics , Pongo abelii/genetics , Pongo pygmaeus/genetics , Animals , Centromere/genetics , Cerebrosides/metabolism , Chromosomes , Evolution, Molecular , Female , Gene Rearrangement/genetics , Genetic Speciation , Genetics, Population , Humans , Male , Phylogeny , Population Density , Population Dynamics , Species Specificity
8.
BMC Genomics ; 15: 86, 2014 Jan 30.
Article in English | MEDLINE | ID: mdl-24479613

ABSTRACT

BACKGROUND: The first generation of genome sequence assemblies and annotations have had a significant impact upon our understanding of the biology of the sequenced species, the phylogenetic relationships among species, the study of populations within and across species, and have informed the biology of humans. As only a few Metazoan genomes are approaching finished quality (human, mouse, fly and worm), there is room for improvement of most genome assemblies. The honey bee (Apis mellifera) genome, published in 2006, was noted for its bimodal GC content distribution that affected the quality of the assembly in some regions and for fewer genes in the initial gene set (OGSv1.0) compared to what would be expected based on other sequenced insect genomes. RESULTS: Here, we report an improved honey bee genome assembly (Amel_4.5) with a new gene annotation set (OGSv3.2), and show that the honey bee genome contains a number of genes similar to that of other insect genomes, contrary to what was suggested in OGSv1.0. The new genome assembly is more contiguous and complete and the new gene set includes ~5000 more protein-coding genes, 50% more than previously reported. About 1/6 of the additional genes were due to improvements to the assembly, and the remaining were inferred based on new RNAseq and protein data. CONCLUSIONS: Lessons learned from this genome upgrade have important implications for future genome sequencing projects. Furthermore, the improvements significantly enhance genomic resources for the honey bee, a key model for social behavior and essential to global ecology through pollination.


Subject(s)
Bees/genetics , Genes, Insect , Animals , Base Composition , Databases, Genetic , Interspersed Repetitive Sequences/genetics , Molecular Sequence Annotation , Open Reading Frames/genetics , Peptides/analysis , Sequence Analysis, RNA , Sequence Homology, Amino Acid
9.
J Bacteriol ; 191(21): 6643-53, 2009 Nov.
Article in English | MEDLINE | ID: mdl-19717590

ABSTRACT

Members of the Streptococcus bovis group are important causes of endocarditis. However, factors associated with their pathogenicity, such as adhesins, remain uncharacterized. We recently demonstrated that endocarditis-derived Streptococcus gallolyticus subsp. gallolyticus isolates frequently adhere to extracellular matrix (ECM) proteins. Here, we generated a draft genome sequence of an ECM protein-adherent S. gallolyticus subsp. gallolyticus strain and found, by genome-wide analyses, 11 predicted LPXTG-type cell wall-anchored proteins with characteristics of MSCRAMMs, including a modular architecture of domains predicted to adopt immunoglobulin (Ig)-like folding. A recombinant segment of one of these, Acb, showed high-affinity binding to immobilized collagen, and cell surface expression of Acb correlated with the presence of acb and collagen adherence of isolates. Three of the 11 proteins have similarities to major pilus subunits and are organized in separate clusters, each including a second Ig-fold-containing MSCRAMM and a class C sortase, suggesting that the sequenced strain encodes three distinct types of pili. Reverse transcription-PCR demonstrated that all three genes of one cluster, acb-sbs7-srtC1, are cotranscribed, consistent with pilus operons of other gram-positive bacteria. Further analysis detected expression of all 11 genes in cells grown to mid to late exponential growth phases. Wide distribution of 9 of the 11 genes was observed among S. gallolyticus subsp. gallolyticus isolates with fewer genes present in other S. bovis group species/subspecies. The high prevalence of genes encoding putative MSCRAMMs and pili, including a collagen-binding MSCRAMM, among S. gallolyticus subsp. gallolyticus isolates may play an important role in the predominance of this subspecies in S. bovis endocarditis.


Subject(s)
Adhesins, Bacterial/metabolism , Fimbriae, Bacterial/metabolism , Gene Expression Regulation, Bacterial/physiology , Streptococcus/metabolism , Gene Expression Profiling , Genome, Bacterial , Multigene Family , Streptococcus/classification , Streptococcus/genetics
10.
BMC Microbiol ; 7: 99, 2007 Nov 06.
Article in English | MEDLINE | ID: mdl-17986343

ABSTRACT

BACKGROUND: Community acquired (CA) methicillin-resistant Staphylococcus aureus (MRSA) increasingly causes disease worldwide. USA300 has emerged as the predominant clone causing superficial and invasive infections in children and adults in the USA. Epidemiological studies suggest that USA300 is more virulent than other CA-MRSA. The genetic determinants that render virulence and dominance to USA300 remain unclear. RESULTS: We sequenced the genomes of two pediatric USA300 isolates: one CA-MRSA and one CA-methicillin susceptible (MSSA), isolated at Texas Children's Hospital in Houston. DNA sequencing was performed by Sanger dideoxy whole genome shotgun (WGS) and 454 Life Sciences pyrosequencing strategies. The sequence of the USA300 MRSA strain was rigorously annotated. In USA300-MRSA 2658 chromosomal open reading frames were predicted and 3.1 and 27 kilobase (kb) plasmids were identified. USA300-MSSA contained a 20 kb plasmid with some homology to the 27 kb plasmid found in USA300-MRSA. Two regions found in US300-MRSA were absent in USA300-MSSA. One of these carried the arginine deiminase operon that appears to have been acquired from S. epidermidis. The USA300 sequence was aligned with other sequenced S. aureus genomes and regions unique to USA300 MRSA were identified. CONCLUSION: USA300-MRSA is highly similar to other MRSA strains based on whole genome alignments and gene content, indicating that the differences in pathogenesis are due to subtle changes rather than to large-scale acquisition of virulence factor genes. The USA300 Houston isolate differs from another sequenced USA300 strain isolate, derived from a patient in San Francisco, in plasmid content and a number of sequence polymorphisms. Such differences will provide new insights into the evolution of pathogens.


Subject(s)
Staphylococcal Infections/epidemiology , Staphylococcus aureus/genetics , Adolescent , Anti-Bacterial Agents/pharmacology , Base Sequence , Genomic Islands/genetics , Humans , Hydrolases/genetics , Methicillin Resistance , Molecular Epidemiology , Molecular Sequence Data , Open Reading Frames/genetics , Plasmids/genetics , Polymorphism, Genetic , Staphylococcus aureus/drug effects , United States/epidemiology
11.
Sci Data ; 3: 160010, 2016 Feb 16.
Article in English | MEDLINE | ID: mdl-26882539

ABSTRACT

Genomic data sharing in cancer has been restricted to aggregate or controlled-access initiatives to protect the privacy of research participants. By limiting access to these data, it has been argued that the autonomy of individuals who decide to participate in data sharing efforts has been superseded and the utility of the data as research and educational tools reduced. In a pilot Open Access (OA) project from the CPRIT-funded Texas Cancer Research Biobank, many Texas cancer patients were willing to openly share genomic data from tumor and normal matched pair specimens. For the first time, genetic data from 7 human cancer cases with matched normal are freely available without requirement for data use agreements nor any major restriction except that end users cannot attempt to re-identify the participants (http://txcrb.org/open.html).


Subject(s)
DNA, Neoplasm , Databases, Genetic , Genome, Human , Pancreatic Neoplasms/genetics , Access to Information , Biological Specimen Banks , Humans , Information Dissemination , Texas
12.
Curr Biol ; 25(12): 1661-5, 2015 Jun 15.
Article in English | MEDLINE | ID: mdl-26051890

ABSTRACT

Cooperative systems are susceptible to invasion by selfish individuals that profit from receiving the social benefits but fail to contribute. These so-called "cheaters" can have a fitness advantage in the laboratory, but it is unclear whether cheating provides an important selective advantage in nature. We used a population genomic approach to examine the history of genes involved in cheating behaviors in the social amoeba Dictyostelium discoideum, testing whether these genes experience rapid evolutionary change as a result of conflict over spore-stalk fate. Candidate genes and surrounding regions showed elevated polymorphism, unusual patterns of linkage disequilibrium, and lower levels of population differentiation, but they did not show greater between-species divergence. The signatures were most consistent with frequency-dependent selection acting to maintain multiple alleles, suggesting that conflict may lead to stalemate rather than an escalating arms race. Our results reveal the evolutionary dynamics of cooperation and cheating and underscore how sequence-based approaches can be used to elucidate the history of conflicts that are difficult to observe directly.


Subject(s)
Dictyostelium/genetics , Genome, Protozoan , Evolution, Molecular , Genomics , Polymorphism, Genetic , Selection, Genetic
14.
Curr Biol ; 25(5): 613-20, 2015 Mar 02.
Article in English | MEDLINE | ID: mdl-25660540

ABSTRACT

Gall-forming arthropods are highly specialized herbivores that, in combination with their hosts, produce extended phenotypes with unique morphologies [1]. Many are economically important, and others have improved our understanding of ecology and adaptive radiation [2]. However, the mechanisms that these arthropods use to induce plant galls are poorly understood. We sequenced the genome of the Hessian fly (Mayetiola destructor; Diptera: Cecidomyiidae), a plant parasitic gall midge and a pest of wheat (Triticum spp.), with the aim of identifying genic modifications that contribute to its plant-parasitic lifestyle. Among several adaptive modifications, we discovered an expansive reservoir of potential effector proteins. Nearly 5% of the 20,163 predicted gene models matched putative effector gene transcripts present in the M. destructor larval salivary gland. Another 466 putative effectors were discovered among the genes that have no sequence similarities in other organisms. The largest known arthropod gene family (family SSGP-71) was also discovered within the effector reservoir. SSGP-71 proteins lack sequence homologies to other proteins, but their structures resemble both ubiquitin E3 ligases in plants and E3-ligase-mimicking effectors in plant pathogenic bacteria. SSGP-71 proteins and wheat Skp proteins interact in vivo. Mutations in different SSGP-71 genes avoid the effector-triggered immunity that is directed by the wheat resistance genes H6 and H9. Results point to effectors as the agents responsible for arthropod-induced plant gall formation.


Subject(s)
Chromosomes/genetics , Diptera/genetics , Multigene Family/genetics , Phylogeny , Plant Tumors/genetics , Triticum/parasitology , Adaptation, Biological/genetics , Amino Acid Sequence , Animals , Base Sequence , Diptera/metabolism , Larva/metabolism , Models, Genetic , Molecular Sequence Data , Sequence Analysis, DNA , Sequence Homology , Sexual Behavior, Animal/physiology , Two-Hybrid System Techniques , Ubiquitin-Protein Ligases/genetics
15.
Circ Cardiovasc Genet ; 7(3): 350-8, 2014 Jun.
Article in English | MEDLINE | ID: mdl-24951661

ABSTRACT

BACKGROUND: The pulmonary function measures of forced expiratory volume in 1 second (FEV1) and its ratio to forced vital capacity (FVC) are used in the diagnosis and monitoring of lung diseases and predict cardiovascular mortality in the general population. Genome-wide association studies (GWASs) have identified numerous loci associated with FEV1 and FEV1/FVC, but the causal variants remain uncertain. We hypothesized that novel or rare variants poorly tagged by GWASs may explain the significant associations between FEV1/FVC and 2 genes: ADAM19 and HTR4. METHODS AND RESULTS: We sequenced ADAM19 and its promoter region along with the ≈21-kb portion of HTR4 harboring GWAS single-nucleotide polymorphisms for pulmonary function and analyzed associations with FEV1/FVC among 3983 participants of European ancestry from Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium. Meta-analysis of common variants in each region identified statistically significant associations (316 tests; P<1.58×10(-4)) with FEV1/FVC for 14 ADAM19 single-nucleotide polymorphisms and 24 HTR4 single-nucleotide polymorphisms. After conditioning on the sentinel GWASs hit in each gene (ADAM19 rs1422795, minor allele frequency=0.33 and HTR4 rs11168048, minor allele frequency=0.40], 1 single-nucleotide polymorphism remained statistically significant (ADAM19 rs13155908, minor allele frequency=0.12; P=1.56×10(-4)). Analysis of rare variants (minor allele frequency <1%) using sequence kernel association test did not identify associations with either region. CONCLUSIONS: Sequencing identified 1 common variant associated with FEV1/FVC independent of the sentinel ADAM19 GWAS hit and supports the original HTR4 GWAS findings. Rare variants do not seem to underlie GWAS associations with pulmonary function for common variants in ADAM19 and HTR4.


Subject(s)
ADAM Proteins/genetics , Aging/genetics , Genetic Variation , Heart Diseases/genetics , Lung/physiopathology , Aged , Aged, 80 and over , Cohort Studies , Female , Genome-Wide Association Study , Genomics , Heart Diseases/epidemiology , Heart Diseases/physiopathology , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide , Sequence Analysis, DNA
16.
Circ Cardiovasc Genet ; 7(3): 335-43, 2014 Jun.
Article in English | MEDLINE | ID: mdl-24951659

ABSTRACT

BACKGROUND: Genome-wide association studies have identified thousands of genetic variants that influence a variety of diseases and health-related quantitative traits. However, the causal variants underlying the majority of genetic associations remain unknown. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study aims to follow up genome-wide association study signals and identify novel associations of the allelic spectrum of identified variants with cardiovascular-related traits. METHODS AND RESULTS: The study included 4231 participants from 3 CHARGE cohorts: the Atherosclerosis Risk in Communities Study, the Cardiovascular Health Study, and the Framingham Heart Study. We used a case-cohort design in which we selected both a random sample of participants and participants with extreme phenotypes for each of 14 traits. We sequenced and analyzed 77 genomic loci, which had previously been associated with ≥1 of 14 phenotypes. A total of 52 736 variants were characterized by sequencing and passed our stringent quality control criteria. For common variants (minor allele frequency ≥1%), we performed unweighted regression analyses to obtain P values for associations and weighted regression analyses to obtain effect estimates that accounted for the sampling design. For rare variants, we applied 2 approaches: collapsed aggregate statistics and joint analysis of variants using the sequence kernel association test. CONCLUSIONS: We sequenced 77 genomic loci in participants from 3 cohorts. We established a set of filters to identify high-quality variants and implemented statistical and bioinformatics strategies to analyze the sequence data and identify potentially functional variants within genome-wide association study loci.


Subject(s)
Aging/genetics , Genome-Wide Association Study , Heart Diseases/genetics , Adult , Aged , Aged, 80 and over , Cohort Studies , Female , Genetic Variation , Genomics , Heart Diseases/epidemiology , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide , Research Design , Sequence Analysis, DNA
17.
PLoS One ; 9(6): e99798, 2014.
Article in English | MEDLINE | ID: mdl-24959832

ABSTRACT

BACKGROUND: Stroke, the leading neurologic cause of death and disability, has a substantial genetic component. We previously conducted a genome-wide association study (GWAS) in four prospective studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium and demonstrated that sequence variants near the NINJ2 gene are associated with incident ischemic stroke. Here, we sought to fine-map functional variants in the region and evaluate the contribution of rare variants to ischemic stroke risk. METHODS AND RESULTS: We sequenced 196 kb around NINJ2 on chromosome 12p13 among 3,986 European ancestry participants, including 475 ischemic stroke cases, from the Atherosclerosis Risk in Communities Study, Cardiovascular Health Study, and Framingham Heart Study. Meta-analyses of single-variant tests for 425 common variants (minor allele frequency [MAF] ≥ 1%) confirmed the original GWAS results and identified an independent intronic variant, rs34166160 (MAF = 0.012), most significantly associated with incident ischemic stroke (HR = 1.80, p = 0.0003). Aggregating 278 putatively-functional variants with MAF≤ 1% using count statistics, we observed a nominally statistically significant association, with the burden of rare NINJ2 variants contributing to decreased ischemic stroke incidence (HR = 0.81; p = 0.026). CONCLUSION: Common and rare variants in the NINJ2 region were nominally associated with incident ischemic stroke among a subset of CHARGE participants. Allelic heterogeneity at this locus, caused by multiple rare, low frequency, and common variants with disparate effects on risk, may explain the difficulties in replicating the original GWAS results. Additional studies that take into account the complex allelic architecture at this locus are needed to confirm these findings.


Subject(s)
Cell Adhesion Molecules, Neuronal/genetics , Genetic Association Studies/methods , Ischemia/genetics , Myocardial Infarction/genetics , White People/genetics , Female , Genetic Heterogeneity , Humans , Introns , Male , Myocardial Infarction/etiology , Polymorphism, Single Nucleotide , Prospective Studies , Sequence Analysis, DNA
18.
Heart Rhythm ; 11(3): 452-7, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24239840

ABSTRACT

BACKGROUND: Genome-wide association studies (GWAS) have identified common genetic variants that predispose to atrial fibrillation (AF). It is unclear whether rare and low-frequency variants in genes implicated by such GWAS confer additional risk of AF. OBJECTIVE: To study the association of genetic variants with AF at GWAS top loci. METHODS: In the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Targeted Sequencing Study, we selected and sequenced 77 target gene regions from GWAS loci of complex diseases or traits, including 4 genes hypothesized to be related to AF (PRRX1, CAV1, CAV2, and ZFHX3). Sequencing was performed in participants with (n = 948) and without (n = 3330) AF from the Atherosclerosis Risk in Communities Study, the Cardiovascular Health Study, the Framingham Heart Study, and the Massachusetts General Hospital. RESULTS: One common variant (rs11265611; P = 1.70 × 10(-6)) intronic to IL6R (interleukin-6 receptor gene) was significantly associated with AF after Bonferroni correction (odds ratio 0.70; 95% confidence interval 0.58-0.85). The variant was not genotyped or imputed by prior GWAS, but it is in linkage disequilibrium (r(2) = .69) with the single-nucleotide polymorphism, with the strongest association with AF so far at this locus (rs4845625). In the rare variant joint analysis, damaging variants within the PRRX1 region showed significant association with AF after Bonferroni correction (P = .01). CONCLUSIONS: We identified 1 common single-nucleotide polymorphism and 1 gene region that were significantly associated with AF. Future sequencing efforts with larger sample sizes and more comprehensive genome coverage are anticipated to identify additional AF-related variants.


Subject(s)
Atrial Fibrillation/genetics , Homeodomain Proteins/genetics , Polymorphism, Single Nucleotide , Receptors, Interleukin-6/genetics , Aged , Female , Genetic Predisposition to Disease , Genetic Variation , Genome-Wide Association Study , Humans , Linkage Disequilibrium , Male , Middle Aged
19.
Science ; 344(6188): 1168-1173, 2014 Jun 06.
Article in English | MEDLINE | ID: mdl-24904168

ABSTRACT

Sheep (Ovis aries) are a major source of meat, milk, and fiber in the form of wool and represent a distinct class of animals that have a specialized digestive organ, the rumen, that carries out the initial digestion of plant material. We have developed and analyzed a high-quality reference sheep genome and transcriptomes from 40 different tissues. We identified highly expressed genes encoding keratin cross-linking proteins associated with rumen evolution. We also identified genes involved in lipid metabolism that had been amplified and/or had altered tissue expression patterns. This may be in response to changes in the barrier lipids of the skin, an interaction between lipid metabolism and wool synthesis, and an increased role of volatile fatty acids in ruminants compared with nonruminant animals.


Subject(s)
Lipid Metabolism/physiology , Rumen/physiology , Sheep, Domestic/genetics , Sheep, Domestic/metabolism , Amino Acid Sequence , Animals , Fatty Acids, Volatile/metabolism , Fatty Acids, Volatile/physiology , Gene Expression Regulation , Genome , Keratins, Hair-Specific/genetics , Lipid Metabolism/genetics , Molecular Sequence Data , Phylogeny , Rumen/metabolism , Sheep, Domestic/classification , Transcriptome , Wool/growth & development
20.
Genome Med ; 5(6): 57, 2013.
Article in English | MEDLINE | ID: mdl-23806086

ABSTRACT

BACKGROUND: The debate regarding the relative merits of whole genome sequencing (WGS) versus exome sequencing (ES) centers around comparative cost, average depth of coverage for each interrogated base, and their relative efficiency in the identification of medically actionable variants from the myriad of variants identified by each approach. Nevertheless, few genomes have been subjected to both WGS and ES, using multiple next generation sequencing platforms. In addition, no personal genome has been so extensively analyzed using DNA derived from peripheral blood as opposed to DNA from transformed cell lines that may either accumulate mutations during propagation or clonally expand mosaic variants during cell transformation and propagation. METHODS: We investigated a genome that was studied previously by SOLiD chemistry using both ES and WGS, and now perform six independent ES assays (Illumina GAII (x2), Illumina HiSeq (x2), Life Technologies' Personal Genome Machine (PGM) and Proton), and one additional WGS (Illumina HiSeq). RESULTS: We compared the variants identified by the different methods and provide insights into the differences among variants identified between ES runs in the same technology platform and among different sequencing technologies. We resolved the true genotypes of medically actionable variants identified in the proband through orthogonal experimental approaches. Furthermore, ES identified an additional SH3TC2 variant (p.M1?) that likely contributes to the phenotype in the proband. CONCLUSIONS: ES identified additional medically actionable variant calls and helped resolve ambiguous single nucleotide variants (SNV) documenting the power of increased depth of coverage of the captured targeted regions. Comparative analyses of WGS and ES reveal that pseudogenes and segmental duplications may explain some instances of apparent disease mutations in unaffected individuals.

SELECTION OF CITATIONS
SEARCH DETAIL