Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 82
Filter
Add more filters

Publication year range
1.
Genes Dev ; 33(19-20): 1381-1396, 2019 10 01.
Article in English | MEDLINE | ID: mdl-31488579

ABSTRACT

Short telomere syndromes manifest as familial idiopathic pulmonary fibrosis; they are the most common premature aging disorders. We used genome-wide linkage to identify heterozygous loss of function of ZCCHC8, a zinc-knuckle containing protein, as a cause of autosomal dominant pulmonary fibrosis. ZCCHC8 associated with TR and was required for telomerase function. In ZCCHC8 knockout cells and in mutation carriers, genomically extended telomerase RNA (TR) accumulated at the expense of mature TR, consistent with a role for ZCCHC8 in mediating TR 3' end targeting to the nuclear RNA exosome. We generated Zcchc8-null mice and found that heterozygotes, similar to human mutation carriers, had TR insufficiency but an otherwise preserved transcriptome. In contrast, Zcchc8-/- mice developed progressive and fatal neurodevelopmental pathology with features of a ciliopathy. The Zcchc8-/- brain transcriptome was highly dysregulated, showing accumulation and 3' end misprocessing of other low-abundance RNAs, including those encoding cilia components as well as the intronless replication-dependent histones. Our data identify a novel cause of human short telomere syndromes-familial pulmonary fibrosis and uncover nuclear exosome targeting as an essential 3' end maturation mechanism that vertebrate TR shares with replication-dependent histones.


Subject(s)
Carrier Proteins/genetics , Idiopathic Pulmonary Fibrosis/genetics , Loss of Function Mutation , Nuclear Proteins/genetics , RNA/metabolism , Telomerase/metabolism , Animals , Brain/enzymology , Brain/physiopathology , Cell Line , Cilia/genetics , Female , Genetic Linkage , HCT116 Cells , Humans , Idiopathic Pulmonary Fibrosis/enzymology , Idiopathic Pulmonary Fibrosis/physiopathology , Male , Mice , Mice, Knockout , Neurodevelopmental Disorders/genetics , Pedigree , RNA Processing, Post-Transcriptional/genetics , Telomere Shortening/genetics
2.
Am J Hum Genet ; 109(10): 1894-1908, 2022 10 06.
Article in English | MEDLINE | ID: mdl-36206743

ABSTRACT

Individuals with cystic fibrosis (CF) develop complications of the gastrointestinal tract influenced by genetic variants outside of CFTR. Cystic fibrosis-related diabetes (CFRD) is a distinct form of diabetes with a variable age of onset that occurs frequently in individuals with CF, while meconium ileus (MI) is a severe neonatal intestinal obstruction affecting ∼20% of newborns with CF. CFRD and MI are slightly correlated traits with previous evidence of overlap in their genetic architectures. To better understand the genetic commonality between CFRD and MI, we used whole-genome-sequencing data from the CF Genome Project to perform genome-wide association. These analyses revealed variants at 11 loci (6 not previously identified) that associated with MI and at 12 loci (5 not previously identified) that associated with CFRD. Of these, variants at SLC26A9, CEBPB, and PRSS1 associated with both traits; variants at SLC26A9 and CEBPB increased risk for both traits, while variants at PRSS1, the higher-risk alleles for CFRD, conferred lower risk for MI. Furthermore, common and rare variants within the SLC26A9 locus associated with MI only or CFRD only. As expected, different loci modify risk of CFRD and MI; however, a subset exhibit pleiotropic effects indicating etiologic and mechanistic overlap between these two otherwise distinct complications of CF.


Subject(s)
Cystic Fibrosis , Diabetes Mellitus , Infant, Newborn, Diseases , Intestinal Obstruction , Cystic Fibrosis/complications , Cystic Fibrosis/genetics , Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Diabetes Mellitus/genetics , Genome-Wide Association Study , Humans , Infant, Newborn , Intestinal Obstruction/complications , Intestinal Obstruction/genetics
3.
Am J Respir Crit Care Med ; 207(10): 1324-1333, 2023 05 15.
Article in English | MEDLINE | ID: mdl-36921087

ABSTRACT

Rationale: Lung disease is the major cause of morbidity and mortality in persons with cystic fibrosis (pwCF). Variability in CF lung disease has substantial non-CFTR (CF transmembrane conductance regulator) genetic influence. Identification of genetic modifiers has prognostic and therapeutic importance. Objectives: Identify genetic modifier loci and genes/pathways associated with pulmonary disease severity. Methods: Whole-genome sequencing data on 4,248 unique pwCF with pancreatic insufficiency and lung function measures were combined with imputed genotypes from an additional 3,592 patients with pancreatic insufficiency from the United States, Canada, and France. This report describes association of approximately 15.9 million SNPs using the quantitative Kulich normal residual mortality-adjusted (KNoRMA) lung disease phenotype in 7,840 pwCF using premodulator lung function data. Measurements and Main Results: Testing included common and rare SNPs, transcriptome-wide association, gene-level, and pathway analyses. Pathway analyses identified novel associations with genes that have key roles in organ development, and we hypothesize that these genes may relate to dysanapsis and/or variability in lung repair. Results confirmed and extended previous genome-wide association study findings. These whole-genome sequencing data provide finely mapped genetic information to support mechanistic studies. No novel primary associations with common single variants or rare variants were found. Multilocus effects at chr5p13 (SLC9A3/CEP72) and chr11p13 (EHF/APIP) were identified. Variant effect size estimates at associated loci were consistently ordered across the cohorts, indicating possible age or birth cohort effects. Conclusions: This premodulator genomic, transcriptomic, and pathway association study of 7,840 pwCF will facilitate mechanistic and postmodulator genetic studies and the development of novel therapeutics for CF lung disease.


Subject(s)
Cystic Fibrosis , Humans , Cystic Fibrosis/genetics , Genome-Wide Association Study/methods , Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Patient Acuity , Lung , Microtubule-Associated Proteins/genetics
4.
BMC Genet ; 17 Suppl 2: 8, 2016 Feb 03.
Article in English | MEDLINE | ID: mdl-26866982

ABSTRACT

High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure.


Subject(s)
Genetic Variation , Mexican Americans/genetics , Microtubule-Associated Proteins/genetics , RNA Cap-Binding Proteins/genetics , Receptors, Cytoplasmic and Nuclear/genetics , Transcription Factors/genetics , Blood Pressure/genetics , Genetic Markers/genetics , Humans , Hypertension/genetics , Software
5.
Nat Genet ; 39(9): 1045-51, 2007 Sep.
Article in English | MEDLINE | ID: mdl-17728769

ABSTRACT

The Genetic Association Information Network (GAIN) is a public-private partnership established to investigate the genetic basis of common diseases through a series of collaborative genome-wide association studies. GAIN has used new approaches for project selection, data deposition and distribution, collaborative analysis, publication and protection from premature intellectual property claims. These demonstrate a new commitment to shared scientific knowledge that should facilitate rapid advances in understanding the genetics of complex diseases.


Subject(s)
Biomedical Research/methods , Genetic Predisposition to Disease , Genome, Human/genetics , Information Services/organization & administration , Attention Deficit Disorder with Hyperactivity/genetics , Bipolar Disorder/genetics , Humans , International Cooperation , Models, Organizational , Psoriasis/genetics
6.
Hum Mol Genet ; 22(10): 2119-27, 2013 May 15.
Article in English | MEDLINE | ID: mdl-23314186

ABSTRACT

With white blood cell count emerging as an important risk factor for chronic inflammatory diseases, genetic associations of differential leukocyte types, specifically monocyte count, are providing novel candidate genes and pathways to further investigate. Circulating monocytes play a critical role in vascular diseases such as in the formation of atherosclerotic plaque. We performed a joint and ancestry-stratified genome-wide association analyses to identify variants specifically associated with monocyte count in 11 014 subjects in the electronic Medical Records and Genomics Network. In the joint and European ancestry samples, we identified novel associations in the chromosome 16 interferon regulatory factor 8 (IRF8) gene (P-value = 2.78×10(-16), ß = -0.22). Other monocyte associations include novel missense variants in the chemokine-binding protein 2 (CCBP2) gene (P-value = 1.88×10(-7), ß = 0.30) and a region of replication found in ribophorin I (RPN1) (P-value = 2.63×10(-16), ß = -0.23) on chromosome 3. The CCBP2 and RPN1 region is located near GATA binding protein2 gene that has been previously shown to be associated with coronary heart disease. On chromosome 9, we found a novel association in the prostaglandin reductase 1 gene (P-value = 2.29×10(-7), ß = 0.16), which is downstream from lysophosphatidic acid receptor 1. This region has previously been shown to be associated with monocyte count. We also replicated monocyte associations of genome-wide significance (P-value = 5.68×10(-17), ß = -0.23) at the integrin, alpha 4 gene on chromosome 2. The novel IRF8 results and further replications provide supporting evidence of genetic regions associated with monocyte count.


Subject(s)
Atherosclerosis/blood , Atherosclerosis/genetics , Chromosomes, Human/genetics , Genome-Wide Association Study , Leukocyte Count , Adult , Aged , Chromosomes, Human/metabolism , Female , GATA2 Transcription Factor/genetics , GATA2 Transcription Factor/metabolism , Humans , Integrin alpha4/genetics , Integrin alpha4/metabolism , Interferon Regulatory Factors/genetics , Interferon Regulatory Factors/metabolism , Male , Membrane Proteins/genetics , Membrane Proteins/metabolism , Middle Aged , Monocytes , Mutation, Missense , Receptors, Chemokine/genetics , Receptors, Chemokine/metabolism , Receptors, Lysophosphatidic Acid/genetics , Receptors, Lysophosphatidic Acid/metabolism
7.
Genet Med ; 17(10): 782-8, 2015 Oct.
Article in English | MEDLINE | ID: mdl-25569433

ABSTRACT

PURPOSE: In March 2013 the American College of Medical Genetics and Genomics published a list of 56 genes with the recommendation that pathogenic and likely pathogenic variants detected incidentally by clinical sequencing be reported to patients. As an initial step in determining the practical consequences of this recommendation in the research setting, we searched for variants in these genes in 232 whole-exome sequences from the Baylor-Hopkins Center for Mendelian Genomics. METHODS: We identified rare, nonsynonymous, and splicing single-nucleotide variants and insertions/deletions and assessed variant classification using the Human Gene Mutation, Emory, and ClinVar databases. We analyzed the burden of mutation in each of the 56 genes and determined which variants should be reported to patients. RESULTS: Our filtering resulted in 249 distinct variants, with a mean of 1.69 variants per individual. Half of these were novel missense mutations not classified by any of the three reference databases. Of 101 variants listed in the Human Gene Mutation Database, 48 were also in ClinVar and 3 were also in Emory; half of these shared variants were classified discordantly between databases. Some genes consistently had greater variation than others. In total, 0.86% of individuals had a reportable incidental variant. CONCLUSION: These observations demonstrate some current challenges of assessing phenotypic consequences of incidental variants for counseling patients.


Subject(s)
Exome , Genomics , High-Throughput Nucleotide Sequencing , Incidental Findings , Databases, Nucleic Acid , Female , Genetic Diseases, Inborn/diagnosis , Genetic Diseases, Inborn/genetics , Genomics/methods , Humans , Male , Mutation , Polymorphism, Single Nucleotide
8.
Am J Phys Anthropol ; 158(1): 78-91, 2015 Sep.
Article in English | MEDLINE | ID: mdl-26119360

ABSTRACT

OBJECTIVES: The tool-assisted extractive foraging capabilities of captive (zoo) and semi-captive (sanctuary) bonobo (Pan paniscus) groups were compared to each other and to those known in wild chimpanzee (Pan troglodytes) cultures. MATERIALS AND METHODS: The bonobos were provided with natural raw materials and challenged with tasks not previously encountered, in experimental settings simulating natural contexts where resources requiring special retrieval efforts were hidden. They were shown that food was buried underground or inserted into long bone cavities, and left to tackle the tasks without further intervention. RESULTS: The bonobos used modified branches and unmodified antlers or stones to dig under rocks and in the ground or to break bones to retrieve the food. Antlers, short sticks, long sticks, and rocks were effectively used as mattocks, daggers, levers, and shovels, respectively. One bonobo successively struck a long bone with an angular hammer stone, completely bisecting it longitudinally. Another bonobo modified long branches into spears and used them as attack weapons and barriers. Bonobos in the sanctuary, unlike those in the zoo, used tool sets to perform sequential actions. DISCUSSION: The competent and diverse tool-assisted extractive foraging by the bonobos corroborates and complements the extensive information on similar tool use by chimpanzees, suggesting that such competence is a shared trait. Better performance by the sanctuary bonobos than the zoo group was probably due to differences in their cultural exposure and housing conditions. The bonobos' foraging techniques resembled some of those attributed to Oldowan hominins, implying that they can serve as referential models.


Subject(s)
Feeding Behavior/physiology , Pan paniscus/physiology , Tool Use Behavior/physiology , Animals , Animals, Zoo , Anthropology, Physical , Female , Male
9.
Proc Natl Acad Sci U S A ; 109(36): 14500-3, 2012 Sep 04.
Article in English | MEDLINE | ID: mdl-22912400

ABSTRACT

Using direct percussion, language-competent bonobo-chimpanzees Kanzi and Pan-Banisha produced a significantly wider variety of flint tool types than hitherto reported, and used them task-specifically to break wooden logs or to dig underground for food retrieval. For log breaking, small flakes were rotated drill-like or used as scrapers, whereas thick cortical flakes were used as axes or wedges, leaving consistent wear patterns along the glued slits, the weakest areas of the log. For digging underground, a variety of modified stone tools, as well as unmodified flint nodules, were used as shovels. Such tool production and utilization competencies reported here in Pan indicate that present-day Pan exhibits Homo-like technological competencies.


Subject(s)
Biological Evolution , Learning/physiology , Pan paniscus/physiology , Tool Use Behavior/physiology , Animals , Female , Male , Observation
10.
J Cyst Fibros ; 2024 May 10.
Article in English | MEDLINE | ID: mdl-38734509

ABSTRACT

BACKGROUND: Cystic fibrosis (CF) is caused by deleterious variants in each CFTR gene. We investigated the utility of whole-gene CFTR sequencing when fewer than two pathogenic or likely pathogenic (P/LP) variants were detected by conventional testing (sequencing of exons and flanking introns) of CFTR. METHODS: Individuals with features of CF and a CF-diagnostic sweat chloride concentration with zero or one P/LP variants identified by conventional testing enrolled in the CF Mutation Analysis Program (MAP) underwent whole-gene CFTR sequencing. Replication was performed on individuals enrolled in the CF Genome Project (CFGP), followed by phenotype review and interrogation of other genes. RESULTS: Whole-gene sequencing identified a second P/LP variant in 20/43 MAP enrollees (47 %) and 10/22 CFGP enrollees (45 %) who had one P/LP variant after conventional testing. No P/LP variants were detected when conventional testing was negative (MAP: n = 43; CFGP: n = 13). Genome-wide analysis was unable to find an alternative etiology in CFGP participants with fewer than two P/LP CFTR variants and CF could not be confirmed in 91 % following phenotype re-review. CONCLUSIONS: Whole-gene CFTR analysis is beneficial in individuals with one previously-identified P/LP variant and a CF-diagnostic sweat chloride. Negative conventional CFTR testing indicates that the phenotype should be re-evaluated.

11.
Hum Mol Genet ; 20(24): 5012-23, 2011 Dec 15.
Article in English | MEDLINE | ID: mdl-21926416

ABSTRACT

We performed a multistage genome-wide association study of melanoma. In a discovery cohort of 1804 melanoma cases and 1026 controls, we identified loci at chromosomes 15q13.1 (HERC2/OCA2 region) and 16q24.3 (MC1R) regions that reached genome-wide significance within this study and also found strong evidence for genetic effects on susceptibility to melanoma from markers on chromosome 9p21.3 in the p16/ARF region and on chromosome 1q21.3 (ARNT/LASS2/ANXA9 region). The most significant single-nucleotide polymorphisms (SNPs) in the 15q13.1 locus (rs1129038 and rs12913832) lie within a genomic region that has profound effects on eye and skin color; notably, 50% of variability in eye color is associated with variation in the SNP rs12913832. Because eye and skin colors vary across European populations, we further evaluated the associations of the significant SNPs after carefully adjusting for European substructure. We also evaluated the top 10 most significant SNPs by using data from three other genome-wide scans. Additional in silico data provided replication of the findings from the most significant region on chromosome 1q21.3 rs7412746 (P = 6 × 10(-10)). Together, these data identified several candidate genes for additional studies to identify causal variants predisposing to increased risk for developing melanoma.


Subject(s)
Genetic Loci/genetics , Genetic Predisposition to Disease , Genome-Wide Association Study , Melanoma/genetics , Skin Neoplasms/genetics , Case-Control Studies , Chromosomes, Human, Pair 1/genetics , Genetic Markers , Guanine Nucleotide Exchange Factors/genetics , Humans , Meta-Analysis as Topic , Pigmentation/genetics , Polymorphism, Single Nucleotide/genetics , Ubiquitin-Protein Ligases
12.
Proc Natl Acad Sci U S A ; 107(16): 7401-6, 2010 Apr 20.
Article in English | MEDLINE | ID: mdl-20385819

ABSTRACT

We executed a genome-wide association scan for age-related macular degeneration (AMD) in 2,157 cases and 1,150 controls. Our results validate AMD susceptibility loci near CFH (P < 10(-75)), ARMS2 (P < 10(-59)), C2/CFB (P < 10(-20)), C3 (P < 10(-9)), and CFI (P < 10(-6)). We compared our top findings with the Tufts/Massachusetts General Hospital genome-wide association study of advanced AMD (821 cases, 1,709 controls) and genotyped 30 promising markers in additional individuals (up to 7,749 cases and 4,625 controls). With these data, we identified a susceptibility locus near TIMP3 (overall P = 1.1 x 10(-11)), a metalloproteinase involved in degradation of the extracellular matrix and previously implicated in early-onset maculopathy. In addition, our data revealed strong association signals with alleles at two loci (LIPC, P = 1.3 x 10(-7); CETP, P = 7.4 x 10(-7)) that were previously associated with high-density lipoprotein cholesterol (HDL-c) levels in blood. Consistent with the hypothesis that HDL metabolism is associated with AMD pathogenesis, we also observed association with AMD of HDL-c-associated alleles near LPL (P = 3.0 x 10(-3)) and ABCA1 (P = 5.6 x 10(-4)). Multilocus analysis including all susceptibility loci showed that 329 of 331 individuals (99%) with the highest-risk genotypes were cases, and 85% of these had advanced AMD. Our studies extend the catalog of AMD associated loci, help identify individuals at high risk of disease, and provide clues about underlying cellular pathways that should eventually lead to new therapies.


Subject(s)
Genetic Predisposition to Disease , Lipoproteins, HDL/metabolism , Macular Degeneration/genetics , Tissue Inhibitor of Metalloproteinase-3/genetics , Alleles , Case-Control Studies , Chromosome Mapping , Complement Factor I/genetics , Genetic Variation , Genome-Wide Association Study , Genotype , Humans , Polymorphism, Single Nucleotide , Regression Analysis , Risk , Tissue Inhibitor of Metalloproteinase-3/physiology
13.
Proc Natl Acad Sci U S A ; 107(11): 5082-7, 2010 Mar 16.
Article in English | MEDLINE | ID: mdl-20202923

ABSTRACT

Excessive alcohol consumption is one of the leading causes of preventable death in the United States. Approximately 14% of those who use alcohol meet criteria during their lifetime for alcohol dependence, which is characterized by tolerance, withdrawal, inability to stop drinking, and continued drinking despite serious psychological or physiological problems. We explored genetic influences on alcohol dependence among 1,897 European-American and African-American subjects with alcohol dependence compared with 1,932 unrelated, alcohol-exposed, nondependent controls. Constitutional DNA of each subject was genotyped using the Illumina 1M beadchip. Fifteen SNPs yielded P < 10(-5), but in two independent replication series, no SNP passed a replication threshold of P < 0.05. Candidate gene GABRA2, which encodes the GABA receptor alpha2 subunit, was evaluated independently. Five SNPs at GABRA2 yielded nominal (uncorrected) P < 0.05, with odds ratios between 1.11 and 1.16. Further dissection of the alcoholism phenotype, to disentangle the influence of comorbid substance-use disorders, will be a next step in identifying genetic variants associated with alcohol dependence.


Subject(s)
Alcoholism/genetics , Genome-Wide Association Study , Adult , Case-Control Studies , Family , Female , Humans , Male , Odds Ratio , Polymorphism, Single Nucleotide/genetics , Receptors, GABA-A/genetics , Reproducibility of Results
14.
Hum Hered ; 74(1): 36-44, 2012.
Article in English | MEDLINE | ID: mdl-23154503

ABSTRACT

OBJECTIVE: Custom genotyping of markers in families with familial idiopathic scoliosis were used to fine-map candidate regions on chromosomes 9 and 16 in order to identify candidate genes that contribute to this disorder and prioritize them for next-generation sequence analysis. METHODS: Candidate regions on 9q and 16p-16q, previously identified as linked to familial idiopathic scoliosis in a study of 202 families, were genotyped with a high-density map of single nucleotide polymorphisms. Tests of linkage for fine-mapping and intra-familial tests of association, including tiled regression, were performed on scoliosis as both a qualitative and quantitative trait. RESULTS AND CONCLUSIONS: Nominally significant linkage results were found for markers in both candidate regions. Results from intra-familial tests of association and tiled regression corroborated the linkage findings and identified possible candidate genes suitable for follow-up with next-generation sequencing in these same families. Candidate genes that met our prioritization criteria included FAM129B and CERCAM on chromosome 9 and SYT1, GNAO1, and CDH3 on chromosome 16.


Subject(s)
Chromosomes, Human, Pair 16/genetics , Chromosomes, Human, Pair 9/genetics , Genetic Linkage , Scoliosis/genetics , Adult , Chromosome Mapping/methods , Female , Genetic Heterogeneity , Genetic Predisposition to Disease , Genotype , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide
15.
Genet Epidemiol ; 35 Suppl 1: S12-7, 2011.
Article in English | MEDLINE | ID: mdl-22128052

ABSTRACT

With the advent of novel sequencing technologies, interest in the identification of rare variants that influence common traits has increased rapidly. Standard statistical methods, such as the Cochrane-Armitage trend test or logistic regression, fail in this setting for the analysis of unrelated subjects because of the rareness of the variants. Recently, various alternative approaches have been proposed that circumvent the rareness problem by collapsing rare variants in a defined genetic region or sets of regions. We provide an overview of these collapsing methods for association analysis and discuss the use of permutation approaches for significance testing of the data-adaptive methods.


Subject(s)
Models, Genetic , Models, Statistical , Molecular Epidemiology/methods , Genetic Predisposition to Disease , Humans , Sequence Analysis
16.
Genet Epidemiol ; 35 Suppl 1: S56-60, 2011.
Article in English | MEDLINE | ID: mdl-22128060

ABSTRACT

As part of Genetic Analysis Workshop 17 (GAW17), our group considered the application of novel and standard approaches to the analysis of genotype-phenotype association in next-generation sequencing data. Our group identified a major issue in the analysis of the GAW17 next-generation sequencing data: type I error and false-positive report probability rates higher than those expected based on empirical type I error levels (as high as 90%). Two main causes emerged: population stratification and long-range correlation (gametic phase disequilibrium) between rare variants. Population stratification was expected because of the diverse sample. Correlation between rare variants was attributable to both random causes (e.g., nearly 10,000 of 25,000 markers were private variants, and the sample size was small [n = 697]) and nonrandom causes (more correlation was observed than was expected by random chance). Principal components analysis was used to control for population structure and helped to minimize type I errors, but this was at the expense of identifying fewer causal variants. A novel multiple regression approach showed promise to handle correlation between markers. Further work is needed, first, to identify best practices for the control of type I errors in the analysis of sequencing data and then to explore and compare the many promising new aggregating approaches for identifying markers associated with disease phenotypes.


Subject(s)
Genetic Markers , Molecular Epidemiology/methods , Genetic Association Studies , Genotype , Human Genome Project , Humans , Phenotype , Polymorphism, Single Nucleotide , Principal Component Analysis , Regression Analysis , Sensitivity and Specificity , Sequence Analysis
17.
Genet Epidemiol ; 35(6): 469-78, 2011 Sep.
Article in English | MEDLINE | ID: mdl-21618603

ABSTRACT

Nonsyndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome-wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international consortium. Family-based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption, and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G × E) interaction simultaneously, plus a separate 1 df test for G × E interaction alone. Conditional logistic regression models were used to estimate effects on risk to exposed and unexposed children. While no SNP achieved genome-wide significance when considered alone, markers in several genes attained or approached genome-wide significance when G × E interaction was included. Among these, MLLT3 and SMC2 on chromosome 9 showed multiple SNPs resulting in an increased risk if the mother consumed alcohol during the peri-conceptual period (3 months prior to conception through the first trimester). TBK1 on chr. 12 and ZNF236 on chr. 18 showed multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G × E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G × E interaction when searching for genes influencing risk to complex and heterogeneous disorders, such as nonsyndromic CP.


Subject(s)
Cleft Palate/genetics , Alcohol Drinking , Chromosome Mapping , Cleft Palate/chemically induced , Cleft Palate/etiology , Female , Gene-Environment Interaction , Genome-Wide Association Study , Genotype , Humans , Male , Maternal Exposure , Models, Genetic , Parents , Polymorphism, Single Nucleotide , Pregnancy , Risk , Vitamins/therapeutic use
18.
Genet Epidemiol ; 35(8): 887-98, 2011 Dec.
Article in English | MEDLINE | ID: mdl-22125226

ABSTRACT

Genome-wide association studies (GWAS) are a useful approach in the study of the genetic components of complex phenotypes. Aside from large cohorts, GWAS have generally been limited to the study of one or a few diseases or traits. The emergence of biobanks linked to electronic medical records (EMRs) allows the efficient reuse of genetic data to yield meaningful genotype-phenotype associations for multiple phenotypes or traits. Phase I of the electronic MEdical Records and GEnomics (eMERGE-I) Network is a National Human Genome Research Institute-supported consortium composed of five sites to perform various genetic association studies using DNA repositories and EMR systems. Each eMERGE site has developed EMR-based algorithms to comprise a core set of 14 phenotypes for extraction of study samples from each site's DNA repository. Each eMERGE site selected samples for a specific phenotype, and these samples were genotyped at either the Broad Institute or at the Center for Inherited Disease Research using the Illumina Infinium BeadChip technology. In all, approximately 17,000 samples from across the five sites were genotyped. A unified quality control (QC) pipeline was developed by the eMERGE Genomics Working Group and used to ensure thorough cleaning of the data. This process includes examination of sample and marker quality and various batch effects. Upon completion of the genotyping and QC analyses for each site's primary study, eMERGE Coordinating Center merged the datasets from all five sites. This larger merged dataset reentered the established eMERGE QC pipeline. Based on lessons learned during the process, additional analyses and QC checkpoints were added to the pipeline to ensure proper merging. Here, we explore the challenges associated with combining datasets from different genotyping centers and describe the expansion to eMERGE QC pipeline for merged datasets. These additional steps will be useful as the eMERGE project expands to include additional sites in eMERGE-II, and also serve as a starting point for investigators merging multiple genotype datasets accessible through the National Center for Biotechnology Information in the database of Genotypes and Phenotypes. Our experience demonstrates that merging multiple datasets after additional QC can be an efficient use of genotype data despite new challenges that appear in the process.


Subject(s)
Electronic Health Records , Genome-Wide Association Study/standards , Quality Control , Algorithms , Genotype , Humans , National Human Genome Research Institute (U.S.) , Phenotype , United States
19.
Stroke ; 43(4): 980-6, 2012 Apr.
Article in English | MEDLINE | ID: mdl-22363065

ABSTRACT

BACKGROUND AND PURPOSE: Ischemic stroke (IS) shares many common risk factors with coronary artery disease (CAD). We hypothesized that genetic variants associated with myocardial infarction (MI) or CAD may be similarly involved in the etiology of IS. To test this hypothesis, we evaluated whether single-nucleotide polymorphisms (SNPs) at 11 different loci recently associated with MI or CAD through genome-wide association studies were associated with IS. METHODS: Meta-analyses of the associations between the 11 MI-associated SNPs and IS were performed using 6865 cases and 11 395 control subjects recruited from 9 studies. SNPs were either genotyped directly or imputed; in a few cases a surrogate SNP in high linkage disequilibrium was chosen. Logistic regression was performed within each study to obtain study-specific ßs and standard errors. Meta-analysis was conducted using an inverse variance weighted approach assuming a random effect model. RESULTS: Despite having power to detect odds ratio of 1.09-1.14 for overall IS and 1.20-1.32 for major stroke subtypes, none of the SNPs were significantly associated with overall IS and/or stroke subtypes after adjusting for multiple comparisons. CONCLUSIONS: Our results suggest that the major common loci associated with MI risk do not have effects of similar magnitude on overall IS but do not preclude moderate associations restricted to specific IS subtypes. Disparate mechanisms may be critical in the development of acute ischemic coronary and cerebrovascular events.


Subject(s)
Brain Ischemia/genetics , Genome-Wide Association Study , Linkage Disequilibrium , Myocardial Infarction/genetics , Polymorphism, Single Nucleotide , Stroke/genetics , Adolescent , Adult , Aged , Aged, 80 and over , Female , Humans , Male , Middle Aged
20.
Hum Genet ; 131(4): 639-52, 2012 Apr.
Article in English | MEDLINE | ID: mdl-22037903

ABSTRACT

White blood cell count (WBC) is unique among identified inflammatory predictors of chronic disease in that it is routinely measured in asymptomatic patients in the course of routine patient care. We led a genome-wide association analysis to identify variants associated with WBC levels in 13,923 subjects in the electronic Medical Records and Genomics (eMERGE) Network. We identified two regions of interest that were each unique to subjects of genetically determined ancestry to the African continent (AA) or to the European continent (EA). WBC varies among different ancestry groups. Despite being ancestry specific, these regions were identifiable in the combined analysis. In AA subjects, the region surrounding the Duffy antigen/chemokine receptor gene (DARC) on 1q21 exhibited significant association (p value = 6.71e-55). These results validate the previously reported association between WBC and of the regulatory variant rs2814778 in the promoter region, which causes the Duffy negative phenotype (Fy-/-). A second missense variant (rs12075) is responsible for the two principal antigens, Fya and Fyb of the Duffy blood group system. The two variants, consisting of four alleles, act in concert to produce five antigens and subsequent phenotypes. We were able to identify the marginal and novel interaction effects of these two variants on WBC. In the EA subjects, we identified significantly associated SNPs tagging three separate genes in the 17q21 region: (1) GSDMA, (2) MED24, and (3) PSMD3. Variants in this region have been reported to be associated with WBC, neutrophil count, and inflammatory diseases including asthma and Crohn's disease.


Subject(s)
Genetic Variation , Genome-Wide Association Study/methods , Leukocyte Count , Adult , Aged , Aged, 80 and over , Alleles , Black People/genetics , Duffy Blood-Group System/genetics , Female , Gene Frequency , Genome, Human/genetics , Genome-Wide Association Study/statistics & numerical data , Genotype , Humans , Male , Mediator Complex/genetics , Medical Records/statistics & numerical data , Middle Aged , Neoplasm Proteins/genetics , Polymorphism, Single Nucleotide , Principal Component Analysis , Proteasome Endopeptidase Complex/genetics , Receptors, Cell Surface/genetics , White People/genetics , White People/statistics & numerical data
SELECTION OF CITATIONS
SEARCH DETAIL