RESUMO
BACKGROUND: The association between genetic variants on the X chromosome to risk of COPD has not been fully explored. We hypothesize that the X chromosome harbors variants important in determining risk of COPD related phenotypes and may drive sex differences in COPD manifestations. METHODS: Using X chromosome data from three COPD-enriched cohorts of adult smokers, we performed X chromosome specific quality control, imputation, and testing for association with COPD case-control status, lung function, and quantitative emphysema. Analyses were performed among all subjects, then stratified by sex, and subsequently combined in meta-analyses. RESULTS: Among 10,193 subjects of non-Hispanic white or European ancestry, a variant near TMSB4X, rs5979771, reached genome-wide significance for association with lung function measured by FEV1/FVC ([Formula: see text] 0.020, SE 0.004, p 4.97 × 10-08), with suggestive evidence of association with FEV1 ([Formula: see text] 0.092, SE 0.018, p 3.40 × 10-07). Sex-stratified analyses revealed X chromosome variants that were differentially trending in one sex, with significantly different effect sizes or directions. CONCLUSIONS: This investigation identified loci influencing lung function, COPD, and emphysema in a comprehensive genetic association meta-analysis of X chromosome genetic markers from multiple COPD-related datasets. Sex differences play an important role in the pathobiology of complex lung disease, including X chromosome variants that demonstrate differential effects by sex and variants that may be relevant through escape from X chromosome inactivation. Comprehensive interrogation of the X chromosome to better understand genetic control of COPD and lung function is important to further understanding of disease pathology. Trial registration Genetic Epidemiology of COPD Study (COPDGene) is registered at ClinicalTrials.gov, NCT00608764 (Active since January 28, 2008). Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints Study (ECLIPSE), GlaxoSmithKline study code SCO104960, is registered at ClinicalTrials.gov, NCT00292552 (Active since February 16, 2006). Genetics of COPD in Norway Study (GenKOLS) holds GlaxoSmithKline study code RES11080, Genetics of Chronic Obstructive Lung Disease.
Assuntos
Enfisema , Doença Pulmonar Obstrutiva Crônica , Enfisema Pulmonar , Feminino , Masculino , Humanos , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Doença Pulmonar Obstrutiva Crônica/genética , Fenótipo , Cromossomo XRESUMO
SARS-CoV-2 mortality has been extensively studied in relation to host susceptibility. How sequence variations in the SARS-CoV-2 genome affect pathogenicity is poorly understood. Starting in October 2020, using the methodology of genome-wide association studies (GWAS), we looked at the association between whole-genome sequencing (WGS) data of the virus and COVID-19 mortality as a potential method of early identification of highly pathogenic strains to target for containment. Although continuously updating our analysis, in December 2020, we analyzed 7548 single-stranded SARS-CoV-2 genomes of COVID-19 patients in the GISAID database and associated variants with mortality using a logistic regression. In total, evaluating 29,891 sequenced loci of the viral genome for association with patient/host mortality, two loci, at 12,053 and 25,088 bp, achieved genome-wide significance (p values of 4.09e-09 and 4.41e-23, respectively), though only 25,088 bp remained significant in follow-up analyses. Our association findings were exclusively driven by the samples that were submitted from Brazil (p value of 4.90e-13 for 25,088 bp). The mutation frequency of 25,088 bp in the Brazilian samples on GISAID has rapidly increased from about 0.4 in October/December 2020 to 0.77 in March 2021. Although GWAS methodology is suitable for samples in which mutation frequencies varies between geographical regions, it cannot account for mutation frequencies that change rapidly overtime, rendering a GWAS follow-up analysis of the GISAID samples that have been submitted after December 2020 as invalid. The locus at 25,088 bp is located in the P.1 strain, which later (April 2021) became one of the distinguishing loci (precisely, substitution V1176F) of the Brazilian strain as defined by the Centers for Disease Control. Specifically, the mutations at 25,088 bp occur in the S2 subunit of the SARS-CoV-2 spike protein, which plays a key role in viral entry of target host cells. Since the mutations alter amino acid coding sequences, they potentially imposing structural changes that could enhance viral infectivity and symptom severity. Our analysis suggests that GWAS methodology can provide suitable analysis tools for the real-time detection of new more transmissible and pathogenic viral strains in databases such as GISAID, though new approaches are needed to accommodate rapidly changing mutation frequencies over time, in the presence of simultaneously changing case/control ratios. Improvements of the associated metadata/patient information in terms of quality and availability will also be important to fully utilize the potential of GWAS methodology in this field.
Assuntos
COVID-19 , Glicoproteína da Espícula de Coronavírus , Brasil , Estudo de Associação Genômica Ampla , Humanos , Mutação , Filogenia , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus/genéticaRESUMO
MOTIVATION: Analysis of rare variants in family-based studies remains a challenge. Transmission-based approaches provide robustness against population stratification, but the evaluation of the significance of test statistics based on asymptotic theory can be imprecise. Also, power will depend heavily on the choice of the test statistic and on the underlying genetic architecture of the locus, which will be generally unknown. RESULTS: In our proposed framework, we utilize the FBAT haplotype algorithm to obtain the conditional offspring genotype distribution under the null hypothesis given the sufficient statistic. Based on this conditional offspring genotype distribution, the significance of virtually any association test statistic can be evaluated based on simulations or exact computations, without the need for asymptotic approximations. Besides standard linear burden-type statistics, this enables our approach to also evaluate other test statistics such as variance components statistics, higher criticism approaches, and maximum-single-variant-statistics, where asymptotic theory might be involved or does not provide accurate approximations for rare variant data. Based on these P-values, combined test statistics such as the aggregated Cauchy association test (ACAT) can also be utilized. In simulation studies, we show that our framework outperforms existing approaches for family-based studies in several scenarios. We also applied our methodology to a TOPMed whole-genome sequencing dataset with 897 asthmatic trios from Costa Rica. AVAILABILITY AND IMPLEMENTATION: FBAT software is available at https://sites.google.com/view/fbatwebpage. Simulation code is available at https://github.com/julianhecker/FBAT_rare_variant_test_simulations. Whole-genome sequencing data for 'NHLBI TOPMed: The Genetic Epidemiology of Asthma in Costa Rica' is available at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000988.v4.p1. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
RESUMO
Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).
Assuntos
Variação Genética , Doença Pulmonar Obstrutiva Crônica/genética , Análise de Sequência de DNA/métodos , Idade de Início , Boston , Simulação por Computador , Bases de Dados Genéticas , Família , Genoma Humano , Humanos , Modelos Genéticos , Penetrância , Padrões de ReferênciaRESUMO
RATIONALE: Emphysema has considerable variability in the severity and distribution of parenchymal destruction throughout the lungs. Upper lobe-predominant emphysema has emerged as an important predictor of response to lung volume reduction surgery. Yet, aside from alpha-1 antitrypsin deficiency, the genetic determinants of emphysema distribution remain largely unknown. OBJECTIVES: To identify the genetic influences of emphysema distribution in non-alpha-1 antitrypsin-deficient smokers. METHODS: A total of 11,532 subjects with complete genotype and computed tomography densitometry data in the COPDGene (Genetic Epidemiology of Chronic Obstructive Pulmonary Disease [COPD]; non-Hispanic white and African American), ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints), and GenKOLS (Genetics of Chronic Obstructive Lung Disease) studies were analyzed. Two computed tomography scan emphysema distribution measures (difference between upper-third and lower-third emphysema; ratio of upper-third to lower-third emphysema) were tested for genetic associations in all study subjects. Separate analyses in each study population were followed by a fixed effect metaanalysis. Single-nucleotide polymorphism-, gene-, and pathway-based approaches were used. In silico functional evaluation was also performed. MEASUREMENTS AND MAIN RESULTS: We identified five loci associated with emphysema distribution at genome-wide significance. These loci included two previously reported associations with COPD susceptibility (4q31 near HHIP and 15q25 near CHRNA5) and three new associations near SOWAHB, TRAPPC9, and KIAA1462. Gene set analysis and in silico functional evaluation revealed pathways and cell types that may potentially contribute to the pathogenesis of emphysema distribution. CONCLUSIONS: This multicohort genome-wide association study identified new genomic loci associated with differential emphysematous destruction throughout the lungs. These findings may point to new biologic pathways on which to expand diagnostic and therapeutic approaches in chronic obstructive pulmonary disease. Clinical trial registered with www.clinicaltrials.gov (NCT 00608764).
Assuntos
Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla/métodos , Enfisema Pulmonar/genética , Estudos de Coortes , Feminino , Humanos , Pulmão/diagnóstico por imagem , Pulmão/fisiopatologia , Masculino , Pessoa de Meia-Idade , Enfisema Pulmonar/diagnóstico por imagem , Enfisema Pulmonar/fisiopatologia , Fumar/fisiopatologia , Tomografia Computadorizada por Raios XRESUMO
RATIONALE: Chronic obstructive pulmonary disease (COPD) susceptibility is in part related to genetic variants. Most genetic studies have been focused on genome-wide common variants without a specific focus on coding variants, but common and rare coding variants may also affect COPD susceptibility. OBJECTIVES: To identify coding variants associated with COPD. METHODS: We tested nonsynonymous, splice, and stop variants derived from the Illumina HumanExome array for association with COPD in five study populations enriched for COPD. We evaluated single variants with a minor allele frequency greater than 0.5% using logistic regression. Results were combined using a fixed effects meta-analysis. We replicated novel single-variant associations in three additional COPD cohorts. MEASUREMENTS AND MAIN RESULTS: We included 6,004 control subjects and 6,161 COPD cases across five cohorts for analysis. Our top result was rs16969968 (P = 1.7 × 10(-14)) in CHRNA5, a locus previously associated with COPD susceptibility and nicotine dependence. Additional top results were found in AGER, MMP3, and SERPINA1. A nonsynonymous variant, rs181206, in IL27 (P = 4.7 × 10(-6)) was just below the level of exome-wide significance but attained exome-wide significance (P = 5.7 × 10(-8)) when combined with results from other cohorts. Gene expression datasets revealed an association of rs181206 and the surrounding locus with expression of multiple genes; several were differentially expressed in COPD lung tissue, including TUFM. CONCLUSIONS: In an exome array analysis of COPD, we identified nonsynonymous variants at previously described loci and a novel exome-wide significant variant in IL27. This variant is at a locus previously described in genome-wide associations with diabetes, inflammatory bowel disease, and obesity and appears to affect genes potentially related to COPD pathogenesis.
Assuntos
Exoma/genética , Predisposição Genética para Doença/genética , Interleucina-27/genética , Doença Pulmonar Obstrutiva Crônica/genética , Adulto , Idoso , Feminino , Frequência do Gene/genética , Humanos , Masculino , Pessoa de Meia-IdadeRESUMO
RATIONALE: Chronic obstructive pulmonary disease (COPD) is defined by the presence of airflow limitation on spirometry, yet subjects with COPD can have marked differences in computed tomography imaging. These differences may be driven by genetic factors. We hypothesized that a genome-wide association study (GWAS) of quantitative imaging would identify loci not previously identified in analyses of COPD or spirometry. In addition, we sought to determine whether previously described genome-wide significant COPD and spirometric loci were associated with emphysema or airway phenotypes. OBJECTIVES: To identify genetic determinants of quantitative imaging phenotypes. METHODS: We performed a GWAS on two quantitative emphysema and two quantitative airway imaging phenotypes in the COPDGene (non-Hispanic white and African American), ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints), NETT (National Emphysema Treatment Trial), and GenKOLS (Genetics of COPD, Norway) studies and on percentage gas trapping in COPDGene. We also examined specific loci reported as genome-wide significant for spirometric phenotypes related to airflow limitation or COPD. MEASUREMENTS AND MAIN RESULTS: The total sample size across all cohorts was 12,031, of whom 9,338 were from COPDGene. We identified five loci associated with emphysema-related phenotypes, one with airway-related phenotypes, and two with gas trapping. These loci included previously reported associations, including the HHIP, 15q25, and AGER loci, as well as novel associations near SERPINA10 and DLC1. All previously reported COPD and a significant number of spirometric GWAS loci were at least nominally (P < 0.05) associated with either emphysema or airway phenotypes. CONCLUSIONS: Genome-wide analysis may identify novel risk factors for quantitative imaging characteristics in COPD and also identify imaging features associated with previously identified lung function loci.
Assuntos
Pulmão/diagnóstico por imagem , Enfisema Pulmonar/genética , Idoso , Proteínas de Transporte/genética , Estudos de Coortes , Feminino , Proteínas Ativadoras de GTPase/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Processamento de Imagem Assistida por Computador , Proteína 2 Reguladora do Ferro/genética , Estudos Longitudinais , Masculino , Glicoproteínas de Membrana/genética , Pessoa de Meia-Idade , Proteínas do Tecido Nervoso/genética , Fenótipo , Doença Pulmonar Obstrutiva Crônica/diagnóstico por imagem , Doença Pulmonar Obstrutiva Crônica/genética , Enfisema Pulmonar/diagnóstico por imagem , Receptor para Produtos Finais de Glicação Avançada/genética , Receptores Nicotínicos/genética , Serpinas/genética , Tomografia Computadorizada por Raios X , Proteínas Supressoras de Tumor/genéticaRESUMO
Many correlated disease variables are analyzed jointly in genetic studies in the hope of increasing power to detect causal genetic variants. One approach involves assessing the relationship between each phenotype and each SNP individually and using a Bonferroni correction for the effective number of tests conducted. Alternatively, one can apply a multivariate regression or a dimension reduction technique, such as principal component analysis, and test for the association with the principal components of the phenotypes rather than the individual phenotypes. Inspired by the previous approaches of combining phenotypes to maximize heritability at individual SNPs, in this paper, we propose to construct a maximally heritable (MaxH) phenotype by taking advantage of the estimated total heritability and co-heritability. The heritability and co-heritability only need to be estimated once; therefore, our method is applicable to genome-wide scans. The MaxH phenotype is a linear combination of the individual phenotypes with increased heritability and power over the phenotypes being combined. Simulations show that the heritability and power achieved agree well with the theory for large samples and two phenotypes. We compare our approach with commonly used methods and assess both the heritability and the power of the MaxH phenotype. Moreover, we provide suggestions for how to choose the phenotypes for combination. An application of our approach to a GWAS on chronic obstructive pulmonary disease shows its practical relevance.
Assuntos
Modelos Genéticos , Fenótipo , Estudo de Associação Genômica Ampla , Humanos , Doença Pulmonar Obstrutiva Crônica/genéticaRESUMO
BACKGROUND: Pulmonary function decline is a major contributor to morbidity and mortality among smokers. Post bronchodilator FEV1 and FEV1/FVC ratio are considered the standard assessment of airflow obstruction. We performed a genome-wide association study (GWAS) in 9919 current and former smokers in the COPDGene study (6659 non-Hispanic Whites [NHW] and 3260 African Americans [AA]) to identify associations with spirometric measures (post-bronchodilator FEV1 and FEV1/FVC). We also conducted meta-analysis of FEV1 and FEV1/FVC GWAS in the COPDGene, ECLIPSE, and GenKOLS cohorts (total n = 13,532). RESULTS: Among NHW in the COPDGene cohort, both measures of pulmonary function were significantly associated with SNPs at the 15q25 locus [containing CHRNA3/5, AGPHD1, IREB2, CHRNB4] (lowest p-value = 2.17 × 10(-11)), and FEV1/FVC was associated with a genomic region on chromosome 4 [upstream of HHIP] (lowest p-value = 5.94 × 10(-10)); both regions have been previously associated with COPD. For the meta-analysis, in addition to confirming associations to the regions near CHRNA3/5 and HHIP, genome-wide significant associations were identified for FEV1 on chromosome 1 [TGFB2] (p-value = 8.99 × 10(-9)), 9 [DBH] (p-value = 9.69 × 10(-9)) and 19 [CYP2A6/7] (p-value = 3.49 × 10(-8)) and for FEV1/FVC on chromosome 1 [TGFB2] (p-value = 8.99 × 10(-9)), 4 [FAM13A] (p-value = 3.88 × 10(-12)), 11 [MMP3/12] (p-value = 3.29 × 10(-10)) and 14 [RIN3] (p-value = 5.64 × 10(-9)). CONCLUSIONS: In a large genome-wide association study of lung function in smokers, we found genome-wide significant associations at several previously described loci with lung function or COPD. We additionally identified a novel genome-wide significant locus with FEV1 on chromosome 9 [DBH] in a meta-analysis of three study populations.
Assuntos
População Negra/genética , Loci Gênicos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Fumar/genética , População Branca/genética , Broncodilatadores/farmacologia , Cromossomos Humanos Par 15/genética , Cromossomos Humanos Par 4/genética , Estudos de Coortes , Feminino , Volume Expiratório Forçado , Genoma Humano , Humanos , Masculino , Metanálise como Assunto , Pessoa de Meia-Idade , Fatores de Risco , EspirometriaRESUMO
RATIONALE: Previous studies of chronic obstructive pulmonary disease (COPD) have suggested that genetic factors play an important role in the development of disease. However, single-nucleotide polymorphisms that are associated with COPD in genome-wide association studies have been shown to account for only a small percentage of the genetic variance in phenotypes of COPD, such as spirometry and imaging variables. These phenotypes are highly predictive of disease, and family studies have shown that spirometric phenotypes are heritable. OBJECTIVES: To assess the heritability and coheritability of four major COPD-related phenotypes (measurements of FEV1, FEV1/FVC, percent emphysema, and percent gas trapping), and COPD affection status in smokers of non-Hispanic white and African American descent using a population design. METHODS: Single-nucleotide polymorphisms from genome-wide association studies chips were used to calculate the relatedness of pairs of individuals and a mixed model was adopted to estimate genetic variance and covariance. MEASUREMENTS AND MAIN RESULTS: In the non-Hispanic whites, estimated heritabilities of FEV1 and FEV1/FVC were both about 37%, consistent with estimates in the literature from family-based studies. For chest computed tomography scan phenotypes, estimated heritabilities were both close to 25%. Heritability of COPD affection status was estimated as 37.7% in both populations. CONCLUSIONS: This study suggests that a large portion of the genetic risk of COPD is yet to be discovered and gives rationale for additional genetic studies of COPD. The estimates of coheritability (genetic covariance) for pairs of the phenotypes suggest considerable overlap of causal genetic loci.
Assuntos
Doença Pulmonar Obstrutiva Crônica/genética , Fumar/efeitos adversos , Feminino , Volume Expiratório Forçado/genética , Predisposição Genética para Doença/genética , Humanos , Masculino , Pessoa de Meia-Idade , Análise de Sequência com Séries de Oligonucleotídeos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Doença Pulmonar Obstrutiva Crônica/etiologia , Doença Pulmonar Obstrutiva Crônica/fisiopatologia , Grupos Raciais/genética , Fumar/fisiopatologia , Capacidade Vital/genéticaRESUMO
Rapid advances in sequencing technologies set the stage for the large-scale medical sequencing efforts to be performed in the near future, with the goal of assessing the importance of rare variants in complex diseases. The discovery of new disease susceptibility genes requires powerful statistical methods for rare variant analysis. The low frequency and the expected large number of such variants pose great difficulties for the analysis of these data. We propose here a robust and powerful testing strategy to study the role rare variants may play in affecting susceptibility to complex traits. The strategy is based on assessing whether rare variants in a genetic region collectively occur at significantly higher frequencies in cases compared with controls (or vice versa). A main feature of the proposed methodology is that, although it is an overall test assessing a possibly large number of rare variants simultaneously, the disease variants can be both protective and risk variants, with moderate decreases in statistical power when both types of variants are present. Using simulations, we show that this approach can be powerful under complex and general disease models, as well as in larger genetic regions where the proportion of disease susceptibility variants may be small. Comparisons with previously published tests on simulated data show that the proposed approach can have better power than the existing methods. An application to a recently published study on Type-1 Diabetes finds rare variants in gene IFIH1 to be protective against Type-1 Diabetes.
Assuntos
Simulação por Computador , Predisposição Genética para Doença , Testes Genéticos/estatística & dados numéricos , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Algoritmos , RNA Helicases DEAD-box/genética , Interpretação Estatística de Dados , Diabetes Mellitus Tipo 1/genética , Variação Genética , Haplótipos/genética , Humanos , Helicase IFIH1 Induzida por Interferon , Fatores de Risco , Análise de Sequência de DNARESUMO
The Human Genome Project and its spin-offs are making it increasingly feasible to determine the genetic basis of complex traits using genome-wide association studies. The statistical challenge of analyzing such studies stems from the severe multiple-comparison problem resulting from the analysis of thousands of SNPs. Our methodology for genome-wide family-based association studies, using single SNPs or haplotypes, can identify associations that achieve genome-wide significance. In relation to developing guidelines for our screening tools, we determined lower bounds for the estimated power to detect the gene underlying the disease-susceptibility locus, which hold regardless of the linkage disequilibrium structure present in the data. We also assessed the power of our approach in the presence of multiple disease-susceptibility loci. Our screening tools accommodate genomic control and use the concept of haplotype-tagging SNPs. Our methods use the entire sample and do not require separate screening and validation samples to establish genome-wide significance, as population-based designs do.
Assuntos
Predisposição Genética para Doença , Desequilíbrio de Ligação , Linhagem , Asma/genética , Simulação por Computador , Genoma Humano , Haplótipos , Humanos , Interleucina-10/genética , Polimorfismo de Nucleotídeo Único , SoftwareRESUMO
BACKGROUND: We previously reported that asthmatic children with GSTM1 null genotype may be more susceptible to the acute effect of ozone on the small airways and might benefit from antioxidant supplementation. This study aims to assess the acute effect of ozone on lung function (FEF(25-75)) in asthmatic children according to dietary intake of vitamin C and the number of putative risk alleles in three antioxidant genes: GSTM1, GSTP1 (rs1695), and NQO1 (rs1800566). METHODS: 257 asthmatic children from two cohort studies conducted in Mexico City were included. Stratified linear mixed models with random intercepts and random slopes on ozone were used. Potential confounding by ethnicity was assessed. Analyses were conducted under single gene and genotype score approaches. RESULTS: The change in FEF(25-75) per interquartile range (60 ppb) of ozone in persistent asthmatic children with low vitamin C intake and GSTM1 null was -91.2 ml/s (p = 0.06). Persistent asthmatic children with 4 to 6 risk alleles and low vitamin C intake showed an average decrement in FEF(25-75) of 97.2 ml/s per 60 ppb of ozone (p = 0.03). In contrast in children with 1 to 3 risk alleles, acute effects of ozone on FEF25-75 did not differ by vitamin C intake. CONCLUSIONS: Our results provide further evidence that asthmatic children predicted to have compromised antioxidant defense by virtue of genetic susceptibility combined with deficient antioxidant intake may be at increased risk of adverse effects of ozone on pulmonary function.
Assuntos
Antioxidantes/administração & dosagem , Ácido Ascórbico/administração & dosagem , Asma/genética , Suplementos Nutricionais , Exposição Ambiental/efeitos adversos , Enzimas/genética , Interação Gene-Ambiente , Predisposição Genética para Doença , Ozônio/efeitos adversos , Fatores Etários , Deficiência de Ácido Ascórbico/tratamento farmacológico , Deficiência de Ácido Ascórbico/epidemiologia , Asma/diagnóstico , Asma/enzimologia , Asma/epidemiologia , Asma/fisiopatologia , Asma/prevenção & controle , Criança , Estudos de Coortes , Método Duplo-Cego , Feminino , Glutationa S-Transferase pi/genética , Glutationa Transferase/genética , Humanos , Modelos Lineares , Pulmão/fisiopatologia , Masculino , Fluxo Máximo Médio Expiratório , México/epidemiologia , NAD(P)H Desidrogenase (Quinona)/genética , Fenótipo , Polimorfismo Genético , Medição de Risco , Fatores de Risco , Saúde da População UrbanaRESUMO
Identifying population stratification and genotyping error are important for candidate gene association studies using the Transmission Disequilibrium Test (TDT). Although the TDT retains the prespecified Type I error in the presence of population stratification, the test may have decreased power in the presence of population stratification. Genotyping error can also cause the TDT to have an elevated Type I error. Differentiating population stratification from genotyping error remains a challenge for geneticists. Both genotyping error and population stratification can result in an increase in the observed homozygosity of a sample relative to that expected assuming Hardy-Weinberg Equilibrium (HWE). We show that when family data are available, even if a limited number of markers are genotyped, evaluating the markers that show statistically significant deviation from HWE with the Mating Type Distortion Test (MTDT)--a test based on the mating type distribution--can reliably differentiate genotyping error from population stratification. We simulate data based on several models of genotyping error in previously published literature, and show how this method could be used in practice to assist in differentiating population stratification from systematic genotyping error.
Assuntos
Técnicas Genéticas , Técnicas de Genotipagem , Família , Estudo de Associação Genômica Ampla , Homozigoto , Humanos , Desequilíbrio de Ligação , Modelos GenéticosRESUMO
Compositional epistasis is said to be present when the effect of a genetic factor at one locus is masked by a variant at another locus. Although such compositional epistasis is not equivalent to the presence of an interaction in a statistical model, non-standard tests can sometimes be used to detect compositional epistasis. In this paper we consider empirical tests for compositional epistasis under models for the joint effect of two genetic factors which place no restrictions on the main effects of each factor but constrain the interactive effects of the two factors so as to be captured by a single parameter in the model. We describe the implications of these tests for cohort, case-control, case-only and family-based study designs and we illustrate the methods using an example of gene-gene interaction already reported in the literature.
Assuntos
Epistasia Genética , Modelos Genéticos , Artrite Reumatoide/genética , Estudos de Casos e Controles , Estudos de Coortes , Família , Antígenos HLA-DR/genética , Cadeias HLA-DRB1 , HumanosRESUMO
Investigators sometimes use information obtained from multiple informants about a given variable. We focus on estimating the effect of a predictor on a continuous outcome, when that (true) predictor cannot be observed directly but is measured by 2 informants. We describe various approaches to using information from 2 informants to estimate a regression or correlation coefficient for the effect of the (true) predictor on the outcome. These approaches include methods we refer to as single informant, simple average, optimal weighted average, principal components analysis, and classical measurement error. Each of these 5 methods effectively uses a weighted average of the informants' reports as a proxy for the true predictor in calculating the correlation or regression coefficient. We compare the performance of these methods in simulation experiments that assume a rounded congeneric measurement model for the relationship between the informants' reports and a true predictor that is a mixture of zeros and positively distributed continuous values. We also compare the methods' performance in a real data example-the relationship between vigorous physical activity (the predictor) and body mass index (the continuous outcome). The results of the simulations and the example suggest that the simple average is a reasonable choice when there are only 2 informants.
Assuntos
Índice de Massa Corporal , Atividade Motora/fisiologia , Obesidade/epidemiologia , Valor Preditivo dos Testes , Exercício Físico/fisiologia , Feminino , Humanos , Modelos Lineares , Masculino , Modelos EstatísticosRESUMO
Longitudinal studies are an important tool for analysing traits that change over time, depending on individual characteristics and environmental exposures. Complex quantitative traits, such as lung function, may change over time and appear to depend on genetic and environmental factors, as well as on potential gene-environment interactions. There is a growing interest in modelling both marginal genetic effects and gene-environment interactions. In an admixed population, the use of traditional statistical models may fail to adjust for confounding by ethnicity, leading to bias in the genetic effect estimates. A variety of methods have been developed to account for the genetic substructure of human populations. Family-based designs provide an important resource for avoiding confounding due to admixture. To date, however, most genetic analyses have been applied to cross-sectional designs. In this paper, we propose a methodology which aims to improve the assessment of main genetic effect and gene-environment interaction effects by combining the advantages of both longitudinal studies for continuous phenotypes, and the family-based designs. This approach is based on an extension of ordinary linear mixed models for quantitative phenotypes, which incorporates information from a case-parent design. Our results indicate that use of this method allows both main genetic and gene-environment interaction effects to be estimated without bias, even in the presence of population substructure.
Assuntos
Meio Ambiente , Genes , Estudos de Associação Genética , Família , Humanos , Estudos Longitudinais , FenótipoRESUMO
It is useful to have robust gene-environment interaction tests that can utilize a variety of family structures in an efficient way. This article focuses on tests for gene-environment interaction in the presence of main genetic and environmental effects. The objective is to develop powerful tests that can combine trio data with parental genotypes and discordant sibships when parents' genotypes are missing. We first make a modest improvement on a method for discordant sibs (discordant on phenotype), but the approach does not allow one to use families when all offspring are affected, e.g., trios. We then make a modest improvement on a Mendelian transmission-based approach that is inefficient when discordant sibs are available, but can be applied to any nuclear family. Finally, we propose a hybrid approach that utilizes the most efficient method for a specific family type, then combines over families. We utilize this hybrid approach to analyze a chronic obstructive pulmonary disorder dataset to test for gene-environment interaction in the Serpine2 gene with smoking. The methods are freely available in the R package fbati.
Assuntos
Interação Gene-Ambiente , Ligação Genética/genética , Predisposição Genética para Doença/genética , Modelos Genéticos , Núcleo Familiar , Serpina E2/genética , Fumar/genética , Simulação por Computador , Predisposição Genética para Doença/epidemiologia , Humanos , Metagenômica/métodos , Fumar/epidemiologiaRESUMO
In clinical trials multiple outcomes are often used to assess treatment interventions. This paper presents an evaluation of likelihood-based methods for jointly testing treatment effects in clinical trials with multiple continuous outcomes. Specifically, we compare the power of joint tests of treatment effects obtained from joint models for the multiple outcomes with univariate tests based on modeling the outcomes separately. We also consider the power and bias of tests when data are missing, a common feature of many trials, especially in psychiatry. Our results suggest that joint tests capitalize on the correlation of multiple outcomes and are more powerful than standard univariate methods, especially when outcomes are missing completely at random. When outcomes are missing at random, test procedures based on correctly specified joint models are unbiased, while standard univariate procedures are not. Results of a simulation study are reported, and the methods are illustrated in an example from the Clinical Antipsychotic Trials of Intervention Effectiveness for schizophrenia.