RESUMO
One in three people has been infected with Mycobacterium tuberculosis (MTB), and the risk for MTB infection in HIV-infected individuals is even higher. We hypothesized that HIV-positive individuals living in tuberculosis-endemic regions who do not get infected by Mycobacterium tuberculosis are genetically resistant. Using an "experiment of nature" design that proved successful in our previous work, we performed a genome-wide association study of tuberculin skin test positivity using 469 HIV-positive patients from prospective study cohorts of tuberculosis from Tanzania and Uganda to identify genetic loci associated with MTB infection in the context of HIV-infection. Among these individuals, 244 tested were tuberculin skin test (TST) positive either at enrollment or during the >8 year follow up, while 225 were not. We identified a genome-wide significant association between a dominant model of rs877356 and binary TST status in the combined cohort (Odds ratio = 0.2671, p = 1.22x10-8). Association was replicated with similar significance when examining TST induration as a continuous trait. The variant lies in the 5q31.1 region, 57kb downstream from IL9. Two-locus analyses of association of variants near rs877356 showed a haplotype comprised of rs877356 and an IL9 missense variant, rs2069885, had the most significant association (p = 1.59x10-12). We also replicated previously linked loci on chromosomes 2, 5, and 11. IL9 is a cytokine produced by mast cells and TH2 cells during inflammatory responses, providing a possible link between airway inflammation and protection from MTB infection. Our results indicate that studying uninfected, HIV-positive participants with extensive exposure increases the power to detect associations in complex infectious disease.
Assuntos
Cromossomos Humanos Par 5/genética , Estudo de Associação Genômica Ampla , Infecções por HIV/genética , Tuberculose/genética , Adulto , Doenças Endêmicas , Feminino , HIV/genética , HIV/patogenicidade , Infecções por HIV/complicações , Infecções por HIV/microbiologia , Infecções por HIV/virologia , Haplótipos/genética , Humanos , Masculino , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/patogenicidade , Testes Cutâneos , Tanzânia , Teste Tuberculínico , Tuberculose/complicações , Tuberculose/microbiologia , Tuberculose/virologia , UgandaRESUMO
Immunosuppression resulting from HIV infection increases the risk of progression to active tuberculosis (TB) both in individuals newly exposed to Mycobacterium tuberculosis (MTB) and in those with latent infections. We hypothesized that HIV-positive individuals who do not develop TB, despite living in areas where it is hyperendemic, provide a model of natural resistance. We performed a genome-wide association study of TB resistance by using 581 HIV-positive Ugandans and Tanzanians enrolled in prospective cohort studies of TB; 267 of these individuals developed active TB, and 314 did not. A common variant, rs4921437 at 5q33.3, was significantly associated with TB (odds ratio = 0.37, p = 2.11 × 10(-8)). This variant lies within a genomic region that includes IL12B and is embedded in an H3K27Ac histone mark. The locus also displays consistent patterns of linkage disequilibrium across African populations and has signals of strong selection in populations from equatorial Africa. Along with prior studies demonstrating that therapy with IL-12 (the cytokine encoded in part by IL12B, associated with longer survival following MTB infection in mice deficient in CD4 T cells), our results suggest that this pathway might be an excellent target for the development of new modalities for treating TB, especially for HIV-positive individuals. Our results also indicate that studying extreme disease resistance in the face of extensive exposure can increase the power to detect associations in complex infectious disease.
Assuntos
Loci Gênicos , Predisposição Genética para Doença , Subunidade p40 da Interleucina-12/genética , Tuberculose/genética , Adolescente , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Infecções por HIV/microbiologia , Humanos , Subunidade p40 da Interleucina-12/metabolismo , Desequilíbrio de Ligação , Modelos Logísticos , Masculino , Mycobacterium tuberculosis , Estudos Prospectivos , Fatores de Risco , Tanzânia , Tuberculose/diagnóstico , UgandaRESUMO
Helicobacter pylori is the principal cause of gastric cancer, the second leading cause of cancer mortality worldwide. However, H. pylori prevalence generally does not predict cancer incidence. To determine whether coevolution between host and pathogen influences disease risk, we examined the association between the severity of gastric lesions and patterns of genomic variation in matched human and H. pylori samples. Patients were recruited from two geographically distinct Colombian populations with significantly different incidences of gastric cancer, but virtually identical prevalence of H. pylori infection. All H. pylori isolates contained the genetic signatures of multiple ancestries, with an ancestral African cluster predominating in a low-risk, coastal population and a European cluster in a high-risk, mountain population. The human ancestry of the biopsied individuals also varied with geography, with mostly African ancestry in the coastal region (58%), and mostly Amerindian ancestry in the mountain region (67%). The interaction between the host and pathogen ancestries completely accounted for the difference in the severity of gastric lesions in the two regions of Colombia. In particular, African H. pylori ancestry was relatively benign in humans of African ancestry but was deleterious in individuals with substantial Amerindian ancestry. Thus, coevolution likely modulated disease risk, and the disruption of coevolved human and H. pylori genomes can explain the high incidence of gastric disease in the mountain population.
Assuntos
Suscetibilidade a Doenças , Evolução Molecular , Infecções por Helicobacter/microbiologia , Helicobacter pylori/genética , Gastropatias/microbiologia , Adulto , Idoso , Infecções por Helicobacter/complicações , Humanos , Pessoa de Meia-IdadeRESUMO
Gastric cancer incidence varies considerably among populations, even those with comparable rates of Helicobacter pylori infection. To test the hypothesis that genetic variation plays a role in gastric disease, we assessed the relationship between genotypes and gastric histopathology in a Colombian study population, using a genotyping array of immune-related single nucleotide polymorphisms (SNPs). Two synonymous SNPs (rs6061243 and rs6587239) were associated with progression of premalignant gastric lesions in a dominant-effects model after correction for multiple comparisons (p = 2.63E-07 and p = 7.97E-07, respectively); effect sizes were ß = -0.863 and ß = -0.815, respectively, where ß is an estimate of effect on histopathology scores, which ranged from 1 (normal) to 5 (dysplasia). In our replication cohort, a second Colombian population, both SNPs were associated with histopathology when additively modeled (ß = -0.256, 95 % CI = -0.47, -0.039; and ß = -0.239, 95 % CI = -0.45, -0.024), and rs6587239 was significantly associated in a dominant-effects model (ß = -0.330, 95 % CI = -0.66, 0.00). Because promoter methylation of GATA5 has previously been associated with gastric cancer, we also tested for the association of methylation status with more advanced histopathology scores in our samples and found a significant relationship (p = 0.001). A multivariate regression model revealed that the effects of both the promoter methylation and the exonic SNPs in GATA5 were independent. A SNP-by-methylation interaction term was also significant. This interaction between GATA5 variants and GATA5 promoter methylation indicates that the association of either factor with gastric disease progression is modified by the other.
Assuntos
Metilação de DNA/genética , Epigenômica , Fator de Transcrição GATA5/genética , Infecções por Helicobacter/genética , Neoplasias Gástricas/genética , Adulto , Feminino , Estudos de Associação Genética , Genótipo , Infecções por Helicobacter/microbiologia , Infecções por Helicobacter/patologia , Helicobacter pylori/patogenicidade , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , Regiões Promotoras Genéticas , Fatores de Risco , Neoplasias Gástricas/microbiologia , Neoplasias Gástricas/patologiaRESUMO
The number of effectively independent tests performed in genome-wide association studies (GWAS) varies by population, making a universal P-value threshold inappropriate. We estimated the number of independent SNPs in Phase 3 HapMap samples by: (1) the LD-pruning function in PLINK, and (2) an autocorrelation-based approach. Autocorrelation was also used to estimate the number of independent SNPs in whole genome sequences from 1000 Genomes. Both approaches yielded consistent estimates of numbers of independent SNPs, which were used to calculate new population-specific thresholds for genome-wide significance. African populations had the most stringent thresholds (1.49 × 10(-7) for YRI at r(2) = 0.3), East Asian populations the least (3.75 × 10(-7) for JPT at r(2) = 0.3). We also assessed how using population-specific significance thresholds compared to using a single multiple testing threshold at the conventional 5 × 10(-8) cutoff. Applied to a previously published GWAS of melanoma in Caucasians, our approach identified two additional genes, both previously associated with the phenotype. In a Chinese breast cancer GWAS, our approach identified 48 additional genes, 19 of which were in or near genes previously associated with the phenotype. We conclude that the conventional genome-wide significance threshold generates an excess of Type 2 errors, particularly in GWAS performed on more recently founded populations.
Assuntos
Estudo de Associação Genômica Ampla , Projeto HapMap , Melanoma/genética , Povo Asiático/genética , Genética Populacional , Humanos , Polimorfismo de Nucleotídeo Único , População Branca/genéticaRESUMO
Using genetic data from an obesity candidate gene study of self-reported African Americans and European Americans, we investigated the number of Ancestry Informative Markers (AIMs) and candidate gene SNPs necessary to infer continental ancestry. Proportions of African and European ancestry were assessed with STRUCTURE (K = 2), using 276 AIMs. These reference values were compared to estimates derived using 120, 60, 30, and 15 SNP subsets randomly chosen from the 276 AIMs and from 1144 SNPs in 44 candidate genes. All subsets generated estimates of ancestry consistent with the reference estimates, with mean correlations greater than 0.99 for all subsets of AIMs, and mean correlations of 0.99 ± 0.003; 0.98 ± 0.01; 0.93 ± 0.03; and 0.81 ± 0.11 for subsets of 120, 60, 30, and 15 candidate gene SNPs, respectively. Among African Americans, the median absolute difference from reference African ancestry values ranged from 0.01 to 0.03 for the four AIMs subsets and from 0.03 to 0.09 for the four candidate gene SNP subsets. Furthermore, YRI/CEU Fst values provided a metric to predict the performance of candidate gene SNPs. Our results demonstrate that a small number of SNPs randomly selected from candidate genes can be used to estimate admixture proportions in African Americans reliably.
Assuntos
Negro ou Afro-Americano/genética , Polimorfismo de Nucleotídeo Único , População Branca/genética , Feminino , Marcadores Genéticos , Genótipo , Humanos , Masculino , Obesidade/etnologia , Obesidade/genéticaRESUMO
Plasma concentration of plasminogen activator inhibitor-1 (PAI-1) is highly correlated with several cardiovascular disease (CVD) risk factors. It also plays a direct role in CVD, including myocardial infarction and stroke, by impeding the dissolution of thrombi in the blood. Insofar as PAI-1 links CVD's risk factors to its endpoints, genetic variants modulating the relationship between PAI-1 and risk factors may be of particular clinical and biological interest. The high heritability of PAI-1, which has not been explained by genetic association studies, may also, in large part, be due to this relationship with CVD risk factors. Using exome-wide data from 1,032 Ghanaian study participants, we tested for heterogeneity of correlation by genotype between PAI-1 and 4 CVD risk factors (body mass index, triglycerides, mean arterial pressure, and fasting glucose) under the hypothesis that loci involved in the relationship between PAI-1 and other risk factors will also modify their correlational structure. We found more significant heterogeneities of correlation by genotype than we found marginal effects, with no evidence of type I inflation. The most significant result among all univariate and multivariate tests performed in this study was the heterogeneity of correlation between PAI-1 and mean arterial pressure at rs10738554, near SLC24A2, a gene previously associated with high blood pressure in African Americans.
Assuntos
Doenças Cardiovasculares/genética , DNA/genética , Inibidor 1 de Ativador de Plasminogênio/genética , Polimorfismo de Nucleotídeo Único , Medição de Risco , Doenças Cardiovasculares/epidemiologia , Exoma , Genótipo , Gana/epidemiologia , Humanos , Morbidade/tendências , Inibidor 1 de Ativador de Plasminogênio/metabolismo , Fatores de RiscoRESUMO
We present here the draft genomes of 13 Helicobacter pylori strains isolated from Colombian residents on the Pacific coast (n = 6) and in the Andes mountains (n = 7), locations that differ in gastric cancer risk. These 13 strains were obtained from individuals with diagnosed gastric lesions.
RESUMO
BACKGROUND: Metabolic syndrome (MetS) is diagnosed by the presence of at least 3 of the following: obesity, hypertension, hyperglycemia, hypertriglyceridemia, and low high-density lipoprotein. Individuals with MetS also typically have elevated plasma levels of the antifibrinolytic factor, plasminogen activator inhibitor-1 (PAI-1), but the relationships between PAI-1 and MetS diagnostic criteria are not clear. Understanding these relationships can elucidate the relevance of MetS to cardiovascular disease risk, because PAI-1 is associated with ischemic events and directly involved in thrombosis. METHODS AND RESULTS: In a cross-sectional analysis of 2220 Ghanaian men and women from urban and rural locales, we found the age-standardized prevalence of MetS to be as high as 21.4% (urban women). PAI-1 level increased exponentially as the number of diagnostic criteria increased linearly (P<10-13), supporting the conclusion that MetS components have a joint effect that is stronger than their additive contributions. Body mass index, triglycerides, and fasting glucose were more strongly correlated with PAI-1 than with canonical MetS criteria, and this pattern did not change when pair-wise correlations were conditioned on all other risk factors, supporting an independent role for PAI-1 in MetS. Finally, whereas the correlations between conventional risk factors did not vary significantly by sex or across urban and rural environments, correlations with PAI-1 were generally stronger among urban participants. CONCLUSIONS: MetS prevalence in the West African population we studied was comparable to that of the industrialized West. PAI-1 may serve as a key link between MetS, as currently defined, and the endpoints with which it is associated. Whether this association is generalizable will require follow-up.
Assuntos
Síndrome Metabólica/sangue , Inibidor 1 de Ativador de Plasminogênio/sangue , Adolescente , Adulto , Anti-Hipertensivos/uso terapêutico , Glicemia/metabolismo , Pressão Sanguínea , Índice de Massa Corporal , HDL-Colesterol/sangue , Estudos Transversais , Diabetes Mellitus/tratamento farmacológico , Diabetes Mellitus/epidemiologia , Jejum , Feminino , Gana/epidemiologia , Humanos , Hipertensão/tratamento farmacológico , Hipertensão/epidemiologia , Hipoglicemiantes/uso terapêutico , Masculino , Síndrome Metabólica/epidemiologia , Pessoa de Meia-Idade , Prevalência , População Rural/estatística & dados numéricos , Triglicerídeos/sangue , População Urbana/estatística & dados numéricos , Adulto JovemRESUMO
Populations in sub-Saharan Africa are shifting from rural to increasingly urban. Although the burden of cardiovascular disease is expected to increase with this changing landscape, few large studies have assessed a wide range of risk factors in urban and rural populations, particularly in West Africa. We conducted a cross-sectional, population-based survey of 3317 participants from Ghana (≥18 years old), of whom 2265 (57% female) were from a mid-sized city (Sunyani, population ~250,000) and 1052 (55% female) were from surrounding villages (populations <5000). We measured canonical cardiovascular disease risk factors (BMI, blood pressure, fasting glucose, lipids) and fibrinolytic markers (PAI-1 and t-PA), and assessed how their distributions and related clinical outcomes (including obesity, hypertension and diabetes) varied with urban residence and sex. Urban residence was strongly associated with obesity (OR: 7.8, 95% CI: 5.3-11.3), diabetes (OR 3.6, 95% CI: 2.3-5.7), and hypertension (OR 3.2, 95% CI: 2.6-4.0). Among the quantitative measures, most affected were total cholesterol (+0.81 standard deviations, 95% CI 0.73-0.88), LDL cholesterol (+0.89, 95% CI: 0.79-0.99), and t-PA (+0.56, 95% CI: 0.48-0.63). Triglycerides and HDL cholesterol profiles were similarly poor in both urban and rural environments, but significantly worse among rural participants after BMI-adjustment. For most of the risk factors, the strength of the association with urban residence did not vary with sex. Obesity was a major exception, with urban women at particularly high risk (26% age-standardized prevalence) compared to urban men (7%). Overall, urban residents had substantially worse cardiovascular risk profiles, with some risk factors at levels typically seen in the developed world.
Assuntos
Doenças Cardiovasculares/etiologia , Urbanização , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Glicemia/análise , Pressão Sanguínea , Índice de Massa Corporal , Doenças Cardiovasculares/epidemiologia , Colesterol/sangue , HDL-Colesterol/sangue , LDL-Colesterol/sangue , Estudos Transversais , Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/epidemiologia , Feminino , Gana/epidemiologia , Humanos , Hipertensão/complicações , Hipertensão/epidemiologia , Masculino , Pessoa de Meia-Idade , Obesidade/complicações , Obesidade/epidemiologia , Inibidor 1 de Ativador de Plasminogênio/sangue , Prevalência , Fatores de Risco , Fumar , Inquéritos e Questionários , Ativador de Plasminogênio Tecidual/sangue , Triglicerídeos/sangue , Adulto JovemRESUMO
Genetic studies of human diseases have identified many variants associated with pathogenesis and severity. However, most studies have used only statistical association to assess putative relationships to disease, and ignored other factors for evaluation. For example, evolution is a factor that has shaped disease risk, changing allele frequencies as human populations migrated into and inhabited new environments. Since many common variants differ among populations in frequency, as does disease prevalence, we hypothesized that patterns of disease and population structure, taken together, will inform association studies. Thus, the population distributions of allelic risk variants should reflect the distributions of their associated diseases. Evolutionary Triangulation (ET) exploits this evolutionary differentiation by comparing population structure among three populations with variable patterns of disease prevalence. By selecting populations based on patterns where two have similar rates of disease that differ substantially from a third, we performed a proof of principle analysis for this method. We examined three disease phenotypes, lactase persistence, melanoma, and Type 2 diabetes mellitus. We show that for lactase persistence, a phenotype with a simple genetic architecture, ET identifies the key gene, lactase. For melanoma, ET identifies several genes associated with this disease and/or phenotypes related to it, such as skin color genes. ET was less obviously successful for Type 2 diabetes mellitus, perhaps because of the small effect sizes in known risk loci and recent environmental changes that have altered disease risk. Alternatively, ET may have revealed new genes involved in conferring disease risk for diabetes that did not meet nominal GWAS significance thresholds. We also compared ET to another method used to filter for phenotype associated genes, population branch statistic (PBS), and show that ET performs better in identifying genes known to associate with diseases appropriately distributed among populations. Our results indicate that ET can filter association results to improve our ability to discover disease loci.
RESUMO
AIM: We sought to identify potential pleiotropy involving pharmacogenes. METHODS: We tested 184 functional variants in 34 pharmacogenes for associations using a custom grouping of International Classification and Disease, Ninth Revision billing codes extracted from deidentified electronic health records of 6892 patients. RESULTS: We replicated several associations including ABCG2 (rs2231142) and gout (p = 1.73 × 10(-7); odds ratio [OR]: 1.73; 95% CI: 1.40-2.12); and SLCO1B1 (rs4149056) and jaundice (p = 2.50 × 10(-4); OR: 1.67; 95% CI: 1.27-2.20). CONCLUSION: In this systematic screen for phenotypic associations with functional variants, several novel genotype-phenotype combinations also achieved phenome-wide significance, including SLC15A2 rs1143672 and renal osteodystrophy (p = 2.67 × 10(-) (6); OR: 0.61; 95% CI: 0.49-0.75).
Assuntos
Pleiotropia Genética , Farmacogenética , Adulto , Negro ou Afro-Americano , Citocromo P-450 CYP2C19/genética , Feminino , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Fenótipo , Simportadores/genéticaRESUMO
Plasminogen activator inhibitor 1 (PAI-1), a major modulator of the fibrinolytic system, is an important factor in cardiovascular disease (CVD) susceptibility and severity. PAI-1 is highly heritable, but the few genes associated with it explain only a small portion of its variation. Studies of PAI-1 typically employ linear regression to estimate the effects of genetic variants on PAI-1 levels, but PAI-1 is not normally distributed, even after transformation. Therefore, alternative statistical methods may provide greater power to identify important genetic variants. Additionally, most genetic studies of PAI-1 have been performed on populations of European descent, limiting the generalizability of their results. We analyzed >30,000 variants for association with PAI-1 in a Ghanaian population, using median regression, a non-parametric alternative to linear regression. Three variants associated with median PAI-1, the most significant of which was in the gene arylsulfatase B (ARSB) (p = 1.09 x 10(-7)). We also analyzed the upper quartile of PAI-1, the most clinically relevant part of the distribution, and found 19 SNPs significantly associated in this quartile. Of note an association was found in period circadian clock 3 (PER3). Our results reveal novel associations with median and elevated PAI-1 in an understudied population. The lack of overlap between the two analyses indicates that the genetic effects on PAI-1 are not uniform across its distribution. They also provide evidence of the generalizability of the circadian pathway's effect on PAI-1, as a recent meta-analysis performed in Caucasian populations identified another circadian clock gene (ARNTL).
Assuntos
Inibidor 1 de Ativador de Plasminogênio/genética , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética , Adulto , Doenças Cardiovasculares/genética , Ritmo Circadiano/genética , Feminino , Gana , Humanos , Masculino , N-Acetilgalactosamina-4-Sulfatase/genética , Proteínas Circadianas Period/genéticaRESUMO
A major goal in infectious disease research is to identify the human and pathogenic genetic variants that explain differences in microbial pathogenesis. However, neither pathogenic strain nor human genetic variation in isolation has proven adequate to explain the heterogeneity of disease pathology. We suggest that disrupted co-evolution between a pathogen and its human host can explain variation in disease outcomes, and that genome-by-genome interactions should therefore be incorporated into genetic models of disease caused by infectious agents. Genetic epidemiological studies that fail to take both the pathogen and host into account can lead to false and misleading conclusions about disease etiology. We discuss our model in the context of three pathogens, Helicobacter pylori, Mycobacterium tuberculosis and human papillomavirus, and generalize the conditions under which it may be applicable.
RESUMO
In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions.