RESUMO
Genome-wide association studies (GWASs) have enabled unbiased identification of genetic loci contributing to common complex diseases. Because GWAS loci often harbor many variants and genes, it remains a major challenge to move from GWASs' statistical associations to the identification of causal variants and genes that underlie these association signals. Researchers have applied many statistical and functional fine-mapping strategies to prioritize genetic variants and genes as potential candidates. There is no gold standard in fine-mapping approaches, but consistent results across different approaches can improve confidence in the fine-mapping findings. Here, we combined text mining with a systematic review and formed a catalog of 85 studies with evidence of fine mapping for at least one autoimmune GWAS locus. Across all fine-mapping studies, we compiled 230 GWAS loci with allelic heterogeneity estimates and predictions of causal variants and trait-relevant genes. These 230 loci included 455 combinations of locus-by-disease association signals with 15 autoimmune diseases. Using these estimates, we assessed the probability of mediating disease risk associations across genes in GWAS loci and identified robust signals of causal disease biology. We predict that this comprehensive catalog of GWAS fine-mapping efforts in autoimmune disease will greatly help distill the plethora of information in the field and inform therapeutic strategies.
Assuntos
Doenças Autoimunes/genética , Estudo de Associação Genômica Ampla , HumanosRESUMO
Deciphering the impact of genetic variation on gene regulation is fundamental to understanding common, complex human diseases. Although histone modifications are important markers of gene regulatory elements of the genome, any specific histone modification has not been assayed in more than a few individuals in the human liver. As a result, the effects of genetic variation on histone modification states in the liver are poorly understood. Here, we generate the most comprehensive genome-wide dataset of two epigenetic marks, H3K4me3 and H3K27ac, and annotate thousands of putative regulatory elements in the human liver. We integrate these findings with genome-wide gene expression data collected from the same human liver tissues and high-resolution promoter-focused chromatin interaction maps collected from human liver-derived HepG2 cells. We demonstrate widespread functional consequences of natural genetic variation on putative regulatory element activity and gene expression levels. Leveraging these extensive datasets, we fine-map a total of 74 GWAS loci that have been associated with at least one complex phenotype. Our results reveal a repertoire of genes and regulatory mechanisms governing complex disease development and further the basic understanding of genetic and epigenetic regulation of gene expression in the human liver tissue.
Assuntos
Cromatina/genética , Mapeamento Cromossômico/métodos , Epigênese Genética , Fígado/patologia , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Adolescente , Adulto , Idoso , Criança , Cromatina/metabolismo , Feminino , Estudos de Associação Genética , Células Hep G2 , Histonas/genética , Humanos , Fígado/metabolismo , Masculino , Pessoa de Meia-Idade , Fenótipo , Regiões Promotoras Genéticas , Estudos Prospectivos , Sequências Reguladoras de Ácido Nucleico , Adulto JovemRESUMO
Gene regulation shapes the evolution of phenotypic diversity. We investigated the evolution of liver promoters and enhancers in six primate species using ChIP-seq (H3K27ac and H3K4me1) to profile cis-regulatory elements (CREs) and using RNA-seq to characterize gene expression in the same individuals. To quantify regulatory divergence, we compared CRE activity across species by testing differential ChIP-seq read depths directly measured for orthologous sequences. We show that the primate regulatory landscape is largely conserved across the lineage, with 63% of the tested human liver CREs showing similar activity across species. Conserved CRE function is associated with sequence conservation, proximity to coding genes, cell-type specificity, and transcription factor binding. Newly evolved CREs are enriched in immune response and neurodevelopmental functions. We further demonstrate that conserved CREs bind master regulators, suggesting that while CREs contribute to species adaptation to the environment, core functions remain intact. Newly evolved CREs are enriched in young transposable elements (TEs), including Long-Terminal-Repeats (LTRs) and SINE-VNTR-Alus (SVAs), that significantly affect gene expression. Conversely, only 16% of conserved CREs overlap TEs. We tested the cis-regulatory activity of 69 TE subfamilies by luciferase reporter assays, spanning all major TE classes, and showed that 95.6% of tested TEs can function as either transcriptional activators or repressors. In conclusion, we demonstrated the critical role of TEs in primate gene regulation and illustrated potential mechanisms underlying evolutionary divergence among the primate species through the noncoding genome.
Assuntos
Elementos de DNA Transponíveis , Evolução Molecular , Regulação da Expressão Gênica/genética , Elementos Reguladores de Transcrição , Animais , Callithrix , Humanos , Especificidade da EspécieRESUMO
Genome-wide association studies (GWASs) have become a standard tool for dissecting genetic contributions to disease risk. However, these studies typically require extraordinarily large sample sizes to be adequately powered. Strategies that incorporate functional information alongside genetic associations have proved successful in increasing GWAS power. Following this paradigm, we present the results of 20 different genetic association studies for quantitative traits related to complex diseases, conducted in the Hutterites of South Dakota. To boost the power of these association studies, we collected RNA-sequencing data from lymphoblastoid cell lines for 431 Hutterite individuals. We then used Sherlock, a tool that integrates GWAS and expression quantitative trait locus (eQTL) data, to identify weak GWAS signals that are also supported by eQTL data. Using this approach, we found novel associations with quantitative phenotypes related to cardiovascular disease, including carotid intima-media thickness, left atrial volume index, monocyte count and serum YKL-40 levels.
Assuntos
Doenças Cardiovasculares/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Locos de Características Quantitativas/genética , Doenças Cardiovasculares/patologia , Espessura Intima-Media Carotídea , Regulação da Expressão Gênica/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Fenótipo , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Rhinovirus (RV) is the most prevalent human respiratory virus and is responsible for at least half of all common colds. RV infections may result in a broad spectrum of effects that range from asymptomatic infections to severe lower respiratory illnesses. The basis for inter-individual variation in the response to RV infection is not well understood. In this study, we explored whether host genetic variation is associated with variation in gene expression response to RV infections between individuals. To do so, we obtained genome-wide genotype and gene expression data in uninfected and RV-infected peripheral blood mononuclear cells (PBMCs) from 98 individuals. We mapped local and distant genetic variation that is associated with inter-individual differences in gene expression levels (eQTLs) in both uninfected and RV-infected cells. We focused specifically on response eQTLs (reQTLs), namely, genetic associations with inter-individual variation in gene expression response to RV infection. We identified local reQTLs for 38 genes, including genes with known functions in viral response (UBA7, OAS1, IRF5) and genes that have been associated with immune and RV-related diseases (e.g., ITGA2, MSR1, GSTM3). The putative regulatory regions of genes with reQTLs were enriched for binding sites of virus-activated STAT2, highlighting the role of condition-specific transcription factors in genotype-by-environment interactions. Overall, we suggest that the 38 loci associated with inter-individual variation in gene expression response to RV-infection represent promising candidates for affecting immune and RV-related respiratory diseases.
Assuntos
Resfriado Comum/genética , Loci Gênicos , Variação Genética , Transcriptoma , 2',5'-Oligoadenilato Sintetase/genética , 2',5'-Oligoadenilato Sintetase/metabolismo , Adulto , Resfriado Comum/metabolismo , Feminino , Perfilação da Expressão Gênica , Interação Gene-Ambiente , Glutationa Transferase/genética , Glutationa Transferase/metabolismo , Humanos , Integrina alfa2/genética , Integrina alfa2/metabolismo , Fatores Reguladores de Interferon/genética , Fatores Reguladores de Interferon/metabolismo , Leucócitos Mononucleares/metabolismo , Masculino , Pessoa de Meia-Idade , Fator de Transcrição STAT2/genética , Fator de Transcrição STAT2/metabolismo , Receptores Depuradores Classe A/genética , Receptores Depuradores Classe A/metabolismoRESUMO
BACKGROUND: Both genetic variation at the 17q21 locus and virus-induced respiratory wheezing illnesses are associated with the development of asthma. Our aim was to determine the effects of these two factors on the risk of asthma in the Childhood Origins of Asthma (COAST) and the Copenhagen Prospective Study on Asthma in Childhood (COPSAC) birth cohorts. METHODS: We tested genotypes at the 17q21 locus for associations with asthma and with human rhinovirus (HRV) and respiratory syncytial virus (RSV) wheezing illnesses and tested for interactions between 17q21 genotypes and HRV and RSV wheezing illnesses with respect to the risk of asthma. Finally, we examined genotype-specific expression of 17q21 genes in unstimulated and HRV-stimulated peripheral-blood mononuclear cells (PBMCs). RESULTS: The 17q21 variants were associated with HRV wheezing illnesses in early life, but not with RSV wheezing illnesses. The associations of 17q21 variants with asthma were restricted to children who had had HRV wheezing illnesses, resulting in a significant interaction effect with respect to the risk of asthma. Moreover, the expression levels of ORMDL3 and of GSDMB were significantly increased in HRV-stimulated PBMCs, as compared with unstimulated PBMCs. The expression of these genes was associated with 17q21 variants in both conditions, although the increase with exposure to HRV was not genotype-specific. CONCLUSIONS: Variants at the 17q21 locus were associated with asthma in children who had had HRV wheezing illnesses and with expression of two genes at this locus. The expression levels of both genes increased in response to HRV stimulation, although the relative increase was not associated with the 17q21 genotypes. (Funded by the National Institutes of Health.).
Assuntos
Asma/genética , Resfriado Comum/complicações , Sons Respiratórios , Rhinovirus , Asma/virologia , Criança , Cromossomos Humanos Par 17 , DNA/isolamento & purificação , Expressão Gênica , Predisposição Genética para Doença , Genótipo , Humanos , Leucócitos Mononucleares , Polimorfismo de Nucleotídeo Único , RNA/isolamento & purificação , Sons Respiratórios/genética , Infecções por Vírus Respiratório Sincicial/complicações , Vírus Sinciciais Respiratórios , RiscoRESUMO
Very long-chain fatty acids (VLCFAs, chain length >C20) exist in tissues throughout the body and are synthesized by repetition of the fatty acid (FA) elongation cycle composed of four successive enzymatic reactions. In mammals, the TER gene is the only gene encoding trans-2-enoyl-CoA reductase, which catalyzes the fourth reaction in the FA elongation cycle. The TER P182L mutation is the pathogenic mutation for nonsyndromic mental retardation. This mutation substitutes a leucine for a proline residue at amino acid 182 in the TER enzyme. Currently, the mechanism by which the TER P182L mutation causes nonsyndromic mental retardation is unknown. To understand the effect of this mutation on the TER enzyme and VLCFA synthesis, we have biochemically characterized the TER P182L mutant enzyme using yeast and mammalian cells transfected with the TER P182L mutant gene and analyzed the FA elongation cycle in the B-lymphoblastoid cell line with the homozygous TER P182L mutation (TER(P182L/P182L) B-lymphoblastoid cell line). We have found that TER P182L mutant enzyme exhibits reduced trans-2-enoyl-CoA reductase activity and protein stability, thereby impairing VLCFA synthesis and, in turn, altering the sphingolipid profile (i.e. decreased level of C24 sphingomyelin and C24 ceramide) in the TER(P182L/P182L) B-lymphoblastoid cell line. We have also found that in addition to the TER enzyme-catalyzed fourth reaction, the third reaction in the FA elongation cycle is affected by the TER P182L mutation. These findings provide new insight into the biochemical defects associated with this genetic mutation.
Assuntos
Ceramidas/metabolismo , Ácidos Graxos/biossíntese , Deficiência Intelectual Ligada ao Cromossomo X/genética , Mutação de Sentido Incorreto , Oxirredutases atuantes sobre Doadores de Grupo CH-CH/metabolismo , Esfingomielinas/metabolismo , Estabilidade Enzimática , Células HEK293 , Células HeLa , Homozigoto , Humanos , Oxirredutases atuantes sobre Doadores de Grupo CH-CH/genética , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismoRESUMO
Epstein-Barr virus (EBV) transformed lymphoblastoid cell lines (LCLs) provide a conveniently accessible and renewable resource for functional genomic studies in humans. The ability to accumulate multidimensional data pertaining to the same individual cell lines, from complete genomic sequences to detailed gene regulatory profiles, further enhances the utility of LCLs as a model system. A lingering concern, however, is that the changes associated with EBV transformation of B cells reduce the usefulness of LCLs as a surrogate model for primary tissues. To evaluate the validity of this concern, we compared global gene expression and methylation profiles between CD20+ primary B cells sampled from six individuals and six independent replicates of transformed LCLs derived from each sample. These data allowed us to obtain a detailed catalog of the genes and pathways whose regulation is affected by EBV transformation. We found that the expression levels and promoter methylation profiles of more than half of the studied genes were affected by the EBV transformation, including enrichments of genes involved in transcription regulation, cell cycle and immune response. However, we show that most of the differences in gene expression levels between LCLs and B cells are of small magnitude, and that LCLs can often recapitulate the naturally occurring gene expression variation in primary B cells. Thus, our observations suggest that inference of the genetic architecture that underlies regulatory variation in LCLs can typically be generalized to primary B cells. In contrast, inference based on functional studies in LCLs may be more limited to the cell lines.
Assuntos
Linfócitos B/virologia , Transformação Celular Viral , Perfilação da Expressão Gênica , Herpesvirus Humano 4/fisiologia , Adulto , Linfócitos B/fisiologia , Linhagem Celular Transformada , Metilação de DNA , DNA Mitocondrial/metabolismo , Epigênese Genética , Redes Reguladoras de Genes , Humanos , Pessoa de Meia-Idade , Análise de Componente Principal , Regiões Promotoras Genéticas , Locos de Características Quantitativas , Carga Viral , Adulto JovemRESUMO
Exome sequencing is a powerful tool for discovery of the Mendelian disease genes. Previously, we reported a novel locus for autosomal recessive non-syndromic mental retardation (NSMR) in a consanguineous family [Nolan, D.K., Chen, P., Das, S., Ober, C. and Waggoner, D. (2008) Fine mapping of a locus for nonsyndromic mental retardation on chromosome 19p13. Am. J. Med. Genet. A, 146A, 1414-1422]. Using linkage and homozygosity mapping, we previously localized the gene to chromosome 19p13. The parents of this sibship were recently included in an exome sequencing project. Using a series of filters, we narrowed the putative causal mutation to a single variant site that segregated with NSMR: the mutation was homozygous in five affected siblings but in none of eight unaffected siblings. This mutation causes a substitution of a leucine for a highly conserved proline at amino acid 182 in TECR (trans-2,3-enoyl-CoA reductase), a synaptic glycoprotein. Our results reveal the value of massively parallel sequencing for identification of novel disease genes that could not be found using traditional approaches and identifies only the seventh causal mutation for autosomal recessive NSMR.
Assuntos
Cromossomos Humanos Par 19/genética , Doenças Genéticas Inatas/genética , Deficiência Intelectual/genética , Glicoproteínas de Membrana/genética , Mutação , Oxirredutases/genética , Membranas Sinápticas/genética , Feminino , Doenças Genéticas Inatas/enzimologia , Humanos , Deficiência Intelectual/enzimologia , Masculino , Glicoproteínas de Membrana/metabolismo , Oxirredutases/metabolismo , Linhagem , Membranas Sinápticas/enzimologiaRESUMO
Lung cancer is the leading cause of cancer deaths among both men and women, representing approximately 25% of cancer fatalities each year. The treatment landscape for non-small cell lung cancer (NSCLC) is rapidly evolving due to the progress made in biomarker-driven targeted therapies. While advancements in targeted treatments have improved survival rates for NSCLC patients with actionable biomarkers, long-term survival remains low, with an overall 5-year relative survival rate below 20%. Artificial intelligence/machine learning (AI/ML) algorithms have shown promise in biomarker discovery, yet NSCLC-specific studies capturing the clinical challenges targeted and emerging patterns identified using AI/ML approaches are lacking. Here, we employed a text-mining approach and identified 215 studies that reported potential biomarkers of NSCLC using AI/ML algorithms. We catalogued these studies with respect to BEST (Biomarkers, EndpointS, and other Tools) biomarker sub-types and summarized emerging patterns and trends in AI/ML-driven NSCLC biomarker discovery. We anticipate that our comprehensive review will contribute to the current understanding of AI/ML advances in NSCLC biomarker research and provide an important catalogue that may facilitate clinical adoption of AI/ML-derived biomarkers.
RESUMO
The evolutionary history of variation in the human Rh blood group system, determined by variants in the RHD and RHCE genes, has long been an unresolved puzzle in human genetics. Prior to medical treatments and interventions developed in the last century, the D-positive (RhD positive) children of D-negative (RhD negative) women were at risk for hemolytic disease of the newborn, if the mother produced anti-D antibodies following sensitization to the blood of a previous D-positive child. Given the deleterious fitness consequences of this disease, the appreciable frequencies in European populations of the responsible RHD gene deletion variant (for example, 0.43 in our study) seem surprising. In this study, we used new molecular and genomic data generated from four HapMap population samples to test the idea that positive selection for an as-of-yet unknown fitness benefit of the RHD deletion may have offset the otherwise negative fitness effects of hemolytic disease of the newborn. We found no evidence that positive natural selection affected the frequency of the RHD deletion. Thus, the initial rise to intermediate frequency of the RHD deletion in European populations may simply be explained by genetic drift/founder effect, or by an older or more complex sweep that we are insufficiently powered to detect. However, our simulations recapitulate previous findings that selection on the RHD deletion is frequency dependent and weak or absent near 0.5. Therefore, once such a frequency was achieved, it could have been maintained by a relatively small amount of genetic drift. We unexpectedly observed evidence for positive selection on the C allele of RHCE in non-African populations (on chromosomes with intact copies of the RHD gene) in the form of an unusually high F( ST ) value and the high frequency of a single haplotype carrying the C allele. RhCE function is not well understood, but the C/c antigenic variant is clinically relevant and can result in hemolytic disease of the newborn, albeit much less commonly and severely than that related to the D-negative blood type. Therefore, the potential fitness benefits of the RHCE C allele are currently unknown but merit further exploration.
Assuntos
Eritroblastose Fetal/genética , Sistema do Grupo Sanguíneo Rh-Hr/genética , Alelos , Sequência de Bases , Linhagem Celular , Evolução Molecular , Genótipo , Humanos , Análise de Sequência de DNARESUMO
Epstein-Barr virus (EBV) transformed lymphoblastoid cell lines (LCLs) are a widely used renewable resource for functional genomic studies in humans. The ability to accumulate multidimensional data pertaining to the same individual cell lines, from complete genomic sequences to detailed gene regulatory profiles, further enhances the utility of LCLs as a model system. However, the extent to which LCLs are a faithful model system is relatively unknown. We have previously shown that gene expression profiles of newly established LCLs maintain a strong individual component. Here, we extend our study to investigate the effect of freeze-thaw cycles on gene expression patterns in mature LCLs, especially in the context of inter-individual variation in gene expression. We report a profound difference in the gene expression profiles of newly established and mature LCLs. Once newly established LCLs undergo a freeze-thaw cycle, the individual specific gene expression signatures become much less pronounced as the gene expression levels in LCLs from different individuals converge to a more uniform profile, which reflects a mature transformed B cell phenotype. We found that previously identified eQTLs are enriched among the relatively few genes whose regulations in mature LCLs maintain marked individual signatures. We thus conclude that while insight drawn from gene regulatory studies in mature LCLs may generally not be affected by the artificial nature of the LCL model system, many aspects of primary B cell biology cannot be observed and studied in mature LCL cultures.
Assuntos
Linfócitos B , Congelamento/efeitos adversos , Expressão Gênica , Adulto , Linfócitos B/metabolismo , Linhagem Celular Transformada , Criopreservação , Perfilação da Expressão Gênica , Herpesvirus Humano 4/genética , Herpesvirus Humano 4/metabolismo , Humanos , Análise em Microsséries , Pessoa de Meia-Idade , Fenótipo , Temperatura , Transformação Genética , Adulto JovemRESUMO
Although the North American Hutterites trace their origins to South Tyrol, no attempts have been made to examine the genetic migration history of the Hutterites before emigrating to the United States in the 1870s. To investigate this, we studied 9 microsatellite loci and 11 unique event polymorphism (UEP) markers on the Y-chromosome, the hypervariable region I (HVRI) of the mitochondrial DNA (mtDNA), as well as the complete mtDNA genome of Hutterite and South Tyrolean samples. Only 6 out of 14 Y-chromosome UEP+microsatellite haplotypes and 3 out of 11 mitochondrial haplotypes that were present in the Hutterites were also present in the South Tyrolean population. The phylogenetic relationships inferred from Y-chromosome and mtDNA databases show that the Hutterites have a unique genetic background related to a similar extent to central and eastern European populations. An admixture analysis indicates, however, a relatively high genetic contribution of central European populations to the Hutterite gene pool. These results are consistent with historical records on Hutterite migrations and demographic history. In addition, our data reveal similar numbers of Y and mitochondrial haplotypes in Hutterite male and female founders, respectively. The Hutterite male and female gene pools are similar with respect to genetic diversity and genetic distance measures and comparable with respect to their origins, suggesting a similar evolutionary history.