RESUMEN
Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use1. Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels2, heart disease remains the leading cause of death worldwide3. Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS4-23 have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns24. Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine25, we anticipate that increased diversity of participants will lead to more accurate and equitable26 application of polygenic scores in clinical practice.
Asunto(s)
Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Enfermedades Cardiovasculares/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Desequilibrio de Ligamiento , Herencia Multifactorial , Polimorfismo de Nucleótido Simple/genética , Grupos de PoblaciónRESUMEN
Phenome-wide association studies (PheWASs) have been a useful tool for testing associations between genetic variations and multiple complex traits or diagnoses. Linking PheWAS-based associations between phenotypes and a variant or a genomic region into a network provides a new way to investigate cross-phenotype associations, and it might broaden the understanding of genetic architecture that exists between diagnoses, genes, and pleiotropy. We created a network of associations from one of the largest PheWASs on electronic health record (EHR)-derived phenotypes across 38,682 unrelated samples from the Geisinger's biobank; the samples were genotyped through the DiscovEHR project. We computed associations between 632,574 common variants and 541 diagnosis codes. Using these associations, we constructed a "disease-disease" network (DDN) wherein pairs of diseases were connected on the basis of shared associations with a given genetic variant. The DDN provides a landscape of intra-connections within the same disease classes, as well as inter-connections across disease classes. We identified clusters of diseases with known biological connections, such as autoimmune disorders (type 1 diabetes, rheumatoid arthritis, and multiple sclerosis) and cardiovascular disorders. Previously unreported relationships between multiple diseases were identified on the basis of genetic associations as well. The network approach applied in this study can be used to uncover interactions between diseases as a result of their shared, potentially pleiotropic SNPs. Additionally, this approach might advance clinical research and even clinical practice by accelerating our understanding of disease mechanisms on the basis of similar underlying genetic associations.
Asunto(s)
Enfermedad/genética , Registros Electrónicos de Salud , Estudios de Asociación Genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Enfermedades Autoinmunes/genética , Enfermedades Cardiovasculares/genética , Epigenómica , HumanosRESUMEN
[Purpose] To present a case of non-surgical reduction of thoracic hyperkyphosis utilizing a multimodal rehabilitation program emphasizing the mirror image® concept. [Subject and Methods] A 15-year-old female presented to a rehabilitation office suffering from back and neck pains and headaches. The patient was treated sporadically over a period of 13-months. Treatment consisted of anterior thoracic translation and thoracic extension exercises, spinal traction and spinal manipulation. [Results] After 13-months of treatment the patient displayed a significant reduction in hyperkyphosis and a dramatic correction of her overall posture and spine alignment corresponding to the reduction in back/neck pains, headaches and the simultaneous improvement of various other health issues. [Conclusion] Thoracic hyperkyphosis can be reduced through a multimodal rehabilitation program emphasizing mirror image thoracic extension procedures.
RESUMEN
The purpose of this review is to provide an analysis of the latest developments on the functions of the carbon catabolite-repression 4-Not (Ccr4-Not) complex in regulating eukaryotic gene expression. Ccr4-Not is a nine-subunit protein complex that is conserved in sequence and function throughout the eukaryotic kingdom. Although Ccr4-Not has been studied since the 1980s, our understanding of what it does is constantly evolving. Once thought to solely regulate transcription, it is now clear that it has much broader roles in gene regulation, such as in mRNA decay and quality control, RNA export, translational repression and protein ubiquitylation. The mechanism of actions for each of its functions is still being debated. Some of the difficulty in drawing a clear picture is that it has been implicated in so many processes that regulate mRNAs and proteins in both the cytoplasm and the nucleus. We will describe what is known about the Ccr4-Not complex in yeast and other eukaryotes in an effort to synthesize a unified model for how this complex coordinates multiple steps in gene regulation and provide insights into what questions will be most exciting to answer in the future.
Asunto(s)
Células Eucariotas/metabolismo , Factores de Transcripción/metabolismo , Animales , Regulación de la Expresión Génica/genética , Regulación de la Expresión Génica/fisiología , Humanos , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/química , Factores de Transcripción/genéticaRESUMEN
(1) Background: This study assessed the relationship between cervical spine parameters taken on standing full-spine lateral radiographic images compared to sectional lateral cervical radiographs. (2) Methods: Full-spine (FS) and sectional lateral cervical (LC) radiographs from four spine treatment facilities across the USA retrospectively provided data collected on 220 persons to assess the comparison of three sagittal cervical radiographic measurements between the two views. The measures included cervical lordosis using the absolute rotation angle from C2-C7, sagittal cervical translation of C2-C7, and atlas plane angle to horizontal. Linear correlation and R2 models were used for statistical comparison of the measures for the two views. (3) Results: The mean values of the three measurements were statistically different from each other: C2-C7 translation (FS = 19.84 ± 11.98 vs. LC = 21.18 ± 11.8), C2-C7 lordosis (FS = -15.3 ± 14.63 vs. LC = -18.32 ± 13.16), and atlas plane (FS = -19.99 ± 8.88 vs. LC = -22.56 ± 8.93), where all values were p < 0.001. Weak-to-moderate-to-strong correlations existed between the full-spine and sectional lateral cervical radiographic variables. The R2 values varied based on the measurement were R2 = 0.768 (p < 0.001) for sagittal cervical translation of C2-C7 (strong), R2 = 0.613 (p < 0.001) for the absolute rotation angle C2-C7 (moderate), and R2 = 0.406 (p < 0.001) for the atlas plane line (weak). Though a linear correlation was identified, there were consistent intra-person differences between the measurements on the full spine versus sectional lateral cervical radiographic views, where the full-spine view consistently underestimated the magnitude of the variables. (4) Conclusion: Key sagittal cervical radiographic measurements on the full spine versus sectional lateral cervical radiographic views show striking intra-person differences. The findings of this study confirm that full spine versus sectional lateral cervical radiographic views provide different biomechanical magnitudes of cervical sagittal alignment, and caution should be exercised by health care providers as these are not interchangeable. We recommend the LC view for measurement of cervical sagittal alignment variables.
RESUMEN
Clinical algorithms, or "pathways," promote the delivery of medical care that is consistent and equitable. Race, ethnicity, and/or ancestry terms are sometimes included in these types of guidelines, but it is unclear if this is appropriate for clinical decision-making. At our institution, we developed and applied a structured framework to determine whether race, ethnicity, or ancestry terms identified in our clinical pathways library should be retained, modified, or removed. First, we reviewed all text and associated reference documents for 132 institutionally-developed clinical pathways and identified 8 pathways that included race, ethnicity, or ancestry terms. Five pathways had clear evidence or a change in institutional policy that supported removal of the term. Multispecialty teams conducted additional in-depth evaluation of the 3 remaining pathways (Acute Viral Illness, Hyperbilirubinemia, and Weight Management) by applying the framework. In total, based on these reviews, race, ethnicity, or ancestry terms were removed (n = 6) or modified (n = 2) in all 8 pathways. Application of the framework established several recommended practices, including: (1) define race, ethnicity, and ancestry rigorously; (2) assess the most likely mechanisms underlying epidemiologic associations; (3) consider whether inclusion of the term is likely to mitigate or exacerbate existing inequities; and (4) exercise caution when applying population-level data to individual patient encounters. This process and framework may be useful to other institutional programs and national organizations in evaluating the inclusion of race, ethnicity, and ancestry in clinical guidelines.
Asunto(s)
Vías Clínicas , Etnicidad , HumanosRESUMEN
Nonalcoholic fatty liver disease is common and highly heritable. Genetic studies of hepatic fat have not sufficiently addressed non-European and rare variants. In a medical biobank, we quantitate hepatic fat from clinical computed tomography (CT) scans via deep learning in 10,283 participants with whole-exome sequences available. We conduct exome-wide associations of single variants and rare predicted loss-of-function (pLOF) variants with CT-based hepatic fat and perform cross-modality replication in the UK Biobank (UKB) by linking whole-exome sequences to MRI-based hepatic fat. We confirm single variants previously associated with hepatic fat and identify several additional variants, including two (FGD5 H600Y and CITED2 S198_G199del) that replicated in UKB. A burden of rare pLOF variants in LMF2 is associated with increased hepatic fat and replicates in UKB. Quantitative phenotypes generated from clinical imaging studies and intersected with genomic data in medical biobanks have the potential to identify molecular pathways associated with human traits and disease.
Asunto(s)
Exoma , Enfermedad del Hígado Graso no Alcohólico , Humanos , Exoma/genética , Bancos de Muestras Biológicas , Fenotipo , Tomografía Computarizada por Rayos X , Enfermedad del Hígado Graso no Alcohólico/diagnóstico por imagen , Enfermedad del Hígado Graso no Alcohólico/genética , Proteínas Represoras/genética , Transactivadores/genéticaRESUMEN
BACKGROUND: Genetic variants within nearly 1000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently unknown, limiting understanding of these findings and hindering downstream translational efforts such as drug target discovery. RESULTS: To expand our understanding of the underlying biological pathways and mechanisms controlling blood lipid levels, we leverage a large multi-ancestry meta-analysis (N = 1,654,960) of blood lipids to prioritize putative causal genes for 2286 lipid associations using six gene prediction approaches. Using phenome-wide association (PheWAS) scans, we identify relationships of genetically predicted lipid levels to other diseases and conditions. We confirm known pleiotropic associations with cardiovascular phenotypes and determine novel associations, notably with cholelithiasis risk. We perform sex-stratified GWAS meta-analysis of lipid levels and show that 3-5% of autosomal lipid-associated loci demonstrate sex-biased effects. Finally, we report 21 novel lipid loci identified on the X chromosome. Many of the sex-biased autosomal and X chromosome lipid loci show pleiotropic associations with sex hormones, emphasizing the role of hormone regulation in lipid metabolism. CONCLUSIONS: Taken together, our findings provide insights into the biological mechanisms through which associated variants lead to altered lipid levels and potentially cardiovascular disease risk.
Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Caracteres Sexuales , Fenotipo , Lípidos/genética , Polimorfismo de Nucleótido Simple , Pleiotropía GenéticaRESUMEN
BACKGROUND: Rare variants play an essential role in the etiology of cancer. In this study, we aim to characterize rare germline variants that impact the risk of cancer. METHODS: We performed a genome-wide rare variant analysis using germline whole exome sequencing (WES) data derived from the Geisinger MyCode initiative to discover cancer predisposition variants. The case-control association analysis was conducted by binning variants in 5,538 patients with cancer and 7,286 matched controls in a discovery set and 1,991 patients with cancer and 2,504 matched controls in a validation set across nine cancer types. Further, The Cancer Genome Atlas (TCGA) germline data were used to replicate the findings. RESULTS: We identified 133 significant pathway-cancer pairs (85 replicated) and 90 significant gene-cancer pairs (12 replicated). In addition, we identified 18 genes and 3 pathways that were associated with survival outcome across cancers (Bonferroni P < 0.05). CONCLUSIONS: In this study, we identified potential predisposition genes and pathways based on rare variants in nine cancers. IMPACT: This work adds to the knowledge base and progress being made in precision medicine.
Asunto(s)
Predisposición Genética a la Enfermedad , Neoplasias/genética , Bancos de Muestras Biológicas , Biomarcadores de Tumor/sangre , Estudios de Casos y Controles , Femenino , Estudio de Asociación del Genoma Completo , Mutación de Línea Germinal , Humanos , Masculino , Neoplasias/sangre , Neoplasias/epidemiología , Secuenciación del ExomaRESUMEN
Plasma lipids are known heritable risk factors for cardiovascular disease, but increasing evidence also supports shared genetics with diseases of other organ systems. We devised a comprehensive three-phase framework to identify new lipid-associated genes and study the relationships among lipids, genotypes, gene expression and hundreds of complex human diseases from the Electronic Medical Records and Genomics (347 traits) and the UK Biobank (549 traits). Aside from 67 new lipid-associated genes with strong replication, we found evidence for pleiotropic SNPs/genes between lipids and diseases across the phenome. These include discordant pleiotropy in the HLA region between lipids and multiple sclerosis and putative causal paths between triglycerides and gout, among several others. Our findings give insights into the genetic basis of the relationship between plasma lipids and diseases on a phenome-wide scale and can provide context for future prevention and treatment strategies.
Asunto(s)
Biomarcadores , Susceptibilidad a Enfermedades , Registros Electrónicos de Salud , Lípidos/sangre , Alelos , Bancos de Muestras Biológicas , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Humanos , Polimorfismo de Nucleótido Simple , Vigilancia en Salud Pública , Carácter Cuantitativo Heredable , Reino UnidoRESUMEN
Characterizing how variation at the level of individual nucleotides contributes to traits and diseases has been an area of growing interest since the completion of sequencing the first human genome. Our understanding of how a single nucleotide polymorphism (SNP) leads to a pathogenic phenotype on a genome-wide scale is a fruitful endeavor for anyone interested in developing diagnostic tests, therapeutics, or simply wanting to understand the etiology of a disease or trait. To this end, many datasets and algorithms have been developed as resources/tools to annotate SNPs. One of the most common practices is to annotate coding SNPs that affect the protein sequence. Synonymous variants are often grouped as one type of variant, however there are in fact many tools available to dissect their effects on gene expression. More recently, large consortiums like ENCODE and GTEx have made it possible to annotate non-coding regions. Although annotating variants is a common technique among human geneticists, the constant advances in tools and biology surrounding SNPs requires an updated summary of what is known and the trajectory of the field. This review will discuss the history behind SNP annotation, commonly used tools, and newer strategies for SNP annotation. Additionally, we will comment on the caveats that distinguish approaches from one another, along with gaps in the current state of knowledge, and potential future directions. We do not intend for this to be a comprehensive review for any specific area of SNP annotation, but rather it will be an excellent resource for those unfamiliar with computational tools used to functionally characterize SNPs. In summary, this review will help illustrate how each SNP annotation method impacts the way in which the genetic and molecular etiology of a disease is explored in-silico.
RESUMEN
Endometrial cancer is the fourth most commonly diagnosed cancer in women. Family history is a known risk factor for endometrial cancer. The incidence of endometrial cancer in a first-degree relative elevates the relative risk to range between 1.3 and 2.8. It is unclear to what extent or what other novel germline variants are at play in endometrial cancer. We aim to address this question by utilizing whole exome sequencing as a means to identify novel, rare variant associations between exonic regions and endometrial cancer. The MyCode community health initiative is an excellent resource for this study with germline whole exome data for 60,000 patients available in the first phase, and further 30,000 patients independently sequenced in the second phase as part of DiscovEHR study. We conducted exome-wide rare variant association using 472 cases and 4,110 controls in 60,000 patients (discovery cohort); and 261 cases and 1,531 controls from 30,000 patients (replication cohort). After binning rare germline variants into genes, case-control association tests performed using Optimal Unified Approach for Rare-Variant Association, SKAT-O. Seven genes, including RBM12, NDUFB6, ATP6V1A, RECK, SLC35E1, RFX3 (Bonferroni-corrected P < 0.05) and ATP8A1 (suggestive P < 10-5), and one long non-coding RNA, DLGAP4-AS1 (Bonferroni-corrected P < 0.05), were associated with endometrial cancer. Notably, RECK, and ATP8A1 were replicated from the replication cohort (suggestive threshold P < 0.05). Additionally, a pathway-based rare variant analysis, using pathogenic and likely pathogenic variants, identified two significant pathways, pyrimidine metabolism and protein processing in the endoplasmic reticulum (Bonferroni-corrected P < 0.05). In conclusion, our results using the single-source electronic health records (EHR) linked to genomic data highlights candidate genes and pathways associated with endometrial cancer and indicates rare variants involvement in endometrial cancer predisposition, which could help in personalized prognosis and also further our understanding of its genetic etiology.
RESUMEN
BACKGROUND: At least 90% of human genes are alternatively spliced. Alternative splicing has an important function regulating gene expression and miss-splicing can contribute to risk for human diseases, including Alzheimer's disease (AD). METHODS: We developed a splicing decision model as a molecular mechanism to identify functional exon skipping events and genetic variation affecting alternative splicing on a genome-wide scale by integrating genomics, transcriptomics, and neuroimaging data in a systems biology approach. In this study, we analyzed RNA-Seq data of hippocampus brain tissue from Alzheimer's disease (AD; n = 24) and cognitively normal elderly controls (CN; n = 50) and identified three exon skipping events in two genes (RELN and NOS1) as significantly associated with AD (corrected p-value < 0.05 and fold change > 1.5). Next, we identified single-nucleotide polymorphisms (SNPs) affecting exon skipping events using the splicing decision model and then performed an association analysis of SNPs potentially affecting three exon skipping events with a global cortical measure of amyloid-ß deposition measured by [18F] Florbetapir position emission tomography (PET) scan as an AD-related quantitative phenotype. A whole-brain voxel-based analysis was also performed. RESULTS: Two exons in RELN and one exon in NOS1 showed significantly lower expression levels in the AD participants compared to CN participants, suggesting that the exons tend to be skipped more in AD. We also showed the loss of the core protein structure due to the skipped exons using the protein 3D structure analysis. The targeted SNP-based association analysis identified one intronic SNP (rs362771) adjacent to the skipped exon 24 in RELN as significantly associated with cortical amyloid-ß levels (corrected p-value < 0.05). This SNP is within the splicing regulatory element, i.e., intronic splicing enhancer. The minor allele of rs362771 conferred decreases in cortical amyloid-ß levels in the right temporal and bilateral parietal lobes. CONCLUSIONS: Our results suggest that exon skipping events and splicing-affecting SNPs in the human hippocampus may contribute to AD pathogenesis. Integration of multiple omics and neuroimaging data provides insights into possible mechanisms underlying AD pathophysiology through exon skipping and may help identify novel therapeutic targets.
Asunto(s)
Enfermedad de Alzheimer/genética , Exones/genética , Hipocampo/metabolismo , Humanos , Polimorfismo de Nucleótido Simple , Proteína Reelina , Análisis de Secuencia de ARNRESUMEN
BACKGROUND: Endometrial cancer (EMCA) is the fifth most common cancer among women in the world. Identification of potentially pathogenic germline variants from individuals with EMCA will help characterize genetic features that underlie the disease and potentially predispose individuals to its pathogenesis. METHODS: The Geisinger Health System's (GHS) DiscovEHR cohort includes exome sequencing on over 50,000 consenting patients, 297 of whom have evidence of an EMCA diagnosis in their electronic health record. Here, rare variants were annotated as potentially pathogenic. RESULTS: Eight genes were identified as having increased burden in the EMCA cohort relative to the non-cancer control cohort. None of the eight genes had an increased burden in the other hormone related cancer cohort from GHS, suggesting they can help characterize the underlying genetic variation that gives rise to EMCA. Comparing GHS to the cancer genome atlas (TCGA) EMCA germline data illustrated 34 genes with potentially pathogenic variation and eight unique potentially pathogenic variants that were present in both studies. Thus, similar germline variation among genes can be observed in unique EMCA cohorts and could help prioritize genes to investigate for future work. CONCLUSION: In summary, this systematic characterization of potentially pathogenic germline variants describes the genetic underpinnings of EMCA through the use of data from a single hospital system.
Asunto(s)
Registros Electrónicos de Salud , Neoplasias Endometriales/genética , Mutación de Línea Germinal , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Neoplasias Endometriales/patología , Femenino , Humanos , Persona de Mediana Edad , Secuenciación del ExomaRESUMEN
Following publication of the original article [1], the authors reported that Fig. 1 was not correctly processed during the production process. The correct Fig. 1 is given below.
RESUMEN
BACKGROUND: The Cancer Genome Atlas (TCGA) project is a public resource that provides transcriptomic, DNA sequence, methylation, and clinical data for 33 cancer types. Transforming the large size and high complexity of TCGA cancer genome data into integrated knowledge can be useful to promote cancer research. Alternative splicing (AS) is a key regulatory mechanism of genes in human cancer development and in the interaction with epigenetic factors. Therefore, AS-guided integration of existing TCGA data sets will make it easier to gain insight into the genetic architecture of cancer risk and related outcomes. There are already existing tools analyzing and visualizing alternative mRNA splicing patterns for large-scale RNA-seq experiments. However, these existing web-based tools are limited to the analysis of individual TCGA data sets at a time, such as only transcriptomic information. RESULTS: We implemented CAS-viewer (integrative analysis of Cancer genome data based on Alternative Splicing), a web-based tool leveraging multi-cancer omics data from TCGA. It illustrates alternative mRNA splicing patterns along with methylation, miRNAs, and SNPs, and then provides an analysis tool to link differential transcript expression ratio to methylation, miRNA, and splicing regulatory elements for 33 cancer types. Moreover, one can analyze AS patterns with clinical data to identify potential transcripts associated with different survival outcome for each cancer. CONCLUSIONS: CAS-viewer is a web-based application for transcript isoform-driven integration of multi-omics data in multiple cancer types and will aid in the visualization and possible discovery of biomarkers for cancer by integrating multi-omics data from TCGA.
Asunto(s)
Genómica/métodos , Internet , Neoplasias/genética , Empalme del ARN , Humanos , Polimorfismo de Nucleótido Simple , Programas Informáticos , Neoplasias de la Vejiga Urinaria/genéticaRESUMEN
The Ccr4 (carbon catabolite repression 4)-Not complex is a major regulator of stress responses that controls gene expression at multiple levels, from transcription to mRNA decay. Ccr4, a "core" subunit of the complex, is the main cytoplasmic deadenylase in Saccharomyces cerevisiae; however, its mRNA targets have not been mapped on a genome-wide scale. Here, we describe a genome-wide approach, RNA immunoprecipitation (RIP) high-throughput sequencing (RIP-seq), to identify the RNAs bound to Ccr4, and two proteins that associate with it, Dhh1 and Puf5 All three proteins were preferentially bound to lowly abundant mRNAs, most often at the 3' end of the transcript. Furthermore, Ccr4, Dhh1, and Puf5 are recruited to mRNAs that are targeted by other RNA-binding proteins that promote decay and mRNA transport, and inhibit translation. Although Ccr4-Not regulates mRNA transcription and decay, Ccr4 recruitment to mRNAs correlates better with decay rates, suggesting it imparts greater control over transcript abundance through decay. Ccr4-enriched mRNAs are refractory to control by the other deadenylase complex in yeast, Pan2/3, suggesting a division of labor between these deadenylation complexes. Finally, Ccr4 and Dhh1 associate with mRNAs whose abundance increases during nutrient starvation, and those that fluctuate during metabolic and oxygen consumption cycles, which explains the known genetic connections between these factors and nutrient utilization and stress pathways.
Asunto(s)
Regulación Fúngica de la Expresión Génica , Genoma Fúngico , ARN de Hongos/genética , ARN Mensajero/genética , Ribonucleasas/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Sitios de Unión , ARN Helicasas DEAD-box/genética , ARN Helicasas DEAD-box/metabolismo , Ontología de Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Inmunoprecipitación , Anotación de Secuencia Molecular , Unión Proteica , Estabilidad del ARN , ARN de Hongos/metabolismo , ARN Mensajero/metabolismo , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismo , Ribonucleasas/metabolismo , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Transcripción GenéticaRESUMEN
Alzheimer's disease (AD) is a neurodegenerative disorder with few biomarkers even though it impacts a relatively large portion of the population and is predicted to affect significantly more individuals in the future. Neuroimaging has been used in concert with genetic information to improve our understanding in relation to how AD arises and how it can be potentially diagnosed. Additionally, evidence suggests synonymous variants can have a functional impact on gene regulatory mechanisms, including those related to AD. Some synonymous codons are preferred over others leading to a codon bias. The bias can arise with respect to codons that are more or less frequently used in the genome. A bias can also result from optimal and non-optimal codons, which have stronger and weaker codon anti-codon interactions, respectively. Although association tests have been utilized before to identify genes associated with AD, it remains unclear how codon bias plays a role and if it can improve rare variant analysis. In this work, rare variants from whole-genome sequencing from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort were binned into genes using BioBin. An association analysis of the genes with AD-related neuroimaging biomarker was performed using SKAT-O. While using all synonymous variants we did not identify any genomewide significant associations, using only synonymous variants that affected codon frequency we identified several genes as significantly associated with the imaging phenotype. Additionally, significant associations were found using only rare variants that contains an optimal codon in among minor alleles and a non-optimal codon in the major allele. These results suggest that codon bias may play a role in AD and that it can be used to improve detection power in rare variant association analysis.
Asunto(s)
Enfermedad de Alzheimer/diagnóstico por imagen , Enfermedad de Alzheimer/genética , Codón/genética , Variación Genética , Anciano , Anciano de 80 o más Años , Biomarcadores , Biología Computacional/métodos , Femenino , Estudios de Asociación Genética , Humanos , Imagen por Resonancia Magnética , Masculino , Neuroimagen , Polimorfismo de Nucleótido Simple , Secuenciación Completa del GenomaRESUMEN
Eukaryotic organisms have developed a variety of mechanisms to regulate translation post-transcriptionally, including but not limited to the use of miRNA silencing in many species. One method of post-transcriptional regulation is through miRNAs that bind to the 3' UTRs to regulate mRNA abundance and influence protein expression. Therefore, the diversity of mRNA 3' UTRs mediating miRNA binding sites influence miRNA-mediated regulation. Alternative polyadenylation, by shortening mRNA isoforms, increases the diversity of 3' UTRs; moreover, short mRNA isoforms elude miRNA-medicated repression. Because no current prediction methods for putative miRNA target sites consider whether or not 1) splicing-informed miRNA binding sites and/or 2) the use of 3' UTRs provide higher resolution or functionality, we sought to identify not only the genome-wide impact of using exons in mRNA 3' UTRs but also their functional connection to miRNA regulation and clinical outcomes in cancer. With a genome-wide expression of mRNA and miRNA quantified by 395 bladder cancer cases from The Cancer Genome Atlas (TCGA), we 1) demonstrate the diversity of 3' UTRs affecting miRNA efficiency and 2) identify a set of genes clinically associated with mRNA expression in bladder cancer. Knowledge of 3' UTR diversity will not only be a useful addition to current miRNA target prediction algorithms but also enhance the clinical utility of mRNA isoforms in the expression of mRNA in cancer. Thus, variability among cancer patient's variability in molecular signatures based on these exon usage events in 3' UTR along with miRNAs in bladder cancer may lead to better prognostic/treatment strategies for improved precision medicine.