Búsqueda | Portal de Búsqueda de la BVS

1.

The Parkinson's disease protein alpha-synuclein is a modulator of processing bodies and mRNA stability.

Hallacli, Erinc; Kayatekin, Can; Nazeen, Sumaiya; Wang, Xiou H; Sheinkopf, Zoe; Sathyakumar, Shubhangi; Sarkar, Souvarish; Jiang, Xin; Dong, Xianjun; Di Maio, Roberto; Wang, Wen; Keeney, Matthew T; Felsky, Daniel; Sandoe, Jackson; Vahdatshoar, Aazam; Udeshi, Namrata D; Mani, D R; Carr, Steven A; Lindquist, Susan; De Jager, Philip L; Bartel, David P; Myers, Chad L; Greenamyre, J Timothy; Feany, Mel B; Sunyaev, Shamil R; Chung, Chee Yeun; Khurana, Vikram.

Cell ; 185(12): 2035-2056.e33, 2022 06 09.

Artículo en Inglés | MEDLINE | ID: mdl-35688132

RESUMEN

Alpha-synuclein (αS) is a conformationally plastic protein that reversibly binds to cellular membranes. It aggregates and is genetically linked to Parkinson's disease (PD). Here, we show that αS directly modulates processing bodies (P-bodies), membraneless organelles that function in mRNA turnover and storage. The N terminus of αS, but not other synucleins, dictates mutually exclusive binding either to cellular membranes or to P-bodies in the cytosol. αS associates with multiple decapping proteins in close proximity on the Edc4 scaffold. As αS pathologically accumulates, aberrant interaction with Edc4 occurs at the expense of physiologic decapping-module interactions. mRNA decay kinetics within PD-relevant pathways are correspondingly disrupted in PD patient neurons and brain. Genetic modulation of P-body components alters αS toxicity, and human genetic analysis lends support to the disease-relevance of these interactions. Beyond revealing an unexpected aspect of αS function and pathology, our data highlight the versatility of conformationally plastic proteins with high intrinsic disorder.

Asunto(s)

Enfermedad de Parkinson , alfa-Sinucleína , Humanos , Enfermedad de Parkinson/metabolismo , Cuerpos de Procesamiento , Estabilidad del ARN , alfa-Sinucleína/genética , alfa-Sinucleína/metabolismo

2.

StrVCTVRE: A supervised learning method to predict the pathogenicity of human genome structural variants.

Sharo, Andrew G; Hu, Zhiqiang; Sunyaev, Shamil R; Brenner, Steven E.

Am J Hum Genet ; 109(2): 195-209, 2022 02 03.

Artículo en Inglés | MEDLINE | ID: mdl-35032432

RESUMEN

Whole-genome sequencing resolves many clinical cases where standard diagnostic methods have failed. However, at least half of these cases remain unresolved after whole-genome sequencing. Structural variants (SVs; genomic variants larger than 50 base pairs) of uncertain significance are the genetic cause of a portion of these unresolved cases. As sequencing methods using long or linked reads become more accessible and SV detection algorithms improve, clinicians and researchers are gaining access to thousands of reliable SVs of unknown disease relevance. Methods to predict the pathogenicity of these SVs are required to realize the full diagnostic potential of long-read sequencing. To address this emerging need, we developed StrVCTVRE to distinguish pathogenic SVs from benign SVs that overlap exons. In a random forest classifier, we integrated features that capture gene importance, coding region, conservation, expression, and exon structure. We found that features such as expression and conservation are important but are absent from SV classification guidelines. We leveraged multiple resources to construct a size-matched training set of rare, putatively benign and pathogenic SVs. StrVCTVRE performs accurately across a wide SV size range on independent test sets, which will allow clinicians and researchers to eliminate about half of SVs from consideration while retaining a 90% sensitivity. We anticipate clinicians and researchers will use StrVCTVRE to prioritize SVs in probands where no SV is immediately compelling, empowering deeper investigation into novel SVs to resolve cases and understand new mechanisms of disease. StrVCTVRE runs rapidly and is publicly available.

Asunto(s)

Algoritmos , Genoma Humano , Variación Estructural del Genoma , Programas Informáticos , Aprendizaje Automático Supervisado , Conjuntos de Datos como Asunto , Exones , Genómica/métodos , Humanos , Curva ROC , Secuenciación Completa del Genoma/estadística & datos numéricos

3.

FAVOR: functional annotation of variants online resource and annotator for variation across the human genome.

Zhou, Hufeng; Arapoglou, Theodore; Li, Xihao; Li, Zilin; Zheng, Xiuwen; Moore, Jill; Asok, Abhijith; Kumar, Sushant; Blue, Elizabeth E; Buyske, Steven; Cox, Nancy; Felsenfeld, Adam; Gerstein, Mark; Kenny, Eimear; Li, Bingshan; Matise, Tara; Philippakis, Anthony; Rehm, Heidi L; Sofia, Heidi J; Snyder, Grace; Weng, Zhiping; Neale, Benjamin; Sunyaev, Shamil R; Lin, Xihong.

Nucleic Acids Res ; 51(D1): D1300-D1311, 2023 01 06.

Artículo en Inglés | MEDLINE | ID: mdl-36350676

RESUMEN

Large biobank-scale whole genome sequencing (WGS) studies are rapidly identifying a multitude of coding and non-coding variants. They provide an unprecedented resource for illuminating the genetic basis of human diseases. Variant functional annotations play a critical role in WGS analysis, result interpretation, and prioritization of disease- or trait-associated causal variants. Existing functional annotation databases have limited scope to perform online queries and functionally annotate the genotype data of large biobank-scale WGS studies. We develop the Functional Annotation of Variants Online Resources (FAVOR) to meet these pressing needs. FAVOR provides a comprehensive multi-faceted variant functional annotation online portal that summarizes and visualizes findings of all possible nine billion single nucleotide variants (SNVs) across the genome. It allows for rapid variant-, gene- and region-level queries of variant functional annotations. FAVOR integrates variant functional information from multiple sources to describe the functional characteristics of variants and facilitates prioritizing plausible causal variants influencing human phenotypes. Furthermore, we provide a scalable annotation tool, FAVORannotator, to functionally annotate large-scale WGS studies and efficiently store the genotype and their variant functional annotation data in a single file using the annotated Genomic Data Structure (aGDS) format, making downstream analysis more convenient. FAVOR and FAVORannotator are available at https://favor.genohub.org.

Asunto(s)

Genoma Humano , Programas Informáticos , Humanos , Anotación de Secuencia Molecular , Genómica , Genotipo , Variación Genética

4.

Leveraging pleiotropy to discover and interpret GWAS results for sleep-associated traits.

Chun, Sung; Akle, Sebastian; Teodosiadis, Athanasios; Cade, Brian E; Wang, Heming; Sofer, Tamar; Evans, Daniel S; Stone, Katie L; Gharib, Sina A; Mukherjee, Sutapa; Palmer, Lyle J; Hillman, David; Rotter, Jerome I; Hanis, Craig L; Stamatoyannopoulos, John A; Redline, Susan; Cotsapas, Chris; Sunyaev, Shamil R.

PLoS Genet ; 18(12): e1010557, 2022 12.

Artículo en Inglés | MEDLINE | ID: mdl-36574455

RESUMEN

Genetic association studies of many heritable traits resulting from physiological testing often have modest sample sizes due to the cost and burden of the required phenotyping. This reduces statistical power and limits discovery of multiple genetic associations. We present a strategy to leverage pleiotropy between traits to both discover new loci and to provide mechanistic hypotheses of the underlying pathophysiology. Specifically, we combine a colocalization test with a locus-level test of pleiotropy. In simulations, we show that this approach is highly selective for identifying true pleiotropy driven by the same causative variant, thereby improves the chance to replicate the associations in underpowered validation cohorts and leads to higher interpretability. Here, as an exemplar, we use Obstructive Sleep Apnea (OSA), a common disorder diagnosed using overnight multi-channel physiological testing. We leverage pleiotropy with relevant cellular and cardio-metabolic phenotypes and gene expression traits to map new risk loci in an underpowered OSA GWAS. We identify several pleiotropic loci harboring suggestive associations to OSA and genome-wide significant associations to other traits, and show that their OSA association replicates in independent cohorts of diverse ancestries. By investigating pleiotropic loci, our strategy allows proposing new hypotheses about OSA pathobiology across many physiological layers. For example, we identify and replicate the pleiotropy across the plateletcrit, OSA and an eQTL of DNA primase subunit 1 (PRIM1) in immune cells. We find suggestive links between OSA, a measure of lung function (FEV1/FVC), and an eQTL of matrix metallopeptidase 15 (MMP15) in lung tissue. We also link a previously known genome-wide significant peak for OSA in the hexokinase 1 (HK1) locus to hematocrit and other red blood cell related traits. Thus, the analysis of pleiotropic associations has the potential to assemble diverse phenotypes into a chain of mechanistic hypotheses that provide insight into the pathogenesis of complex human diseases.

Asunto(s)

Estudio de Asociación del Genoma Completo , Apnea Obstructiva del Sueño , Humanos , Estudio de Asociación del Genoma Completo/métodos , Fenotipo , Estudios de Asociación Genética , Sueño , Pleiotropía Genética , Polimorfismo de Nucleótido Simple , ADN Primasa

5.

Purifying selection on noncoding deletions of human regulatory loci detected using their cellular pleiotropy.

Radke, David W; Sul, Jae Hoon; Balick, Daniel J; Akle, Sebastian; Green, Robert C; Sunyaev, Shamil R.

Genome Res ; 31(6): 935-946, 2021 06.

Artículo en Inglés | MEDLINE | ID: mdl-33963077

RESUMEN

Genomic deletions provide a powerful loss-of-function model in noncoding regions to assess the role of purifying selection on genetic variation. Regulatory element function is characterized by nonuniform tissue and cell type activity, necessarily linking the study of fitness consequences from regulatory variants to their corresponding cellular activity. We generated a callset of deletions from genomes in the Alzheimer's Disease Neuroimaging Initiative (ADNI) and used deletions from The 1000 Genomes Project Consortium (1000GP) in order to examine whether purifying selection preserves noncoding sites of chromatin accessibility marked by DNase I hypersensitivity (DHS), histone modification (enhancer, transcribed, Polycomb-repressed, heterochromatin), and chromatin loop anchors. To examine this in a cellular activity-aware manner, we developed a statistical method, pleiotropy ratio score (PlyRS), which calculates a correlation-adjusted count of "cellular pleiotropy" for each noncoding base pair by analyzing shared regulatory annotations across tissues and cell types. By comparing real deletion PlyRS values to simulations in a length-matched framework and by using genomic covariates in analyses, we found that purifying selection acts to preserve both DHS and enhancer noncoding sites. However, we did not find evidence of purifying selection for noncoding transcribed, Polycomb-repressed, or heterochromatin sites beyond that of the noncoding background. Additionally, we found evidence that purifying selection is acting on chromatin loop integrity by preserving colocalized CTCF binding sites. At regions of DHS, enhancer, and CTCF within chromatin loop anchors, we found evidence that both sites of activity specific to a particular tissue or cell type and sites of cellularly pleiotropic activity are preserved by selection.

Asunto(s)

Cromatina , Genómica , Sitios de Unión , Cromatina/genética , Humanos , Proteínas del Grupo Polycomb/metabolismo

6.

Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics.

Chun, Sung; Imakaev, Maxim; Hui, Daniel; Patsopoulos, Nikolaos A; Neale, Benjamin M; Kathiresan, Sekar; Stitziel, Nathan O; Sunyaev, Shamil R.

Am J Hum Genet ; 107(1): 46-59, 2020 07 02.

Artículo en Inglés | MEDLINE | ID: mdl-32470373

RESUMEN

In complex trait genetics, the ability to predict phenotype from genotype is the ultimate measure of our understanding of genetic architecture underlying the heritability of a trait. A complete understanding of the genetic basis of a trait should allow for predictive methods with accuracies approaching the trait's heritability. The highly polygenic nature of quantitative traits and most common phenotypes has motivated the development of statistical strategies focused on combining myriad individually non-significant genetic effects. Now that predictive accuracies are improving, there is a growing interest in the practical utility of such methods for predicting risk of common diseases responsive to early therapeutic intervention. However, existing methods require individual-level genotypes or depend on accurately specifying the genetic architecture underlying each disease to be predicted. Here, we propose a polygenic risk prediction method that does not require explicitly modeling any underlying genetic architecture. We start with summary statistics in the form of SNP effect sizes from a large GWAS cohort. We then remove the correlation structure across summary statistics arising due to linkage disequilibrium and apply a piecewise linear interpolation on conditional mean effects. In both simulated and real datasets, this new non-parametric shrinkage (NPS) method can reliably allow for linkage disequilibrium in summary statistics of 5 million dense genome-wide markers and consistently improves prediction accuracy. We show that NPS improves the identification of groups at high risk for breast cancer, type 2 diabetes, inflammatory bowel disease, and coronary heart disease, all of which have available early intervention or prevention treatments.

Asunto(s)

Herencia Multifactorial/genética , Anciano , Estudios de Cohortes , Diabetes Mellitus Tipo 2/genética , Femenino , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Persona de Mediana Edad , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética

7.

Associations of variants In the hexokinase 1 and interleukin 18 receptor regions with oxyhemoglobin saturation during sleep.

Cade, Brian E; Chen, Han; Stilp, Adrienne M; Louie, Tin; Ancoli-Israel, Sonia; Arens, Raanan; Barfield, Richard; Below, Jennifer E; Cai, Jianwen; Conomos, Matthew P; Evans, Daniel S; Frazier-Wood, Alexis C; Gharib, Sina A; Gleason, Kevin J; Gottlieb, Daniel J; Hillman, David R; Johnson, W Craig; Lederer, David J; Lee, Jiwon; Loredo, Jose S; Mei, Hao; Mukherjee, Sutapa; Patel, Sanjay R; Post, Wendy S; Purcell, Shaun M; Ramos, Alberto R; Reid, Kathryn J; Rice, Ken; Shah, Neomi A; Sofer, Tamar; Taylor, Kent D; Thornton, Timothy A; Wang, Heming; Yaffe, Kristine; Zee, Phyllis C; Hanis, Craig L; Palmer, Lyle J; Rotter, Jerome I; Stone, Katie L; Tranah, Gregory J; Wilson, James G; Sunyaev, Shamil R; Laurie, Cathy C; Zhu, Xiaofeng; Saxena, Richa; Lin, Xihong; Redline, Susan.

PLoS Genet ; 15(4): e1007739, 2019 04.

Artículo en Inglés | MEDLINE | ID: mdl-30990817

RESUMEN

Sleep disordered breathing (SDB)-related overnight hypoxemia is associated with cardiometabolic disease and other comorbidities. Understanding the genetic bases for variations in nocturnal hypoxemia may help understand mechanisms influencing oxygenation and SDB-related mortality. We conducted genome-wide association tests across 10 cohorts and 4 populations to identify genetic variants associated with three correlated measures of overnight oxyhemoglobin saturation: average and minimum oxyhemoglobin saturation during sleep and the percent of sleep with oxyhemoglobin saturation under 90%. The discovery sample consisted of 8,326 individuals. Variants with p < 1 × 10(-6) were analyzed in a replication group of 14,410 individuals. We identified 3 significantly associated regions, including 2 regions in multi-ethnic analyses (2q12, 10q22). SNPs in the 2q12 region associated with minimum SpO2 (rs78136548 p = 2.70 × 10(-10)). SNPs at 10q22 were associated with all three traits including average SpO2 (rs72805692 p = 4.58 × 10(-8)). SNPs in both regions were associated in over 20,000 individuals and are supported by prior associations or functional evidence. Four additional significant regions were detected in secondary sex-stratified and combined discovery and replication analyses, including a region overlapping Reelin, a known marker of respiratory complex neurons.These are the first genome-wide significant findings reported for oxyhemoglobin saturation during sleep, a phenotype of high clinical interest. Our replicated associations with HK1 and IL18R1 suggest that variants in inflammatory pathways, such as the biologically-plausible NLRP3 inflammasome, may contribute to nocturnal hypoxemia.

Asunto(s)

Hexoquinasa/genética , Subunidad alfa del Receptor de Interleucina-18/genética , Oxihemoglobinas/metabolismo , Sueño/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Moléculas de Adhesión Celular Neuronal/genética , Biología Computacional , Proteínas de la Matriz Extracelular/genética , Femenino , Redes Reguladoras de Genes , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , Hipoxia/sangre , Hipoxia/genética , Masculino , Persona de Mediana Edad , Proteína con Dominio Pirina 3 de la Familia NLR/genética , Proteínas del Tejido Nervioso/genética , Oxígeno/sangre , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Proteína Reelina , Serina Endopeptidasas/genética , Síndromes de la Apnea del Sueño/sangre , Síndromes de la Apnea del Sueño/genética , Adulto Joven

8.

Admixture mapping identifies novel loci for obstructive sleep apnea in Hispanic/Latino Americans.

Wang, Heming; Cade, Brian E; Sofer, Tamar; Sands, Scott A; Chen, Han; Browning, Sharon R; Stilp, Adrienne M; Louie, Tin L; Thornton, Timothy A; Johnson, W Craig; Below, Jennifer E; Conomos, Matthew P; Evans, Daniel S; Gharib, Sina A; Guo, Xiuqing; Wood, Alexis C; Mei, Hao; Yaffe, Kristine; Loredo, Jose S; Ramos, Alberto R; Barrett-Connor, Elizabeth; Ancoli-Israel, Sonia; Zee, Phyllis C; Arens, Raanan; Shah, Neomi A; Taylor, Kent D; Tranah, Gregory J; Stone, Katie L; Hanis, Craig L; Wilson, James G; Gottlieb, Daniel J; Patel, Sanjay R; Rice, Ken; Post, Wendy S; Rotter, Jerome I; Sunyaev, Shamil R; Cai, Jianwen; Lin, Xihong; Purcell, Shaun M; Laurie, Cathy C; Saxena, Richa; Redline, Susan; Zhu, Xiaofeng.

Hum Mol Genet ; 28(4): 675-687, 2019 02 15.

Artículo en Inglés | MEDLINE | ID: mdl-30403821

RESUMEN

Obstructive sleep apnea (OSA) is a common disorder associated with increased risk of cardiovascular disease and mortality. Its prevalence and severity vary across ancestral background. Although OSA traits are heritable, few genetic associations have been identified. To identify genetic regions associated with OSA and improve statistical power, we applied admixture mapping on three primary OSA traits [the apnea hypopnea index (AHI), overnight average oxyhemoglobin saturation (SaO2) and percentage time SaO2 < 90%] and a secondary trait (respiratory event duration) in a Hispanic/Latino American population study of 11 575 individuals with significant variation in ancestral background. Linear mixed models were performed using previously inferred African, European and Amerindian local genetic ancestry markers. Global African ancestry was associated with a lower AHI, higher SaO2 and shorter event duration. Admixture mapping analysis of the primary OSA traits identified local African ancestry at the chromosomal region 2q37 as genome-wide significantly associated with AHI (P < 5.7 × 10-5), and European and Amerindian ancestries at 18q21 suggestively associated with both AHI and percentage time SaO2 < 90% (P < 10-3). Follow-up joint ancestry-SNP association analyses identified novel variants in ferrochelatase (FECH), significantly associated with AHI and percentage time SaO2 < 90% after adjusting for multiple tests (P < 8 × 10-6). These signals contributed to the admixture mapping associations and were replicated in independent cohorts. In this first admixture mapping study of OSA, novel associations with variants in the iron/heme metabolism pathway suggest a role for iron in influencing respiratory traits underlying OSA.

Asunto(s)

Ferroquelatasa/genética , Estudio de Asociación del Genoma Completo , Apnea Obstructiva del Sueño/genética , Anciano , Mapeo Cromosómico , Femenino , Genotipo , Hispánicos o Latinos/genética , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Polisomnografía , Apnea Obstructiva del Sueño/diagnóstico por imagen , Apnea Obstructiva del Sueño/fisiopatología , Población Blanca/genética

9.

Commonalities across computational workflows for uncovering explanatory variants in undiagnosed cases.

Kobren, Shilpa Nadimpalli; Baldridge, Dustin; Velinder, Matt; Krier, Joel B; LeBlanc, Kimberly; Esteves, Cecilia; Pusey, Barbara N; Züchner, Stephan; Blue, Elizabeth; Lee, Hane; Huang, Alden; Bastarache, Lisa; Bican, Anna; Cogan, Joy; Marwaha, Shruti; Alkelai, Anna; Murdock, David R; Liu, Pengfei; Wegner, Daniel J; Paul, Alexander J; Sunyaev, Shamil R; Kohane, Isaac S.

Genet Med ; 23(6): 1075-1085, 2021 06.

Artículo en Inglés | MEDLINE | ID: mdl-33580225

RESUMEN

PURPOSE: Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. METHODS: We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. RESULTS: We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. CONCLUSION: The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.

Asunto(s)

Genómica , Enfermedades no Diagnosticadas , Biología Computacional , Pruebas Genéticas , Genoma , Humanos , Programas Informáticos , Flujo de Trabajo

10.

Identification of cis-suppression of human disease mutations by comparative genomics.

Jordan, Daniel M; Frangakis, Stephan G; Golzio, Christelle; Cassa, Christopher A; Kurtzberg, Joanne; Davis, Erica E; Sunyaev, Shamil R; Katsanis, Nicholas.

Nature ; 524(7564): 225-9, 2015 Aug 13.

Artículo en Inglés | MEDLINE | ID: mdl-26123021

RESUMEN

Patterns of amino acid conservation have served as a tool for understanding protein evolution. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity.

Asunto(s)

Enfermedad/genética , Genómica , Mutación Missense/genética , Supresión Genética/genética , Proteínas Adaptadoras Transductoras de Señales/genética , Alelos , Animales , Evolución Molecular , Genoma Humano/genética , Humanos , Proteínas Inmediatas-Precoces/genética , Microcefalia/genética , Proteínas Asociadas a Microtúbulos , Fenotipo , Proteínas/genética , Alineación de Secuencia , Proteínas Supresoras de Tumor/genética

11.

Cell-of-origin chromatin organization shapes the mutational landscape of cancer.

Polak, Paz; Karlic, Rosa; Koren, Amnon; Thurman, Robert; Sandstrom, Richard; Lawrence, Michael; Reynolds, Alex; Rynes, Eric; Vlahovicek, Kristian; Stamatoyannopoulos, John A; Sunyaev, Shamil R.

Nature ; 518(7539): 360-364, 2015 Feb 19.

Artículo en Inglés | MEDLINE | ID: mdl-25693567

RESUMEN

Cancer is a disease potentiated by mutations in somatic cells. Cancer mutations are not distributed uniformly along the human genome. Instead, different human genomic regions vary by up to fivefold in the local density of cancer somatic mutations, posing a fundamental problem for statistical methods used in cancer genomics. Epigenomic organization has been proposed as a major determinant of the cancer mutational landscape. However, both somatic mutagenesis and epigenomic features are highly cell-type-specific. We investigated the distribution of mutations in multiple independent samples of diverse cancer types and compared them to cell-type-specific epigenomic features. Here we show that chromatin accessibility and modification, together with replication timing, explain up to 86% of the variance in mutation rates along cancer genomes. The best predictors of local somatic mutation density are epigenomic features derived from the most likely cell type of origin of the corresponding malignancy. Moreover, we find that cell-of-origin chromatin features are much stronger determinants of cancer mutation profiles than chromatin features of matched cancer cell lines. Furthermore, we show that the cell type of origin of a cancer can be accurately determined based on the distribution of mutations along its genome. Thus, the DNA sequence of a cancer genome encompasses a wealth of information about the identity and epigenomic features of its cell of origin.

Asunto(s)

Cromatina/genética , Cromatina/metabolismo , Epigénesis Genética/genética , Mutación/genética , Neoplasias/genética , Neoplasias/patología , Línea Celular Tumoral , Cromatina/química , Momento de Replicación del ADN , Epigenómica , Genoma Humano/genética , Humanos , Melanocitos/metabolismo , Melanocitos/patología , Melanoma/genética , Melanoma/patología , Especificidad de Órganos/genética

12.

Applicability of the Mutation-Selection Balance Model to Population Genetics of Heterozygous Protein-Truncating Variants in Humans.

Weghorn, Donate; Balick, Daniel J; Cassa, Christopher; Kosmicki, Jack A; Daly, Mark J; Beier, David R; Sunyaev, Shamil R.

Mol Biol Evol ; 36(8): 1701-1710, 2019 08 01.

Artículo en Inglés | MEDLINE | ID: mdl-31004148

RESUMEN

The fate of alleles in the human population is believed to be highly affected by the stochastic force of genetic drift. Estimation of the strength of natural selection in humans generally necessitates a careful modeling of drift including complex effects of the population history and structure. Protein-truncating variants (PTVs) are expected to evolve under strong purifying selection and to have a relatively high per-gene mutation rate. Thus, it is appealing to model the population genetics of PTVs under a simple deterministic mutation-selection balance, as has been proposed earlier (Cassa et al. 2017). Here, we investigated the limits of this approximation using both computer simulations and data-driven approaches. Our simulations rely on a model of demographic history estimated from 33,370 individual exomes of the Non-Finnish European subset of the ExAC data set (Lek et al. 2016). Additionally, we compared the African and European subset of the ExAC study and analyzed de novo PTVs. We show that the mutation-selection balance model is applicable to the majority of human genes, but not to genes under the weakest selection.

Asunto(s)

Codón sin Sentido , Flujo Genético , Modelos Genéticos , Selección Genética , Humanos , Crecimiento Demográfico

13.

Genomic variation landscape of the human gut microbiome.

Schloissnig, Siegfried; Arumugam, Manimozhiyan; Sunagawa, Shinichi; Mitreva, Makedonka; Tap, Julien; Zhu, Ana; Waller, Alison; Mende, Daniel R; Kultima, Jens Roat; Martin, John; Kota, Karthik; Sunyaev, Shamil R; Weinstock, George M; Bork, Peer.

Nature ; 493(7430): 45-50, 2013 Jan 03.

Artículo en Inglés | MEDLINE | ID: mdl-23222524

RESUMEN

Whereas large-scale efforts have rapidly advanced the understanding and practical impact of human genomic variation, the practical impact of variation is largely unexplored in the human microbiome. We therefore developed a framework for metagenomic variation analysis and applied it to 252 faecal metagenomes of 207 individuals from Europe and North America. Using 7.4 billion reads aligned to 101 reference species, we detected 10.3 million single nucleotide polymorphisms (SNPs), 107,991 short insertions/deletions, and 1,051 structural variants. The average ratio of non-synonymous to synonymous polymorphism rates of 0.11 was more variable between gut microbial species than across human hosts. Subjects sampled at varying time intervals exhibited individuality and temporal stability of SNP variation patterns, despite considerable composition changes of their gut microbiota. This indicates that individual-specific strains are not easily replaced and that an individual might have a unique metagenomic genotype, which may be exploitable for personalized diet or drug intake.

Asunto(s)

Variación Genética/genética , Intestinos/microbiología , Metagenoma/genética , Europa (Continente) , Heces/microbiología , Genoma Bacteriano/genética , Genotipo , Mapeo Geográfico , Humanos , América del Norte , Polimorfismo de Nucleótido Simple/genética , Estándares de Referencia , Factores de Tiempo

14.

Variants in angiopoietin-2 (ANGPT2) contribute to variation in nocturnal oxyhaemoglobin saturation level.

Wang, Heming; Cade, Brian E; Chen, Han; Gleason, Kevin J; Saxena, Richa; Feng, Tao; Larkin, Emma K; Vasan, Ramachandran S; Lin, Honghuang; Patel, Sanjay R; Tracy, Russell P; Liu, Yongmei; Gottlieb, Daniel J; Below, Jennifer E; Hanis, Craig L; Petty, Lauren E; Sunyaev, Shamil R; Frazier-Wood, Alexis C; Rotter, Jerome I; Post, Wendy; Lin, Xihong; Redline, Susan; Zhu, Xiaofeng.

Hum Mol Genet ; 25(23): 5244-5253, 2016 12 01.

Artículo en Inglés | MEDLINE | ID: mdl-27798093

RESUMEN

Genetic determinants of sleep-disordered breathing (SDB), a common set of disorders that contribute to significant cardiovascular and neuropsychiatric morbidity, are not clear. Overnight nocturnal oxygen saturation (SaO2) is a clinically relevant and easily measured indicator of SDB severity but its genetic contribution has never been studied. Our recent study suggests nocturnal SaO2 is heritable. We performed linkage analysis, association analysis and haplotype analysis of average nocturnal oxyhaemoglobin saturation in participants in the Cleveland Family Study (CFS), followed by gene-based association and additional tests in four independent samples. Linkage analysis identified a peak (LOD = 4.29) on chromosome 8p23. Follow-up association analysis identified two haplotypes in angiopoietin-2 (ANGPT2) that significantly contributed to the variation of SaO2 (P = 8 × 10-5) and accounted for a portion of the linkage evidence. Gene-based association analysis replicated the association of ANGPT2 and nocturnal SaO2. A rare missense SNP rs200291021 in ANGPT2 was associated with serum angiopoietin-2 level (P = 1.29 × 10-4), which was associated with SaO2 (P = 0.002). Our study provides the first evidence for the association of ANGPT2, a gene previously implicated in acute lung injury syndromes, with nocturnal SaO2, suggesting that this gene has a broad range of effects on gas exchange, including influencing oxygenation during sleep.

Asunto(s)

Angiopoyetina 2/genética , Consumo de Oxígeno/genética , Oxihemoglobinas/genética , Síndromes de la Apnea del Sueño/genética , Adulto , Femenino , Estudios de Asociación Genética , Ligamiento Genético , Predisposición Genética a la Enfermedad , Haplotipos/genética , Humanos , Masculino , Oxígeno/metabolismo , Polimorfismo de Nucleótido Simple , Respiración/genética , Sueño/genética , Síndromes de la Apnea del Sueño/metabolismo , Síndromes de la Apnea del Sueño/patología

15.

Leveraging Distant Relatedness to Quantify Human Mutation and Gene-Conversion Rates.

Palamara, Pier Francesco; Francioli, Laurent C; Wilton, Peter R; Genovese, Giulio; Gusev, Alexander; Finucane, Hilary K; Sankararaman, Sriram; Sunyaev, Shamil R; de Bakker, Paul I W; Wakeley, John; Pe'er, Itsik; Price, Alkes L.

Am J Hum Genet ; 97(6): 775-89, 2015 Dec 03.

Artículo en Inglés | MEDLINE | ID: mdl-26581902

RESUMEN

The rate at which human genomes mutate is a central biological parameter that has many implications for our ability to understand demographic and evolutionary phenomena. We present a method for inferring mutation and gene-conversion rates by using the number of sequence differences observed in identical-by-descent (IBD) segments together with a reconstructed model of recent population-size history. This approach is robust to, and can quantify, the presence of substantial genotyping error, as validated in coalescent simulations. We applied the method to 498 trio-phased sequenced Dutch individuals and inferred a point mutation rate of 1.66 × 10(-8) per base per generation and a rate of 1.26 × 10(-9) for <20 bp indels. By quantifying how estimates varied as a function of allele frequency, we inferred the probability that a site is involved in non-crossover gene conversion as 5.99 × 10(-6). We found that recombination does not have observable mutagenic effects after gene conversion is accounted for and that local gene-conversion rates reflect recombination rates. We detected a strong enrichment of recent deleterious variation among mismatching variants found within IBD regions and observed summary statistics of local sharing of IBD segments to closely match previously proposed metrics of background selection; however, we found no significant effects of selection on our mutation-rate estimates. We detected no evidence of strong variation of mutation rates in a number of genomic annotations obtained from several recent studies. Our analysis suggests that a mutation-rate estimate higher than that reported by recent pedigree-based studies should be adopted in the context of DNA-based demographic reconstruction.

Asunto(s)

Genoma Humano , Mutación de Línea Germinal , Modelos Genéticos , Tasa de Mutación , Alelos , Frecuencia de los Genes , Haplotipos , Humanos , Mutación INDEL , Modelos Lineales , Recombinación Genética

16.

Dominance of Deleterious Alleles Controls the Response to a Population Bottleneck.

Balick, Daniel J; Do, Ron; Cassa, Christopher A; Reich, David; Sunyaev, Shamil R.

PLoS Genet ; 11(8): e1005436, 2015 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-26317225

RESUMEN

Population bottlenecks followed by re-expansions have been common throughout history of many populations. The response of alleles under selection to such demographic perturbations has been a subject of great interest in population genetics. On the basis of theoretical analysis and computer simulations, we suggest that this response qualitatively depends on dominance. The number of dominant or additive deleterious alleles per haploid genome is expected to be slightly increased following the bottleneck and re-expansion. In contrast, the number of completely or partially recessive alleles should be sharply reduced. Changes of population size expose differences between recessive and additive selection, potentially providing insight into the prevalence of dominance in natural populations. Specifically, we use a simple statistic, [Formula: see text], where xi represents the derived allele frequency, to compare the number of mutations in different populations, and detail its functional dependence on the strength of selection and the intensity of the population bottleneck. We also provide empirical evidence showing that gene sets associated with autosomal recessive disease in humans may have a BR indicative of recessive selection. Together, these theoretical predictions and empirical observations show that complex demographic history may facilitate rather than impede inference of parameters of natural selection.

Asunto(s)

Frecuencia de los Genes/genética , Genes Dominantes/genética , Genética de Población/estadística & datos numéricos , Dinámica Poblacional/estadística & datos numéricos , Animales , Evolución Biológica , Simulación por Computador , Humanos , Modelos Genéticos , Modelos Estadísticos , Selección Genética

17.

Excess of Deleterious Mutations around HLA Genes Reveals Evolutionary Cost of Balancing Selection.

Lenz, Tobias L; Spirin, Victor; Jordan, Daniel M; Sunyaev, Shamil R.

Mol Biol Evol ; 33(10): 2555-64, 2016 10.

Artículo en Inglés | MEDLINE | ID: mdl-27436009

RESUMEN

Deleterious mutations are expected to evolve under negative selection and are usually purged from the population. However, deleterious alleles segregate in the human population and some disease-associated variants are maintained at considerable frequencies. Here, we test the hypothesis that balancing selection may counteract purifying selection in neighboring regions and thus maintain deleterious variants at higher frequency than expected from their detrimental fitness effect. We first show in realistic simulations that balancing selection reduces the density of polymorphic sites surrounding a locus under balancing selection, but at the same time markedly increases the population frequency of the remaining variants, including even substantially deleterious alleles. To test the predictions of our simulations empirically, we then use whole-exome sequencing data from 6,500 human individuals and focus on the most established example for balancing selection in the human genome, the major histocompatibility complex (MHC). Our analysis shows an elevated frequency of putatively deleterious coding variants in nonhuman leukocyte antigen (non-HLA) genes localized in the MHC region. The mean frequency of these variants declined with physical distance from the classical HLA genes, indicating dependency on genetic linkage. These results reveal an indirect cost of the genetic diversity maintained by balancing selection, which has hitherto been perceived as mostly advantageous, and have implications both for the evolution of recombination and also for the epidemiology of various MHC-associated diseases.

Asunto(s)

Antígenos HLA/genética , Complejo Mayor de Histocompatibilidad/genética , Selección Genética , Eliminación de Secuencia , Alelos , Evolución Biológica , Simulación por Computador , Bases de Datos Genéticas , Evolución Molecular , Frecuencia de los Genes/genética , Variación Genética , Genoma Humano , Haplotipos/genética , Humanos , Polimorfismo Genético/genética

18.

Genome sequencing reveals insights into physiology and longevity of the naked mole rat.

Kim, Eun Bae; Fang, Xiaodong; Fushan, Alexey A; Huang, Zhiyong; Lobanov, Alexei V; Han, Lijuan; Marino, Stefano M; Sun, Xiaoqing; Turanov, Anton A; Yang, Pengcheng; Yim, Sun Hee; Zhao, Xiang; Kasaikina, Marina V; Stoletzki, Nina; Peng, Chunfang; Polak, Paz; Xiong, Zhiqiang; Kiezun, Adam; Zhu, Yabing; Chen, Yuanxin; Kryukov, Gregory V; Zhang, Qiang; Peshkin, Leonid; Yang, Lan; Bronson, Roderick T; Buffenstein, Rochelle; Wang, Bo; Han, Changlei; Li, Qiye; Chen, Li; Zhao, Wei; Sunyaev, Shamil R; Park, Thomas J; Zhang, Guojie; Wang, Jun; Gladyshev, Vadim N.

Nature ; 479(7372): 223-7, 2011 Oct 12.

Artículo en Inglés | MEDLINE | ID: mdl-21993625

RESUMEN

The naked mole rat (Heterocephalus glaber) is a strictly subterranean, extraordinarily long-lived eusocial mammal. Although it is the size of a mouse, its maximum lifespan exceeds 30 years, making this animal the longest-living rodent. Naked mole rats show negligible senescence, no age-related increase in mortality, and high fecundity until death. In addition to delayed ageing, they are resistant to both spontaneous cancer and experimentally induced tumorigenesis. Naked mole rats pose a challenge to the theories that link ageing, cancer and redox homeostasis. Although characterized by significant oxidative stress, the naked mole rat proteome does not show age-related susceptibility to oxidative damage or increased ubiquitination. Naked mole rats naturally reside in large colonies with a single breeding female, the 'queen', who suppresses the sexual maturity of her subordinates. They also live in full darkness, at low oxygen and high carbon dioxide concentrations, and are unable to sustain thermogenesis nor feel certain types of pain. Here we report the sequencing and analysis of the naked mole rat genome, which reveals unique genome features and molecular adaptations consistent with cancer resistance, poikilothermy, hairlessness and insensitivity to low oxygen, and altered visual function, circadian rythms and taste sensing. This information provides insights into the naked mole rat's exceptional longevity and ability to live in hostile conditions, in the dark and at low oxygen. The extreme traits of the naked mole rat, together with the reported genome and transcriptome information, offer opportunities for understanding ageing and advancing other areas of biological and biomedical research.

Asunto(s)

Adaptación Fisiológica/genética , Genoma/genética , Longevidad/genética , Ratas Topo/genética , Ratas Topo/fisiología , Envejecimiento/genética , Secuencia de Aminoácidos , Animales , Regulación de la Temperatura Corporal/genética , Dióxido de Carbono/análisis , Dióxido de Carbono/metabolismo , Ritmo Circadiano/genética , Oscuridad , Genes/genética , Inestabilidad Genómica/genética , Genómica , Humanos , Canales Iónicos/genética , Longevidad/fisiología , Masculino , Proteínas Mitocondriales/genética , Datos de Secuencia Molecular , Mutagénesis/genética , Oxígeno/análisis , Oxígeno/metabolismo , Gusto/genética , Transcriptoma/genética , Proteína Desacopladora 1 , Percepción Visual/genética

19.

Genetic Associations with Obstructive Sleep Apnea Traits in Hispanic/Latino Americans.

Cade, Brian E; Chen, Han; Stilp, Adrienne M; Gleason, Kevin J; Sofer, Tamar; Ancoli-Israel, Sonia; Arens, Raanan; Bell, Graeme I; Below, Jennifer E; Bjonnes, Andrew C; Chun, Sung; Conomos, Matthew P; Evans, Daniel S; Johnson, W Craig; Frazier-Wood, Alexis C; Lane, Jacqueline M; Larkin, Emma K; Loredo, Jose S; Post, Wendy S; Ramos, Alberto R; Rice, Ken; Rotter, Jerome I; Shah, Neomi A; Stone, Katie L; Taylor, Kent D; Thornton, Timothy A; Tranah, Gregory J; Wang, Chaolong; Zee, Phyllis C; Hanis, Craig L; Sunyaev, Shamil R; Patel, Sanjay R; Laurie, Cathy C; Zhu, Xiaofeng; Saxena, Richa; Lin, Xihong; Redline, Susan.

Am J Respir Crit Care Med ; 194(7): 886-897, 2016 Oct 01.

Artículo en Inglés | MEDLINE | ID: mdl-26977737

RESUMEN

RATIONALE: Obstructive sleep apnea is a common disorder associated with increased risk for cardiovascular disease, diabetes, and premature mortality. Although there is strong clinical and epidemiologic evidence supporting the importance of genetic factors in influencing obstructive sleep apnea, its genetic basis is still largely unknown. Prior genetic studies focused on traits defined using the apnea-hypopnea index, which contains limited information on potentially important genetically determined physiologic factors, such as propensity for hypoxemia and respiratory arousability. OBJECTIVES: To define novel obstructive sleep apnea genetic risk loci for obstructive sleep apnea, we conducted genome-wide association studies of quantitative traits in Hispanic/Latino Americans from three cohorts. METHODS: Genome-wide data from as many as 12,558 participants in the Hispanic Community Health Study/Study of Latinos, Multi-Ethnic Study of Atherosclerosis, and Starr County Health Studies population-based cohorts were metaanalyzed for association with the apnea-hypopnea index, average oxygen saturation during sleep, and average respiratory event duration. MEASUREMENTS AND MAIN RESULTS: Two novel loci were identified at genome-level significance (rs11691765, GPR83, P = 1.90 × 10-8 for the apnea-hypopnea index, and rs35424364; C6ORF183/CCDC162P, P = 4.88 × 10-8 for respiratory event duration) and seven additional loci were identified with suggestive significance (P < 5 × 10-7). Secondary sex-stratified analyses also identified one significant and several suggestive associations. Multiple loci overlapped genes with biologic plausibility. CONCLUSIONS: These are the first genome-level significant findings reported for obstructive sleep apnea-related physiologic traits in any population. These findings identify novel associations in inflammatory, hypoxia signaling, and sleep pathways.

20.

Searching for missing heritability: designing rare variant association studies.

Zuk, Or; Schaffner, Stephen F; Samocha, Kaitlin; Do, Ron; Hechter, Eliana; Kathiresan, Sekar; Daly, Mark J; Neale, Benjamin M; Sunyaev, Shamil R; Lander, Eric S.

Proc Natl Acad Sci U S A ; 111(4): E455-64, 2014 Jan 28.

Artículo en Inglés | MEDLINE | ID: mdl-24443550

RESUMEN

Genetic studies have revealed thousands of loci predisposing to hundreds of human diseases and traits, revealing important biological pathways and defining novel therapeutic hypotheses. However, the genes discovered to date typically explain less than half of the apparent heritability. Because efforts have largely focused on common genetic variants, one hypothesis is that much of the missing heritability is due to rare genetic variants. Studies of common variants are typically referred to as genomewide association studies, whereas studies of rare variants are often simply called sequencing studies. Because they are actually closely related, we use the terms common variant association study (CVAS) and rare variant association study (RVAS). In this paper, we outline the similarities and differences between RVAS and CVAS and describe a conceptual framework for the design of RVAS. We apply the framework to address key questions about the sample sizes needed to detect association, the relative merits of testing disruptive alleles vs. missense alleles, frequency thresholds for filtering alleles, the value of predictors of the functional impact of missense alleles, the potential utility of isolated populations, the value of gene-set analysis, and the utility of de novo mutations. The optimal design depends critically on the selection coefficient against deleterious alleles and thus varies across genes. The analysis shows that common variant and rare variant studies require similarly large sample collections. In particular, a well-powered RVAS should involve discovery sets with at least 25,000 cases, together with a substantial replication set.

Asunto(s)

Variación Genética , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Mutación

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA