Búsqueda | Portal de Búsqueda de la BVS Ecuador

1.

Uganda Genome Resource Enables Insights into Population History and Genomic Discovery in Africa.

Gurdasani, Deepti; Carstensen, Tommy; Fatumo, Segun; Chen, Guanjie; Franklin, Chris S; Prado-Martinez, Javier; Bouman, Heleen; Abascal, Federico; Haber, Marc; Tachmazidou, Ioanna; Mathieson, Iain; Ekoru, Kenneth; DeGorter, Marianne K; Nsubuga, Rebecca N; Finan, Chris; Wheeler, Eleanor; Chen, Li; Cooper, David N; Schiffels, Stephan; Chen, Yuan; Ritchie, Graham R S; Pollard, Martin O; Fortune, Mary D; Mentzer, Alex J; Garrison, Erik; Bergström, Anders; Hatzikotoulas, Konstantinos; Adeyemo, Adebowale; Doumatey, Ayo; Elding, Heather; Wain, Louise V; Ehret, Georg; Auer, Paul L; Kooperberg, Charles L; Reiner, Alexander P; Franceschini, Nora; Maher, Dermot; Montgomery, Stephen B; Kadie, Carl; Widmer, Chris; Xue, Yali; Seeley, Janet; Asiki, Gershim; Kamali, Anatoli; Young, Elizabeth H; Pomilla, Cristina; Soranzo, Nicole; Zeggini, Eleftheria; Pirie, Fraser; Morris, Andrew P.

Cell ; 179(4): 984-1002.e36, 2019 10 31.

Artículo en Inglés | MEDLINE | ID: mdl-31675503

RESUMEN

Genomic studies in African populations provide unique opportunities to understand disease etiology, human diversity, and population history. In the largest study of its kind, comprising genome-wide data from 6,400 individuals and whole-genome sequences from 1,978 individuals from rural Uganda, we find evidence of geographically correlated fine-scale population substructure. Historically, the ancestry of modern Ugandans was best represented by a mixture of ancient East African pastoralists. We demonstrate the value of the largest sequence panel from Africa to date as an imputation resource. Examining 34 cardiometabolic traits, we show systematic differences in trait heritability between European and African populations, probably reflecting the differential impact of genes and environment. In a multi-trait pan-African GWAS of up to 14,126 individuals, we identify novel loci associated with anthropometric, hematological, lipid, and glycemic traits. We find that several functionally important signals are driven by Africa-specific variants, highlighting the value of studying diverse populations across the region.

Asunto(s)

Población Negra/genética , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Genómica , Femenino , Frecuencia de los Genes/genética , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética , Uganda/epidemiología , Secuenciación Completa del Genoma

2.

Genome-wide detection of human intronic AG-gain variants located between splicing branchpoints and canonical splice acceptor sites.

Zhang, Peng; Chaldebas, Matthieu; Ogishi, Masato; Al Qureshah, Fahd; Ponsin, Khoren; Feng, Yi; Rinchai, Darawan; Milisavljevic, Baptiste; Han, Ji Eun; Moncada-Vélez, Marcela; Keles, Sevgi; Schröder, Bernd; Stenson, Peter D; Cooper, David N; Cobat, Aurélie; Boisson, Bertrand; Zhang, Qian; Boisson-Dupuis, Stéphanie; Abel, Laurent; Casanova, Jean-Laurent.

Proc Natl Acad Sci U S A ; 120(46): e2314225120, 2023 Nov 14.

Artículo en Inglés | MEDLINE | ID: mdl-37931111

RESUMEN

Human genetic variants that introduce an AG into the intronic region between the branchpoint (BP) and the canonical splice acceptor site (ACC) of protein-coding genes can disrupt pre-mRNA splicing. Using our genome-wide BP database, we delineated the BP-ACC segments of all human introns and found extreme depletion of AG/YAG in the [BP+8, ACC-4] high-risk region. We developed AGAIN as a genome-wide computational approach to systematically and precisely pinpoint intronic AG-gain variants within the BP-ACC regions. AGAIN identified 350 AG-gain variants from the Human Gene Mutation Database, all of which alter splicing and cause disease. Among them, 74% created new acceptor sites, whereas 31% resulted in complete exon skipping. AGAIN also predicts the protein-level products resulting from these two consequences. We performed AGAIN on our exome/genomes database of patients with severe infectious diseases but without known genetic etiology and identified a private homozygous intronic AG-gain variant in the antimycobacterial gene SPPL2A in a patient with mycobacterial disease. AGAIN also predicts a retention of six intronic nucleotides that encode an in-frame stop codon, turning AG-gain into stop-gain. This allele was then confirmed experimentally to lead to loss of function by disrupting splicing. We further showed that AG-gain variants inside the high-risk region led to misspliced products, while those outside the region did not, by two case studies in genes STAT1 and IRF7. We finally evaluated AGAIN on our 14 paired exome-RNAseq samples and found that 82% of AG-gain variants in high-risk regions showed evidence of missplicing. AGAIN is publicly available from https://hgidsoft.rockefeller.edu/AGAIN and https://github.com/casanova-lab/AGAIN.

Asunto(s)

Sitios de Empalme de ARN , Empalme del ARN , Humanos , Intrones , Mutación , Genoma

3.

Deciphering the Role of Rapidly Evolving Conserved Elements in Primate Brain Development and Exploring Their Potential Involvement in Alzheimer's Disease.

Hu, Benxia; Zhuang, Xiao-Lin; Zhou, Long; Zhang, Guojie; Cooper, David N; Wu, Dong-Dong.

Mol Biol Evol ; 41(1)2024 Jan 03.

Artículo en Inglés | MEDLINE | ID: mdl-38175672

RESUMEN

Although previous studies have identified human-specific accelerated regions as playing a key role in the recent evolution of the human brain, the characteristics and cellular functions of rapidly evolving conserved elements (RECEs) in ancestral primate lineages remain largely unexplored. Here, based on large-scale primate genome assemblies, we identify 888 RECEs that have been highly conserved in primates that exhibit significantly accelerated substitution rates in the ancestor of the Simiiformes. This primate lineage exhibits remarkable morphological innovations, including an expanded brain mass. Integrative multiomic analyses reveal that RECEs harbor sequences with potential cis-regulatory functions that are activated in the adult human brain. Importantly, genes linked to RECEs exhibit pronounced expression trajectories in the adult brain relative to the fetal stage. Furthermore, we observed an increase in the chromatin accessibility of RECEs in oligodendrocytes from individuals with Alzheimer's disease (AD) compared to that of a control group, indicating that these RECEs may contribute to brain aging and AD. Our findings serve to expand our knowledge of the genetic underpinnings of brain function during primate evolution.

Asunto(s)

Enfermedad de Alzheimer , Animales , Humanos , Enfermedad de Alzheimer/genética , Evolución Molecular , Primates/genética , Encéfalo

4.

Genome-wide identification of dominant polyadenylation hexamers for use in variant classification.

Shiferaw, Henoke K; Hong, Celine S; Cooper, David N; Johnston, Jennifer J; Biesecker, Leslie G.

Hum Mol Genet ; 32(23): 3211-3224, 2023 Nov 17.

Artículo en Inglés | MEDLINE | ID: mdl-37606238

RESUMEN

Polyadenylation is an essential process for the stabilization and export of mRNAs to the cytoplasm and the polyadenylation signal hexamer (herein referred to as hexamer) plays a key role in this process. Yet, only 14 Mendelian disorders have been associated with hexamer variants. This is likely an under-ascertainment as hexamers are not well defined and not routinely examined in molecular analysis. To facilitate the interrogation of putatively pathogenic hexamer variants, we set out to define functionally important hexamers genome-wide as a resource for research and clinical testing interrogation. We identified predominant polyA sites (herein referred to as pPAS) and putative predominant hexamers across protein coding genes (PAS usage >50% per gene). As a measure of the validity of these sites, the population constraint of 4532 predominant hexamers were measured. The predominant hexamers had fewer observed variants compared to non-predominant hexamers and trimer controls, and CADD scores for variants in these hexamers were significantly higher than controls. Exome data for 1477 individuals were interrogated for hexamer variants and transcriptome data were generated for 76 individuals with 65 variants in predominant hexamers. 3' RNA-seq data showed these variants resulted in alternate polyadenylation events (38%) and in elongated mRNA transcripts (12%). Our list of pPAS and predominant hexamers are available in the UCSC genome browser and on GitHub. We suggest this list of predominant hexamers can be used to interrogate exome and genome data. Variants in these predominant hexamers should be considered candidates for pathogenic variation in human disease, and to that end we suggest pathogenicity criteria for classifying hexamer variants.

Asunto(s)

Genoma , Poliadenilación , Humanos , Poliadenilación/genética

5.

Analysis of missense variants in the human genome reveals widespread gene-specific clustering and improves prediction of pathogenicity.

Quinodoz, Mathieu; Peter, Virginie G; Cisarova, Katarina; Royer-Bertrand, Beryl; Stenson, Peter D; Cooper, David N; Unger, Sheila; Superti-Furga, Andrea; Rivolta, Carlo.

Am J Hum Genet ; 109(3): 457-470, 2022 03 03.

Artículo en Inglés | MEDLINE | ID: mdl-35120630

RESUMEN

We used a machine learning approach to analyze the within-gene distribution of missense variants observed in hereditary conditions and cancer. When applied to 840 genes from the ClinVar database, this approach detected a significant non-random distribution of pathogenic and benign variants in 387 (46%) and 172 (20%) genes, respectively, revealing that variant clustering is widespread across the human exome. This clustering likely occurs as a consequence of mechanisms shaping pathogenicity at the protein level, as illustrated by the overlap of some clusters with known functional domains. We then took advantage of these findings to develop a pathogenicity predictor, MutScore, that integrates qualitative features of DNA substitutions with the new additional information derived from this positional clustering. Using a random forest approach, MutScore was able to identify pathogenic missense mutations with very high accuracy, outperforming existing predictive tools, especially for variants associated with autosomal-dominant disease and cancer. Thus, the within-gene clustering of pathogenic and benign DNA changes is an important and previously underappreciated feature of the human exome, which can be harnessed to improve the prediction of pathogenicity and disambiguation of DNA variants of uncertain significance.

Asunto(s)

Genoma Humano , Mutación Missense , Análisis por Conglomerados , Exoma/genética , Genoma Humano/genética , Humanos , Mutación Missense/genética , Virulencia

6.

Meta-analysis of 46,000 germline de novo mutations linked to human inherited disease.

Lopes-Marques, Mónica; Mort, Matthew; Carneiro, João; Azevedo, António; Amaro, Andreia P; Cooper, David N; Azevedo, Luísa.

Hum Genomics ; 18(1): 20, 2024 Feb 23.

Artículo en Inglés | MEDLINE | ID: mdl-38395944

RESUMEN

BACKGROUND: De novo mutations (DNMs) are variants that occur anew in the offspring of noncarrier parents. They are not inherited from either parent but rather result from endogenous mutational processes involving errors of DNA repair/replication. These spontaneous errors play a significant role in the causation of genetic disorders, and their importance in the context of molecular diagnostic medicine has become steadily more apparent as more DNMs have been reported in the literature. In this study, we examined 46,489 disease-associated DNMs annotated by the Human Gene Mutation Database (HGMD) to ascertain their distribution across gene and disease categories. RESULTS: Most disease-associated DNMs reported to date are found to be associated with developmental and psychiatric disorders, a reflection of the focus of sequencing efforts over the last decade. Of the 13,277 human genes in which DNMs have so far been found, the top-10 genes with the highest proportions of DNM relative to gene size were H3-3 A, DDX3X, CSNK2B, PURA, ZC4H2, STXBP1, SCN1A, SATB2, H3-3B and TUBA1A. The distribution of CADD and REVEL scores for both disease-associated DNMs and those mutations not reported to be de novo revealed a trend towards higher deleteriousness for DNMs, consistent with the likely lower selection pressure impacting them. This contrasts with the non-DNMs, which are presumed to have been subject to continuous negative selection over multiple generations. CONCLUSION: This meta-analysis provides important information on the occurrence and distribution of disease-associated DNMs in association with heritable disease and should make a significant contribution to our understanding of this major type of mutation.

Asunto(s)

Células Germinativas , Padres , Humanos , Mutación

7.

Gene expression across mammalian organ development.

Cardoso-Moreira, Margarida; Halbert, Jean; Valloton, Delphine; Velten, Britta; Chen, Chunyan; Shao, Yi; Liechti, Angélica; Ascenção, Kelly; Rummel, Coralie; Ovchinnikova, Svetlana; Mazin, Pavel V; Xenarios, Ioannis; Harshman, Keith; Mort, Matthew; Cooper, David N; Sandi, Carmen; Soares, Michael J; Ferreira, Paula G; Afonso, Sandra; Carneiro, Miguel; Turner, James M A; VandeBerg, John L; Fallahshahroudi, Amir; Jensen, Per; Behr, Rüdiger; Lisgo, Steven; Lindsay, Susan; Khaitovich, Philipp; Huber, Wolfgang; Baker, Julie; Anders, Simon; Zhang, Yong E; Kaessmann, Henrik.

Nature ; 571(7766): 505-509, 2019 07.

Artículo en Inglés | MEDLINE | ID: mdl-31243369

RESUMEN

The evolution of gene expression in mammalian organ development remains largely uncharacterized. Here we report the transcriptomes of seven organs (cerebrum, cerebellum, heart, kidney, liver, ovary and testis) across developmental time points from early organogenesis to adulthood for human, rhesus macaque, mouse, rat, rabbit, opossum and chicken. Comparisons of gene expression patterns identified correspondences of developmental stages across species, and differences in the timing of key events during the development of the gonads. We found that the breadth of gene expression and the extent of purifying selection gradually decrease during development, whereas the amount of positive selection and expression of new genes increase. We identified differences in the temporal trajectories of expression of individual genes across species, with brain tissues showing the smallest percentage of trajectory changes, and the liver and testis showing the largest. Our work provides a resource of developmental transcriptomes of seven organs across seven species, and comparative analyses that characterize the development and evolution of mammalian organs.

Asunto(s)

Regulación del Desarrollo de la Expresión Génica , Organogénesis/genética , Transcriptoma/genética , Animales , Evolución Biológica , Pollos/genética , Femenino , Humanos , Macaca mulatta/genética , Masculino , Ratones , Zarigüeyas/genética , Conejos , Ratas

8.

Functional genomics analysis reveals the evolutionary adaptation and demographic history of pygmy lorises.

Li, Ming-Li; Wang, Sheng; Xu, Penghui; Tian, Hang-Yu; Bai, Mixue; Zhang, Ya-Ping; Shao, Yong; Xiong, Zi-Jun; Qi, Xiao-Guang; Cooper, David N; Zhang, Guojie; Zhu, He Helen; Wu, Dong-Dong.

Proc Natl Acad Sci U S A ; 119(40): e2123030119, 2022 10 04.

Artículo en Inglés | MEDLINE | ID: mdl-36161902

RESUMEN

Lorises are a group of globally threatened strepsirrhine primates that exhibit many unusual physiological and behavioral features, including a low metabolic rate, slow movement, and hibernation. Here, we assembled a chromosome-level genome sequence of the pygmy loris (Xanthonycticebus pygmaeus) and resequenced whole genomes from 50 pygmy lorises and 6 Bengal slow lorises (Nycticebus bengalensis). We found that many gene families involved in detoxification have been specifically expanded in the pygmy loris, including the GSTA gene family, with many newly derived copies functioning specifically in the liver. We detected many genes displaying evolutionary convergence between pygmy loris and koala, including PITRM1. Significant decreases in PITRM1 enzymatic activity in these two species may have contributed to their characteristic low rate of metabolism. We also detected many evolutionarily convergent genes and positively selected genes in the pygmy loris that are involved in muscle development. Functional assays demonstrated the decreased ability of one positively selected gene, MYOF, to up-regulate the fast-type muscle fiber, consistent with the lower proportion of fast-twitch muscle fibers in the pygmy loris. The protein product of another positively selected gene in the pygmy loris, PER2, exhibited weaker binding to the key circadian core protein CRY, a finding that may be related to this species' unusual circadian rhythm. Finally, population genomics analysis revealed that these two extant loris species, which coexist in the same habitat, have exhibited an inverse relationship in terms of their demography over the past 1 million years, implying strong interspecies competition after speciation.

Asunto(s)

Adaptación Biológica , Evolución Biológica , Lorisidae , Adaptación Biológica/genética , Animales , Demografía , Hibernación , Lorisidae/genética , Metagenómica , Metaloendopeptidasas/genética

9.

Genome-wide detection of human variants that disrupt intronic branchpoints.

Zhang, Peng; Philippot, Quentin; Ren, Weicheng; Lei, Wei-Te; Li, Juan; Stenson, Peter D; Palacín, Pere Soler; Colobran, Roger; Boisson, Bertrand; Zhang, Shen-Ying; Puel, Anne; Pan-Hammarström, Qiang; Zhang, Qian; Cooper, David N; Abel, Laurent; Casanova, Jean-Laurent.

Proc Natl Acad Sci U S A ; 119(44): e2211194119, 2022 11.

Artículo en Inglés | MEDLINE | ID: mdl-36306325

RESUMEN

Pre-messenger RNA splicing is initiated with the recognition of a single-nucleotide intronic branchpoint (BP) within a BP motif by spliceosome elements. Forty-eight rare variants in 43 human genes have been reported to alter splicing and cause disease by disrupting BP. However, until now, no computational approach was available to efficiently detect such variants in massively parallel sequencing data. We established a comprehensive human genome-wide BP database by integrating existing BP data and generating new BP data from RNA sequencing of lariat debranching enzyme DBR1-mutated patients and from machine-learning predictions. We characterized multiple features of BP in major and minor introns and found that BP and BP-2 (two nucleotides upstream of BP) positions exhibit a lower rate of variation in human populations and higher evolutionary conservation than the intronic background, while being comparable to the exonic background. We developed BPHunter as a genome-wide computational approach to systematically and efficiently detect intronic variants that may disrupt BP recognition. BPHunter retrospectively identified 40 of the 48 known pathogenic BP variants, in which we summarized a strategy for prioritizing BP variant candidates. The remaining eight variants all create AG-dinucleotides between the BP and acceptor site, which is the likely reason for missplicing. We demonstrated the practical utility of BPHunter prospectively by using it to identify a novel germline heterozygous BP variant of STAT2 in a patient with critical COVID-19 pneumonia and a novel somatic intronic 59-nucleotide deletion of ITPKB in a lymphoma patient, both of which were validated experimentally. BPHunter is publicly available from https://hgidsoft.rockefeller.edu/BPHunter and https://github.com/casanova-lab/BPHunter.

Asunto(s)

COVID-19 , Humanos , Intrones/genética , Estudios Retrospectivos , COVID-19/genética , Empalme del ARN/genética , Nucleótidos

10.

Integrative Omics Reveals Rapidly Evolving Regulatory Sequences Driving Primate Brain Evolution.

Zhuang, Xiao-Lin; Zhang, Jin-Jin; Shao, Yong; Ye, Yaxin; Chen, Chun-Yan; Zhou, Long; Wang, Zheng-Bo; Luo, Xin; Su, Bing; Yao, Yong-Gang; Cooper, David N; Hu, Ben-Xia; Wang, Lu; Qi, Xiao-Guang; Lin, Jiangwei; Zhang, Guo-Jie; Wang, Wen; Sheng, Nengyin; Wu, Dong-Dong.

Mol Biol Evol ; 40(8)2023 08 03.

Artículo en Inglés | MEDLINE | ID: mdl-37494289

RESUMEN

Although the continual expansion of the brain during primate evolution accounts for our enhanced cognitive capabilities, the drivers of brain evolution have scarcely been explored in these ancestral nodes. Here, we performed large-scale comparative genomic, transcriptomic, and epigenomic analyses to investigate the evolutionary alterations acquired by brain genes and provide comprehensive listings of innovatory genetic elements along the evolutionary path from ancestral primates to human. The regulatory sequences associated with brain-expressed genes experienced rapid change, particularly in the ancestor of the Simiiformes. Extensive comparisons of single-cell and bulk transcriptomic data between primate and nonprimate brains revealed that these regulatory sequences may drive the high expression of certain genes in primate brains. Employing in utero electroporation into mouse embryonic cortex, we show that the primate-specific brain-biased gene BMP7 was recruited, probably in the ancestor of the Simiiformes, to regulate neuronal proliferation in the primate ventricular zone. Our study provides a comprehensive listing of genes and regulatory changes along the brain evolution lineage of ancestral primates leading to human. These data should be invaluable for future functional studies that will deepen our understanding not only of the genetic basis of human brain evolution but also of inherited disease.

Asunto(s)

Encéfalo , Primates , Ratones , Humanos , Animales , Primates/genética , Encéfalo/metabolismo , Evolución Molecular

11.

Large-Scale Chromosomal Changes Lead to Genome-Level Expression Alterations, Environmental Adaptation, and Speciation in the Gayal (Bos frontalis).

Li, Yan; Wang, Sheng; Zhang, Zhe; Luo, Jing; Lin, Guo Liang; Deng, Wei-Dong; Guo, Zhifan; Han, Feng Ming; Wang, Li-Li; Li, Jie; Wu, Shi-Fang; Liu, He-Qun; He, Sheng; Murphy, Robert W; Zhang, Zi-Jie; Cooper, David N; Wu, Dong-Dong; Zhang, Ya-Ping.

Mol Biol Evol ; 40(1)2023 01 04.

Artículo en Inglés | MEDLINE | ID: mdl-36625089

RESUMEN

Determining the functional consequences of karyotypic changes is invariably challenging because evolution tends to obscure many of its own footprints, such as accumulated mutations, recombination events, and demographic perturbations. Here, we describe the assembly of a chromosome-level reference genome of the gayal (Bos frontalis) thereby revealing the structure, at base-pair-level resolution, of a telo/acrocentric-to-telo/acrocentric Robertsonian translocation (2;28) (T/A-to-T/A rob[2;28]). The absence of any reduction in the recombination rate or genetic introgression within the fusion region of gayal served to challenge the long-standing view of a role for fusion-induced meiotic dysfunction in speciation. The disproportionate increase noted in the distant interactions across pro-chr2 and pro-chr28, and the change in open-chromatin accessibility following rob(2;28), may, however, have led to the various gene expression irregularities observed in the gayal. Indeed, we found that many muscle-related genes, located synthetically on pro-chr2 and pro-chr28, exhibited significant changes in expression. This, combined with genome-scale structural variants and expression alterations in genes involved in myofibril composition, may have driven the rapid sarcomere adaptation of gayal to its rugged mountain habitat. Our findings not only suggest that large-scale chromosomal changes can lead to alterations in genome-level expression, thereby promoting both adaptation and speciation, but also illuminate novel avenues for studying the relationship between karyotype evolution and speciation.

Asunto(s)

Cromatina , Genoma , Animales , Bovinos

12.

Genetic evidence for T-wave area from 12-lead electrocardiograms to monitor cardiovascular diseases in patients taking diabetes medications.

Qi, Mengling; Zhang, Haoyang; Xiu, Xuehao; He, Dan; Cooper, David N; Yang, Yuanhao; Zhao, Huiying.

Hum Genet ; 2024 Mar 20.

Artículo en Inglés | MEDLINE | ID: mdl-38507016

RESUMEN

Aims Many studies indicated use of diabetes medications can influence the electrocardiogram (ECG), which remains the simplest and fastest tool for assessing cardiac functions. However, few studies have explored the role of genetic factors in determining the relationship between the use of diabetes medications and ECG trace characteristics (ETC). Methods Genome-wide association studies (GWAS) were performed for 168 ETCs extracted from the 12-lead ECGs of 42,340 Europeans in the UK Biobank. The genetic correlations, causal relationships, and phenotypic relationships of these ETCs with medication usage, as well as the risk of cardiovascular diseases (CVDs), were estimated by linkage disequilibrium score regression (LDSC), Mendelian randomization (MR), and regression model, respectively. Results The GWAS identified 124 independent single nucleotide polymorphisms (SNPs) that were study-wise and genome-wide significantly associated with at least one ETC. Regression model and LDSC identified significant phenotypic and genetic correlations of T-wave area in lead aVR (aVR_T-area) with usage of diabetes medications (ATC code: A10 drugs, and metformin), and the risks of ischemic heart disease (IHD) and coronary atherosclerosis (CA). MR analyses support a putative causal effect of the use of diabetes medications on decreasing aVR_T-area, and on increasing risk of IHD and CA. ConclusionPatients taking diabetes medications are prone to have decreased aVR_T-area and an increased risk of IHD and CA. The aVR_T-area is therefore a potential ECG marker for pre-clinical prediction of IHD and CA in patients taking diabetes medications.

13.

Identification of discriminative gene-level and protein-level features associated with pathogenic gain-of-function and loss-of-function variants.

Sevim Bayrak, Cigdem; Stein, David; Jain, Aayushee; Chaudhary, Kumardeep; Nadkarni, Girish N; Van Vleck, Tielman T; Puel, Anne; Boisson-Dupuis, Stephanie; Okada, Satoshi; Stenson, Peter D; Cooper, David N; Schlessinger, Avner; Itan, Yuval.

Am J Hum Genet ; 108(12): 2301-2318, 2021 12 02.

Artículo en Inglés | MEDLINE | ID: mdl-34762822

RESUMEN

Identifying whether a given genetic mutation results in a gene product with increased (gain-of-function; GOF) or diminished (loss-of-function; LOF) activity is an important step toward understanding disease mechanisms because they may result in markedly different clinical phenotypes. Here, we generated an extensive database of documented germline GOF and LOF pathogenic variants by employing natural language processing (NLP) on the available abstracts in the Human Gene Mutation Database. We then investigated various gene- and protein-level features of GOF and LOF variants and applied machine learning and statistical analyses to identify discriminative features. We found that GOF variants were enriched in essential genes, for autosomal-dominant inheritance, and in protein binding and interaction domains, whereas LOF variants were enriched in singleton genes, for protein-truncating variants, and in protein core regions. We developed a user-friendly web-based interface that enables the extraction of selected subsets from the GOF/LOF database by a broad set of annotated features and downloading of up-to-date versions. These results improve our understanding of how variants affect gene/protein function and may ultimately guide future treatment options.

Asunto(s)

Bases de Datos Genéticas , Mutación con Ganancia de Función , Mutación con Pérdida de Función , Proteínas/genética , Nube Computacional , Predisposición Genética a la Enfermedad , Genoma Humano , Mutación de Línea Germinal , Humanos , Intervención basada en la Internet , Aprendizaje Automático

14.

A platform for curated products from novel open reading frames prompts reinterpretation of disease variants.

Neville, Matthew D C; Kohze, Robin; Erady, Chaitanya; Meena, Narendra; Hayden, Matthew; Cooper, David N; Mort, Matthew; Prabakaran, Sudhakaran.

Genome Res ; 31(2): 327-336, 2021 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-33468550

RESUMEN

Recent evidence from proteomics and deep massively parallel sequencing studies have revealed that eukaryotic genomes contain substantial numbers of as-yet-uncharacterized open reading frames (ORFs). We define these uncharacterized ORFs as novel ORFs (nORFs). nORFs in humans are mostly under 100 codons and are found in diverse regions of the genome, including in long noncoding RNAs, pseudogenes, 3' UTRs, 5' UTRs, and alternative reading frames of canonical protein coding exons. There is therefore a pressing need to evaluate the potential functional importance of these unannotated transcripts and proteins in biological pathways and human disease on a larger scale, rather than one at a time. In this study, we outline the creation of a valuable nORFs data set with experimental evidence of translation for the community, use measures of heritability and selection that reveal signals for functional importance, and show the potential implications for functional interpretation of genetic variants in nORFs. Our results indicate that some variants that were previously classified as being benign or of uncertain significance may have to be reinterpreted.

15.

The genetic structure of the Turkish population reveals high levels of variation and admixture.

Kars, M Ece; Basak, A Nazli; Onat, O Emre; Bilguvar, Kaya; Choi, Jungmin; Itan, Yuval; Çaglar, Caner; Palvadeau, Robin; Casanova, Jean-Laurent; Cooper, David N; Stenson, Peter D; Yavuz, Alper; Bulus, Hakan; Günel, Murat; Friedman, Jeffrey M; Özçelik, Tayfun.

Proc Natl Acad Sci U S A ; 118(36)2021 09 07.

Artículo en Inglés | MEDLINE | ID: mdl-34426522

RESUMEN

The construction of population-based variomes has contributed substantially to our understanding of the genetic basis of human inherited disease. Here, we investigated the genetic structure of Turkey from 3,362 unrelated subjects whose whole exomes (n = 2,589) or whole genomes (n = 773) were sequenced to generate a Turkish (TR) Variome that should serve to facilitate disease gene discovery in Turkey. Consistent with the history of present-day Turkey as a crossroads between Europe and Asia, we found extensive admixture between Balkan, Caucasus, Middle Eastern, and European populations with a closer genetic relationship of the TR population to Europeans than hitherto appreciated. We determined that 50% of TR individuals had high inbreeding coefficients (≥0.0156) with runs of homozygosity longer than 4 Mb being found exclusively in the TR population when compared to 1000 Genomes Project populations. We also found that 28% of exome and 49% of genome variants in the very rare range (allele frequency < 0.005) are unique to the modern TR population. We annotated these variants based on their functional consequences to establish a TR Variome containing alleles of potential medical relevance, a repository of homozygous loss-of-function variants and a TR reference panel for genotype imputation using high-quality haplotypes, to facilitate genome-wide association studies. In addition to providing information on the genetic structure of the modern TR population, these data provide an invaluable resource for future studies to identify variants that are associated with specific phenotypes as well as establishing the phenotypic consequences of mutations in specific genes.

Asunto(s)

Variación Genética/genética , Genoma Humano/genética , Alelos , Consanguinidad , Exoma , Frecuencia de los Genes/genética , Flujo Genético , Genética de Población/métodos , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Haplotipos/genética , Migración Humana/tendencias , Humanos , Turquía/etnología , Secuenciación del Exoma/métodos

16.

Inferring the genetic relationship between brain imaging-derived phenotypes and risk of complex diseases by Mendelian randomization and genome-wide colocalization.

Lin, Siying; Zhang, Haoyang; Qi, Mengling; Cooper, David N; Yang, Yuedong; Yang, Yuanhao; Zhao, Huiying.

Neuroimage ; 279: 120325, 2023 10 01.

Artículo en Inglés | MEDLINE | ID: mdl-37579999

RESUMEN

Observational studies consistently disclose brain imaging-derived phenotypes (IDPs) as critical markers for early diagnosis of both brain disorders and cardiovascular diseases. However, it remains unclear about the shared genetic landscape between brain IDPs and the risk of brain disorders and cardiovascular diseases, restricting the applications of potential diagnostic techniques through brain IDPs. Here, we reported genetic correlations and putative causal relationships between 921 brain IDPs, 20 brain disorders and six cardiovascular diseases by leveraging their large-scale genome-wide association study (GWAS) summary statistics. Applications of Mendelian randomization (MR) identified significant putative causal effects of multiple region-specific brain IDPs in relation to the increased risks for amyotrophic lateral sclerosis (ALS), major depressive disorder (MDD), autism spectrum disorder (ASD) and schizophrenia (SCZ). We also found brain IDPs specifically from temporal lobe as a putatively causal consequence of hypertension. The genome-wide colocalization analysis identified three genomic regions in which MDD, ASD and SCZ colocalized with the brain IDPs, and two novel SNPs to be associated with ASD, SCZ, and multiple brain IDPs. Furthermore, we identified a list of candidate genes involved in the shared genetics underlying pairs of brain IDPs and MDD, ASD, SCZ, ALS and hypertension. Our results provide novel insights into the genetic relationships between brain disorders and cardiovascular diseases and brain IDP, which may server as clues for using brain IDPs to predict risks of diseases.

Asunto(s)

Esclerosis Amiotrófica Lateral , Trastorno del Espectro Autista , Encefalopatías , Enfermedades Cardiovasculares , Trastorno Depresivo Mayor , Hipertensión , Humanos , Trastorno Depresivo Mayor/diagnóstico por imagen , Trastorno Depresivo Mayor/genética , Enfermedades Cardiovasculares/diagnóstico por imagen , Enfermedades Cardiovasculares/genética , Estudio de Asociación del Genoma Completo/métodos , Trastorno del Espectro Autista/diagnóstico por imagen , Trastorno del Espectro Autista/genética , Análisis de la Aleatorización Mendeliana/métodos , Fenotipo , Encefalopatías/diagnóstico por imagen , Encefalopatías/genética , Neuroimagen

17.

Identifying shared genetic factors underlying epilepsy and congenital heart disease in Europeans.

Wu, Yiming; Bayrak, Cigdem Sevim; Dong, Bosi; He, Shixu; Stenson, Peter D; Cooper, David N; Itan, Yuval; Chen, Lei.

Hum Genet ; 142(2): 275-288, 2023 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-36352240

RESUMEN

Epilepsy (EP) and congenital heart disease (CHD) are two apparently unrelated diseases that nevertheless display substantial mutual comorbidity. Thus, while congenital heart defects are associated with an elevated risk of developing epilepsy, the incidence of epilepsy in CHD patients correlates with CHD severity. Although genetic determinants have been postulated to underlie the comorbidity of EP and CHD, the precise genetic etiology is unknown. We performed variant and gene association analyses on EP and CHD patients separately, using whole exomes of genetically identified Europeans from the UK Biobank and Mount Sinai BioMe Biobank. We prioritized biologically plausible candidate genes and investigated the enriched pathways and other identified comorbidities by biological proximity calculation, pathway analyses, and gene-level phenome-wide association studies. Our variant- and gene-level results point to the Voltage-Gated Calcium Channels (VGCC) pathway as being a unifying framework for EP and CHD comorbidity. Additionally, pathway-level analyses indicated that the functions of disease-associated genes partially overlap between the two disease entities. Finally, phenome-wide association analyses of prioritized candidate genes revealed that cerebral blood flow and ulcerative colitis constitute the two main traits associated with both EP and CHD.

Asunto(s)

Epilepsia , Cardiopatías Congénitas , Humanos , Pueblo Europeo , Cardiopatías Congénitas/genética , Epilepsia/epidemiología , Epilepsia/genética , Estudios de Asociación Genética , Fenotipo

18.

Profiling human pathogenic repeat expansion regions by synergistic and multi-level impacts on molecular connections.

Fan, Cong; Chen, Ken; Wang, Yukai; Ball, Edward V; Stenson, Peter D; Mort, Matthew; Bacolla, Albino; Kehrer-Sawatzki, Hildegard; Tainer, John A; Cooper, David N; Zhao, Huiying.

Hum Genet ; 142(2): 245-274, 2023 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-36344696

RESUMEN

Whilst DNA repeat expansions cause numerous heritable human disorders, their origins and underlying pathological mechanisms are often unclear. We collated a dataset comprising 224 human repeat expansions encompassing 203 different genes, and performed a systematic analysis with respect to key topological features at the DNA, RNA and protein levels. Comparison with controls without known pathogenicity and genomic regions lacking repeats, allowed the construction of the first tool to discriminate repeat regions harboring pathogenic repeat expansions (DPREx). At the DNA level, pathogenic repeat expansions exhibited stronger signals for DNA regulatory factors (e.g. H3K4me3, transcription factor-binding sites) in exons, promoters, 5'UTRs and 5'genes but were not significantly different from controls in introns, 3'UTRs and 3'genes. Additionally, pathogenic repeat expansions were also found to be enriched in non-B DNA structures. At the RNA level, pathogenic repeat expansions were characterized by lower free energy for forming RNA secondary structure and were closer to splice sites in introns, exons, promoters and 5'genes than controls. At the protein level, pathogenic repeat expansions exhibited a preference to form coil rather than other types of secondary structure, and tended to encode surface-located protein domains. Guided by these features, DPREx ( http://biomed.nscc-gz.cn/zhaolab/geneprediction/# ) achieved an Area Under the Curve (AUC) value of 0.88 in a test on an independent dataset. Pathogenic repeat expansions are thus located such that they exert a synergistic influence on the gene expression pathway involving inter-molecular connections at the DNA, RNA and protein levels.

Asunto(s)

Expansión de las Repeticiones de ADN , ADN , Humanos , Intrones/genética , ARN , Expansión de Repetición de Trinucleótido

19.

Expanding ACMG variant classification guidelines into a general framework.

Masson, Emmanuelle; Zou, Wen-Bin; Génin, Emmanuelle; Cooper, David N; Le Gac, Gerald; Fichou, Yann; Pu, Na; Rebours, Vinciane; Férec, Claude; Liao, Zhuan; Chen, Jian-Min.

Hum Genomics ; 16(1): 31, 2022 08 16.

Artículo en Inglés | MEDLINE | ID: mdl-35974416

RESUMEN

BACKGROUND: The American College of Medical Genetics and Genomics (ACMG)-recommended five variant classification categories (pathogenic, likely pathogenic, uncertain significance, likely benign, and benign) have been widely used in medical genetics. However, these guidelines are fundamentally constrained in practice owing to their focus upon Mendelian disease genes and their dichotomous classification of variants as being either causal or not. Herein, we attempt to expand the ACMG guidelines into a general variant classification framework that takes into account not only the continuum of clinical phenotypes, but also the continuum of the variants' genetic effects, and the different pathological roles of the implicated genes. MAIN BODY: As a disease model, we employed chronic pancreatitis (CP), which manifests clinically as a spectrum from monogenic to multifactorial. Bearing in mind that any general conceptual proposal should be based upon sound data, we focused our analysis on the four most extensively studied CP genes, PRSS1, CFTR, SPINK1 and CTRC. Based upon several cross-gene and cross-variant comparisons, we first assigned the different genes to two distinct categories in terms of disease causation: CP-causing (PRSS1 and SPINK1) and CP-predisposing (CFTR and CTRC). We then employed two new classificatory categories, "predisposing" and "likely predisposing", to replace ACMG's "pathogenic" and "likely pathogenic" categories in the context of CP-predisposing genes, thereby classifying all pathologically relevant variants in these genes as "predisposing". In the case of CP-causing genes, the two new classificatory categories served to extend the five ACMG categories whilst two thresholds (allele frequency and functional) were introduced to discriminate "pathogenic" from "predisposing" variants. CONCLUSION: Employing CP as a disease model, we expand ACMG guidelines into a five-category classification system (predisposing, likely predisposing, uncertain significance, likely benign, and benign) and a seven-category classification system (pathogenic, likely pathogenic, predisposing, likely predisposing, uncertain significance, likely benign, and benign) in the context of disease-predisposing and disease-causing genes, respectively. Taken together, the two systems constitute a general variant classification framework that, in principle, should span the entire spectrum of variants in any disease-related gene. The maximal compliance of our five-category and seven-category classification systems with the ACMG guidelines ought to facilitate their practical application.

Asunto(s)

Pancreatitis Crónica , Inhibidor de Tripsina Pancreática de Kazal , Regulador de Conductancia de Transmembrana de Fibrosis Quística/genética , Frecuencia de los Genes , Pruebas Genéticas , Variación Genética , Genómica , Humanos , Pancreatitis Crónica/genética , Análisis de Secuencia de ADN , Inhibidor de Tripsina Pancreática de Kazal/genética , Estados Unidos

20.

Classification of PRSS1 variants responsible for chronic pancreatitis: An expert perspective from the Franco-Chinese GREPAN study group.

Masson, Emmanuelle; Zou, Wen-Bin; Pu, Na; Rebours, Vinciane; Génin, Emmanuelle; Wu, Hao; Lin, Jin-Huan; Wang, Yuan-Chen; Li, Zhao-Shen; Cooper, David N; Férec, Claude; Liao, Zhuan; Chen, Jian-Min.

Pancreatology ; 23(5): 491-506, 2023 08.

Artículo en Inglés | MEDLINE | ID: mdl-37581535

RESUMEN

BACKGROUND: PRSS1 was the first reported chronic pancreatitis (CP) gene. The existence of both gain-of-function (GoF) and gain-of-proteotoxicity (GoP) pathological PRSS1 variants, together with the fact that PRSS1 variants have been identified in CP subtypes spanning the range from monogenic to multifactorial, has made the classification of PRSS1 variants very challenging. METHODS: All currently reported PRSS1 variants (derived primarily from two databases) were manually reviewed with respect to their clinical genetics, functional analysis and population allele frequency. They were classified by variant type and pathological mechanism within the framework of our recently proposed ACMG/AMP guidelines-based seven-category system. RESULTS: The total number of distinct germline PRSS1 variants included for analysis was 100, comprising 3 copy number variants (CNVs), 12 5' and 3' variants, 19 intronic variants, 5 nonsense variants, 1 frameshift deletion variant, 6 synonymous variants, 1 in-frame duplication, 3 gene conversions and 50 missense variants. Based upon a combination of clinical genetic and functional analysis, population data and in silico analysis, we classified 26 variants (all 3 CNVs, the in-frame duplication, all 3 gene conversions and 19 missense) as "pathogenic", 3 variants (missense) as "likely pathogenic", 5 variants (four missense and one promoter) as "predisposing", 13 variants (all missense) as "unknown significance", 2 variants (missense) as "likely benign", and all remaining 51 variants as "benign". CONCLUSIONS: We describe an expert classification of the 100 PRSS1 variants reported to date. The results have immediate implications for reclassifying many ClinVar-registered PRSS1 variants as well as providing optimal guidelines/standards for reporting PRSS1 variants.

Asunto(s)

Pueblos del Este de Asia , Pancreatitis Crónica , Humanos , Alelos , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Mutación/genética , Pancreatitis Crónica/genética , Pancreatitis Crónica/patología , Tripsina/genética , Tripsinógeno/genética , China , Francia

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA