Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 75
Filtrar
1.
J Med Genet ; 2024 Sep 03.
Artículo en Inglés | MEDLINE | ID: mdl-39227160

RESUMEN

BACKGROUND: The 2015 American College of Medical Genetics/Association of Molecular Pathology (ACMG/AMP) variant classification framework specifies that case-control observations can be scored as 'strong' evidence (PS4) towards pathogenicity. METHODS: We developed the PS4-likelihood ratio calculator (PS4-LRCalc) for quantitative evidence assignment based on the observed variant frequencies in cases and controls. Binomial likelihoods are computed for two models, each defined by prespecified OR thresholds. Model 1 represents the hypothesis of association between variant and phenotype (eg, OR≥5) and model 2 represents the hypothesis of non-association (eg, OR≤1). RESULTS: PS4-LRCalc enables continuous quantitation of evidence for variant classification expressed as a likelihood ratio (LR), which can be log-converted into log LR (evidence points). Using PS4-LRCalc, observed data can be used to quantify evidence towards either pathogenicity or benignity. Variants can also be evaluated against models of different penetrance. The approach is applicable to balanced data sets generated for more common phenotypes and smaller data sets more typical in very rare disease variant evaluation. CONCLUSION: PS4-LRCalc enables flexible evidence quantitation on a continuous scale for observed case-control data. The converted LR is amenable to incorporation into the now widely used 2018 updated Bayesian ACMG/AMP framework.

2.
Genet Med ; : 101249, 2024 Sep 03.
Artículo en Inglés | MEDLINE | ID: mdl-39243181

RESUMEN

PURPOSE: Identifying pathogenic non-coding variants is challenging. A single protein-altering variant is often identified in a recessive gene in individuals with developmental disorders (DD), but the prevalence of pathogenic non-coding 'second hits' in trans with these is unknown. METHODS: In 4,073 genetically undiagnosed rare disease trio probands from the 100,000 Genomes project, we identified rare heterozygous protein-altering variants in recessive DD-associated genes. We identified rare non-coding variants on the other haplotype in introns, untranslated regions (UTRs), promoters, and candidate enhancer regions. We clinically evaluated the top candidates for phenotypic fit, and performed functional testing where possible. RESULTS: We identified 3,761 rare heterozygous loss-of-function or ClinVar pathogenic variants in recessive DD-associated genes in 2,430 probands. For 1,366 (36.3%) of these, we identified at least one rare non-coding variant in trans. Bioinformatic filtering and clinical review, revealed seven to be a good clinical fit. After detailed characterisation, we identified likely diagnoses for three probands (in GAA, NPHP3, and PKHD1) and candidate diagnoses in a further three (PAH, LAMA2, IGHMBP2). CONCLUSION: We developed a systematic approach to uncover new diagnoses involving compound heterozygous coding/non-coding variants and conclude that this mechanism is likely to be a rare cause of DDs.

4.
Genome Med ; 16(1): 88, 2024 Jul 11.
Artículo en Inglés | MEDLINE | ID: mdl-38992748

RESUMEN

BACKGROUND: One of the major hurdles in clinical genetics is interpreting the clinical consequences associated with germline missense variants in humans. Recent significant advances have leveraged natural variation observed in large-scale human populations to uncover genes or genomic regions that show a depletion of natural variation, indicative of selection pressure. We refer to this as "genetic constraint". Although existing genetic constraint metrics have been demonstrated to be successful in prioritising genes or genomic regions associated with diseases, their spatial resolution is limited in distinguishing pathogenic variants from benign variants within genes. METHODS: We aim to identify missense variants that are significantly depleted in the general human population. Given the size of currently available human populations with exome or genome sequencing data, it is not possible to directly detect depletion of individual missense variants, since the average expected number of observations of a variant at most positions is less than one. We instead focus on protein domains, grouping homologous variants with similar functional impacts to examine the depletion of natural variations within these comparable sets. To accomplish this, we develop the Homologous Missense Constraint (HMC) score. We utilise the Genome Aggregation Database (gnomAD) 125 K exome sequencing data and evaluate genetic constraint at quasi amino-acid resolution by combining signals across protein homologues. RESULTS: We identify one million possible missense variants under strong negative selection within protein domains. Though our approach annotates only protein domains, it nonetheless allows us to assess 22% of the exome confidently. It precisely distinguishes pathogenic variants from benign variants for both early-onset and adult-onset disorders. It outperforms existing constraint metrics and pathogenicity meta-predictors in prioritising de novo mutations from probands with developmental disorders (DD). It is also methodologically independent of these, adding power to predict variant pathogenicity when used in combination. We demonstrate utility for gene discovery by identifying seven genes newly significantly associated with DD that could act through an altered-function mechanism. CONCLUSIONS: Grouping variants of comparable functional impacts is effective in evaluating their genetic constraint. HMC is a novel and accurate predictor of missense consequence for improved variant interpretation.


Asunto(s)
Mutación Missense , Humanos , Dominios Proteicos , Predisposición Genética a la Enfermedad
5.
J Biol Chem ; 300(8): 107560, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-39002681

RESUMEN

Lowering expression of prion protein (PrP) is a well-validated therapeutic strategy in prion disease, but additional modalities are urgently needed. In other diseases, small molecules have proven capable of modulating pre-mRNA splicing, sometimes by forcing inclusion of cryptic exons that reduce gene expression. Here, we characterize a cryptic exon located in human PRNP's sole intron and evaluate its potential to reduce PrP expression through incorporation into the 5' untranslated region. This exon is homologous to exon 2 in nonprimate species but contains a start codon that would yield an upstream open reading frame with a stop codon prior to a splice site if included in PRNP mRNA, potentially downregulating PrP expression through translational repression or nonsense-mediated decay. We establish a minigene transfection system and test a panel of splice site alterations, identifying mutants that reduce PrP expression by as much as 78%. Our findings nominate a new therapeutic target for lowering PrP.


Asunto(s)
Exones , Proteínas Priónicas , Sitios de Empalme de ARN , Humanos , Proteínas Priónicas/metabolismo , Proteínas Priónicas/genética , Empalme del ARN , Intrones , Regulación de la Expresión Génica , Animales , Priones/metabolismo , Priones/genética , Enfermedades por Prión/metabolismo , Enfermedades por Prión/genética , Regiones no Traducidas 5'
6.
Cell Genom ; 4(7): 100602, 2024 Jul 10.
Artículo en Inglés | MEDLINE | ID: mdl-38944039

RESUMEN

The phenotypic impact of compound heterozygous (CH) variation has not been investigated at the population scale. We phased rare variants (MAF ∼0.001%) in the UK Biobank (UKBB) exome-sequencing data to characterize recessive effects in 175,587 individuals across 311 common diseases. A total of 6.5% of individuals carry putatively damaging CH variants, 90% of which are only identifiable upon phasing rare variants (MAF < 0.38%). We identify six recessive gene-trait associations (p < 1.68 × 10-7) after accounting for relatedness, polygenicity, nearby common variants, and rare variant burden. Of these, just one is discovered when considering homozygosity alone. Using longitudinal health records, we additionally identify and replicate a novel association between bi-allelic variation in ATP2C2 and an earlier age at onset of chronic obstructive pulmonary disease (COPD) (p < 3.58 × 10-8). Genetic phase contributes to disease risk for gene-trait pairs: ATP2C2-COPD (p = 0.000238), FLG-asthma (p = 0.00205), and USH2A-visual impairment (p = 0.0084). We demonstrate the power of phasing large-scale genetic cohorts to discover phenome-wide consequences of compound heterozygosity.


Asunto(s)
Bancos de Muestras Biológicas , Exoma , Heterocigoto , Fenotipo , Humanos , Reino Unido/epidemiología , Exoma/genética , Predisposición Genética a la Enfermedad , Enfermedad Pulmonar Obstructiva Crónica/genética , Femenino , Masculino , Proteínas Filagrina , Estudio de Asociación del Genoma Completo , Asma/genética , Biobanco del Reino Unido
7.
Genome Res ; 34(4): 530-538, 2024 05 15.
Artículo en Inglés | MEDLINE | ID: mdl-38719470

RESUMEN

The application of ribosome profiling has revealed an unexpected abundance of translation in addition to that responsible for the synthesis of previously annotated protein-coding regions. Multiple short sequences have been found to be translated within single RNA molecules, within both annotated protein-coding and noncoding regions. The biological significance of this translation is a matter of intensive investigation. However, current schematic or annotation-based representations of mRNA translation generally do not account for the apparent multitude of translated regions within the same molecules. They also do not take into account the stochasticity of the process that allows alternative translations of the same RNA molecules by different ribosomes. There is a need for formal representations of mRNA complexity that would enable the analysis of quantitative information on translation and more accurate models for predicting the phenotypic effects of genetic variants affecting translation. To address this, we developed a conceptually novel abstraction that we term ribosome decision graphs (RDGs). RDGs represent translation as multiple ribosome paths through untranslated and translated mRNA segments. We termed the latter "translons." Nondeterministic events, such as initiation, reinitiation, selenocysteine insertion, or ribosomal frameshifting, are then represented as branching points. This representation allows for an adequate representation of eukaryotic translation complexity and focuses on locations critical for translation regulation. We show how RDGs can be used for depicting translated regions and for analyzing genetic variation and quantitative genome-wide data on translation for characterization of regulatory modulators of translation.


Asunto(s)
Biosíntesis de Proteínas , ARN Mensajero , Ribosomas , Ribosomas/metabolismo , Ribosomas/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , Humanos , Sistemas de Lectura Abierta , Eucariontes/genética
8.
Genome Biol ; 25(1): 111, 2024 04 29.
Artículo en Inglés | MEDLINE | ID: mdl-38685090

RESUMEN

BACKGROUND: Untranslated regions (UTRs) are important mediators of post-transcriptional regulation. The length of UTRs and the composition of regulatory elements within them are known to vary substantially across genes, but little is known about the reasons for this variation in humans. Here, we set out to determine whether this variation, specifically in 5'UTRs, correlates with gene dosage sensitivity. RESULTS: We investigate 5'UTR length, the number of alternative transcription start sites, the potential for alternative splicing, the number and type of upstream open reading frames (uORFs) and the propensity of 5'UTRs to form secondary structures. We explore how these elements vary by gene tolerance to loss-of-function (LoF; using the LOEUF metric), and in genes where changes in dosage are known to cause disease. We show that LOEUF correlates with 5'UTR length and complexity. Genes that are most intolerant to LoF have longer 5'UTRs, greater TSS diversity, and more upstream regulatory elements than their LoF tolerant counterparts. We show that these differences are evident in disease gene-sets, but not in recessive developmental disorder genes where LoF of a single allele is tolerated. CONCLUSIONS: Our results confirm the importance of post-transcriptional regulation through 5'UTRs in tight regulation of mRNA and protein levels, particularly for genes where changes in dosage are deleterious and lead to disease. Finally, to support gene-based investigation we release a web-based browser tool, VuTR, that supports exploration of the composition of individual 5'UTRs and the impact of genetic variation within them.


Asunto(s)
Regiones no Traducidas 5' , Sistemas de Lectura Abierta , Biosíntesis de Proteínas , Humanos , Dosificación de Gen , Regulación de la Expresión Génica , Sitio de Iniciación de la Transcripción , Empalme Alternativo , Conformación de Ácido Nucleico
9.
Genet Med ; 26(2): 101029, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-37982373

RESUMEN

PURPOSE: The terminology used for gene-disease curation and variant annotation to describe inheritance, allelic requirement, and both sequence and functional consequences of a variant is currently not standardized. There is considerable discrepancy in the literature and across clinical variant reporting in the derivation and application of terms. Here, we standardize the terminology for the characterization of disease-gene relationships to facilitate harmonized global curation and to support variant classification within the ACMG/AMP framework. METHODS: Terminology for inheritance, allelic requirement, and both structural and functional consequences of a variant used by Gene Curation Coalition members and partner organizations was collated and reviewed. Harmonized terminology with definitions and use examples was created, reviewed, and validated. RESULTS: We present a standardized terminology to describe gene-disease relationships, and to support variant annotation. We demonstrate application of the terminology for classification of variation in the ACMG SF 2.0 genes recommended for reporting of secondary findings. Consensus terms were agreed and formalized in both Sequence Ontology (SO) and Human Phenotype Ontology (HPO) ontologies. Gene Curation Coalition member groups intend to use or map to these terms in their respective resources. CONCLUSION: The terminology standardization presented here will improve harmonization, facilitate the pooling of curation datasets across international curation efforts and, in turn, improve consistency in variant classification and genetic test interpretation.


Asunto(s)
Pruebas Genéticas , Variación Genética , Humanos , Alelos , Bases de Datos Genéticas
10.
bioRxiv ; 2023 Nov 12.
Artículo en Inglés | MEDLINE | ID: mdl-37986835

RESUMEN

The application of ribosome profiling has revealed an unexpected abundance of translation in addition to that responsible for the synthesis of previously annotated protein-coding regions. Multiple short sequences have been found to be translated within single RNA molecules, both within annotated protein-coding and non-coding regions. The biological significance of this translation is a matter of intensive investigation. However, current schematic or annotation-based representations of mRNA translation generally do not account for the apparent multitude of translated regions within the same molecules. They also do not take into account the stochasticity of the process that allows alternative translations of the same RNA molecules by different ribosomes. There is a need for formal representations of mRNA complexity that would enable the analysis of quantitative information on translation and more accurate models for predicting the phenotypic effects of genetic variants affecting translation. To address this, we developed a conceptually novel abstraction that we term Ribosome Decision Graphs (RDGs). RDGs represent translation as multiple ribosome paths through untranslated and translated mRNA segments. We termed the later 'translons'. Non-deterministic events, such as initiation, re-initiation, selenocysteine insertion or ribosomal frameshifting are then represented as branching points. This representation allows for an adequate representation of eukaryotic translation complexity and focuses on locations critical for translation regulation. We show how RDGs can be used for depicting translated regions, analysis of genetic variation and quantitative genome-wide data on translation for characterisation of regulatory modulators of translation.

11.
medRxiv ; 2023 Sep 12.
Artículo en Inglés | MEDLINE | ID: mdl-37745552

RESUMEN

Background: Both promoters and untranslated regions (UTRs) have critical regulatory roles, yet variants in these regions are largely excluded from clinical genetic testing due to difficulty in interpreting pathogenicity. The extent to which these regions may harbour diagnoses for individuals with rare disease is currently unknown. Methods: We present a framework for the identification and annotation of potentially deleterious proximal promoter and UTR variants in known dominant disease genes. We use this framework to annotate de novo variants (DNVs) in 8,040 undiagnosed individuals in the Genomics England 100,000 genomes project, which were subject to strict region-based filtering, clinical review, and validation studies where possible. In addition, we performed region and variant annotation-based burden testing in 7,862 unrelated probands against matched unaffected controls. Results: We prioritised eleven DNVs and identified an additional variant overlapping one of the eleven. Ten of these twelve variants (82%) are in genes that are a strong match to the individual's phenotype and six had not previously been identified. Through burden testing, we did not observe a significant enrichment of potentially deleterious promoter and/or UTR variants in individuals with rare disease collectively across any of our region or variant annotations. Conclusions: Overall, we demonstrate the value of screening promoters and UTRs to uncover additional diagnoses for previously undiagnosed individuals with rare disease and provide a framework for doing so without dramatically increasing interpretation burden.

12.
medRxiv ; 2023 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-37461573

RESUMEN

Exome-sequencing association studies have successfully linked rare protein-coding variation to risk of thousands of diseases. However, the relationship between rare deleterious compound heterozygous (CH) variation and their phenotypic impact has not been fully investigated. Here, we leverage advances in statistical phasing to accurately phase rare variants (MAF ~ 0.001%) in exome sequencing data from 175,587 UK Biobank (UKBB) participants, which we then systematically annotate to identify putatively deleterious CH coding variation. We show that 6.5% of individuals carry such damaging variants in the CH state, with 90% of variants occurring at MAF < 0.34%. Using a logistic mixed model framework, systematically accounting for relatedness, polygenic risk, nearby common variants, and rare variant burden, we investigate recessive effects in common complex diseases. We find six exome-wide significant (P<1.68×10-7) and 17 nominally significant (P<5.25×10-5) gene-trait associations. Among these, only four would have been identified without accounting for CH variation in the gene. We further incorporate age-at-diagnosis information from primary care electronic health records, to show that genetic phase influences lifetime risk of disease across 20 gene-trait combinations (FDR < 5%). Using a permutation approach, we find evidence for genetic phase contributing to disease susceptibility for a collection of gene-trait pairs, including FLG-asthma (P=0.00205) and USH2A-visual impairment (P=0.0084). Taken together, we demonstrate the utility of phasing large-scale genetic sequencing cohorts for robust identification of the phenome-wide consequences of compound heterozygosity.

13.
Eur Heart J ; 44(48): 5146-5158, 2023 Dec 21.
Artículo en Inglés | MEDLINE | ID: mdl-37431535

RESUMEN

AIMS: Hypertrophic cardiomyopathy (HCM) is characterized by phenotypic heterogeneity that is partly explained by the diversity of genetic variants contributing to disease. Accurate interpretation of these variants constitutes a major challenge for diagnosis and implementing precision medicine, especially in understudied populations. The aim is to define the genetic architecture of HCM in North African cohorts with high consanguinity using ancestry-matched cases and controls. METHODS AND RESULTS: Prospective Egyptian patients (n = 514) and controls (n = 400) underwent clinical phenotyping and genetic testing. Rare variants in 13 validated HCM genes were classified according to standard clinical guidelines and compared with a prospective HCM cohort of majority European ancestry (n = 684). A higher prevalence of homozygous variants was observed in Egyptian patients (4.1% vs. 0.1%, P = 2 × 10-7), with variants in the minor HCM genes MYL2, MYL3, and CSRP3 more likely to present in homozygosity than the major genes, suggesting these variants are less penetrant in heterozygosity. Biallelic variants in the recessive HCM gene TRIM63 were detected in 2.1% of patients (five-fold greater than European patients), highlighting the importance of recessive inheritance in consanguineous populations. Finally, rare variants in Egyptian HCM patients were less likely to be classified as (likely) pathogenic compared with Europeans (40.8% vs. 61.6%, P = 1.6 × 10-5) due to the underrepresentation of Middle Eastern populations in current reference resources. This proportion increased to 53.3% after incorporating methods that leverage new ancestry-matched controls presented here. CONCLUSION: Studying consanguineous populations reveals novel insights with relevance to genetic testing and our understanding of the genetic architecture of HCM.


Asunto(s)
Cardiomiopatía Hipertrófica , Etnicidad , Humanos , Consanguinidad , Estudios Prospectivos , Pruebas Genéticas , Cardiomiopatía Hipertrófica/diagnóstico , Mutación
14.
medRxiv ; 2023 Apr 03.
Artículo en Inglés | MEDLINE | ID: mdl-37066232

RESUMEN

PURPOSE: The terminology used for gene-disease curation and variant annotation to describe inheritance, allelic requirement, and both sequence and functional consequences of a variant is currently not standardized. There is considerable discrepancy in the literature and across clinical variant reporting in the derivation and application of terms. Here we standardize the terminology for the characterization of disease-gene relationships to facilitate harmonized global curation, and to support variant classification within the ACMG/AMP framework. METHODS: Terminology for inheritance, allelic requirement, and both structural and functional consequences of a variant used by Gene Curation Coalition (GenCC) members and partner organizations was collated and reviewed. Harmonized terminology with definitions and use examples was created, reviewed, and validated. RESULTS: We present a standardized terminology to describe gene-disease relationships, and to support variant annotation. We demonstrate application of the terminology for classification of variation in the ACMG SF 2.0 genes recommended for reporting of secondary findings. Consensus terms were agreed and formalized in both sequence ontology (SO) and human phenotype ontology (HPO) ontologies. GenCC member groups intend to use or map to these terms in their respective resources. CONCLUSION: The terminology standardization presented here will improve harmonization, facilitate the pooling of curation datasets across international curation efforts and, in turn, improve consistency in variant classification and genetic test interpretation.

15.
Cell Genom ; 3(4): 100296, 2023 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-37082142

RESUMEN

Exons are regions of DNA that are transcribed to RNA and retained after introns are spliced out. However, the term "exon" is often misused as synonymous to "protein coding," including in some literature and textbook definitions. In contrast, only a fraction of exonic sequences are protein coding (<30% in humans). Both exons and introns are also present in untranslated regions (UTRs) and non-coding RNAs. Misuse of the term exon is problematic, for example, "whole-exome sequencing" technology targets <25% of the human exome, primarily regions that are protein coding. Here, we argue for the importance of the original definition of an exon for making functional distinctions in genetics and genomics. Further, we recommend the use of clearer language referring to coding exonic regions and non-coding exonic regions. We propose the use of coding exome sequencing, or CES, to more appropriately describe sequencing approaches that target primarily protein-coding regions rather than all transcribed regions.

16.
Am J Hum Genet ; 110(3): 409, 2023 Mar 02.
Artículo en Inglés | MEDLINE | ID: mdl-36868202

RESUMEN

This article is based on the address given by the author at the 2022 meeting of the American Society of Human Genetics (ASHG) in Los Angeles, California. The video of the original address can be found at the ASHG website.

17.
Circ Genom Precis Med ; 16(1): e003672, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36580316

RESUMEN

BACKGROUND: Truncating variants in desmoplakin (DSPtv) are an important cause of arrhythmogenic cardiomyopathy; however the genetic architecture and genotype-specific risk factors are incompletely understood. We evaluated phenotype, risk factors for ventricular arrhythmias, and underlying genetics of DSPtv cardiomyopathy. METHODS: Individuals with DSPtv and any cardiac phenotype, and their gene-positive family members were included from multiple international centers. Clinical data and family history information were collected. Event-free survival from ventricular arrhythmia was assessed. Variant location was compared between cases and controls, and literature review of reported DSPtv performed. RESULTS: There were 98 probands and 72 family members (mean age at diagnosis 43±8 years, 59% women) with a DSPtv, of which 146 were considered clinically affected. Ventricular arrhythmia (sudden cardiac arrest, sustained ventricular tachycardia, appropriate implantable cardioverter defibrillator therapy) occurred in 56 (33%) individuals. DSPtv location and proband status were independent risk factors for ventricular arrhythmia. Further, gene region was important with variants in cases (cohort n=98; Clinvar n=167) more likely to occur in the regions resulting in nonsense mediated decay of both major DSP isoforms, compared with n=124 genome aggregation database control variants (148 [83.6%] versus 29 [16.4%]; P<0.0001). CONCLUSIONS: In the largest series of individuals with DSPtv, we demonstrate that variant location is a novel risk factor for ventricular arrhythmia, can inform variant interpretation, and provide critical insights to allow for precision-based clinical management.


Asunto(s)
Displasia Ventricular Derecha Arritmogénica , Cardiomiopatías , Desmoplaquinas , Femenino , Humanos , Masculino , Arritmias Cardíacas/genética , Displasia Ventricular Derecha Arritmogénica/diagnóstico , Cardiomiopatías/genética , Desmoplaquinas/genética , Factores de Riesgo
18.
bioRxiv ; 2023 Dec 19.
Artículo en Inglés | MEDLINE | ID: mdl-38187635

RESUMEN

Lowering expression of prion protein (PrP) is a well-validated therapeutic strategy in prion disease, but additional modalities are urgently needed. In other diseases, small molecules have proven capable of modulating pre-mRNA splicing, sometimes by forcing inclusion of cryptic exons that reduce gene expression. Here, we characterize a cryptic exon located in human PRNP's sole intron and evaluate its potential to reduce PrP expression through incorporation into the 5' untranslated region (5'UTR). This exon is homologous to exon 2 in non-primate species, but contains a start codon that would yield an upstream open reading frame (uORF) with a stop codon prior to a splice site if included in PRNP mRNA, potentially downregulating PrP expression through translational repression or nonsense-mediated decay. We establish a minigene transfection system and test a panel of splice site alterations, identifying mutants that reduce PrP expression by as much as 78%. Our findings nominate a new therapeutic target for lowering PrP.

19.
Genome Med ; 14(1): 73, 2022 07 19.
Artículo en Inglés | MEDLINE | ID: mdl-35850704

RESUMEN

BACKGROUND: The majority of clinical genetic testing focuses almost exclusively on regions of the genome that directly encode proteins. The important role of variants in non-coding regions in penetrant disease is, however, increasingly being demonstrated, and the use of whole genome sequencing in clinical diagnostic settings is rising across a large range of genetic disorders. Despite this, there is no existing guidance on how current guidelines designed primarily for variants in protein-coding regions should be adapted for variants identified in other genomic contexts. METHODS: We convened a panel of nine clinical and research scientists with wide-ranging expertise in clinical variant interpretation, with specific experience in variants within non-coding regions. This panel discussed and refined an initial draft of the guidelines which were then extensively tested and reviewed by external groups. RESULTS: We discuss considerations specifically for variants in non-coding regions of the genome. We outline how to define candidate regulatory elements, highlight examples of mechanisms through which non-coding region variants can lead to penetrant monogenic disease, and outline how existing guidelines can be adapted for the interpretation of these variants. CONCLUSIONS: These recommendations aim to increase the number and range of non-coding region variants that can be clinically interpreted, which, together with a compatible phenotype, can lead to new diagnoses and catalyse the discovery of novel disease mechanisms.


Asunto(s)
Variación Genética , Estudio de Asociación del Genoma Completo , Genoma , Sistemas de Lectura Abierta , Secuencias Reguladoras de Ácidos Nucleicos
20.
Mol Cell ; 82(15): 2885-2899.e8, 2022 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-35841888

RESUMEN

Translated small open reading frames (smORFs) can have important regulatory roles and encode microproteins, yet their genome-wide identification has been challenging. We determined the ribosome locations across six primary human cell types and five tissues and detected 7,767 smORFs with translational profiles matching those of known proteins. The human genome was found to contain highly cell-type- and tissue-specific smORFs and a subset that encodes highly conserved amino acid sequences. Changes in the translational efficiency of upstream-encoded smORFs (uORFs) and the corresponding main ORFs predominantly occur in the same direction. Integration with 456 mass-spectrometry datasets confirms the presence of 603 small peptides at the protein level in humans and provides insights into the subcellular localization of these small proteins. This study provides a comprehensive atlas of high-confidence translated smORFs derived from primary human cells and tissues in order to provide a more complete understanding of the translated human genome.


Asunto(s)
Regulación de la Expresión Génica , Ribosomas , Genoma Humano/genética , Humanos , Sistemas de Lectura Abierta/genética , Biosíntesis de Proteínas , Proteínas/metabolismo , ARN/metabolismo , Ribosomas/genética , Ribosomas/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA