Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
2.
Nat Genet ; 56(1): 152-161, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38057443

RESUMO

Recessive diseases arise when both copies of a gene are impacted by a damaging genetic variant. When a patient carries two potentially causal variants in a gene, accurate diagnosis requires determining that these variants occur on different copies of the chromosome (that is, are in trans) rather than on the same copy (that is, in cis). However, current approaches for determining phase, beyond parental testing, are limited in clinical settings. Here we developed a strategy for inferring phase for rare variant pairs within genes, leveraging genotypes observed in the Genome Aggregation Database (v2, n = 125,748 exomes). Our approach estimates phase with 96% accuracy, both in trio data and in patients with Mendelian conditions and presumed causal compound heterozygous variants. We provide a public resource of phasing estimates for coding variants and counts per gene of rare variants in trans that can aid interpretation of rare co-occurring variants in the context of recessive disease.


Assuntos
Exoma , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Exoma/genética , Sequenciamento do Exoma , Genótipo
3.
Nature ; 625(7993): 92-100, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38057664

RESUMO

The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1-4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)-the largest public open-access human genome allele frequency reference dataset-and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.


Assuntos
Genoma Humano , Genômica , Modelos Genéticos , Mutação , Humanos , Acesso à Informação , Bases de Dados Genéticas , Conjuntos de Dados como Assunto , Frequência do Gene , Genoma Humano/genética , Mutação/genética , Seleção Genética
4.
bioRxiv ; 2023 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-36993580

RESUMO

Recessive diseases arise when both the maternal and the paternal copies of a gene are impacted by a damaging genetic variant in the affected individual. When a patient carries two different potentially causal variants in a gene for a given disorder, accurate diagnosis requires determining that these two variants occur on different copies of the chromosome (i.e., are in trans) rather than on the same copy (i.e. in cis). However, current approaches for determining phase, beyond parental testing, are limited in clinical settings. We developed a strategy for inferring phase for rare variant pairs within genes, leveraging genotypes observed in exome sequencing data from the Genome Aggregation Database (gnomAD v2, n=125,748). When applied to trio data where phase can be determined by transmission, our approach estimates phase with 95.7% accuracy and remains accurate even for very rare variants (allele frequency < 1×10-4). We also correctly phase 95.9% of variant pairs in a set of 293 patients with Mendelian conditions carrying presumed causal compound heterozygous variants. We provide a public resource of phasing estimates from gnomAD, including phasing estimates for coding variants across the genome and counts per gene of rare variants in trans, that can aid interpretation of rare co-occurring variants in the context of recessive disease.

5.
Lancet Infect Dis ; 22(6): 835-844, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35202600

RESUMO

BACKGROUND: Hand hygiene is at the core of effective infection prevention and control (IPC) programmes. 10 years after the development of the WHO Multimodal Hand Hygiene Improvement Strategy, we aimed to ascertain the level of hand hygiene implementation and its drivers in health-care facilities through a global WHO survey. METHODS: From Jan 16 to Dec 31, 2019, IPC professionals were invited through email and campaigns to complete the online Hand Hygiene Self-Assessment Framework (HHSAF). A geospatial clustering algorithm selected unique health-care facilities responses and post-stratification weighting was applied to improve representativeness. Weighted median HHSAF scores and IQR were reported. Drivers of the HHSAF score were determined through a generalised estimation equation. FINDINGS: 3206 unique responses from 90 countries (46% WHO Member States) were included. The HHSAF score indicated an intermediate hand hygiene implementation level (350 points, IQR 248-430), which was positively associated with country income level and health-care facility funding structure. System Change had the highest score (85 points, IQR 55-100), whereby alcohol-based hand rub at the point of care has become standard practice in many health-care facilities, especially in high-income countries. Institutional Safety Climate had the lowest score (55 points, IQR 35-75). From 2015 to 2019, the median HHSAF score in health-care facilities participating in both HHSAF surveys (n=190) stagnated. INTERPRETATION: Most health-care facilities had an intermediate level of hand hygiene implementation or higher, for which health-care facility funding and country income level were important drivers. Availability of resources, leadership, and organisational support are key elements to further improve quality of care and provide access to safe care for all. FUNDING: WHO, Geneva University Hospitals and Faculty of Medicine, and WHO Collaborating Center on Patient Safety, Geneva, Switzerland.


Assuntos
Infecção Hospitalar , Higiene das Mãos , Infecção Hospitalar/prevenção & controle , Fidelidade a Diretrizes , Desinfecção das Mãos , Higiene das Mãos/métodos , Instalações de Saúde , Humanos , Controle de Infecções/métodos , Autoavaliação (Psicologia) , Organização Mundial da Saúde
11.
Nat Commun ; 11(1): 2539, 2020 05 27.
Artigo em Inglês | MEDLINE | ID: mdl-32461613

RESUMO

Multi-nucleotide variants (MNVs), defined as two or more nearby variants existing on the same haplotype in an individual, are a clinically and biologically important class of genetic variation. However, existing tools typically do not accurately classify MNVs, and understanding of their mutational origins remains limited. Here, we systematically survey MNVs in 125,748 whole exomes and 15,708 whole genomes from the Genome Aggregation Database (gnomAD). We identify 1,792,248 MNVs across the genome with constituent variants falling within 2 bp distance of one another, including 18,756 variants with a novel combined effect on protein sequence. Finally, we estimate the relative impact of known mutational mechanisms - CpG deamination, replication error by polymerase zeta, and polymerase slippage at repeat junctions - on the generation of MNVs. Our results demonstrate the value of haplotype-aware variant annotation, and refine our understanding of genome-wide mutational mechanisms of MNVs.


Assuntos
Exoma , Variação Genética , Genoma Humano , Ilhas de CpG , Análise Mutacional de DNA , Bases de Dados Genéticas , Humanos , Mutação
12.
Nat Commun ; 11(1): 2523, 2020 05 27.
Artigo em Inglês | MEDLINE | ID: mdl-32461616

RESUMO

Upstream open reading frames (uORFs) are tissue-specific cis-regulators of protein translation. Isolated reports have shown that variants that create or disrupt uORFs can cause disease. Here, in a systematic genome-wide study using 15,708 whole genome sequences, we show that variants that create new upstream start codons, and variants disrupting stop sites of existing uORFs, are under strong negative selection. This selection signal is significantly stronger for variants arising upstream of genes intolerant to loss-of-function variants. Furthermore, variants creating uORFs that overlap the coding sequence show signals of selection equivalent to coding missense variants. Finally, we identify specific genes where modification of uORFs likely represents an important disease mechanism, and report a novel uORF frameshift variant upstream of NF2 in neurofibromatosis. Our results highlight uORF-perturbing variants as an under-recognised functional class that contribute to penetrant human disease, and demonstrate the power of large-scale population sequencing data in studying non-coding variant classes.


Assuntos
Regiões 5' não Traduzidas , Variação Genética , Mutação com Perda de Função , Proteínas/genética , Sequência de Bases , Genoma Humano , Humanos , Fases de Leitura Aberta
13.
Nature ; 581(7809): 444-451, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-32461652

RESUMO

Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.


Assuntos
Doença/genética , Variação Genética , Genética Médica/normas , Genética Populacional/normas , Genoma Humano/genética , Feminino , Testes Genéticos , Técnicas de Genotipagem , Humanos , Masculino , Pessoa de Meia-Idade , Mutação , Polimorfismo de Nucleotídeo Único/genética , Grupos Raciais/genética , Padrões de Referência , Seleção Genética , Sequenciamento Completo do Genoma
14.
Nature ; 581(7809): 434-443, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-32461654

RESUMO

Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.


Assuntos
Exoma/genética , Genes Essenciais/genética , Variação Genética/genética , Genoma Humano/genética , Adulto , Encéfalo/metabolismo , Doenças Cardiovasculares/genética , Estudos de Coortes , Bases de Dados Genéticas , Feminino , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Humanos , Mutação com Perda de Função/genética , Masculino , Taxa de Mutação , Pró-Proteína Convertase 9/genética , RNA Mensageiro/genética , Reprodutibilidade dos Testes , Sequenciamento do Exoma , Sequenciamento Completo do Genoma
15.
Eur J Hum Genet ; 27(9): 1456-1465, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31053783

RESUMO

Hearing impairment (HI) is characterized by extensive genetic heterogeneity. To determine the population-specific contribution of known autosomal recessive nonsyndromic (ARNS)HI genes and variants to HI etiology; pathogenic and likely pathogenic (PLP) ARNSHI variants were selected from ClinVar and the Deafness Variation Database and their frequencies were obtained from gnomAD for seven populations. ARNSHI prevalence due to PLP variants varies greatly by population ranging from 96.9 affected per 100,000 individuals for Ashkenazi Jews to 5.2 affected per 100,000 individuals for Africans/African Americans. For Europeans, Finns have the lowest prevalence due to ARNSHI PLP variants with 9.5 affected per 100,000 individuals. For East Asians, Latinos, non-Finish Europeans, and South Asians, ARNSHI prevalence due to PLP variants ranges from 17.1 to 33.7 affected per 100,000 individuals. ARNSHI variants that were previously reported in a single ancestry or family were observed in additional populations, e.g., USH1C p.(Q723*) reported in a Chinese family was the most prevalent pathogenic variant observed in gnomAD for African/African Americans. Variability between populations is due to how extensively ARNSHI has been studied, ARNSHI prevalence and ancestry specific ARNSHI variant architecture which is impacted by population history. Our study demonstrates that additional gene and variant discovery studies are necessary for all populations and particularly for individuals of African ancestry.


Assuntos
Surdez/diagnóstico , Surdez/genética , Genes Recessivos , Predisposição Genética para Doença , Variação Genética , Perda Auditiva/diagnóstico , Perda Auditiva/genética , Alelos , Mapeamento Cromossômico , Frequência do Gene , Estudos de Associação Genética , Humanos
16.
Hum Mutat ; 38(11): 1534-1541, 2017 11.
Artigo em Inglês | MEDLINE | ID: mdl-28714244

RESUMO

The genetic basis combined with the sporadic occurrence of amyotrophic lateral sclerosis (ALS) suggests a role of de novo mutations in disease pathogenesis. Previous studies provided some evidence for this hypothesis; however, results were conflicting: no genes with recurrent occurring de novo mutations were identified and different pathways were postulated. In this study, we analyzed whole-exome data from 82 new patient-parents trios and combined it with the datasets of all previously published ALS trios (173 trios in total). The per patient de novo rate was not higher than expected based on the general population (P = 0.40). We showed that these mutations are not part of the previously postulated pathways, and gene-gene interaction analysis found no enrichment of interacting genes in this group (P = 0.57). Also, we were able to show that the de novo mutations in ALS patients are located in genes already prone for de novo mutations (P < 1 × 10-15 ). Although the individual effect of rare de novo mutations in specific genes could not be assessed, our results indicate that, in contrast to previous hypothesis, de novo mutations in general do not impose a major burden on ALS risk.


Assuntos
Esclerose Lateral Amiotrófica/genética , Estudos de Associação Genética , Predisposição Genética para Doença , Mutação , Alelos , Substituição de Aminoácidos , Esclerose Lateral Amiotrófica/metabolismo , Proteína C9orf72/genética , Estudos de Casos e Controles , Bases de Dados Genéticas , Feminino , Humanos , Masculino , Taxa de Mutação , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas , Sequenciamento do Exoma , Sequenciamento Completo do Genoma
17.
Science ; 356(6337): 539-542, 2017 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-28473589

RESUMO

Negative selection against deleterious alleles produced by mutation influences within-population variation as the most pervasive form of natural selection. However, it is not known whether deleterious alleles affect fitness independently, so that cumulative fitness loss depends exponentially on the number of deleterious alleles, or synergistically, so that each additional deleterious allele results in a larger decrease in relative fitness. Negative selection with synergistic epistasis should produce negative linkage disequilibrium between deleterious alleles and, therefore, an underdispersed distribution of the number of deleterious alleles in the genome. Indeed, we detected underdispersion of the number of rare loss-of-function alleles in eight independent data sets from human and fly populations. Thus, selection against rare protein-disrupting alleles is characterized by synergistic epistasis, which may explain how human and fly populations persist despite high genomic mutation rates.


Assuntos
Drosophila melanogaster/genética , Epistasia Genética , Genoma Humano , Genoma de Inseto , Taxa de Mutação , Seleção Genética , Alelos , Animais , Aptidão Genética , Humanos , Desequilíbrio de Ligação , Mutação de Sentido Incorreto
18.
Eur J Hum Genet ; 25(2): 227-233, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27876817

RESUMO

Germline mutation detection from human DNA sequence data is challenging due to the rarity of such events relative to the intrinsic error rates of sequencing technologies and the uneven coverage across the genome. We developed PhaseByTransmission (PBT) to identify de novo single nucleotide variants and short insertions and deletions (indels) from sequence data collected in parent-offspring trios. We compute the joint probability of the data given the genotype likelihoods in the individual family members, the known familial relationships and a prior probability for the mutation rate. Candidate de novo mutations (DNMs) are reported along with their posterior probability, providing a systematic way to prioritize them for validation. Our tool is integrated in the Genome Analysis Toolkit and can be used together with the ReadBackedPhasing module to infer the parental origin of DNMs based on phase-informative reads. Using simulated data, we show that PBT outperforms existing tools, especially in low coverage data and on the X chromosome. We further show that PBT displays high validation rates on empirical parent-offspring sequencing data for whole-exome data from 104 trios and X-chromosome data from 249 parent-offspring families. Finally, we demonstrate an association between father's age at conception and the number of DNMs in female offspring's X chromosome, consistent with previous literature reports.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Mutação em Linhagem Germinativa , Linhagem , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/métodos , Software , Adulto , Criança , Cromossomos Humanos X/genética , Exoma , Feminino , Genótipo , Humanos , Masculino , Modelos Genéticos
19.
Nat Commun ; 7: 12989, 2016 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-27708267

RESUMO

Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals.


Assuntos
Genoma Humano , Variação Estrutural do Genoma , Genômica , Algoritmos , Cromossomos/ultraestrutura , Biologia Computacional , Deleção de Genes , Genótipo , Haplótipos , Humanos , Mutação INDEL , Desequilíbrio de Ligação , Países Baixos , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo Único , RNA/metabolismo , Análise de Sequência de DNA , Análise de Sequência de RNA , Software
20.
Am J Hum Genet ; 97(6): 775-89, 2015 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-26581902

RESUMO

The rate at which human genomes mutate is a central biological parameter that has many implications for our ability to understand demographic and evolutionary phenomena. We present a method for inferring mutation and gene-conversion rates by using the number of sequence differences observed in identical-by-descent (IBD) segments together with a reconstructed model of recent population-size history. This approach is robust to, and can quantify, the presence of substantial genotyping error, as validated in coalescent simulations. We applied the method to 498 trio-phased sequenced Dutch individuals and inferred a point mutation rate of 1.66 × 10(-8) per base per generation and a rate of 1.26 × 10(-9) for <20 bp indels. By quantifying how estimates varied as a function of allele frequency, we inferred the probability that a site is involved in non-crossover gene conversion as 5.99 × 10(-6). We found that recombination does not have observable mutagenic effects after gene conversion is accounted for and that local gene-conversion rates reflect recombination rates. We detected a strong enrichment of recent deleterious variation among mismatching variants found within IBD regions and observed summary statistics of local sharing of IBD segments to closely match previously proposed metrics of background selection; however, we found no significant effects of selection on our mutation-rate estimates. We detected no evidence of strong variation of mutation rates in a number of genomic annotations obtained from several recent studies. Our analysis suggests that a mutation-rate estimate higher than that reported by recent pedigree-based studies should be adopted in the context of DNA-based demographic reconstruction.


Assuntos
Genoma Humano , Mutação em Linhagem Germinativa , Modelos Genéticos , Taxa de Mutação , Alelos , Frequência do Gene , Haplótipos , Humanos , Mutação INDEL , Modelos Lineares , Recombinação Genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...