Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
Proc Natl Acad Sci U S A ; 116(36): 17916-17924, 2019 09 03.
Artigo em Inglês | MEDLINE | ID: mdl-31427530

RESUMO

The sources of human germline mutations are poorly understood. Part of the difficulty is that mutations occur very rarely, and so direct pedigree-based approaches remain limited in the numbers that they can examine. To address this problem, we consider the spectrum of low-frequency variants in a dataset (Genome Aggregation Database, gnomAD) of 13,860 human X chromosomes and autosomes. X-autosome differences are reflective of germline sex differences and have been used extensively to learn about male versus female mutational processes; what is less appreciated is that they also reflect chromosome-level biochemical features that differ between the X and autosomes. We tease these components apart by comparing the mutation spectrum in multiple genomic compartments on the autosomes and between the X and autosomes. In so doing, we are able to ascribe specific mutation patterns to replication timing and recombination and to identify differences in the types of mutations that accrue in males and females. In particular, we identify C > G as a mutagenic signature of male meiotic double-strand breaks on the X, which may result from late repair. Our results show how biochemical processes of damage and repair in the germline interact with sex-specific life history traits to shape mutation patterns on both the X chromosome and autosomes.


Assuntos
Cromossomos Humanos X , Cromossomos Humanos , Período de Replicação do DNA , Recombinação Genética , Feminino , Variação Genética , Mutação em Linhagem Germinativa , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Fatores Sexuais
2.
Elife ; 122023 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-36648429

RESUMO

Causal loss-of-function (LOF) variants for Mendelian and severe complex diseases are enriched in 'mutation intolerant' genes. We show how such observations can be interpreted in light of a model of mutation-selection balance and use the model to relate the pathogenic consequences of LOF mutations at present to their evolutionary fitness effects. To this end, we first infer posterior distributions for the fitness costs of LOF mutations in 17,318 autosomal and 679 X-linked genes from exome sequences in 56,855 individuals. Estimated fitness costs for the loss of a gene copy are typically above 1%; they tend to be largest for X-linked genes, whether or not they have a Y homolog, followed by autosomal genes and genes in the pseudoautosomal region. We compare inferred fitness effects for all possible de novo LOF mutations to those of de novo mutations identified in individuals diagnosed with one of six severe, complex diseases or developmental disorders. Probands carry an excess of mutations with estimated fitness effects above 10%; as we show by simulation, when sampled in the population, such highly deleterious mutations are typically only a couple of generations old. Moreover, the proportion of highly deleterious mutations carried by probands reflects the typical age of onset of the disease. The study design also has a discernible influence: a greater proportion of highly deleterious mutations is detected in pedigree than case-control studies, and for autism, in simplex than multiplex families and in female versus male probands. Thus, anchoring observations in human genetics to a population genetic model allows us to learn about the fitness effects of mutations identified by different mapping strategies and for different traits.


Assuntos
Transtorno Autístico , Mutação com Perda de Função , Humanos , Masculino , Feminino , Mutação , Transtorno Autístico/genética , Fenótipo , Estudos de Casos e Controles
3.
Elife ; 102021 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-34806592

RESUMO

Whole exome sequences have now been collected for millions of humans, with the related goals of identifying pathogenic mutations in patients and establishing reference repositories of data from unaffected individuals. As a result, we are approaching an important limit, in which datasets are large enough that, in the absence of natural selection, every highly mutable site will have experienced at least one mutation in the genealogical history of the sample. Here, we focus on CpG sites that are methylated in the germline and experience mutations to T at an elevated rate of ~10-7 per site per generation; considering synonymous mutations in a sample of 390,000 individuals, ~ 99 % of such CpG sites harbor a C/T polymorphism. Methylated CpG sites provide a natural mutation saturation experiment for fitness effects: as we show, at nt sample sizes, not seeing a non-synonymous polymorphism is indicative of strong selection against that mutation. We rely on this idea in order to directly identify a subset of CpG transitions that are likely to be highly deleterious, including ~27 % of possible loss-of-function mutations, and up to 20 % of possible missense mutations, depending on the type of functional site in which they occur. Unlike methylated CpGs, most mutation types, with rates on the order of 10-8 or 10-9, remain very far from saturation. We discuss what these findings imply for interpreting the potential clinical relevance of mutations from their presence or absence in reference databases and for inferences about the fitness effects of new mutations.


Assuntos
Ilhas de CpG/genética , Aptidão Genética , Mutação Silenciosa , Humanos
4.
PLoS One ; 16(8): e0255680, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34347855

RESUMO

New emerging infectious diseases are identified every year, a subset of which become global pandemics like COVID-19. In the case of COVID-19, many governments have responded to the ongoing pandemic by imposing social policies that restrict contacts outside of the home, resulting in a large fraction of the workforce either working from home or not working. To ensure essential services, however, a substantial number of workers are not subject to these limitations, and maintain many of their pre-intervention contacts. To explore how contacts among such "essential" workers, and between essential workers and the rest of the population, impact disease risk and the effectiveness of pandemic control, we evaluated several mathematical models of essential worker contacts within a standard epidemiology framework. The models were designed to correspond to key characteristics of cashiers, factory employees, and healthcare workers. We find in all three models that essential workers are at substantially elevated risk of infection compared to the rest of the population, as has been documented, and that increasing the numbers of essential workers necessitates the imposition of more stringent controls on contacts among the rest of the population to manage the pandemic. Importantly, however, different archetypes of essential workers differ in both their individual probability of infection and impact on the broader pandemic dynamics, highlighting the need to understand and target intervention for the specific risks faced by different groups of essential workers. These findings, especially in light of the massive human costs of the current COVID-19 pandemic, indicate that contingency plans for future epidemics should account for the impacts of essential workers on disease spread.


Assuntos
COVID-19/transmissão , Controle de Infecções , Distanciamento Físico , Recursos Humanos , COVID-19/epidemiologia , Epidemias/prevenção & controle , Pessoal de Saúde/estatística & dados numéricos , Humanos , Controle de Infecções/métodos , Controle de Infecções/normas , Controle de Infecções/estatística & dados numéricos , Modelos Estatísticos , Cidade de Nova Iorque/epidemiologia , Ocupações/estatística & dados numéricos , Pandemias , Quarentena/estatística & dados numéricos , Fatores de Risco , Populações Vulneráveis/estatística & dados numéricos , Recursos Humanos/organização & administração , Recursos Humanos/estatística & dados numéricos
5.
Elife ; 92020 01 30.
Artigo em Inglês | MEDLINE | ID: mdl-31999256

RESUMO

Fields as diverse as human genetics and sociology are increasingly using polygenic scores based on genome-wide association studies (GWAS) for phenotypic prediction. However, recent work has shown that polygenic scores have limited portability across groups of different genetic ancestries, restricting the contexts in which they can be used reliably and potentially creating serious inequities in future clinical applications. Using the UK Biobank data, we demonstrate that even within a single ancestry group (i.e., when there are negligible differences in linkage disequilibrium or in causal alleles frequencies), the prediction accuracy of polygenic scores can depend on characteristics such as the socio-economic status, age or sex of the individuals in which the GWAS and the prediction were conducted, as well as on the GWAS design. Our findings highlight both the complexities of interpreting polygenic scores and underappreciated obstacles to their broad use.


Complex diseases like cancer and heart disease are caused by the interplay of many factors: the variants of genes we inherit, the lifestyles we lead and the environments we inhabit, plus the interaction of all these factors. In fact, almost every trait, even how many years we will spend studying, is influenced both by our environment and our genes. To identify some of the genetic factors at play, scientists perform analyses known as genome-wide association studies, or GWAS for short. In these studies, the genomes from many different people are scanned to look for genetic differences associated with differences in traits. By summing up all the small genetic differences, so-called "polygenic scores" can be calculated. When there is a large genetic component to a trait, polygenic scores can be useful predictive tools. But there is a catch: polygenic scores make less accurate predictions for individuals of a different ancestry than those involved in the GWAS, which limits the use of these tools around the world. Mostafavi, Harpak et al. set out to understand if there are other factors in addition to ancestry that could influence the performance of polygenic scores. Using data from the UK Biobank, an international health resource that pairs genomic data and clinical information, Mostafavi, Harpak et al. examined polygenic scores among individuals that share a single, common ancestry. These polygenic scores were used to predict three traits (blood pressure, body mass index and educational attainment) in individuals and the predictions were then compared to the actual trait values to see how accurate they were. The analysis revealed that even within a group of people with similar ancestry, the accuracy of polygenic scores can vary, depending on characteristics such as the sex, age or socioeconomic status of the individuals. This analysis emphasises how variable GWAS and their predictive value can be even within seemingly similar population groups. It further highlights both the complexities of interpreting polygenic scores and underappreciated obstacles to their broad use in medical and social sciences.


Assuntos
Genética Populacional/métodos , Estudo de Associação Genômica Ampla/métodos , Herança Multifatorial/genética , Adulto , Fatores Etários , Idoso , Feminino , Frequência do Gene/genética , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , Fatores Sexuais , Fatores Socioeconômicos , Reino Unido
6.
Genetics ; 208(2): 745-762, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29242286

RESUMO

Functionally redundant genes present a puzzle as to their evolutionary preservation, and offer an interesting opportunity for molecular specialization. In Caenorhabditis elegans, either one of two presenilin genes (sel-12 or hop-1) facilitate Notch activation, providing the catalytic subunit for the γ secretase proteolytic enzyme complex. For all known Notch signaling events, sel-12 can mediate Notch activation, so the conservation of hop-1 remains a mystery. Here, we uncover a novel "late-onset" germline Notch phenotype in which HOP-1-deficient worms fail to maintain proliferating germline stem cells during adulthood. Either SEL-12 or HOP-1 presenilin can impart sufficient Notch signaling for the establishment and expansion of the germline, but maintenance of an adult stem cell pool relies exclusively on HOP-1-mediated Notch signaling. We also show that HOP-1 is necessary for maximum fecundity and reproductive span. The low-fecundity phenotype of hop-1 mutants can be phenocopied by switching off glp-1/Notch function during the last stage of larval development. We propose that at the end of larval development, dual presenilin usage switches exclusively to HOP-1, perhaps offering opportunities for differential regulation of the germline during adulthood. Additional defects in oocyte size and production rate in hop-1 and glp-1 mutants indicate that the process of oogenesis is compromised when germline Notch signaling is switched off. We calculate that in wild-type adults, as much as 86% of cells derived from the stem cell pool function to support oogenesis. This work suggests that an important role for Notch signaling in the adult germline is to furnish a large and continuous supply of nurse cells to support the efficiency of oogenesis.


Assuntos
Caenorhabditis elegans/genética , Células Germinativas/metabolismo , Proteínas de Membrana/deficiência , Receptores Notch/metabolismo , Transdução de Sinais , Alelos , Animais , Proteínas de Caenorhabditis elegans , Proliferação de Células , Mutação com Ganho de Função , Receptor do Peptídeo Semelhante ao Glucagon 1/metabolismo , Gônadas/metabolismo , Masculino , Mutação , Oócitos/metabolismo , Oogênese , Fenótipo , Reprodução/genética
7.
J Healthc Eng ; 2017: 5284145, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-29177026

RESUMO

Motivation. Microbiome sequencing allows defining clusters of samples with shared composition. However, this paradigm poorly accounts for samples whose composition is a mixture of cluster-characterizing ones and which therefore lie in between them in the cluster space. This paper addresses unsupervised learning of 2-way clusters. It defines a mixture model that allows 2-way cluster assignment and describes a variant of generalized k-means for learning such a model. We demonstrate applicability to microbial 16S rDNA sequencing data from the Human Vaginal Microbiome Project.


Assuntos
Microbiota , Distribuição Normal , Algoritmos , Feminino , Humanos , Análise de Sequência de DNA , Vagina/microbiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA