Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Commun ; 13(1): 4312, 2022 07 25.
Artigo em Inglês | MEDLINE | ID: mdl-35879308

RESUMO

Large-scale genome sequencing has enabled the measurement of strong purifying selection in protein-coding genes. Here we describe a new method, called ExtRaINSIGHT, for measuring such selection in noncoding as well as coding regions of the human genome. ExtRaINSIGHT estimates the prevalence of "ultraselection" by the fractional depletion of rare single-nucleotide variants, after controlling for variation in mutation rates. Applying ExtRaINSIGHT to 71,702 whole genome sequences from gnomAD v3, we find abundant ultraselection in evolutionarily ancient miRNAs and neuronal protein-coding genes, as well as at splice sites. By contrast, we find much less ultraselection in other noncoding RNAs and transcription factor binding sites, and only modest levels in ultraconserved elements. We estimate that ~0.4-0.7% of the human genome is ultraselected, implying ~ 0.26-0.51 strongly deleterious mutations per generation. Overall, our study sheds new light on the genome-wide distribution of fitness effects by combining deep sequencing data and classical theory from population genetics.


Assuntos
Genoma Humano , Mutação Puntual , Evolução Molecular , Genética Populacional , Genoma Humano/genética , Humanos , Mutação , Seleção Genética
2.
Genetics ; 220(1)2022 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-34849832

RESUMO

The Patterson F- and D-statistics are commonly used measures for quantifying population relationships and for testing hypotheses about demographic history. These statistics make use of allele frequency information across populations to infer different aspects of population history, such as population structure and introgression events. Inclusion of related or inbred individuals can bias such statistics, which may often lead to the filtering of such individuals. Here, we derive statistical properties of the F- and D-statistics, including their biases due to the inclusion of related or inbred individuals, their variances, and their corresponding mean squared errors. Moreover, for those statistics that are biased, we develop unbiased estimators and evaluate the variances of these new quantities. Comparisons of the new unbiased statistics to the originals demonstrates that our newly derived statistics often have lower error across a wide population parameter space. Furthermore, we apply these unbiased estimators using several global human populations with the inclusion of related individuals to highlight their application on an empirical dataset. Finally, we implement these unbiased estimators in open-source software package funbiased for easy application by the scientific community.


Assuntos
Frequência do Gene
3.
Proc Natl Acad Sci U S A ; 118(26)2021 06 29.
Artigo em Inglês | MEDLINE | ID: mdl-34162703

RESUMO

No endemic Madagascar animal with body mass >10 kg survived a relatively recent wave of extinction on the island. From morphological and isotopic analyses of skeletal "subfossil" remains we can reconstruct some of the biology and behavioral ecology of giant lemurs (primates; up to ∼160 kg) and other extraordinary Malagasy megafauna that survived into the past millennium. Yet, much about the evolutionary biology of these now-extinct species remains unknown, along with persistent phylogenetic uncertainty in some cases. Thankfully, despite the challenges of DNA preservation in tropical and subtropical environments, technical advances have enabled the recovery of ancient DNA from some Malagasy subfossil specimens. Here, we present a nuclear genome sequence (∼2× coverage) for one of the largest extinct lemurs, the koala lemur Megaladapis edwardsi (∼85 kg). To support the testing of key phylogenetic and evolutionary hypotheses, we also generated high-coverage nuclear genomes for two extant lemurs, Eulemur rufifrons and Lepilemur mustelinus, and we aligned these sequences with previously published genomes for three other extant lemurs and 47 nonlemur vertebrates. Our phylogenetic results confirm that Megaladapis is most closely related to the extant Lemuridae (typified in our analysis by E. rufifrons) to the exclusion of L. mustelinus, which contradicts morphology-based phylogenies. Our evolutionary analyses identified significant convergent evolution between M. edwardsi and an extant folivore (a colobine monkey) and an herbivore (horse) in genes encoding proteins that function in plant toxin biodegradation and nutrient absorption. These results suggest that koala lemurs were highly adapted to a leaf-based diet, which may also explain their convergent craniodental morphology with the small-bodied folivore Lepilemur.


Assuntos
Núcleo Celular/genética , Extinção Biológica , Genoma , Lemur/genética , Filogenia , Aminoácidos/genética , Animais , Sequência de Bases , Evolução Molecular , Genômica , Herbivoria/fisiologia
4.
PLoS Genet ; 16(8): e1008896, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32853200

RESUMO

Identifying regions of positive selection in genomic data remains a challenge in population genetics. Most current approaches rely on comparing values of summary statistics calculated in windows. We present an approach termed SURFDAWave, which translates measures of genetic diversity calculated in genomic windows to functional data. By transforming our discrete data points to be outputs of continuous functions defined over genomic space, we are able to learn the features of these functions that signify selection. This enables us to confidently identify complex modes of natural selection, including adaptive introgression. We are also able to predict important selection parameters that are responsible for shaping the inferred selection events. By applying our model to human population-genomic data, we recapitulate previously identified regions of selective sweeps, such as OCA2 in Europeans, and predict that its beneficial mutation reached a frequency of 0.02 before it swept 1,802 generations ago, a time when humans were relatively new to Europe. In addition, we identify BNC2 in Europeans as a target of adaptive introgression, and predict that it harbors a beneficial mutation that arose in an archaic human population that split from modern humans within the hypothesized modern human-Neanderthal divergence range.


Assuntos
Modelos Genéticos , Taxa de Mutação , População Branca/genética , Animais , Proteínas de Ligação a DNA/genética , Variação Genética , Humanos , Proteínas de Membrana Transportadoras , Homem de Neandertal/genética , Seleção Genética , Software
5.
Mol Biol Evol ; 36(2): 252-270, 2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30398642

RESUMO

Identifying genomic locations of natural selection from sequence data is an ongoing challenge in population genetics. Current methods utilizing information combined from several summary statistics typically assume no correlation of summary statistics regardless of the genomic location from which they are calculated. However, due to linkage disequilibrium, summary statistics calculated at nearby genomic positions are highly correlated. We introduce an approach termed Trendsetter that accounts for the similarity of statistics calculated from adjacent genomic regions through trend filtering, while reducing the effects of multicollinearity through regularization. Our penalized regression framework has high power to detect sweeps, is capable of classifying sweep regions as either hard or soft, and can be applied to other selection scenarios as well. We find that Trendsetter is robust to both extensive missing data and strong background selection, and has comparable power to similar current approaches. Moreover, the model learned by Trendsetter can be viewed as a set of curves modeling the spatial distribution of summary statistics in the genome. Application to human genomic data revealed positively selected regions previously discovered such as LCT in Europeans and EDAR in East Asians. We also identified a number of novel candidates and show that populations with greater relatedness share more sweep signals.


Assuntos
Técnicas Genéticas , Genética Populacional/métodos , Genoma Humano , Aprendizado de Máquina , Modelos Genéticos , Simulação por Computador , Humanos , Análise de Regressão , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA