Pesquisa | Portal Regional da BVS

An efficient and robust ABC approach to infer the rate and strength of adaptation.

Murga-Moreno, Jesús; Casillas, Sònia; Barbadilla, Antonio; Uricchio, Lawrence; Enard, David.

G3 (Bethesda) ; 14(4)2024 Apr 03.

Artigo em Inglês | MEDLINE | ID: mdl-38365205

RESUMO

Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in nonmodel species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to nonmodel genomes. We apply ABC-MK to the human proteome and a set of known virus interacting proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.

Assuntos

Evolução Biológica , Seleção Genética , Humanos , Genoma , Mutação

An efficient and robust ABC approach to infer the rate and strength of adaptation.

Murga-Moreno, Jesús; Casillas, Sònia; Barbadilla, Antonio; Uricchio, Lawrence; Enard, David.

bioRxiv ; 2023 Sep 28.

Artigo em Inglês | MEDLINE | ID: mdl-37693550

RESUMO

Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in non-model species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to non-model genomes. We apply ABC-MK to the human proteome and a set of known Virus Interacting Proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.

impMKT: the imputed McDonald and Kreitman test, a straightforward correction that significantly increases the evidence of positive selection of the McDonald and Kreitman test at the gene level.

Murga-Moreno, Jesús; Coronado-Zamora, Marta; Casillas, Sònia; Barbadilla, Antonio.

G3 (Bethesda) ; 12(10)2022 09 30.

Artigo em Inglês | MEDLINE | ID: mdl-35976111

RESUMO

The McDonald and Kreitman test is one of the most powerful and widely used methods to detect and quantify recurrent natural selection in DNA sequence data. One of its main limitations is the underestimation of positive selection due to the presence of slightly deleterious variants segregating at low frequencies. Although several approaches have been developed to overcome this limitation, most of them work on gene pooled analyses. Here, we present the imputed McDonald and Kreitman test (impMKT), a new straightforward approach for the detection of positive selection and other selection components of the distribution of fitness effects at the gene level. We compare imputed McDonald and Kreitman test with other widely used McDonald and Kreitman test approaches considering both simulated and empirical data. By applying imputed McDonald and Kreitman test to humans and Drosophila data at the gene level, we substantially increase the statistical evidence of positive selection with respect to previous approaches (e.g. by 50% and 157% compared with the McDonald and Kreitman test in Drosophila and humans, respectively). Finally, we review the minimum number of genes required to obtain a reliable estimation of the proportion of adaptive substitution (α) in gene pooled analyses by using the imputed McDonald and Kreitman test compared with other McDonald and Kreitman test implementations. Because of its simplicity and increased power to detect recurrent positive selection on genes, we propose the imputed McDonald and Kreitman test as the first straightforward approach for testing specific evolutionary hypotheses at the gene level. The software implementation and population genomics data are available at the web-server imkt.uab.cat.

Assuntos

Evolução Biológica , Seleção Genética , Animais , Drosophila/genética , Evolução Molecular , Humanos , Metagenômica , Software

PopHumanVar: an interactive application for the functional characterization and prioritization of adaptive genomic variants in humans.

Colomer-Vilaplana, Aina; Murga-Moreno, Jesús; Canalda-Baltrons, Aleix; Inserte, Clara; Soto, Daniel; Coronado-Zamora, Marta; Barbadilla, Antonio; Casillas, Sònia.

Nucleic Acids Res ; 50(D1): D1069-D1076, 2022 01 07.

Artigo em Inglês | MEDLINE | ID: mdl-34664660

RESUMO

Adaptive challenges that humans faced as they expanded across the globe left specific molecular footprints that can be decoded in our today's genomes. Different sets of metrics are used to identify genomic regions that have undergone selection. However, there are fewer methods capable of pinpointing the allele ultimately responsible for this selection. Here, we present PopHumanVar, an interactive online application that is designed to facilitate the exploration and thorough analysis of candidate genomic regions by integrating both functional and population genomics data currently available. PopHumanVar generates useful summary reports of prioritized variants that are putatively causal of recent selective sweeps. It compiles data and graphically represents different layers of information, including natural selection statistics, as well as functional annotations and genealogical estimations of variant age, for biallelic single nucleotide variants (SNVs) of the 1000 Genomes Project phase 3. Specifically, PopHumanVar amasses SNV-based information from GEVA, SnpEFF, GWAS Catalog, ClinVar, RegulomeDB and DisGeNET databases, as well as accurate estimations of iHS, nSL and iSAFE statistics. Notably, PopHumanVar can successfully identify known causal variants of frequently reported candidate selection regions, including EDAR in East-Asians, ACKR1 (DARC) in Africans and LCT/MCM6 in Europeans. PopHumanVar is open and freely available at https://pophumanvar.uab.cat.

Assuntos

Bases de Dados Genéticas , Genoma Humano/genética , Seleção Genética/genética , Software , Adaptação Fisiológica/genética , Biologia Computacional , Genômica , Humanos , Polimorfismo de Nucleotídeo Único/genética

Decreased recent adaptation at human mendelian disease genes as a possible consequence of interference between advantageous and deleterious variants.

Di, Chenlu; Murga Moreno, Jesus; Salazar-Tortosa, Diego F; Lauterbur, M Elise; Enard, David.

Elife ; 102021 10 12.

Artigo em Inglês | MEDLINE | ID: mdl-34636724

RESUMO

Advances in genome sequencing have improved our understanding of the genetic basis of human diseases, and thousands of human genes have been associated with different diseases. Recent genomic adaptation at disease genes has not been well characterized. Here, we compare the rate of strong recent adaptation in the form of selective sweeps between mendelian, non-infectious disease genes and non-disease genes across distinct human populations from the 1000 Genomes Project. We find that mendelian disease genes have experienced far less selective sweeps compared to non-disease genes especially in Africa. Investigating further the possible causes of the sweep deficit at disease genes, we find that this deficit is very strong at disease genes with both low recombination rates and with high numbers of associated disease variants, but is almost non-existent at disease genes with higher recombination rates or lower numbers of associated disease variants. Because segregating recessive deleterious variants have the ability to interfere with adaptive ones, these observations strongly suggest that adaptation has been slowed down by the presence of interfering recessive deleterious variants at disease genes. These results suggest that disease genes suffer from a transient inability to adapt as fast as the rest of the genome.

Assuntos

Doenças Genéticas Inatas/genética , Variação Genética , Genoma Humano/genética , Bases de Dados Genéticas , Evolução Molecular , Humanos

Drosophila Evolution over Space and Time (DEST): A New Population Genomics Resource.

Kapun, Martin; Nunez, Joaquin C B; Bogaerts-Márquez, María; Murga-Moreno, Jesús; Paris, Margot; Outten, Joseph; Coronado-Zamora, Marta; Tern, Courtney; Rota-Stabelli, Omar; Guerreiro, Maria P García; Casillas, Sònia; Orengo, Dorcas J; Puerma, Eva; Kankare, Maaria; Ometto, Lino; Loeschcke, Volker; Onder, Banu S; Abbott, Jessica K; Schaeffer, Stephen W; Rajpurohit, Subhash; Behrman, Emily L; Schou, Mads F; Merritt, Thomas J S; Lazzaro, Brian P; Glaser-Schmitt, Amanda; Argyridou, Eliza; Staubach, Fabian; Wang, Yun; Tauber, Eran; Serga, Svitlana V; Fabian, Daniel K; Dyer, Kelly A; Wheat, Christopher W; Parsch, John; Grath, Sonja; Veselinovic, Marija Savic; Stamenkovic-Radak, Marina; Jelic, Mihailo; Buendía-Ruíz, Antonio J; Gómez-Julián, Maria Josefa; Espinosa-Jimenez, Maria Luisa; Gallardo-Jiménez, Francisco D; Patenkovic, Aleksandra; Eric, Katarina; Tanaskovic, Marija; Ullastres, Anna; Guio, Lain; Merenciano, Miriam; Guirao-Rico, Sara; Horváth, Vivien.

Mol Biol Evol ; 38(12): 5782-5805, 2021 12 09.

Artigo em Inglês | MEDLINE | ID: mdl-34469576

RESUMO

Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome data sets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate data sets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in >20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This data set, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental metadata. A web-based genome browser and web portal provide easy access to the SNP data set. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan data set. Our resource will enable population geneticists to analyze spatiotemporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.

Assuntos

Drosophila melanogaster , Metagenômica , Animais , Drosophila melanogaster/genética , Frequência do Gene , Genética Populacional , Genômica

iMKT: the integrative McDonald and Kreitman test.

Murga-Moreno, Jesús; Coronado-Zamora, Marta; Hervas, Sergi; Casillas, Sònia; Barbadilla, Antonio.

Nucleic Acids Res ; 47(W1): W283-W288, 2019 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-31081014

RESUMO

The McDonald and Kreitman test (MKT) is one of the most powerful and widely used methods to detect and quantify recurrent natural selection using DNA sequence data. Here we present iMKT (acronym for integrative McDonald and Kreitman test), a novel web-based service performing four distinct MKT types. It allows the detection and estimation of four different selection regimes -adaptive, neutral, strongly deleterious and weakly deleterious- acting on any genomic sequence. iMKT can analyze both user's own population genomic data and pre-loaded Drosophila melanogaster and human sequences of protein-coding genes obtained from the largest population genomic datasets to date. Advanced options in the website allow testing complex hypotheses such as the application example showed here: do genes located in high recombination regions undergo higher rates of adaptation? We aim that iMKT will become a reference site tool for the study of evolutionary adaptation in massive population genomics datasets, especially in Drosophila and humans. iMKT is a free resource online at https://imkt.uab.cat.

Assuntos

Adaptação Fisiológica/genética , Drosophila melanogaster/genética , Genoma , Recombinação Genética , Seleção Genética , Análise de Sequência de DNA/estatística & dados numéricos , Alelos , Animais , Evolução Biológica , Conjuntos de Dados como Assunto , Frequência do Gene , Humanos , Metagenômica , Polimorfismo Genético

PopHumanScan: the online catalog of human genome adaptation.

Murga-Moreno, Jesús; Coronado-Zamora, Marta; Bodelón, Alejandra; Barbadilla, Antonio; Casillas, Sònia.

Nucleic Acids Res ; 47(D1): D1080-D1089, 2019 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-30335169

RESUMO

Since the migrations that led humans to colonize Earth, our species has faced frequent adaptive challenges that have left signatures in the landscape of genetic variation and that we can identify in our today's genomes. Here, we (i) perform an outlier approach on eight different population genetic statistics for 22 non-admixed human populations of the Phase III of the 1000 Genomes Project to detect selective sweeps at different historical ages, as well as events of recurrent positive selection in the human lineage; and (ii) create PopHumanScan, an online catalog that compiles and annotates all candidate regions under selection to facilitate their validation and thoroughly analysis. Well-known examples of human genetic adaptation published elsewhere are included in the catalog, as well as hundreds of other attractive candidates that will require further investigation. Designed as a collaborative database, PopHumanScan aims to become a central repository to share information, guide future studies and help advance our understanding of how selection has modeled our genomes as a response to changes in the environment or lifestyle of human populations. PopHumanScan is open and freely available at https://pophumanscan.uab.cat.

Assuntos

Adaptação Fisiológica/genética , Biologia Computacional/métodos , Genética Populacional/métodos , Genoma Humano/genética , Seleção Genética , Bases de Dados Genéticas , Evolução Molecular , Genômica/métodos , Humanos , Internet , Desequilíbrio de Ligação , Modelos Genéticos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA