Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Mol Biol Evol ; 41(7)2024 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-38935581

RESUMO

Segregation distorters (SDs) are genetic elements that distort the Mendelian segregation ratio to favor their own transmission and are able to spread even when they incur fitness costs on organisms carrying them. Depending on the biology of the host organisms and the genetic architecture of the SDs, the population dynamics of SDs can be highly variable. Inbreeding is considered an effective mechanism for inhibiting the spread of SDs in populations, and can evolve as a defense mechanism against SDs in some systems. However, we show that inbreeding in the form of selfing in fact promotes the spread of SDs acting as pollen killers in a toxin-antidote system in hermaphroditic plants by two mechanisms: (i) By reducing the effective recombination rate between killer and antidote loci in the two-locus system and (ii) by increasing the proportion of SD alleles in individual flowers, rather than in the general gene-pool. We also show that in rice (Oryza sativa L.), a typical hermaphroditic plant, all molecularly characterized SDs associated with pollen killing were involved in population hybridization and have introgressed across different species. Paradoxically, these loci, which are associated with hybrid incompatibility and can be thought of as Bateson-Dobzhansky-Muller incompatibility loci are expected to reduce gene-flow between species, in fact cross species boundaries more frequently than random loci, and may act as important drivers of introgression.


Assuntos
Introgressão Genética , Oryza , Oryza/genética , Endogamia , Pólen/genética , Organismos Hermafroditas/genética , Hibridização Genética , Autofertilização
2.
PLoS Genet ; 18(7): e1010281, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35839249

RESUMO

Estimating admixture histories is crucial for understanding the genetic diversity we see in present-day populations. Allele frequency or phylogeny-based methods are excellent for inferring the existence of admixture or its proportions. However, to estimate admixture times, spatial information from admixed chromosomes of local ancestry or the decay of admixture linkage disequilibrium (ALD) is used. One popular method, implemented in the programs ALDER and ROLLOFF, uses two-locus ALD to infer the time of a single admixture event, but is only able to estimate the time of the most recent admixture event based on this summary statistic. To address this limitation, we derive analytical expressions for the expected ALD in a three-locus system and provide a new statistical method based on these results that is able to resolve more complicated admixture histories. Using simulations, we evaluate the performance of this method on a range of different admixture histories. As an example, we apply the method to the Colombian and Mexican samples from the 1000 Genomes project. The implementation of our method is available at https://github.com/Genomics-HSE/LaNeta.


Assuntos
Genética Populacional , Grupos Populacionais , Colômbia , Frequência do Gene/genética , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Grupos Populacionais/genética
3.
J Math Biol ; 89(5): 47, 2024 Oct 04.
Artigo em Inglês | MEDLINE | ID: mdl-39363040

RESUMO

Estimation of admixture proportions has become one of the most commonly used computational tools in population genomics. However, there is remarkably little population genetic theory on statistical properties of these variables. We develop theoretical results that can accurately predict means and variances of admixture proportions within a population using models with recombination and genetic drift. Based on established theory on measures of multilocus disequilibrium, we show that there is a set of recurrence relations that can be used to derive expectations for higher moments of the admixture proportions distribution. We obtain closed form solutions for some special cases. Using these results, we develop a method for estimating admixture parameters from estimated admixture proportions obtained from programs such as Structure or Admixture. We apply this method to HapMap 3 data and find that the population history of African Americans, as expected, is not best explained by a single admixture event between people of European and African ancestry. The model of constant gene flow starting at 8 generations and ending at 2 generations before present gives the best fit.


Assuntos
Fluxo Gênico , Genética Populacional , Desequilíbrio de Ligação , Conceitos Matemáticos , Modelos Genéticos , Brancos , Humanos , Negro ou Afro-Americano/genética , Deriva Genética , Genética Populacional/estatística & dados numéricos , Recombinação Genética , Brancos/genética
4.
PLoS Comput Biol ; 18(8): e1010409, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-36001646

RESUMO

Accurate simulation of complex biological processes is an essential component of developing and validating new technologies and inference approaches. As an effort to help contain the COVID-19 pandemic, large numbers of SARS-CoV-2 genomes have been sequenced from most regions in the world. More than 5.5 million viral sequences are publicly available as of November 2021. Many studies estimate viral genealogies from these sequences, as these can provide valuable information about the spread of the pandemic across time and space. Additionally such data are a rich source of information about molecular evolutionary processes including natural selection, for example allowing the identification of new variants with transmissibility and immunity evasion advantages. To our knowledge, there is no framework that is both efficient and flexible enough to simulate the pandemic to approximate world-scale scenarios and generate viral genealogies of millions of samples. Here, we introduce a new fast simulator VGsim which addresses the problem of simulation genealogies under epidemiological models. The simulation process is split into two phases. During the forward run the algorithm generates a chain of population-level events reflecting the dynamics of the pandemic using an hierarchical version of the Gillespie algorithm. During the backward run a coalescent-like approach generates a tree genealogy of samples conditioning on the population-level events chain generated during the forward run. Our software can model complex population structure, epistasis and immunity escape.


Assuntos
COVID-19 , Pandemias , COVID-19/epidemiologia , Simulação por Computador , Humanos , SARS-CoV-2/genética , Software
5.
Mol Biol Evol ; 38(5): 2152-2165, 2021 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-33502512

RESUMO

Adaptive introgression-the flow of adaptive genetic variation between species or populations-has attracted significant interest in recent years and it has been implicated in a number of cases of adaptation, from pesticide resistance and immunity, to local adaptation. Despite this, methods for identification of adaptive introgression from population genomic data are lacking. Here, we present Ancestry_HMM-S, a hidden Markov model-based method for identifying genes undergoing adaptive introgression and quantifying the strength of selection acting on them. Through extensive validation, we show that this method performs well on moderately sized data sets for realistic population and selection parameters. We apply Ancestry_HMM-S to a data set of an admixed Drosophila melanogaster population from South Africa and we identify 17 loci which show signatures of adaptive introgression, four of which have previously been shown to confer resistance to insecticides. Ancestry_HMM-S provides a powerful method for inferring adaptive introgression in data sets that are typically collected when studying admixed populations. This method will enable powerful insights into the genetic consequences of admixture across diverse populations. Ancestry_HMM-S can be downloaded from https://github.com/jesvedberg/Ancestry_HMM-S/.


Assuntos
Adaptação Biológica/genética , Introgressão Genética , Modelos Genéticos , Seleção Genética , Software , Algoritmos , Animais , Drosophila melanogaster/genética , Cadeias de Markov
6.
PLoS Genet ; 14(9): e1007641, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30226838

RESUMO

Human populations outside of Africa have experienced at least two bouts of introgression from archaic humans, from Neanderthals and Denisovans. In Papuans there is prior evidence of both these introgressions. Here we present a new approach to detect segments of individual genomes of archaic origin without using an archaic reference genome. The approach is based on a hidden Markov model that identifies genomic regions with a high density of single nucleotide variants (SNVs) not seen in unadmixed populations. We show using simulations that this provides a powerful approach to identifying segments of archaic introgression with a low rate of false detection, given data from a suitable outgroup population is available, without the archaic introgression but containing a majority of the variation that arose since initial separation from the archaic lineage. Furthermore our approach is able to infer admixture proportions and the times both of admixture and of initial divergence between the human and archaic populations. We apply the model to detect archaic introgression in 89 Papuans and show how the identified segments can be assigned to likely Neanderthal or Denisovan origin. We report more Denisovan admixture than previous studies and find a shift in size distribution of fragments of Neanderthal and Denisovan origin that is compatible with a difference in admixture time. Furthermore, we identify small amounts of Denisova ancestry in South East Asians and South Asians.


Assuntos
Genoma Humano/genética , Hominidae/genética , Hibridização Genética/genética , Homem de Neandertal/genética , Animais , Povo Asiático/genética , População Negra/genética , Fósseis , Humanos , Havaiano Nativo ou Outro Ilhéu do Pacífico/genética , Filogenia , População Branca/genética
7.
J Math Biol ; 77(5): 1279-1298, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-29876645

RESUMO

The number of individuals in a random sample with close relatives in the sample is a quantity of interest when designing Genome Wide Association Studies and other cohort based genetic, and non-genetic, studies. In this paper, we develop expressions for the distribution and expectation of the number of p-th cousins in a sample from a population of size N under two diploid Wright-Fisher models. We also develop simple asymptotic expressions for large values of N. For example, the expected proportion of individuals with at least one p-th cousin in a sample of K individuals, for a diploid dioecious Wright-Fisher model, is approximately [Formula: see text]. Our results show that a substantial fraction of individuals in the sample will have at least a second cousin if the sampling fraction (K / N) is on the order of [Formula: see text]. This confirms that, for large cohort samples, relatedness among individuals cannot easily be ignored.


Assuntos
Família , Genética Populacional/estatística & dados numéricos , Irmãos , Estudos de Coortes , Simulação por Computador , Feminino , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Humanos , Masculino , Conceitos Matemáticos , Modelos Genéticos , Modelos Estatísticos , Linhagem , Probabilidade , Tamanho da Amostra
8.
Viruses ; 15(7)2023 06 22.
Artigo em Inglês | MEDLINE | ID: mdl-37515103

RESUMO

The Omicron variant of SARS-CoV-2 rapidly spread worldwide in late 2021-early 2022, displacing the previously prevalent Delta variant. Before 16 December 2021, community transmission had already been observed in tens of countries globally. However, in Russia, the majority of reported cases at that time had been sporadic and associated with travel. Here, we report an Omicron outbreak at a student dormitory in Saint Petersburg between 16-29 December 2021, which was the earliest known instance of a large-scale community transmission in Russia. Out of the 465 sampled residents of the dormitory, 180 (38.7%) tested PCR-positive. Among the 118 residents for whom the variant had been tested by whole-genome sequencing, 111 (94.1%) were found to carry the Omicron variant. Among these 111 residents, 60 (54.1%) were vaccinated or had reported a previous infection of COVID-19. Phylogenetic analysis confirmed that the outbreak was caused by a single introduction of the BA.1.1 sub-lineage of the Omicron variant. The dormitory-derived clade constituted a significant proportion of BA.1.1 samples in Saint Petersburg and has spread to other regions of Russia and even to other countries. The rapid spread of the Omicron variant in a population with preexisting immunity to previous variants underlines its propensity for immune evasion.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , COVID-19/epidemiologia , Filogenia , Surtos de Doenças , Federação Russa/epidemiologia
9.
Virus Evol ; 8(1): veac017, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35371558

RESUMO

Delta has outcompeted most preexisting variants of SARS-CoV-2, becoming the globally predominant lineage by mid-2021. Its subsequent evolution has led to the emergence of multiple sublineages, most of which are well-mixed between countries. By contrast, here we show that nearly the entire Delta epidemic in Russia has probably descended from a single import event, or from multiple closely timed imports from a single poorly sampled geographic location. Indeed, over 90 per cent of Delta samples in Russia are characterized by the nsp2:K81N + ORF7a:P45L pair of mutations which is rare outside Russia, putting them in the AY.122 sublineage. The AY.122 lineage was frequent in Russia among Delta samples from the start, and has not increased in frequency in other countries where it has been observed, suggesting that its high prevalence in Russia has probably resulted from a random founder effect rather than a transmission advantage. The apartness of the genetic composition of the Delta epidemic in Russia makes Russia somewhat unusual, although not exceptional, among other countries.

10.
Curr Zool ; 67(2): 191-199, 2021 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-33854537

RESUMO

Animals living in extremely high elevations have to adapt to low temperatures and low oxygen availability (hypoxia), but the underlying genetic mechanisms associated with these adaptations are still unclear. The mitochondrial respiratory chain can provide >95% of the ATP in animal cells, and its efficiency is influenced by temperature and oxygen availability. Therefore, the respiratory chain complexes (RCCs) could be important molecular targets for positive selection associated with respiratory adaptation in high-altitude environments. Here, we investigated positive selection in 5 RCCs and their assembly factors by analyzing sequences of 106 genes obtained through RNA-seq of all 15 Chinese Phrynocephalus lizard species, which are distributed from lowlands to the Tibetan plateau (average elevation >4,500 m). Our results indicate that evidence of positive selection on RCC genes is not significantly different from assembly factors, and we found no difference in selective pressures among the 5 complexes. We specifically looked for positive selection in lineages where changes in habitat elevation happened. The group of lineages evolving from low to high altitude show stronger signals of positive selection than lineages evolving from high to low elevations. Lineages evolving from low to high elevation also have more shared codons under positive selection, though the changes are not equivalent at the amino acid level. This study advances our understanding of the genetic basis of animal respiratory metabolism evolution in extreme high environments and provides candidate genes for further confirmation with functional analyses.

11.
medRxiv ; 2021 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-33948608

RESUMO

Accurate simulation of complex biological processes is an essential component of developing and validating new technologies and inference approaches. As an effort to help contain the COVID-19 pandemic, large numbers of SARS-CoV-2 genomes have been sequenced from most regions in the world. More than 5.5 million viral sequences are publicly available as of November 2021. Many studies estimate viral genealogies from these sequences, as these can provide valuable information about the spread of the pandemic across time and space. Additionally such data are a rich source of information about molecular evolutionary processes including natural selection, for example allowing the identification of new variants with transmissibility and immunity evasion advantages. To our knowledge, there is no framework that is both efficient and flexible enough to simulate the pandemic to approximate world-scale scenarios and generate viral genealogies of millions of samples. Here, we introduce a new fast simulator VGsim which addresses the problem of simulation genealogies under epidemiological models. The simulation process is split into two phases. During the forward run the algorithm generates a chain of population-level events reflecting the dynamics of the pandemic using an hierarchical version of the Gillespie algorithm. During the backward run a coalescent-like approach generates a tree genealogy of samples conditioning on the population-level events chain generated during the forward run. Our software can model complex population structure, epistasis and immunity escape. The code is freely available at https://github.com/Genomics-HSE/VGsim.

12.
Nat Commun ; 12(1): 649, 2021 01 28.
Artigo em Inglês | MEDLINE | ID: mdl-33510171

RESUMO

The ongoing pandemic of SARS-CoV-2 presents novel challenges and opportunities for the use of phylogenetics to understand and control its spread. Here, we analyze the emergence of SARS-CoV-2 in Russia in March and April 2020. Combining phylogeographic analysis with travel history data, we estimate that the sampled viral diversity has originated from at least 67 closely timed introductions into Russia, mostly in late February to early March. All but one of these introductions were not from China, suggesting that border closure with China has helped delay establishment of SARS-CoV-2 in Russia. These introductions resulted in at least 9 distinct Russian lineages corresponding to domestic transmission. A notable transmission cluster corresponded to a nosocomial outbreak at the Vreden hospital in Saint Petersburg; phylodynamic analysis of this cluster reveals multiple (2-3) introductions each giving rise to a large number of cases, with a high initial effective reproduction number of 3.0 [1.9, 4.3].


Assuntos
Número Básico de Reprodução/estatística & dados numéricos , COVID-19/epidemiologia , COVID-19/transmissão , Genoma Viral/genética , SARS-CoV-2/genética , Humanos , Taxa de Mutação , Filogeografia , Federação Russa/epidemiologia , Sequenciamento Completo do Genoma
13.
medRxiv ; 2021 Dec 05.
Artigo em Inglês | MEDLINE | ID: mdl-34909799

RESUMO

BACKGROUND: Delta has outcompeted most preexisting variants of SARS-CoV-2, becoming the globally predominant lineage by mid-2021. Its subsequent evolution has led to emergence of multiple sublineages, many of which are well-mixed between countries. AIM: Here, we aim to study the emergence and spread of the Delta lineage in Russia. METHODS: We use a phylogeographic approach to infer imports of Delta sublineages into Russia, and phylodynamic models to assess the rate of their spread. RESULTS: We show that nearly the entire Delta epidemic in Russia has probably descended from a single import event despite genetic evidence of multiple Delta imports. Indeed, over 90% of Delta samples in Russia are characterized by the nsp2:K81N+ORF7a:P45L pair of mutations which is rare outside Russia, putting them in the AY.122 sublineage. The AY.122 lineage was frequent in Russia among Delta samples from the start, and has not increased in frequency in other countries where it has been observed, suggesting that its high prevalence in Russia has probably resulted from a random founder effect. CONCLUSION: The apartness of the genetic composition of the Delta epidemic in Russia makes Russia somewhat unusual, although not exceptional, among other countries.

14.
G3 (Bethesda) ; 10(10): 3663-3673, 2020 10 05.
Artigo em Inglês | MEDLINE | ID: mdl-32763953

RESUMO

Admixture is increasingly being recognized as an important factor in evolutionary genetics. The distribution of genomic admixture tracts, and the resulting effects on admixture linkage disequilibrium, can be used to date the timing of admixture between species or populations. However, the theory used for such prediction assumes selective neutrality despite the fact that many famous examples of admixture involve natural selection acting for or against admixture. In this paper, we investigate the effects of positive selection on the distribution of tract lengths. We develop a theoretical framework that relies on approximating the trajectory of the selected allele using a logistic function. By numerically calculating the expected allele trajectory, we also show that the approach can be extended to cases where the logistic approximation is poor due to the effects of genetic drift. Using simulations, we show that the model is highly accurate under most scenarios. We use the model to show that positive selection on average will tend to increase the admixture tract length. However, perhaps counter-intuitively, conditional on the allele frequency at the time of sampling, positive selection will actually produce shorter expected tract lengths. We discuss the consequences of our results in interpreting the timing of the introgression of EPAS1 from Denisovans into the ancestors of Tibetans.


Assuntos
Deriva Genética , Seleção Genética , Alelos , Frequência do Gene , Genética Populacional , Desequilíbrio de Ligação
15.
Genome Biol Evol ; 12(8): 1459-1470, 2020 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-32614437

RESUMO

Detection of positive selection signatures in populations around the world is helping to uncover recent human evolutionary history as well as the genetic basis of diseases. Most human evolutionary genomic studies have been performed in European, African, and Asian populations. However, populations with Native American ancestry have been largely underrepresented. Here, we used a genome-wide local ancestry enrichment approach complemented with neutral simulations to identify postadmixture adaptations underwent by admixed Chileans through gene flow from Europeans into local Native Americans. The top significant hits (P = 2.4×10-7) are variants in a region on chromosome 12 comprising multiple regulatory elements. This region includes rs12821256, which regulates the expression of KITLG, a well-known gene involved in lighter hair and skin pigmentation in Europeans as well as in thermogenesis. Another variant from that region is associated with the long noncoding RNA RP11-13A1.1, which has been specifically involved in the innate immune response against infectious pathogens. Our results suggest that these genes were relevant for adaptation in Chileans following the Columbian exchange.


Assuntos
Adaptação Biológica/genética , Cromossomos Humanos Par 12 , Genoma Humano , Pigmentação/genética , Seleção Genética , Chile , Feminino , Fluxo Gênico , Haplótipos , Humanos , Hibridização Genética , Indígenas Sul-Americanos/genética , Masculino , Termogênese/genética , População Branca/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA