Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 2.493
Filter
1.
BMC Res Notes ; 17(1): 187, 2024 Jul 05.
Article in English | MEDLINE | ID: mdl-38970104

ABSTRACT

OBJECTIVE: This study assesses the accuracy of the IrisPlex system, a genetic eye color prediction tool for forensic analysis, in the Kazakh population. The study compares previously published genotypes of 515 Kazakh individuals from varied geographical and ethnohistorical contexts with phenotypic data on their eye color, introduced for the first time in this research. RESULTS: The IrisPlex panel's effectiveness in predicting eye color in the Kazakh population was validated. It exhibited slightly lower accuracy than in Western European populations but was higher than in Siberian populations. The sensitivity was notably high for brown-eyed individuals (0.99), but further research is needed for blue and intermediate eye colors. This study establishes IrisPlex as a useful predictive tool in the Kazakh population and provides a basis for future investigations into the genetic basis of phenotypic variations in this diverse population.


Subject(s)
Eye Color , Humans , Eye Color/genetics , Kazakhstan , Genetic Variation/genetics , Phenotype , Genotype , Genetics, Population/methods , Asian People/genetics
2.
Mol Genet Genomics ; 299(1): 65, 2024 Jul 07.
Article in English | MEDLINE | ID: mdl-38972030

ABSTRACT

BACKGROUND: A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations. RESULTS: Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution. CONCLUSION: Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.


Subject(s)
DNA Copy Number Variations , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , DNA Copy Number Variations/genetics , High-Throughput Nucleotide Sequencing/methods , Genome, Human/genetics , Polymorphism, Single Nucleotide/genetics , Genetic Variation/genetics , Genetic Predisposition to Disease , Genetics, Population/methods , INDEL Mutation
3.
J Math Biol ; 89(2): 23, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38954016

ABSTRACT

The embedding problem of Markov matrices in Markov semigroups is a classic problem that regained a lot of impetus and activities through recent needs in phylogeny and population genetics. Here, we give an account for dimensions d ⩽ 4 , including a complete and simplified treatment of the case d = 3 , and derive the results in a systematic fashion, with an eye on the potential applications. Further, we reconsider the setup of the corresponding problem for time-inhomogeneous Markov chains, which is needed for real-world applications because transition rates need not be constant over time. Additional cases of this more general embedding occur for any d ⩾ 3 . We review the known case of d = 3 and describe the setting for future work on d = 4 .


Subject(s)
Markov Chains , Mathematical Concepts , Phylogeny , Genetics, Population/statistics & numerical data , Genetics, Population/methods , Models, Genetic , Humans
4.
Mol Biol Evol ; 41(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38958167

ABSTRACT

Admixture between populations and species is common in nature. Since the influx of new genetic material might be either facilitated or hindered by selection, variation in mixture proportions along the genome is expected in organisms undergoing recombination. Various graph-based models have been developed to better understand these evolutionary dynamics of population splits and mixtures. However, current models assume a single mixture rate for the entire genome and do not explicitly account for linkage. Here, we introduce TreeSwirl, a novel method for inferring branch lengths and locus-specific mixture proportions by using genome-wide allele frequency data, assuming that the admixture graph is known or has been inferred. TreeSwirl builds upon TreeMix that uses Gaussian processes to estimate the presence of gene flow between diverged populations. However, in contrast to TreeMix, our model infers locus-specific mixture proportions employing a hidden Markov model that accounts for linkage. Through simulated data, we demonstrate that TreeSwirl can accurately estimate locus-specific mixture proportions and handle complex demographic scenarios. It also outperforms related D- and f-statistics in terms of accuracy and sensitivity to detect introgressed loci.


Subject(s)
Gene Frequency , Models, Genetic , Genetics, Population/methods , Markov Chains , Gene Flow , Genome , Computer Simulation , Genetic Linkage
5.
Proc Natl Acad Sci U S A ; 121(28): e2307107121, 2024 Jul 09.
Article in English | MEDLINE | ID: mdl-38959040

ABSTRACT

Despite evolutionary biology's obsession with natural selection, few studies have evaluated multigenerational series of patterns of selection on a genome-wide scale in natural populations. Here, we report on a 10-y population-genomic survey of the microcrustacean Daphnia pulex. The genome sequences of [Formula: see text]800 isolates provide insights into patterns of selection that cannot be obtained from long-term molecular-evolution studies, including the following: the pervasiveness of near quasi-neutrality across the genome (mean net selection coefficients near zero, but with significant temporal variance about the mean, and little evidence of positive covariance of selection across time intervals); the preponderance of weak positive selection operating on minor alleles; and a genome-wide distribution of numerous small linkage islands of observable selection influencing levels of nucleotide diversity. These results suggest that interannual fluctuating selection is a major determinant of standing levels of variation in natural populations, challenge the conventional paradigm for interpreting patterns of nucleotide diversity and divergence, and motivate the need for the further development of theoretical expressions for the interpretation of population-genomic data.


Subject(s)
Daphnia , Genome , Selection, Genetic , Animals , Daphnia/genetics , Genome/genetics , Evolution, Molecular , Genetic Variation , Genetics, Population/methods
6.
PLoS Genet ; 20(7): e1011092, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38959269

ABSTRACT

Haplotype estimation, or phasing, has gained significant traction in large-scale projects due to its valuable contributions to population genetics, variant analysis, and the creation of reference panels for imputation and phasing of new samples. To scale with the growing number of samples, haplotype estimation methods designed for population scale rely on highly optimized statistical models to phase genotype data, and usually ignore read-level information. Statistical methods excel in resolving common variants, however, they still struggle at rare variants due to the lack of statistical information. In this study we introduce SAPPHIRE, a new method that leverages whole-genome sequencing data to enhance the precision of haplotype calls produced by statistical phasing. SAPPHIRE achieves this by refining haplotype estimates through the realignment of sequencing reads, particularly targeting low-confidence phase calls. Our findings demonstrate that SAPPHIRE significantly enhances the accuracy of haplotypes obtained from state of the art methods and also provides the subset of phase calls that are validated by sequencing reads. Finally, we show that our method scales to large data sets by its successful application to the extensive 3.6 Petabytes of sequencing data of the last UK Biobank 200,031 sample release.


Subject(s)
Genetics, Population , Haplotypes , Whole Genome Sequencing , Whole Genome Sequencing/methods , Humans , Genetics, Population/methods , Genome, Human , Polymorphism, Single Nucleotide/genetics , Genome-Wide Association Study/methods , Algorithms
7.
BMC Res Notes ; 17(1): 173, 2024 Jun 20.
Article in English | MEDLINE | ID: mdl-38902826

ABSTRACT

OBJECTIVE: The objective of this study was to examine the genetic diversity within and between farmed populations of Onychostoma macrolepis, and to establish a foundation for enhancing the genetic resources of breeding groups through the introduction of new individuals and crossbreeding. A total of 49 individuals were subjected to sequencing using Specific-Locus Amplified Fragment Sequencing (SLAF-seq), one of the restriction site-associated DNA sequencing technologies. The single nucleotide polymorphisms(SNPs)were identified to conduct the analyzation of phylogeny population structure, principal component and genetic diversity. RESULTS: A total of 853,067 SNPs were identified. The results of the phylogenetic analysis revealed that each sample was genetically clustered into three distinct groups: ZhenPing (ZP), LanGao parents (LG), and their progeny population (LG-F1). Each population was observed to be clustered together. Analysis of population genetic diversity revealed that the observed heterozygosity (Ho) ranged from 0.200 to 0.230, the expected heterozygosity (He) ranged from 0.280 to 0.282, and the polymorphic information content (PIC) ranged from 0.228 to 0.230. These results indicate that the genetic diversity of the population is low and the signs of long-term interbreeding are obvious, but there are differences between the populations, and the genetic diversity of the population can be improved by hybridization in different regions.


Subject(s)
Genetic Variation , Phylogeny , Polymorphism, Single Nucleotide , Animals , Polymorphism, Single Nucleotide/genetics , Sequence Analysis, DNA/methods , Genetics, Population/methods , Principal Component Analysis
8.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-38856170

ABSTRACT

In the application of genomic prediction, a situation often faced is that there are multiple populations in which genomic prediction (GP) need to be conducted. A common way to handle the multi-population GP is simply to combine the multiple populations into a single population. However, since these populations may be subject to different environments, there may exist genotype-environment interactions which may affect the accuracy of genomic prediction. In this study, we demonstrated that multi-trait genomic best linear unbiased prediction (MTGBLUP) can be used for multi-population genomic prediction, whereby the performances of a trait in different populations are regarded as different traits, and thus multi-population prediction is regarded as multi-trait prediction by employing the between-population genetic correlation. Using real datasets, we proved that MTGBLUP outperformed the conventional multi-population model that simply combines different populations together. We further proposed that MTGBLUP can be improved by partitioning the global between-population genetic correlation into local genetic correlations (LGC). We suggested two LGC models, LGC-model-1 and LGC-model-2, which partition the genome into regions with and without significant LGC (LGC-model-1) or regions with and without strong LGC (LGC-model-2). In analysis of real datasets, we demonstrated that the LGC models could increase universally the prediction accuracy and the relative improvement over MTGBLUP reached up to 163.86% (25.64% on average).


Subject(s)
Genomics , Models, Genetic , Genomics/methods , Genetics, Population/methods , Quantitative Trait Loci , Humans , Algorithms , Genotype
9.
HLA ; 103(6): e15543, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38837862

ABSTRACT

The MHC class I region contains crucial genes for the innate and adaptive immune response, playing a key role in susceptibility to many autoimmune and infectious diseases. Genome-wide association studies have identified numerous disease-associated SNPs within this region. However, these associations do not fully capture the immune-biological relevance of specific HLA alleles. HLA imputation techniques may leverage available SNP arrays by predicting allele genotypes based on the linkage disequilibrium between SNPs and specific HLA alleles. Successful imputation requires diverse and large reference panels, especially for admixed populations. This study employed a bioinformatics approach to call SNPs and HLA alleles in multi-ethnic samples from the 1000 genomes (1KG) dataset and admixed individuals from Brazil (SABE), utilising 30X whole-genome sequencing data. Using HIBAG, we created three reference panels: 1KG (n = 2504), SABE (n = 1171), and the full model (n = 3675) encompassing all samples. In extensive cross-validation of these reference panels, the multi-ethnic 1KG reference exhibited overall superior performance than the reference with only Brazilian samples. However, the best results were achieved with the full model. Additionally, we expanded the scope of imputation by developing reference panels for non-classical, MICA, MICB and HLA-H genes, previously unavailable for multi-ethnic populations. Validation in an independent Brazilian dataset showcased the superiority of our reference panels over the Michigan Imputation Server, particularly in predicting HLA-B alleles among Brazilians. Our investigations underscored the need to enhance or adapt reference panels to encompass the target population's genetic diversity, emphasising the significance of multiethnic references for accurate imputation across different populations.


Subject(s)
Alleles , Ethnicity , Gene Frequency , Polymorphism, Single Nucleotide , Humans , Brazil , Ethnicity/genetics , HLA Antigens/genetics , Linkage Disequilibrium , Genome-Wide Association Study/methods , Genotype , Genetics, Population/methods , Histocompatibility Antigens Class I/genetics , Computational Biology/methods
10.
Mol Biol Evol ; 41(6)2024 Jun 01.
Article in English | MEDLINE | ID: mdl-38839045

ABSTRACT

Human populations harbor a high concentration of deleterious genetic variants. Here, we tested the hypothesis that non-random mating practices affect the distribution of these variants, through exposure in the homozygous state, leading to their purging from the population gene pool. To do so, we produced whole-genome sequencing data for two pairs of Asian populations exhibiting different alliance rules and rates of inbreeding, but with similar effective population sizes. The results show that populations with higher rates of inbred matings do not purge deleterious variants more efficiently. Purging therefore has a low efficiency in human populations, and different mating practices lead to a similar mutational load.


Subject(s)
Asian People , Humans , Asian People/genetics , Genetics, Population/methods , Genetic Variation , Inbreeding
11.
Genes (Basel) ; 15(6)2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38927684

ABSTRACT

Due to its turbulent demographic history, marked by extensive settlement and gene flow from diverse regions of Eurasia, Southeastern Europe (SEE) has consistently served as a genetic crossroads between East and West and a junction for the migrations that reshaped Europe's population. SEE, including modern Croatian territory, was a crucial passage from the Near East and even more distant regions and human populations in this region, as almost any other European population represents a remarkable genetic mixture. Modern humans have continuously occupied this region since the Upper Paleolithic era, and different (pre)historical events have left a distinctive genetic signature on the historical narrative of this region. Our views of its history have been mostly renewed in the last few decades by extraordinary data obtained from Y-chromosome studies. In recent times, the international research community, bringing together geneticists and archaeologists, has steadily released a growing number of ancient genomes from this region, shedding more light on its complex past population dynamics and shaping the genetic pool in Croatia and this part of Europe.


Subject(s)
Chromosomes, Human, Y , Genetics, Population , Humans , Chromosomes, Human, Y/genetics , Croatia , Genetics, Population/methods , Gene Pool , DNA, Ancient/analysis , Gene Flow , Human Migration , Male
12.
Genes (Basel) ; 15(6)2024 Jun 20.
Article in English | MEDLINE | ID: mdl-38927752

ABSTRACT

The Mongolian gerbil is a distinctive experimental animal in China, as its genetic qualities possess significant value in the field of medical biology research. Here, we aimed to establish an economical and efficient panel for genetic quality detection in Mongolian gerbils using single-nucleotide polymorphism (SNP) markers. To search for SNPs, we conducted whole-genome sequencing (WGS) in 40 Mongolian gerbils from outbred populations. Reliable screening criteria were established to preliminarily select SNPs with a wide genome distribution and high levels of polymorphism. Subsequently, a multiple-target regional capture detection system based on second-generation sequencing was developed for SNP genotyping. Based on the results of WGS, 219 SNPs were preliminarily selected, and they were established and optimized in a multiple-amplification system that included 206 SNP loci by genotyping three outbred populations. PopGen.32 analysis revealed that the average effective allele number, Shannon index, observed heterozygosity, expected heterozygosity, average heterozygosity, polymorphism information content, and other population genetic parameters of the Capital Medical University (CMU) gerbils were the highest, followed by those of Zhejiang gerbils and Dalian gerbils. Through scientific screening and optimization, we successfully established a novel, robust, and cost-effective genetic detection system for Mongolian gerbils by utilizing SNP markers for the first time.


Subject(s)
Gerbillinae , Polymorphism, Single Nucleotide , Animals , Gerbillinae/genetics , Polymorphism, Single Nucleotide/genetics , Whole Genome Sequencing/methods , Genotyping Techniques/methods , Genetics, Population/methods , China , Genotype , High-Throughput Nucleotide Sequencing/methods
13.
Bioinformatics ; 40(6)2024 Jun 03.
Article in English | MEDLINE | ID: mdl-38885407

ABSTRACT

MOTIVATION: Single nucleotide polymorphism (SNP) markers are increasingly popular for population genomics and inferring ancestry for individuals of unknown origin. Because large SNP datasets are impractical for rapid and routine analysis, diagnostics rely on panels of highly informative markers. Strategies exist for selecting these markers, however, resources for efficiently evaluating their performance are limited for non-model systems. RESULTS: snpAIMeR is a user-friendly R package that evaluates the efficacy of genomic markers for the cluster assignment of unknown individuals. It is intended to help minimize panel size and genotyping effort by determining the informativeness of candidate diagnostic markers. Provided genotype data from individuals of known origin, it uses leave-one-out cross-validation to determine population assignment rates for individual markers and marker combinations. AVAILABILITY AND IMPLEMENTATION: snpAIMeR is available on CRAN (https://CRAN.R-project.org/package=snpAIMeR).


Subject(s)
Polymorphism, Single Nucleotide , Software , Humans , Genetic Markers , Genetics, Population/methods , Genomics/methods , Genotype
14.
BMC Genom Data ; 25(1): 60, 2024 Jun 14.
Article in English | MEDLINE | ID: mdl-38877416

ABSTRACT

BACKGROUND: Forest geneticists typically use provenances to account for population differences in their improvement schemes; however, the historical records of the imported materials might not be very precise or well-aligned with the genetic clusters derived from advanced molecular techniques. The main objective of this study was to assess the impact of marker-based population structure on genetic parameter estimates related to growth and wood properties and their trade-offs in Norway spruce, by either incorporating it as a fixed effect (model-A) or excluding it entirely from the analysis (model-B). RESULTS: Our results indicate that models incorporating population structure significantly reduce estimates of additive genetic variance, resulting in substantial reduction of narrow-sense heritability. However, these models considerably improve prediction accuracies. This was particularly significant for growth and solid-wood properties, which showed to have the highest population genetic differentiation (QST) among the studied traits. Additionally, although the pattern of correlations remained similar across the models, their magnitude was slightly lower for models that included population structure as a fixed effect. This suggests that selection, consistently performed within populations, might be less affected by unfavourable genetic correlations compared to mass selection conducted without pedigree restrictions. CONCLUSION: We conclude that the results of models properly accounting for population structure are more accurate and less biased compared to those neglecting this effect. This might have practical implications for breeders and forest managers where, decisions based on imprecise selections can pose a high risk to economic efficiency.


Subject(s)
Picea , Wood , Picea/genetics , Picea/growth & development , Wood/genetics , Genetic Markers/genetics , Models, Genetic , Genetics, Population/methods , Genetic Variation/genetics
15.
Mol Biol Evol ; 41(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38874402

ABSTRACT

Inferring the demographic history of populations provides fundamental insights into species dynamics and is essential for developing a null model to accurately study selective processes. However, background selection and selective sweeps can produce genomic signatures at linked sites that mimic or mask signals associated with historical population size change. While the theoretical biases introduced by the linked effects of selection have been well established, it is unclear whether ancestral recombination graph (ARG)-based approaches to demographic inference in typical empirical analyses are susceptible to misinference due to these effects. To address this, we developed highly realistic forward simulations of human and Drosophila melanogaster populations, including empirically estimated variability of gene density, mutation rates, recombination rates, purifying, and positive selection, across different historical demographic scenarios, to broadly assess the impact of selection on demographic inference using a genealogy-based approach. Our results indicate that the linked effects of selection minimally impact demographic inference for human populations, although it could cause misinference in populations with similar genome architecture and population parameters experiencing more frequent recurrent sweeps. We found that accurate demographic inference of D. melanogaster populations by ARG-based methods is compromised by the presence of pervasive background selection alone, leading to spurious inferences of recent population expansion, which may be further worsened by recurrent sweeps, depending on the proportion and strength of beneficial mutations. Caution and additional testing with species-specific simulations are needed when inferring population history with non-human populations using ARG-based approaches to avoid misinference due to the linked effects of selection.


Subject(s)
Drosophila melanogaster , Models, Genetic , Population Density , Selection, Genetic , Animals , Drosophila melanogaster/genetics , Humans , Recombination, Genetic , Genetics, Population/methods , Computer Simulation , Mutation Rate
16.
Sci Rep ; 14(1): 13954, 2024 06 17.
Article in English | MEDLINE | ID: mdl-38886537

ABSTRACT

China, is characterized by its remarkable ethnical diversity, which necessitates whole genome variation data from multiple populations as crucial tools for advancing population genetics and precision medical research. However, there has been a scarcity of research concentrating on the whole genome of ethnic minority groups. To fill this gap, we developed the Guizhou Multi-ethnic Genome Database (GMGD). It comprises whole genome sequencing data from 476 healthy unrelated individuals spanning 11 ethnic minorities groups in Guizhou Province, Southwest China, including Bouyei, Dong, Miao, Yi, Bai, Gelo, Zhuang, Tujia, Yao, Hui, and Sui. The GMGD database comprises more than 16.33 million variants in GRCh38 and 16.20 million variants in GRCh37. Among these, approximately 11.9% (1,956,322) of the variants in GRCh38 and 18.5% (3,009,431) of the variants in GRCh37 are entirely new and do not exist in the dbSNP database. These novel variants shed light on the genetic diversity landscape across these populations, providing valuable insights with an average coverage of 5.5 ×. This makes GMGD the largest genome-wide database encompassing the most diverse ethnic groups to date. The GMGD interactive interface facilitates researchers with multi-dimensional mutation search methods and displays population frequency differences among global populations. Furthermore, GMGD is equipped with a genotype-imputation function, enabling enhanced capabilities for low-depth genomic research or targeted region capture studies. GMGD offers unique insights into the genomic variation landscape of different ethnic groups, which are freely accessible at https://db.cngb.org/pop/gmgd/ .


Subject(s)
Databases, Genetic , Ethnicity , Genome, Human , Humans , Ethnicity/genetics , China/ethnology , Genetics, Population/methods , Whole Genome Sequencing/methods , Genetic Variation , Minority Groups , Polymorphism, Single Nucleotide
17.
Proc Natl Acad Sci U S A ; 121(27): e2406734121, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38913897

ABSTRACT

The Merovingian period (5th to 8th cc AD) was a time of demographic, socioeconomic, cultural, and political realignment in Western Europe. Here, we report the whole-genome shotgun sequence data of 30 human skeletal remains from a coastal Late Merovingian site of Koksijde (675 to 750 AD), alongside 18 remains from two Early to Late Medieval sites in present-day Flanders, Belgium. We find two distinct ancestries, one shared with Early Medieval England and the Netherlands, while the other, minor component, reflecting likely continental Gaulish ancestry. Kinship analyses identified no large pedigrees characteristic to elite burials revealing instead a high modularity of distant relationships among individuals of the main ancestry group. In contrast, individuals with >90% Gaulish ancestry had no kinship links among sampled individuals. Evidence for population structure and major differences in the extent of Gaulish ancestry in the main group, including in a mother-daughter pair, suggests ongoing admixture in the community at the time of their burial. The isotopic and genetic evidence combined supports a model by which the burials, representing an established coastal nonelite community, had incorporated migrants from inland populations. The main group of burials at Koksijde shows an abundance of >5 cM long shared allelic intervals with the High Medieval site nearby, implying long-term continuity and suggesting that similarly to Britain, the Early Medieval ancestry shifts left a significant and long-lasting impact on the genetic makeup of the Flemish population. We find substantial allele frequency differences between the two ancestry groups in pigmentation and diet-associated variants, including those linked with lactase persistence, likely reflecting ancestry change rather than local adaptation.


Subject(s)
Pedigree , Humans , History, Medieval , Belgium , Burial/history , Genetics, Population/methods , Female , Male , DNA, Ancient/analysis , England , Human Migration , Archaeology , Netherlands , Genome, Human
18.
Mol Biol Evol ; 41(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38935572

ABSTRACT

Two important characteristics of metapopulations are extinction-(re)colonization dynamics and gene flow between subpopulations. These processes can cause strong shifts in genome-wide allele frequencies that are generally not observed in "classical" (large, stable, and panmictic) populations. Subpopulations founded by one or a few individuals, the so-called propagule model, are initially expected to show intermediate allele frequencies at polymorphic sites until natural selection and genetic drift drive allele frequencies toward a mutation-selection-drift equilibrium characterized by a negative exponential-like distribution of the site frequency spectrum. We followed changes in site frequency spectrum distribution in a natural metapopulation of the cyclically parthenogenetic pond-dwelling microcrustacean Daphnia magna using biannual pool-seq samples collected over a 5-yr period from 118 ponds occupied by subpopulations of known age. As expected under the propagule model, site frequency spectra in newly founded subpopulations trended toward intermediate allele frequencies and shifted toward right-skewed distributions as the populations aged. Immigration and subsequent hybrid vigor altered this dynamic. We show that the analysis of site frequency spectrum dynamics is a powerful approach to understand evolution in metapopulations. It allowed us to disentangle evolutionary processes occurring in a natural metapopulation, where many subpopulations evolve in parallel. Thereby, stochastic processes like founder and immigration events lead to a pattern of subpopulation divergence, while genetic drift leads to converging site frequency spectrum distributions in the persisting subpopulations. The observed processes are well explained by the propagule model and highlight that metapopulations evolve differently from classical populations.


Subject(s)
Daphnia , Gene Frequency , Genetic Drift , Selection, Genetic , Animals , Daphnia/genetics , Gene Flow , Models, Genetic , Genetics, Population/methods , Population Dynamics , Genome , Biological Evolution , Evolution, Molecular
20.
Methods Mol Biol ; 2809: 19-36, 2024.
Article in English | MEDLINE | ID: mdl-38907888

ABSTRACT

The allele frequency net database (AFND, http://www.allelefrequencies.net ) is an online web-based repository that contains information on the frequencies of immune-related genes and their corresponding alleles in worldwide human populations. At present, the website contains data from 1784 population samples in more than 14 million individuals from 129 countries on the frequency of genes from different polymorphic regions including data for the human leukocyte antigen (HLA) system. In addition, over the last four years, AFND has also incorporated genotype raw data from 85,000 individuals comprising 215 population samples from 39 countries. Moreover, more population data sets containing next generation sequencing data spanning >3 million individuals have been added. This resource has been widely used in a variety of contexts such as histocompatibility, immunology, epidemiology, pharmacogenetics, epitope prediction algorithms for population coverage in vaccine development, population genetics, among many others. In this chapter, we present an update of the most used searching mechanisms as described in a previous volume and some of the latest developments included in AFND.


Subject(s)
Databases, Genetic , Gene Frequency , Genetics, Population , Humans , Genetics, Population/methods , HLA Antigens/genetics , Alleles , Computational Biology/methods , Internet , Web Browser , Genotype , High-Throughput Nucleotide Sequencing/methods
SELECTION OF CITATIONS
SEARCH DETAIL