Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
PLoS Genet ; 17(1): e1008748, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-33493157

RESUMO

The rate at which plants grow is a major functional trait in plant ecology. However, little is known about its evolution in natural populations. Here, we investigate evolutionary and environmental factors shaping variation in the growth rate of Arabidopsis thaliana. We used plant diameter as a proxy to monitor plant growth over time in environments that mimicked latitudinal differences in the intensity of natural light radiation, across a set of 278 genotypes sampled within four broad regions, including an outgroup set of genotypes from China. A field experiment conducted under natural conditions confirmed the ecological relevance of the observed variation. All genotypes markedly expanded their rosette diameter when the light supply was decreased, demonstrating that environmental plasticity is a predominant source of variation to adapt plant size to prevailing light conditions. Yet, we detected significant levels of genetic variation both in growth rate and growth plasticity. Genome-wide association studies revealed that only 2 single nucleotide polymorphisms associate with genetic variation for growth above Bonferroni confidence levels. However, marginally associated variants were significantly enriched among genes with an annotated role in growth and stress reactions. Polygenic scores computed from marginally associated variants confirmed the polygenic basis of growth variation. For both light regimes, phenotypic divergence between the most distantly related population (China) and the various regions in Europe is smaller than the variation observed within Europe, indicating that the evolution of growth rate is likely to be constrained by stabilizing selection. We observed that Spanish genotypes, however, reach a significantly larger size than Northern European genotypes. Tests of adaptive divergence and analysis of the individual burden of deleterious mutations reveal that adaptive processes have played a more important role in shaping regional differences in rosette growth than maladaptive evolution.


Assuntos
Adaptação Fisiológica/genética , Arabidopsis/genética , Herança Multifatorial/genética , Seleção Genética , Aclimatação/genética , Arabidopsis/crescimento & desenvolvimento , China , Europa (Continente) , Variação Genética/genética , Genética Populacional , Genótipo , Fenótipo , Desenvolvimento Vegetal/genética
2.
Theor Appl Genet ; 135(9): 3211-3222, 2022 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-35931838

RESUMO

KEY MESSAGE: Transposon insertion polymorphisms can improve prediction of complex agronomic traits in rice compared to using SNPs only, especially when accessions to be predicted are less related to the training set. Transposon insertion polymorphisms (TIPs) are significant sources of genetic variation. Previous work has shown that TIPs can improve detection of causative loci on agronomic traits in rice. Here, we quantify the fraction of variance explained by single nucleotide polymorphisms (SNPs) compared to TIPs, and we explore whether TIPs can improve prediction of traits when compared to using only SNPs. We used eleven traits of agronomic relevance from by five different rice population groups (Aus, Indica, Aromatic, Japonica, and Admixed), 738 accessions in total. We assess prediction by applying data split validation in two scenarios. In the within-population scenario, we predicted performance of improved Indica varieties using the rest of Indica accessions. In the across population scenario, we predicted all Aromatic and Admixed accessions using the rest of populations. In each scenario, Bayes C and a Bayesian reproducible kernel Hilbert space regression were compared. We find that TIPs can explain an important fraction of total genetic variance and that they also improve genomic prediction. In the across population prediction scenario, TIPs outperformed SNPs in nine out of the eleven traits analyzed. In some traits like leaf senescence or grain width, using TIPs increased predictive correlation by 30-50%. Our results evidence, for the first time, that TIPs genotyping can improve prediction on complex agronomic traits in rice, especially when accessions to be predicted are less related to training accessions.


Assuntos
Oryza , Teorema de Bayes , Elementos de DNA Transponíveis , Oryza/genética , Fenótipo , Polimorfismo de Nucleotídeo Único
3.
Mol Biol Evol ; 34(12): 3299-3302, 2017 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-29029172

RESUMO

We present version 6 of the DNA Sequence Polymorphism (DnaSP) software, a new version of the popular tool for performing exhaustive population genetic analyses on multiple sequence alignments. This major upgrade incorporates novel functionalities to analyze large data sets, such as those generated by high-throughput sequencing technologies. Among other features, DnaSP 6 implements: 1) modules for reading and analyzing data from genomic partitioning methods, such as RADseq or hybrid enrichment approaches, 2) faster methods scalable for high-throughput sequencing data, and 3) summary statistics for the analysis of multi-locus population genetics data. Furthermore, DnaSP 6 includes novel modules to perform single- and multi-locus coalescent simulations under a wide range of demographic scenarios. The DnaSP 6 program, with extensive documentation, is freely available at http://www.ub.edu/dnasp.


Assuntos
Polimorfismo Genético/genética , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Sequência de Bases , Genética Populacional , Genoma , Genômica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Software , Interface Usuário-Computador
4.
Theor Popul Biol ; 123: 70-79, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-29964061

RESUMO

We introduce the conditional Site Frequency Spectrum (SFS) for a genomic region linked to a focal mutation of known frequency. An exact expression for its expected value is provided for the neutral model without recombination. Its relation with the expected SFS for two sites, 2-SFS, is discussed. These spectra derive from the coalescent approach of Fu (1995) for finite samples, which is reviewed. Remarkably simple expressions are obtained for the linked SFS of a large population, which are also solutions of the multi-allelic Kolmogorov equations. These formulae are the immediate extensions of the well known single site θ∕f neutral SFS. Besides the general interest in these spectra, they relate to relevant biological cases, such as structural variants and introgressions. As an application, a recipe to adapt Tajima's D and other SFS-based neutrality tests to a non-recombining region containing a neutral marker is presented.


Assuntos
Genética Populacional/métodos , Modelos Genéticos , Taxa de Mutação , Evolução Molecular , Desequilíbrio de Ligação , Seleção Genética
5.
Heredity (Edinb) ; 120(1): 63-76, 2018 01.
Artigo em Inglês | MEDLINE | ID: mdl-29234173

RESUMO

Pigs (Sus scrofa) originated in Southeast Asia and expanded to Europe and North Africa approximately 1 MYA. Analyses of porcine Y-chromosome variation have shown the existence of two main haplogroups that are highly divergent, a result that is consistent with previous mitochondrial and autosomal data showing that the Asian and non-Asian pig populations remained geographically isolated until recently. Paradoxically, one of these Y-chromosome haplogroups is extensively shared by pigs and wild boars from Asia and Europe, an observation that is difficult to reconcile with a scenario of prolonged geographic isolation. To shed light on this issue, we genotyped 33 Y-linked SNPs and one indel in a worldwide sample of pigs and wild boars and sequenced a total of 9903 nucleotide sites from seven loci distributed along the Y-chromosome. Notably, the nucleotide diversity per site at the Y-linked loci (0.0015 in Asian pigs) displayed the same order of magnitude as that described for autosomal loci (~0.0023), a finding compatible with a process of sustained and intense isolation. We performed an approximate Bayesian computation analysis focused on the paternal diversity of wild boars and local pig breeds in which we compared three demographic models: two isolation models (I models) differing in the time of isolation and a model of isolation with recent unidirectional migration (IM model). Our results suggest that the most likely explanation for the extensive sharing of one Y-chromosome haplogroup between non-Asian and Asian populations is a recent and unidirectional (non-Asian > Asian) paternal migration event.


Assuntos
Fluxo Gênico , Variação Genética , Sus scrofa/genética , Cromossomo Y/genética , Migração Animal , Animais , Ásia , Teorema de Bayes , Europa (Continente) , Genética Populacional , Haplótipos , Masculino , Filogenia , Dinâmica Populacional
6.
Mol Biol Evol ; 32(10): 2760-74, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26174143

RESUMO

The availability of extensive databases of crop genome sequences should allow analysis of crop variability at an unprecedented scale, which should have an important impact in plant breeding. However, up to now the analysis of genetic variability at the whole-genome scale has been mainly restricted to single nucleotide polymorphisms (SNPs). This is a strong limitation as structural variation (SV) and transposon insertion polymorphisms are frequent in plant species and have had an important mutational role in crop domestication and breeding. Here, we present the first comprehensive analysis of melon genetic diversity, which includes a detailed analysis of SNPs, SV, and transposon insertion polymorphisms. The variability found among seven melon varieties representing the species diversity and including wild accessions and highly breed lines, is relatively high due in part to the marked divergence of some lineages. The diversity is distributed nonuniformly across the genome, being lower at the extremes of the chromosomes and higher in the pericentromeric regions, which is compatible with the effect of purifying selection and recombination forces over functional regions. Additionally, this variability is greatly reduced among elite varieties, probably due to selection during breeding. We have found some chromosomal regions showing a high differentiation of the elite varieties versus the rest, which could be considered as strongly selected candidate regions. Our data also suggest that transposons and SV may be at the origin of an important fraction of the variability in melon, which highlights the importance of analyzing all types of genetic variability to understand crop genome evolution.


Assuntos
Cucurbitaceae/genética , Elementos de DNA Transponíveis/genética , Evolução Molecular , Genoma de Planta , Mutagênese Insercional/genética , Polimorfismo de Nucleotídeo Único/genética , Cucumis sativus/genética , Deleção de Genes , Loci Gênicos , Nucleotídeos/genética , Filogenia , Seleção Genética
7.
BMC Genomics ; 16: 4, 2015 Jan 22.
Artigo em Inglês | MEDLINE | ID: mdl-25612459

RESUMO

BACKGROUND: The genome of the melon (Cucumis melo L.) double-haploid line DHL92 was recently sequenced, with 87.5 and 80.8% of the scaffold assembly anchored and oriented to the 12 linkage groups, respectively. However, insufficient marker coverage and a lack of recombination left several large, gene rich scaffolds unanchored, and some anchored scaffolds unoriented. To improve the anchoring and orientation of the melon genome assembly, we used resequencing data between the parental lines of DHL92 to develop a new set of SNP markers from unanchored scaffolds. RESULTS: A high-resolution genetic map composed of 580 SNPs was used to anchor 354.8 Mb of sequence, contained in 141 scaffolds (average size 2.5 Mb) and corresponding to 98.2% of the scaffold assembly, to the 12 melon chromosomes. Over 325.4 Mb (90%) of the assembly was oriented. The genetic map revealed regions of segregation distortion favoring SC alleles as well as recombination suppression regions coinciding with putative centromere, 45S, and 5S rDNA sites. New chromosome-scale pseudomolecules were created by incorporating to the previous v3.5 version an additional 38.3 Mb of anchored sequence representing 1,837 predicted genes contained in 55 scaffolds. Using fluorescent in situ hybridization (FISH) with BACs that produced chromosome-specific signals, melon chromosomes that correspond to the twelve linkage groups were identified, and a standardized karyotype of melon inbred line T111 was developed. CONCLUSIONS: By utilizing resequencing data and targeted SNP selection combined with a large F2 mapping population, we significantly improved the quantity of anchored and oriented melon scaffold genome assembly. Using genome information combined with FISH mapping provided the first cytogenetic map of an inodorus melon type. With these results it was possible to make inferences on melon chromosome structure by relating zones of recombination suppression to centromeres and 45S and 5S heterochromatic regions. This study represents the first steps towards the integration of the high-resolution genetic and cytogenetic maps with the genomic sequence in melon that will provide more information on genome organization and allow for the improvement of the melon genome draft sequence.


Assuntos
Cucumis melo/genética , Genoma de Planta , Polimorfismo de Nucleotídeo Único , Alelos , Mapeamento Cromossômico , Ligação Genética , Hibridização in Situ Fluorescente , Cariotipagem , RNA Ribossômico/genética , RNA Ribossômico 5S/genética , Recombinação Genética
8.
Mol Biol Evol ; 31(7): 1929-36, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24739305

RESUMO

Although many computer programs can perform population genetics calculations, they are typically limited in the analyses and data input formats they offer; few applications can process the large data sets produced by whole-genome resequencing projects. Furthermore, there is no coherent framework for the easy integration of new statistics into existing pipelines, hindering the development and application of new population genetics and genomics approaches. Here, we present PopGenome, a population genomics package for the R software environment (a de facto standard for statistical analyses). PopGenome can efficiently process genome-scale data as well as large sets of individual loci. It reads DNA alignments and single-nucleotide polymorphism (SNP) data sets in most common formats, including those used by the HapMap, 1000 human genomes, and 1001 Arabidopsis genomes projects. PopGenome also reads associated annotation files in GFF format, enabling users to easily define regions or classify SNPs based on their annotation; all analyses can also be applied to sliding windows. PopGenome offers a wide range of diverse population genetics analyses, including neutrality tests as well as statistics for population differentiation, linkage disequilibrium, and recombination. PopGenome is linked to Hudson's MS and Ewing's MSMS programs to assess statistical significance based on coalescent simulations. PopGenome's integration in R facilitates effortless and reproducible downstream analyses as well as the production of publication-quality graphics. Developers can easily incorporate new analyses methods into the PopGenome framework. PopGenome and R are freely available from CRAN (http://cran.r-project.org/) for all major operating systems under the GNU General Public License.


Assuntos
Metagenômica/métodos , Software , Arabidopsis/genética , Variação Genética , Genoma Humano , Genoma de Planta , Humanos , Polimorfismo de Nucleotídeo Único , Navegador
9.
Theor Popul Biol ; 100C: 79-87, 2015 03.
Artigo em Inglês | MEDLINE | ID: mdl-25595553

RESUMO

Several variations of the Watterson estimator of variability for Next Generation Sequencing (NGS) data have been proposed in the literature. We present a unified framework for generalized Watterson estimators based on Maximum Composite Likelihood, which encompasses most of the existing estimators. We propose this class of unbiased estimators as generalized Watterson estimators for a large class of NGS data, including pools and trios. We also discuss the relation with the estimators proposed in the literature and show that they admit two equivalent but seemingly different forms, deriving a set of combinatorial identities as a byproduct. Finally, we give a detailed treatment of Watterson estimators for single or multiple autopolyploid individuals.

10.
BMC Genomics ; 14: 148, 2013 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-23497037

RESUMO

BACKGROUND: In contrast to international pig breeds, the Iberian breed has not been admixed with Asian germplasm. This makes it an important model to study both domestication and relevance of Asian genes in the pig. Besides, Iberian pigs exhibit high meat quality as well as appetite and propensity to obesity. Here we provide a genome wide analysis of nucleotide and structural diversity in a reduced representation library from a pool (n=9 sows) and shotgun genomic sequence from a single sow of the highly inbred Guadyerbas strain. In the pool, we applied newly developed tools to account for the peculiarities of these data. RESULTS: A total of 254,106 SNPs in the pool (79.6 Mb covered) and 643,783 in the Guadyerbas sow (1.47 Gb covered) were called. The nucleotide diversity (1.31x10-3 per bp in autosomes) is very similar to that reported in wild boar. A much lower than expected diversity in the X chromosome was confirmed (1.79x10-4 per bp in the individual and 5.83x10-4 per bp in the pool). A strong (0.70) correlation between recombination and variability was observed, but not with gene density or GC content. Multicopy regions affected about 4% of annotated pig genes in their entirety, and 2% of the genes partially. Genes within the lowest variability windows comprised interferon genes and, in chromosome X, genes involved in behavior like HTR2C or MCEP2. A modified Hudson-Kreitman-Aguadé test for pools also indicated an accelerated evolution in genes involved in behavior, as well as in spermatogenesis and in lipid metabolism. CONCLUSIONS: This work illustrates the strength of current sequencing technologies to picture a comprehensive landscape of variability in livestock species, and to pinpoint regions containing genes potentially under selection. Among those genes, we report genes involved in behavior, including feeding behavior, and lipid metabolism. The pig X chromosome is an outlier in terms of nucleotide diversity, which suggests selective constraints. Our data further confirm the importance of structural variation in the species, including Iberian pigs, and allowed us to identify new paralogs for known gene families.


Assuntos
Animais Endogâmicos/genética , Mapeamento Cromossômico , Polimorfismo de Nucleotídeo Único/genética , Suínos/genética , Animais , Cruzamento , Variação Genética , Nucleotídeos/genética
11.
Mol Ecol ; 22(22): 5561-76, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-24102736

RESUMO

Next generation sequencing of pooled samples is an effective approach for studies of variability and differentiation in populations. In this paper we provide a comprehensive set of estimators of the most common statistics in population genetics based on the frequency spectrum, namely the Watterson estimator θW, nucleotide pairwise diversity Π, Tajima's D, Fu and Li's D and F, Fay and Wu's H, McDonald-Kreitman and HKA tests and FST, corrected for sequencing errors and ascertainment bias. In a simulation study, we show that pool and individual θ estimates are highly correlated and discuss how the performance of the statistics vary with read depth and sample size in different evolutionary scenarios. As an application, we reanalyse sequences from Drosophila mauritiana and from an evolution experiment in Drosophila melanogaster. These methods are useful for population genetic projects with limited budget, study of communities of individuals that are hard to isolate, or autopolyploid species.


Assuntos
Genética Populacional/métodos , Genômica/métodos , Modelos Genéticos , Animais , Simulação por Computador , Drosophila/genética , Sequenciamento de Nucleotídeos em Larga Escala , Modelos Estatísticos , Tamanho da Amostra
12.
Genes (Basel) ; 14(9)2023 08 28.
Artigo em Inglês | MEDLINE | ID: mdl-37761854

RESUMO

One of the main necessities for population geneticists is the availability of sensitive statistical tools that enable to accept or reject the standard Wright-Fisher model of neutral evolution. A number of statistical tests have been developed to detect specific deviations from the null frequency spectrum in different directions (e.g., Tajima's D, Fu and Li's F and D tests, Fay and Wu's H). A general framework exists to generate all neutrality tests that are linear functions of the frequency spectrum. In this framework, it is possible to develop a family of optimal tests with almost maximum power against a specific alternative evolutionary scenario. In this paper we provide a thorough discussion of the structure and properties of linear and nonlinear neutrality tests. First, we present the general framework for linear tests and emphasise the importance of the property of scalability with the sample size (that is, the interpretation of the tests should not depend on the sample size), which, if missing, can lead to errors in interpreting the data. After summarising the motivation and structure of linear optimal tests, we present a more general framework for the optimisation of linear tests, leading to a new family of tunable neutrality tests. In a further generalisation, we extend the framework to nonlinear neutrality tests and we derive nonlinear optimal tests for polynomials of any degree in the frequency spectrum.


Assuntos
Evolução Biológica , Mustelidae , Animais , Deriva Genética , Motivação , Tamanho da Amostra
13.
Plants (Basel) ; 12(2)2023 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-36678957

RESUMO

A high-density single nucleotide polymorphism (SNP) array is essential to enable faster progress in plant breeding for new cultivar development. In this regard, we have developed an Axiom 60K almond SNP array by resequencing 81 almond accessions. For the validation of the array, a set of 210 accessions were genotyped and 82.8% of the SNPs were classified in the best recommended SNPs. The rate of missing data was between 0.4% and 2.7% for the almond accessions and less than 15.5% for the few peach and wild accessions, suggesting that this array can be used for peach and interspecific peach × almond genetic studies. The values of the two SNPs linked to the RMja (nematode resistance) and SK (bitterness) genes were consistent. We also genotyped 49 hybrids from an almond F2 progeny and could build a genetic map with a set of 1159 SNPs. Error rates, less than 1%, were evaluated by comparing replicates and by detection of departures from Mendelian inheritance in the F2 progeny. This almond array is commercially available and should be a cost-effective genotyping tool useful in the search for new genes and quantitative traits loci (QTL) involved in the control of agronomic traits.

14.
Mol Ecol ; 19(16): 3364-79, 2010 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-20670364

RESUMO

A. halleri is a psuedometallophyte with a patchy distribution in Europe and is often spread by human activity. To determine the population history and whether this history is consistent with potential human effects, we surveyed nucleotide variation using 24 loci from 12 individuals in a large A. halleri population. The means of total and silent nucleotide variation (theta(W)) are within the range expected for the species. The population genetic neutrality tests Tajima's D and Wall's B had significant composite results rejecting panmixia, and Approximate Bayesian Computation analysis revealed that a subdivision model better explained the variation than the standard neutral model, refugia (or admixture), bottleneck or change of population size models. A categorical regression analysis further supports the subdivision model, and under the subdivision model, the neutrality tests are no longer significant. The best support was for two source populations, a situation consistent with the mixing of two populations possibly mediated by human activity. This scenario might limit the genetic diversity and adaptive potential of the population. The non-neutral population variation described here should be considered in bioinformatic searches for adaptation.


Assuntos
Arabidopsis/genética , Variação Genética , Genética Populacional , Teorema de Bayes , DNA de Plantas/genética , Alemanha , Haplótipos , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA
15.
Animals (Basel) ; 10(12)2020 Nov 27.
Artigo em Inglês | MEDLINE | ID: mdl-33261195

RESUMO

Inbreeding and effective population size (Ne) are fundamental indicators for the management and conservation of genetic diversity in populations. Genomic inbreeding gives accurate estimates of inbreeding, and the Ne determines the rate of the loss of genetic variation. The objective of this work was to study the distribution of runs of homozygosity (ROHs) in order to estimate genomic inbreeding (FROH) and an effective population size using 38,789 Single Nucleotide Polymorphisms (SNPs) from the Illumina Bovine 50K BeadChip in 86 samples from populations of Charolais de Cuba (n = 40) cattle and to compare this information with French (n = 20) and British Charolais (n = 26) populations. In the Cuban, French, and British Charolais populations, the average estimated genomic inbreeding values using the FROH statistics were 5.7%, 3.4%, and 4%, respectively. The dispersion measured by variation coefficient was high at 43.9%, 37.0%, and 54.2%, respectively. The effective population size experienced a very similar decline during the last century in Charolais de Cuba (from 139 to 23 individuals), in French Charolais (from 142 to 12), and in British Charolais (from 145 to 14) for the ~20 last generations. However, the high variability found in the ROH indicators and FROH reveals an opportunity for maintaining the genetic diversity of this breed with an adequate mating strategy, which can be favored with the use of molecular markers. Moreover, the detected ROH were compared to previous results obtained on the detection of signatures of selection in the same breed. Some of the observed signatures were confirmed by the ROHs, emphasizing the process of adaptation to tropical climate experienced by the Charolais de Cuba population.

16.
Viruses ; 12(11)2020 11 14.
Artigo em Inglês | MEDLINE | ID: mdl-33202558

RESUMO

Newcastle disease virus (NDV) infections are well known to harbour quasispecies, due to the error-prone nature of the RNA polymerase. Quasispecies variants in the fusion cleavage site of the virus are known to significantly change its virulence. However, little is known about the genomic patterns of diversity and selection in NDV viral swarms. We analyse deep sequencing data from in vitro and in vivo NDV infections to uncover the genomic patterns of diversity and the signatures of selection within NDV swarms. Variants in viruses from in vitro samples are mostly localised in non-coding regions and 3' and 5' untranslated regions (3'UTRs or 5'UTRs), while in vivo samples contain an order of magnitude more variants. We find different patterns of genomic divergence and diversity among NDV genotypes, as well as differences in the genomic distribution of intra-host variants among in vitro and in vivo infections of the same strain. The frequency spectrum shows clear signatures of intra-host purifying selection in vivo on the matrix protein (M) coding gene and positive or diversifying selection on nucleocapsid (NP) and haemagglutinin-neuraminidase (HN). The comparison between within-host polymorphisms and phylogenetic divergence reveals complex patterns of selective pressure on the NDV genome at between- and within-host level. The M sequence is strongly constrained both between and within hosts, fusion protein (F) coding gene is under intra-host positive selection, and NP and HN show contrasting patterns: HN RNA sequence is positively selected between hosts while its protein sequence is positively selected within hosts, and NP is under intra-host positive selection at the RNA level and negative selection at the protein level.


Assuntos
Evolução Molecular , Variação Genética , Vírus da Doença de Newcastle/genética , Quase-Espécies/genética , Animais , Linhagem Celular , Embrião de Galinha/virologia , Galinhas , Genoma Viral , Genômica , Genótipo , Hemaglutininas/genética , Hemaglutininas/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Interações Hospedeiro-Patógeno/genética , Neuraminidase/genética , Neuraminidase/metabolismo , Doença de Newcastle , Vírus da Doença de Newcastle/isolamento & purificação , Filogenia , Doenças das Aves Domésticas/virologia , RNA Viral/genética , RNA Viral/isolamento & purificação , Análise de Sequência de RNA , Proteínas Virais/genética , Proteínas Virais/metabolismo
17.
Front Genet ; 9: 480, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30405691

RESUMO

The Site Frequency Spectrum (SFS) and the heterozygosity of allelic variants are among the most important summary statistics for population genetic analysis of diploid organisms. We discuss the generalization of these statistics to populations of autopolyploid organisms in terms of the joint Site Frequency/Dosage Spectrum and its expected value for autopolyploid populations that follow the standard neutral model. Based on these results, we present estimators of nucleotide variability from High-Throughput Sequencing (HTS) data of autopolyploids and discuss potential issues related to sequencing errors and variant calling. We use these estimators to generalize Tajima's D and other SFS-based neutrality tests to HTS data from autopolyploid organisms. Finally, we discuss how these approaches fail when the number of individuals is small. In fact, in autopolyploids there are many possible deviations from the Hardy-Weinberg equilibrium, each reflected in a different shape of the individual dosage distribution. The SFS from small samples is often dominated by the shape of these deviations of the dosage distribution from its Hardy-Weinberg expectations.

18.
Genome Biol Evol ; 10(6): 1584-1595, 2018 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-29901717

RESUMO

Transposable elements (TEs) are a major driver of plant genome evolution. A part from being a rich source of new genes and regulatory sequences, TEs can also affect plant genome evolution by modifying genome size and shaping chromosome structure. TEs tend to concentrate in heterochromatic pericentromeric regions and their proliferation may expand these regions. Here, we show that after the split of melon and cucumber, TEs have expanded the pericentromeric regions of melon chromosomes that, probably as a consequence, show a very low recombination frequency. In contrast, TEs have not proliferated to a high extent in cucumber, which has small TE-dense pericentromeric regions and shows a relatively constant recombination rate along chromosomes. These differences in chromosome structure also translate in differences in gene nucleotide diversity. Although gene nucleotide diversity is essentially constant along cucumber chromosomes, melon chromosomes show a bimodal pattern of genetic variability, with a gene-poor region where variability is negatively correlated with gene density. Interestingly, genes are not homogeneously distributed in melon, and the high variable low-recombining pericentromeric regions show a higher concentration of melon-specific genes whereas genes shared with cucumber and other plants are essentially found in gene-rich chromosomal arms. The results presented here suggest that melon pericentromeric regions may allow gene sequences to evolve more freely than in other chromosomal compartments which may allow new ORFs to arise and eventually be selected. These results show that TEs can drastically change the structure of chromosomes creating different chromosomal compartments imposing different constraints for gene evolution.


Assuntos
Cucurbitaceae/genética , Elementos de DNA Transponíveis/genética , Genoma de Planta/genética , Cromossomos de Plantas/genética , Evolução Molecular , Variação Genética/genética , Tamanho do Genoma/genética
19.
Evol Bioinform Online ; 13: 1176934317723884, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28894353

RESUMO

The accurate estimation of nucleotide variability using next-generation sequencing data is challenged by the high number of sequencing errors produced by new sequencing technologies, especially for nonmodel species, where reference sequences may not be available and the read depth may be low due to limited budgets. The most popular single-nucleotide polymorphism (SNP) callers are designed to obtain a high SNP recovery and low false discovery rate but are not designed to account appropriately the frequency of the variants. Instead, algorithms designed to account for the frequency of SNPs give precise results for estimating the levels and the patterns of variability. These algorithms are focused on the unbiased estimation of the variability and not on the high recovery of SNPs. Here, we implemented a fast and optimized parallel algorithm that includes the method developed by Roesti et al and Lynch, which estimates the genotype of each individual at each site, considering the possibility to call both bases from the genotype, a single one or none. This algorithm does not consider the reference and therefore is independent of biases related to the reference nucleotide specified. The pipeline starts from a BAM file converted to pileup or mpileup format and the software outputs a FASTA file. The new program not only reduces the running times but also, given the improved use of resources, it allows its usage with smaller computers and large parallel computers, expanding its benefits to a wider range of researchers. The output file can be analyzed using software for population genetics analysis, such as the R library PopGenome, the software VariScan, and the program mstatspop for analysis considering positions with missing data.

20.
Genetics ; 207(1): 229-240, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28679545

RESUMO

We investigate the dependence of the site frequency spectrum on the topological structure of genealogical trees. We show that basic population genetic statistics, for instance, estimators of θ or neutrality tests such as Tajima's D, can be decomposed into components of waiting times between coalescent events and of tree topology. Our results clarify the relative impact of the two components on these statistics. We provide a rigorous interpretation of positive or negative values of an important class of neutrality tests in terms of the underlying tree shape. In particular, we show that values of Tajima's D and Fay and Wu's H depend in a direct way on a peculiar measure of tree balance, which is mostly determined by the root balance of the tree. We present a new test for selection in the same class as Fay and Wu's H and discuss its interpretation and power. Finally, we determine the trees corresponding to extreme expected values of these neutrality tests and present formulas for these extreme values as a function of sample size and number of segregating sites.


Assuntos
Modelos Genéticos , Taxa de Mutação , Filogenia , Seleção Genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa