Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Syst Biol ; 64(6): 1032-47, 2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26227865

RESUMO

Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the presence of missing data. Phylogenetic analysis of RAD loci requires careful attention to model assumptions, especially if downstream analyses depend on branch lengths.


Assuntos
Classificação/métodos , Simulação por Computador , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Animais , Lagartos/classificação , Lagartos/genética
2.
Theor Popul Biol ; 100C: 88-97, 2015 03.
Artigo em Inglês | MEDLINE | ID: mdl-25542067

RESUMO

For a one- or two-dimensional lattice of finite length consisting of populations, each of which has the same population size, the classical stepping-stone model has been used to approximate the patterns of variation at neutral loci in geographic regions. In the pioneering papers by Maruyama (1970a, 1970b, 1971) the changes of gene frequency at a locus subject to neutral mutation between two alleles, migration, and random genetic drift were modeled by a vector autoregression model. Maruyama was able to use the spectrum of the migration matrix, but to do this he had to introduce approximations in which there was either extra mutation in the terminal populations, or extra migration from the subterminal population into the terminal population. In this paper a similar vector autoregression model is used, but it proves possible to obtain the eigenvalues and eigenvectors of the migration matrix without those approximations. Approximate formulas for the variances and covariances of gene frequencies in different populations are obtained, and checked by numerical iteration of the exact covariances of the vector autoregression model.

3.
Mol Biol Evol ; 29(8): 1917-32, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22422763

RESUMO

The multispecies coalescent provides an elegant theoretical framework for estimating species trees and species demographics from genetic markers. However, practical applications of the multispecies coalescent model are limited by the need to integrate or sample over all gene trees possible for each genetic marker. Here we describe a polynomial-time algorithm that computes the likelihood of a species tree directly from the markers under a finite-sites model of mutation effectively integrating over all possible gene trees. The method applies to independent (unlinked) biallelic markers such as well-spaced single nucleotide polymorphisms, and we have implemented it in SNAPP, a Markov chain Monte Carlo sampler for inferring species trees, divergence dates, and population sizes. We report results from simulation experiments and from an analysis of 1997 amplified fragment length polymorphism loci in 69 individuals sampled from six species of Ourisia (New Zealand native foxglove).


Assuntos
Alelos , Digitalis/genética , Genes de Plantas/genética , Filogenia , Algoritmos , Análise do Polimorfismo de Comprimento de Fragmentos Amplificados , Simulação por Computador , Bases de Dados Genéticas , Marcadores Genéticos , Funções Verossimilhança , Nova Zelândia , Especificidade da Espécie
4.
Am J Hum Genet ; 87(6): 848-56, 2010 Dec 10.
Artigo em Inglês | MEDLINE | ID: mdl-21109223

RESUMO

Determining historical sex ratios throughout human evolution can provide insight into patterns of genomic variation, the structure and composition of ancient populations, and the cultural factors that influence the sex ratio (e.g., sex-specific migration rates). Although numerous studies have suggested that unequal sex ratios have existed in human evolutionary history, a coherent picture of sex-biased processes has yet to emerge. For example, two recent studies compared human X chromosome to autosomal variation to make inferences about historical sex ratios but reached seemingly contradictory conclusions, with one study finding evidence for a male bias and the other study identifying a female bias. Here, we show that a large part of this discrepancy can be explained by methodological differences. Specifically, through reanalysis of empirical data, derivation of explicit analytical formulae, and extensive simulations we demonstrate that two estimators of the effective sex ratio based on population structure and nucleotide diversity preferentially detect biases that have occurred on different timescales. Our results clarify apparently contradictory evidence on the role of sex-biased processes in human evolutionary history and show that extant patterns of human genomic variation are consistent with both a recent male bias and an earlier, persistent female bias.


Assuntos
Razão de Masculinidade , Cromossomos Humanos X , Feminino , Humanos , Masculino
5.
Am Nat ; 179(2): 145-56, 2012 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-22218305

RESUMO

The threshold model developed by Sewall Wright in 1934 can be used to model the evolution of two-state discrete characters along a phylogeny. The model assumes that there is a quantitative character, called liability, that is unobserved and that determines the discrete character according to whether the liability exceeds a threshold value. A Markov chain Monte Carlo algorithm is used to infer the evolutionary covariances of the liabilities for discrete characters, sampling liability values consistent with the phylogeny and with the observed data. The same approach can also be used for continuous characters by assuming that the tip species have values that have been observed. In this way, one can make a comparative-methods analysis that combines both discrete and continuous characters. Simulations are presented showing that the covariances of the liabilities are successfully estimated, although precision can be achieved only by using a large number of species, and we must always worry whether the covariances and the model apply throughout the group. An advantage of the threshold model is that the model can be straightforwardly extended to accommodate within-species phenotypic variation and allows an interface with quantitative-genetics models.


Assuntos
Evolução Biológica , Classificação/métodos , Modelos Biológicos , Fenótipo , Algoritmos , Funções Verossimilhança , Cadeias de Markov , Modelos Genéticos , Método de Monte Carlo , Filogenia , Especificidade da Espécie
6.
Genetics ; 180(2): 1095-105, 2008 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-18780754

RESUMO

We have developed a pruning algorithm for likelihood estimation of a tree of populations. This algorithm enables us to compute the likelihood for large trees. Thus, it gives an efficient way of obtaining the maximum-likelihood estimate (MLE) for a given tree topology. Our method utilizes the differences accumulated by random genetic drift in allele count data from single-nucleotide polymorphisms (SNPs), ignoring the effect of mutation after divergence from the common ancestral population. The computation of the maximum-likelihood tree involves both maximizing likelihood over branch lengths of a given topology and comparing the maximum-likelihood across topologies. Here our focus is the maximization of likelihood over branch lengths of a given topology. The pruning algorithm computes arrays of probabilities at the root of the tree from the data at the tips of the tree; at the root, the arrays determine the likelihood. The arrays consist of probabilities related to the number of coalescences and allele counts for the partially coalesced lineages. Computing these probabilities requires an unusual two-stage algorithm. Our computation is exact and avoids time-consuming Monte Carlo methods. We can also correct for ascertainment bias.


Assuntos
Algoritmos , Evolução Molecular , Genética Populacional/estatística & dados numéricos , Funções Verossimilhança , Polimorfismo de Nucleotídeo Único
7.
Am Nat ; 171(6): 713-25, 2008 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-18419518

RESUMO

Comparative methods analyses have usually assumed that the species phenotypes are the true means for those species. In most analyses, the actual values used are means of samples of modest size. The covariances of contrasts then involve both the covariance of evolutionary changes and a fraction of the within-species phenotypic covariance, the fraction depending on the sample size for that species. Ives et al. have shown how to analyze data in this case when the within-species phenotypic covariances are known. The present model allows them to be unknown and to be estimated from the data. A multivariate normal statistical model is used for multiple characters in samples of finite size from species related by a known phylogeny, under the usual Brownian motion model of change and with equal within-species phenotypic covariances. Contrasts in each character can be obtained both between individuals within a species and between species. Each contrast can be taken for all of the characters. These sets of contrasts, each the same contrast taken for different characters, are independent. The within-set covariances are unequal and depend on the unknown true covariance matrices. An expectation-maximization algorithm is derived for making a reduced maximum likelihood estimate of the covariances of evolutionary change and the within-species phenotypic covariances. It is available in the Contrast program of the PHYLIP package. Computer simulations show that the covariances are biased when the finiteness of sample size is not taken into account and that using the present model corrects the bias. Sampling variation reduces the power of inference of covariation in evolution of different characters. An extension of this method to incorporate estimates of additive genetic covariances from a simple genetic experiment is also discussed.


Assuntos
Evolução Biológica , Modelos Genéticos , Animais , Simulação por Computador , Especiação Genética , Variação Genética , Filogenia , Viés de Seleção
8.
Cladistics ; 2(2): 130-143, 1986 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-34949068

RESUMO

Abstract- Farris (1985) claimed that my assertions about unbiasedness and consistency of estimates of a phylogeny obtained by least squares fitting are in error. The counterexample he constructed violates the assumptions of additivity and independence of distances which were clearly stated in my earlier paper. As such it is not a valid counterexample. It is argued, contrary to Farris's claims, that one need not avoid nonmetric distances, and that one should avoid negative branch lengths in estimates of phylogenies from distance data. Statistical tests of clockness, and, to a limited extent, of alternative phylogenies can be constructed, and these are demonstrated by example. A computer program to infer phylogenies from distance matrices has been in free distribution by me for several years; it seems as effective as the program recently announced by Farris. Information on phylogenies is present in distance data, as in other kinds of data, and statistical methods can be developed to extract it.

9.
Philos Trans R Soc Lond B Biol Sci ; 366(1569): 1410-24, 2011 May 12.
Artigo em Inglês | MEDLINE | ID: mdl-21444315

RESUMO

How do we quantify patterns (such as responses to local selection) sampled across multiple populations within a single species? Key to this question is the extent to which populations within species represent statistically independent data points in our analysis. Comparative analyses across species and higher taxa have long recognized the need to control for the non-independence of species data that arises through patterns of shared common ancestry among them (phylogenetic non-independence), as have quantitative genetic studies of individuals linked by a pedigree. Analyses across populations lacking pedigree information fall in the middle, and not only have to deal with shared common ancestry, but also the impact of exchange of migrants between populations (gene flow). As a result, phenotypes measured in one population are influenced by processes acting on others, and may not be a good guide to either the strength or direction of local selection. Although many studies examine patterns across populations within species, few consider such non-independence. Here, we discuss the sources of non-independence in comparative analysis, and show why the phylogeny-based approaches widely used in cross-species analyses are unlikely to be useful in analyses across populations within species. We outline the approaches (intraspecific contrasts, generalized least squares, generalized linear mixed models and autoregression) that have been used in this context, and explain their specific assumptions. We highlight the power of 'mixed models' in many contexts where problems of non-independence arise, and show that these allow incorporation of both shared common ancestry and gene flow. We suggest what can be done when ideal solutions are inaccessible, highlight the need for incorporation of a wider range of population models in intraspecific comparative methods and call for simulation studies of the error rates associated with alternative approaches.


Assuntos
Evolução Biológica , Fluxo Gênico , Modelos Genéticos , Animais , Genética Populacional , Fenótipo
10.
Mol Biol Evol ; 23(3): 691-700, 2006 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-16364968

RESUMO

A computer simulation study has been made of the accuracy of estimates of Theta = 4Nemu from a sample from a single isolated population of finite size. The accuracies turn out to be well predicted by a formula developed by Fu and Li, who used optimistic assumptions. Their formulas are restated in terms of accuracy, defined here as the reciprocal of the squared coefficient of variation. This should be proportional to sample size when the entities sampled provide independent information. Using these formulas for accuracy, the sampling strategy for estimation of Theta can be investigated. Two models for cost have been used, a cost-per-base model and a cost-per-read model. The former would lead us to prefer to have a very large number of loci, each one base long. The latter, which is more realistic, causes us to prefer to have one read per locus and an optimum sample size which declines as costs of sampling organisms increase. For realistic values, the optimum sample size is 8 or fewer individuals. This is quite close to the results obtained by Pluzhnikov and Donnelly for a cost-per-base model, evaluating other estimators of Theta. It can be understood by considering that the resources spent collecting larger samples prevent us from considering more loci. An examination of the efficiency of Watterson's estimator of Theta was also made, and it was found to be reasonably efficient when the number of mutants per generation in the sequence in the whole population is less than 2.5.


Assuntos
Variação Genética , Genética Populacional , Funções Verossimilhança , Humanos , Matemática , Modelos Genéticos , Filogenia
11.
Philos Trans R Soc Lond B Biol Sci ; 360(1459): 1427-34, 2005 Jul 29.
Artigo em Inglês | MEDLINE | ID: mdl-16048785

RESUMO

Sewall Wright's threshold model has been used in modelling discrete traits that may have a continuous trait underlying them, but it has proven difficult to make efficient statistical inferences with it. The availability of Markov chain Monte Carlo (MCMC) methods makes possible likelihood and Bayesian inference using this model. This paper discusses prospects for the use of the threshold model in morphological systematics to model the evolution of discrete all-or-none traits. There the threshold model has the advantage over 0/1 Markov process models in that it not only accommodates polymorphism within species, but can also allow for correlated evolution of traits with far fewer parameters that need to be inferred. The MCMC importance sampling methods needed to evaluate likelihood ratios for the threshold model are introduced and described in some detail.


Assuntos
Evolução Biológica , Classificação/métodos , Modelos Genéticos , Teorema de Bayes , Funções Verossimilhança , Cadeias de Markov , Método de Monte Carlo , Filogenia , Locos de Características Quantitativas , Especificidade da Espécie
12.
Evolution ; 39(4): 783-791, 1985 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28561359

RESUMO

The recently-developed statistical method known as the "bootstrap" can be used to place confidence intervals on phylogenies. It involves resampling points from one's own data, with replacement, to create a series of bootstrap samples of the same size as the original data. Each of these is analyzed, and the variation among the resulting estimates taken to indicate the size of the error involved in making estimates from the original data. In the case of phylogenies, it is argued that the proper method of resampling is to keep all of the original species while sampling characters with replacement, under the assumption that the characters have been independently drawn by the systematist and have evolved independently. Majority-rule consensus trees can be used to construct a phylogeny showing all of the inferred monophyletic groups that occurred in a majority of the bootstrap samples. If a group shows up 95% of the time or more, the evidence for it is taken to be statistically significant. Existing computer programs can be used to analyze different bootstrap samples by using weights on the characters, the weight of a character being how many times it was drawn in bootstrap sampling. When all characters are perfectly compatible, as envisioned by Hennig, bootstrap sampling becomes unnecessary; the bootstrap method would show significant evidence for a group if it is defined by three or more characters.

13.
Evolution ; 46(1): 159-173, 1992 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28564959

RESUMO

Restriction sites data can be analyzed by maximum likelihood to obtain estimates of phylogenies. The likelihood methods of Smouse and Li, who were able to compute likelihoods for up to four species under a simplified model of base change, can be extended numerically to deal with any number of species. The computational methods for doing so are outlined. The resulting algorithms are slow but take multiple gains and losses of restriction sites fully into account, unlike parsimony methods. They allow for the failure to observe potential sites that are absent from all species. Analysis of the five-species hominoid data of Ferris and coworkers confirms the pattern found by Smouse and Li with four species-that a chimpanzee-gorilla clade is favored, but not statistically significantly over other tree topologies. A large data set produced by computer simulation has also been analyzed to confirm that the method works properly. The methods used here do not allow for different rates of transitions and transversions. They can be extended to do so, but only at a cost of considerably slower computations. The present method is available in a computer program.

14.
Evolution ; 38(1): 16-24, 1984 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-28556064
16.
Evolution ; 35(1): 124-138, 1981 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-28563447
17.
Evolution ; 40(4): 883-889, 1986 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28556149
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA