Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
BMC Evol Biol ; 17(1): 136, 2017 06 12.
Artigo em Inglês | MEDLINE | ID: mdl-28606055

RESUMO

BACKGROUND: Amino acid substitution models play an essential role in inferring phylogenies from mitochondrial protein data. However, only few empirical models have been estimated from restricted mitochondrial protein data of a hundred species. The existing models are unlikely to represent appropriately the amino acid substitutions from hundred thousands metazoan mitochondrial protein sequences. RESULTS: We selected 125,935 mitochondrial protein sequences from 34,448 species in the metazoan kingdom to estimate new amino acid substitution models targeting metazoa, vertebrates and invertebrate groups. The new models help to find significantly better likelihood phylogenies in comparison with the existing models. We noted remarkable distances from phylogenies with the existing models to the maximum likelihood phylogenies that indicate a considerable number of incorrect bipartitions in phylogenies with the existing models. Finally, we used the new models and mitochondrial protein data to certify that Testudines, Aves, and Crocodylia form one separated clade within amniotes. CONCLUSIONS: We introduced new mitochondrial amino acid substitution models for metazoan mitochondrial proteins. The new models outperform the existing models in inferring phylogenies from metazoan mitochondrial protein data. We strongly recommend researchers to use the new models in analysing metazoan mitochondrial protein data.


Assuntos
Invertebrados/genética , Modelos Genéticos , Filogenia , Vertebrados/genética , Substituição de Aminoácidos , Animais , Evolução Molecular , Humanos , Mitocôndrias/genética , Proteínas Mitocondriais/genética , Homologia de Sequência de Aminoácidos
2.
PLoS Genet ; 9(5): e1003509, 2013 May.
Artigo em Inglês | MEDLINE | ID: mdl-23717212

RESUMO

Combining data from genome-wide association studies (GWAS) conducted at different locations, using genotype imputation and fixed-effects meta-analysis, has been a powerful approach for dissecting complex disease genetics in populations of European ancestry. Here we investigate the feasibility of applying the same approach in Africa, where genetic diversity, both within and between populations, is far more extensive. We analyse genome-wide data from approximately 5,000 individuals with severe malaria and 7,000 population controls from three different locations in Africa. Our results show that the standard approach is well powered to detect known malaria susceptibility loci when sample sizes are large, and that modern methods for association analysis can control the potential confounding effects of population structure. We show that pattern of association around the haemoglobin S allele differs substantially across populations due to differences in haplotype structure. Motivated by these observations we consider new approaches to association analysis that might prove valuable for multicentre GWAS in Africa: we relax the assumptions of SNP-based fixed effect analysis; we apply Bayesian approaches to allow for heterogeneity in the effect of an allele on risk across studies; and we introduce a region-based test to allow for heterogeneity in the location of causal alleles.


Assuntos
População Negra/genética , Estudo de Associação Genômica Ampla , Hemoglobina Falciforme/genética , Malária/genética , África , Teorema de Bayes , Mapeamento Cromossômico , Heterogeneidade Genética , Predisposição Genética para Doença , Variação Genética , Genética Populacional , Genoma Humano , Haplótipos , Humanos , Desequilíbrio de Ligação , Malária/epidemiologia , Malária/patologia , Polimorfismo de Nucleotídeo Único
3.
BMC Bioinformatics ; 15: 341, 2014 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-25344302

RESUMO

BACKGROUND: Amino acid replacement rate matrices are a crucial component of many protein analysis systems such as sequence similarity search, sequence alignment, and phylogenetic inference. Ideally, the rate matrix reflects the mutational behavior of the actual data under study; however, estimating amino acid replacement rate matrices requires large protein alignments and is computationally expensive and complex. As a compromise, sub-optimal pre-calculated generic matrices are typically used for protein-based phylogeny. Sequence availability has now grown to a point where problem-specific rate matrices can often be calculated if the computational cost can be controlled. RESULTS: The most time consuming step in estimating rate matrices by maximum likelihood is building maximum likelihood phylogenetic trees from protein alignments. We propose a new procedure, called FastMG, to overcome this obstacle. The key innovation is the alignment-splitting algorithm that splits alignments with many sequences into non-overlapping sub-alignments prior to estimating amino acid replacement rates. Experiments with different large data sets showed that the FastMG procedure was an order of magnitude faster than without splitting. Importantly, there was no apparent loss in matrix quality if an appropriate splitting procedure is used. CONCLUSIONS: FastMG is a simple, fast and accurate procedure to estimate amino acid replacement rate matrices from large data sets. It enables researchers to study the evolutionary relationships for specific groups of proteins or taxa with optimized, data-specific amino acid replacement rate matrices. The programs, data sets, and the new mammalian mitochondrial protein rate matrix are available at http://fastmg.codeplex.com.


Assuntos
Algoritmos , Substituição de Aminoácidos , Evolução Molecular , Funções Verossimilhança , Animais , Filogenia , Probabilidade , Proteínas/química , Proteínas/genética
4.
Bioinformatics ; 27(19): 2758-60, 2011 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-21791535

RESUMO

SUMMARY: Amino acid replacement rate matrices are an essential basis of protein studies (e.g. in phylogenetics and alignment). A number of general purpose matrices have been proposed (e.g. JTT, WAG, LG) since the seminal work of Margaret Dayhoff and co-workers. However, it has been shown that matrices specific to certain protein groups (e.g. mitochondrial) or life domains (e.g. viruses) differ significantly from general average matrices, and thus perform better when applied to the data to which they are dedicated. This Web server implements the maximum-likelihood estimation procedure that was used to estimate LG, and provides a number of tools and facilities. Users upload a set of multiple protein alignments from their domain of interest and receive the resulting matrix by email, along with statistics and comparisons with other matrices. A non-parametric bootstrap is performed optionally to assess the variability of replacement rate estimates. Maximum-likelihood trees, inferred using the estimated rate matrix, are also computed optionally for each input alignment. Finely tuned procedures and up-to-date ML software (PhyML 3.0, XRATE) are combined to perform all these heavy calculations on our clusters. AVAILABILITY: http://www.atgc-montpellier.fr/ReplacementMatrix/ CONTACT: olivier.gascuel@lirmm.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at http://www.atgc-montpellier.fr/ReplacementMatrix/


Assuntos
Aminoácidos/genética , Filogenia , Alinhamento de Sequência/estatística & dados numéricos , Internet , Funções Verossimilhança , Probabilidade , Proteínas/genética , Software
5.
BMC Evol Biol ; 10: 99, 2010 Apr 12.
Artigo em Inglês | MEDLINE | ID: mdl-20384985

RESUMO

BACKGROUND: The amino acid substitution model is the core component of many protein analysis systems such as sequence similarity search, sequence alignment, and phylogenetic inference. Although several general amino acid substitution models have been estimated from large and diverse protein databases, they remain inappropriate for analyzing specific species, e.g., viruses. Emerging epidemics of influenza viruses raise the need for comprehensive studies of these dangerous viruses. We propose an influenza-specific amino acid substitution model to enhance the understanding of the evolution of influenza viruses. RESULTS: A maximum likelihood approach was applied to estimate an amino acid substitution model (FLU) from approximately 113,000 influenza protein sequences, consisting of approximately 20 million residues. FLU outperforms 14 widely used models in constructing maximum likelihood phylogenetic trees for the majority of influenza protein alignments. On average, FLU gains approximately 42 log likelihood points with an alignment of 300 sites. Moreover, topologies of trees constructed using FLU and other models are frequently different. FLU does indeed have an impact on likelihood improvement as well as tree topologies. It was implemented in PhyML and can be downloaded from ftp://ftp.sanger.ac.uk/pub/1000genomes/lsq/FLU or included in PhyML 3.0 server at http://www.atgc-montpellier.fr/phyml/. CONCLUSIONS: FLU should be useful for any influenza protein analysis system which requires an accurate description of amino acid substitutions.


Assuntos
Substituição de Aminoácidos , Modelos Genéticos , Orthomyxoviridae/genética , Proteínas Virais/genética , Humanos , Funções Verossimilhança , Proteínas Virais/química
6.
Science ; 356(6343)2017 06 16.
Artigo em Inglês | MEDLINE | ID: mdl-28522690

RESUMO

The malaria parasite Plasmodium falciparum invades human red blood cells by a series of interactions between host and parasite surface proteins. By analyzing genome sequence data from human populations, including 1269 individuals from sub-Saharan Africa, we identify a diverse array of large copy-number variants affecting the host invasion receptor genes GYPA and GYPB We find that a nearby association with severe malaria is explained by a complex structural rearrangement involving the loss of GYPB and gain of two GYPB-A hybrid genes, which encode a serologically distinct blood group antigen known as Dantu. This variant reduces the risk of severe malaria by 40% and has recently increased in frequency in parts of Kenya, yet it appears to be absent from west Africa. These findings link structural variation of red blood cell invasion receptors with natural resistance to severe malaria.


Assuntos
Resistência à Doença/genética , Eritrócitos/parasitologia , Glicoforinas , Interações Hospedeiro-Parasita/genética , Malária Falciparum/genética , Modelos Moleculares , Adulto , África Subsaariana , Criança , Variações do Número de Cópias de DNA/genética , Frequência do Gene , Genoma Humano/genética , Glicoforinas/química , Glicoforinas/genética , Glicoforinas/metabolismo , Humanos , Estrutura Secundária de Proteína , Receptores de Superfície Celular/química , Receptores de Superfície Celular/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA