Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 30(4): 576-7, 2014 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-24336146

RESUMO

SUMMARY: forqs is a forward-in-time simulation of recombination, quantitative traits and selection. It was designed to investigate haplotype patterns resulting from scenarios where substantial evolutionary change has taken place in a small number of generations due to recombination and/or selection on polygenic quantitative traits. AVAILABILITY AND IMPLEMENTATION: forqs is implemented as a command-line C++ program. Source code and binary executables for Linux, OSX and Windows are freely available under a permissive BSD license: https://bitbucket.org/dkessner/forqs.


Assuntos
Locos de Características Quantitativas/genética , Recombinação Genética/genética , Seleção Genética/genética , Software , Algoritmos , Simulação por Computador , Evolução Molecular , Genética Populacional , Haplótipos/genética , Humanos
2.
Mol Biol Evol ; 30(5): 1145-58, 2013 May.
Artigo em Inglês | MEDLINE | ID: mdl-23364324

RESUMO

DNA samples are often pooled, either by experimental design or because the sample itself is a mixture. For example, when population allele frequencies are of primary interest, individual samples may be pooled together to lower the cost of sequencing. Alternatively, the sample itself may be a mixture of multiple species or strains (e.g., bacterial species comprising a microbiome or pathogen strains in a blood sample). We present an expectation-maximization algorithm for estimating haplotype frequencies in a pooled sample directly from mapped sequence reads, in the case where the possible haplotypes are known. This method is relevant to the analysis of pooled sequencing data from selection experiments, as well as the calculation of proportions of different species within a metagenomics sample. Our method outperforms existing methods based on single-site allele frequencies, as well as simple approaches using sequence read data. We have implemented the method in a freely available open-source software tool.


Assuntos
Haplótipos/genética , Funções Verossimilhança , Metagenômica/métodos , Algoritmos , Animais , Frequência do Gene/genética , Genótipo , Humanos , RNA Ribossômico 16S/genética
3.
Mol Cell Proteomics ; 10(1): R110.000133, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20716697

RESUMO

Mass spectrometry is a fundamental tool for discovery and analysis in the life sciences. With the rapid advances in mass spectrometry technology and methods, it has become imperative to provide a standard output format for mass spectrometry data that will facilitate data sharing and analysis. Initially, the efforts to develop a standard format for mass spectrometry data resulted in multiple formats, each designed with a different underlying philosophy. To resolve the issues associated with having multiple formats, vendors, researchers, and software developers convened under the banner of the HUPO PSI to develop a single standard. The new data format incorporated many of the desirable technical attributes from the previous data formats, while adding a number of improvements, including features such as a controlled vocabulary with validation tools to ensure consistent usage of the format, improved support for selected reaction monitoring data, and immediately available implementations to facilitate rapid adoption by the community. The resulting standard data format, mzML, is a well tested open-source format for mass spectrometer output files that can be readily utilized by the community and easily adapted for incremental advances in mass spectrometry technology.


Assuntos
Bases de Dados de Proteínas/normas , Espectrometria de Massas/métodos , Espectrometria de Massas/normas , Software/normas , Padrões de Referência , Reprodutibilidade dos Testes
4.
Hum Mutat ; 33(7): 1087-98, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22415848

RESUMO

Genetic variation in LRRK2 predisposes to Parkinson disease (PD), which underpins its development as a therapeutic target. Here, we aimed to identify novel genotype-phenotype associations that might support developing LRRK2 therapies for other conditions. We sequenced the 51 exons of LRRK2 in cases comprising 12 common diseases (n = 9,582), and in 4,420 population controls. We identified 739 single-nucleotide variants, 62% of which were observed in only one person, including 316 novel exonic variants. We found evidence of purifying selection for the LRRK2 gene and a trend suggesting that this is more pronounced in the central (ROC-COR-kinase) core protein domains of LRRK2 than the flanking domains. Population genetic analyses revealed that LRRK2 is not especially polymorphic or differentiated in comparison to 201 other drug target genes. Among Europeans, we identified 17 carriers (0.13%) of pathogenic LRRK2 mutations that were not significantly enriched within any disease or in those reporting a family history of PD. Analysis of pathogenic mutations within Europe reveals that the p.Arg1628Pro (c4883G>C) mutation arose independently in Europe and Asia. Taken together, these findings demonstrate how targeted deep sequencing can help to reveal fundamental characteristics of clinically important loci.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Proteínas Serina-Treonina Quinases/genética , Europa (Continente) , Predisposição Genética para Doença , Genética Populacional , Humanos , Serina-Treonina Proteína Quinase-2 com Repetições Ricas em Leucina , Mutação , Doença de Parkinson/genética , População Branca/genética
5.
Bioinformatics ; 24(21): 2534-6, 2008 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-18606607

RESUMO

SUMMARY: The ProteoWizard software project provides a modular and extensible set of open-source, cross-platform tools and libraries. The tools perform proteomics data analyses; the libraries enable rapid tool creation by providing a robust, pluggable development framework that simplifies and unifies data file access, and performs standard proteomics and LCMS dataset computations. The library contains readers and writers of the mzML data format, which has been written using modern C++ techniques and design principles and supports a variety of platforms with native compilers. The software has been specifically released under the Apache v2 license to ensure it can be used in both academic and commercial projects. In addition to the library, we also introduce a rapidly growing set of companion tools whose implementation helps to illustrate the simplicity of developing applications on top of the ProteoWizard library. AVAILABILITY: Cross-platform software that compiles using native compilers (i.e. GCC on Linux, MSVC on Windows and XCode on OSX) is available for download free of charge, at http://proteowizard.sourceforge.net. This website also provides code examples, and documentation. It is our hope the ProteoWizard project will become a standard platform for proteomics development; consequently, code use, contribution and further development are strongly encouraged.


Assuntos
Proteoma/análise , Proteômica/métodos , Software , Bases de Dados de Proteínas , Proteínas/análise , Proteínas/química , Proteoma/química , Interface Usuário-Computador
6.
Genetics ; 199(4): 991-1005, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25672748

RESUMO

Evolve and resequence studies combine artificial selection experiments with massively parallel sequencing technology to study the genetic basis for complex traits. In these experiments, individuals are selected for extreme values of a trait, causing alleles at quantitative trait loci (QTL) to increase or decrease in frequency in the experimental population. We present a new analysis of the power of artificial selection experiments to detect and localize quantitative trait loci. This analysis uses a simulation framework that explicitly models whole genomes of individuals, quantitative traits, and selection based on individual trait values. We find that explicitly modeling QTL provides qualitatively different insights than considering independent loci with constant selection coefficients. Specifically, we observe how interference between QTL under selection affects the trajectories and lengthens the fixation times of selected alleles. We also show that a substantial portion of the genetic variance of the trait (50-100%) can be explained by detected QTL in as little as 20 generations of selection, depending on the trait architecture and experimental design. Furthermore, we show that power depends crucially on the opportunity for recombination during the experiment. Finally, we show that an increase in power is obtained by leveraging founder haplotype information to obtain allele frequency estimates.


Assuntos
Genoma de Inseto , Modelos Genéticos , Locos de Características Quantitativas , Seleção Genética , Algoritmos , Animais , Drosophila/genética , Variação Genética , Recombinação Genética
7.
Science ; 337(6090): 100-4, 2012 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-22604722

RESUMO

Rare genetic variants contribute to complex disease risk; however, the abundance of rare variants in human populations remains unknown. We explored this spectrum of variation by sequencing 202 genes encoding drug targets in 14,002 individuals. We find rare variants are abundant (1 every 17 bases) and geographically localized, so that even with large sample sizes, rare variant catalogs will be largely incomplete. We used the observed patterns of variation to estimate population growth parameters, the proportion of variants in a given frequency class that are putatively deleterious, and mutation rates for each gene. We conclude that because of rapid population growth and weak purifying selection, human populations harbor an abundance of rare variants, many of which are deleterious and have relevance to understanding disease risk.


Assuntos
Doença/genética , Variação Genética , Genoma Humano , Negro ou Afro-Americano/genética , Povo Asiático , Frequência do Gene , Estudos de Associação Genética , Predisposição Genética para Doença , Geografia , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Terapia de Alvo Molecular , Herança Multifatorial , Taxa de Mutação , Farmacogenética , Fenótipo , Polimorfismo de Nucleotídeo Único , Crescimento Demográfico , Tamanho da Amostra , Seleção Genética , População Branca/genética
8.
Nat Genet ; 43(9): 847-53, 2011 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-21775992

RESUMO

Studies of recombination and how it varies depend crucially on accurate recombination maps. We propose a new approach for constructing high-resolution maps of relative recombination rates based on the observation of ancestry switch points among admixed individuals. We show the utility of this approach using simulations and by applying it to SNP genotype data from a sample of 2,565 African Americans and 299 African Caribbeans and detecting several hundred thousand recombination events. Comparison of the inferred map with high-resolution maps from non-admixed populations provides evidence of fine-scale differentiation in recombination rates between populations. Overall, the admixed map is well predicted by the average proportion of admixture and the recombination rate estimates from the source populations. The exceptions to this are in areas surrounding known large chromosomal structural variants, specifically inversions. These results suggest that outside of structurally variable regions, admixture does not substantially disrupt the factors controlling recombination rates in humans.


Assuntos
Mapeamento Cromossômico/métodos , Recombinação Genética , Negro ou Afro-Americano/genética , Haplótipos , Humanos , Linhagem , Polimorfismo de Nucleotídeo Único
9.
J Proteome Res ; 7(9): 4031-9, 2008 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-18707148

RESUMO

Mass spectrometry-based proteomics experiments have become an important tool for studying biological systems. Identifying the proteins in complex mixtures by assigning peptide fragmentation spectra to peptide sequences is an important step in the proteomics process. The 1-2 ppm mass-accuracy of hybrid instruments, like the LTQ-FT, has been cited as a key factor in their ability to identify a larger number of peptides with greater confidence than competing instruments. However, in replicate experiments of an 18-protein mixture, we note parent masses deviate 171 ppm, on average, for ion-trap data directed identifications and 8 ppm, on average, for preview Fourier transform (FT) data directed identifications. These deviations are neither caused by poor calibration nor by excessive ion-loading and are most likely due to errors in parent mass estimation. To improve these deviations, we introduce msPrefix, a program to re-estimate a peptide's parent mass from an associated high-accuracy full-scan survey spectrum. In 18-protein mixture experiments, msPrefix parent mass estimates deviate only 1 ppm, on average, from the identified peptides. In a cell lysate experiment searched with a tolerance of 50 ppm, 2295 peptides were confidently identified using native data and 4560 using msPrefixed data. Likewise, in a plasma experiment searched with a tolerance of 50 ppm, 326 peptides were identified using native data and 1216 using msPrefixed data. msPrefix is also able to determine which MS/MS spectra were possibly derived from multiple precursor ions. In complex mixture experiments, we demonstrate that more than 50% of triggered MS/MS may have had multiple precursor ions and note that spectra with multiple candidate ions are less likely to result in an identification using TANDEM. These results demonstrate integration of msPrefix into traditional shotgun proteomics workflows significantly improves identification results.


Assuntos
Proteínas Sanguíneas/química , Peptídeos/análise , Espectrometria de Massas em Tandem/métodos , Algoritmos , Bases de Dados de Proteínas , Humanos , Espectrometria de Massas em Tandem/instrumentação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA