Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Stat Appl Genet Mol Biol ; 14(5): 497-505, 2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26506100

RESUMO

We are concerned with statistical inference for 2 × C × K contingency tables in the context of genetic case-control association studies. Multivariate methods based on asymptotic Gaussianity of vectors of test statistics require information about the asymptotic correlation structure among these test statistics under the global null hypothesis. In the case of C=2, we show that for a wide variety of test statistics this asymptotic correlation structure is given by the standardized linkage disequilibrium matrix of the K loci under investigation. Three popular choices of test statistics are discussed for illustration. In the case of C=3, the standardized composite linkage disequilibrium matrix is the limiting correlation matrix of the K locus-specific Cochran-Armitage trend test statistics.


Assuntos
Estudos de Associação Genética , Algoritmos , Estudos de Casos e Controles , Interpretação Estatística de Dados , Loci Gênicos , Predisposição Genética para Doença , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Modelos Estatísticos , Distribuição Normal , Razão de Chances
2.
Genome Med ; 5(7): 69, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23902830

RESUMO

With exome sequencing becoming a tool for mutation detection in routine diagnostics there is an increasing need for platform-independent methods of quality control. We present a genotype-weighted metric that allows comparison of all the variant calls of an exome to a high-quality reference dataset of an ethnically matched population. The exome-wide genotyping accuracy is estimated from the distance to this reference set, and does not require any further knowledge about data generation or the bioinformatics involved. The distances of our metric are visualized by non-metric multidimensional scaling and serve as an intuitive, standardizable score for the quality assessment of exome data.

3.
Nucleic Acids Res ; 40(6): 2426-31, 2012 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-22127862

RESUMO

With the availability of next-generation sequencing (NGS) technology, it is expected that sequence variants may be called on a genomic scale. Here, we demonstrate that a deeper understanding of the distribution of the variant call frequencies at heterozygous loci in NGS data sets is a prerequisite for sensitive variant detection. We model the crucial steps in an NGS protocol as a stochastic branching process and derive a mathematical framework for the expected distribution of alleles at heterozygous loci before measurement that is sequencing. We confirm our theoretical results by analyzing technical replicates of human exome data and demonstrate that the variance of allele frequencies at heterozygous loci is higher than expected by a simple binomial distribution. Due to this high variance, mutation callers relying on binomial distributed priors are less sensitive for heterozygous variants that deviate strongly from the expected mean frequency. Our results also indicate that error rates can be reduced to a greater degree by technical replicates than by increasing sequencing depth.


Assuntos
Frequência do Gene , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Alelos , Exoma , Heterozigoto , Humanos , Processos Estocásticos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA