Pesquisa | BVS IEC

Family-based tests for associating haplotypes with general phenotype data: Improving the FBAT-haplotype algorithm.

Hecker, Julian; Xu, Xin; Townes, F William; Loehlein Fier, Heide; Corcoran, Chris; Laird, Nan; Lange, Christoph.

Genet Epidemiol ; 42(1): 123-126, 2018 02.

Artigo em Inglês | MEDLINE | ID: mdl-29159827

RESUMO

For family-based association studies, Horvath et al. proposed an algorithm for the association analysis between haplotypes and arbitrary phenotypes when the phase of the haplotypes is unknown, that is, genotype data is given. Their approach to haplotype analysis maintains the original features of the TDT/FBAT-approach, that is, complete robustness against genetic confounding and misspecification of the phenotype. The algorithm has been implemented in the FBAT and PBAT software package and has been used in numerous substantive manuscripts. Here, we propose a simplification of the original algorithm that maintains the original approach but reduces the computational burden of the approach substantially and gives valuable insights regarding the conditional distribution. With the modified algorithm, the application to whole-genome sequencing (WGS) studies becomes feasible; for example, in sliding window approaches or spatial-clustering approaches. The reduction of the computational burden that our modification provides is especially dramatic when both parental genotypes are missing. For example, for eight variants and 441 nuclear families with mostly offspring-only families, in a WGS study at the APOE locus, the running time decreased from approximately 21 hr for the original algorithm to 0.11 sec after our modification.

Assuntos

Algoritmos , Haplótipos , Núcleo Familiar , Fenótipo , Apolipoproteínas E/genética , Análise por Conglomerados , Feminino , Humanos , Masculino , Modelos Genéticos , Fatores de Tempo , Sequenciamento Completo do Genoma

On the association analysis of genome-sequencing data: A spatial clustering approach for partitioning the entire genome into nonoverlapping windows.

Loehlein Fier, Heide; Prokopenko, Dmitry; Hecker, Julian; Cho, Michael H; Silverman, Edwin K; Weiss, Scott T; Tanzi, Rudolph E; Lange, Christoph.

Genet Epidemiol ; 41(4): 332-340, 2017 05.

Artigo em Inglês | MEDLINE | ID: mdl-28318110

RESUMO

For the association analysis of whole-genome sequencing (WGS) studies, we propose an efficient and fast spatial-clustering algorithm. Compared to existing analysis approaches for WGS data, that define the tested regions either by sliding or consecutive windows of fixed sizes along variants, a meaningful grouping of nearby variants into consecutive regions has the advantage that, compared to sliding window approaches, the number of tested regions is likely to be smaller. In comparison to consecutive, fixed-window approaches, our approach is likely to group nearby variants together. Given existing biological evidence that disease-associated mutations tend to physically cluster in specific regions along the chromosome, the identification of meaningful groups of nearby located variants could thus lead to a potential power gain for association analysis. Our algorithm defines consecutive genomic regions based on the physical positions of the variants, assuming an inhomogeneous Poisson process and groups together nearby variants. As parameters are estimated locally, the algorithm takes the differing variant density along the chromosome into account and provides locally optimal partitioning of variants into consecutive regions. An R-implementation of the algorithm is provided. We discuss the theoretical advances of our algorithm compared to existing, window-based approaches and show the performance and advantage of our introduced algorithm in a simulation study and by an application to Alzheimer's disease WGS data. Our analysis identifies a region in the ITGB3 gene that potentially harbors disease susceptibility loci for Alzheimer's disease. The region-based association signal of ITGB3 replicates in an independent data set and achieves formally genome-wide significance. Software Implementation: An implementation of the algorithm in R is available at: https://github.com/heidefier/cluster_wgs_data.

Assuntos

Estudo de Associação Genômica Ampla , Genoma , Análise de Sequência de DNA , Algoritmos , Doença de Alzheimer/genética , Análise por Conglomerados , Simulação por Computador , Genômica , Humanos , Modelos Genéticos , Software

Using Network Methodology to Infer Population Substructure.

Prokopenko, Dmitry; Hecker, Julian; Silverman, Edwin; Nöthen, Markus M; Schmid, Matthias; Lange, Christoph; Loehlein Fier, Heide.

PLoS One ; 10(6): e0130708, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26098940

RESUMO

One of the main caveats of association studies is the possible affection by bias due to population stratification. Existing methods rely on model-based approaches like structure and ADMIXTURE or on principal component analysis like EIGENSTRAT. Here we provide a novel visualization technique and describe the problem of population substructure from a graph-theoretical point of view. We group the sequenced individuals into triads, which depict the relational structure, on the basis of a predefined pairwise similarity measure. We then merge the triads into a network and apply community detection algorithms in order to identify homogeneous subgroups or communities, which can further be incorporated as covariates into logistic regression. We apply our method to populations from different continents in the 1000 Genomes Project and evaluate the type 1 error based on the empirical p-values. The application to 1000 Genomes data suggests that the network approach provides a very fine resolution of the underlying ancestral population structure. Besides we show in simulations, that in the presence of discrete population structures, our developed approach maintains the type 1 error more precisely than existing approaches.

Assuntos

Algoritmos , Modelos Genéticos , População/genética , Humanos , Polimorfismo de Nucleotídeo Único

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA