Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Bioinformatics ; 32(17): 2598-603, 2016 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-27187203

RESUMEN

MOTIVATION: For genetic studies, statistically significant variants explain far less trait variance than 'sub-threshold' association signals. To dimension follow-up studies, researchers need to accurately estimate 'true' effect sizes at each SNP, e.g. the true mean of odds ratios (ORs)/regression coefficients (RRs) or Z-score noncentralities. Naïve estimates of effect sizes incur winner's curse biases, which are reduced only by laborious winner's curse adjustments (WCAs). Given that Z-scores estimates can be theoretically translated on other scales, we propose a simple method to compute WCA for Z-scores, i.e. their true means/noncentralities. RESULTS: WCA of Z-scores shrinks these towards zero while, on P-value scale, multiple testing adjustment (MTA) shrinks P-values toward one, which corresponds to the zero Z-score value. Thus, WCA on Z-scores scale is a proxy for MTA on P-value scale. Therefore, to estimate Z-score noncentralities for all SNPs in genome scans, we propose F: DR I: nverse Q: uantile T: ransformation (FIQT). It (i) performs the simpler MTA of P-values using FDR and (ii) obtains noncentralities by back-transforming MTA P-values on Z-score scale. When compared to competitors, realistic simulations suggest that FIQT is more (i) accurate and (ii) computationally efficient by orders of magnitude. Practical application of FIQT to Psychiatric Genetic Consortium schizophrenia cohort predicts a non-trivial fraction of sub-threshold signals which become significant in much larger supersamples. CONCLUSIONS: FIQT is a simple, yet accurate, WCA method for Z-scores (and ORs/RRs, via simple transformations). AVAILABILITY AND IMPLEMENTATION: A 10 lines R function implementation is available at https://github.com/bacanusa/FIQT CONTACT: sabacanu@vcu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Sesgo , Interpretación Estadística de Datos , Humanos , Fenotipo
2.
Bioinformatics ; 32(2): 295-7, 2016 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-26428293

RESUMEN

MOTIVATION: To increase detection power, gene level analysis methods are used to aggregate weak signals. To greatly increase computational efficiency, most methods use as input summary statistics from genome-wide association studies (GWAS). Subsequently, gene statistics are constructed using linkage disequilibrium (LD) patterns from a relevant reference panel. However, all methods, including our own Joint Effect on Phenotype of eQTL/functional single nucleotide polymorphisms (SNPs) associated with a Gene (JEPEG), assume homogeneous panels, e.g. European. However, this renders these tools unsuitable for the analysis of large cosmopolitan cohorts. RESULTS: We propose a JEPEG extension, JEPEGMIX, which similar to one of our software tools, Direct Imputation of summary STatistics of unmeasured SNPs from MIXed ethnicity cohorts, is capable of estimating accurate LD patterns for cosmopolitan cohorts. JEPEGMIX uses this accurate LD estimates to (i) impute the summary statistics at unmeasured functional variants and (ii) test for the joint effect of all measured and imputed functional variants which are associated with a gene. We illustrate the performance of our tool by analyzing the GWAS meta-analysis summary statistics from the multi-ethnic Psychiatric Genomics Consortium Schizophrenia stage 2 cohort. This practical application supports the immune system being one of the main drivers of the process leading to schizophrenia. AVAILABILITY AND IMPLEMENTATION: Software, annotation database and examples are available at http://dleelab.github.io/jepegmix/. CONTACT: donghyung.lee@vcuhealth.org SUPPLEMENTARY INFORMATION: Supplementary material is available at Bioinformatics online.


Asunto(s)
Etnicidad/genética , Pruebas Genéticas , Genética de Población , Polimorfismo de Nucleótido Simple/genética , Esquizofrenia/genética , Programas Informáticos , Estudios de Cohortes , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Humanos , Desequilibrio de Ligamiento , Fenotipo
3.
Bioinformatics ; 31(19): 3099-104, 2015 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-26059716

RESUMEN

MOTIVATION: To increase the signal resolution for large-scale meta-analyses of genome-wide association studies, genotypes at unmeasured single nucleotide polymorphisms (SNPs) are commonly imputed using large multi-ethnic reference panels. However, the ever increasing size and ethnic diversity of both reference panels and cohorts makes genotype imputation computationally challenging for moderately sized computer clusters. Moreover, genotype imputation requires subject-level genetic data, which unlike summary statistics provided by virtually all studies, is not publicly available. While there are much less demanding methods which avoid the genotype imputation step by directly imputing SNP statistics, e.g. Directly Imputing summary STatistics (DIST) proposed by our group, their implicit assumptions make them applicable only to ethnically homogeneous cohorts. RESULTS: To decrease computational and access requirements for the analysis of cosmopolitan cohorts, we propose DISTMIX, which extends DIST capabilities to the analysis of mixed ethnicity cohorts. The method uses a relevant reference panel to directly impute unmeasured SNP statistics based only on statistics at measured SNPs and estimated/user-specified ethnic proportions. Simulations show that the proposed method adequately controls the Type I error rates. The 1000 Genomes panel imputation of summary statistics from the ethnically diverse Psychiatric Genetic Consortium Schizophrenia Phase 2 suggests that, when compared to genotype imputation methods, DISTMIX offers comparable imputation accuracy for only a fraction of computational resources. AVAILABILITY AND IMPLEMENTATION: DISTMIX software, its reference population data, and usage examples are publicly available at http://code.google.com/p/distmix. CONTACT: dlee4@vcu.edu SUPPLEMENTARY INFORMATION: Supplementary Data are available at Bioinformatics online.


Asunto(s)
Biología Computacional/métodos , Etnicidad/genética , Polimorfismo de Nucleótido Simple/genética , Programas Informáticos , Estadística como Asunto , Estudios de Cohortes , Simulación por Computador , Bases de Datos Genéticas , Estudio de Asociación del Genoma Completo , Humanos
4.
Schizophr Res ; 164(1-3): 181-6, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25778617

RESUMEN

Empirically derived phenotypic measurements have the potential to enhance gene-finding efforts in schizophrenia. Previous research based on factor analyses of symptoms has typically included schizoaffective cases. Deriving factor loadings from analysis of only narrowly defined schizophrenia cases could yield more sensitive factor scores for gene pathway and gene ontology analyses. Using an Irish family sample, this study 1) factor analyzed clinician-rated Operational Criteria Checklist items in cases with schizophrenia only, 2) scored the full sample based on these factor loadings, and 3) implemented genome-wide association, gene-based, and gene-pathway analysis of these SCZ-based symptom factors (final N=507). Three factors emerged from the analysis of the schizophrenia cases: a manic, a depressive, and a positive symptom factor. In gene-based analyses of these factors, multiple genes had q<0.01. Of particular interest are findings for PTPRG and WBP1L, both of which were previously implicated by the Psychiatric Genomics Consortium study of SCZ; results from this study suggest that variants in these genes might also act as modifiers of SCZ symptoms. Gene pathway analyses of the first factor indicated over-representation of glutamatergic transmission, GABA-A receptor, and cyclic GMP pathways. Results suggest that these pathways may have differential influence on affective symptom presentation in schizophrenia.


Asunto(s)
Redes Reguladoras de Genes/genética , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple/genética , Trastornos Psicóticos/etiología , Trastornos Psicóticos/genética , Esquizofrenia/complicaciones , Análisis Factorial , Femenino , Genotipo , Humanos , Masculino , Esquizofrenia/diagnóstico
5.
Bioinformatics ; 31(8): 1176-82, 2015 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-25505091

RESUMEN

MOTIVATION: Gene expression is influenced by variants commonly known as expression quantitative trait loci (eQTL). On the basis of this fact, researchers proposed to use eQTL/functional information univariately for prioritizing single nucleotide polymorphisms (SNPs) signals from genome-wide association studies (GWAS). However, most genes are influenced by multiple eQTLs which, thus, jointly affect any downstream phenotype. Therefore, when compared with the univariate prioritization approach, a joint modeling of eQTL action on phenotypes has the potential to substantially increase signal detection power. Nonetheless, a joint eQTL analysis is impeded by (i) not measuring all eQTLs in a gene and/or (ii) lack of access to individual genotypes. RESULTS: We propose joint effect on phenotype of eQTL/functional SNPs associated with a gene (JEPEG), a novel software tool which uses only GWAS summary statistics to (i) impute the summary statistics at unmeasured eQTLs and (ii) test for the joint effect of all measured and imputed eQTLs in a gene. We illustrate the behavior/performance of the developed tool by analysing the GWAS meta-analysis summary statistics from the Psychiatric Genomics Consortium Stage 1 and the Genetic Consortium for Anorexia Nervosa. CONCLUSIONS: Applied analyses results suggest that JEPEG complements commonly used univariate GWAS tools by: (i) increasing signal detection power via uncovering (a) novel genes or (b) known associated genes in smaller cohorts and (ii) assisting in fine-mapping of challenging regions, e.g. major histocompatibility complex for schizophrenia. AVAILABILITY AND IMPLEMENTATION: JEPEG, its associated database of eQTL SNPs and usage examples are publicly available at http://code.google.com/p/jepeg/. CONTACT: dlee4@vcu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Anorexia Nerviosa/genética , Biomarcadores/análisis , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo , Programas Informáticos , Estudios de Cohortes , Perfilación de la Expresión Génica , Genómica/métodos , Genotipo , Humanos , Metaanálisis como Asunto , Fenotipo
6.
Twin Res Hum Genet ; 17(3): 143-50, 2014 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-24739319

RESUMEN

With the dramatic technological developments of genome-wide association single-nucleotide polymorphism (SNP) chips and next generation sequencing, human geneticists now have the ability to assay genetic variation at ever-rarer allele frequencies. To fully understand the impact of these rare variants on common, complex diseases, we must be able to accurately assess their statistical significance. However, it is well established that classical association tests are not appropriate for the analysis of low-frequency variation, giving spurious findings when observed counts are too few. To further our understanding of the asymptotic properties of traditional association tests, we conducted a range of simulations of a typical rare variant (~1%) under the null hypothesis and tested the allelic χ2, Cochran-Armitage trend, Wald, and Fisher's exact tests. We demonstrate that rare variation shows marked deviation from the expected distributional behavior for each test, with fewer minor alleles corresponding to a greater degree of test statistics deflation. The effect becomes more pronounced at progressively smaller α levels. We also show that the Wald test is particularly deflated at α levels consistent with genome-wide association significance, much more so than the other association tests considered. In general, these classical association tests are inappropriate for the analysis of variants for which the minor allele is observed fewer than 80 times, largely irrespective of sample size.


Asunto(s)
Frecuencia de los Genes , Marcadores Genéticos , Variación Genética/genética , Modelos Estadísticos , Estudios de Casos y Controles , Simulación por Computador , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Modelos Genéticos
7.
Schizophr Bull ; 40(1): 60-5, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23970557

RESUMEN

BACKGROUND: Early descriptive work and controlled family and adoption studies support the hypothesis that a range of personality and nonschizophrenic psychotic disorders aggregate in families of schizophrenic probands. Can we validate, using molecular polygene scores from genome-wide association studies (GWAS), this schizophrenia spectrum? METHODS: The predictive value of polygenic findings reported by the Psychiatric GWAS Consortium (PGC) was applied to 4 groups of relatives from the Irish Study of High-Density Schizophrenia Families (ISHDSF; N = 836) differing on their assignment within the schizophrenia spectrum. Genome-wide single nucleotide polymorphism data for affected and unaffected relatives were used to construct per-individual polygene risk scores based on the PGC stage-I results. We compared mean polygene scores in the ISHDSF with mean scores in ethnically matched population controls (N = 929). RESULTS: The schizophrenia polygene score differed significantly across diagnostic categories and was highest in those with narrow schizophrenia spectrum, lowest in those with no psychiatric illness, and in-between in those classified in the intermediate, broad, and very broad schizophrenia spectrum. Relatives of all of these groups of affected subjects, including those with no diagnosis, had schizophrenia polygene scores significantly higher than the control sample. CONCLUSIONS: In the relatives of high-density families, the observed pattern of enrichment of molecular indices of schizophrenia risk suggests an underlying, continuous liability distribution and validates, using aggregate common risk alleles, a genetic basis for the schizophrenia spectrum disorders. In addition, as predicted by genetic theory, nonpsychotic members of multiply-affected schizophrenia families are significantly enriched for replicated, polygenic risk variants compared with the general population.


Asunto(s)
Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Herencia Multifactorial/genética , Linaje , Esquizofrenia/genética , Genotipo , Humanos , Irlanda/epidemiología , Irlanda del Norte/epidemiología , Polimorfismo de Nucleótido Simple/genética , Valor Predictivo de las Pruebas , Esquizofrenia/clasificación , Esquizofrenia/epidemiología
8.
Am J Med Genet B Neuropsychiatr Genet ; 162B(8): 898-906, 2013 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-24123842

RESUMEN

BACKGROUND: Prior genome-scans of bipolar disorder have revealed chromosome 6q22 as a promising candidate region. However, linkage disequilibrium (LD) mapping studies have yet to identify replicated susceptibility loci. METHODS: We analyzed 1,422 LD-tagging single nucleotide polymorphisms (SNPs) in 83 genes to test single-marker and locus-wide evidence of association with bipolar disorder in the NIMH Genetics Initiative bipolar pedigrees and the Portuguese Island Collection (PIC) (N = 1,093 in 528 informative pairs). Both studies previously demonstrated significant evidence of linkage to 6q. SNPs were genotyped using an Illumina iSelect genotyping array which employs the Infinium assay. Evidence of single-marker association was assessed using the generalized disequilibrium test (GDT). Empirical estimates of gene-wide significance were obtained by permutation (via 100,000 gene-dropping simulations) of Fisher's combined test of P-values for each locus. RESULTS: No single variant yielded significant experiment-wide evidence of association, for either the combined sample or in each subsample. Our gene-dropping simulations identified nominally significant gene-wide associations with multiple loci, of which NT5DC1 in the NIMH subsample and CCNC in the PIC were the strongest candidates. However, no one gene consistently exceeded empirical significance criteria in both independent samples or survived Bonferroni correction for the number of genes tested. CONCLUSIONS: Using a gene-based approach to family-based association, we identified gene-wide associations with several genes, though no single locus was significantly associated with bipolar disorder in both cohorts. This suggests that chromosome 6q may harbor multiple susceptibility loci or that complex patterns of LD in this region may confound approaches based on common SNPs. Published 2013. This article is a U.S. Government work and is in the public domain in the USA.


Asunto(s)
Algoritmos , Trastorno Bipolar/genética , Cromosomas Humanos Par 6/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Marcadores Genéticos , Humanos , Desequilibrio de Ligamiento/genética , Portugal
9.
PLoS One ; 8(7): e67776, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23922650

RESUMEN

Integrating evidence from multiple domains is useful in prioritizing disease candidate genes for subsequent testing. We ranked all known human genes (n=3819) under linkage peaks in the Irish Study of High-Density Schizophrenia Families using three different evidence domains: 1) a meta-analysis of microarray gene expression results using the Stanley Brain collection, 2) a schizophrenia protein-protein interaction network, and 3) a systematic literature search. Each gene was assigned a domain-specific p-value and ranked after evaluating the evidence within each domain. For comparison to this ranking process, a large-scale candidate gene hypothesis was also tested by including genes with Gene Ontology terms related to neurodevelopment. Subsequently, genotypes of 3725 SNPs in 167 genes from a custom Illumina iSelect array were used to evaluate the top ranked vs. hypothesis selected genes. Seventy-three genes were both highly ranked and involved in neurodevelopment (category 1) while 42 and 52 genes were exclusive to neurodevelopment (category 2) or highly ranked (category 3), respectively. The most significant associations were observed in genes PRKG1, PRKCE, and CNTN4 but no individual SNPs were significant after correction for multiple testing. Comparison of the approaches showed an excess of significant tests using the hypothesis-driven neurodevelopment category. Random selection of similar sized genes from two independent genome-wide association studies (GWAS) of schizophrenia showed the excess was unlikely by chance. In a further meta-analysis of three GWAS datasets, four candidate SNPs reached nominal significance. Although gene ranking using integrated sources of prior information did not enrich for significant results in the current experiment, gene selection using an a priori hypothesis (neurodevelopment) was superior to random selection. As such, further development of gene ranking strategies using more carefully selected sources of information is warranted.


Asunto(s)
Algoritmos , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Modelos Genéticos , Sistema Nervioso/crecimiento & desarrollo , Sistema Nervioso/patología , Esquizofrenia/genética , Bases de Datos Genéticas , Humanos , Metaanálisis como Asunto , Polimorfismo de Nucleótido Simple/genética , Edición
10.
Bioinformatics ; 29(22): 2925-7, 2013 Nov 15.
Artículo en Inglés | MEDLINE | ID: mdl-23990413

RESUMEN

MOTIVATION: Genotype imputation methods are used to enhance the resolution of genome-wide association studies, and thus increase the detection rate for genetic signals. Although most studies report all univariate summary statistics, many of them limit the access to subject-level genotypes. Because such an access is required by all genotype imputation methods, it is helpful to develop methods that impute summary statistics without going through the interim step of imputing genotypes. Even when subject-level genotypes are available, due to the substantial computational cost of the typical genotype imputation, there is a need for faster imputation methods. RESULTS: Direct Imputation of summary STatistics (DIST) imputes the summary statistics of untyped variants without first imputing their subject-level genotypes. This is achieved by (i) using the conditional expectation formula for multivariate normal variates and (ii) using the correlation structure from a relevant reference population. When compared with genotype imputation methods, DIST (i) requires only a fraction of their computational resources, (ii) has comparable imputation accuracy for independent subjects and (iii) is readily applicable to the imputation of association statistics coming from large pedigree data. Thus, the proposed application is useful for a fast imputation of summary results for (i) studies of unrelated subjects, which (a) do not provide subject-level genotypes or (b) have a large size and (ii) family association studies. AVAILABILITY AND IMPLEMENTATION: Pre-compiled executables built under commonly used operating systems are publicly available at http://code.google.com/p/dist/. CONTACT: dlee4@vcu.edu .


Asunto(s)
Polimorfismo de Nucleótido Simple , Programas Informáticos , Interpretación Estadística de Datos , Genoma Humano , Técnicas de Genotipaje , Humanos
12.
PLoS One ; 6(12): e21440, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-22220189

RESUMEN

BACKGROUND: Prior genomewide scans of schizophrenia support evidence of linkage to regions of chromosome 20. However, association analyses have yet to provide support for any etiologically relevant variants. METHODS: We analyzed 2988 LD-tagging single nucleotide polymorphisms (SNPs) in 327 genes on chromosome 20, to test for association with schizophrenia in 270 Irish high-density families (ISHDSF, N = 270 families, 1408 subjects). These SNPs were genotyped using an Illumina iSelect genotyping array which employs the Infinium assay. Given a previous report of novel linkage with chromosome 20p using latent classes of psychotic illness in this sample, association analysis was also conducted for each of five factor-derived scores based on the Operational Criteria Checklist for Psychotic Illness (delusions, hallucinations, mania, depression, and negative symptoms). Tests of association were conducted using the PDTPHASE and QPDTPHASE packages of UNPHASED. Empirical estimates of gene-wise significance were obtained by adaptive permutation of a) the smallest observed P-value and b) the threshold-truncated product of P-values for each locus. RESULTS: While no single variant was significant after LD-corrected Bonferroni-correction, our gene-dropping analyses identified loci which exceeded empirical significance criteria for both gene-based tests. Namely, R3HDML and C20orf39 are significantly associated with depressive symptoms of schizophrenia (P(emp)<2×10⁻5) based on the minimum P-value and truncated-product methods, respectively. CONCLUSIONS: Using a gene-based approach to family-based association, R3HDML and C20orf39 were found to be significantly associated with clinical dimensions of schizophrenia. These findings demonstrate the efficacy of gene-based analysis and support previous evidence that chromosome 20 may harbor schizophrenia susceptibility or modifier loci.


Asunto(s)
Cromosomas Humanos Par 20/genética , Depresión/genética , Estudios de Asociación Genética , Ligamiento Genético/genética , Sitios Genéticos/genética , Predisposición Genética a la Enfermedad , Trastornos Psicóticos/genética , Simulación por Computador , Depresión/complicaciones , Depresión/diagnóstico , Femenino , Marcadores Genéticos , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Linaje , Polimorfismo de Nucleótido Simple/genética , Trastornos Psicóticos/complicaciones , Trastornos Psicóticos/diagnóstico , Factores de Riesgo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...