RESUMO
Lithium is the mainstay prophylactic treatment for bipolar disorder (BD), but treatment response varies considerably across individuals. Patients who respond well to lithium treatment might represent a relatively homogeneous subtype of this genetically and phenotypically diverse disorder. Here, we performed genome-wide association studies (GWAS) to identify (i) specific genetic variations influencing lithium response and (ii) genetic variants associated with risk for lithium-responsive BD. Patients with BD and controls were recruited from Sweden and the United Kingdom. GWAS were performed on 2698 patients with subjectively defined (self-reported) lithium response and 1176 patients with objectively defined (clinically documented) lithium response. We next conducted GWAS comparing lithium responders with healthy controls (1639 subjective responders and 8899 controls; 323 objective responders and 6684 controls). Meta-analyses of Swedish and UK results revealed no significant associations with lithium response within the bipolar subjects. However, when comparing lithium-responsive patients with controls, two imputed markers attained genome-wide significant associations, among which one was validated in confirmatory genotyping (rs116323614, P=2.74 × 10(-8)). It is an intronic single-nucleotide polymorphism (SNP) on chromosome 2q31.2 in the gene SEC14 and spectrin domains 1 (SESTD1), which encodes a protein involved in regulation of phospholipids. Phospholipids have been strongly implicated as lithium treatment targets. Furthermore, we estimated the proportion of variance for lithium-responsive BD explained by common variants ('SNP heritability') as 0.25 and 0.29 using two definitions of lithium response. Our results revealed a genetic variant in SESTD1 associated with risk for lithium-responsive BD, suggesting that the understanding of BD etiology could be furthered by focusing on this subtype of BD.
Assuntos
Transtorno Bipolar/genética , Proteínas de Transporte/genética , Adulto , Antimaníacos/uso terapêutico , Biomarcadores Farmacológicos/sangue , Transtorno Bipolar/metabolismo , Proteínas de Transporte/metabolismo , Feminino , Predisposição Genética para Doença/genética , Variação Genética , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Lítio/metabolismo , Lítio/uso terapêutico , Compostos de Lítio/uso terapêutico , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , Fatores de Risco , Autorrelato , Suécia , Reino UnidoRESUMO
Late-onset Alzheimer's disease (AD) is 50-70% heritable with complex genetic underpinnings. In addition to Apoliprotein E (APOE) ε4, the major genetic risk factor, recent genome-wide association studies (GWAS) have identified a growing list of sequence variations associated with the disease. Building on a prior large-scale AD GWAS, we used a recently developed analytic method to compute a polygenic score that involves up to 26 independent common sequence variants and is associated with AD dementia, above and beyond APOE. We then examined the associations between the polygenic score and the magnetic resonance imaging-derived thickness measurements across AD-vulnerable cortex in clinically normal (CN) human subjects (N = 104). AD-specific cortical thickness was correlated with the polygenic risk score, even after controlling for APOE genotype and cerebrospinal fluid (CSF) levels of ß-amyloid (Aß(1-42)). Furthermore, the association remained significant in CN subjects with levels of CSF Aß(1-)(42) in the normal range and in APOE ε3 homozygotes. The observation that genetic risk variants are associated with thickness across AD-vulnerable regions of interest in CN older individuals, suggests that the combination of polygenic risk profile, neuroimaging, and CSF biomarkers may hold synergistic potential to aid in the prediction of future cognitive decline.
Assuntos
Doença de Alzheimer/genética , Doença de Alzheimer/patologia , Córtex Cerebral/patologia , Idoso , Peptídeos beta-Amiloides/genética , Peptídeos beta-Amiloides/metabolismo , Apolipoproteínas E/genética , Bases de Dados Genéticas , Progressão da Doença , Feminino , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Imageamento por Ressonância Magnética , Masculino , Doenças Neurodegenerativas/genética , Doenças Neurodegenerativas/patologia , Neuroimagem , Fragmentos de Peptídeos/genética , Polimorfismo de Nucleotídeo Único/genética , Valores de ReferênciaRESUMO
Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study (BiGS), and a German sample. The combined sample consisted of 2,836 BP cases with information on sub-phenotypes and 2,744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles.
Assuntos
Transtorno Bipolar/epidemiologia , Transtorno Bipolar/genética , Estudo de Associação Genômica Ampla , Transtornos Psicóticos/epidemiologia , Transtornos Psicóticos/genética , Idade de Início , Transtorno Bipolar/complicações , Demografia , Feminino , Alemanha/epidemiologia , Humanos , Masculino , Transtornos Psicóticos/complicações , Adulto JovemRESUMO
MOTIVATION: Identifying single nucleotide polymorphisms (SNPs) that underlie common and complex human diseases, such as cancer, is of major interest in current molecular epidemiology. Nevertheless, the tremendous number of SNPs on the human genome requires computational methods for prioritizing SNPs according to their potentially deleterious effects to human health, and as such, for expediting genotyping and analysis. As of yet, little has been done to quantitatively assess the possible deleterious effects of SNPs for effective association studies. RESULTS: We propose a new integrative scoring system for prioritizing SNPs based on their possible deleterious effects within a probabilistic framework. We applied our system to 580 disease-susceptibility genes obtained from the OMIM (Online Mendelian Inheritance in Man) database, which is one of the most widely used databases of human genes and genetic disorders. The scoring results clearly show that the distribution of the functional significance (FS) scores for already known disease-related SNPs is significantly different from that of neutral SNPs. In addition, we summarize distinct features of potentially deleterious SNPs based on their FS score, such as functional genomic regions where they occur or bio-molecular functions that they mainly affect. We also demonstrate, through a comparative study, that our system improves upon other function-assessment systems for SNPs, by assigning significantly higher FS scores to already known disease-related SNPs than to neutral SNPs.
Assuntos
Biologia Computacional/métodos , Polimorfismo de Nucleotídeo Único , Genoma Humano , HumanosRESUMO
The Functional Single Nucleotide Polymorphism (F-SNP) database integrates information obtained from 16 bioinformatics tools and databases about the functional effects of SNPs. These effects are predicted and indicated at the splicing, transcriptional, translational and post-translational level. As such, the database helps identify and focus on SNPs with potential deleterious effect to human health. In particular, users can retrieve SNPs that disrupt genomic regions known to be functional, including splice sites and transcriptional regulatory regions. Users can also identify non-synonymous SNPs that may have deleterious effects on protein structure or function, interfere with protein translation or impede post-translational modification. A web interface enables easy navigation for obtaining information through multiple starting points and exploration routes (e.g. starting from SNP identifier, genomic region, gene or target disease). The F-SNP database is available at http://compbio.cs.queensu.ca/F-SNP/.
Assuntos
Bases de Dados de Ácidos Nucleicos , Doenças Genéticas Inatas/genética , Polimorfismo de Nucleotídeo Único , Biologia Computacional , Predisposição Genética para Doença , Genômica , Humanos , Internet , Proteínas/genética , Software , Interface Usuário-ComputadorRESUMO
Selecting a representative set of single nucleotide polymorphism (SNP) markers for facilitating association studies is an important step to uncover the genetic basis of human disease. Tag SNP selection and functional SNP selection are the two main approaches for addressing the SNP selection problem. However, little was done so far to effectively combine these distinct and possibly competing approaches. Here, we present a new multiobjective optimization framework for identifying SNPs that are both informative tagging and have functional significance (FS). Our selection algorithm is based on the notion of Pareto optimality, which has been extensively used for addressing multiobjective optimization problems in game theory, economics, and engineering. We applied our method to 34 disease-susceptibility genes for lung cancer and compared the performance with that of other systems which support both tag SNP selection and functional SNP selection methods. The comparison shows that our algorithm always finds a subset of SNPs that improves upon the subset selected by other state-of-the-art systems with respect to both selection objectives.
Assuntos
Polimorfismo de Nucleotídeo Único , Algoritmos , Biologia Computacional , Predisposição Genética para Doença , Humanos , Desequilíbrio de Ligação , Neoplasias Pulmonares/genéticaRESUMO
Genetic variation analysis holds much promise as a basis for disease-gene association. However, due to the tremendous number of candidate single nucleotide polymorphisms (SNPs), there is a clear need to expedite genotyping by selecting and considering only a subset of all SNPs. This process is known as tagging SNP selection. Several methods for tagging SNP selection have been proposed, and have shown promising results. However, most of them rely on strong assumptions such as prior block-partitioning, bi-allelic SNPs, or a fixed number or location of tagging SNPs. We introduce BNTagger, a new method for tagging SNP selection, based on conditional independence among SNPs. Using the formalism of Bayesian networks (BNs), our system aims to select a subset of independent and highly predictive SNPs. Similar to previous prediction-based methods, we aim to maximize the prediction accuracy of tagging SNPs, but unlike them, we neither fix the number nor the location of predictive tagging SNPs, nor require SNPs to be bi-allelic. In addition, for newly-genotyped samples, BNTagger directly uses genotype data as input, while producing as output haplotype data of all SNPs. Using three public data sets, we compare the prediction performance of our method to that of three state-of-the-art tagging SNP selection methods. The results demonstrate that our method consistently improves upon previous methods in terms of prediction accuracy. Moreover, our method retains its good performance even when a very small number of tagging SNPs are used.
Assuntos
Algoritmos , Análise Mutacional de DNA/métodos , Etiquetas de Sequências Expressas , Polimorfismo de Nucleotídeo Único/genética , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Software , Sequência de Bases , Teorema de Bayes , Modelos Logísticos , Dados de Sequência Molecular , Reconhecimento Automatizado de Padrão/métodosRESUMO
Identifying single nucleotide polymorphisms (SNPs) that are responsible for common and complex diseases, such as cancer, is of major interest in current molecular epidemiology. However, due to the tremendous number of SNPs on the human genome, to expedite genotyping and analysis, there is a clear need to prioritize SNPs according to their potentially deleterious effects to human health. As of yet, there have been few efforts to quantitatively assess the possible deleterious effects of SNPs for effective association studies. Here we propose a new integrative scoring system for prioritizing SNPs based on their possible deleterious effects in a probabilistic framework. We also provide the evaluation result of our system on the OMIM (Online Mendelian Inheritance in Man) database, which is one of the most widely-used databases of human genes and genetic disorders.
Assuntos
Mapeamento Cromossômico/métodos , Análise Mutacional de DNA/métodos , Predisposição Genética para Doença/epidemiologia , Predisposição Genética para Doença/genética , Polimorfismo de Nucleotídeo Único/genética , Modelos de Riscos Proporcionais , Análise de Sequência de DNA/métodos , Humanos , Incidência , Medição de Risco/métodos , Fatores de RiscoRESUMO
Pervasive Developmental Disorders (PDD) are neurodevelopmental disorders characterized by impairments in social interaction, communication and behavior. Given the diversity and varying severity of PDD, diagnostic tools attempt to identify homogeneous subtypes within PDD. Identifying subtypes can lead to targeted etiology studies and to effective type-specific intervention. Cluster analysis can suggest coherent subsets in data; however, different methods and assumptions lead to different results. Several previous studies applied clustering to PDD data, varying in number and characteristics of the produced subtypes. Most studies used a relatively small dataset (fewer than 150 subjects), and all applied only a single clustering method. Here we study a relatively large dataset (358 PDD patients), using an ensemble of three clustering methods. The results are evaluated using several validation methods, and consolidated through an integration step. Four clusters are identified, analyzed and compared to subtypes previously defined by the widely used diagnostic tool DSM-IV.
Assuntos
Transtornos Globais do Desenvolvimento Infantil/classificação , Análise por Conglomerados , Teorema de Bayes , Criança , Transtornos Globais do Desenvolvimento Infantil/diagnóstico , Manual Diagnóstico e Estatístico de Transtornos Mentais , HumanosRESUMO
MOTIVATION: Inferring the genetic interaction mechanism using Bayesian networks has recently drawn increasing attention due to its well-established theoretical foundation and statistical robustness. However, the relative insufficiency of experiments with respect to the number of genes leads to many false positive inferences. RESULTS: We propose a novel method to infer genetic networks by alleviating the shortage of available mRNA expression data with prior knowledge. We call the proposed method 'modularized network learning' (MONET). Firstly, the proposed method divides a whole gene set to overlapped modules considering biological annotations and expression data together. Secondly, it infers a Bayesian network for each module, and integrates the learned subnetworks to a global network. An algorithm that measures a similarity between genes based on hierarchy, specificity and multiplicity of biological annotations is presented. The proposed method draws a global picture of inter-module relationships as well as a detailed look of intra-module interactions. We applied the proposed method to analyze Saccharomyces cerevisiae stress data, and found several hypotheses to suggest putative functions of unclassified genes. We also compared the proposed method with a whole-set-based approach and two expression-based clustering approaches.