Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
1.
Nat Neurosci ; 22(9): 1521-1532, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31455884

RESUMO

One fundamental but understudied mechanism of gene regulation in disease is allele-specific expression (ASE), the preferential expression of one allele. We leveraged RNA-sequencing data from human brain to assess ASE in autism spectrum disorder (ASD). When ASE is observed in ASD, the allele with lower population frequency (minor allele) is preferentially more highly expressed than the major allele, opposite to the canonical pattern. Importantly, genes showing ASE in ASD are enriched in those downregulated in ASD postmortem brains and in genes harboring de novo mutations in ASD. Two regions, 14q32 and 15q11, containing all known orphan C/D box small nucleolar RNAs (snoRNAs), are particularly enriched in shifts to higher minor allele expression. We demonstrate that this allele shifting enhances snoRNA-targeted splicing changes in ASD-related target genes in idiopathic ASD and 15q11-q13 duplication syndrome. Together, these results implicate allelic imbalance and dysregulation of orphan C/D box snoRNAs in ASD pathogenesis.


Assuntos
Desequilíbrio Alélico/genética , Transtorno do Espectro Autista/genética , Encéfalo , Transcriptoma/genética , Alelos , Perfilação da Expressão Gênica , Humanos , RNA Nucleolar Pequeno/genética
2.
Genetics ; 209(3): 685-698, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29752291

RESUMO

Over the past few years, genome-wide association studies have identified many trait-associated loci that have different effects on females and males, which increased attention to the genetic architecture differences between the sexes. The between-sex differences in genetic architectures can cause a variety of phenomena such as differences in the effect sizes at trait-associated loci, differences in the magnitudes of polygenic background effects, and differences in the phenotypic variances. However, current association testing approaches for dealing with sex, such as including sex as a covariate, cannot fully account for these phenomena and can be suboptimal in statistical power. We present a novel association mapping framework, MetaSex, that can comprehensively account for the genetic architecture differences between the sexes. Through simulations and applications to real data, we show that our framework has superior performance than previous approaches in association mapping.


Assuntos
Mapeamento Cromossômico/métodos , Biologia Computacional/métodos , Estudo de Associação Genômica Ampla/métodos , Caracteres Sexuais , Algoritmos , Feminino , Humanos , Masculino , Herança Multifatorial , Locos de Características Quantitativas
3.
Genet Epidemiol ; 42(1): 49-63, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29114909

RESUMO

BACKGROUND: Epistasis and gene-environment interactions are known to contribute significantly to variation of complex phenotypes in model organisms. However, their identification in human association studies remains challenging for myriad reasons. In the case of epistatic interactions, the large number of potential interacting sets of genes presents computational, multiple hypothesis correction, and other statistical power issues. In the case of gene-environment interactions, the lack of consistently measured environmental covariates in most disease studies precludes searching for interactions and creates difficulties for replicating studies. RESULTS: In this work, we develop a new statistical approach to address these issues that leverages genetic ancestry, defined as the proportion of ancestry derived from each ancestral population (e.g., the fraction of European/African ancestry in African Americans), in admixed populations. We applied our method to gene expression and methylation data from African American and Latino admixed individuals, respectively, identifying nine interactions that were significant at P<5×10-8. We show that two of the interactions in methylation data replicate, and the remaining six are significantly enriched for low P-values (P<1.8×10-6). CONCLUSION: We show that genetic ancestry can be a useful proxy for unknown and unmeasured covariates in the search for interaction effects. These results have important implications for our understanding of the genetic architecture of complex traits.


Assuntos
População Negra/genética , Negro ou Afro-Americano/genética , Epistasia Genética/genética , Interação Gene-Ambiente , Hispânico ou Latino/genética , Modelos Genéticos , População Branca/genética , Metilação de DNA , Humanos , Fenótipo
4.
Bioinformatics ; 33(14): i67-i74, 2017 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-28881962

RESUMO

MOTIVATION: There is recent interest in using gene expression data to contextualize findings from traditional genome-wide association studies (GWAS). Conditioned on a tissue, expression quantitative trait loci (eQTLs) are genetic variants associated with gene expression, and eGenes are genes whose expression levels are associated with genetic variants. eQTLs and eGenes provide great supporting evidence for GWAS hits and important insights into the regulatory pathways involved in many diseases. When a significant variant or a candidate gene identified by GWAS is also an eQTL or eGene, there is strong evidence to further study this variant or gene. Multi-tissue gene expression datasets like the Gene Tissue Expression (GTEx) data are used to find eQTLs and eGenes. Unfortunately, these datasets often have small sample sizes in some tissues. For this reason, there have been many meta-analysis methods designed to combine gene expression data across many tissues to increase power for finding eQTLs and eGenes. However, these existing techniques are not scalable to datasets containing many tissues, like the GTEx data. Furthermore, these methods ignore a biological insight that the same variant may be associated with the same gene across similar tissues. RESULTS: We introduce a meta-analysis model that addresses these problems in existing methods. We focus on the problem of finding eGenes in gene expression data from many tissues, and show that our model is better than other types of meta-analyses. AVAILABILITY AND IMPLEMENTATION: Source code is at https://github.com/datduong/RECOV . CONTACT: eeskin@cs.ucla.edu or datdb@cs.ucla.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Variação Genética , Locos de Características Quantitativas , Software , Perfilação da Expressão Gênica/métodos , Estudo de Associação Genômica Ampla/métodos , Humanos , Metanálise como Assunto , Modelos Genéticos
5.
Proc Natl Acad Sci U S A ; 114(38): 10166-10171, 2017 09 19.
Artigo em Inglês | MEDLINE | ID: mdl-28874526

RESUMO

Prediction of human physical traits and demographic information from genomic data challenges privacy and data deidentification in personalized medicine. To explore the current capabilities of phenotype-based genomic identification, we applied whole-genome sequencing, detailed phenotyping, and statistical modeling to predict biometric traits in a cohort of 1,061 participants of diverse ancestry. Individually, for a large fraction of the traits, their predictive accuracy beyond ancestry and demographic information is limited. However, we have developed a maximum entropy algorithm that integrates multiple predictions to determine which genomic samples and phenotype measurements originate from the same person. Using this algorithm, we have reidentified an average of >8 of 10 held-out individuals in an ethnically mixed cohort and an average of 5 of either 10 African Americans or 10 Europeans. This work challenges current conceptions of personal privacy and may have far-reaching ethical and legal implications.


Assuntos
Confidencialidade , Impressões Digitais de DNA , Modelos Genéticos , Fenótipo , Sequenciamento Completo do Genoma , Adulto , Fatores Etários , Algoritmos , Tamanho Corporal , Estudos de Coortes , Anonimização de Dados , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Pigmentação/genética , Adulto Jovem
6.
G3 (Bethesda) ; 7(8): 2545-2558, 2017 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-28620084

RESUMO

Epilepsy has many causes and comorbidities affecting as many as 4% of people in their lifetime. Both idiopathic and symptomatic epilepsies are highly heritable, but genetic factors are difficult to characterize among humans due to complex disease etiologies. Rodent genetic studies have been critical to the discovery of seizure susceptibility loci, including Kcnj10 mutations identified in both mouse and human cohorts. However, genetic analyses of epilepsy phenotypes in mice to date have been carried out as acute studies in seizure-naive animals or in Mendelian models of epilepsy, while humans with epilepsy have a history of recurrent seizures that also modify brain physiology. We have applied a repeated seizure model to a genetic reference population, following seizure susceptibility over a 36-d period. Initial differences in generalized seizure threshold among the Hybrid Mouse Diversity Panel (HMDP) were associated with a well-characterized seizure susceptibility locus found in mice: Seizure susceptibility 1 Remarkably, Szs1 influence diminished as subsequent induced seizures had diminishing latencies in certain HMDP strains. Administration of eight seizures, followed by an incubation period and an induced retest seizure, revealed novel associations within the calmodulin-binding transcription activator 1, Camta1 Using systems genetics, we have identified four candidate genes that are differentially expressed between seizure-sensitive and -resistant strains close to our novel Epileptogenesis susceptibility factor 1 (Esf1) locus that may act individually or as a coordinated response to the neuronal stress of seizures.


Assuntos
Epilepsia/genética , Loci Gênicos , Predisposição Genética para Doença , Variação Genética , Convulsões/genética , Alelos , Animais , Encéfalo/metabolismo , Encéfalo/patologia , Cromossomos de Mamíferos/genética , Cruzamentos Genéticos , Modelos Animais de Doenças , Epistasia Genética , Feminino , Flurotila , Estudo de Associação Genômica Ampla , Excitação Neurológica/genética , Masculino , Camundongos , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Análise de Regressão
7.
Genetics ; 204(4): 1379-1390, 2016 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-27770036

RESUMO

A typical genome-wide association study tests correlation between a single phenotype and each genotype one at a time. However, single-phenotype analysis might miss unmeasured aspects of complex biological networks. Analyzing many phenotypes simultaneously may increase the power to capture these unmeasured aspects and detect more variants. Several multivariate approaches aim to detect variants related to more than one phenotype, but these current approaches do not consider the effects of population structure. As a result, these approaches may result in a significant amount of false positive identifications. Here, we introduce a new methodology, referred to as GAMMA for generalized analysis of molecular variance for mixed-model analysis, which is capable of simultaneously analyzing many phenotypes and correcting for population structure. In a simulated study using data implanted with true genetic effects, GAMMA accurately identifies these true effects without producing false positives induced by population structure. In simulations with this data, GAMMA is an improvement over other methods which either fail to detect true effects or produce many false positive identifications. We further apply our method to genetic studies of yeast and gut microbiome from mice and show that GAMMA identifies several variants that are likely to have true biological mechanisms.


Assuntos
Algoritmos , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Animais , Humanos , Camundongos , Polimorfismo de Nucleotídeo Único , População/genética , Sensibilidade e Especificidade , Leveduras/genética
8.
Genetics ; 204(3): 1057-1064, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27765809

RESUMO

The study of the genetics of gene expression is of considerable importance to understanding the nature of common, complex diseases. The most widely applied approach to identifying relationships between genetic variation and gene expression is the expression quantitative trait loci (eQTL) approach. Here, we increased the computational power of eQTL with an alternative and complementary approach based on analyzing allele specific expression (ASE). We designed a novel analytical method to identify cis-acting regulatory variants based on genome sequencing and measurements of ASE from RNA-sequencing (RNA-seq) data. We evaluated the power and resolution of our method using simulated data. We then applied the method to map regulatory variants affecting gene expression in lymphoblastoid cell lines (LCLs) from 77 unrelated northern and western European individuals (CEU), which were part of the HapMap project. A total of 2309 SNPs were identified as being associated with ASE patterns. The SNPs associated with ASE were enriched within promoter regions and were significantly more likely to signal strong evidence for a regulatory role. Finally, among the candidate regulatory SNPs, we identified 108 SNPs that were previously associated with human immune diseases. With further improvements in quantifying ASE from RNA-seq, the application of our method to other datasets is expected to accelerate our understanding of the biological basis of common diseases.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único , Transcriptoma , Algoritmos , Alelos , Linhagem Celular Tumoral , Europa (Continente) , Projeto HapMap , Humanos , Doenças do Sistema Imunitário/genética , Regiões Promotoras Genéticas , Locos de Características Quantitativas , População Branca/genética
9.
PLoS Genet ; 12(9): e1006303, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27631375

RESUMO

Malaria transmission is dependent on the propensity of Anopheles mosquitoes to bite humans (anthropophily) instead of other dead end hosts. Recent increases in the usage of Long Lasting Insecticide Treated Nets (LLINs) in Africa have been associated with reductions in highly anthropophilic and endophilic vectors such as Anopheles gambiae s.s., leaving species with a broader host range, such as Anopheles arabiensis, as the most prominent remaining source of transmission in many settings. An. arabiensis appears to be more of a generalist in terms of its host choice and resting behavior, which may be due to phenotypic plasticity and/or segregating allelic variation. To investigate the genetic basis of host choice and resting behavior in An. arabiensis we sequenced the genomes of 23 human-fed and 25 cattle-fed mosquitoes collected both in-doors and out-doors in the Kilombero Valley, Tanzania. We identified a total of 4,820,851 SNPs, which were used to conduct the first genome-wide estimates of "SNP heritability" for host choice and resting behavior in this species. A genetic component was detected for host choice (human vs cow fed; permuted P = 0.002), but there was no evidence of a genetic component for resting behavior (indoors versus outside; permuted P = 0.465). A principal component analysis (PCA) segregated individuals based on genomic variation into three groups which were characterized by differences at the 2Rb and/or 3Ra paracentromeric chromosome inversions. There was a non-random distribution of cattle-fed mosquitoes between the PCA clusters, suggesting that alleles linked to the 2Rb and/or 3Ra inversions may influence host choice. Using a novel inversion genotyping assay, we detected a significant enrichment of the standard arrangement (non-inverted) of 3Ra among cattle-fed mosquitoes (N = 129) versus all non-cattle-fed individuals (N = 234; χ2, p = 0.007). Thus, tracking the frequency of the 3Ra in An. arabiensis populations may be of use to infer selection on host choice behavior within these vector populations; possibly in response to vector control. Controlled host-choice assays are needed to discern whether the observed genetic component has a direct relationship with innate host preference. A better understanding of the genetic basis for host feeding behavior in An. arabiensis may also open avenues for novel vector control strategies based on driving genes for zoophily into wild mosquito populations.


Assuntos
Anopheles/genética , Interações Hospedeiro-Patógeno/genética , Insetos Vetores/genética , Malária/genética , África , Animais , Anopheles/parasitologia , Comportamento Animal/fisiologia , Bovinos , Genótipo , Humanos , Insetos Vetores/parasitologia , Inseticidas/uso terapêutico , Malária/epidemiologia , Malária/parasitologia , Malária/transmissão , Controle de Mosquitos , Polimorfismo de Nucleotídeo Único
10.
Am J Hum Genet ; 99(1): 89-103, 2016 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-27292110

RESUMO

Genome-wide association studies (GWASs) have been successful in detecting variants correlated with phenotypes of clinical interest. However, the power to detect these variants depends on the number of individuals whose phenotypes are collected, and for phenotypes that are difficult to collect, the sample size might be insufficient to achieve the desired statistical power. The phenotype of interest is often difficult to collect, whereas surrogate phenotypes or related phenotypes are easier to collect and have already been collected in very large samples. This paper demonstrates how we take advantage of these additional related phenotypes to impute the phenotype of interest or target phenotype and then perform association analysis. Our approach leverages the correlation structure between phenotypes to perform the imputation. The correlation structure can be estimated from a smaller complete dataset for which both the target and related phenotypes have been collected. Under some assumptions, the statistical power can be computed analytically given the correlation structure of the phenotypes used in imputation. In addition, our method can impute the summary statistic of the target phenotype as a weighted linear combination of the summary statistics of related phenotypes. Thus, our method is applicable to datasets for which we have access only to summary statistics and not to the raw genotypes. We illustrate our approach by analyzing associated loci to triglycerides (TGs), body mass index (BMI), and systolic blood pressure (SBP) in the Northern Finland Birth Cohort dataset.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Fenótipo , Animais , Pressão Sanguínea/genética , Índice de Massa Corporal , Estudos de Coortes , Conjuntos de Dados como Assunto , Finlândia , Genótipo , Humanos , Camundongos , Modelos Genéticos , Herança Multifatorial , Reprodutibilidade dos Testes , Projetos de Pesquisa , Tamanho da Amostra , Triglicerídeos/sangue
11.
G3 (Bethesda) ; 6(7): 1793-8, 2016 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-27194809

RESUMO

Meta-analysis has become a popular tool for genetic association studies to combine different genetic studies. A key challenge in meta-analysis is heterogeneity, or the differences in effect sizes between studies. Heterogeneity complicates the interpretation of meta-analyses. In this paper, we describe ForestPMPlot, a flexible visualization tool for analyzing studies included in a meta-analysis. The main feature of the tool is visualizing the differences in the effect sizes of the studies to understand why the studies exhibit heterogeneity for a particular phenotype and locus pair under different conditions. We show the application of this tool to interpret a meta-analysis of 17 mouse studies, and to interpret a multi-tissue eQTL study.


Assuntos
Heterogeneidade Genética , Metanálise como Assunto , Modelos Estatísticos , Locos de Características Quantitativas , Software , Animais , Gráficos por Computador , Humanos , Camundongos , Fenótipo , Tamanho da Amostra , Viés de Seleção
12.
Genome Res ; 25(10): 1558-69, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26260972

RESUMO

Genetics provides a potentially powerful approach to dissect host-gut microbiota interactions. Toward this end, we profiled gut microbiota using 16s rRNA gene sequencing in a panel of 110 diverse inbred strains of mice. This panel has previously been studied for a wide range of metabolic traits and can be used for high-resolution association mapping. Using a SNP-based approach with a linear mixed model, we estimated the heritability of microbiota composition. We conclude that, in a controlled environment, the genetic background accounts for a substantial fraction of abundance of most common microbiota. The mice were previously studied for response to a high-fat, high-sucrose diet, and we hypothesized that the dietary response was determined in part by gut microbiota composition. We tested this using a cross-fostering strategy in which a strain showing a modest response, SWR, was seeded with microbiota from a strain showing a strong response, A×B19. Consistent with a role of microbiota in dietary response, the cross-fostered SWR pups exhibited a significantly increased response in weight gain. To examine specific microbiota contributing to the response, we identified various genera whose abundance correlated with dietary response. Among these, we chose Akkermansia muciniphila, a common anaerobe previously associated with metabolic effects. When administered to strain A×B19 by gavage, the dietary response was significantly blunted for obesity, plasma lipids, and insulin resistance. In an effort to further understand host-microbiota interactions, we mapped loci controlling microbiota composition and prioritized candidate genes. Our publicly available data provide a resource for future studies.


Assuntos
Microbioma Gastrointestinal/genética , Animais , Dieta , Dieta Hiperlipídica , Meio Ambiente , Feminino , Estudo de Associação Genômica Ampla , Hereditariedade , Masculino , Camundongos , Camundongos Endogâmicos , Obesidade/microbiologia , RNA Ribossômico 16S , Sacarose/metabolismo
13.
Genetics ; 198(2): 497-508, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25104515

RESUMO

Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20-50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/.


Assuntos
Estudos de Associação Genética , Estudos de Casos e Controles , Quitinases/genética , Doença da Artéria Coronariana/genética , Predisposição Genética para Doença , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas
14.
J Assoc Res Otolaryngol ; 15(3): 335-52, 2014 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-24570207

RESUMO

Age-related hearing loss (AHL) is characterized by a symmetric sensorineural hearing loss primarily in high frequencies and individuals have different levels of susceptibility to AHL. Heritability studies have shown that the sources of this variance are both genetic and environmental, with approximately half of the variance attributable to hereditary factors as reported by Huag and Tang (Eur Arch Otorhinolaryngol 267(8):1179-1191, 2010). Only a limited number of large-scale association studies for AHL have been undertaken in humans, to date. An alternate and complementary approach to these human studies is through the use of mouse models. Advantages of mouse models include that the environment can be more carefully controlled, measurements can be replicated in genetically identical animals, and the proportion of the variability explained by genetic variation is increased. Complex traits in mouse strains have been shown to have higher heritability and genetic loci often have stronger effects on the trait compared to humans. Motivated by these advantages, we have performed the first genome-wide association study of its kind in the mouse by combining several data sets in a meta-analysis to identify loci associated with age-related hearing loss. We identified five genome-wide significant loci (<10(-6)). One of these loci confirmed a previously identified locus (ahl8) on distal chromosome 11 and greatly narrowed the candidate region. Specifically, the most significant associated SNP is located 450 kb upstream of Fscn2. These data confirm the utility of this approach and provide new high-resolution mapping information about variation within the mouse genome associated with hearing loss.


Assuntos
Envelhecimento/fisiologia , Estudo de Associação Genômica Ampla , Perda Auditiva Neurossensorial/genética , Animais , Modelos Animais de Doenças , Potenciais Evocados Auditivos do Tronco Encefálico , Feminino , Humanos , Masculino , Camundongos , Probabilidade , Locos de Características Quantitativas
15.
PLoS Genet ; 10(1): e1004022, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24415945

RESUMO

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study.


Assuntos
HDL-Colesterol/genética , Interação Gene-Ambiente , Locos de Características Quantitativas/genética , Animais , Meio Ambiente , Genoma , Camundongos , Modelos Teóricos
16.
Bioinformatics ; 30(2): 206-13, 2014 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-24158599

RESUMO

MOTIVATION: Recently, investigators have proposed state-of-the-art Identity-by-descent (IBD) mapping methods to detect IBD segments between purportedly unrelated individuals. The IBD information can then be used for association testing in genetic association studies. One approach for this IBD association testing strategy is to test for excessive IBD between pairs of cases ('pairwise method'). However, this approach is inefficient because it requires a large number of permutations. Moreover, a limited number of permutations define a lower bound for P-values, which makes fine-mapping of associated regions difficult because, in practice, a much larger genomic region is implicated than the region that is actually associated. RESULTS: In this article, we introduce a new pairwise method 'Fast-Pairwise'. Fast-Pairwise uses importance sampling to improve efficiency and enable approximation of extremely small P-values. Fast-Pairwise method takes only days to complete a genome-wide scan. In the application to the WTCCC type 1 diabetes data, Fast-Pairwise successfully fine-maps a known human leukocyte antigen gene that is known to cause the disease. AVAILABILITY: Fast-Pairwise is publicly available at: http://genetics.cs.ucla.edu/graphibd.


Assuntos
Diabetes Mellitus Tipo 1/genética , Genoma Humano , Estudo de Associação Genômica Ampla , Antígenos HLA/genética , Característica Quantitativa Herdável , Algoritmos , Estudos de Casos e Controles , Mapeamento Cromossômico , Simulação por Computador , Humanos , Polimorfismo de Nucleotídeo Único/genética
17.
Sci Rep ; 3: 1815, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23657357

RESUMO

Applications of linear mixed models (LMMs) to problems in genomics include phenotype prediction, correction for confounding in genome-wide association studies, estimation of narrow sense heritability, and testing sets of variants (e.g., rare variants) for association. In each of these applications, the LMM uses a genetic similarity matrix, which encodes the pairwise similarity between every two individuals in a cohort. Although ideally these similarities would be estimated using strictly variants relevant to the given phenotype, the identity of such variants is typically unknown. Consequently, relevant variants are excluded and irrelevant variants are included, both having deleterious effects. For each application of the LMM, we review known effects and describe new effects showing how variable selection can be used to mitigate them.


Assuntos
Variação Genética , Genômica , Modelos Teóricos , Fenótipo , Característica Quantitativa Herdável , Estudo de Associação Genômica Ampla , Humanos , Seleção Genética
18.
Bioinformatics ; 29(12): 1526-33, 2013 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-23599503

RESUMO

MOTIVATION: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power. RESULTS: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects-one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn's disease case-control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis. AVAILABILITY: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com.


Assuntos
Marcadores Genéticos , Estudo de Associação Genômica Ampla/métodos , Algoritmos , Estudos de Casos e Controles , Doença de Crohn/genética , Humanos , Modelos Lineares , Fenótipo , Polimorfismo de Nucleotídeo Único
19.
Mamm Genome ; 23(9-10): 680-92, 2012 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22892838

RESUMO

We have developed an association-based approach using classical inbred strains of mice in which we correct for population structure, which is very extensive in mice, using an efficient mixed-model algorithm. Our approach includes inbred parental strains as well as recombinant inbred strains in order to capture loci with effect sizes typical of complex traits in mice (in the range of 5% of total trait variance). Over the last few years, we have typed the hybrid mouse diversity panel (HMDP) strains for a variety of clinical traits as well as intermediate phenotypes and have shown that the HMDP has sufficient power to map genes for highly complex traits with resolution that is in most cases less than a megabase. In this essay, we review our experience with the HMDP, describe various ongoing projects, and discuss how the HMDP may fit into the larger picture of common diseases and different approaches.


Assuntos
Camundongos Endogâmicos/genética , Animais , Bases de Dados Genéticas , Camundongos
20.
Genetics ; 191(3): 959-67, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22505625

RESUMO

Genetic studies in mouse models have played an integral role in the discovery of the mechanisms underlying many human diseases. The primary mode of discovery has been the application of linkage analysis to mouse crosses. This approach results in high power to identify regions that affect traits, but in low resolution, making it difficult to identify the precise genomic location harboring the causal variant. Recently, a panel of mice referred to as the hybrid mouse diversity panel (HMDP) has been developed to overcome this problem. However, power in this panel is limited by the availability of inbred strains. Previous studies have suggested combining results across multiple panels as a means to increase power, but the methods employed may not be well suited to structured populations, such as the HMDP. In this article, we introduce a meta-analysis-based method that may be used to combine HMDP studies with F2 cross studies to gain power, while increasing resolution. Due to the drastically different genetic structure of F2s and the HMDP, the best way to combine two studies for a given SNP depends on the strain distribution pattern in each study. We show that combining results, while accounting for these patterns, leads to increased power and resolution. Using our method to map bone mineral density, we find that two previously implicated loci are replicated with increased significance and that the size of the associated is decreased. We also map HDL cholesterol and show a dramatic increase in the significance of a previously identified result.


Assuntos
Mapeamento Cromossômico , Metanálise como Assunto , Camundongos/genética , Modelos Genéticos , Animais , Densidade Óssea/genética , HDL-Colesterol/genética , Ligação Genética , Variação Genética/genética , Hibridização Genética , Camundongos/metabolismo , Camundongos/fisiologia , Fenótipo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA