Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 19 de 19
Filtrar
1.
Am J Hum Genet ; 100(2): 228-237, 2017 02 02.
Artigo em Inglês | MEDLINE | ID: mdl-28065468

RESUMO

We analyzed the mRNA levels for 36,778 transcript expression traits (probes) from 2,765 individuals to comprehensively investigate the genetic architecture and degree of missing heritability for gene expression in peripheral blood. We identified 11,204 cis and 3,791 trans independent expression quantitative trait loci (eQTL) by using linear mixed models to perform genome-wide association analyses. Furthermore, using information on both closely and distantly related individuals, heritability was estimated for all expression traits. Of the set of expressed probes (15,966), 10,580 (66%) had an estimated narrow-sense heritability (h2) greater than zero with a mean (median) value of 0.192 (0.142). Across these probes, on average the proportion of genetic variance explained by all eQTL (hCOJO2) was 31% (0.060/0.192), meaning that 69% is missing, with the sentinel SNP of the largest eQTL explaining 87% (0.052/0.060) of the variance attributed to all identified cis- and trans-eQTL. For the same set of probes, the genetic variance attributed to genome-wide common (MAF > 0.01) HapMap 3 SNPs (hg2) accounted for on average 48% (0.093/0.192) of h2. Taken together, the evidence suggests that approximately half the genetic variance for gene expression is not tagged by common SNPs, and of the variance that is tagged by common SNPs, a large proportion can be attributed to identifiable eQTL of large effect, typically in cis. Finally, we present evidence that, compared with a meta-analysis, using individual-level data results in an increase of approximately 50% in power to detect eQTL.


Assuntos
Expressão Gênica , Padrões de Herança , Locos de Características Quantitativas , RNA Mensageiro/sangue , Estudos de Associação Genética , Genoma Humano , Genótipo , Projeto HapMap , Humanos , Modelos Lineares , Desequilíbrio de Ligação , Modelos Genéticos , Fenótipo , Polimorfismo de Nucleotídeo Único , RNA Mensageiro/genética
2.
Neural Comput ; 28(12): 2585-2593, 2016 12.
Artigo em Inglês | MEDLINE | ID: mdl-27626962

RESUMO

The mixture-of-experts (MoE) model is a popular neural network architecture for nonlinear regression and classification. The class of MoE mean functions is known to be uniformly convergent to any unknown target function, assuming that the target function is from a Sobolev space that is sufficiently differentiable and that the domain of estimation is a compact unit hypercube. We provide an alternative result, which shows that the class of MoE mean functions is dense in the class of all continuous functions over arbitrary compact domains of estimation. Our result can be viewed as a universal approximation theorem for MoE models. The theorem we present allows MoE users to be confident in applying such models for estimation when data arise from nonlinear and nondifferentiable generative processes.

3.
Biometrics ; 72(4): 1255-1265, 2016 12.
Artigo em Inglês | MEDLINE | ID: mdl-27123964

RESUMO

Understanding how aquatic species grow is fundamental in fisheries because stock assessment often relies on growth dependent statistical models. Length-frequency-based methods become important when more applicable data for growth model estimation are either not available or very expensive. In this article, we develop a new framework for growth estimation from length-frequency data using a generalized von Bertalanffy growth model (VBGM) framework that allows for time-dependent covariates to be incorporated. A finite mixture of normal distributions is used to model the length-frequency cohorts of each month with the means constrained to follow a VBGM. The variances of the finite mixture components are constrained to be a function of mean length, reducing the number of parameters and allowing for an estimate of the variance at any length. To optimize the likelihood, we use a minorization-maximization (MM) algorithm with a Nelder-Mead sub-step. This work was motivated by the decline in catches of the blue swimmer crab (BSC) (Portunus armatus) off the east coast of Queensland, Australia. We test the method with a simulation study and then apply it to the BSC fishery data.


Assuntos
Braquiúros/crescimento & desenvolvimento , Pesqueiros/estatística & dados numéricos , Modelos Biológicos , Modelos Estatísticos , Algoritmos , Animais , Distribuição Normal , Fatores de Tempo
5.
Evol Appl ; 16(4): 911-935, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37124084

RESUMO

Effective management of protected species requires information on appropriate evolutionary and geographic population boundaries and knowledge of how the physical environment and life-history traits combine to shape the population structure and connectivity. Saltwater crocodiles (Crocodylus porosus) are the largest and most widely distributed of living crocodilians, extending from Sri Lanka to Southeast Asia and down to northern Australia. Given the long-distance movement capabilities reported for C. porosus, management units are hypothesised to be highly connected by migration. However, the magnitude, scale, and consistency of connection across managed populations are not fully understood. Here we used an efficient genotyping method that combines DArTseq and sequence capture to survey ≈ 3000 high-quality genome-wide single nucleotide polymorphisms from 1176 C. porosus sampled across nearly the entire range of the species in Queensland, Australia. We investigated historical and present-day connectivity patterns using fixation and diversity indices coupled with clustering methods and the spatial distribution of kin pairs. We inferred kinship using forward simulation coupled with a kinship estimation method that is robust to unspecified population structure. The results demonstrated that the C. porosus population has substantial genetic structure with six broad populations correlated with geographical location. The rate of gene flow was highly correlated with spatial distance, with greater differentiation along the east coast compared to the west. Kinship analyses revealed evidence of reproductive philopatry and limited dispersal, with approximately 90% of reported first and second-degree relatives showing a pairwise distance of <50 km between sampling locations. Given the limited dispersal, lack of suitable habitat, low densities of crocodiles and the high proportion of immature animals in the population, future management and conservation interventions should be considered at regional and state-wide scales.

6.
Sci Rep ; 13(1): 12512, 2023 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-37532795

RESUMO

Reliable information on population size is fundamental to the management of threatened species. For wild species, mark-recapture methods are a cornerstone of abundance estimation. Here, we show the first application of the close-kin mark-recapture (CKMR) method to a terrestrial species of high conservation value; the Christmas Island flying-fox (CIFF). The CIFF is the island's last remaining native terrestrial mammal and was recently listed as critically endangered. CKMR is a powerful tool for estimating the demographic parameters central to CIFF management and circumvents the complications arising from the species' cryptic nature, mobility, and difficult-to-survey habitat. To this end, we used genetic data from 450 CIFFs captured between 2015 and 2019 to detect kin pairs. We implemented a novel CKMR model that estimates sex-specific abundance, trend, and mortality and accommodates observations from the kin-pair distribution of male reproductive skew and mate persistence. CKMR estimated CIFF total adult female abundance to be approximately 2050 individuals (95% CI (950, 4300)). We showed that on average only 23% of the adult male population contributed to annual reproduction and strong evidence for between-year mate fidelity, an observation not previously quantified for a Pteropus species in the wild. Critically, our population estimates provide the most robust understanding of the status of this critically endangered population, informing immediate and future conservation initiatives.


Assuntos
Quirópteros , Conservação dos Recursos Naturais , Humanos , Animais , Masculino , Feminino , Espécies em Perigo de Extinção , Densidade Demográfica , Ecossistema , Mamíferos
7.
PLoS One ; 17(7): e0271930, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35901047

RESUMO

Monitoring programs are fundamental to understanding the state and trend of aquatic ecosystems. Sampling designs are a crucial component of monitoring programs and ensure that measurements evaluate progress toward clearly stated management objectives, which provides a mechanism for adaptive management. Here, we use a well-established marine monitoring program for inshore water quality in the Great Barrier Reef (GBR), Australia to investigate whether a sampling re-design has increased the program's capacity to meet its primary objectives. Specifically, we use bootstrap resampling to assess the change in statistical power to detect temporal water quality trends in a 15-year inshore marine water quality data set that includes data from both before and after the sampling re-design. We perform a comprehensive power analysis for six water quality analytes at four separate study areas in the GBR Marine Park and find that the sampling re-design (i) increased power to detect trends in 23 of the 24 analyte-study area combinations, and (ii) resulted in an average increase in power of 34% to detect increasing or decreasing trends in water quality analytes. This increase in power is attributed more to the addition of sampling locations than increasing the sampling rate. Therefore, the sampling re-design has substantially increased the capacity of the program to detect temporal trends in inshore marine water quality. Further improvements in sampling design need to focus on the program's capability to reliably detect trends within realistic timeframes where inshore improvements to water quality can be expected to occur.


Assuntos
Recifes de Corais , Qualidade da Água , Austrália , Ecossistema , Monitoramento Ambiental/métodos
8.
Nat Commun ; 12(1): 1164, 2021 02 19.
Artigo em Inglês | MEDLINE | ID: mdl-33608517

RESUMO

Understanding how natural selection has shaped genetic architecture of complex traits is of importance in medical and evolutionary genetics. Bayesian methods have been developed using individual-level GWAS data to estimate multiple genetic architecture parameters including selection signature. Here, we present a method (SBayesS) that only requires GWAS summary statistics. We analyse data for 155 complex traits (n = 27k-547k) and project the estimates onto those obtained from evolutionary simulations. We estimate that, on average across traits, about 1% of human genome sequence are mutational targets with a mean selection coefficient of ~0.001. Common diseases, on average, show a smaller number of mutational targets and have been under stronger selection, compared to other traits. SBayesS analyses incorporating functional annotations reveal that selection signatures vary across genomic regions, among which coding regions have the strongest selection signature and are enriched for both the number of associated variants and the magnitude of effect sizes.


Assuntos
Genoma , Herança Multifatorial/genética , Herança Multifatorial/fisiologia , Seleção Genética/genética , Seleção Genética/fisiologia , Teorema de Bayes , Evolução Molecular , Variação Genética , Estudo de Associação Genômica Ampla , Genômica , Humanos , Modelos Biológicos , Fenótipo , Polimorfismo de Nucleotídeo Único
9.
Nat Commun ; 10(1): 3009, 2019 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-31285442

RESUMO

Quantitative genetics theory predicts that X-chromosome dosage compensation (DC) will have a detectable effect on the amount of genetic and therefore phenotypic trait variances at associated loci in males and females. Here, we systematically examine the role of DC in humans in 20 complex traits in a sample of more than 450,000 individuals from the UK Biobank and 1600 gene expression traits from a sample of 2000 individuals as well as across-tissue gene expression from the GTEx resource. We find approximately twice as much X-linked genetic variation across the UK Biobank traits in males (mean h2SNP = 0.63%) compared to females (mean h2SNP = 0.30%), confirming the predicted DC effect. Our DC estimates for complex traits and gene expression are consistent with a small proportion of genes escaping X-inactivation in a trait- and tissue-dependent manner. Finally, we highlight examples of biologically relevant X-linked heterogeneity between the sexes that bias DC estimates if unaccounted for.


Assuntos
Genes Ligados ao Cromossomo X/genética , Loci Gênicos/genética , Variação Genética/genética , Herança Multifatorial/genética , Inativação do Cromossomo X/genética , Conjuntos de Dados como Assunto , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Modelos Genéticos , Fenótipo , Fatores Sexuais
10.
Nat Commun ; 10(1): 5086, 2019 11 08.
Artigo em Inglês | MEDLINE | ID: mdl-31704910

RESUMO

Accurate prediction of an individual's phenotype from their DNA sequence is one of the great promises of genomics and precision medicine. We extend a powerful individual-level data Bayesian multiple regression model (BayesR) to one that utilises summary statistics from genome-wide association studies (GWAS), SBayesR. In simulation and cross-validation using 12 real traits and 1.1 million variants on 350,000 individuals from the UK Biobank, SBayesR improves prediction accuracy relative to commonly used state-of-the-art summary statistics methods at a fraction of the computational resources. Furthermore, using summary statistics for variants from the largest GWAS meta-analysis (n ≈ 700, 000) on height and BMI, we show that on average across traits and two independent data sets that SBayesR improves prediction R2 by 5.2% relative to LDpred and by 26.5% relative to clumping and p value thresholding.


Assuntos
Teorema de Bayes , Herança Multifatorial/genética , Análise de Regressão , Tecido Adiposo , Alopecia/genética , Metabolismo Basal/genética , Bancos de Espécimes Biológicos , Peso ao Nascer/genética , Composição Corporal/genética , Estatura/genética , Índice de Massa Corporal , Densidade Óssea/genética , Diabetes Mellitus Tipo 2/genética , Volume Expiratório Forçado/genética , Estudos de Associação Genética , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único , Estatística como Assunto , Capacidade Vital/genética , Relação Cintura-Quadril
11.
Genetics ; 212(3): 905-918, 2019 07.
Artigo em Inglês | MEDLINE | ID: mdl-31123039

RESUMO

Expression QTL (eQTL) detection has emerged as an important tool for unraveling the relationship between genetic risk factors and disease or clinical phenotypes. Most studies are predicated on the assumption that only a single causal variant explains the association signal in each interval. This greatly simplifies the statistical modeling, but is liable to biases in scenarios where multiple local causal-variants are responsible. Here, our primary goal was to address the prevalence of secondary cis-eQTL signals regulating peripheral blood gene expression locally, utilizing two large human cohort studies, each >2500 samples with accompanying whole genome genotypes. The CAGE (Consortium for the Architecture of Gene Expression) dataset is a compendium of Illumina microarray studies, and the Framingham Heart Study is a two-generation Affymetrix dataset. We also describe Bayesian colocalization analysis of the extent of sharing of cis-eQTL detected in both studies as well as with the BIOS RNAseq dataset. Stepwise conditional modeling demonstrates that multiple eQTL signals are present for ∼40% of over 3500 eGenes in both microarray datasets, and that the number of loci with additional signals reduces by approximately two-thirds with each conditioning step. Although <20% of the peak signals across platforms fine map to the same credible interval, the colocalization analysis finds that as many as 50-60% of the primary eQTL are actually shared. Subsequently, colocalization of eQTL signals with GWAS hits detected 1349 genes whose expression in peripheral blood is associated with 591 human phenotype traits or diseases, including enrichment for genes with regulatory functions. At least 10%, and possibly as many as 40%, of eQTL-trait colocalized signals are due to nonprimary cis-eQTL peaks, but just one-quarter of these colocalization signals replicated across the gene expression datasets. Our results are provided as a web-based resource for visualization of multi-site regulation of gene expression and its association with human complex traits and disease states.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Locos de Características Quantitativas , Algoritmos , Perfilação da Expressão Gênica/métodos , Predisposição Genética para Doença , Humanos , Desequilíbrio de Ligação , Herança Multifatorial
12.
Genetics ; 208(4): 1397-1408, 2018 04.
Artigo em Inglês | MEDLINE | ID: mdl-29429966

RESUMO

Genome-wide association studies (GWAS) have identified thousands of loci that are robustly associated with complex diseases. The use of linear mixed model (LMM) methodology for GWAS is becoming more prevalent due to its ability to control for population structure and cryptic relatedness and to increase power. The odds ratio (OR) is a common measure of the association of a disease with an exposure (e.g., a genetic variant) and is readably available from logistic regression. However, when the LMM is applied to all-or-none traits it provides estimates of genetic effects on the observed 0-1 scale, a different scale to that in logistic regression. This limits the comparability of results across studies, for example in a meta-analysis, and makes the interpretation of the magnitude of an effect from an LMM GWAS difficult. In this study, we derived transformations from the genetic effects estimated under the LMM to the OR that only rely on summary statistics. To test the proposed transformations, we used real genotypes from two large, publicly available data sets to simulate all-or-none phenotypes for a set of scenarios that differ in underlying model, disease prevalence, and heritability. Furthermore, we applied these transformations to GWAS summary statistics for type 2 diabetes generated from 108,042 individuals in the UK Biobank. In both simulation and real-data application, we observed very high concordance between the transformed OR from the LMM and either the simulated truth or estimates from logistic regression. The transformations derived and validated in this study improve the comparability of results from prospective and already performed LMM GWAS on complex diseases by providing a reliable transformation to a common comparative scale for the genetic effects.


Assuntos
Estudos de Associação Genética , Modelos Lineares , Modelos Genéticos , Herança Multifatorial , Característica Quantitativa Herdável , Algoritmos , Simulação por Computador , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Estudo de Associação Genômica Ampla , Humanos , Razão de Chances
13.
Nat Commun ; 9(1): 918, 2018 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-29500431

RESUMO

The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.


Assuntos
Genômica/métodos , Característica Quantitativa Herdável , Metilação de DNA/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Especificidade de Órgãos , Locos de Características Quantitativas/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Sequências Reguladoras de Ácido Nucleico/genética , Esquizofrenia/genética , Transcriptoma/genética
14.
Nat Commun ; 9(1): 2941, 2018 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-30054458

RESUMO

Type 2 diabetes (T2D) is a very common disease in humans. Here we conduct a meta-analysis of genome-wide association studies (GWAS) with ~16 million genetic variants in 62,892 T2D cases and 596,424 controls of European ancestry. We identify 139 common and 4 rare variants associated with T2D, 42 of which (39 common and 3 rare variants) are independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2765) with the GWAS results identifies 33 putative functional genes for T2D, 3 of which were targeted by approved drugs. A further integration of DNA methylation (n = 1980) and epigenomic annotation data highlight 3 genes (CAMK1D, TP53INP1, and ATP5G1) with plausible regulatory mechanisms, whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. Our study uncovers additional loci, proposes putative genetic regulatory mechanisms for T2D, and provides evidence of purifying selection for T2D-associated variants.


Assuntos
Diabetes Mellitus Tipo 2/genética , Regulação da Expressão Gênica/fisiologia , Predisposição Genética para Doença , Variação Genética , Estudo de Associação Genômica Ampla/métodos , Proteína Quinase Tipo 1 Dependente de Cálcio-Calmodulina/genética , Proteínas de Transporte/genética , Metilação de DNA , Diabetes Mellitus Tipo 2/etnologia , Epigênese Genética , Epigenômica , Genótipo , Proteínas de Choque Térmico/genética , Humanos , ATPases Mitocondriais Próton-Translocadoras/genética , Risco , População Branca/genética
15.
Nat Genet ; 50(5): 746-753, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29662166

RESUMO

We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data (N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant (P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits.


Assuntos
Seleção Genética/genética , Teorema de Bayes , Frequência do Gene/genética , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Modelos Lineares , Modelos Genéticos , Herança Multifatorial/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Característica Quantitativa Herdável
16.
Genetics ; 206(2): 1113-1126, 2017 06.
Artigo em Inglês | MEDLINE | ID: mdl-28381588

RESUMO

Genetic association studies in admixed populations are underrepresented in the genomics literature, with a key concern for researchers being the adequate control of spurious associations due to population structure. Linear mixed models (LMMs) are well suited for genome-wide association studies (GWAS) because they account for both population stratification and cryptic relatedness and achieve increased statistical power by jointly modeling all genotyped markers. Additionally, Bayesian LMMs allow for more flexible assumptions about the underlying distribution of genetic effects, and can concurrently estimate the proportion of phenotypic variance explained by genetic markers. Using three recently published Bayesian LMMs, Bayes R, BSLMM, and BOLT-LMM, we investigate an existing data set on eye (n = 625) and skin (n = 684) color from Cape Verde, an island nation off West Africa that is home to individuals with a broad range of phenotypic values for eye and skin color due to the mix of West African and European ancestry. We use simulations to demonstrate the utility of Bayesian LMMs for mapping loci and studying the genetic architecture of quantitative traits in admixed populations. The Bayesian LMMs provide evidence for two new pigmentation loci: one for eye color (AHRR) and one for skin color (DDB1).


Assuntos
Olho , Genética Populacional , Pigmentos Biológicos/genética , Pigmentação da Pele/genética , África Ocidental , Teorema de Bayes , Cor , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas
17.
G3 (Bethesda) ; 7(8): 2533-2544, 2017 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-28600440

RESUMO

Expression quantitative trait locus (eQTL) detection has emerged as an important tool for unraveling of the relationship between genetic risk factors and disease or clinical phenotypes. Most studies use single marker linear regression to discover primary signals, followed by sequential conditional modeling to detect secondary genetic variants affecting gene expression. However, this approach assumes that functional variants are sparsely distributed and that close linkage between them has little impact on estimation of their precise location and the magnitude of effects. We describe a series of simulation studies designed to evaluate the impact of linkage disequilibrium (LD) on the fine mapping of causal variants with typical eQTL effect sizes. In the presence of multisite regulation, even though between 80 and 90% of modeled eSNPs associate with normally distributed traits, up to 10% of all secondary signals could be statistical artifacts, and at least 5% but up to one-quarter of credible intervals of SNPs within r2 > 0.8 of the peak may not even include a causal site. The Bayesian methods eCAVIAR and DAP (Deterministic Approximation of Posteriors) provide only modest improvement in resolution. Given the strong empirical evidence that gene expression is commonly regulated by more than one variant, we conclude that the fine mapping of causal variants needs to be adjusted for multisite influences, as conditional estimates can be highly biased by interference among linked sites, but ultimately experimental verification of individual effects is needed. Presumably similar conclusions apply not just to eQTL mapping, but to multisite influences on fine mapping of most types of quantitative trait.


Assuntos
Regulação da Expressão Gênica , Mapeamento Físico do Cromossomo , Locos de Características Quantitativas/genética , Alelos , Teorema de Bayes , Simulação por Computador , Humanos , Modelos Genéticos , Polimorfismo de Nucleotídeo Único/genética
18.
Nat Commun ; 8(1): 483, 2017 09 07.
Artigo em Inglês | MEDLINE | ID: mdl-28883458

RESUMO

Transcript co-expression is regulated by a combination of shared genetic and environmental factors. Here, we estimate the proportion of co-expression that is due to shared genetic variance. To do so, we estimated the genetic correlations between each pairwise combination of 2469 transcripts that are highly heritable and expressed in whole blood in 1748 unrelated individuals of European ancestry. We identify 556 pairs with a significant genetic correlation of which 77% are located on different chromosomes, and report 934 expression quantitative trait loci, identified in an independent cohort, with significant effects on both transcripts in a genetically correlated pair. We show significant enrichment for transcription factor control and physical proximity through chromatin interactions as possible mechanisms of shared genetic control. Finally, we construct networks of interconnected transcripts and identify their underlying biological functions. Using genetic correlations to investigate transcriptional co-regulation provides valuable insight into the nature of the underlying genetic architecture of gene regulation.Covariance of gene expression pairs is due to a combination of shared genetic and environmental factors. Here the authors estimate the genetic correlation between highly heritable pairs and identify transcription factor control and chromatin interactions as possible mechanisms of correlation.


Assuntos
Sangue/metabolismo , Regulação da Expressão Gênica , Expressão Gênica , Variação Genética , Redes Reguladoras de Genes , Humanos , Locos de Características Quantitativas , RNA Mensageiro/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/fisiologia , Transcrição Gênica
19.
Nat Genet ; 49(8): 1174-1181, 2017 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-28692066

RESUMO

Obesity is a worldwide epidemic, with major health and economic costs. Here we estimate heritability for body mass index (BMI) in 172,000 sibling pairs and 150,832 unrelated individuals and explore the contribution of genotype-covariate interaction effects at common SNP loci. We find evidence for genotype-age interaction (likelihood ratio test (LRT) = 73.58, degrees of freedom (df) = 1, P = 4.83 × 10-18), which contributed 8.1% (1.4% s.e.) to BMI variation. Across eight self-reported lifestyle factors, including diet and exercise, we find genotype-environment interaction only for smoking behavior (LRT = 19.70, P = 5.03 × 10-5 and LRT = 30.80, P = 1.42 × 10-8), which contributed 4.0% (0.8% s.e.) to BMI variation. Bayesian association analysis suggests that BMI is highly polygenic, with 75% of the SNP heritability attributable to loci that each explain <0.01% of the phenotypic variance. Our findings imply that substantially larger sample sizes across ages and lifestyles are required to understand the full genetic architecture of BMI.


Assuntos
Índice de Massa Corporal , Obesidade/genética , Adolescente , Adulto , Idoso , Envelhecimento/genética , Teorema de Bayes , Feminino , Interação Gene-Ambiente , Genótipo , Humanos , Estilo de Vida , Masculino , Pessoa de Meia-Idade , Herança Multifatorial , Polimorfismo de Nucleotídeo Único , Caracteres Sexuais , Gêmeos/genética , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA