RESUMEN
BACKGROUND: Cohorts of individuals that have been genotyped and phenotyped for genomic selection programs offer the opportunity to better understand genetic variation associated with complex traits. Here, we performed an association study for traits related to body size and muscular development in intensively selected beef cattle. We leveraged multiple trait information to refine and interpret the significant associations. RESULTS: After a multiple-step genotype imputation to the sequence-level for 14,762 Belgian Blue beef (BBB) cows, we performed a genome-wide association study (GWAS) for 11 traits related to muscular development and body size. The 37 identified genome-wide significant quantitative trait loci (QTL) could be condensed in 11 unique QTL regions based on their position. Evidence for pleiotropic effects was found in most of these regions (e.g., correlated association signals, overlap between credible sets (CS) of candidate variants). Thus, we applied a multiple-trait approach to combine information from different traits to refine the CS. In several QTL regions, we identified strong candidate genes known to be related to growth and height in other species such as LCORL-NCAPG or CCND2. For some of these genes, relevant candidate variants were identified in the CS, including three new missense variants in EZH2, PAPPA2 and ADAM12, possibly two additional coding variants in LCORL, and candidate regulatory variants linked to CCND2 and ARMC12. Strikingly, four other QTL regions associated with dimension or muscular development traits were related to five (recessive) deleterious coding variants previously identified. CONCLUSIONS: Our study further supports that a set of common genes controls body size across mammalian species. In particular, we added new genes to the list of those associated with height in both humans and cattle. We also identified new strong candidate causal variants in some of these genes, strengthening the evidence of their causality. Several breed-specific recessive deleterious variants were identified in our QTL regions, probably as a result of the extreme selection for muscular development in BBB cattle.
Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Humanos , Femenino , Bovinos/genética , Animales , Estudio de Asociación del Genoma Completo/veterinaria , Bélgica , Fenotipo , Tamaño Corporal/genética , Mamíferos/genética , Polimorfismo de Nucleótido SimpleRESUMEN
Inflammatory bowel diseases are chronic gastrointestinal inflammatory disorders that affect millions of people worldwide. Genome-wide association studies have identified 200 inflammatory bowel disease-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 inflammatory bowel disease loci using high-density genotyping in 67,852 individuals. We pinpoint 18 associations to a single causal variant with greater than 95% certainty, and an additional 27 associations to a single variant with greater than 50% certainty. These 45 variants are significantly enriched for protein-coding changes (n = 13), direct disruption of transcription-factor binding sites (n = 3), and tissue-specific epigenetic marks (n = 10), with the last category showing enrichment in specific immune cells among associations stronger in Crohn's disease and in gut mucosa among associations stronger in ulcerative colitis. The results of this study suggest that high-resolution fine-mapping in large samples can convert many discoveries from genome-wide association studies into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms.
Asunto(s)
Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Enfermedades Inflamatorias del Intestino/genética , Sitios de Carácter Cuantitativo/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Sitios de Unión , Cromatina/genética , Colitis Ulcerosa/genética , Enfermedad de Crohn/genética , Epigénesis Genética/genética , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Persona de Mediana Edad , Proteína smad3/genética , Factores de Transcripción/metabolismo , Adulto JovenRESUMEN
BACKGROUND: Genomic selection has been successfully implemented in many livestock and crop species. The genomic best linear unbiased predictor (GBLUP) approach, assigning equal variance to all SNP effects, is one of the reference methods. When large-effect variants contribute to complex traits, it has been shown that genomic prediction methods that assign a higher variance to subsets of SNP effects can achieve higher prediction accuracy. We herein compared the efficiency of several such approaches, including the Adaptive MultiBLUP (AM-BLUP) that uses local genomic relationship matrices (GRM) to automatically identify and weight genomic regions with large effects, to predict genetic merit in Belgian Blue beef cattle. RESULTS: We used a population of approximately 10,000 genotyped cows and their phenotypes for 14 traits, mostly related to muscular development and body dimensions. According to the trait, we found that 4 to 25% of the genetic variance could be associated with 2 to 12 genomic regions harbouring large-effect variants. Noteworthy, three previously identified recessive deleterious variants presented heterozygote advantage and were among the most significant SNPs for several traits. The AM-BLUP resulted in increased reliability of genomic predictions compared to GBLUP (+ 2%), but Bayesian methods proved more efficient (+ 3%). Overall, the reliability gains remained thus limited although higher gains were observed for skin thickness, a trait affected by two genomic regions having particularly large effects. Higher accuracies than those from the original AM-BLUP were achieved when applying the Bayesian Sparse Linear Mixed Model to pre-select groups of SNPs with large effects and subsequently use their estimated variance to build a weighted GRM. Finally, the single-step GBLUP performed best and could be further improved (+ 3% prediction accuracy) by using these weighted GRM. CONCLUSIONS: The AM-BLUP is an attractive method to automatically identify and weight genomic regions with large effects on complex traits. However, the method was less accurate than Bayesian methods. Overall, weighted methods achieved modest accuracy gains compared to GBLUP. Nevertheless, the computational efficiency of the AM-BLUP might be valuable at higher marker density, including with whole-genome sequencing data. Furthermore, weighted GRM are particularly useful to account for large variance loci in the single-step GBLUP.
Asunto(s)
Genoma , Genómica , Animales , Teorema de Bayes , Bélgica , Bovinos/genética , Femenino , Genotipo , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple , Reproducibilidad de los ResultadosRESUMEN
BACKGROUND: Inbreeding coefficients can be estimated either from pedigree data or from genomic data, and with genomic data, they are either global or local (when the linkage map is used). Recently, we developed a new hidden Markov model (HMM) that estimates probabilities of homozygosity-by-descent (HBD) at each marker position and automatically partitions autozygosity in multiple age-related classes (based on the length of HBD segments). Our objectives were to: (1) characterize inbreeding with our model in an intensively selected population such as the Belgian Blue Beef (BBB) cattle breed; (2) compare the properties of the model at different marker densities; and (3) compare our model with other methods. RESULTS: When using 600 K single nucleotide polymorphisms (SNPs), the inbreeding coefficient (probability of sampling an HBD locus in an individual) was on average 0.303 (ranging from 0.258 to 0.375). HBD-classes associated to historical ancestors (with small segments ≤ 200 kb) accounted for 21.6% of the genome length (71.4% of the total length of the genome in HBD segments), whereas classes associated to more recent ancestors accounted for only 22.6% of the total length of the genome in HBD segments. However, these recent classes presented more individual variation than more ancient classes. Although inbreeding coefficients obtained with low SNP densities (7 and 32 K) were much lower (0.060 and 0.093), they were highly correlated with those obtained at higher density (r = 0.934 and 0.975, respectively), indicating that they captured most of the individual variation. At higher SNP density, smaller HBD segments are identified and, thus, more past generations can be explored. We observed very high correlations between our estimates and those based on homozygosity (r = 0.95) or on runs-of-homozygosity (r = 0.95). As expected, pedigree-based estimates were mainly correlated with recent HBD-classes (r = 0.56). CONCLUSIONS: Although we observed high levels of autozygosity associated with small HBD segments in BBB cattle, recent inbreeding accounted for most of the individual variation. Recent autozygosity can be captured efficiently with low-density SNP arrays and relatively simple models (e.g., two HBD classes). The HMM framework provides local HBD probabilities that are still useful at lower SNP densities.
Asunto(s)
Bovinos/genética , Genómica/métodos , Endogamia/métodos , Modelos Estadísticos , Polimorfismo de Nucleótido Simple/genética , Animales , Genoma , Genotipo , Homocigoto , Masculino , LinajeRESUMEN
GWAS have identified >200 risk loci for Inflammatory Bowel Disease (IBD). The majority of disease associations are known to be driven by regulatory variants. To identify the putative causative genes that are perturbed by these variants, we generate a large transcriptome data set (nine disease-relevant cell types) and identify 23,650 cis-eQTL. We show that these are determined by â¼9720 regulatory modules, of which â¼3000 operate in multiple tissues and â¼970 on multiple genes. We identify regulatory modules that drive the disease association for 63 of the 200 risk loci, and show that these are enriched in multigenic modules. Based on these analyses, we resequence 45 of the corresponding 100 candidate genes in 6600 Crohn disease (CD) cases and 5500 controls, and show with burden tests that they include likely causative genes. Our analyses indicate that ≥10-fold larger sample sizes will be required to demonstrate the causality of individual genes using this approach.
Asunto(s)
Enfermedades Inflamatorias del Intestino/genética , Herencia Multifactorial , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Enfermedad de Crohn/genética , Femenino , Perfilación de la Expresión Génica , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ADNRESUMEN
AIMS: To report normal reference ranges for echocardiographic dimensions of the proximal aorta obtained in a large group of healthy volunteers recruited using state-of-the-art cardiac ultrasound equipment, considering different measurement conventions, and taking into account gender, age, and body size of individuals. METHODS AND RESULTS: A total of 704 (mean age: 46.0 ± 13.5 years) healthy volunteers (310 men and 394 women) were prospectively recruited from the collaborating institutions of the Normal Reference Ranges for Echocardiography (NORRE) study. A comprehensive echocardiographic examination was obtained in all subjects following pre-defined protocols. Aortic dimensions were obtained in systole and diastole, following both the leading-edge to leading-edge and the inner-edge to inner-edge conventions. Diameters were measured at four levels: ventricular-arterial junction, sinuses of Valsalva, sino-tubular junction, and proximal tubular ascending aorta. Measures of aortic root in the short-axis view following the orientation of each of the three sinuses were also performed. Men had significantly larger body sizes when compared with women, and showed larger aortic dimensions independently of the measurement method used. Dimensions indexed by height and body surface area are provided, and stratification by age ranges is also displayed. In multivariable analysis, the independent predictors of aortic dimensions were age, gender, and height or body surface area. CONCLUSION: The NORRE study provides normal values of proximal aorta dimensions as assessed by echocardiography. Reference ranges for different anatomical levels using different (i) measurement conventions and (ii) at different times of the cardiac cycle (i.e. mid-systole and end-diastole) are provided. Age, gender, and body size were significant determinants of aortic dimensions.