Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 141
1.
Commun Biol ; 7(1): 724, 2024 Jun 12.
Article En | MEDLINE | ID: mdl-38866948

Most genetic variants associated with fertility in mammals fall in non-coding regions of the genome and it is unclear how these variants affect fertility. Here we use genome-wide association summary statistics for Heifer puberty (pubertal or not at 600 days) from 27,707 Bos indicus, Bos taurus and crossbred cattle; multi-trait GWAS signals from 2119 indicine cattle for four fertility traits, including days to calving, age at first calving, pregnancy status, and foetus age in weeks (assessed by rectal palpation of the foetus); and expression quantitative trait locus for whole blood from 489 indicine cattle, to identify 87 putatively functional genes affecting cattle fertility. Our analysis reveals a significant overlap between the set of cattle and previously reported human fertility-related genes, impling the existence of a shared pool of genes that regulate fertility in mammals. These findings are crucial for developing approaches to improve fertility in cattle and potentially other mammals.


Fertility , Genome-Wide Association Study , Quantitative Trait Loci , Animals , Cattle/genetics , Fertility/genetics , Genome-Wide Association Study/veterinary , Female , Polymorphism, Single Nucleotide
2.
Nat Genet ; 56(5): 767-777, 2024 May.
Article En | MEDLINE | ID: mdl-38689000

We develop a method, SBayesRC, that integrates genome-wide association study (GWAS) summary statistics with functional genomic annotations to improve polygenic prediction of complex traits. Our method is scalable to whole-genome variant analysis and refines signals from functional annotations by allowing them to affect both causal variant probability and causal effect distribution. We analyze 50 complex traits and diseases using ∼7 million common single-nucleotide polymorphisms (SNPs) and 96 annotations. SBayesRC improves prediction accuracy by 14% in European ancestry and up to 34% in cross-ancestry prediction compared to the baseline method SBayesR, which does not use annotations, and outperforms other methods, including LDpred2, LDpred-funct, MegaPRS, PolyPred-S and PRS-CSx. Investigation of factors affecting prediction accuracy identifies a significant interaction between SNP density and annotation information, suggesting whole-genome sequence variants with annotations may further improve prediction. Functional partitioning analysis highlights a major contribution of evolutionary constrained regions to prediction accuracy and the largest per-SNP contribution from nonsynonymous SNPs.


Genome-Wide Association Study , Molecular Sequence Annotation , Multifactorial Inheritance , Polymorphism, Single Nucleotide , Multifactorial Inheritance/genetics , Genome-Wide Association Study/methods , Humans , Molecular Sequence Annotation/methods , Genomics/methods , Genome, Human , Models, Genetic
3.
Genet Sel Evol ; 56(1): 22, 2024 Mar 28.
Article En | MEDLINE | ID: mdl-38549172

BACKGROUND: Bovine lactoferrin (Lf) is an iron absorbing whey protein with antibacterial, antiviral, and antifungal activity. Lactoferrin is economically valuable and has an extremely variable concentration in milk, partly driven by environmental influences such as milking frequency, involution, or mastitis. A significant genetic influence has also been previously observed to regulate lactoferrin content in milk. Here, we conducted genetic mapping of lactoferrin protein concentration in conjunction with RNA-seq, ChIP-seq, and ATAC-seq data to pinpoint candidate causative variants that regulate lactoferrin concentrations in milk. RESULTS: We identified a highly-significant lactoferrin protein quantitative trait locus (pQTL), as well as a cis lactotransferrin (LTF) expression QTL (cis-eQTL) mapping to the LTF locus. Using ChIP-seq and ATAC-seq datasets representing lactating mammary tissue samples, we also report a number of regions where the openness of chromatin is under genetic influence. Several of these also show highly significant QTL with genetic signatures similar to those highlighted through pQTL and eQTL analysis. By performing correlation analysis between these QTL, we revealed an ATAC-seq peak in the putative promotor region of LTF, that highlights a set of 115 high-frequency variants that are potentially responsible for these effects. One of the 115 variants (rs110000337), which maps within the ATAC-seq peak, was predicted to alter binding sites of transcription factors known to be involved in lactation-related pathways. CONCLUSIONS: Here, we report a regulatory haplotype of 115 variants with conspicuously large impacts on milk lactoferrin concentration. These findings could enable the selection of animals for high-producing specialist herds.


Lactation , Lactoferrin , Milk , Animals , Female , Haplotypes , Lactation/genetics , Lactoferrin/genetics , Lactoferrin/analysis , Lactoferrin/metabolism , Milk/chemistry , Milk/metabolism , Cattle
4.
Cell Genom ; 3(10): 100385, 2023 Oct 11.
Article En | MEDLINE | ID: mdl-37868035

Many quantitative trait loci (QTLs) are in non-coding regions. Therefore, QTLs are assumed to affect gene regulation. Gene expression and RNA splicing are primary steps of transcription, so DNA variants changing gene expression (eVariants) or RNA splicing (sVariants) are expected to significantly affect phenotypes. We quantify the contribution of eVariants and sVariants detected from 16 tissues (n = 4,725) to 37 traits of ∼120,000 cattle (average magnitude of genetic correlation between traits = 0.13). Analyzed in Bayesian mixture models, averaged across 37 traits, cis and trans eVariants and sVariants detected from 16 tissues jointly explain 69.2% (SE = 0.5%) of heritability, 44% more than expected from the same number of random variants. This 69.2% includes an average of 24% from trans e-/sVariants (14% more than expected). Averaged across 56 lipidomic traits, multi-tissue cis and trans e-/sVariants also explain 71.5% (SE = 0.3%) of heritability, demonstrating the essential role of proximal and distal regulatory variants in shaping mammalian phenotypes.

5.
Sci Rep ; 13(1): 15596, 2023 09 20.
Article En | MEDLINE | ID: mdl-37730913

Allele specific expression (ASE) is widespread in many species including cows. Therefore, regulatory regions which control gene expression should show cis-regulatory variation which mirrors this differential expression within the animal. ChIP-seq peaks for histone modifications and transcription factors measure activity at functional regions and the height of some peaks have been shown to correlate across tissues with the expression of particular genes, suggesting these peaks are putative regulatory regions. In this study we identified ASE in the bovine genome in multiple tissues and investigated whether ChIP-seq peaks for four histone modifications and the transcription factor CTCF show allele specific binding (ASB) differences in the same tissues. We then investigate whether peak height and gene expression, which correlates across tissues, also correlates within the animal by investigating whether the direction of ASB in putative regulatory regions, mirrors that of the ASE in the genes they are putatively regulating. We found that ASE and ASB were widespread in the bovine genome but vary in extent between tissues. However, even when the height of a peak was positively correlated across tissues with expression of an exon, ASE of the exon and ASB of the peak were in the same direction only half the time. A likely explanation for this finding is that the correlations between peak height and exon expression do not indicate that the height of the peak causes the extent of exon expression, at least in some cases.


Histone Code , Transcription Factors , Female , Animals , Cattle , Histone Code/genetics , Transcription Factors/genetics , Alleles , Chromatin Immunoprecipitation Sequencing , Exons/genetics
7.
Genet Sel Evol ; 55(1): 9, 2023 Jan 31.
Article En | MEDLINE | ID: mdl-36721111

Studies have demonstrated that structural variants (SV) play a substantial role in the evolution of species and have an impact on Mendelian traits in the genome. However, unlike small variants (< 50 bp), it has been challenging to accurately identify and genotype SV at the population scale using short-read sequencing. Long-read sequencing technologies are becoming competitively priced and can address several of the disadvantages of short-read sequencing for the discovery and genotyping of SV. In livestock species, analysis of SV at the population scale still faces challenges due to the lack of resources, high costs, technological barriers, and computational limitations. In this review, we summarize recent progress in the characterization of SV in the major livestock species, the obstacles that still need to be overcome, as well as the future directions in this growing field. It seems timely that research communities pool resources to build global population-scale long-read sequencing consortiums for the major livestock species for which the application of genomic tools has become cost-effective.


Genomics , Livestock , Animals , Livestock/genetics , Genotype , Phenotype
8.
BMC Genomics ; 23(1): 815, 2022 Dec 08.
Article En | MEDLINE | ID: mdl-36482302

BACKGROUND: Causal variants for complex traits, such as eQTL are often found in non-coding regions of the genome, where they are hypothesised to influence phenotypes by regulating gene expression. Many regulatory regions are marked by histone modifications, which can be assayed by chromatin immunoprecipitation followed by sequencing (ChIP-seq). Sequence reads from ChIP-seq form peaks at putative regulatory regions, which may reflect the amount of regulatory activity at this region. Therefore, eQTL which are also associated with differences in histone modifications are excellent candidate causal variants. RESULTS: We assayed the histone modifications H3K4Me3, H3K4Me1 and H3K27ac and mRNA in the mammary gland of up to 400 animals. We identified QTL for peak height (histone QTL), exon expression (eeQTL), allele specific expression (aseQTL) and allele specific binding (asbQTL). By intersecting these results, we identify variants which may influence gene expression by altering regulatory regions of the genome, and may be causal variants for other traits. Lastly, we find that these variants are found in putative transcription factor binding sites, identifying a mechanism for the effect of many eQTL. CONCLUSIONS: We find that allele specific and traditional QTL analysis often identify the same genetic variants and provide evidence that many eQTL are regulatory variants which alter activity at regulatory regions of the bovine genome. Our work provides methodological and biological updates on how regulatory mechanisms interplay at multi-omics levels.


Histone Code , Multiomics , Cattle/genetics , Animals , Genetic Variation , Gene Expression
9.
Genet Sel Evol ; 54(1): 60, 2022 Sep 06.
Article En | MEDLINE | ID: mdl-36068488

BACKGROUND: Sharing individual phenotype and genotype data between countries is complex and fraught with potential errors, while sharing summary statistics of genome-wide association studies (GWAS) is relatively straightforward, and thus would be especially useful for traits that are expensive or difficult-to-measure, such as feed efficiency. Here we examined: (1) the sharing of individual cow data from international partners; and (2) the use of sequence variants selected from GWAS of international cow data to evaluate the accuracy of genomic estimated breeding values (GEBV) for residual feed intake (RFI) in Australian cows. RESULTS: GEBV for RFI were estimated using genomic best linear unbiased prediction (GBLUP) with 50k or high-density single nucleotide polymorphisms (SNPs), from a training population of 3797 individuals in univariate to trivariate analyses where the three traits were RFI phenotypes calculated using 584 Australian lactating cows (AUSc), 824 growing heifers (AUSh), and 2526 international lactating cows (OVE). Accuracies of GEBV in AUSc were evaluated by either cohort-by-birth-year or fourfold random cross-validations. GEBV of AUSc were also predicted using only the AUS training population with a weighted genomic relationship matrix constructed with SNPs from the 50k array and sequence variants selected from a meta-GWAS that included only international datasets. The genomic heritabilities estimated using the AUSc, OVE and AUSh datasets were moderate, ranging from 0.20 to 0.36. The genetic correlations (rg) of traits between heifers and cows ranged from 0.30 to 0.95 but were associated with large standard errors. The mean accuracies of GEBV in Australian cows were up to 0.32 and almost doubled when either overseas cows, or both overseas cows and AUS heifers were included in the training population. They also increased when selected sequence variants were combined with 50k SNPs, but with a smaller relative increase. CONCLUSIONS: The accuracy of RFI GEBV increased when international data were used or when selected sequence variants were combined with 50k SNP array data. This suggests that if direct sharing of data is not feasible, a meta-analysis of summary GWAS statistics could provide selected SNPs for custom panels to use in genomic selection programs. However, since this finding is based on a small cross-validation study, confirmation through a larger study is recommended.


Cattle , Lactation , Animals , Australia , Cattle/genetics , Female , Genome-Wide Association Study , Genomics , Genotype , Phenotype , Polymorphism, Single Nucleotide
10.
Commun Biol ; 5(1): 661, 2022 07 05.
Article En | MEDLINE | ID: mdl-35790806

Bayesian methods, such as BayesR, for predicting the genetic value or risk of individuals from their genotypes, such as Single Nucleotide Polymorphisms (SNP), are often implemented using a Markov Chain Monte Carlo (MCMC) process. However, the generation of Markov chains is computationally slow. We introduce a form of blocked Gibbs sampling for estimating SNP effects from Markov chains that greatly reduces computational time by sampling each SNP effect iteratively n-times from conditional block posteriors. Subsequent iteration over all blocks m-times produces chains of length m × n. We use this strategy to solve large-scale genomic prediction and fine-mapping problems using the Bayesian MCMC mixed-effects genetic model, BayesR3. We validate the method using simulated data, followed by analysis of empirical dairy cattle data using high dimension milk mid infra-red spectra data as an example of "omics" data and show its use to increase the precision of mapping variants affecting milk, fat, and protein yields relative to a univariate analysis of milk, fat, and protein.


Genome , Genomics , Animals , Bayes Theorem , Cattle , Genomics/methods , Markov Chains , Phenotype
12.
Nat Genet ; 54(7): 934-939, 2022 07.
Article En | MEDLINE | ID: mdl-35817969

The quantitative geneticist W. G. ('Bill') Hill, awardee of the 2018 Darwin Medal of the Royal Society and the 2019 Mendel Medal of the Genetics Society (United Kingdom), died on 17 December 2021 at the age of 81 years. Here, we pay tribute to his multiple key scientific contributions, which span population and evolutionary genetics, animal and plant breeding and human genetics. We discuss his theoretical research on the role of linkage disequilibrium (LD) and mutational variance in the response to selection, the origin of the widely used LD metric r2 in genomic association studies, the genetic architecture of complex traits, the quantification of the variation in realized relationships given a pedigree relationship and much more. We demonstrate that basic theoretical research in quantitative and statistical genetics has led to profound insights into the genetics and evolution of complex traits and made predictions that were subsequently empirically validated, often decades later.


Genome , Plant Breeding , Animals , Genome-Wide Association Study , Genomics , Humans , Linkage Disequilibrium
13.
Eur Heart J ; 43(19): 1864-1877, 2022 05 14.
Article En | MEDLINE | ID: mdl-35567557

AIMS: Inflammation is a key factor in atherosclerosis. The transcription factor interferon regulatory factor-5 (IRF5) drives macrophages towards a pro-inflammatory state. We investigated the role of IRF5 in human atherosclerosis and plaque stability. METHODS AND RESULTS: Bulk RNA sequencing from the Carotid Plaque Imaging Project biobank were used to mine associations between major macrophage associated genes and transcription factors and human symptomatic carotid disease. Immunohistochemistry, proximity extension assays, and Helios cytometry by time of flight (CyTOF) were used for validation. The effect of IRF5 deficiency on carotid plaque phenotype and rupture in ApoE-/- mice was studied in an inducible model of plaque rupture. Interferon regulatory factor-5 and ITGAX/CD11c were identified as the macrophage associated genes with the strongest associations with symptomatic carotid disease. Expression of IRF5 and ITGAX/CD11c correlated with the vulnerability index, pro-inflammatory plaque cytokine levels, necrotic core area, and with each other. Macrophages were the predominant CD11c-expressing immune cells in the plaque by CyTOF and immunohistochemistry. Interferon regulatory factor-5 immunopositive areas were predominantly found within CD11c+ areas with a predilection for the shoulder region, the area of the human plaque most prone to rupture. Accordingly, an inducible plaque rupture model of ApoE-/-Irf5-/- mice had significantly lower frequencies of carotid plaque ruptures, smaller necrotic cores, and less CD11c+ macrophages than their IRF5-competent counterparts. CONCLUSION: Using complementary evidence from data from human carotid endarterectomies and a murine model of inducible rupture of carotid artery plaque in IRF5-deficient mice, we demonstrate a mechanistic link between the pro-inflammatory transcription factor IRF5, macrophage phenotype, plaque inflammation, and its vulnerability to rupture.


Atherosclerosis , Interferon Regulatory Factors , Macrophages , Plaque, Atherosclerotic , Animals , Apolipoproteins E/genetics , Atherosclerosis/metabolism , Atherosclerosis/pathology , Humans , Inflammation/metabolism , Interferon Regulatory Factors/metabolism , Macrophages/immunology , Mice , Necrosis , Plaque, Atherosclerotic/metabolism , Plaque, Atherosclerotic/pathology
15.
Nat Genet ; 54(3): 263-273, 2022 03.
Article En | MEDLINE | ID: mdl-35256806

Analyses of data from genome-wide association studies on unrelated individuals have shown that, for human traits and diseases, approximately one-third to two-thirds of heritability is captured by common SNPs. However, it is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular whether the causal variants are rare, or whether it is overestimated due to bias in inference from pedigree data. Here we estimated heritability for height and body mass index (BMI) from whole-genome sequence data on 25,465 unrelated individuals of European ancestry. The estimated heritability was 0.68 (standard error 0.10) for height and 0.30 (standard error 0.10) for body mass index. Low minor allele frequency variants in low linkage disequilibrium (LD) with neighboring variants were enriched for heritability, to a greater extent for protein-altering variants, consistent with negative selection. Our results imply that rare variants, in particular those in regions of low linkage disequilibrium, are a major source of the still missing heritability of complex traits and disease.


Genome-Wide Association Study , Multifactorial Inheritance , Alleles , Genome-Wide Association Study/methods , Humans , Linkage Disequilibrium , Multifactorial Inheritance/genetics , Polymorphism, Single Nucleotide/genetics
16.
Evolution ; 76(4): 817-820, 2022 Apr.
Article En | MEDLINE | ID: mdl-35192732
17.
Nat Commun ; 13(1): 660, 2022 02 03.
Article En | MEDLINE | ID: mdl-35115518

Many traits are subject to assortative mating, with recent molecular genetic findings confirming longstanding theoretical predictions that assortative mating induces long range dependence across causal variants. However, all marker-based heritability estimators implicitly assume mating is random. We provide mathematical and simulation-based evidence demonstrating that both method-of-moments and likelihood-based estimators are biased in the presence of assortative mating and derive corrected heritability estimators for traits subject to assortment. Finally, we demonstrate that the empirical patterns of estimates across methods and sample sizes for real traits subject to assortative mating are congruent with expected assortative mating-induced biases. For example, marker-based heritability estimates for height are 14% - 23% higher than corrected estimates using UK Biobank data.


Algorithms , Genetics, Population/methods , Models, Genetic , Reproduction/genetics , Bias , Computer Simulation , Female , Genome-Wide Association Study/methods , Humans , Likelihood Functions , Linkage Disequilibrium , Male , Mendelian Randomization Analysis/methods , Phenotype , Polymorphism, Single Nucleotide , Quantitative Trait, Heritable
18.
Nat Commun ; 13(1): 215, 2022 01 11.
Article En | MEDLINE | ID: mdl-35017526

Macrophages are integral to the pathogenesis of atherosclerosis, but the contribution of distinct macrophage subsets to disease remains poorly defined. Using single cell technologies and conditional ablation via a LysMCre+ Clec4a2flox/DTR mouse strain, we demonstrate that the expression of the C-type lectin receptor CLEC4A2 is a distinguishing feature of vascular resident macrophages endowed with athero-protective properties. Through genetic deletion and competitive bone marrow chimera experiments, we identify CLEC4A2 as an intrinsic regulator of macrophage tissue adaptation by promoting a bias in monocyte-to-macrophage in situ differentiation towards colony stimulating factor 1 (CSF1) in vascular health and disease. During atherogenesis, CLEC4A2 deficiency results in loss of resident vascular macrophages and their homeostatic properties causing dysfunctional cholesterol metabolism and enhanced toll-like receptor triggering, exacerbating disease. Our study demonstrates that CLEC4A2 licenses monocytes to join the vascular resident macrophage pool, and that CLEC4A2-mediated macrophage homeostasis is critical to combat cardiovascular disease.


Apolipoproteins E/genetics , Atherosclerosis/genetics , Blood Vessels/metabolism , Lectins, C-Type/genetics , Macrophages/metabolism , Animals , Apolipoproteins E/deficiency , Atherosclerosis/metabolism , Atherosclerosis/pathology , Blood Vessels/pathology , Bone Marrow Cells/metabolism , Bone Marrow Cells/pathology , Cell Death/genetics , Cell Differentiation , Cell Lineage/genetics , Cholesterol/metabolism , Disease Models, Animal , Gene Expression Regulation , Homeostasis/genetics , Humans , Lectins, C-Type/deficiency , Macrophage Colony-Stimulating Factor/genetics , Macrophage Colony-Stimulating Factor/metabolism , Macrophages/pathology , Mice , Mice, Inbred C57BL , Mice, Knockout , Monocytes/metabolism , Monocytes/pathology , Signal Transduction , Single-Cell Analysis
19.
Commun Biol ; 4(1): 1353, 2021 12 02.
Article En | MEDLINE | ID: mdl-34857886

Mutant alleles (MAs) that have been classically recognised have large effects on phenotype and tend to be deleterious to traits and fitness. Is this the case for mutations with small effects? We infer MAs for 8 million sequence variants in 113k cattle and quantify the effects of MA on 37 complex traits. Heterozygosity for variants at genomic sites conserved across 100 vertebrate species increase fertility, stature, and milk production, positively associating these traits with fitness. MAs decrease stature and fat and protein concentration in milk, but increase gestation length and somatic cell count in milk (the latter indicative of mastitis). However, the frequency of MAs decreasing stature and fat and protein concentration, increasing gestation length and somatic cell count were lower than the frequency of MAs with the opposite effect. These results suggest bias in the mutations direction of effect (e.g. towards reduced protein in milk), but selection operating to reduce the frequency of these MAs. Taken together, our results imply two classes of genomic sites subject to long-term selection: sites conserved across vertebrates show hybrid vigour while sites subject to less long-term selection show a bias in mutation towards undesirable alleles.


Alleles , Cattle/genetics , Genetic Fitness , Mutation , Phenotype , Animals , Body Size/genetics , Female , Glycolipids/genetics , Glycoproteins/genetics , Lipid Droplets , Milk/chemistry , Milk Proteins/genetics , Pregnancy
...