RESUMO
BackgroundLynch syndrome (LS) is an inherited cancer predisposition syndrome caused by genetic variants affecting DNA mismatch repair (MMR) genes MLH1, MSH2, MSH6 and PMS2 Cancer risk in LS is estimated from cohorts of individuals ascertained by individual or family history of cancer, which may upwardly bias estimates. METHODS: 830 carriers of pathogenic or likely pathogenic (path_MMR) MMR gene variants classified by InSiGHT were identified in 454 756 UK Biobank (UKB) participants using whole-exome sequence. Nelson-Aalen survival analysis was used to estimate cumulative incidence of colorectal, endometrial and breast cancer (BC). RESULTS: Cumulative incidence of colorectal and endometrial cancer (EC) by age 70 years was elevated in path_MMR carriers compared with non-carriers (colorectal: 11.8% (95% confidence interval (CI): 9.5% to 14.6%) vs 1.7% (95% CI: 1.6% to 1.7%), endometrial: 13.4% (95% CI: 10.2% to 17.6%) vs 1.0% (95% CI: 0.9% to 1.0%)), but the magnitude of this increase differed between genes. Cumulative BC incidence by age 70 years was not elevated in path_MMR carriers compared with non-carriers (8.9% (95% CI: 6.3% to 12.4%) vs 7.5% (95% CI: 7.4% to 7.6%)). Cumulative cancer incidence estimates in UKB were similar to estimates from the Prospective Lynch Syndrome Database for all genes and cancers, except there was no evidence for elevated EC risk in carriers of pathogenic PMS2 variants in UKB. CONCLUSION: These results support offering incidentally identified carriers of any path_MMR surveillance to manage colorectal cancer risk. Incidentally identified carriers of pathogenic variants in MLH1, MSH2 and MSH6 would also benefit from interventions to reduce EC risk. The results suggest that BC is not an LS-related cancer.
Assuntos
Bancos de Espécimes Biológicos , Neoplasias Colorretais Hereditárias sem Polipose , Reparo de Erro de Pareamento de DNA , Predisposição Genética para Doença , Humanos , Neoplasias Colorretais Hereditárias sem Polipose/genética , Neoplasias Colorretais Hereditárias sem Polipose/epidemiologia , Feminino , Reino Unido/epidemiologia , Pessoa de Meia-Idade , Reparo de Erro de Pareamento de DNA/genética , Idoso , Heterozigoto , Sequenciamento do Exoma , Incidência , Adulto , Neoplasias da Mama/genética , Neoplasias da Mama/epidemiologia , Neoplasias do Endométrio/genética , Neoplasias do Endométrio/epidemiologia , Proteínas de Ligação a DNA/genética , Proteína 1 Homóloga a MutL/genética , Biobanco do Reino Unido , Proteína 2 Homóloga a MutSRESUMO
Estimates of narrow sense heritability derived from genomic data that contain related individuals may be biased due to the within-family effects such as dominance, epistasis and common environmental factors. However, for many wild populations, removal of related individuals from the data would result in small sample sizes. In 2013, Zaitlen et al. proposed a method to estimate heritability in populations that include close relatives by simultaneously fitting an identity-by-state (IBS) genomic relatedness matrix (GRM) and an identity-by-descent (IBD) GRM. The IBD GRM is identical to the IBS GRM, except relatedness estimates below a specified threshold are set to 0. We applied this method to a sample of 8557 wild Soay sheep from St. Kilda, with genotypic information for 419,281 single nucleotide polymorphisms. We aimed to see how this method would partition heritability into population-level (IBS) and family-associated (IBD) variance for a range of genetic architectures, and so we focused on a mixture of polygenic and monogenic traits. We also implemented a variant of the model in which the IBD GRM was replaced by a GRM constructed from SNPs with low minor allele frequency to examine whether any additive genetic variance is captured by rare alleles. Whilst the inclusion of the IBD GRM did not significantly improve the fit of the model for the monogenic traits, it improved the fit for some of the polygenic traits, suggesting that dominance, epistasis and/or common environment not already captured by the non-genetic random effects fitted in our models may influence these traits.
Assuntos
Genoma , Polimorfismo de Nucleotídeo Único , Humanos , Ovinos/genética , Animais , Linhagem , Genótipo , Genômica , Fenótipo , Carneiro Doméstico/genética , Modelos GenéticosRESUMO
The relative importance between additive and non-additive genetic variance has been widely argued in quantitative genetics. By approaching this question from an evolutionary perspective we show that, while additive variance can be maintained under selection at a low level for some patterns of epistasis, the majority of the genetic variance that will persist is actually non-additive. We propose that one reason that the problem of the "missing heritability" arises is because the additive genetic variation that is estimated to be contributing to the variance of a trait will most likely be an artefact of the non-additive variance that can be maintained over evolutionary time. In addition, it can be shown that even a small reduction in linkage disequilibrium between causal variants and observed SNPs rapidly erodes estimates of epistatic variance, leading to an inflation in the perceived importance of additive effects. We demonstrate that the perception of independent additive effects comprising the majority of the genetic architecture of complex traits is biased upwards and that the search for causal variants in complex traits under selection is potentially underpowered by parameterising for additive effects alone. Given dense SNP panels the detection of causal variants through genome-wide association studies may be improved by searching for epistatic effects explicitly.
Assuntos
Evolução Biológica , Epistasia Genética , Estudo de Associação Genômica Ampla , Seleção Genética/genética , Algoritmos , Frequência do Gene , Humanos , Desequilíbrio de Ligação/genética , Modelos Teóricos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
BACKGROUND: Comparisons of quantitative trait loci (QTL) for growth and parameters of growth curves assist in understanding the genetics and ultimately the physiology of growth. Records of body weight at 3, 6, 12, 24, 48 and 72 weeks of age and growth rate between successive age intervals of about 500 F2 female chickens of the Roslin broiler-layer cross were available for analysis. These data were analysed to detect and compare QTL for body weight, growth rate and parameters of the Gompertz growth function. RESULTS: Over 50 QTL were identified for body weight at specific ages and most were also detected in the nearest preceding and/or subsequent growth stage. The sum of the significant and suggestive additive effects for bodyweight at specific ages accounted for 23-43% of the phenotypic variation. A single QTL for body weight on chromosome 4 at 48 weeks of age had the largest additive effect (550.4 ± 68.0 g, 11.5% of the phenotypic variation) and a QTL at a similar position accounted 14.5% of the phenotypic variation at 12 weeks of age. Age specific QTL for growth rate were detected suggesting that there are specific genes that affect developmental processes during the different stages of growth. Relatively few QTL influencing Gompertz growth curve parameters were detected and overlapped with loci affecting growth rate. Dominance effects were generally not significant but from 12 weeks of age they exceeded the additive effect in a few cases. No evidence for epistatic QTL pairs was found. CONCLUSIONS: The results confirm the location for body weight and body weight gain during growth that were identified in previous studies and were consistent with QTL for the parameters of the Gompertz growth function. Chromosome 4 explained a relatively large proportion of the observed growth variation across the different ages, and also harboured most of the detected QTL for Gompertz parameters, confirming its importance in controlling growth. Very few QTL were detected for body weight or gain at 48 and 72 weeks of age, probably reflecting the effect of differences in reproduction and random environmental effects.
Assuntos
Galinhas/crescimento & desenvolvimento , Galinhas/genética , Locos de Características Quantitativas , Envelhecimento , Animais , Peso Corporal , Galinhas/fisiologia , Feminino , Masculino , Modelos BiológicosRESUMO
Understanding the genetic architecture underpinning quantitative traits in wild populations is pivotal to understanding the processes behind trait evolution. The 'animal model' is a popular method for estimating quantitative genetic parameters such as heritability and genetic correlation and involves fitting an estimate of relatedness between individuals in the study population. Genotypes at genome-wide markers can be used to estimate relatedness; however, relatedness estimates vary with marker density, potentially affecting results. Increasing density of markers is also expected to increase the power to detect quantitative trait loci (QTL). In order to understand how the density of genetic markers affects the results of quantitative genetic analyses, we estimated heritability and performed genome-wide association studies (GWAS) on five body size traits in an unmanaged population of Soay sheep using two different SNP densities: a dataset of 37,037 genotyped SNPs and an imputed dataset of 417,373 SNPs. Heritability estimates did not differ between the two SNP densities, but the high-density imputed SNP dataset revealed four new SNP-trait associations that were not found with the lower density dataset, as well as confirming all previously-found QTL. We also demonstrated that fitting fixed and random effects in the same step as performing GWAS is a more powerful approach than pre-correcting for covariates in a separate model.
RESUMO
Human isolates have been postulated as a good resource for the identification of QTL due to reduced genetic diversity and a more homogeneous environment. Isolates may also have increased linkage disequilibrium (LD) due to small effective population size and, either loss or increase in frequency of alleles that are rare in the general population from which they originate. Here we investigate the difference in allele and genotype frequencies, LD and homozygous tracts between an isolate-several villages from the island of Vis in Croatia-and an outbred population of European origin: the Hapmap CEPH founders. Using the HumanHap300 v1 Genotyping BeadChip, we show that our population does not differ greatly from the reference CEU outbred population despite having a slightly higher proportion of monomorphic loci, a slightly higher long-range LD, and a greater proportion of individuals with long homozygous tracts. We conclude that genotyping arrays should perform equally well in our isolate as in outbred European populations for disease mapping studies and that SNP-trait associations discovered in our well-characterized Croatian isolate should be valid in the general European population from which they descend.
Assuntos
Genética Populacional , Desequilíbrio de Ligação/genética , População Branca/genética , Mapeamento Cromossômico/métodos , Croácia , Efeito Fundador , Frequência do Gene , Variação Genética , Genótipo , Humanos , Modelos Genéticos , Modelos Estatísticos , Locos de Características Quantitativas , UtahRESUMO
Genes for height have gained interest for decades, but only recently have candidate genes started to be identified. We have performed linkage analysis and genome-wide association for height in approximately 4000 individuals from five European populations. A total of five chromosomal regions showed suggestive linkage and in one of these regions, two SNPs (rs849140 and rs1635852) were associated with height (nominal P = 7.0 x 10(-8) and P = 9.6 x 10(-7), respectively). In total, five SNPs across the genome showed an association with height that reached the threshold of genome-wide significance (nominal P < 1.6 x 10(-7)). The association with height was replicated for two SNPs (rs1635852 and rs849140) using three independent studies (n = 31 077, n=1268 and n = 5746) with overall meta P-values of 9.4 x 10(-10) and 5.3 x 10(-8). These SNPs are located in the JAZF1 gene, which has recently been associated with type II diabetes, prostate and endometrial cancer. JAZF1 is a transcriptional repressor of NR2C2, which results in low IGF1 serum concentrations, perinatal and early postnatal hypoglycemia and growth retardation when knocked out in mice. Both the linkage and association analyses independently identified the JAZF1 region affecting human height. We have demonstrated, through replication in additional independent populations, the consistency of the effect of the JAZF1 SNPs on height. Since this gene also has a key function in the metabolism of growth, JAZF1 represents one of the strongest candidates influencing human height identified so far.
Assuntos
Estatura/genética , Ligação Genética , Estudo de Associação Genômica Ampla , Proteínas de Neoplasias/genética , Proteínas Correpressoras , Proteínas de Ligação a DNA , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , População Branca/genéticaRESUMO
We describe a genome-wide analytical approach, SNP and Haplotype Regional Heritability Mapping (SNHap-RHM), that provides regional estimates of the heritability across locally defined regions in the genome. This approach utilises relationship matrices that are based on sharing of SNP and haplotype alleles at local haplotype blocks delimited by recombination boundaries in the genome. We implemented the approach on simulated data and show that the haplotype-based regional GRMs capture variation that is complementary to that captured by SNP-based regional GRMs, and thus justifying the fitting of the two GRMs jointly in a single analysis (SNHap-RHM). SNHap-RHM captures regions in the genome contributing to the phenotypic variation that existing genome-wide analysis methods may fail to capture. We further demonstrate that there are real benefits to be gained from this approach by applying it to real data from about 20,000 individuals from the Generation Scotland: Scottish Family Health Study. We analysed height and major depressive disorder (MDD). We identified seven genomic regions that are genome-wide significant for height, and three regions significant at a suggestive threshold (p-value < 1 × 10-5) for MDD. These significant regions have genes mapped to within 400 kb of them. The genes mapped for height have been reported to be associated with height in humans. Similarly, those mapped for MDD have been reported to be associated with major depressive disorder and other psychiatry phenotypes. The results show that SNHap-RHM presents an exciting new opportunity to analyse complex traits by allowing the joint mapping of novel genomic regions tagged by either SNPs or haplotypes, potentially leading to the recovery of some of the "missing" heritability.
RESUMO
Genome-wide association (GWA) studies have identified around 20 common genetic variants influencing the risk of type 2 diabetes (T2D). Likewise, a number of variants have been associated with diabetes-related quantitative glycaemic traits, but to date the overlap between these genes and variants has been low. The majority of genetic studies have focused on fasting plasma glucose levels; however, this measure is highly variable. We have conducted a GWA meta-analysis of glycated haemoglobin (HbA1(C) ) levels within three healthy nondiabetic populations. This phenotype provides an estimate of mean glucose levels over 2-3 months and is a more stable predictor of future diabetes risk. Participants were from three isolated populations: the Orkney Isles in the north of Scotland, the Dalmatian islands of Vis, and Korcula in Croatia (total of 1782 nondiabetic subjects). Association was tested in each population and results combined by meta-analysis. The strongest association was with the TCF7L2 gene (rs7903146, P= 1.48 × 10â»7). This is also the strongest common genetic risk factor for T2D but it has not been identified in previous genome-wide studies of glycated haemoglobin.
Assuntos
Diabetes Mellitus Tipo 2/genética , Estudo de Associação Genômica Ampla , Hemoglobinas Glicadas/genética , Proteína 2 Semelhante ao Fator 7 de Transcrição/genética , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Croácia/epidemiologia , Diabetes Mellitus Tipo 2/sangue , Feminino , Genótipo , Hemoglobinas Glicadas/metabolismo , Humanos , Masculino , Pessoa de Meia-Idade , Fenótipo , Escócia/epidemiologia , Adulto JovemRESUMO
MOTIVATION: Unravelling the genetic architecture of complex traits requires large amounts of data, sophisticated models and large computational resources. The lack of user-friendly software incorporating all these requisites is delaying progress in the analysis of complex traits. METHODS: Linkage disequilibrium and linkage analysis (LDLA) is a high-resolution gene mapping approach based on sophisticated mixed linear models, applicable to any population structure. LDLA can use population history information in addition to pedigree and molecular markers to decompose traits into genetic components. Analyses are distributed in parallel over a large public grid of computers in the UK. RESULTS: We have proven the performance of LDLA with analyses of simulated data. There are real gains in statistical power to detect quantitative trait loci when using historical information compared with traditional linkage analysis. Moreover, the use of a grid of computers significantly increases computational speed, hence allowing analyses that would have been prohibitive on a single computer. AVAILABILITY: The authors have implemented LDLA within the freely available GridQTL software (www.gridqtl.org.uk).
Assuntos
Biologia Computacional/métodos , Ligação Genética/genética , Desequilíbrio de Ligação/genética , Software , Bases de Dados Genéticas , Internet , Locos de Características Quantitativas , Interface Usuário-ComputadorRESUMO
BACKGROUND: Infectious disease of livestock continues to be a cause of substantial economic loss and has adverse welfare consequences in both the developing and developed world. New solutions to control disease are needed and research focused on the genetic loci determining variation in immune-related traits has the potential to deliver solutions. However, identifying selectable markers and the causal genes involved in disease resistance and vaccine response is not straightforward. The aims of this study were to locate regions of the bovine genome that control the immune response post immunisation. 195 F2 and backcross Holstein Charolais cattle were immunised with a 40-mer peptide derived from foot-and-mouth disease virus (FMDV). T cell and antibody (IgG1 and IgG2) responses were measured at several time points post immunisation. All experimental animals (F0, F1 and F2, n = 982) were genotyped with 165 microsatellite markers for the genome scan. RESULTS: Considerable variability in the immune responses across time was observed and sire, dam and age had significant effects on responses at specific time points. There were significant correlations within traits across time, and between IgG1 and IgG2 traits, also some weak correlations were detected between T cell and IgG2 responses. The whole genome scan detected 77 quantitative trait loci (QTL), on 22 chromosomes, including clusters of QTL on BTA 4, 5, 6, 20, 23 and 25. Two QTL reached 5% genome wide significance (on BTA 6 and 24) and one on BTA 20 reached 1% genome wide significance. CONCLUSIONS: A proportion of the variance in the T cell and antibody response post immunisation with an FDMV peptide has a genetic component. Even though the antigen was relatively simple, the humoral and cell mediated responses were clearly under complex genetic control, with the majority of QTL located outside the MHC locus. The results suggest that there may be specific genes or loci that impact on variation in both the primary and secondary immune responses, whereas other loci may be specifically important for early or later phases of the immune response. Future fine mapping of the QTL clusters identified has the potential to reveal the causal variations underlying the variation in immune response observed.
Assuntos
Proteínas do Capsídeo/imunologia , Bovinos/genética , Febre Aftosa/genética , Febre Aftosa/imunologia , Fragmentos de Peptídeos/imunologia , Locos de Características Quantitativas , Animais , Anticorpos Antivirais/sangue , Formação de Anticorpos , Antígenos Virais/imunologia , Feminino , Vírus da Febre Aftosa/imunologia , Genótipo , Imunidade Celular , Imunoglobulina G/sangue , Masculino , Repetições de Microssatélites , Linfócitos T/imunologia , Vacinas Virais/imunologiaRESUMO
INTRODUCTION: Variance component QTL methodology was used to analyse three candidate regions on chicken chromosomes 1, 4 and 5 for dominant and parent-of-origin QTL effects. Data were available for bodyweight and conformation score measured at 40 days from a two-generation commercial broiler dam line. One hundred dams were nested in 46 sires with phenotypes and genotypes on 2708 offspring. Linear models were constructed to simultaneously estimate fixed, polygenic and QTL effects. Different genetic models were compared using likelihood ratio test statistics derived from the comparison of full with reduced or null models. Empirical thresholds were derived by permutation analysis. RESULTS: Dominant QTL were found for bodyweight on chicken chromosome 4 and for bodyweight and conformation score on chicken chromosome 5. Suggestive evidence for a maternally expressed QTL for bodyweight and conformation score was found on chromosome 1 in a region corresponding to orthologous imprinted regions in the human and mouse. CONCLUSION: Initial results suggest that variance component analysis can be applied within commercial populations for the direct detection of segregating dominant and parent of origin effects.
Assuntos
Galinhas/genética , Mapeamento Cromossômico , Impressão Genômica , Locos de Características Quantitativas , Animais , Peso Corporal , Cromossomos/genética , Cruzamentos Genéticos , Feminino , Humanos , Masculino , Camundongos , LinhagemRESUMO
In natural populations, genetic variation affects resistance to disease. Whether that genetic variation comprises lots of small-effect polymorphisms or a small number of large-effect polymorphisms has implications for adaptation, selection and how genetic variation is maintained in populations. Furthermore, how much genetic variation there is, and the genes that underlie this variation, affects models of co-evolution between parasites and their hosts. We are studying the genetic variation that affects the resistance of Drosophila melanogaster to its natural pathogen--the vertically transmitted sigma virus. We have carried out three separate quantitative trait locus mapping analyses to map gene variants on the second chromosome that cause variation in the rate at which males transmit the infection to their offspring. All three crosses identified a locus in a similar chromosomal location that causes a large drop in the rate at which the virus is transmitted. We also found evidence for an additional smaller-effect quantitative trait locus elsewhere on the chromosome. Our data, together with previous experiments on the sigma virus and parasitoid wasps, indicate that the resistance of D. melanogaster to co-evolved pathogens is controlled by a limited number of major-effect polymorphisms.
Assuntos
Drosophila melanogaster/genética , Drosophila melanogaster/virologia , Interações Hospedeiro-Parasita/genética , Locos de Características Quantitativas , Infecções por Rhabdoviridae/genética , Animais , Mapeamento Cromossômico , Genes de Insetos , Genética Populacional , Genótipo , Transmissão Vertical de Doenças Infecciosas , Modelos Lineares , Masculino , Modelos Genéticos , Fenótipo , Polimorfismo Genético , Característica Quantitativa Herdável , Rhabdoviridae/genética , Infecções por Rhabdoviridae/transmissãoRESUMO
Dominance is an important source of variation in complex traits. Here, we have carried out the first thorough investigation of quantitative trait locus (QTL) detection using variance component (VC) models extended to incorporate both additive and dominant QTL effects. Simulation results showed that the empirical distribution of the test statistic when testing for dominant QTL effects did not behave in accordance with existing theoretical expectations and varied with pedigree structure. Extensive simulations were carried out to assess accuracy of estimates, type 1 error and statistical power in two-generation human-, poultry- and pig-type pedigrees each with 1900 progeny in small-, medium- and large-sized families, respectively. The distribution of the likelihood-ratio test statistic was heavily dependent on family structure, with empirical thresholds lower for human pedigrees. Power to detect QTL was high (0.84-1.0) in pig and poultry scenarios for dominance effects accounting for >7% of phenotypic variance but much lower (0.42) in human-type pedigrees. Maternal or common environment effects can be partially confounded with dominance and must be fitted in the QTL model. Including dominance in the QTL model did not affect power to detect additive QTL effects. Also, detection of spurious dominance QTL effects only occurred when maternal effects were not included in the QTL model. When dominance effects were present in the data but not in the analysis model, this resulted in spurious detection of additive QTL or inflated estimates of additive QTL effects. The study demonstrates that dominance can be included routinely in QTL analysis of general pedigrees; however, optimal power is dependent on selection of the appropriate thresholds for pedigree structure.
Assuntos
Linhagem , Locos de Características Quantitativas/genética , Análise de Variância , Animais , Simulação por Computador , Ligação Genética , Humanos , Aves Domésticas/genéticaRESUMO
We propose a novel approach to analyze genomic data that incorporates haplotype information for detecting rare variants within a regional heritability mapping framework. The performance of our approach was tested in a simulation study based on human genotypes. The phenotypes were simulated by generating regional variance using either SNP(s) or haplotype(s). Regional genomic relationship matrices, constructed with either a SNP-based or a haplotype-based estimator, were employed to estimate the regional variance. The results from the study show that haplotype heritability mapping captures the regional effect, with its relative performance decreasing with increasing analysis window size. The SNP-based regional mapping approach often misses the effect of causal haplotype(s); however, it has a greater power to detect simulated SNP-based-variants. Heritability estimates suggest that the haplotype heritability mapping estimates the simulated regional heritability accurately for all phenotypes and analysis windows. However, the SNP-based analysis overestimates the regional heritability and performs less well than our haplotype-based approach for the simulated rare haplotype-based-variant. We conclude that haplotype heritability mapping is a useful tool to capture the effect of rare variants, and explain a proportion of the missing heritability.
Assuntos
Mapeamento Cromossômico/métodos , Genoma Humano/genética , Haplótipos , Modelos Genéticos , Herança Multifatorial/genética , Croácia , Humanos , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genéticaRESUMO
There is currently considerable interest in genetic analysis of quantitative traits such as blood pressure and body mass index. Despite the fact that these traits change throughout life they are commonly analyzed only at a single time point. The genetic basis of such traits can be better understood by collecting and effectively analyzing longitudinal data. Analyses of these data are complicated by the need to incorporate information from complex pedigree structures and genetic markers. We propose conducting longitudinal quantitative trait locus (QTL) analyses on such data sets by using a flexible random regression estimation technique. The relationship between genetic effects at different ages is efficiently modeled using covariance functions (CFs). Using simulated data we show that the change in genetic effects over time can be well characterized using CFs and that including parameters to model the change in effect with age can provide substantial increases in power to detect QTL compared with repeated measure or univariate techniques. The asymptotic distributions of the methods used are investigated and methods for overcoming the practical difficulties in fitting CFs are discussed. The CF-based techniques should allow efficient multivariate analyses of many data sets in human and natural population genetics.
Assuntos
Modelos Genéticos , Linhagem , Locos de Características Quantitativas , Característica Quantitativa Herdável , Mapeamento Cromossômico/estatística & dados numéricos , Simulação por Computador , HumanosRESUMO
Linkage analysis (either parametric or nonparametric) is commonly applied to identify chromosomal regions using related individuals affected by disease. In complex disease the incomplete relationship between phenotype and genotype can be modeled using a phenocopy parameter, the probability that an individual is affected given they do not carry the disease mutation of interest, and a nonpenetrance parameter, the probability that an individual is not affected given they do carry the disease mutation of interest. If the linkage phase between multiple markers and a putative disease locus is known, then haplotypes carrying the mutation can, in principle, be identified by comparing the chromosome segments that are shared identical-by-descent (IBD) across affected individuals. We consider here the effect of a nonzero phenocopy rate on the linkage peak and hence upon the identification of disease haplotypes that are shared IBD between affected individuals. We show, by theory and computer simulation, that in diseases for which there is a nonzero phenocopy rate, the chromosomal regions identified may not include the true disease locus. We utilize a LOD-1 confidence interval for a widely used nonparametric linkage statistic. We find that in small/moderate samples this confidence interval may be inappropriate. We give specific examples where the phenocopy rates are nonnegligible in some complex diseases. The success of further work to identify the causal mutations underlying the linkage peaks in these diseases will depend on researchers allowing for the presence of phenocopies by examining appropriately wide regions around the initial positive linkage finding.
Assuntos
Ligação Genética , Haplótipos/genética , Modelos Genéticos , Alelos , Simulação por Computador , Marcadores Genéticos , Genótipo , Humanos , Padrões de Herança , Escore Lod , Mutação/genética , Fenótipo , Estatísticas não ParamétricasRESUMO
The Framingham Heart Study offspring cohort, a complex data set with irregularly spaced longitudinal phenotype data, was made available as part of Genetic Analysis Workshop 13. To allow an analysis of all of the data simultaneously, a mixed-model- based random-regression (RR) approach was used. The RR accounted for the variation in genetic effects (including marker-specific quantitative trait locus (QTL) effects) across time by fitting polynomials of age. The use of a mixed model allowed both fixed (such as sex) and random (such as familial environment) effects to be accounted for appropriately. Using this method we performed a QTL analysis of all of the available adult phenotype data (26,106 phenotypic records). In addition to RR, conventional univariate variance component techniques were applied. The traits of interest were BMI, HDLC, total cholesterol, and height. The longitudinal method allowed the characterization of the change in QTL effects with aging. A QTL affecting BMI was shown to act mainly at early ages.
Assuntos
Modelos Estatísticos , Adulto , Filhos Adultos , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , Doenças Cardiovasculares/epidemiologia , Doenças Cardiovasculares/genética , Mapeamento Cromossômico/métodos , Cromossomos Humanos Par 12/genética , Cromossomos Humanos Par 16/genética , Cromossomos Humanos Par 20/genética , Interpretação Estatística de Dados , Feminino , Humanos , Desequilíbrio de Ligação/genética , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Herança Multifatorial/genética , Análise Multivariada , Locos de Características Quantitativas/genéticaRESUMO
We surveyed gene-gene interactions (epistasis) in human body mass index (BMI) in four European populations (n<1200) via exhaustive pair-wise genome scans where interactions were computed as F ratios by testing a linear regression model fitting two single-nucleotide polymorphisms (SNPs) with interactions against the one without. Before the association tests, BMI was corrected for sex and age, normalised and adjusted for relatedness. Neither single SNPs nor SNP interactions were genome-wide significant in either cohort based on the consensus threshold (P=5.0E-08) and a Bonferroni corrected threshold (P=1.1E-12), respectively. Next we compared sub genome-wide significant SNP interactions (P<5.0E-08) across cohorts to identify common epistatic signals, where SNPs were annotated to genes to test for gene ontology (GO) enrichment. Among the epistatic genes contributing to the commonly enriched GO terms, 19 were shared across study cohorts of which 15 are previously published genome-wide association loci, including CDH13 (cadherin 13) associated with height and SORCS2 (sortilin-related VPS10 domain containing receptor 2) associated with circulating insulin-like growth factor 1 and binding protein 3. Interactions between the 19 shared epistatic genes and those involving BMI candidate loci (P<5.0E-08) were tested across cohorts and found eight replicated at the SNP level (P<0.05) in at least one cohort, which were further tested and showed limited replication in a separate European population (n>5000). We conclude that genome-wide analysis of epistasis in multiple populations is an effective approach to provide new insights into the genetic regulation of BMI but requires additional efforts to confirm the findings.
Assuntos
Índice de Massa Corporal , Epistasia Genética , Estudo de Associação Genômica Ampla , População Branca/genética , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Coortes , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , Adulto JovemRESUMO
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations.