Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
1.
BMC Bioinformatics ; 25(1): 202, 2024 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-38816801

RESUMO

INTODUCTION: In systems biology, an organism is viewed as a system of interconnected molecular entities. To understand the functioning of organisms it is essential to integrate information about the variations in the concentrations of those molecular entities. This information can be structured as a set of networks with interconnections and with some hierarchical relations between them. Few methods exist for the reconstruction of integrative networks. OBJECTIVE: In this work, we propose an integrative network reconstruction method in which the network organization for a particular type of omics data is guided by the network structure of a related type of omics data upstream in the omic cascade. The structure of these guiding data can be either already known or be estimated from the guiding data themselves. METHODS: The method consists of three steps. First a network structure for the guiding data should be provided. Next, responses in the target set are regressed on the full set of predictors in the guiding data with a Lasso penalty to reduce the number of predictors and an L2 penalty on the differences between coefficients for predictors that share edges in the network for the guiding data. Finally, a network is reconstructed on the fitted target responses as functions of the predictors in the guiding data. This way we condition the target network on the network of the guiding data. CONCLUSIONS: We illustrate our approach on two examples in Arabidopsis. The method detects groups of metabolites that have a similar genetic or transcriptomic basis.


Assuntos
Arabidopsis , Arabidopsis/genética , Arabidopsis/metabolismo , Biologia de Sistemas/métodos , Redes Reguladoras de Genes , Algoritmos , Biologia Computacional/métodos , Multiômica
2.
BMC Plant Biol ; 24(1): 562, 2024 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-38877425

RESUMO

BACKGROUND: On tropical regions, phosphorus (P) fixation onto aluminum and iron oxides in soil clays restricts P diffusion from the soil to the root surface, limiting crop yields. While increased root surface area favors P uptake under low-P availability, the relationship between the three-dimensional arrangement of the root system and P efficiency remains elusive. Here, we simultaneously assessed allelic effects of loci associated with a variety of root and P efficiency traits, in addition to grain yield under low-P availability, using multi-trait genome-wide association. We also set out to establish the relationship between root architectural traits assessed in hydroponics and in a low-P soil. Our goal was to better understand the influence of root morphology and architecture in sorghum performance under low-P availability. RESULT: In general, the same alleles of associated SNPs increased root and P efficiency traits including grain yield in a low-P soil. We found that sorghum P efficiency relies on pleiotropic loci affecting root traits, which enhance grain yield under low-P availability. Root systems with enhanced surface area stemming from lateral root proliferation mostly up to 40 cm soil depth are important for sorghum adaptation to low-P soils, indicating that differences in root morphology leading to enhanced P uptake occur exactly in the soil layer where P is found at the highest concentration. CONCLUSION: Integrated QTLs detected in different mapping populations now provide a comprehensive molecular genetic framework for P efficiency studies in sorghum. This indicated extensive conservation of P efficiency QTL across populations and emphasized the terminal portion of chromosome 3 as an important region for P efficiency in sorghum. Increases in root surface area via enhancement of lateral root development is a relevant trait for sorghum low-P soil adaptation, impacting the overall architecture of the sorghum root system. In turn, particularly concerning the critical trait for water and nutrient uptake, root surface area, root system development in deeper soil layers does not occur at the expense of shallow rooting, which may be a key reason leading to the distinctive sorghum adaptation to tropical soils with multiple abiotic stresses including low P availability and drought.


Assuntos
Estudo de Associação Genômica Ampla , Fósforo , Raízes de Plantas , Locos de Características Quantitativas , Sorghum , Sorghum/genética , Sorghum/metabolismo , Sorghum/crescimento & desenvolvimento , Fósforo/metabolismo , Raízes de Plantas/crescimento & desenvolvimento , Raízes de Plantas/metabolismo , Raízes de Plantas/genética , Raízes de Plantas/anatomia & histologia , Mapeamento Cromossômico , Polimorfismo de Nucleotídeo Único , Solo/química , Fenótipo
3.
Heredity (Edinb) ; 2024 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-38982296

RESUMO

Chromosome substitution lines (CSLs) are tentatively supreme resources to investigate non-allelic genetic interactions. However, the difficulty of generating such lines in most species largely yielded imperfect CSL panels, prohibiting a systematic dissection of epistasis. Here, we present the development and use of a unique and complete panel of CSLs in Arabidopsis thaliana, allowing the full factorial analysis of epistatic interactions. A first comparison of reciprocal single chromosome substitutions revealed a dependency of QTL detection on different genetic backgrounds. The subsequent analysis of the complete panel of CSLs enabled the mapping of the genetic interactors and identified multiple two- and three-way interactions for different traits. Some of the detected epistatic effects were as large as any observed main effect, illustrating the impact of epistasis on quantitative trait variation. We, therefore, have demonstrated the high power of detection and mapping of genome-wide epistasis, confirming the assumed supremacy of comprehensive CSL sets.

4.
Bioinformatics ; 38(22): 5134-5136, 2022 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-36193999

RESUMO

MOTIVATION: Multi-parent populations (MPPs) are popular for QTL mapping because they combine wide genetic diversity in parents with easy control of population structure, but a limited number of software tools for QTL mapping are specifically developed for general MPP designs. RESULTS: We developed an R package called statgenMPP, adopting a unified identity-by-descent (IBD)-based mixed model approach for QTL analysis in MPPs. The package offers easy-to-use functionalities of IBD calculations, mixed model solutions and visualizations for QTL mapping in a wide range of MPP designs, including diallele, nested-association mapping populations, multi-parent advanced genetic inter-cross populations and other complicated MPPs with known crossing schemes. AVAILABILITY AND IMPLEMENTATION: The R package statgenMPP is open-source and freely available on CRAN at https://CRAN.R-project.org/package=statgenMPP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Mapeamento Cromossômico
5.
Theor Appl Genet ; 135(6): 2059-2082, 2022 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-35524815

RESUMO

KEY MESSAGE: We evaluate self-organizing maps (SOM) to identify adaptation zones and visualize multi-environment genotypic responses. We apply SOM to multiple traits and crop growth model output of large-scale European sunflower data. Genotype-by-environment interactions (G × E) complicate the selection of well-adapted varieties. A possible solution is to group trial locations into adaptation zones with G × E occurring mainly between zones. By selecting for good performance inside those zones, response to selection is increased. In this paper, we present a two-step procedure to identify adaptation zones that starts from a self-organizing map (SOM). In the SOM, trials across locations and years are assigned to groups, called units, that are organized on a two-dimensional grid. Units that are further apart contain more distinct trials. In an iterative process of reweighting trial contributions to units, the grid configuration is learnt simultaneously with the trial assignment to units. An aggregation of the units in the SOM by hierarchical clustering then produces environment types, i.e. trials with similar growing conditions. Adaptation zones can subsequently be identified by grouping trial locations with similar distributions of environment types across years. For the construction of SOMs, multiple data types can be combined. We compared environment types and adaptation zones obtained for European sunflower from quantitative traits like yield, oil content, phenology and disease scores with those obtained from environmental indices calculated with the crop growth model Sunflo. We also show how results are affected by input data organization and user-defined weights for genotypes and traits. Adaptation zones for European sunflower as identified by our SOM-based strategy captured substantial genotype-by-location interaction and pointed to trials in Spain, Turkey and South Bulgaria as inducing different genotypic responses.


Assuntos
Helianthus , Adaptação Fisiológica , Algoritmos , Análise por Conglomerados , Genótipo , Helianthus/genética
6.
Mol Breed ; 42(12): 76, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37313326

RESUMO

Genome-wide association studies (GWAS) are a useful tool to unravel the genetic architecture of complex traits, but the results can be difficult to interpret. Population structure, genetic heterogeneity, and rare alleles easily result in false positive or false negative associations. This paper describes the analysis of a GWAS panel combined with three bi-parental mapping populations to validate GWAS results, using phenotypic data for steroidal glycoalkaloid (SGA) accumulation and the ratio (SGR) between the two major glycoalkaloids α-solanine and α-chaconine in potato tubers. SGAs are secondary metabolites in the Solanaceae family, functional as a defence against various pests and pathogens and in high quantities toxic for humans. With GWAS, we identified five quantitative trait loci (QTL) of which Sga1.1, Sgr8.1, and Sga11.1 were validated, but not Sga3.1 and Sgr7.1. In the bi-parental populations, Sga5.1 and Sga7.1 were mapped, but these were not identified with GWAS. The QTLs Sga1.1, Sga7.1, Sgr7.1, and Sgr8.1 co-localize with genes GAME9, GAME 6/GAME 11, SGT1, and SGT2, respectively. For other genes involved in SGA synthesis, no QTLs were identified. The results of this study illustrate a number of pitfalls in GWAS of which population structure seems the most important. We also show that introgression breeding for disease resistance has introduced new haplotypes to the gene pool involved in higher SGA levels in certain pedigrees. Finally, we show that high SGA levels remain unpredictable in potato but that α-solanine/α-chaconine ratio has a predictable outcome with specific SGT1 and SGT2 haplotypes. Supplementary Information: The online version contains supplementary material available at 10.1007/s11032-022-01344-2.

7.
Theor Appl Genet ; 134(11): 3643-3660, 2021 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-34342658

RESUMO

KEY MESSAGE: The identity-by-descent (IBD)-based mixed model approach introduced in this study can detect quantitative trait loci (QTLs) referring to the parental origin and simultaneously account for multilevel relatedness of individuals within and across families. This unified approach is proved to be a powerful approach for all kinds of multiparental population (MPP) designs. Multiparental populations (MPPs) have become popular for quantitative trait loci (QTL) detection. Tools for QTL mapping in MPPs are mostly developed for specific MPPs and do not generalize well to other MPPs. We present an IBD-based mixed model approach for QTL mapping in all kinds of MPP designs, e.g., diallel, Nested Association Mapping (NAM), and Multiparental Advanced Generation Intercross (MAGIC) designs. The first step is to compute identity-by-descent (IBD) probabilities using a general Hidden Markov model framework, called reconstructing ancestry blocks bit by bit (RABBIT). Next, functions of IBD information are used as design matrices, or genetic predictors, in a mixed model approach to estimate variance components for multiallelic genetic effects associated with parents. Family-specific residual genetic effects are added, and a polygenic effect is structured by kinship relations between individuals. Case studies of simulated diallel, NAM, and MAGIC designs proved that the advanced IBD-based multi-QTL mixed model approach incorporating both kinship relations and family-specific residual variances (IBD.MQMkin_F) is robust across a variety of MPP designs and allele segregation patterns in comparison to a widely used benchmark association mapping method, and in most cases, outperformed or behaved at least as well as other tools developed for specific MPP designs in terms of mapping power and resolution. Successful analyses of real data cases confirmed the wide applicability of our IBD-based mixed model methodology.


Assuntos
Mapeamento Cromossômico , Modelos Genéticos , Locos de Características Quantitativas , Alelos , Simulação por Computador , Modelos Lineares , Cadeias de Markov , Plantas/genética
8.
Theor Appl Genet ; 134(3): 897-908, 2021 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-33367942

RESUMO

Much has been published on QTL detection for complex traits using bi-parental and multi-parental crosses (linkage analysis) or diversity panels (GWAS studies). While successful for detection, transferability of results to real applications has proven more difficult. Here, we combined a QTL detection approach using a pre-breeding populations which utilized intensive phenotypic selection for the target trait across multiple plant generations, combined with rapid generation turnover (i.e. "speed breeding") to allow cycling of multiple plant generations each year. The reasoning is that QTL mapping information would complement the selection process by identifying the genome regions under selection within the relevant germplasm. Questions to answer were the location of the genomic regions determining response to selection and the origin of the favourable alleles within the pedigree. We used data from a pre-breeding program that aimed at pyramiding different resistance sources to Fusarium crown rot into elite (but susceptible) wheat backgrounds. The population resulted from a complex backcrossing scheme involving multiple resistance donors and multiple elite backgrounds, akin to a MAGIC population (985 genotypes in total, with founders, and two major offspring layers within the pedigree). A significant increase in the resistance level was observed (i.e. a positive response to selection) after the selection process, and 17 regions significantly associated with that response were identified using a GWAS approach. Those regions included known QTL as well as potentially novel regions contributing resistance to Fusarium crown rot. In addition, we were able to trace back the sources of the favourable alleles for each QTL. We demonstrate that QTL detection using breeding populations under selection for the target trait can identify QTL controlling the target trait and that the frequency of the favourable alleles was increased as a response to selection, thereby validating the QTL detected. This is a valuable opportunistic approach that can provide QTL information that is more easily transferred to breeding applications.


Assuntos
Resistência à Doença/genética , Fusarium/fisiologia , Marcadores Genéticos , Melhoramento Vegetal , Doenças das Plantas/genética , Locos de Características Quantitativas , Triticum/genética , Alelos , Mapeamento Cromossômico/métodos , Cromossomos de Plantas/genética , Resistência à Doença/imunologia , Ligação Genética , Doenças das Plantas/microbiologia , Triticum/imunologia , Triticum/microbiologia
9.
Plant J ; 99(6): 1172-1191, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31108005

RESUMO

Broadening the genetic base of crops is crucial for developing varieties to respond to global agricultural challenges such as climate change. Here, we analysed a diverse panel of 371 domesticated lines of the model crop barley to explore the genetics of crop adaptation. We first collected exome sequence data and phenotypes of key life history traits from contrasting multi-environment common garden trials. Then we applied refined statistical methods, including some based on exomic haplotype states, for genotype-by-environment (G×E) modelling. Sub-populations defined from exomic profiles were coincident with barley's biology, geography and history, and explained a high proportion of trial phenotypic variance. Clear G×E interactions indicated adaptation profiles that varied for landraces and cultivars. Exploration of circadian clock-related genes, associated with the environmentally adaptive days to heading trait (crucial for the crop's spread from the Fertile Crescent), illustrated complexities in G×E effect directions, and the importance of latitudinally based genic context in the expression of large-effect alleles. Our analysis supports a gene-level scientific understanding of crop adaption and leads to practical opportunities for crop improvement, allowing the prioritisation of genomic regions and particular sets of lines for breeding efforts seeking to cope with climate change and other stresses.


Assuntos
Aclimatação/genética , Produtos Agrícolas/genética , Exoma , Hordeum/genética , Relógios Circadianos/genética , Variação Genética , Estudo de Associação Genômica Ampla , Genótipo , Geografia , Haplótipos , Desequilíbrio de Ligação , Fenótipo , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Sequenciamento do Exoma
10.
J Nutr ; 150(3): 634-643, 2020 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-31858107

RESUMO

BACKGROUND: In nutritional epidemiology, dealing with confounding and complex internutrient relations are major challenges. An often-used approach is dietary pattern analyses, such as principal component analysis, to deal with internutrient correlations, and to more closely resemble the true way nutrients are consumed. However, despite these improvements, these approaches still require subjective decisions in the preselection of food groups. Moreover, they do not make efficient use of multivariate dietary data, because they detect only marginal associations. We propose the use of copula graphical models (CGMs) to model and make statistical inferences regarding complex associations among variables in multivariate data, where associations between all variables can be learned simultaneously. OBJECTIVE: We aimed to reconstruct nutritional intake and physical functioning networks in Dutch older adults by applying a CGM. METHODS: We addressed this issue by uncovering the pairwise associations between variables while correcting for the effect of remaining variables. More specifically, we used a CGM to infer the precision matrix, which contains all the conditional independence relations between nodes in the graph. The nonzero elements of the precision matrix indicate the presence of a direct association. We applied this method to reconstruct nutrient-physical functioning networks from the combined data of 4 studies (Nu-Age, ProMuscle, ProMO, and V-Fit, total n = 662, mean ± SD age = 75 ± 7 y). The method was implemented in the R package nutriNetwork which is freely available at https://cran.r-project.org/web/packages/nutriNetwork. RESULTS: Greater intakes of vegetable protein and vitamin B-6 were partially correlated with higher scores on the total Short Physical Performance Battery (SPPB) and the chair rise test. Greater intakes of vitamin B-12 and folate were partially correlated with higher scores on the chair rise test and the total SPPB, respectively. CONCLUSIONS: We determined that vegetable protein, vitamin B-6, folate, and vitamin B-12 intakes are partially correlated with improved functional outcome measurements in Dutch older adults.


Assuntos
Ácido Fólico/administração & dosagem , Modelos Teóricos , Desempenho Físico Funcional , Proteínas de Vegetais Comestíveis/administração & dosagem , Vitamina B 12/administração & dosagem , Vitamina B 6/administração & dosagem , Idoso , Idoso de 80 Anos ou mais , Índice de Massa Corporal , Idoso Fragilizado , Humanos , Países Baixos
11.
Theor Appl Genet ; 133(3): 1009-1018, 2020 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-31907563

RESUMO

KEY MESSAGE: Multi-environment models using marker-based kinship information for both additive and dominance effects can accurately predict hybrid performance in different environments. Sorghum is an important hybrid crop that is grown extensively in many subtropical and tropical regions including Northern NSW and Queensland in Australia. The highly varying weather patterns in the Australian summer months mean that sorghum hybrids exhibit a great deal of variation in yield between locations. To ultimately enable prediction of the outcome of crossing parental lines, both additive effects on yield performance and dominance interaction effects need to be characterised. This paper demonstrates that fitting a linear mixed model that includes both types of effects calculated using genetic markers in relationship matrices improves predictions. Genotype by environment interactions was investigated by comparing FA1 (single-factor analytic) and FA2 (two-factor analytic) structures. The G×E causes a change in hybrid rankings between trials with a difference of up to 25% of the hybrids in the top 10% of each trial. The prediction accuracies increased with the addition of the dominance term (over and above that achieved with an additive effect alone) by an average of 15% and a maximum of 60%. The percentage of dominance of the total genetic variance varied between trials with the trials with higher broad-sense heritability having the greater percentage of dominance. The inclusion of dominance in the factor analytic models improves the accuracy of the additive effects. Breeders selecting high yielding parents for crossing need to be aware of effects due to environment and dominance.


Assuntos
Melhoramento Vegetal , Sorghum/genética , Austrália , Clima , Epistasia Genética , Genes Dominantes , Estudos de Associação Genética , Marcadores Genéticos , Variação Genética , Genômica , Genótipo , Modelos Genéticos , Linhagem , Fenótipo , Polimorfismo de Nucleotídeo Único , Seleção Genética , Sorghum/crescimento & desenvolvimento
12.
Theor Appl Genet ; 132(7): 2055-2067, 2019 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-30968160

RESUMO

KEY MESSAGE: The use of a kinship matrix integrating pedigree- and marker-based relationships optimized the performance of genomic prediction in sorghum, especially for traits of lower heritability. Selection based on genome-wide markers has become an active breeding strategy in crops. Genomic prediction models can make use of pedigree information to account for the residual polygenic effects not captured by markers. Our aim was to evaluate the impact of using pedigree and genomic information on prediction quality of breeding values for different traits in sorghum. We explored BLUP models that use weighted combinations of pedigree and genomic relationship matrices. The optimal weighting factor was empirically determined in order to maximize predictive ability after evaluating a range of candidate weights. The phenotypic data consisted of testcross evaluations of sorghum parental lines across multiple environments. All lines were genotyped, and full pedigree information was available. The performance of the best predictive combined matrix was compared to that of models fitting the component matrices independently. Model performance was assessed using cross-validation technique. Fitting a combined pedigree-genomic matrix with the optimal weight always yielded the largest increases in predictive ability and the largest reductions in prediction bias relative to the simple G-BLUP. However, the weight that optimized prediction varied across traits. The benefits of including pedigree information in the genomic model were more relevant for traits with lower heritability, such as grain yield and stay-green. Our results suggest that the combination of pedigree and genomic relatedness can be used to optimize predictions of complex traits in crops when the additive variation is not fully explained by markers.


Assuntos
Genômica/métodos , Modelos Genéticos , Linhagem , Melhoramento Vegetal , Sorghum/genética , Genótipo , Fenótipo
13.
Genet Sel Evol ; 51(1): 2, 2019 Jan 24.
Artigo em Inglês | MEDLINE | ID: mdl-30678638

RESUMO

BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R2) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R2 of 0.39 (0.49). After quality control, 2.5 × 106 (3.5 × 106) SNPs had a Beagle R2 higher than 0.6, resulting in an average Beagle R2 of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Suínos/genética , Sequenciamento Completo do Genoma/métodos , Animais , Estudo de Associação Genômica Ampla/normas , Estudo de Associação Genômica Ampla/veterinária , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Sequenciamento Completo do Genoma/normas , Sequenciamento Completo do Genoma/veterinária
14.
J Anim Breed Genet ; 136(6): 418-429, 2019 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-31215703

RESUMO

Significance testing for genome-wide association study (GWAS) with increasing SNP density up to whole-genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole-genome sequence data, fitting a genomic relationship matrix based on a leave-one-chromosome-out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole-genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini-Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave-one-chromosome-out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.


Assuntos
Estudo de Associação Genômica Ampla , Genômica , Genótipo , Sequenciamento Completo do Genoma , Animais , Cruzamento , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas/genética , Suínos/genética
15.
New Phytol ; 213(3): 1346-1362, 2017 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-27699793

RESUMO

Plants are exposed to combinations of various biotic and abiotic stresses, but stress responses are usually investigated for single stresses only. Here, we investigated the genetic architecture underlying plant responses to 11 single stresses and several of their combinations by phenotyping 350 Arabidopsis thaliana accessions. A set of 214 000 single nucleotide polymorphisms (SNPs) was screened for marker-trait associations in genome-wide association (GWA) analyses using tailored multi-trait mixed models. Stress responses that share phytohormonal signaling pathways also share genetic architecture underlying these responses. After removing the effects of general robustness, for the 30 most significant SNPs, average quantitative trait locus (QTL) effect sizes were larger for dual stresses than for single stresses. Plants appear to deploy broad-spectrum defensive mechanisms influencing multiple traits in response to combined stresses. Association analyses identified QTLs with contrasting and with similar responses to biotic vs abiotic stresses, and below-ground vs above-ground stresses. Our approach allowed for an unprecedented comprehensive genetic analysis of how plants deal with a wide spectrum of stress conditions.


Assuntos
Arabidopsis/genética , Arabidopsis/fisiologia , Mapeamento Cromossômico , Estudo de Associação Genômica Ampla , Estresse Fisiológico/genética , DNA Bacteriano/genética , Genes de Plantas , Estudos de Associação Genética , Padrões de Herança/genética , Modelos Genéticos , Mutação/genética , Fenótipo , Reguladores de Crescimento de Plantas/metabolismo , Locos de Características Quantitativas/genética , Reprodutibilidade dos Testes
16.
Theor Appl Genet ; 130(2): 433-444, 2017 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-27921120

RESUMO

KEY MESSAGE: Probabilistic graphical models show great potential for robust and reliable construction of linkage maps. We show how to use probabilistic graphical models to construct high-quality linkage maps in the face of data perturbations caused by genotyping errors and reciprocal translocations. It has been shown that linkage map construction can be hampered by the presence of genotyping errors and chromosomal rearrangements such as inversions and translocations. Here, we report a novel method for linkage map construction using probabilistic graphical models. The method is proven, both theoretically and practically, to be effective in filtering out markers that contain genotyping errors. In particular, it carries out marker filtering and ordering simultaneously, and is therefore superior to the standard post hoc filtering using nearest-neighbour stress. Furthermore, we demonstrate empirically that the proposed method offers a promising solution to linkage map construction in the case of a reciprocal translocation.


Assuntos
Mapeamento Cromossômico , Ligação Genética , Modelos Genéticos , Modelos Estatísticos , Algoritmos , Alelos , Inversão Cromossômica , Cromossomos de Plantas , Cucumis sativus/genética , Técnicas de Genotipagem , Haplótipos , Hordeum/genética , Fenótipo , Translocação Genética
17.
Theor Appl Genet ; 130(1): 123-135, 2017 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-27699464

RESUMO

KEY MESSAGE: The number of SNPs required for QTL discovery is justified by the distance at which linkage disequilibrium has decayed. Simulations and real potato SNP data showed how to estimate and interpret LD decay. The magnitude of linkage disequilibrium (LD) and its decay with genetic distance determine the resolution of association mapping, and are useful for assessing the desired numbers of SNPs on arrays. To study LD and LD decay in tetraploid potato, we simulated autotetraploid genotypes and used it to explore the dependence on: (1) the number of haplotypes in the population (the amount of genetic variation) and (2) the percentage of haplotype specific SNPs (hs-SNPs). Several estimators for short-range LD were explored, such as the average r 2, median r 2, and other percentiles of r 2 (80, 90, and 95 %). For LD decay, we looked at LD½,90, the distance at which the short-range LD is halved when using the 90 % percentile of r 2 at short range, as estimator for LD. Simulations showed that the performance of various estimators for LD decay strongly depended on the number of haplotypes, although the real value of LD decay was not influenced very much by this number. The estimator LD½,90 was chosen to evaluate LD decay in 537 tetraploid varieties. LD½,90 values were 1.5 Mb for varieties released before 1945 and 0.6 Mb in varieties released after 2005. LD½,90 values within three different subpopulations ranged from 0.7 to 0.9 Mb. LD½,90 was 2.5 Mb for introgressed regions, indicating large haplotype blocks. In pericentromeric heterochromatin, LD decay was negligible. This study demonstrates that several related factors influencing LD decay could be disentangled, that no universal approach can be suggested, and that the estimation of LD decay has to be performed with great care and knowledge of the sampled material.


Assuntos
Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Solanum tuberosum/genética , Tetraploidia , Frequência do Gene , Genética Populacional , Genótipo , Haplótipos , Modelos Genéticos
18.
Theor Appl Genet ; 130(7): 1375-1392, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28374049

RESUMO

KEY MESSAGE: A flexible and user-friendly spatial method called SpATS performed comparably to more elaborate and trial-specific spatial models in a series of sorghum breeding trials. Adjustment for spatial trends in plant breeding field trials is essential for efficient evaluation and selection of genotypes. Current mixed model methods of spatial analysis are based on a multi-step modelling process where global and local trends are fitted after trying several candidate spatial models. This paper reports the application of a novel spatial method that accounts for all types of continuous field variation in a single modelling step by fitting a smooth surface. The method uses two-dimensional P-splines with anisotropic smoothing formulated in the mixed model framework, referred to as SpATS model. We applied this methodology to a series of large and partially replicated sorghum breeding trials. The new model was assessed in comparison with the more elaborate standard spatial models that use autoregressive correlation of residuals. The improvements in precision and the predictions of genotypic values produced by the SpATS model were equivalent to those obtained using the best fitting standard spatial models for each trial. One advantage of the approach with SpATS is that all patterns of spatial trend and genetic effects were modelled simultaneously by fitting a single model. Furthermore, we used a flexible model to adequately adjust for field trends. This strategy reduces potential parameter identification problems and simplifies the model selection process. Therefore, the new method should be considered as an efficient and easy-to-use alternative for routine analyses of plant breeding trials.


Assuntos
Modelos Genéticos , Melhoramento Vegetal/métodos , Sorghum/genética , Algoritmos , Genótipo , Análise Espacial
19.
BMC Med Res Methodol ; 16(1): 139, 2016 10 13.
Artigo em Inglês | MEDLINE | ID: mdl-27737637

RESUMO

BACKGROUND: Measurement error in self-reported dietary intakes is known to bias the association between dietary intake and a health outcome of interest such as risk of a disease. The association can be distorted further by mismeasured confounders, leading to invalid results and conclusions. It is, however, difficult to adjust for the bias in the association when there is no internal validation data. METHODS: We proposed a method to adjust for the bias in the diet-disease association (hereafter, association), due to measurement error in dietary intake and a mismeasured confounder, when there is no internal validation data. The method combines prior information on the validity of the self-report instrument with the observed data to adjust for the bias in the association. We compared the proposed method with the method that ignores the confounder effect, and with the method that ignores measurement errors completely. We assessed the sensitivity of the estimates to various magnitudes of measurement error, error correlations and uncertainty in the literature-reported validation data. We applied the methods to fruits and vegetables (FV) intakes, cigarette smoking (confounder) and all-cause mortality data from the European Prospective Investigation into Cancer and Nutrition study. RESULTS: Using the proposed method resulted in about four times increase in the strength of association between FV intake and mortality. For weakly correlated errors, measurement error in the confounder minimally affected the hazard ratio estimate for FV intake. The effect was more pronounced for strong error correlations. CONCLUSIONS: The proposed method permits sensitivity analysis on measurement error structures and accounts for uncertainties in the reported validity coefficients. The method is useful in assessing the direction and quantifying the magnitude of bias in the association due to measurement errors in the confounders.


Assuntos
Neoplasias/epidemiologia , Viés , Dieta/efeitos adversos , Humanos , Estudos Multicêntricos como Assunto , Análise Multivariada , Neoplasias/etiologia , Modelos de Riscos Proporcionais , Estudos Prospectivos , Medição de Risco , Autorrelato , Sensibilidade e Especificidade , Fumar/efeitos adversos , Estudos de Validação como Assunto
20.
Biom J ; 58(4): 766-82, 2016 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-27003183

RESUMO

Dietary questionnaires are prone to measurement error, which bias the perceived association between dietary intake and risk of disease. Short-term measurements are required to adjust for the bias in the association. For foods that are not consumed daily, the short-term measurements are often characterized by excess zeroes. Via a simulation study, the performance of a two-part calibration model that was developed for a single-replicate study design was assessed by mimicking leafy vegetable intake reports from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) study. In part I of the fitted two-part calibration model, a logistic distribution was assumed; in part II, a gamma distribution was assumed. The model was assessed with respect to the magnitude of the correlation between the consumption probability and the consumed amount (hereafter, cross-part correlation), the number and form of covariates in the calibration model, the percentage of zero response values, and the magnitude of the measurement error in the dietary intake. From the simulation study results, transforming the dietary variable in the regression calibration to an appropriate scale was found to be the most important factor for the model performance. Reducing the number of covariates in the model could be beneficial, but was not critical in large-sample studies. The performance was remarkably robust when fitting a one-part rather than a two-part model. The model performance was minimally affected by the cross-part correlation.


Assuntos
Exposição Dietética , Modelos de Riscos Proporcionais , Calibragem/normas , Simulação por Computador , Humanos , Análise de Regressão , Reprodutibilidade dos Testes , Autorrelato , Inquéritos e Questionários
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA