Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 62
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
BMC Bioinformatics ; 25(1): 202, 2024 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-38816801

RESUMEN

INTODUCTION: In systems biology, an organism is viewed as a system of interconnected molecular entities. To understand the functioning of organisms it is essential to integrate information about the variations in the concentrations of those molecular entities. This information can be structured as a set of networks with interconnections and with some hierarchical relations between them. Few methods exist for the reconstruction of integrative networks. OBJECTIVE: In this work, we propose an integrative network reconstruction method in which the network organization for a particular type of omics data is guided by the network structure of a related type of omics data upstream in the omic cascade. The structure of these guiding data can be either already known or be estimated from the guiding data themselves. METHODS: The method consists of three steps. First a network structure for the guiding data should be provided. Next, responses in the target set are regressed on the full set of predictors in the guiding data with a Lasso penalty to reduce the number of predictors and an L2 penalty on the differences between coefficients for predictors that share edges in the network for the guiding data. Finally, a network is reconstructed on the fitted target responses as functions of the predictors in the guiding data. This way we condition the target network on the network of the guiding data. CONCLUSIONS: We illustrate our approach on two examples in Arabidopsis. The method detects groups of metabolites that have a similar genetic or transcriptomic basis.


Asunto(s)
Arabidopsis , Arabidopsis/genética , Arabidopsis/metabolismo , Biología de Sistemas/métodos , Redes Reguladoras de Genes , Algoritmos , Biología Computacional/métodos , Multiómica
2.
Heredity (Edinb) ; 133(3): 198-205, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-38982296

RESUMEN

Chromosome substitution lines (CSLs) are tentatively supreme resources to investigate non-allelic genetic interactions. However, the difficulty of generating such lines in most species largely yielded imperfect CSL panels, prohibiting a systematic dissection of epistasis. Here, we present the development and use of a unique and complete panel of CSLs in Arabidopsis thaliana, allowing the full factorial analysis of epistatic interactions. A first comparison of reciprocal single chromosome substitutions revealed a dependency of QTL detection on different genetic backgrounds. The subsequent analysis of the complete panel of CSLs enabled the mapping of the genetic interactors and identified multiple two- and three-way interactions for different traits. Some of the detected epistatic effects were as large as any observed main effect, illustrating the impact of epistasis on quantitative trait variation. We, therefore, have demonstrated the high power of detection and mapping of genome-wide epistasis, confirming the assumed supremacy of comprehensive CSL sets.


Asunto(s)
Arabidopsis , Mapeo Cromosómico , Cromosomas de las Plantas , Epistasis Genética , Sitios de Carácter Cuantitativo , Arabidopsis/genética , Mapeo Cromosómico/métodos , Cromosomas de las Plantas/genética , Genoma de Planta , Fenotipo
3.
Bioinformatics ; 38(22): 5134-5136, 2022 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-36193999

RESUMEN

MOTIVATION: Multi-parent populations (MPPs) are popular for QTL mapping because they combine wide genetic diversity in parents with easy control of population structure, but a limited number of software tools for QTL mapping are specifically developed for general MPP designs. RESULTS: We developed an R package called statgenMPP, adopting a unified identity-by-descent (IBD)-based mixed model approach for QTL analysis in MPPs. The package offers easy-to-use functionalities of IBD calculations, mixed model solutions and visualizations for QTL mapping in a wide range of MPP designs, including diallele, nested-association mapping populations, multi-parent advanced genetic inter-cross populations and other complicated MPPs with known crossing schemes. AVAILABILITY AND IMPLEMENTATION: The R package statgenMPP is open-source and freely available on CRAN at https://CRAN.R-project.org/package=statgenMPP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Mapeo Cromosómico
4.
Theor Appl Genet ; 135(6): 2059-2082, 2022 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-35524815

RESUMEN

KEY MESSAGE: We evaluate self-organizing maps (SOM) to identify adaptation zones and visualize multi-environment genotypic responses. We apply SOM to multiple traits and crop growth model output of large-scale European sunflower data. Genotype-by-environment interactions (G × E) complicate the selection of well-adapted varieties. A possible solution is to group trial locations into adaptation zones with G × E occurring mainly between zones. By selecting for good performance inside those zones, response to selection is increased. In this paper, we present a two-step procedure to identify adaptation zones that starts from a self-organizing map (SOM). In the SOM, trials across locations and years are assigned to groups, called units, that are organized on a two-dimensional grid. Units that are further apart contain more distinct trials. In an iterative process of reweighting trial contributions to units, the grid configuration is learnt simultaneously with the trial assignment to units. An aggregation of the units in the SOM by hierarchical clustering then produces environment types, i.e. trials with similar growing conditions. Adaptation zones can subsequently be identified by grouping trial locations with similar distributions of environment types across years. For the construction of SOMs, multiple data types can be combined. We compared environment types and adaptation zones obtained for European sunflower from quantitative traits like yield, oil content, phenology and disease scores with those obtained from environmental indices calculated with the crop growth model Sunflo. We also show how results are affected by input data organization and user-defined weights for genotypes and traits. Adaptation zones for European sunflower as identified by our SOM-based strategy captured substantial genotype-by-location interaction and pointed to trials in Spain, Turkey and South Bulgaria as inducing different genotypic responses.


Asunto(s)
Helianthus , Adaptación Fisiológica , Algoritmos , Análisis por Conglomerados , Genotipo , Helianthus/genética
5.
Mol Breed ; 42(12): 76, 2022 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-37313326

RESUMEN

Genome-wide association studies (GWAS) are a useful tool to unravel the genetic architecture of complex traits, but the results can be difficult to interpret. Population structure, genetic heterogeneity, and rare alleles easily result in false positive or false negative associations. This paper describes the analysis of a GWAS panel combined with three bi-parental mapping populations to validate GWAS results, using phenotypic data for steroidal glycoalkaloid (SGA) accumulation and the ratio (SGR) between the two major glycoalkaloids α-solanine and α-chaconine in potato tubers. SGAs are secondary metabolites in the Solanaceae family, functional as a defence against various pests and pathogens and in high quantities toxic for humans. With GWAS, we identified five quantitative trait loci (QTL) of which Sga1.1, Sgr8.1, and Sga11.1 were validated, but not Sga3.1 and Sgr7.1. In the bi-parental populations, Sga5.1 and Sga7.1 were mapped, but these were not identified with GWAS. The QTLs Sga1.1, Sga7.1, Sgr7.1, and Sgr8.1 co-localize with genes GAME9, GAME 6/GAME 11, SGT1, and SGT2, respectively. For other genes involved in SGA synthesis, no QTLs were identified. The results of this study illustrate a number of pitfalls in GWAS of which population structure seems the most important. We also show that introgression breeding for disease resistance has introduced new haplotypes to the gene pool involved in higher SGA levels in certain pedigrees. Finally, we show that high SGA levels remain unpredictable in potato but that α-solanine/α-chaconine ratio has a predictable outcome with specific SGT1 and SGT2 haplotypes. Supplementary Information: The online version contains supplementary material available at 10.1007/s11032-022-01344-2.

6.
Theor Appl Genet ; 134(11): 3643-3660, 2021 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-34342658

RESUMEN

KEY MESSAGE: The identity-by-descent (IBD)-based mixed model approach introduced in this study can detect quantitative trait loci (QTLs) referring to the parental origin and simultaneously account for multilevel relatedness of individuals within and across families. This unified approach is proved to be a powerful approach for all kinds of multiparental population (MPP) designs. Multiparental populations (MPPs) have become popular for quantitative trait loci (QTL) detection. Tools for QTL mapping in MPPs are mostly developed for specific MPPs and do not generalize well to other MPPs. We present an IBD-based mixed model approach for QTL mapping in all kinds of MPP designs, e.g., diallel, Nested Association Mapping (NAM), and Multiparental Advanced Generation Intercross (MAGIC) designs. The first step is to compute identity-by-descent (IBD) probabilities using a general Hidden Markov model framework, called reconstructing ancestry blocks bit by bit (RABBIT). Next, functions of IBD information are used as design matrices, or genetic predictors, in a mixed model approach to estimate variance components for multiallelic genetic effects associated with parents. Family-specific residual genetic effects are added, and a polygenic effect is structured by kinship relations between individuals. Case studies of simulated diallel, NAM, and MAGIC designs proved that the advanced IBD-based multi-QTL mixed model approach incorporating both kinship relations and family-specific residual variances (IBD.MQMkin_F) is robust across a variety of MPP designs and allele segregation patterns in comparison to a widely used benchmark association mapping method, and in most cases, outperformed or behaved at least as well as other tools developed for specific MPP designs in terms of mapping power and resolution. Successful analyses of real data cases confirmed the wide applicability of our IBD-based mixed model methodology.


Asunto(s)
Mapeo Cromosómico , Modelos Genéticos , Sitios de Carácter Cuantitativo , Alelos , Simulación por Computador , Modelos Lineales , Cadenas de Markov , Plantas/genética
7.
Theor Appl Genet ; 134(3): 897-908, 2021 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-33367942

RESUMEN

Much has been published on QTL detection for complex traits using bi-parental and multi-parental crosses (linkage analysis) or diversity panels (GWAS studies). While successful for detection, transferability of results to real applications has proven more difficult. Here, we combined a QTL detection approach using a pre-breeding populations which utilized intensive phenotypic selection for the target trait across multiple plant generations, combined with rapid generation turnover (i.e. "speed breeding") to allow cycling of multiple plant generations each year. The reasoning is that QTL mapping information would complement the selection process by identifying the genome regions under selection within the relevant germplasm. Questions to answer were the location of the genomic regions determining response to selection and the origin of the favourable alleles within the pedigree. We used data from a pre-breeding program that aimed at pyramiding different resistance sources to Fusarium crown rot into elite (but susceptible) wheat backgrounds. The population resulted from a complex backcrossing scheme involving multiple resistance donors and multiple elite backgrounds, akin to a MAGIC population (985 genotypes in total, with founders, and two major offspring layers within the pedigree). A significant increase in the resistance level was observed (i.e. a positive response to selection) after the selection process, and 17 regions significantly associated with that response were identified using a GWAS approach. Those regions included known QTL as well as potentially novel regions contributing resistance to Fusarium crown rot. In addition, we were able to trace back the sources of the favourable alleles for each QTL. We demonstrate that QTL detection using breeding populations under selection for the target trait can identify QTL controlling the target trait and that the frequency of the favourable alleles was increased as a response to selection, thereby validating the QTL detected. This is a valuable opportunistic approach that can provide QTL information that is more easily transferred to breeding applications.


Asunto(s)
Resistencia a la Enfermedad/genética , Fusarium/fisiología , Marcadores Genéticos , Fitomejoramiento , Enfermedades de las Plantas/genética , Sitios de Carácter Cuantitativo , Triticum/genética , Alelos , Mapeo Cromosómico/métodos , Cromosomas de las Plantas/genética , Resistencia a la Enfermedad/inmunología , Ligamiento Genético , Enfermedades de las Plantas/microbiología , Triticum/inmunología , Triticum/microbiología
8.
Plant J ; 99(6): 1172-1191, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31108005

RESUMEN

Broadening the genetic base of crops is crucial for developing varieties to respond to global agricultural challenges such as climate change. Here, we analysed a diverse panel of 371 domesticated lines of the model crop barley to explore the genetics of crop adaptation. We first collected exome sequence data and phenotypes of key life history traits from contrasting multi-environment common garden trials. Then we applied refined statistical methods, including some based on exomic haplotype states, for genotype-by-environment (G×E) modelling. Sub-populations defined from exomic profiles were coincident with barley's biology, geography and history, and explained a high proportion of trial phenotypic variance. Clear G×E interactions indicated adaptation profiles that varied for landraces and cultivars. Exploration of circadian clock-related genes, associated with the environmentally adaptive days to heading trait (crucial for the crop's spread from the Fertile Crescent), illustrated complexities in G×E effect directions, and the importance of latitudinally based genic context in the expression of large-effect alleles. Our analysis supports a gene-level scientific understanding of crop adaption and leads to practical opportunities for crop improvement, allowing the prioritisation of genomic regions and particular sets of lines for breeding efforts seeking to cope with climate change and other stresses.


Asunto(s)
Aclimatación/genética , Productos Agrícolas/genética , Exoma , Hordeum/genética , Relojes Circadianos/genética , Variación Genética , Estudio de Asociación del Genoma Completo , Genotipo , Geografía , Haplotipos , Desequilibrio de Ligamiento , Fenotipo , Fitomejoramiento , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Secuenciación del Exoma
9.
J Nutr ; 150(3): 634-643, 2020 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-31858107

RESUMEN

BACKGROUND: In nutritional epidemiology, dealing with confounding and complex internutrient relations are major challenges. An often-used approach is dietary pattern analyses, such as principal component analysis, to deal with internutrient correlations, and to more closely resemble the true way nutrients are consumed. However, despite these improvements, these approaches still require subjective decisions in the preselection of food groups. Moreover, they do not make efficient use of multivariate dietary data, because they detect only marginal associations. We propose the use of copula graphical models (CGMs) to model and make statistical inferences regarding complex associations among variables in multivariate data, where associations between all variables can be learned simultaneously. OBJECTIVE: We aimed to reconstruct nutritional intake and physical functioning networks in Dutch older adults by applying a CGM. METHODS: We addressed this issue by uncovering the pairwise associations between variables while correcting for the effect of remaining variables. More specifically, we used a CGM to infer the precision matrix, which contains all the conditional independence relations between nodes in the graph. The nonzero elements of the precision matrix indicate the presence of a direct association. We applied this method to reconstruct nutrient-physical functioning networks from the combined data of 4 studies (Nu-Age, ProMuscle, ProMO, and V-Fit, total n = 662, mean ± SD age = 75 ± 7 y). The method was implemented in the R package nutriNetwork which is freely available at https://cran.r-project.org/web/packages/nutriNetwork. RESULTS: Greater intakes of vegetable protein and vitamin B-6 were partially correlated with higher scores on the total Short Physical Performance Battery (SPPB) and the chair rise test. Greater intakes of vitamin B-12 and folate were partially correlated with higher scores on the chair rise test and the total SPPB, respectively. CONCLUSIONS: We determined that vegetable protein, vitamin B-6, folate, and vitamin B-12 intakes are partially correlated with improved functional outcome measurements in Dutch older adults.


Asunto(s)
Ácido Fólico/administración & dosificación , Modelos Teóricos , Rendimiento Físico Funcional , Proteínas de Vegetales Comestibles/administración & dosificación , Vitamina B 12/administración & dosificación , Vitamina B 6/administración & dosificación , Anciano , Anciano de 80 o más Años , Índice de Masa Corporal , Anciano Frágil , Humanos , Países Bajos
10.
Theor Appl Genet ; 133(3): 1009-1018, 2020 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-31907563

RESUMEN

KEY MESSAGE: Multi-environment models using marker-based kinship information for both additive and dominance effects can accurately predict hybrid performance in different environments. Sorghum is an important hybrid crop that is grown extensively in many subtropical and tropical regions including Northern NSW and Queensland in Australia. The highly varying weather patterns in the Australian summer months mean that sorghum hybrids exhibit a great deal of variation in yield between locations. To ultimately enable prediction of the outcome of crossing parental lines, both additive effects on yield performance and dominance interaction effects need to be characterised. This paper demonstrates that fitting a linear mixed model that includes both types of effects calculated using genetic markers in relationship matrices improves predictions. Genotype by environment interactions was investigated by comparing FA1 (single-factor analytic) and FA2 (two-factor analytic) structures. The G×E causes a change in hybrid rankings between trials with a difference of up to 25% of the hybrids in the top 10% of each trial. The prediction accuracies increased with the addition of the dominance term (over and above that achieved with an additive effect alone) by an average of 15% and a maximum of 60%. The percentage of dominance of the total genetic variance varied between trials with the trials with higher broad-sense heritability having the greater percentage of dominance. The inclusion of dominance in the factor analytic models improves the accuracy of the additive effects. Breeders selecting high yielding parents for crossing need to be aware of effects due to environment and dominance.


Asunto(s)
Fitomejoramiento , Sorghum/genética , Australia , Clima , Epistasis Genética , Genes Dominantes , Estudios de Asociación Genética , Marcadores Genéticos , Variación Genética , Genómica , Genotipo , Modelos Genéticos , Linaje , Fenotipo , Polimorfismo de Nucleótido Simple , Selección Genética , Sorghum/crecimiento & desarrollo
11.
Theor Appl Genet ; 132(7): 2055-2067, 2019 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-30968160

RESUMEN

KEY MESSAGE: The use of a kinship matrix integrating pedigree- and marker-based relationships optimized the performance of genomic prediction in sorghum, especially for traits of lower heritability. Selection based on genome-wide markers has become an active breeding strategy in crops. Genomic prediction models can make use of pedigree information to account for the residual polygenic effects not captured by markers. Our aim was to evaluate the impact of using pedigree and genomic information on prediction quality of breeding values for different traits in sorghum. We explored BLUP models that use weighted combinations of pedigree and genomic relationship matrices. The optimal weighting factor was empirically determined in order to maximize predictive ability after evaluating a range of candidate weights. The phenotypic data consisted of testcross evaluations of sorghum parental lines across multiple environments. All lines were genotyped, and full pedigree information was available. The performance of the best predictive combined matrix was compared to that of models fitting the component matrices independently. Model performance was assessed using cross-validation technique. Fitting a combined pedigree-genomic matrix with the optimal weight always yielded the largest increases in predictive ability and the largest reductions in prediction bias relative to the simple G-BLUP. However, the weight that optimized prediction varied across traits. The benefits of including pedigree information in the genomic model were more relevant for traits with lower heritability, such as grain yield and stay-green. Our results suggest that the combination of pedigree and genomic relatedness can be used to optimize predictions of complex traits in crops when the additive variation is not fully explained by markers.


Asunto(s)
Genómica/métodos , Modelos Genéticos , Linaje , Fitomejoramiento , Sorghum/genética , Genotipo , Fenotipo
12.
Genet Sel Evol ; 51(1): 2, 2019 Jan 24.
Artículo en Inglés | MEDLINE | ID: mdl-30678638

RESUMEN

BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R2) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R2 of 0.39 (0.49). After quality control, 2.5 × 106 (3.5 × 106) SNPs had a Beagle R2 higher than 0.6, resulting in an average Beagle R2 of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.


Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Porcinos/genética , Secuenciación Completa del Genoma/métodos , Animales , Estudio de Asociación del Genoma Completo/normas , Estudio de Asociación del Genoma Completo/veterinaria , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Secuenciación Completa del Genoma/normas , Secuenciación Completa del Genoma/veterinaria
13.
J Anim Breed Genet ; 136(6): 418-429, 2019 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-31215703

RESUMEN

Significance testing for genome-wide association study (GWAS) with increasing SNP density up to whole-genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole-genome sequence data, fitting a genomic relationship matrix based on a leave-one-chromosome-out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole-genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini-Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave-one-chromosome-out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.


Asunto(s)
Estudio de Asociación del Genoma Completo , Genómica , Genotipo , Secuenciación Completa del Genoma , Animales , Cruzamiento , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo/genética , Porcinos/genética
14.
New Phytol ; 213(3): 1346-1362, 2017 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-27699793

RESUMEN

Plants are exposed to combinations of various biotic and abiotic stresses, but stress responses are usually investigated for single stresses only. Here, we investigated the genetic architecture underlying plant responses to 11 single stresses and several of their combinations by phenotyping 350 Arabidopsis thaliana accessions. A set of 214 000 single nucleotide polymorphisms (SNPs) was screened for marker-trait associations in genome-wide association (GWA) analyses using tailored multi-trait mixed models. Stress responses that share phytohormonal signaling pathways also share genetic architecture underlying these responses. After removing the effects of general robustness, for the 30 most significant SNPs, average quantitative trait locus (QTL) effect sizes were larger for dual stresses than for single stresses. Plants appear to deploy broad-spectrum defensive mechanisms influencing multiple traits in response to combined stresses. Association analyses identified QTLs with contrasting and with similar responses to biotic vs abiotic stresses, and below-ground vs above-ground stresses. Our approach allowed for an unprecedented comprehensive genetic analysis of how plants deal with a wide spectrum of stress conditions.


Asunto(s)
Arabidopsis/genética , Arabidopsis/fisiología , Mapeo Cromosómico , Estudio de Asociación del Genoma Completo , Estrés Fisiológico/genética , ADN Bacteriano/genética , Genes de Plantas , Estudios de Asociación Genética , Patrón de Herencia/genética , Modelos Genéticos , Mutación/genética , Fenotipo , Reguladores del Crecimiento de las Plantas/metabolismo , Sitios de Carácter Cuantitativo/genética , Reproducibilidad de los Resultados
15.
Theor Appl Genet ; 130(2): 433-444, 2017 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-27921120

RESUMEN

KEY MESSAGE: Probabilistic graphical models show great potential for robust and reliable construction of linkage maps. We show how to use probabilistic graphical models to construct high-quality linkage maps in the face of data perturbations caused by genotyping errors and reciprocal translocations. It has been shown that linkage map construction can be hampered by the presence of genotyping errors and chromosomal rearrangements such as inversions and translocations. Here, we report a novel method for linkage map construction using probabilistic graphical models. The method is proven, both theoretically and practically, to be effective in filtering out markers that contain genotyping errors. In particular, it carries out marker filtering and ordering simultaneously, and is therefore superior to the standard post hoc filtering using nearest-neighbour stress. Furthermore, we demonstrate empirically that the proposed method offers a promising solution to linkage map construction in the case of a reciprocal translocation.


Asunto(s)
Mapeo Cromosómico , Ligamiento Genético , Modelos Genéticos , Modelos Estadísticos , Algoritmos , Alelos , Inversión Cromosómica , Cromosomas de las Plantas , Cucumis sativus/genética , Técnicas de Genotipaje , Haplotipos , Hordeum/genética , Fenotipo , Translocación Genética
16.
Theor Appl Genet ; 130(1): 123-135, 2017 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-27699464

RESUMEN

KEY MESSAGE: The number of SNPs required for QTL discovery is justified by the distance at which linkage disequilibrium has decayed. Simulations and real potato SNP data showed how to estimate and interpret LD decay. The magnitude of linkage disequilibrium (LD) and its decay with genetic distance determine the resolution of association mapping, and are useful for assessing the desired numbers of SNPs on arrays. To study LD and LD decay in tetraploid potato, we simulated autotetraploid genotypes and used it to explore the dependence on: (1) the number of haplotypes in the population (the amount of genetic variation) and (2) the percentage of haplotype specific SNPs (hs-SNPs). Several estimators for short-range LD were explored, such as the average r 2, median r 2, and other percentiles of r 2 (80, 90, and 95 %). For LD decay, we looked at LD½,90, the distance at which the short-range LD is halved when using the 90 % percentile of r 2 at short range, as estimator for LD. Simulations showed that the performance of various estimators for LD decay strongly depended on the number of haplotypes, although the real value of LD decay was not influenced very much by this number. The estimator LD½,90 was chosen to evaluate LD decay in 537 tetraploid varieties. LD½,90 values were 1.5 Mb for varieties released before 1945 and 0.6 Mb in varieties released after 2005. LD½,90 values within three different subpopulations ranged from 0.7 to 0.9 Mb. LD½,90 was 2.5 Mb for introgressed regions, indicating large haplotype blocks. In pericentromeric heterochromatin, LD decay was negligible. This study demonstrates that several related factors influencing LD decay could be disentangled, that no universal approach can be suggested, and that the estimation of LD decay has to be performed with great care and knowledge of the sampled material.


Asunto(s)
Desequilibrio de Ligamiento , Polimorfismo de Nucleótido Simple , Solanum tuberosum/genética , Tetraploidía , Frecuencia de los Genes , Genética de Población , Genotipo , Haplotipos , Modelos Genéticos
17.
Theor Appl Genet ; 130(7): 1375-1392, 2017 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-28374049

RESUMEN

KEY MESSAGE: A flexible and user-friendly spatial method called SpATS performed comparably to more elaborate and trial-specific spatial models in a series of sorghum breeding trials. Adjustment for spatial trends in plant breeding field trials is essential for efficient evaluation and selection of genotypes. Current mixed model methods of spatial analysis are based on a multi-step modelling process where global and local trends are fitted after trying several candidate spatial models. This paper reports the application of a novel spatial method that accounts for all types of continuous field variation in a single modelling step by fitting a smooth surface. The method uses two-dimensional P-splines with anisotropic smoothing formulated in the mixed model framework, referred to as SpATS model. We applied this methodology to a series of large and partially replicated sorghum breeding trials. The new model was assessed in comparison with the more elaborate standard spatial models that use autoregressive correlation of residuals. The improvements in precision and the predictions of genotypic values produced by the SpATS model were equivalent to those obtained using the best fitting standard spatial models for each trial. One advantage of the approach with SpATS is that all patterns of spatial trend and genetic effects were modelled simultaneously by fitting a single model. Furthermore, we used a flexible model to adequately adjust for field trends. This strategy reduces potential parameter identification problems and simplifies the model selection process. Therefore, the new method should be considered as an efficient and easy-to-use alternative for routine analyses of plant breeding trials.


Asunto(s)
Modelos Genéticos , Fitomejoramiento/métodos , Sorghum/genética , Algoritmos , Genotipo , Análisis Espacial
18.
BMC Med Res Methodol ; 16(1): 139, 2016 10 13.
Artículo en Inglés | MEDLINE | ID: mdl-27737637

RESUMEN

BACKGROUND: Measurement error in self-reported dietary intakes is known to bias the association between dietary intake and a health outcome of interest such as risk of a disease. The association can be distorted further by mismeasured confounders, leading to invalid results and conclusions. It is, however, difficult to adjust for the bias in the association when there is no internal validation data. METHODS: We proposed a method to adjust for the bias in the diet-disease association (hereafter, association), due to measurement error in dietary intake and a mismeasured confounder, when there is no internal validation data. The method combines prior information on the validity of the self-report instrument with the observed data to adjust for the bias in the association. We compared the proposed method with the method that ignores the confounder effect, and with the method that ignores measurement errors completely. We assessed the sensitivity of the estimates to various magnitudes of measurement error, error correlations and uncertainty in the literature-reported validation data. We applied the methods to fruits and vegetables (FV) intakes, cigarette smoking (confounder) and all-cause mortality data from the European Prospective Investigation into Cancer and Nutrition study. RESULTS: Using the proposed method resulted in about four times increase in the strength of association between FV intake and mortality. For weakly correlated errors, measurement error in the confounder minimally affected the hazard ratio estimate for FV intake. The effect was more pronounced for strong error correlations. CONCLUSIONS: The proposed method permits sensitivity analysis on measurement error structures and accounts for uncertainties in the reported validity coefficients. The method is useful in assessing the direction and quantifying the magnitude of bias in the association due to measurement errors in the confounders.


Asunto(s)
Neoplasias/epidemiología , Sesgo , Dieta/efectos adversos , Humanos , Estudios Multicéntricos como Asunto , Análisis Multivariante , Neoplasias/etiología , Modelos de Riesgos Proporcionales , Estudios Prospectivos , Medición de Riesgo , Autoinforme , Sensibilidad y Especificidad , Fumar/efectos adversos , Estudios de Validación como Asunto
19.
Biom J ; 58(4): 766-82, 2016 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-27003183

RESUMEN

Dietary questionnaires are prone to measurement error, which bias the perceived association between dietary intake and risk of disease. Short-term measurements are required to adjust for the bias in the association. For foods that are not consumed daily, the short-term measurements are often characterized by excess zeroes. Via a simulation study, the performance of a two-part calibration model that was developed for a single-replicate study design was assessed by mimicking leafy vegetable intake reports from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) study. In part I of the fitted two-part calibration model, a logistic distribution was assumed; in part II, a gamma distribution was assumed. The model was assessed with respect to the magnitude of the correlation between the consumption probability and the consumed amount (hereafter, cross-part correlation), the number and form of covariates in the calibration model, the percentage of zero response values, and the magnitude of the measurement error in the dietary intake. From the simulation study results, transforming the dietary variable in the regression calibration to an appropriate scale was found to be the most important factor for the model performance. Reducing the number of covariates in the model could be beneficial, but was not critical in large-sample studies. The performance was remarkably robust when fitting a one-part rather than a two-part model. The model performance was minimally affected by the cross-part correlation.


Asunto(s)
Exposición Dietética , Modelos de Riesgos Proporcionales , Calibración/normas , Simulación por Computador , Humanos , Análisis de Regresión , Reproducibilidad de los Resultados , Autoinforme , Encuestas y Cuestionarios
20.
J Exp Bot ; 66(18): 5389-401, 2015 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-26071534

RESUMEN

In the last decade cheaper and faster sequencing methods have resulted in an enormous increase in genomic data. High throughput genotyping, genotyping by sequencing and genomic breeding are becoming a standard in plant breeding. As a result, the collection of phenotypic data is increasingly becoming a limiting factor in plant breeding. Genetic studies on root traits are being hampered by the complexity of these traits and the inaccessibility of the rhizosphere. With an increasing interest in phenotyping, breeders and scientists try to overcome these limitations, resulting in impressive developments in automated phenotyping platforms. Recently, many such platforms have been thoroughly described, yet their efficiency to increase genetic gain often remains undiscussed. This efficiency depends on the heritability of the phenotyped traits as well as the correlation of these traits with agronomically relevant breeding targets. This review provides an overview of the latest developments in root phenotyping and describes the environmental and genetic factors influencing root phenotype and heritability. It also intends to give direction to future phenotyping and breeding strategies for optimizing root system functioning. A quantitative framework to determine the efficiency of phenotyping platforms for genetic gain is described. By increasing heritability, managing effects caused by interactions between genotype and environment and by quantifying the genetic relation between traits phenotyped in platforms and ultimate breeding targets, phenotyping platforms can be utilized to their maximum potential.


Asunto(s)
Productos Agrícolas/genética , Interacción Gen-Ambiente , Genómica/métodos , Fitomejoramiento , Raíces de Plantas/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA