Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 111
Filtrar
1.
BMC Bioinformatics ; 25(1): 202, 2024 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-38816801

RESUMEN

INTODUCTION: In systems biology, an organism is viewed as a system of interconnected molecular entities. To understand the functioning of organisms it is essential to integrate information about the variations in the concentrations of those molecular entities. This information can be structured as a set of networks with interconnections and with some hierarchical relations between them. Few methods exist for the reconstruction of integrative networks. OBJECTIVE: In this work, we propose an integrative network reconstruction method in which the network organization for a particular type of omics data is guided by the network structure of a related type of omics data upstream in the omic cascade. The structure of these guiding data can be either already known or be estimated from the guiding data themselves. METHODS: The method consists of three steps. First a network structure for the guiding data should be provided. Next, responses in the target set are regressed on the full set of predictors in the guiding data with a Lasso penalty to reduce the number of predictors and an L2 penalty on the differences between coefficients for predictors that share edges in the network for the guiding data. Finally, a network is reconstructed on the fitted target responses as functions of the predictors in the guiding data. This way we condition the target network on the network of the guiding data. CONCLUSIONS: We illustrate our approach on two examples in Arabidopsis. The method detects groups of metabolites that have a similar genetic or transcriptomic basis.


Asunto(s)
Arabidopsis , Arabidopsis/genética , Arabidopsis/metabolismo , Biología de Sistemas/métodos , Redes Reguladoras de Genes , Algoritmos , Biología Computacional/métodos , Multiómica
2.
J Exp Bot ; 75(7): 2084-2099, 2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38134290

RESUMEN

Crop growth and phenology are driven by seasonal changes in environmental variables, with temperature as one important factor. However, knowledge about genotype-specific temperature response and its influence on phenology is limited. Such information is fundamental to improve crop models and adapt selection strategies. We measured the increase in height of 352 European winter wheat varieties in 4 years to quantify phenology, and fitted an asymptotic temperature response model. The model used hourly fluctuations in temperature to parameterize the base temperature (Tmin), the temperature optimum (rmax), and the steepness (lrc) of growth responses. Our results show that higher Tmin and lrc relate to an earlier start and end of stem elongation. A higher rmax relates to an increased final height. Both final height and rmax decreased for varieties originating from the continental east of Europe towards the maritime west. A genome-wide association study (GWAS) indicated a quantitative inheritance and a large degree of independence among loci. Nevertheless, genomic prediction accuracies (GBLUPs) for Tmin and lrc were low (r≤0.32) compared with other traits (r≥0.59). As well as known, major genes related to vernalization, photoperiod, or dwarfing, the GWAS indicated additional, as yet unknown loci that dominate the temperature response.


Asunto(s)
Estudio de Asociación del Genoma Completo , Triticum , Triticum/genética , Temperatura , Sitios de Carácter Cuantitativo , Fitomejoramiento , Fenotipo
3.
Heredity (Edinb) ; 2024 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-38982296

RESUMEN

Chromosome substitution lines (CSLs) are tentatively supreme resources to investigate non-allelic genetic interactions. However, the difficulty of generating such lines in most species largely yielded imperfect CSL panels, prohibiting a systematic dissection of epistasis. Here, we present the development and use of a unique and complete panel of CSLs in Arabidopsis thaliana, allowing the full factorial analysis of epistatic interactions. A first comparison of reciprocal single chromosome substitutions revealed a dependency of QTL detection on different genetic backgrounds. The subsequent analysis of the complete panel of CSLs enabled the mapping of the genetic interactors and identified multiple two- and three-way interactions for different traits. Some of the detected epistatic effects were as large as any observed main effect, illustrating the impact of epistasis on quantitative trait variation. We, therefore, have demonstrated the high power of detection and mapping of genome-wide epistasis, confirming the assumed supremacy of comprehensive CSL sets.

4.
Bioinformatics ; 38(22): 5134-5136, 2022 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-36193999

RESUMEN

MOTIVATION: Multi-parent populations (MPPs) are popular for QTL mapping because they combine wide genetic diversity in parents with easy control of population structure, but a limited number of software tools for QTL mapping are specifically developed for general MPP designs. RESULTS: We developed an R package called statgenMPP, adopting a unified identity-by-descent (IBD)-based mixed model approach for QTL analysis in MPPs. The package offers easy-to-use functionalities of IBD calculations, mixed model solutions and visualizations for QTL mapping in a wide range of MPP designs, including diallele, nested-association mapping populations, multi-parent advanced genetic inter-cross populations and other complicated MPPs with known crossing schemes. AVAILABILITY AND IMPLEMENTATION: The R package statgenMPP is open-source and freely available on CRAN at https://CRAN.R-project.org/package=statgenMPP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Mapeo Cromosómico
5.
Theor Appl Genet ; 135(6): 2059-2082, 2022 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-35524815

RESUMEN

KEY MESSAGE: We evaluate self-organizing maps (SOM) to identify adaptation zones and visualize multi-environment genotypic responses. We apply SOM to multiple traits and crop growth model output of large-scale European sunflower data. Genotype-by-environment interactions (G × E) complicate the selection of well-adapted varieties. A possible solution is to group trial locations into adaptation zones with G × E occurring mainly between zones. By selecting for good performance inside those zones, response to selection is increased. In this paper, we present a two-step procedure to identify adaptation zones that starts from a self-organizing map (SOM). In the SOM, trials across locations and years are assigned to groups, called units, that are organized on a two-dimensional grid. Units that are further apart contain more distinct trials. In an iterative process of reweighting trial contributions to units, the grid configuration is learnt simultaneously with the trial assignment to units. An aggregation of the units in the SOM by hierarchical clustering then produces environment types, i.e. trials with similar growing conditions. Adaptation zones can subsequently be identified by grouping trial locations with similar distributions of environment types across years. For the construction of SOMs, multiple data types can be combined. We compared environment types and adaptation zones obtained for European sunflower from quantitative traits like yield, oil content, phenology and disease scores with those obtained from environmental indices calculated with the crop growth model Sunflo. We also show how results are affected by input data organization and user-defined weights for genotypes and traits. Adaptation zones for European sunflower as identified by our SOM-based strategy captured substantial genotype-by-location interaction and pointed to trials in Spain, Turkey and South Bulgaria as inducing different genotypic responses.


Asunto(s)
Helianthus , Adaptación Fisiológica , Algoritmos , Análisis por Conglomerados , Genotipo , Helianthus/genética
6.
Mol Breed ; 42(12): 76, 2022 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-37313326

RESUMEN

Genome-wide association studies (GWAS) are a useful tool to unravel the genetic architecture of complex traits, but the results can be difficult to interpret. Population structure, genetic heterogeneity, and rare alleles easily result in false positive or false negative associations. This paper describes the analysis of a GWAS panel combined with three bi-parental mapping populations to validate GWAS results, using phenotypic data for steroidal glycoalkaloid (SGA) accumulation and the ratio (SGR) between the two major glycoalkaloids α-solanine and α-chaconine in potato tubers. SGAs are secondary metabolites in the Solanaceae family, functional as a defence against various pests and pathogens and in high quantities toxic for humans. With GWAS, we identified five quantitative trait loci (QTL) of which Sga1.1, Sgr8.1, and Sga11.1 were validated, but not Sga3.1 and Sgr7.1. In the bi-parental populations, Sga5.1 and Sga7.1 were mapped, but these were not identified with GWAS. The QTLs Sga1.1, Sga7.1, Sgr7.1, and Sgr8.1 co-localize with genes GAME9, GAME 6/GAME 11, SGT1, and SGT2, respectively. For other genes involved in SGA synthesis, no QTLs were identified. The results of this study illustrate a number of pitfalls in GWAS of which population structure seems the most important. We also show that introgression breeding for disease resistance has introduced new haplotypes to the gene pool involved in higher SGA levels in certain pedigrees. Finally, we show that high SGA levels remain unpredictable in potato but that α-solanine/α-chaconine ratio has a predictable outcome with specific SGT1 and SGT2 haplotypes. Supplementary Information: The online version contains supplementary material available at 10.1007/s11032-022-01344-2.

7.
Plant J ; 102(4): 872-882, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-31856318

RESUMEN

Natural variation has become a prime resource to identify genetic variants that contribute to phenotypic variation. The regional mapping (RegMap) population is one of the most important populations for studying natural variation in Arabidopsis thaliana, and has been used in a large number of association studies and in studies on climatic adaptation. However, only 413 RegMap accessions have been completely sequenced, as part of the 1001 Genomes (1001G) Project, while the remaining 894 accessions have only been genotyped with the Affymetrix 250k chip. As a consequence, most association studies involving the RegMap are either restricted to the sequenced accessions, reducing power, or rely on a limited set of SNPs. Here we impute millions of SNPs to the 894 accessions that are exclusive to the RegMap, using the 1135 accessions of the 1001G Project as the reference panel. We assess imputation accuracy using a novel cross-validation scheme, which we show provides a more reliable measure of accuracy than existing methods. After filtering out low accuracy SNPs, we obtain high-quality genotypic information for 2029 accessions and 3 million markers. To illustrate the benefits of these imputed data, we reconducted genome-wide association studies on five stress-related traits and could identify novel candidate genes.


Asunto(s)
Arabidopsis/genética , Genoma de Planta/genética , Polimorfismo de Nucleótido Simple/genética , Arabidopsis/fisiología , Estudio de Asociación del Genoma Completo , Genotipo , Análisis de Secuencia por Matrices de Oligonucleótidos , Fenotipo , Estrés Fisiológico
8.
Plant J ; 103(3): 1189-1204, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32369642

RESUMEN

Tomato (Solanum lycopersicum L.) has become a popular model for genetic studies of fruit flavor in the last two decades. In this article we present a study of tomato fruit flavor, including an analysis of the genetic, metabolic and sensorial variation of a collection of contemporary commercial glasshouse tomato cultivars, followed by a validation of the associations found by quantitative trait locus (QTL) analysis of representative biparental segregating populations. This led to the identification of the major sensorial and chemical components determining fruit flavor variation and detection of the underlying QTLs. The high representation of QTL haplotypes in the breeders' germplasm suggests that there is great potential for applying these QTLs in current breeding programs aimed at improving tomato flavor. A QTL on chromosome 4 was found to affect the levels of the phenylalanine-derived volatiles (PHEVs) 2-phenylethanol, phenylacetaldehyde and 1-nitro-2-phenylethane. Fruits of near-isogenic lines contrasting for this locus and in the composition of PHEVs significantly differed in the perception of fruity and rose-hip-like aroma. The PHEV locus was fine mapped, which allowed for the identification of FLORAL4 as a candidate gene for PHEV regulation. Using a gene-editing-based (CRISPR-CAS9) reverse-genetics approach, FLORAL4 was demonstrated to be the key factor in this QTL affecting PHEV accumulation in tomato fruit.


Asunto(s)
Boratos/metabolismo , Fructosa/análogos & derivados , Genes de Plantas/genética , Sitios de Carácter Cuantitativo/genética , Solanum lycopersicum/genética , Boratos/normas , Proteína 9 Asociada a CRISPR , Sistemas CRISPR-Cas , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Calidad de los Alimentos , Fructosa/metabolismo , Fructosa/normas , Edición Génica , Genes de Plantas/fisiología , Solanum lycopersicum/metabolismo , Solanum lycopersicum/normas , Fenilalanina/metabolismo , Carácter Cuantitativo Heredable , Compuestos Orgánicos Volátiles/metabolismo
9.
Theor Appl Genet ; 134(3): 897-908, 2021 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-33367942

RESUMEN

Much has been published on QTL detection for complex traits using bi-parental and multi-parental crosses (linkage analysis) or diversity panels (GWAS studies). While successful for detection, transferability of results to real applications has proven more difficult. Here, we combined a QTL detection approach using a pre-breeding populations which utilized intensive phenotypic selection for the target trait across multiple plant generations, combined with rapid generation turnover (i.e. "speed breeding") to allow cycling of multiple plant generations each year. The reasoning is that QTL mapping information would complement the selection process by identifying the genome regions under selection within the relevant germplasm. Questions to answer were the location of the genomic regions determining response to selection and the origin of the favourable alleles within the pedigree. We used data from a pre-breeding program that aimed at pyramiding different resistance sources to Fusarium crown rot into elite (but susceptible) wheat backgrounds. The population resulted from a complex backcrossing scheme involving multiple resistance donors and multiple elite backgrounds, akin to a MAGIC population (985 genotypes in total, with founders, and two major offspring layers within the pedigree). A significant increase in the resistance level was observed (i.e. a positive response to selection) after the selection process, and 17 regions significantly associated with that response were identified using a GWAS approach. Those regions included known QTL as well as potentially novel regions contributing resistance to Fusarium crown rot. In addition, we were able to trace back the sources of the favourable alleles for each QTL. We demonstrate that QTL detection using breeding populations under selection for the target trait can identify QTL controlling the target trait and that the frequency of the favourable alleles was increased as a response to selection, thereby validating the QTL detected. This is a valuable opportunistic approach that can provide QTL information that is more easily transferred to breeding applications.


Asunto(s)
Resistencia a la Enfermedad/genética , Fusarium/fisiología , Marcadores Genéticos , Fitomejoramiento , Enfermedades de las Plantas/genética , Sitios de Carácter Cuantitativo , Triticum/genética , Alelos , Mapeo Cromosómico/métodos , Cromosomas de las Plantas/genética , Resistencia a la Enfermedad/inmunología , Ligamiento Genético , Enfermedades de las Plantas/microbiología , Triticum/inmunología , Triticum/microbiología
10.
Theor Appl Genet ; 134(11): 3643-3660, 2021 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-34342658

RESUMEN

KEY MESSAGE: The identity-by-descent (IBD)-based mixed model approach introduced in this study can detect quantitative trait loci (QTLs) referring to the parental origin and simultaneously account for multilevel relatedness of individuals within and across families. This unified approach is proved to be a powerful approach for all kinds of multiparental population (MPP) designs. Multiparental populations (MPPs) have become popular for quantitative trait loci (QTL) detection. Tools for QTL mapping in MPPs are mostly developed for specific MPPs and do not generalize well to other MPPs. We present an IBD-based mixed model approach for QTL mapping in all kinds of MPP designs, e.g., diallel, Nested Association Mapping (NAM), and Multiparental Advanced Generation Intercross (MAGIC) designs. The first step is to compute identity-by-descent (IBD) probabilities using a general Hidden Markov model framework, called reconstructing ancestry blocks bit by bit (RABBIT). Next, functions of IBD information are used as design matrices, or genetic predictors, in a mixed model approach to estimate variance components for multiallelic genetic effects associated with parents. Family-specific residual genetic effects are added, and a polygenic effect is structured by kinship relations between individuals. Case studies of simulated diallel, NAM, and MAGIC designs proved that the advanced IBD-based multi-QTL mixed model approach incorporating both kinship relations and family-specific residual variances (IBD.MQMkin_F) is robust across a variety of MPP designs and allele segregation patterns in comparison to a widely used benchmark association mapping method, and in most cases, outperformed or behaved at least as well as other tools developed for specific MPP designs in terms of mapping power and resolution. Successful analyses of real data cases confirmed the wide applicability of our IBD-based mixed model methodology.


Asunto(s)
Mapeo Cromosómico , Modelos Genéticos , Sitios de Carácter Cuantitativo , Alelos , Simulación por Computador , Modelos Lineales , Cadenas de Markov , Plantas/genética
11.
Plant J ; 99(6): 1172-1191, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31108005

RESUMEN

Broadening the genetic base of crops is crucial for developing varieties to respond to global agricultural challenges such as climate change. Here, we analysed a diverse panel of 371 domesticated lines of the model crop barley to explore the genetics of crop adaptation. We first collected exome sequence data and phenotypes of key life history traits from contrasting multi-environment common garden trials. Then we applied refined statistical methods, including some based on exomic haplotype states, for genotype-by-environment (G×E) modelling. Sub-populations defined from exomic profiles were coincident with barley's biology, geography and history, and explained a high proportion of trial phenotypic variance. Clear G×E interactions indicated adaptation profiles that varied for landraces and cultivars. Exploration of circadian clock-related genes, associated with the environmentally adaptive days to heading trait (crucial for the crop's spread from the Fertile Crescent), illustrated complexities in G×E effect directions, and the importance of latitudinally based genic context in the expression of large-effect alleles. Our analysis supports a gene-level scientific understanding of crop adaption and leads to practical opportunities for crop improvement, allowing the prioritisation of genomic regions and particular sets of lines for breeding efforts seeking to cope with climate change and other stresses.


Asunto(s)
Aclimatación/genética , Productos Agrícolas/genética , Exoma , Hordeum/genética , Relojes Circadianos/genética , Variación Genética , Estudio de Asociación del Genoma Completo , Genotipo , Geografía , Haplotipos , Desequilibrio de Ligamiento , Fenotipo , Fitomejoramiento , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Secuenciación del Exoma
12.
J Nutr ; 150(3): 634-643, 2020 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-31858107

RESUMEN

BACKGROUND: In nutritional epidemiology, dealing with confounding and complex internutrient relations are major challenges. An often-used approach is dietary pattern analyses, such as principal component analysis, to deal with internutrient correlations, and to more closely resemble the true way nutrients are consumed. However, despite these improvements, these approaches still require subjective decisions in the preselection of food groups. Moreover, they do not make efficient use of multivariate dietary data, because they detect only marginal associations. We propose the use of copula graphical models (CGMs) to model and make statistical inferences regarding complex associations among variables in multivariate data, where associations between all variables can be learned simultaneously. OBJECTIVE: We aimed to reconstruct nutritional intake and physical functioning networks in Dutch older adults by applying a CGM. METHODS: We addressed this issue by uncovering the pairwise associations between variables while correcting for the effect of remaining variables. More specifically, we used a CGM to infer the precision matrix, which contains all the conditional independence relations between nodes in the graph. The nonzero elements of the precision matrix indicate the presence of a direct association. We applied this method to reconstruct nutrient-physical functioning networks from the combined data of 4 studies (Nu-Age, ProMuscle, ProMO, and V-Fit, total n = 662, mean ± SD age = 75 ± 7 y). The method was implemented in the R package nutriNetwork which is freely available at https://cran.r-project.org/web/packages/nutriNetwork. RESULTS: Greater intakes of vegetable protein and vitamin B-6 were partially correlated with higher scores on the total Short Physical Performance Battery (SPPB) and the chair rise test. Greater intakes of vitamin B-12 and folate were partially correlated with higher scores on the chair rise test and the total SPPB, respectively. CONCLUSIONS: We determined that vegetable protein, vitamin B-6, folate, and vitamin B-12 intakes are partially correlated with improved functional outcome measurements in Dutch older adults.


Asunto(s)
Ácido Fólico/administración & dosificación , Modelos Teóricos , Rendimiento Físico Funcional , Proteínas de Vegetales Comestibles/administración & dosificación , Vitamina B 12/administración & dosificación , Vitamina B 6/administración & dosificación , Anciano , Anciano de 80 o más Años , Índice de Masa Corporal , Anciano Frágil , Humanos , Países Bajos
13.
Theor Appl Genet ; 133(9): 2627-2638, 2020 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-32518992

RESUMEN

KEY MESSAGE: Multi-parent populations multi-environment QTL experiments data should be analysed jointly to estimate the QTL effect variation within the population and between environments. Commonly, QTL detection in multi-parent populations (MPPs) data measured in multiple environments (ME) is done by analyzing genotypic values 'averaged' across environments. This method ignores the environment-specific QTL (QTLxE) effects. Running separate single environment analyses is a possibility to measure QTLxE effects, but those analyses do not model the genetic covariance due to the use of the same genotype in different environments. In this paper, we propose methods to analyse MPP-ME QTL experiments using simultaneously the data from several environments and modelling the genotypic covariance. Using data from the EU-NAM Flint population, we show that these methods estimate the QTLxE effects and that they can improve the quality of the QTL detection. Those methods also have a larger inference power. For example, they can be extended to integrate environmental indices like temperature or precipitation to better understand the mechanisms behind the QTLxE effects. Therefore, our methodology allows the exploitation of the full MPP-ME data potential: to estimate QTL effect variation (a) within the MPP between sub-populations due to different genetic backgrounds and (b) between environments.


Asunto(s)
Cruzamientos Genéticos , Ambiente , Modelos Genéticos , Sitios de Carácter Cuantitativo , Zea mays/genética , Interacción Gen-Ambiente , Genotipo
14.
Theor Appl Genet ; 133(3): 1009-1018, 2020 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-31907563

RESUMEN

KEY MESSAGE: Multi-environment models using marker-based kinship information for both additive and dominance effects can accurately predict hybrid performance in different environments. Sorghum is an important hybrid crop that is grown extensively in many subtropical and tropical regions including Northern NSW and Queensland in Australia. The highly varying weather patterns in the Australian summer months mean that sorghum hybrids exhibit a great deal of variation in yield between locations. To ultimately enable prediction of the outcome of crossing parental lines, both additive effects on yield performance and dominance interaction effects need to be characterised. This paper demonstrates that fitting a linear mixed model that includes both types of effects calculated using genetic markers in relationship matrices improves predictions. Genotype by environment interactions was investigated by comparing FA1 (single-factor analytic) and FA2 (two-factor analytic) structures. The G×E causes a change in hybrid rankings between trials with a difference of up to 25% of the hybrids in the top 10% of each trial. The prediction accuracies increased with the addition of the dominance term (over and above that achieved with an additive effect alone) by an average of 15% and a maximum of 60%. The percentage of dominance of the total genetic variance varied between trials with the trials with higher broad-sense heritability having the greater percentage of dominance. The inclusion of dominance in the factor analytic models improves the accuracy of the additive effects. Breeders selecting high yielding parents for crossing need to be aware of effects due to environment and dominance.


Asunto(s)
Fitomejoramiento , Sorghum/genética , Australia , Clima , Epistasis Genética , Genes Dominantes , Estudios de Asociación Genética , Marcadores Genéticos , Variación Genética , Genómica , Genotipo , Modelos Genéticos , Linaje , Fenotipo , Polimorfismo de Nucleótido Simple , Selección Genética , Sorghum/crecimiento & desarrollo
15.
Phytopathology ; 110(3): 633-647, 2020 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-31680652

RESUMEN

Common bean (Phaseolus vulgaris) is one of the most consumed legume crops in the world, and Fusarium wilt, caused by the fungus Fusarium oxysporum f. sp. phaseoli, is one of the major diseases affecting its production. Portugal holds a very promising common bean germplasm with an admixed genetic background that may reveal novel genetic resistance combinations between the original Andean and Mesoamerican gene pools. To identify new sources of Fusarium wilt resistance and detect resistance-associated single-nucleotide polymorphisms (SNPs), we explored, for the first time, a diverse collection of the underused Portuguese common bean germplasm by using genome-wide association analyses. The collection was evaluated for Fusarium wilt resistance under growth chamber conditions, with the highly virulent F. oxysporum f. sp. phaseoli strain FOP-SP1 race 6. Fourteen of the 162 Portuguese accessions evaluated were highly resistant and 71 intermediate. The same collection was genotyped with DNA sequencing arrays, and SNP-resistance associations were tested via a mixed linear model accounting for the genetic relatedness between accessions. The results from the association mapping revealed nine SNPs associated with resistance on chromosomes Pv04, Pv05, Pv07, and Pv08, indicating that Fusarium wilt resistance is under oligogenic control. Putative candidate genes related to phytoalexin biosynthesis, hypersensitive response, and plant primary metabolism were identified. The results reported here highlight the importance of exploring underused germplasm for new sources of resistance and provide new genomic targets for the development of functional markers to support selection in future disease resistance breeding programs.


Asunto(s)
Fusarium , Phaseolus , Resistencia a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Enfermedades de las Plantas , Portugal
16.
BMC Plant Biol ; 19(1): 123, 2019 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-30940081

RESUMEN

BACKGROUND: Maize is a crop in high demand for food purposes and consumers worldwide are increasingly concerned with food quality. However, breeding for improved quality is a complex task and therefore developing tools to select for better quality products is of great importance. Kernel composition, flour pasting behavior, and flour particle size have been previously identified as crucial for maize-based food quality. In this work we carried out a genome-wide association study to identify genomic regions controlling compositional and pasting properties of maize wholemeal flour. RESULTS: A collection of 132 diverse inbred lines, with a considerable representation of the food used Portuguese unique germplasm, was trialed during two seasons, and harvested samples characterized for main compositional traits, flour pasting parameters and mean particle size. The collection was genotyped with the MaizeSNP50 array. SNP-trait associations were tested using a mixed linear model accounting for genetic relatedness. Fifty-seven genomic regions were identified, associated with the 11 different quality-related traits evaluated. Regions controlling multiple traits were detected and potential candidate genes identified. As an example, for two viscosity parameters that reflect the capacity of the starch to absorb water and swell, the strongest common associated region was located near the dull endosperm 1 gene that encodes a starch synthase and is determinant on the starch endosperm structure in maize. CONCLUSIONS: This study allowed for identifying relevant regions on the maize genome affecting maize kernel composition and flour pasting behavior, candidate genes for the majority of the quality-associated genomic regions, or the most promising target regions to develop molecular tools to increase efficacy and efficiency of quality traits selection (such as "breadability") within maize breeding programs.


Asunto(s)
Estudio de Asociación del Genoma Completo , Almidón/metabolismo , Zea mays/genética , Endospermo/genética , Endospermo/metabolismo , Harina , Genómica , Genotipo , Valor Nutritivo , Fenotipo , Fitomejoramiento , Semillas/genética , Semillas/metabolismo , Zea mays/metabolismo
17.
Theor Appl Genet ; 132(7): 2055-2067, 2019 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-30968160

RESUMEN

KEY MESSAGE: The use of a kinship matrix integrating pedigree- and marker-based relationships optimized the performance of genomic prediction in sorghum, especially for traits of lower heritability. Selection based on genome-wide markers has become an active breeding strategy in crops. Genomic prediction models can make use of pedigree information to account for the residual polygenic effects not captured by markers. Our aim was to evaluate the impact of using pedigree and genomic information on prediction quality of breeding values for different traits in sorghum. We explored BLUP models that use weighted combinations of pedigree and genomic relationship matrices. The optimal weighting factor was empirically determined in order to maximize predictive ability after evaluating a range of candidate weights. The phenotypic data consisted of testcross evaluations of sorghum parental lines across multiple environments. All lines were genotyped, and full pedigree information was available. The performance of the best predictive combined matrix was compared to that of models fitting the component matrices independently. Model performance was assessed using cross-validation technique. Fitting a combined pedigree-genomic matrix with the optimal weight always yielded the largest increases in predictive ability and the largest reductions in prediction bias relative to the simple G-BLUP. However, the weight that optimized prediction varied across traits. The benefits of including pedigree information in the genomic model were more relevant for traits with lower heritability, such as grain yield and stay-green. Our results suggest that the combination of pedigree and genomic relatedness can be used to optimize predictions of complex traits in crops when the additive variation is not fully explained by markers.


Asunto(s)
Genómica/métodos , Modelos Genéticos , Linaje , Fitomejoramiento , Sorghum/genética , Genotipo , Fenotipo
18.
Genet Sel Evol ; 51(1): 2, 2019 Jan 24.
Artículo en Inglés | MEDLINE | ID: mdl-30678638

RESUMEN

BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R2) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R2 of 0.39 (0.49). After quality control, 2.5 × 106 (3.5 × 106) SNPs had a Beagle R2 higher than 0.6, resulting in an average Beagle R2 of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.


Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Porcinos/genética , Secuenciación Completa del Genoma/métodos , Animales , Estudio de Asociación del Genoma Completo/normas , Estudio de Asociación del Genoma Completo/veterinaria , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Secuenciación Completa del Genoma/normas , Secuenciación Completa del Genoma/veterinaria
19.
J Anim Breed Genet ; 136(6): 418-429, 2019 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-31215703

RESUMEN

Significance testing for genome-wide association study (GWAS) with increasing SNP density up to whole-genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole-genome sequence data, fitting a genomic relationship matrix based on a leave-one-chromosome-out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole-genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini-Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave-one-chromosome-out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.


Asunto(s)
Estudio de Asociación del Genoma Completo , Genómica , Genotipo , Secuenciación Completa del Genoma , Animales , Cruzamiento , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo/genética , Porcinos/genética
20.
New Phytol ; 213(3): 1346-1362, 2017 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-27699793

RESUMEN

Plants are exposed to combinations of various biotic and abiotic stresses, but stress responses are usually investigated for single stresses only. Here, we investigated the genetic architecture underlying plant responses to 11 single stresses and several of their combinations by phenotyping 350 Arabidopsis thaliana accessions. A set of 214 000 single nucleotide polymorphisms (SNPs) was screened for marker-trait associations in genome-wide association (GWA) analyses using tailored multi-trait mixed models. Stress responses that share phytohormonal signaling pathways also share genetic architecture underlying these responses. After removing the effects of general robustness, for the 30 most significant SNPs, average quantitative trait locus (QTL) effect sizes were larger for dual stresses than for single stresses. Plants appear to deploy broad-spectrum defensive mechanisms influencing multiple traits in response to combined stresses. Association analyses identified QTLs with contrasting and with similar responses to biotic vs abiotic stresses, and below-ground vs above-ground stresses. Our approach allowed for an unprecedented comprehensive genetic analysis of how plants deal with a wide spectrum of stress conditions.


Asunto(s)
Arabidopsis/genética , Arabidopsis/fisiología , Mapeo Cromosómico , Estudio de Asociación del Genoma Completo , Estrés Fisiológico/genética , ADN Bacteriano/genética , Genes de Plantas , Estudios de Asociación Genética , Patrón de Herencia/genética , Modelos Genéticos , Mutación/genética , Fenotipo , Reguladores del Crecimiento de las Plantas/metabolismo , Sitios de Carácter Cuantitativo/genética , Reproducibilidad de los Resultados
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA