RESUMO
Despite the negative impact of common scab (Streptomyces spp.) on the potato industry, little is known about the genetic architecture of resistance to this bacterial disease in the crop. We evaluated a mapping population (â¼150 full sibs) derived from a cross between two tetraploid potatoes ('Atlantic' × B1829-5) in three environments (MN11, PA11, ME12) under natural common scab pressure. Three measures to common scab reaction, namely percentage of scabby tubers and disease area and lesion indices, were found to be highly correlated (>0.76). Because of the large environmental effect, heritability values were zero for all three traits in MN11, but moderate to high in PA11 and ME12 (â¼0.44 to 0.79). We identified a single quantitative trait locus (QTL) for lesion index in PA11, ME12, and joint analyses on linkage group 3, explaining â¼22 to 30% of the total variation. The identification of QTL haplotypes and candidate genes contributing to disease resistance can support genomics-assisted breeding approaches in the crop.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY 4.0 International license.
Assuntos
Solanum tuberosum , Mapeamento Cromossômico , Tubérculos/genética , Locos de Características Quantitativas/genética , Solanum tuberosum/genética , TetraploidiaRESUMO
Despite advances in sequencing technologies, assembly of complex plant genomes remains elusive due to polyploidy and high repeat content. Here we report PolyGembler for grouping and ordering contigs into pseudomolecules by genetic linkage analysis. Our approach also provides an accurate method with which to detect and fix assembly errors. Using simulated data, we demonstrate that our approach is of high accuracy and outperforms three existing state-of-the-art genetic mapping tools. Particularly, our approach is more robust to the presence of missing genotype data and genotyping errors. We used our method to construct pseudomolecules for allotetraploid lawn grass utilizing PacBio long reads in combination with restriction site-associated DNA sequencing, and for diploid Ipomoea trifida and autotetraploid potato utilizing contigs assembled from Illumina reads in combination with genotype data generated by single-nucleotide polymorphism arrays and genotyping by sequencing, respectively. We resolved 13 assembly errors for a published I. trifida genome assembly and anchored eight unplaced scaffolds in the published potato genome.
Assuntos
Algoritmos , Cromossomos de Plantas , Ligação Genética , Genoma de Planta , Poliploidia , Simulação por Computador , Genótipo , Ipomoea/genética , Melhoramento Vegetal , Poaceae/genética , Análise Serial de Proteínas , Solanum tuberosum/genéticaRESUMO
KEY MESSAGE: Polypoid crop breeders can balance resources between density and sequencing depth, dosage information and fewer highly informative SNPs recommended, non-additive models and QTL advantages on prediction dependent on trait architecture. The autopolyploid nature of potato and sweetpotato ensures a wide range of meiotic configurations and linkage phases leading to complex gene-action and pose problems in genotype data quality and genomic selection analyses. We used a 315-progeny biparental F1 population of hexaploid sweetpotato and a diversity panel of 380 tetraploid potato, genotyped using different platforms to answer the following questions: (i) do polyploid crop breeders need to invest more for additional sequencing depth? (ii) how many markers are required to make selection decisions? (iii) does considering non-additive genetic effects improve predictive ability (PA)? (iv) does considering dosage or quantitative trait loci (QTL) offer significant improvement to PA? Our results show that only a small number of highly informative single nucleotide polymorphisms (SNPs; ≤ 1000) are adequate for prediction in the type of populations we analyzed. We also show that considering dosage information and models considering only additive effects had the best PA for most traits, while the comparative advantage of considering non-additive genetic effects and including known QTL in the predictive model depended on trait architecture. We conclude that genomic selection can help accelerate the rate of genetic gains in potato and sweetpotato. However, application of genomic selection should be considered as part of optimizing the entire breeding program. Additionally, since the predictions in the current study are based on single populations, further studies on the effects of haplotype structure and inheritance on PA should be studied in actual multi-generation breeding populations.
Assuntos
Produtos Agrícolas/genética , Genótipo , Ipomoea batatas/genética , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único , Poliploidia , Seleção Genética , Produtos Agrícolas/crescimento & desenvolvimento , Ipomoea batatas/crescimento & desenvolvimento , Fenótipo , Locos de Características Quantitativas , Análise de Sequência de DNARESUMO
In developing countries, the sweetpotato, Ipomoea batatas (L.) Lam. [Formula: see text], is an important autopolyploid species, both socially and economically. However, quantitative trait loci (QTL) mapping has remained limited due to its genetic complexity. Current fixed-effect models can fit only a single QTL and are generally hard to interpret. Here, we report the use of a random-effect model approach to map multiple QTL based on score statistics in a sweetpotato biparental population ('Beauregard' × 'Tanzania') with 315 full-sibs. Phenotypic data were collected for eight yield component traits in six environments in Peru, and jointly adjusted means were obtained using mixed-effect models. An integrated linkage map consisting of 30,684 markers distributed along 15 linkage groups (LGs) was used to obtain the genotype conditional probabilities of putative QTL at every centiMorgan position. Multiple interval mapping was performed using our R package QTLpoly and detected a total of 13 QTL, ranging from none to four QTL per trait, which explained up to 55% of the total variance. Some regions, such as those on LGs 3 and 15, were consistently detected among root number and yield traits, and provided a basis for candidate gene search. In addition, some QTL were found to affect commercial and noncommercial root traits distinctly. Further best linear unbiased predictions were decomposed into additive allele effects and were used to compute multiple QTL-based breeding values for selection. Together with quantitative genotyping and its appropriate usage in linkage analyses, this QTL mapping methodology will facilitate the use of genomic tools in sweetpotato breeding as well as in other autopolyploids.
Assuntos
Mapeamento Cromossômico/métodos , Estudo de Associação Genômica Ampla/métodos , Ipomoea batatas/genética , Poliploidia , Locos de Características Quantitativas , Melhoramento Vegetal/métodosRESUMO
KEY MESSAGE: ß-Carotene content in sweetpotato is associated with the Orange and phytoene synthase genes; due to physical linkage of phytoene synthase with sucrose synthase, ß-carotene and starch content are negatively correlated. In populations depending on sweetpotato for food security, starch is an important source of calories, while ß-carotene is an important source of provitamin A. The negative association between the two traits contributes to the low nutritional quality of sweetpotato consumed, especially in sub-Saharan Africa. Using a biparental mapping population of 315 F1 progeny generated from a cross between an orange-fleshed and a non-orange-fleshed sweetpotato variety, we identified two major quantitative trait loci (QTL) on linkage group (LG) three (LG3) and twelve (LG12) affecting starch, ß-carotene, and their correlated traits, dry matter and flesh color. Analysis of parental haplotypes indicated that these two regions acted pleiotropically to reduce starch content and increase ß-carotene in genotypes carrying the orange-fleshed parental haplotype at the LG3 locus. Phytoene synthase and sucrose synthase, the rate-limiting and linked genes located within the QTL on LG3 involved in the carotenoid and starch biosynthesis, respectively, were differentially expressed in Beauregard versus Tanzania storage roots. The Orange gene, the molecular switch for chromoplast biogenesis, located within the QTL on LG12 while not differentially expressed was expressed in developing roots of the parental genotypes. We conclude that these two QTL regions act together in a cis and trans manner to inhibit starch biosynthesis in amyloplasts and enhance chromoplast biogenesis, carotenoid biosynthesis, and accumulation in orange-fleshed sweetpotato. Understanding the genetic basis of this negative association between starch and ß-carotene will inform future sweetpotato breeding strategies targeting sweetpotato for food and nutritional security.
Assuntos
Regulação da Expressão Gênica de Plantas , Ipomoea batatas/genética , Poliploidia , Locos de Características Quantitativas/genética , Amido/metabolismo , beta Caroteno/metabolismo , Alelos , Meio Ambiente , Estudos de Associação Genética , Fenótipo , Raízes de Plantas/genética , Raízes de Plantas/crescimento & desenvolvimento , Característica Quantitativa HerdávelRESUMO
The hexaploid sweetpotato (Ipomoea batatas (L.) Lam., 2n = 6x = 90) is an important staple food crop worldwide and plays a vital role in alleviating famine in developing countries. Due to its high ploidy level, genetic studies in sweetpotato lag behind major diploid crops significantly. We built an ultra-dense multilocus integrated genetic map and characterized the inheritance system in a sweetpotato full-sib family using our newly developed software, MAPpoly. The resulting genetic map revealed 96.5% collinearity between I. batatas and its diploid relative I. trifida We computed the genotypic probabilities across the whole genome for all individuals in the mapping population and inferred their complete hexaploid haplotypes. We provide evidence that most of the meiotic configurations (73.3%) were resolved in bivalents, although a small portion of multivalent signatures (15.7%), among other inconclusive configurations (11.0%), were also observed. Except for low levels of preferential pairing in linkage group 2, we observed a hexasomic inheritance mechanism in all linkage groups. We propose that the hexasomic-bivalent inheritance promotes stability to the allelic transmission in sweetpotato.
Assuntos
Cromossomos de Plantas/genética , Ligação Genética , Ipomoea batatas/genética , Poliploidia , Mapeamento Cromossômico/métodos , Pareamento Cromossômico , Loci Gênicos , HaplótiposRESUMO
Genomic selection is an efficient approach to get shorter breeding cycles in recurrent selection programs and greater genetic gains with selection of superior individuals. Despite advances in genotyping techniques, genetic studies for polyploid species have been limited to a rough approximation of studies in diploid species. The major challenge is to distinguish the different types of heterozygotes present in polyploid populations. In this work, we evaluated different genomic prediction models applied to a recurrent selection population of 530 genotypes of Panicum maximum, an autotetraploid forage grass. We also investigated the effect of the allele dosage in the prediction, i.e., considering tetraploid (GS-TD) or diploid (GS-DD) allele dosage. A longitudinal linear mixed model was fitted for each one of the six phenotypic traits, considering different covariance matrices for genetic and residual effects. A total of 41,424 genotyping-by-sequencing markers were obtained using 96-plex and Pst1 restriction enzyme, and quantitative genotype calling was performed. Six predictive models were generalized to tetraploid species and predictive ability was estimated by a replicated fivefold cross-validation process. GS-TD and GS-DD models were performed considering 1,223 informative markers. Overall, GS-TD data yielded higher predictive abilities than with GS-DD data. However, different predictive models had similar predictive ability performance. In this work, we provide bioinformatic and modeling guidelines to consider tetraploid dosage and observed that genomic selection may lead to additional gains in recurrent selection program of P. maximum.
Assuntos
Alelos , Dosagem de Genes , Genoma de Planta , Genômica , Panicum/genética , Algoritmos , Genômica/métodos , Fenótipo , Melhoramento Vegetal , Poliploidia , Seleção GenéticaRESUMO
BACKGROUND: Microarray techniques provide promising tools for cancer diagnosis using gene expression profiles. However, molecular diagnosis based on high-throughput platforms presents great challenges due to the overwhelming number of variables versus the small sample size and the complex nature of multi-type tumors. Support vector machines (SVMs) have shown superior performance in cancer classification due to their ability to handle high dimensional low sample size data. The multi-class SVM algorithm of Crammer and Singer provides a natural framework for multi-class learning. Despite its effective performance, the procedure utilizes all variables without selection. In this paper, we propose to improve the procedure by imposing shrinkage penalties in learning to enforce solution sparsity. RESULTS: The original multi-class SVM of Crammer and Singer is effective for multi-class classification but does not conduct variable selection. We improved the method by introducing soft-thresholding type penalties to incorporate variable selection into multi-class classification for high dimensional data. The new methods were applied to simulated data and two cancer gene expression data sets. The results demonstrate that the new methods can select a small number of genes for building accurate multi-class classification rules. Furthermore, the important genes selected by the methods overlap significantly, suggesting general agreement among different variable selection schemes. CONCLUSIONS: High accuracy and sparsity make the new methods attractive for cancer diagnostics with gene expression data and defining targets of therapeutic intervention. AVAILABILITY: The source MATLAB code are available from http://math.arizona.edu/~hzhang/software.html.
RESUMO
UNLABELLED: Statins are widely prescribed for reducing LDL-cholesterol (C) and risk for cardiovascular disease (CVD), but there is considerable variation in therapeutic response. We used a gas chromatography-time-of-flight mass-spectrometry-based metabolomics platform to evaluate global effects of simvastatin on intermediary metabolism. Analyses were conducted in 148 participants in the Cholesterol and Pharmacogenetics study who were profiled pre and six weeks post treatment with 40 mg/day simvastatin: 100 randomly selected from the full range of the LDL-C response distribution and 24 each from the top and bottom 10% of this distribution ("good" and "poor" responders, respectively). The metabolic signature of drug exposure in the full range of responders included essential amino acids, lauric acid (p<0.0055, q<0.055), and alpha-tocopherol (p<0.0003, q<0.017). Using the HumanCyc database and pathway enrichment analysis, we observed that the metabolites of drug exposure were enriched for the pathway class amino acid degradation (p<0.0032). Metabolites whose change correlated with LDL-C lowering response to simvastatin in the full range responders included cystine, urea cycle intermediates, and the dibasic amino acids ornithine, citrulline and lysine. These dibasic amino acids share plasma membrane transporters with arginine, the rate-limiting substrate for nitric oxide synthase (NOS), a critical mediator of cardiovascular health. Baseline metabolic profiles of the good and poor responders were analyzed by orthogonal partial least square discriminant analysis so as to determine the metabolites that best separated the two response groups and could be predictive of LDL-C response. Among these were xanthine, 2-hydroxyvaleric acid, succinic acid, stearic acid, and fructose. Together, the findings from this study indicate that clusters of metabolites involved in multiple pathways not directly connected with cholesterol metabolism may play a role in modulating the response to simvastatin treatment. TRIAL REGISTRATION: ClinicalTrials.gov NCT00451828.
Assuntos
Aminoácidos Essenciais/sangue , Biomarcadores Farmacológicos/sangue , LDL-Colesterol/sangue , Inibidores de Hidroximetilglutaril-CoA Redutases/farmacologia , Metabolômica , Sinvastatina/farmacologia , Adulto , Citrulina/sangue , Cistina/sangue , Feminino , Frutose/sangue , Cromatografia Gasosa-Espectrometria de Massas , Humanos , Inibidores de Hidroximetilglutaril-CoA Redutases/uso terapêutico , Ácidos Láuricos/sangue , Masculino , Óxido Nítrico Sintase/sangue , Ornitina/sangue , Sinvastatina/uso terapêutico , Ácidos Esteáricos/sangue , Ácido Succínico/sangue , Xantina/sangue , alfa-Tocoferol/sangueRESUMO
Although statins are widely prescribed medications, there remains considerable variability in therapeutic response. Genetics can explain only part of this variability. Metabolomics is a global biochemical approach that provides powerful tools for mapping pathways implicated in disease and in response to treatment. Metabolomics captures net interactions between genome, microbiome and the environment. In this study, we used a targeted GC-MS metabolomics platform to measure a panel of metabolites within cholesterol synthesis, dietary sterol absorption, and bile acid formation to determine metabolite signatures that may predict variation in statin LDL-C lowering efficacy. Measurements were performed in two subsets of the total study population in the Cholesterol and Pharmacogenetics (CAP) study: Full Range of Response (FR), and Good and Poor Responders (GPR) were 100 individuals randomly selected from across the entire range of LDL-C responses in CAP. GPR were 48 individuals, 24 each from the top and bottom 10% of the LDL-C response distribution matched for body mass index, race, and gender. We identified three secondary, bacterial-derived bile acids that contribute to predicting the magnitude of statin-induced LDL-C lowering in good responders. Bile acids and statins share transporters in the liver and intestine; we observed that increased plasma concentration of simvastatin positively correlates with higher levels of several secondary bile acids. Genetic analysis of these subjects identified associations between levels of seven bile acids and a single nucleotide polymorphism (SNP), rs4149056, in the gene encoding the organic anion transporter SLCO1B1. These findings, along with recently published results that the gut microbiome plays an important role in cardiovascular disease, indicate that interactions between genome, gut microbiome and environmental influences should be considered in the study and management of cardiovascular disease. Metabolic profiles could provide valuable information about treatment outcomes and could contribute to a more personalized approach to therapy.
Assuntos
Trato Gastrointestinal/efeitos dos fármacos , Trato Gastrointestinal/microbiologia , Metabolômica , Metagenoma/efeitos dos fármacos , Ácidos e Sais Biliares/metabolismo , LDL-Colesterol/sangue , Demografia , Feminino , Trato Gastrointestinal/metabolismo , Humanos , Transportador 1 de Ânion Orgânico Específico do Fígado , Masculino , Pessoa de Meia-Idade , Modelos Biológicos , Transportadores de Ânions Orgânicos/genética , Farmacogenética , Polimorfismo de Nucleotídeo Único/genética , Sinvastatina/farmacologia , Resultado do TratamentoRESUMO
Gene expression data has been used in lieu of phenotype in both classical and quantitative genetic settings. These two disciplines have separate approaches to measuring and interpreting epistasis, which is the interaction between alleles at different loci. We propose a framework for estimating and interpreting epistasis from a classical experiment that combines the strengths of each approach. A regression analysis step accommodates the quantitative nature of expression measurements by estimating the effect of gene deletions plus any interaction. Effects are selected by significance such that a reduced model describes each expression trait. We show how the resulting models correspond to specific hierarchical relationships between two regulator genes and a target gene. These relationships are the basic units of genetic pathways and genomic system diagrams. Our approach can be extended to analyze data from a variety of experiments, multiple loci, and multiple environments.