Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
BMC Genomics ; 25(1): 274, 2024 Mar 12.
Artículo en Inglés | MEDLINE | ID: mdl-38475714

RESUMEN

BACKGROUND: Tuber starch and steroidal glycoalkaloid (SGA)-related traits have been consistently prioritized in potato breeding, while allelic variation pattern of genes that underlie these traits is less explored. RESULTS: Here, we focused on the genes involved in two important metabolic pathways in the potato: starch metabolism and SGA biosynthesis. We identified 119 genes consisting of 81 involved in starch metabolism and 38 in the biosynthesis of steroidal glycoalkaloids, and discovered 96,166 allelic variants among 2,169 gene haplotypes in six autotetraploid potato genomes. Comparative analyses revealed an uneven distribution of allelic variants among gene haplotypes and that the vast majority of deleterious mutations in these genes are retained in heterozygous state in the autotetraploid potato genomes. Leveraging full-length cDNA sequencing data, we find that approximately 70% of haplotypes of the 119 genes are transcribable. Population genetic analyses identify starch and SGA biosynthetic genes that are potentially conserved or diverged between potato varieties with varying starch or SGA content. CONCLUSIONS: These results deepen the understanding of haplotypic diversity within functionally important genes in autotetraploid genomes and may facilitate functional characterization of genes or haplotypes contributing to traits related to starch and SGA in potato.


Asunto(s)
Solanum tuberosum , Solanum tuberosum/genética , Almidón/metabolismo , Fitomejoramiento , Alelos , Fenotipo , Esteroides
2.
Genome Biol ; 25(1): 26, 2024 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-38243222

RESUMEN

Potato is one of the world's major staple crops, and like many important crop plants, it has a polyploid genome. Polyploid haplotype assembly poses a major computational challenge. We introduce a novel strategy for the assembly of polyploid genomes and present an assembly of the autotetraploid potato cultivar Altus. Our method uses low-depth sequencing data from an offspring population to achieve chromosomal clustering and haplotype phasing on the assembly graph. Our approach generates high-quality assemblies of individual chromosomes with haplotype-specific sequence resolution of whole chromosome arms and can be applied in common breeding scenarios where collections of offspring are available.


Asunto(s)
Solanum tuberosum , Tetraploidía , Humanos , Haplotipos , Análisis de Secuencia de ADN , Solanum tuberosum/genética , Fitomejoramiento , Poliploidía
3.
Mol Plant ; 15(3): 520-536, 2022 03 07.
Artículo en Inglés | MEDLINE | ID: mdl-35026436

RESUMEN

Cultivated potato is a clonally propagated autotetraploid species with a highly heterogeneous genome. Phased assemblies of six cultivars including two chromosome-scale phased genome assemblies revealed extensive allelic diversity, including altered coding and transcript sequences, preferential allele expression, and structural variation that collectively result in a highly complex transcriptome and predicted proteome, which are distributed across the homologous chromosomes. Wild species contribute to the extensive allelic diversity in tetraploid cultivars, demonstrating ancestral introgressions predating modern breeding efforts. As a clonally propagated autotetraploid that undergoes limited meiosis, dysfunctional and deleterious alleles are not purged in tetraploid potato. Nearly a quarter of the loci bore mutations are predicted to have a high negative impact on protein function, complicating breeder's efforts to reduce genetic load. The StCDF1 locus controls maturity, and analysis of six tetraploid genomes revealed that 12 allelic variants of StCDF1 are correlated with maturity in a dosage-dependent manner. Knowledge of the complexity of the tetraploid potato genome with its rampant structural variation and embedded deleterious and dysfunctional alleles will be key not only to implementing precision breeding of tetraploid cultivars but also to the construction of homozygous, diploid potato germplasm containing favorable alleles to capitalize on heterosis in F1 hybrids.


Asunto(s)
Solanum tuberosum , Tetraploidía , Alelos , Cromosomas , Fitomejoramiento , Proteoma/genética , Solanum tuberosum/genética , Transcriptoma/genética
4.
G3 (Bethesda) ; 11(9)2021 09 06.
Artículo en Inglés | MEDLINE | ID: mdl-34544132

RESUMEN

Onion is an important vegetable crop with an estimated genome size of 16 Gb. We describe the de novo assembly and ab initio annotation of the genome of a doubled haploid onion line DHCU066619, which resulted in a final assembly of 14.9 Gb with an N50 of 464 Kb. Of this, 2.4 Gb was ordered into eight pseudomolecules using four genetic linkage maps. The remainder of the genome is available in 89.6 K scaffolds. Only 72.4% of the genome could be identified as repetitive sequences and consist, to a large extent, of (retro) transposons. In addition, an estimated 20% of the putative (retro) transposons had accumulated a large number of mutations, hampering their identification, but facilitating their assembly. These elements are probably already quite old. The ab initio gene prediction indicated 540,925 putative gene models, which is far more than expected, possibly due to the presence of pseudogenes. Of these models, 47,066 showed RNASeq support. No gene rich regions were found, genes are uniformly distributed over the genome. Analysis of synteny with Allium sativum (garlic) showed collinearity but also major rearrangements between both species. This assembly is the first high-quality genome sequence available for the study of onion and will be a valuable resource for further research.


Asunto(s)
Cebollas , Secuencias Repetitivas de Ácidos Nucleicos , Tamaño del Genoma , Cebollas/genética
5.
BMC Plant Biol ; 21(1): 198, 2021 Apr 24.
Artículo en Inglés | MEDLINE | ID: mdl-33894758

RESUMEN

BACKGROUND: Scientific literature carries a wealth of information crucial for research, but only a fraction of it is present as structured information in databases and therefore can be analyzed using traditional data analysis tools. Natural language processing (NLP) is often and successfully employed to support humans by distilling relevant information from large corpora of free text and structuring it in a way that lends itself to further computational analyses. For this pilot, we developed a pipeline that uses NLP on biological literature to produce knowledge networks. We focused on the flesh color of potato, a well-studied trait with known associations, and we investigated whether these knowledge networks can assist us in formulating new hypotheses on the underlying biological processes. RESULTS: We trained an NLP model based on a manually annotated corpus of 34 full-text potato articles, to recognize relevant biological entities and relationships between them in text (genes, proteins, metabolites and traits). This model detected the number of biological entities with a precision of 97.65% and a recall of 88.91% on the training set. We conducted a time series analysis on 4023 PubMed abstract of plant genetics-based articles which focus on 4 major Solanaceous crops (tomato, potato, eggplant and capsicum), to determine that the networks contained both previously known and contemporaneously unknown leads to subsequently discovered biological phenomena relating to flesh color. A novel time-based analysis of these networks indicates a connection between our trait and a candidate gene (zeaxanthin epoxidase) already two years prior to explicit statements of that connection in the literature. CONCLUSIONS: Our time-based analysis indicates that network-assisted hypothesis generation shows promise for knowledge discovery, data integration and hypothesis generation in scientific research.


Asunto(s)
Minería de Datos , Procesamiento de Lenguaje Natural , Tubérculos de la Planta/fisiología , Solanum tuberosum/fisiología , Color , Pigmentos Biológicos
6.
G3 (Bethesda) ; 10(10): 3489-3495, 2020 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-32759330

RESUMEN

With the rapid expansion of the application of genomics and sequencing in plant breeding, there is a constant drive for better reference genomes. In potato (Solanum tuberosum), the third largest food crop in the world, the related species S. phureja, designated "DM", has been used as the most popular reference genome for the last 10 years. Here, we introduce the de novo sequenced genome of Solyntus as the next standard reference in potato genome studies. A true Solanum tuberosum made up of 116 contigs that is also highly homozygous, diploid, vigorous and self-compatible, Solyntus provides a more direct and contiguous reference then ever before available. It was constructed by sequencing with state-of-the-art long and short read technology and assembled with Canu. The 116 contigs were assembled into scaffolds to form each pseudochromosome, with three contigs to 17 contigs per chromosome. This assembly contains 93.7% of the single-copy gene orthologs from the Solanaceae set and has an N50 of 63.7 Mbp. The genome and related files can be found at https://www.plantbreeding.wur.nl/Solyntus/ With the release of this research line and its draft genome we anticipate many exciting developments in (diploid) potato research.


Asunto(s)
Solanum tuberosum , Solanum , Secuencia de Bases , Genoma de Planta , Fitomejoramiento , Solanum/genética , Solanum tuberosum/genética
7.
Bioinformatics ; 35(18): 3279-3286, 2019 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-30689725

RESUMEN

SUMMARY: Haplotype assembly of polyploids is an open issue in plant genomics. Recent experimental studies on highly heterozygous autotetraploid potato have shown that available methods do not deliver satisfying results in practice. We propose an optimal method to assemble haplotypes of highly heterozygous polyploids from Illumina short-sequencing reads. Our method is based on a generalization of the existing minimum fragment removal model to the polyploid case and on new integer linear programs to reconstruct optimal haplotypes. We validate our methods experimentally by means of a combined evaluation on simulated and experimental data based on 83 previously sequenced autotetraploid potato cultivars. Results on simulated data show that our methods produce highly accurate haplotype assemblies, while results on experimental data confirm a sensible improvement over the state of the art. AVAILABILITY AND IMPLEMENTATION: Executables for Linux at http://github.com/Computational Genomics/HaplotypeAssembler. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Solanum tuberosum , Algoritmos , Haplotipos , Programación Lineal , Análisis de Secuencia de ADN , Programas Informáticos
8.
BMC Bioinformatics ; 19(1): 183, 2018 05 25.
Artículo en Inglés | MEDLINE | ID: mdl-29801439

RESUMEN

BACKGROUND: A quantitative trait locus (QTL) is a genomic region that correlates with a phenotype. Most of the experimental information about QTL mapping studies is described in tables of scientific publications. Traditional text mining techniques aim to extract information from unstructured text rather than from tables. We present QTLTableMiner++ (QTM), a table mining tool that extracts and semantically annotates QTL information buried in (heterogeneous) tables of plant science literature. QTM is a command line tool written in the Java programming language. This tool takes scientific articles from the Europe PMC repository as input, extracts QTL tables using keyword matching and ontology-based concept identification. The tables are further normalized using rules derived from table properties such as captions, column headers and table footers. Furthermore, table columns are classified into three categories namely column descriptors, properties and values based on column headers and data types of cell entries. Abbreviations found in the tables are expanded using the Schwartz and Hearst algorithm. Finally, the content of QTL tables is semantically enriched with domain-specific ontologies (e.g. Crop Ontology, Plant Ontology and Trait Ontology) using the Apache Solr search platform and the results are stored in a relational database and a text file. RESULTS: The performance of the QTM tool was assessed by precision and recall based on the information retrieved from two manually annotated corpora of open access articles, i.e. QTL mapping studies in tomato (Solanum lycopersicum) and in potato (S. tuberosum). In summary, QTM detected QTL statements in tomato with 74.53% precision and 92.56% recall and in potato with 82.82% precision and 98.94% recall. CONCLUSION: QTM is a unique tool that aids in providing QTL information in machine-readable and semantically interoperable formats.


Asunto(s)
Minería de Datos/métodos , Sitios de Carácter Cuantitativo , Programas Informáticos , Algoritmos , Gráficos por Computador , Bases de Datos Factuales , Solanum lycopersicum/genética , Publicaciones , Semántica , Solanum tuberosum/genética
9.
Brief Bioinform ; 19(3): 387-403, 2018 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-28065918

RESUMEN

Haplotypes are the units of inheritance in an organism, and many genetic analyses depend on their precise determination. Methods for haplotyping single individuals use the phasing information available in next-generation sequencing reads, by matching overlapping single-nucleotide polymorphisms while penalizing post hoc nucleotide corrections made. Haplotyping diploids is relatively easy, but the complexity of the problem increases drastically for polyploid genomes, which are found in both model organisms and in economically relevant plant and animal species. Although a number of tools are available for haplotyping polyploids, the effects of the genomic makeup and the sequencing strategy followed on the accuracy of these methods have hitherto not been thoroughly evaluated.We developed the simulation pipeline haplosim to evaluate the performance of three haplotype estimation algorithms for polyploids: HapCompass, HapTree and SDhaP, in settings varying in sequencing approach, ploidy levels and genomic diversity, using tetraploid potato as the model. Our results show that sequencing depth is the major determinant of haplotype estimation quality, that 1 kb PacBio circular consensus sequencing reads and Illumina reads with large insert-sizes are competitive and that all methods fail to produce good haplotypes when ploidy levels increase. Comparing the three methods, HapTree produces the most accurate estimates, but also consumes the most resources. There is clearly room for improvement in polyploid haplotyping algorithms.


Asunto(s)
Simulación por Computador , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Poliploidía , Análisis de Secuencia de ADN/métodos , Solanum tuberosum/genética , Algoritmos , Genoma de Planta , Genómica
10.
Theor Appl Genet ; 128(10): 1987-97, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26152571

RESUMEN

KEY MESSAGE: A chromosomal inversion associated with the tomato Ty - 2 gene for TYLCV resistance is the cause of severe suppression of recombination in a tomato Ty - 2 introgression line. Among tomato and its wild relatives inversions are often observed, which result in suppression of recombination. Such inversions hamper the transfer of important traits from a related species to the crop by introgression breeding. Suppression of recombination was reported for the TYLCV resistance gene, Ty-2, which has been introgressed in cultivated tomato (Solanum lycopersicum) from the wild relative S. habrochaites accession B6013. Ty-2 was mapped to a 300-kb region on the long arm of chromosome 11. The suppression of recombination in the Ty-2 region could be caused by chromosomal rearrangements in S. habrochaites compared with S. lycopersicum. With the aim of visualizing the genome structure of the Ty-2 region, we compared the draft de novo assembly of S. habrochaites accession LYC4 with the sequence of cultivated tomato ('Heinz'). Furthermore, using populations derived from intraspecific crosses of S. habrochaites accessions, the order of markers in the Ty-2 region was studied. Results showed the presence of an inversion of approximately 200 kb in the Ty-2 region when comparing S. lycopersicum and S. habrochaites. By sequencing a BAC clone from the Ty-2 introgression line, one inversion breakpoint was identified. Finally, the obtained results are discussed with respect to introgression breeding and the importance of a priori de novo sequencing of the species involved.


Asunto(s)
Inversión Cromosómica , Resistencia a la Enfermedad/genética , Solanum lycopersicum/genética , Solanum/genética , Mapeo Cromosómico , Cromosomas Artificiales Bacterianos , Cromosomas de las Plantas , Clonación Molecular , ADN de Plantas/genética , Marcadores Genéticos , Solanum lycopersicum/virología , Virus del Mosaico , Fitomejoramiento , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/virología , Recombinación Genética , Alineación de Secuencia , Análisis de Secuencia de ADN , Solanum/virología
11.
BMC Genomics ; 15: 1152, 2014 Dec 20.
Artículo en Inglés | MEDLINE | ID: mdl-25526885

RESUMEN

BACKGROUND: A RIL population between Solanum lycopersicum cv. Moneymaker and S. pimpinellifolium G1.1554 was genotyped with a custom made SNP array. Additionally, a subset of the lines was genotyped by sequencing (GBS). RESULTS: A total of 1974 polymorphic SNPs were selected to develop a linkage map of 715 unique genetic loci. We generated plots for visualizing the recombination patterns of the population relating physical and genetic positions along the genome.This linkage map was used to identify two QTLs for TYLCV resistance which contained favourable alleles derived from S. pimpinellifolium. Further GBS was used to saturate regions of interest, and the mapping resolution of the two QTLs was improved. The analysis showed highest significance on Chromosome 11 close to the region of 51.3 Mb (qTy-p11) and another on Chromosome 3 near 46.5 Mb (qTy-p3). Furthermore, we explored the population using untargeted metabolic profiling, and the most significant differences between susceptible and resistant plants were mainly associated with sucrose and flavonoid glycosides. CONCLUSIONS: The SNP information obtained from an array allowed a first QTL screening of our RIL population. With additional SNP data of a RILs subset, obtained through GBS, we were able to perform an in silico mapping improvement to further confirm regions associated with our trait of interest. With the combination of different ~ omics platforms we provide valuable insight into the genetics of S. pimpinellifolium-derived TYLCV resistance.


Asunto(s)
Mapeo Cromosómico , Resistencia a la Enfermedad/genética , Técnicas de Genotipaje , Enfermedades de las Plantas/virología , Virus de Plantas/fisiología , Solanum/genética , Solanum/virología , Alelos , Simulación por Computador , Genoma de Planta/genética , Endogamia , Metaboloma , Enfermedades de las Plantas/inmunología , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo/genética , Análisis de Secuencia , Solanum/inmunología , Solanum/metabolismo
12.
BMC Plant Biol ; 11: 116, 2011 Aug 18.
Artículo en Inglés | MEDLINE | ID: mdl-21851635

RESUMEN

BACKGROUND: The cultivated potato (Solanum tuberosum L.) is an important food crop, but highly susceptible to many pathogens. The major threat to potato production is the Irish famine pathogen Phytophthora infestans, which causes the devastating late blight disease. Potato breeding makes use of germplasm from wild relatives (wild germplasm) to introduce resistances into cultivated potato. The Solanum section Petota comprises tuber-bearing species that are potential donors of new disease resistance genes. The aim of this study was to explore Solanum section Petota for resistance genes and generate a widely accessible resource that is useful for studying and implementing disease resistance in potato. DESCRIPTION: The SolRgene database contains data on resistance to P. infestans and presence of R genes and R gene homologues in Solanum section Petota. We have explored Solanum section Petota for resistance to late blight in high throughput disease tests under various laboratory conditions and in field trials. From resistant wild germplasm, segregating populations were generated and assessed for the presence of resistance genes. All these data have been entered into the SolRgene database. To facilitate genetic and resistance gene evolution studies, phylogenetic data of the entire SolRgene collection are included, as well as a tool for generating phylogenetic trees of selected groups of germplasm. Data from resistance gene allele-mining studies are incorporated, which enables detection of R gene homologs in related germplasm. Using these resources, various resistance genes have been detected and some of these have been cloned, whereas others are in the cloning pipeline. All this information is stored in the online SolRgene database, which allows users to query resistance data, sequences, passport data of the accessions, and phylogenic classifications. CONCLUSION: Solanum section Petota forms the basis of the SolRgene database, which contains a collection of resistance data of an unprecedented size and precision. Complemented with R gene sequence data and phylogenetic tools, SolRgene can be considered the primary resource for information on R genes from potato and wild tuber-bearing relatives.


Asunto(s)
Bases de Datos Genéticas , Resistencia a la Enfermedad/genética , Genes de Plantas , Solanum/genética , Secuencia de Bases , Evolución Biológica , Productos Agrícolas/genética , Productos Agrícolas/inmunología , Resistencia a la Enfermedad/inmunología , Datos de Secuencia Molecular , Filogenia , Phytophthora infestans/inmunología , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/inmunología , Solanum/inmunología , Solanum tuberosum/genética , Solanum tuberosum/inmunología
13.
Theor Appl Genet ; 114(6): 1071-80, 2007 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-17273845

RESUMEN

Tomato (Solanum lycopersicum) is susceptible to grey mold (Botrytis cinerea). Partial resistance to this fungus has been identified in accessions of wild relatives of tomato such as Solanum habrochaites LYC4. In a previous F(2) mapping study, three QTLs conferring resistance to B. cinerea (Rbcq1, Rbcq2 and Rbcq4a) were identified. As it was probable that this study had not identified all QTLs involved in resistance we developed an introgression line (IL) population (n = 30), each containing a S. habrochaites introgression in the S. lycopersicum cv. Moneymaker genetic background. On average each IL contained 5.2% of the S. habrochaites genome and together the lines provide an estimated coverage of 95%. The level of susceptibility to B. cinerea for each of the ILs was assessed in a greenhouse trial and compared to the susceptible parent S. lycopersicum cv. Moneymaker. The effect of the three previously identified loci could be confirmed and seven additional loci were detected. Some ILs contains multiple QTLs and the increased resistance to B. cinerea in these ILs is in line with a completely additive model. We conclude that this set of QTLs offers good perspectives for breeding of B. cinerea resistant cultivars and that screening an IL population is more sensitive for detection of QTLs conferring resistance to B. cinerea than the analysis in an F(2) population.


Asunto(s)
Botrytis/patogenicidad , Genética de Población , Inmunidad Innata/genética , Sitios de Carácter Cuantitativo , Solanum/genética , Solanum/inmunología , Botrytis/clasificación , Mapeo Cromosómico , Cromosomas de las Plantas , Cruzamientos Genéticos , ADN de Plantas/genética , ADN de Plantas/aislamiento & purificación , Marcadores Genéticos , Genoma de Planta , Heterocigoto , Homocigoto , Modelos Genéticos , Técnicas de Amplificación de Ácido Nucleico , Polimorfismo Genético , Recombinación Genética , Semillas/genética , Programas Informáticos , Solanum/clasificación
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA