Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
1.
Plant J ; 113(6): 1192-1210, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36626115

RESUMO

Meiotic recombination is crucial for assuring proper segregation of parental chromosomes and generation of novel allelic combinations. As this process is tightly regulated, identifying factors influencing rate, and distribution of meiotic crossovers (COs) is of major importance, notably for plant breeding programs. However, high-resolution recombination maps are sparse in most crops including the Brassica genus and knowledge about intraspecific variation and sex differences is lacking. Here, we report fine-scale resolution recombination landscapes for 10 female and 10 male crosses in Brassica oleracea, by analyzing progenies of five large four-way-cross populations from two reciprocally crossed F1s per population. Parents are highly diverse inbred lines representing major crops, including broccoli, cauliflower, cabbage, kohlrabi, and kale. We produced approximately 4.56T Illumina data from 1248 progenies and identified 15 353 CO across the 10 reciprocal crosses, 51.13% of which being mapped to <10 kb. We revealed fairly similar Mb-scale recombination landscapes among all cross combinations and between the sexes, and provided evidence that these landscapes are largely independent of sequence divergence. We evidenced strong influence of gene density and large structural variations on CO formation in B. oleracea. Moreover, we found extensive variations in CO number depending on the direction and combination of the initial parents crossed with, for the first time, a striking interdependency between these factors. These data improve our current knowledge on meiotic recombination and are important for Brassica breeders.


Assuntos
Brassica , Meiose , Brassica/classificação , Brassica/citologia , Brassica/genética , Melhoramento Vegetal , Recombinação Genética , Cromossomos de Plantas
2.
Plant J ; 116(6): 1667-1680, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37682777

RESUMO

Eggplant (Solanum melongena) is an important Solanaceous crop, widely cultivated and consumed in Asia, the Mediterranean basin, and Southeast Europe. Its domestication centers and migration and diversification routes are still a matter of debate. We report the largest georeferenced and genotyped collection to this date for eggplant and its wild relatives, consisting of 3499 accessions from seven worldwide genebanks, originating from 105 countries in five continents. The combination of genotypic and passport data points to the existence of at least two main centers of domestication, in Southeast Asia and the Indian subcontinent, with limited genetic exchange between them. The wild and weedy eggplant ancestor S. insanum shows admixture with domesticated S. melongena, similar to what was described for other fruit-bearing Solanaceous crops such as tomato and pepper and their wild ancestors. After domestication, migration and admixture of eggplant populations from different regions have been less conspicuous with respect to tomato and pepper, thus better preserving 'local' phenotypic characteristics. The data allowed the identification of misclassified and putatively duplicated accessions, facilitating genebank management. All the genetic, phenotypic, and passport data have been deposited in the Open Access G2P-SOL database, and constitute an invaluable resource for understanding the domestication, migration and diversification of this cosmopolitan vegetable.


Assuntos
Solanum lycopersicum , Solanum melongena , Solanum melongena/genética , Domesticação , Frutas/genética , Ásia
3.
Plant J ; 116(4): 1136-1151, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37150955

RESUMO

Tomato (Solanum lycopersicum) is a prominent fruit with rich genetic resources for crop improvement. By using a phenotype-guided screen of over 7900 tomato accessions from around the world, we identified new associations for complex traits such as fruit weight and total soluble solids (Brix). Here, we present the phenotypic data from several years of trials. To illustrate the power of this dataset we use two case studies. First, evaluation of color revealed allelic variation in phytoene synthase 1 that resulted in differently colored or even bicolored fruit. Secondly, in view of the negative relationship between fruit weight and Brix, we pre-selected a subset of the collection that includes high and low Brix values in each category of fruit size. Genome-wide association analysis allowed us to detect novel loci associated with total soluble solid content and fruit weight. In addition, we developed eight F2 biparental intraspecific populations. Furthermore, by taking a phenotype-guided approach we were able to isolate individuals with high Brix values that were not compromised in terms of yield. In addition, the demonstration of novel results despite the high number of previous genome-wide association studies of these traits in tomato suggests that adoption of a phenotype-guided pre-selection of germplasm may represent a useful strategy for finding target genes for breeding.


Assuntos
Solanum lycopersicum , Humanos , Solanum lycopersicum/genética , Locos de Características Quantitativas/genética , Estudo de Associação Genômica Ampla , Melhoramento Vegetal , Fenótipo , Frutas/genética
4.
Plant J ; 116(5): 1508-1528, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37602679

RESUMO

Investigating crop diversity through genome-wide association studies (GWAS) on core collections helps in deciphering the genetic determinants of complex quantitative traits. Using the G2P-SOL project world collection of 10 038 wild and cultivated Capsicum accessions from 10 major genebanks, we assembled a core collection of 423 accessions representing the known genetic diversity. Since complex traits are often highly dependent upon environmental variables and genotype-by-environment (G × E) interactions, multi-environment GWAS with a 10 195-marker genotypic matrix were conducted on a highly diverse subset of 350 Capsicum annuum accessions, extensively phenotyped in up to six independent trials from five climatically differing countries. Environment-specific and multi-environment quantitative trait loci (QTLs) were detected for 23 diverse agronomic traits. We identified 97 candidate genes potentially implicated in 53 of the most robust and high-confidence QTLs for fruit flavor, color, size, and shape traits, and for plant productivity, vigor, and earliness traits. Investigating the genetic architecture of agronomic traits in this way will assist the development of genetic markers and pave the way for marker-assisted selection. The G2P-SOL pepper core collection will be available upon request as a unique and universal resource for further exploitation in future gene discovery and marker-assisted breeding efforts by the pepper community.


Assuntos
Capsicum , Locos de Características Quantitativas , Locos de Características Quantitativas/genética , Capsicum/genética , Estudo de Associação Genômica Ampla , Melhoramento Vegetal , Fenótipo , Verduras/genética
5.
BMC Genomics ; 25(1): 274, 2024 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-38475714

RESUMO

BACKGROUND: Tuber starch and steroidal glycoalkaloid (SGA)-related traits have been consistently prioritized in potato breeding, while allelic variation pattern of genes that underlie these traits is less explored. RESULTS: Here, we focused on the genes involved in two important metabolic pathways in the potato: starch metabolism and SGA biosynthesis. We identified 119 genes consisting of 81 involved in starch metabolism and 38 in the biosynthesis of steroidal glycoalkaloids, and discovered 96,166 allelic variants among 2,169 gene haplotypes in six autotetraploid potato genomes. Comparative analyses revealed an uneven distribution of allelic variants among gene haplotypes and that the vast majority of deleterious mutations in these genes are retained in heterozygous state in the autotetraploid potato genomes. Leveraging full-length cDNA sequencing data, we find that approximately 70% of haplotypes of the 119 genes are transcribable. Population genetic analyses identify starch and SGA biosynthetic genes that are potentially conserved or diverged between potato varieties with varying starch or SGA content. CONCLUSIONS: These results deepen the understanding of haplotypic diversity within functionally important genes in autotetraploid genomes and may facilitate functional characterization of genes or haplotypes contributing to traits related to starch and SGA in potato.


Assuntos
Solanum tuberosum , Solanum tuberosum/genética , Amido/metabolismo , Melhoramento Vegetal , Alelos , Fenótipo , Esteroides
6.
Proc Natl Acad Sci U S A ; 118(34)2021 08 24.
Artigo em Inglês | MEDLINE | ID: mdl-34400501

RESUMO

Genebanks collect and preserve vast collections of plants and detailed passport information, with the aim of preserving genetic diversity for conservation and breeding. Genetic characterization of such collections has the potential to elucidate the genetic histories of important crops, use marker-trait associations to identify loci controlling traits of interest, search for loci undergoing selection, and contribute to genebank management by identifying taxonomic misassignments and duplicates. We conducted a genomic scan with genotyping by sequencing (GBS) derived single nucleotide polymorphisms (SNPs) of 10,038 pepper (Capsicum spp.) accessions from worldwide genebanks and investigated the recent history of this iconic staple. Genomic data detected up to 1,618 duplicate accessions within and between genebanks and showed that taxonomic ambiguity and misclassification often involve interspecific hybrids that are difficult to classify morphologically. We deeply interrogated the genetic diversity of the commonly consumed Capsicum annuum to investigate its history, finding that the kinds of peppers collected in broad regions across the globe overlap considerably. The method ReMIXTURE-using genetic data to quantify the similarity between the complement of peppers from a focal region and those from other regions-was developed to supplement traditional population genetic analyses. The results reflect a vision of pepper as a highly desirable and tradable cultural commodity, spreading rapidly throughout the globe along major maritime and terrestrial trade routes. Marker associations and possible selective sweeps affecting traits such as pungency were observed, and these traits were shown to be distributed nonuniformly across the globe, suggesting that human preferences exerted a primary influence over domesticated pepper genetic structure.


Assuntos
Capsicum/genética , Cromossomos de Plantas/genética , Genética Populacional , Genoma de Planta , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Capsicum/crescimento & desenvolvimento , Genômica
7.
J Exp Bot ; 74(18): 5896-5916, 2023 09 29.
Artigo em Inglês | MEDLINE | ID: mdl-37527560

RESUMO

European traditional tomato varieties have been selected by farmers given their consistent performance and adaptation to local growing conditions. Here we developed a multipurpose core collection, comprising 226 accessions representative of the genotypic, phenotypic, and geographical diversity present in European traditional tomatoes, to investigate the basis of their phenotypic variation, gene×environment interactions, and stability for 33 agro-morphological traits. Comparison of the traditional varieties with a modern reference panel revealed that some traditional varieties displayed excellent agronomic performance and high trait stability, as good as or better than that of their modern counterparts. We conducted genome-wide association and genome-wide environment interaction studies and detected 141 quantitative trait loci (QTLs). Out of those, 47 QTLs were associated with the phenotype mean (meanQTLs), 41 with stability (stbQTLs), and 53 QTL-by-environment interactions (QTIs). Most QTLs displayed additive gene actions, with the exception of stbQTLs, which were mostly recessive and overdominant QTLs. Both common and specific loci controlled the phenotype mean and stability variation in traditional tomato; however, a larger proportion of specific QTLs was observed, indicating that the stability gene regulatory model is the predominant one. Developmental genes tended to map close to meanQTLs, while genes involved in stress response, hormone metabolism, and signalling were found within regions affecting stability. A total of 137 marker-trait associations for phenotypic means and stability were novel, and therefore our study enhances the understanding of the genetic basis of valuable agronomic traits and opens up a new avenue for an exploitation of the allelic diversity available within European traditional tomato germplasm.


Assuntos
Solanum lycopersicum , Mapeamento Cromossômico , Solanum lycopersicum/genética , Estudo de Associação Genômica Ampla , Locos de Características Quantitativas , Fenótipo
8.
J Exp Bot ; 73(11): 3431-3445, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35358313

RESUMO

A comprehensive collection of 1254 tomato accessions, corresponding to European traditional and modern varieties, early domesticated varieties, and wild relatives, was analyzed by genotyping by sequencing. A continuous genetic gradient between the traditional and modern varieties was observed. European traditional tomatoes displayed very low genetic diversity, with only 298 polymorphic loci (95% threshold) out of 64 943 total variants. European traditional tomatoes could be classified into several genetic groups. Two main clusters consisting of Spanish and Italian accessions showed higher genetic diversity than the remaining varieties, suggesting that these regions might be independent secondary centers of diversity with a different history. Other varieties seem to be the result of a more recent complex pattern of migrations and hybridizations among the European regions. Several polymorphic loci were associated in a genome-wide association study with fruit morphological traits in the European traditional collection. The corresponding alleles were found to contribute to the distinctive phenotypic characteristic of the genetic varietal groups. The few highly polymorphic loci associated with morphological traits in an otherwise a low-diversity population suggests a history of balancing selection, in which tomato farmers likely maintained the morphological variation by inadvertently applying a high selective pressure within different varietal types.


Assuntos
Solanum lycopersicum , Alelos , Fazendeiros , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Solanum lycopersicum/genética , Fenótipo , Polimorfismo de Nucleotídeo Único
9.
Plant J ; 103(3): 1189-1204, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32369642

RESUMO

Tomato (Solanum lycopersicum L.) has become a popular model for genetic studies of fruit flavor in the last two decades. In this article we present a study of tomato fruit flavor, including an analysis of the genetic, metabolic and sensorial variation of a collection of contemporary commercial glasshouse tomato cultivars, followed by a validation of the associations found by quantitative trait locus (QTL) analysis of representative biparental segregating populations. This led to the identification of the major sensorial and chemical components determining fruit flavor variation and detection of the underlying QTLs. The high representation of QTL haplotypes in the breeders' germplasm suggests that there is great potential for applying these QTLs in current breeding programs aimed at improving tomato flavor. A QTL on chromosome 4 was found to affect the levels of the phenylalanine-derived volatiles (PHEVs) 2-phenylethanol, phenylacetaldehyde and 1-nitro-2-phenylethane. Fruits of near-isogenic lines contrasting for this locus and in the composition of PHEVs significantly differed in the perception of fruity and rose-hip-like aroma. The PHEV locus was fine mapped, which allowed for the identification of FLORAL4 as a candidate gene for PHEV regulation. Using a gene-editing-based (CRISPR-CAS9) reverse-genetics approach, FLORAL4 was demonstrated to be the key factor in this QTL affecting PHEV accumulation in tomato fruit.


Assuntos
Boratos/metabolismo , Frutose/análogos & derivados , Genes de Plantas/genética , Locos de Características Quantitativas/genética , Solanum lycopersicum/genética , Boratos/normas , Proteína 9 Associada à CRISPR , Sistemas CRISPR-Cas , Mapeamento Cromossômico , Cromossomos de Plantas/genética , Qualidade dos Alimentos , Frutose/metabolismo , Frutose/normas , Edição de Genes , Genes de Plantas/fisiologia , Solanum lycopersicum/metabolismo , Solanum lycopersicum/normas , Fenilalanina/metabolismo , Característica Quantitativa Herdável , Compostos Orgânicos Voláteis/metabolismo
10.
BMC Plant Biol ; 21(1): 198, 2021 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-33894758

RESUMO

BACKGROUND: Scientific literature carries a wealth of information crucial for research, but only a fraction of it is present as structured information in databases and therefore can be analyzed using traditional data analysis tools. Natural language processing (NLP) is often and successfully employed to support humans by distilling relevant information from large corpora of free text and structuring it in a way that lends itself to further computational analyses. For this pilot, we developed a pipeline that uses NLP on biological literature to produce knowledge networks. We focused on the flesh color of potato, a well-studied trait with known associations, and we investigated whether these knowledge networks can assist us in formulating new hypotheses on the underlying biological processes. RESULTS: We trained an NLP model based on a manually annotated corpus of 34 full-text potato articles, to recognize relevant biological entities and relationships between them in text (genes, proteins, metabolites and traits). This model detected the number of biological entities with a precision of 97.65% and a recall of 88.91% on the training set. We conducted a time series analysis on 4023 PubMed abstract of plant genetics-based articles which focus on 4 major Solanaceous crops (tomato, potato, eggplant and capsicum), to determine that the networks contained both previously known and contemporaneously unknown leads to subsequently discovered biological phenomena relating to flesh color. A novel time-based analysis of these networks indicates a connection between our trait and a candidate gene (zeaxanthin epoxidase) already two years prior to explicit statements of that connection in the literature. CONCLUSIONS: Our time-based analysis indicates that network-assisted hypothesis generation shows promise for knowledge discovery, data integration and hypothesis generation in scientific research.


Assuntos
Mineração de Dados , Processamento de Linguagem Natural , Tubérculos/fisiologia , Solanum tuberosum/fisiologia , Cor , Pigmentos Biológicos
11.
Brief Bioinform ; 19(3): 387-403, 2018 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-28065918

RESUMO

Haplotypes are the units of inheritance in an organism, and many genetic analyses depend on their precise determination. Methods for haplotyping single individuals use the phasing information available in next-generation sequencing reads, by matching overlapping single-nucleotide polymorphisms while penalizing post hoc nucleotide corrections made. Haplotyping diploids is relatively easy, but the complexity of the problem increases drastically for polyploid genomes, which are found in both model organisms and in economically relevant plant and animal species. Although a number of tools are available for haplotyping polyploids, the effects of the genomic makeup and the sequencing strategy followed on the accuracy of these methods have hitherto not been thoroughly evaluated.We developed the simulation pipeline haplosim to evaluate the performance of three haplotype estimation algorithms for polyploids: HapCompass, HapTree and SDhaP, in settings varying in sequencing approach, ploidy levels and genomic diversity, using tetraploid potato as the model. Our results show that sequencing depth is the major determinant of haplotype estimation quality, that 1 kb PacBio circular consensus sequencing reads and Illumina reads with large insert-sizes are competitive and that all methods fail to produce good haplotypes when ploidy levels increase. Comparing the three methods, HapTree produces the most accurate estimates, but also consumes the most resources. There is clearly room for improvement in polyploid haplotyping algorithms.


Assuntos
Simulação por Computador , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Poliploidia , Análise de Sequência de DNA/métodos , Solanum tuberosum/genética , Algoritmos , Genoma de Planta , Genômica
12.
Bioinformatics ; 35(18): 3279-3286, 2019 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-30689725

RESUMO

SUMMARY: Haplotype assembly of polyploids is an open issue in plant genomics. Recent experimental studies on highly heterozygous autotetraploid potato have shown that available methods do not deliver satisfying results in practice. We propose an optimal method to assemble haplotypes of highly heterozygous polyploids from Illumina short-sequencing reads. Our method is based on a generalization of the existing minimum fragment removal model to the polyploid case and on new integer linear programs to reconstruct optimal haplotypes. We validate our methods experimentally by means of a combined evaluation on simulated and experimental data based on 83 previously sequenced autotetraploid potato cultivars. Results on simulated data show that our methods produce highly accurate haplotype assemblies, while results on experimental data confirm a sensible improvement over the state of the art. AVAILABILITY AND IMPLEMENTATION: Executables for Linux at http://github.com/Computational Genomics/HaplotypeAssembler. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Solanum tuberosum , Algoritmos , Haplótipos , Programação Linear , Análise de Sequência de DNA , Software
13.
Bioinformatics ; 35(20): 4147-4155, 2019 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-30903186

RESUMO

MOTIVATION: Modern genomic breeding methods rely heavily on very large amounts of phenotyping and genotyping data, presenting new challenges in effective data management and integration. Recently, the size and complexity of datasets have increased significantly, with the result that data are often stored on multiple systems. As analyses of interest increasingly require aggregation of datasets from diverse sources, data exchange between disparate systems becomes a challenge. RESULTS: To facilitate interoperability among breeding applications, we present the public plant Breeding Application Programming Interface (BrAPI). BrAPI is a standardized web service API specification. The development of BrAPI is a collaborative, community-based initiative involving a growing global community of over a hundred participants representing several dozen institutions and companies. Development of such a standard is recognized as critical to a number of important large breeding system initiatives as a foundational technology. The focus of the first version of the API is on providing services for connecting systems and retrieving basic breeding data including germplasm, study, observation, and marker data. A number of BrAPI-enabled applications, termed BrAPPs, have been written, that take advantage of the emerging support of BrAPI by many databases. AVAILABILITY AND IMPLEMENTATION: More information on BrAPI, including links to the specification, test suites, BrAPPs, and sample implementations is available at https://brapi.org/. The BrAPI specification and the developer tools are provided as free and open source.


Assuntos
Melhoramento Vegetal , Software , Interface Usuário-Computador , Genômica
14.
New Phytol ; 227(1): 260-273, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32171029

RESUMO

Enabling data reuse and knowledge discovery is increasingly critical in modern science, and requires an effort towards standardising data publication practices. This is particularly challenging in the plant phenotyping domain, due to its complexity and heterogeneity. We have produced the MIAPPE 1.1 release, which enhances the existing MIAPPE standard in coverage, to support perennial plants, in structure, through an explicit data model, and in clarity, through definitions and examples. We evaluated MIAPPE 1.1 by using it to express several heterogeneous phenotyping experiments in a range of different formats, to demonstrate its applicability and the interoperability between the various implementations. Furthermore, the extended coverage is demonstrated by the fact that one of the datasets could not have been described under MIAPPE 1.0. MIAPPE 1.1 marks a major step towards enabling plant phenotyping data reusability, thanks to its extended coverage, and especially the formalisation of its data model, which facilitates its implementation in different formats. Community feedback has been critical to this development, and will be a key part of ensuring adoption of the standard.


Assuntos
Fenômica , Plantas , Plantas/genética
15.
Bioinformatics ; 34(22): 3864-3872, 2018 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-29868858

RESUMO

Motivation: Knowledge of haplotypes, i.e. phased and ordered marker alleles on a chromosome, is essential to answer many questions in genetics and genomics. By generating short pieces of DNA sequence, high-throughput modern sequencing technologies make estimation of haplotypes possible for single individuals. In polyploids, however, haplotype estimation methods usually require deep coverage to achieve sufficient accuracy. This often renders sequencing-based approaches too costly to be applied to large populations needed in studies of Quantitative Trait Loci. Results: We propose a novel haplotype estimation method for polyploids, TriPoly, that combines sequencing data with Mendelian inheritance rules to infer haplotypes in parent-offspring trios. Using realistic simulations of both short and long-read sequencing data for banana (Musa acuminata) and potato (Solanum tuberosum) trios, we show that TriPoly yields more accurate progeny haplotypes at low coverages compared to existing methods that work on single individuals. We also apply TriPoly to phase Single Nucleotide Polymorphisms on chromosome 5 for a family of tetraploid potato with 2 parents and 37 offspring sequenced with an RNA capture approach. We show that TriPoly haplotype estimates differ from those of the other methods mainly in regions with imperfect sequencing or mapping difficulties, as it does not rely solely on sequence reads and aims to avoid phasings that are not likely to have been passed from the parents to the offspring. Availability and implementation: TriPoly has been implemented in Python 3.5.2 (also compatible with Python 2.7.3 and higher) and can be freely downloaded at https://github.com/EhsanMotazedi/TriPoly. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Poliploidia , Alelos , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA
16.
BMC Bioinformatics ; 19(1): 183, 2018 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-29801439

RESUMO

BACKGROUND: A quantitative trait locus (QTL) is a genomic region that correlates with a phenotype. Most of the experimental information about QTL mapping studies is described in tables of scientific publications. Traditional text mining techniques aim to extract information from unstructured text rather than from tables. We present QTLTableMiner++ (QTM), a table mining tool that extracts and semantically annotates QTL information buried in (heterogeneous) tables of plant science literature. QTM is a command line tool written in the Java programming language. This tool takes scientific articles from the Europe PMC repository as input, extracts QTL tables using keyword matching and ontology-based concept identification. The tables are further normalized using rules derived from table properties such as captions, column headers and table footers. Furthermore, table columns are classified into three categories namely column descriptors, properties and values based on column headers and data types of cell entries. Abbreviations found in the tables are expanded using the Schwartz and Hearst algorithm. Finally, the content of QTL tables is semantically enriched with domain-specific ontologies (e.g. Crop Ontology, Plant Ontology and Trait Ontology) using the Apache Solr search platform and the results are stored in a relational database and a text file. RESULTS: The performance of the QTM tool was assessed by precision and recall based on the information retrieved from two manually annotated corpora of open access articles, i.e. QTL mapping studies in tomato (Solanum lycopersicum) and in potato (S. tuberosum). In summary, QTM detected QTL statements in tomato with 74.53% precision and 92.56% recall and in potato with 82.82% precision and 98.94% recall. CONCLUSION: QTM is a unique tool that aids in providing QTL information in machine-readable and semantically interoperable formats.


Assuntos
Mineração de Dados/métodos , Locos de Características Quantitativas , Software , Algoritmos , Gráficos por Computador , Bases de Dados Factuais , Solanum lycopersicum/genética , Publicações , Semântica , Solanum tuberosum/genética
17.
BMC Genomics ; 19(Suppl 2): 110, 2018 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-29764364

RESUMO

BACKGROUND: Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. RESULTS: We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. CONCLUSIONS: Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.


Assuntos
Genômica/métodos , Haplótipos , Poliploidia , Algoritmos , Genoma , Análise de Sequência de DNA
18.
Plant J ; 80(1): 136-48, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25039268

RESUMO

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies.


Assuntos
Variação Genética , Genoma de Planta/genética , Solanum lycopersicum/genética , Cruzamento , Mapeamento Cromossômico , DNA de Plantas/química , DNA de Plantas/genética , Frutas/genética , Sequenciamento de Nucleotídeos em Larga Escala , Dados de Sequência Molecular , Fenótipo , Filogenia , Polimorfismo de Nucleotídeo Único , Alinhamento de Sequência , Análise de Sequência de DNA , Especificidade da Espécie
20.
Theor Appl Genet ; 128(10): 1987-97, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26152571

RESUMO

KEY MESSAGE: A chromosomal inversion associated with the tomato Ty - 2 gene for TYLCV resistance is the cause of severe suppression of recombination in a tomato Ty - 2 introgression line. Among tomato and its wild relatives inversions are often observed, which result in suppression of recombination. Such inversions hamper the transfer of important traits from a related species to the crop by introgression breeding. Suppression of recombination was reported for the TYLCV resistance gene, Ty-2, which has been introgressed in cultivated tomato (Solanum lycopersicum) from the wild relative S. habrochaites accession B6013. Ty-2 was mapped to a 300-kb region on the long arm of chromosome 11. The suppression of recombination in the Ty-2 region could be caused by chromosomal rearrangements in S. habrochaites compared with S. lycopersicum. With the aim of visualizing the genome structure of the Ty-2 region, we compared the draft de novo assembly of S. habrochaites accession LYC4 with the sequence of cultivated tomato ('Heinz'). Furthermore, using populations derived from intraspecific crosses of S. habrochaites accessions, the order of markers in the Ty-2 region was studied. Results showed the presence of an inversion of approximately 200 kb in the Ty-2 region when comparing S. lycopersicum and S. habrochaites. By sequencing a BAC clone from the Ty-2 introgression line, one inversion breakpoint was identified. Finally, the obtained results are discussed with respect to introgression breeding and the importance of a priori de novo sequencing of the species involved.


Assuntos
Inversão Cromossômica , Resistência à Doença/genética , Solanum lycopersicum/genética , Solanum/genética , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Cromossomos de Plantas , Clonagem Molecular , DNA de Plantas/genética , Marcadores Genéticos , Solanum lycopersicum/virologia , Vírus do Mosaico , Melhoramento Vegetal , Doenças das Plantas/genética , Doenças das Plantas/virologia , Recombinação Genética , Alinhamento de Sequência , Análise de Sequência de DNA , Solanum/virologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA