Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Plant Genome ; : e20447, 2024 Apr 17.
Artigo em Inglês | MEDLINE | ID: mdl-38628142

RESUMO

Sesame (Sesamum indicum L.) is an ancient oilseed crop belonging to the family Pedaliaceae and a globally cultivated crop for its use as oil and food. In this study, 2496 sesame accessions, being conserved at the National Genebank of ICAR-National Bureau of Plant Genetic Resources (NBPGR), were genotyped using genomics-assisted double-digest restriction-associated DNA sequencing (ddRAD-seq) approach. A total of 64,910 filtered single-nucleotide polymorphisms (SNPs) were utilized to assess the genome-scale diversity. Applications of this genome-scale information (reduced representation using restriction enzymes) are demonstrated through the development of a molecular core collection (CC) representing maximal SNP diversity. This information is also applied in developing a mid-density panel (MDP) comprising 2515 hyper-variable SNPs, representing almost equally the genic and non-genic regions. The sesame CC comprising 384 accessions, a representative set of accessions with maximal diversity, was identified using multiple criteria such as k-mer (subsequence of length "k" in a sequence read) diversity, observed heterozygosity, CoreHunter3, GenoCore, and genetic differentiation. The coreset constituted around 15% of the total accessions studied, and this small subset had captured >60% SNP diversity of the entire population. In the coreset, the admixture analysis shows reduced genetic complexity, increased nucleotide diversity (π), and is geographically distributed without any repetitiveness in the CC germplasm. Within the CC, India-originated accessions exhibit higher diversity (as expected based on the center of diversity concept), than those accessions that were procured from various other countries. The identified CC set and the MDP will be a valuable resource for genomics-assisted accelerated sesame improvement program.

2.
Life (Basel) ; 13(8)2023 Jul 31.
Artigo em Inglês | MEDLINE | ID: mdl-37629524

RESUMO

Sequencing technologies have rapidly evolved over the past two decades, and new technologies are being continually developed and commercialized. The emerging sequencing technologies target generating more data with fewer inputs and at lower costs. This has also translated to an increase in the number and type of corresponding applications in genomics besides enhanced computational capacities (both hardware and software). Alongside the evolving DNA sequencing landscape, bioinformatics research teams have also evolved to accommodate the increasingly demanding techniques used to combine and interpret data, leading to many researchers moving from the lab to the computer. The rich history of DNA sequencing has paved the way for new insights and the development of new analysis methods. Understanding and learning from past technologies can help with the progress of future applications. This review focuses on the evolution of sequencing technologies, their significant enabling role in generating plant genome assemblies and downstream applications, and the parallel development of bioinformatics tools and skills, filling the gap in data analysis techniques.

3.
PLoS One ; 18(6): e0286599, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37267340

RESUMO

To reduce the genome sequence representation, restriction site-associated DNA sequencing (RAD-seq) protocols is being widely used either with single-digest or double-digest methods. In this study, we genotyped the sesame population (48 sample size) in a pilot scale to compare single and double-digest RAD-seq (sd and ddRAD-seq) methods. We analysed the resulting short-read data generated from both protocols and assessed their performance impacting the downstream analysis using various parameters. The distinct k-mer count and gene presence absence variation (PAV) showed a significant difference between the sesame samples studied. Additionally, the variant calling from both datasets (sdRAD-seq and ddRAD-seq) exhibits a significant difference between them. The combined variants from both datasets helped in identifying the most diverse samples and possible sub-groups in the sesame population. The most diverse samples identified from each analysis (k-mer, gene PAV, SNP count, Heterozygosity, NJ and PCA) can possibly be representative samples holding major diversity of the small sesame population used in this study. The best possible strategies with suggested inputs for modifications to utilize the RAD-seq strategy efficiently on a large dataset containing thousands of samples to be subjected to molecular analysis like diversity, population structure and core development studies were discussed.


Assuntos
Sesamum , Sesamum/genética , Genoma , Genótipo , Análise de Sequência de DNA/métodos , Sequência de Bases
4.
Front Plant Sci ; 14: 1143512, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37008459

RESUMO

Due to evolutionary divergence, sorghum race populations exhibit significant genetic and morphological variation. A k-mer-based sorghum race sequence comparison identified the conserved k-mers of all 272 accessions from sorghum and the race-specific genetic signatures identified the gene variability in 10,321 genes (PAVs). To understand sorghum race structure, diversity and domestication, a deep learning-based variant calling approach was employed in a set of genotypic data derived from a diverse panel of 272 sorghum accessions. The data resulted in 1.7 million high-quality genome-wide SNPs and identified selective signature (both positive and negative) regions through a genome-wide scan with different (iHS and XP-EHH) statistical methods. We discovered 2,370 genes associated with selection signatures including 179 selective sweep regions distributed over 10 chromosomes. Co-localization of these regions undergoing selective pressure with previously reported QTLs and genes revealed that the signatures of selection could be related to the domestication of important agronomic traits such as biomass and plant height. The developed k-mer signatures will be useful in the future to identify the sorghum race and for trait and SNP markers for assisting in plant breeding programs.

5.
Methods Mol Biol ; 2443: 273-284, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35037212

RESUMO

With the emerging sequencing technologies and cost reduction, the sequence data generation has accelerated from a single individual to multiple (thousands of) individuals of a species. The terabytes of sequence data generated from thousands of individuals include the majority of the redundant sequence which depends on the level of sequence similarity within the population of individuals. Managing large datasets and creating the unique catalogue sequence from such a large population is challenging to analyze, store, and retrieve the information. In this chapter, we discuss the practical haplotype graph (PHG) which addresses the above said challenges and also able to retrieve required information such as variants and sequences more efficiently, which enable researchers to manage and assess large genomic data.


Assuntos
Genoma , Genômica , Haplótipos/genética , Análise de Sequência de DNA
6.
Plant Genome ; 15(1): e20166, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-34786880

RESUMO

Achieving yield potential in chickpea (Cicer arietinum L.) is limited by many constraints that include biotic and abiotic stresses. Combining next-generation sequencing technology with advanced statistical modeling has the potential to increase genetic gain efficiently. Whole genome resequencing data was obtained from 315 advanced chickpea breeding lines from the Australian chickpea breeding program resulting in more than 298,000 single nucleotide polymorphisms (SNPs) discovered. Analysis of population structure revealed a distinct group of breeding lines with many alleles that are absent from recently released Australian cultivars. Genome-wide association studies (GWAS) using these Australian breeding lines identified 20 SNPs significantly associated with grain yield in multiple field environments. A reduced level of nucleotide diversity and extended linkage disequilibrium suggested that some regions in these chickpea genomes may have been through selective breeding for yield or other traits. A large introgression segment that introduced from C. echinospermum for phytophthora root rot resistance was identified on chromosome 6, yet it also has unintended consequences of reducing yield due to linkage drag. We further investigated the effect of genotype by environment interaction on genomic prediction of yield. We found that the training set had better prediction accuracy when phenotyped under conditions relevant to the targeted environments. We also investigated the effect of SNP functional annotation on prediction accuracy using different subsets of SNPs based on their genomic locations: regulatory regions, exome, and alternative splice sites. Compared with the whole SNP dataset, a subset of SNPs did not significantly decrease prediction accuracy for grain yield despite consisting of a smaller number of SNPs.


Assuntos
Cicer , Austrália , Mapeamento Cromossômico/métodos , Cicer/genética , Estudo de Associação Genômica Ampla , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único
7.
Front Plant Sci ; 12: 666342, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34140962

RESUMO

Sorghum (Sorghum bicolor L.) is a staple food crops in the arid and rainfed production ecologies. Sorghum plays a critical role in resilient farming and is projected as a smart crop to overcome the food and nutritional insecurity in the developing world. The development and characterisation of the sorghum pan-genome will provide insight into genome diversity and functionality, supporting sorghum improvement. We built a sorghum pan-genome using reference genomes as well as 354 genetically diverse sorghum accessions belonging to different races. We explored the structural and functional characteristics of the pan-genome and explain its utility in supporting genetic gain. The newly-developed pan-genome has a total of 35,719 genes, a core genome of 16,821 genes and an average of 32,795 genes in each cultivar. The variable genes are enriched with environment responsive genes and classify the sorghum accessions according to their race. We show that 53% of genes display presence-absence variation, and some of these variable genes are predicted to be functionally associated with drought adaptation traits. Using more than two million SNPs from the pan-genome, association analysis identified 398 SNPs significantly associated with important agronomic traits, of which, 92 were in genes. Drought gene expression analysis identified 1,788 genes that are functionally linked to different conditions, of which 79 were absent from the reference genome assembly. This study provides comprehensive genomic diversity resources in sorghum which can be used in genome assisted crop improvement.

8.
Plant Biotechnol J ; 18(9): 1946-1954, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32020732

RESUMO

Pigeon pea (Cajanus cajan) is an important orphan crop mainly grown by smallholder farmers in India and Africa. Here, we present the first pigeon pea pangenome based on 89 accessions mainly from India and the Philippines, showing that there is significant genetic diversity in Philippine individuals that is not present in Indian individuals. Annotation of variable genes suggests that they are associated with self-fertilization and response to disease. We identified 225 SNPs associated with nine agronomically important traits over three locations and two different time points, with SNPs associated with genes for transcription factors and kinases. These results will lead the way to an improved pigeon pea breeding programme.


Assuntos
Cajanus , África , Cajanus/genética , Índia , Pisum sativum/genética
9.
Front Plant Sci ; 9: 190, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29515606

RESUMO

Drought tolerance is a complex trait that involves numerous genes. Identifying key causal genes or linked molecular markers can facilitate the fast development of drought tolerant varieties. Using a whole-genome resequencing approach, we sequenced 132 chickpea varieties and advanced breeding lines and found more than 144,000 single nucleotide polymorphisms (SNPs). We measured 13 yield and yield-related traits in three drought-prone environments of Western Australia. The genotypic effects were significant for all traits, and many traits showed highly significant correlations, ranging from 0.83 between grain yield and biomass to -0.67 between seed weight and seed emergence rate. To identify candidate genes, the SNP and trait data were incorporated into the SUPER genome-wide association study (GWAS) model, a modified version of the linear mixed model. We found that several SNPs from auxin-related genes, including auxin efflux carrier protein (PIN3), p-glycoprotein, and nodulin MtN21/EamA-like transporter, were significantly associated with yield and yield-related traits under drought-prone environments. We identified four genetic regions containing SNPs significantly associated with several different traits, which was an indication of pleiotropic effects. We also investigated the possibility of incorporating the GWAS results into a genomic selection (GS) model, which is another approach to deal with complex traits. Compared to using all SNPs, application of the GS model using subsets of SNPs significantly associated with the traits under investigation increased the prediction accuracies of three yield and yield-related traits by more than twofold. This has important implication for implementing GS in plant breeding programs.

10.
Front Plant Sci ; 8: 359, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28367154

RESUMO

Ascochyta blight (AB) is a fungal disease that can significantly reduce chickpea production in Australia and other regions of the world. In this study, 69 chickpea genotypes were sequenced using whole genome re-sequencing (WGRS) methods. They included 48 Australian varieties differing in their resistance ranking to AB, 16 advanced breeding lines from the Australian chickpea breeding program, four landraces, and one accession representing the wild chickpea species Cicer reticulatum. More than 800,000 single nucleotide polymorphisms (SNPs) were identified. Population structure analysis revealed relatively narrow genetic diversity amongst recently released Australian varieties and two groups of varieties separated by the level of AB resistance. Several regions of the chickpea genome were under positive selection based on Tajima's D test. Both Fst genome- scan and genome-wide association studies (GWAS) identified a 100 kb region (AB4.1) on chromosome 4 that was significantly associated with AB resistance. The AB4.1 region co-located to a large QTL interval of 7 Mb∼30 Mb identified previously in three different mapping populations which were genotyped at relatively low density with SSR or SNP markers. The AB4.1 region was validated by GWAS in an additional collection of 132 advanced breeding lines from the Australian chickpea breeding program, genotyped with approximately 144,000 SNPs. The reduced level of nucleotide diversity and long extent of linkage disequilibrium also suggested the AB4.1 region may have gone through selective sweeps probably caused by selection of the AB resistance trait in breeding. In total, 12 predicted genes were located in the AB4.1 QTL region, including those annotated as: NBS-LRR receptor-like kinase, wall-associated kinase, zinc finger protein, and serine/threonine protein kinases. One significant SNP located in the conserved catalytic domain of a NBS-LRR receptor-like kinase led to amino acid substitution. Transcriptional analysis using qPCR showed that some predicted genes were significantly induced in resistant lines after inoculation compared to non-inoculated plants. This study demonstrates the power of combining WGRS data with relatively simple traits to rapidly develop "functional makers" for marker-assisted selection and genomic selection.

11.
Plant Methods ; 12: 2, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26793268

RESUMO

BACKGROUND: There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. RESULTS: We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. CONCLUSIONS: We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.

12.
Methods Mol Biol ; 1374: 233-40, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26519409

RESUMO

Second-generation sequencing (SGS) technology has enabled the sequencing of genomes and identification of genes. However, large complex plant genomes remain particularly difficult for de novo assembly. Access to the vast quantity of raw sequence data may facilitate discoveries; however the volume of this data makes access difficult. This chapter discusses the Web-based tool TAGdb that enables researchers to identify paired read second-generation DNA sequence data that share identity with a submitted query sequence. The identified reads can be used for PCR amplification of genomic regions to identify genes and promoters without the need for genome assembly.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Análise de Sequência de DNA/métodos , Software , Bases de Dados Genéticas , Sequenciamento de Nucleotídeos em Larga Escala , Plantas/genética , Navegador
13.
Sci Rep ; 5: 15296, 2015 Oct 19.
Artigo em Inglês | MEDLINE | ID: mdl-26478518

RESUMO

A combination of two approaches, namely QTL analysis and gene enrichment analysis were used to identify candidate genes in the "QTL-hotspot" region for drought tolerance present on the Ca4 pseudomolecule in chickpea. In the first approach, a high-density bin map was developed using 53,223 single nucleotide polymorphisms (SNPs) identified in the recombinant inbred line (RIL) population of ICC 4958 (drought tolerant) and ICC 1882 (drought sensitive) cross. QTL analysis using recombination bins as markers along with the phenotyping data for 17 drought tolerance related traits obtained over 1-5 seasons and 1-5 locations split the "QTL-hotspot" region into two subregions namely "QTL-hotspot_a" (15 genes) and "QTL-hotspot_b" (11 genes). In the second approach, gene enrichment analysis using significant marker trait associations based on SNPs from the Ca4 pseudomolecule with the above mentioned phenotyping data, and the candidate genes from the refined "QTL-hotspot" region showed enrichment for 23 genes. Twelve genes were found common in both approaches. Functional validation using quantitative real-time PCR (qRT-PCR) indicated four promising candidate genes having functional implications on the effect of "QTL-hotspot" for drought tolerance in chickpea.


Assuntos
Adaptação Biológica/genética , Cicer/genética , Cicer/metabolismo , Secas , Genes de Plantas , Locos de Características Quantitativas , Estresse Fisiológico/genética , Pontos de Quebra do Cromossomo , Mapeamento Cromossômico , Regulação da Expressão Gênica de Plantas , Ligação Genética , Genoma de Planta , Estudo de Associação Genômica Ampla , Genômica , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Endogamia , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Recombinação Genética , Reprodutibilidade dos Testes
14.
Artigo em Inglês | MEDLINE | ID: mdl-26289427

RESUMO

Molecular markers are valuable tools for breeders to help accelerate crop improvement. High throughput sequencing technologies facilitate the discovery of large-scale variations such as single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSRs). Sequencing of chickpea genome along with re-sequencing of several chickpea lines has enabled the discovery of 4.4 million variations including SNPs and InDels. Here we report a repository of 1.9 million variations (SNPs and InDels) anchored on eight pseudomolecules in a custom database, referred as CicArVarDB that can be accessed at http://cicarvardb.icrisat.org/. It includes an easy interface for users to select variations around specific regions associated with quantitative trait loci, with embedded webBLAST search and JBrowse visualisation. We hope that this database will be immensely useful for the chickpea research community for both advancing genetics research as well as breeding applications for crop improvement. Database URL: http://cicarvardb.icrisat.org.


Assuntos
Cicer/genética , Produtos Agrícolas/genética , Bases de Dados de Ácidos Nucleicos , Mutação INDEL , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Análise de Sequência de DNA/instrumentação , Análise de Sequência de DNA/métodos
15.
Theor Appl Genet ; 128(6): 1039-47, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25754422

RESUMO

KEY MESSAGE: We characterise the distribution of crossover and non-crossover recombination in Brassica napus and Cicer arietinum using a low-coverage genotyping by sequencing pipeline SkimGBS. The growth of next-generation DNA sequencing technologies has led to a rapid increase in sequence-based genotyping for applications including diversity assessment, genome structure validation and gene-trait association. We have established a skim-based genotyping by sequencing method for crop plants and applied this approach to genotype-segregating populations of Brassica napus and Cicer arietinum. Comparison of progeny genotypes with those of the parental individuals allowed the identification of crossover and non-crossover (gene conversion) events. Our results identify the positions of recombination events with high resolution, permitting the mapping and frequency assessment of recombination in segregating populations.


Assuntos
Brassica napus/genética , Cicer/genética , Troca Genética , Conversão Gênica , Técnicas de Genotipagem , Mapeamento Cromossômico , Genoma de Planta , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo de Nucleotídeo Único
16.
G3 (Bethesda) ; 5(4): 559-67, 2015 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-25660167

RESUMO

Genetic structure can be altered by chemical mutagenesis, which is a common method applied in molecular biology and genetics. Second-generation sequencing provides a platform to reveal base alterations occurring in the whole genome due to mutagenesis. A model legume, Lotus japonicus ecotype Miyakojima, was chemically mutated with alkylating ethyl methanesulfonate (EMS) for the scanning of DNA lesions throughout the genome. Using second-generation sequencing, two individually mutated third-generation progeny (M3, named AM and AS) were sequenced and analyzed to identify single nucleotide polymorphisms and reveal the effects of EMS on nucleotide sequences in these mutant genomes. Single-nucleotide polymorphisms were found in every 208 kb (AS) and 202 kb (AM) with a bias mutation of G/C-to-A/T changes at low percentage. Most mutations were intergenic. The mutation spectrum of the genomes was comparable in their individual chromosomes; however, each mutated genome has unique alterations, which are useful to identify causal mutations for their phenotypic changes. The data obtained demonstrate that whole genomic sequencing is applicable as a high-throughput tool to investigate genomic changes due to mutagenesis. The identification of these single-point mutations will facilitate the identification of phenotypically causative mutations in EMS-mutated germplasm.


Assuntos
Alquilantes/química , Metanossulfonato de Etila/química , Genoma de Planta , Lotus/genética , Pareamento de Bases , DNA de Plantas/análise , DNA de Plantas/química , Frequência do Gene , Loci Gênicos , Sequenciamento de Nucleotídeos em Larga Escala , Fenótipo , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA
17.
Plant Biotechnol J ; 13(1): 97-104, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25147022

RESUMO

Despite being a major international crop, our understanding of the wheat genome is relatively poor due to its large size and complexity. To gain a greater understanding of wheat genome diversity, we have identified single nucleotide polymorphisms between 16 Australian bread wheat varieties. Whole-genome shotgun Illumina paired read sequence data were mapped to the draft assemblies of chromosomes 7A, 7B and 7D to identify more than 4 million intervarietal SNPs. SNP density varied between the three genomes, with much greater density observed on the A and B genomes than the D genome. This variation may be a result of substantial gene flow from the tetraploid Triticum turgidum, which possesses A and B genomes, during early co-cultivation of tetraploid and hexaploid wheat. In addition, we examined SNP density variation along the chromosome syntenic builds and identified genes in low-density regions which may have been selected during domestication and breeding. This study highlights the impact of evolution and breeding on the bread wheat genome and provides a substantial resource for trait association and crop improvement. All SNP data are publically available on a generic genome browser GBrowse at www.wheatgenome.info.


Assuntos
Pão , Cromossomos de Plantas/genética , Polimorfismo de Nucleotídeo Único/genética , Triticum/genética , Austrália , Genoma de Planta , Filogenia , Reprodutibilidade dos Testes
18.
Methods Mol Biol ; 1245: 29-47, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25373747

RESUMO

With the advent of sequencing technology, next-generation sequencing (NGS) technology has dramatically revolutionized plant genomics. NGS technology combined with new software tools enables the discovery, validation, and assessment of genetic markers on a large scale. Among different markers systems, simple sequence repeats (SSRs) and Single nucleotide polymorphisms (SNPs) are the markers of choice for genetics and plant breeding. SSR markers have been a choice for large-scale characterization of germplasm collections, construction of genetic maps, and QTL identification. Similarly, SNPs are the most abundant genetic variations with higher frequencies throughout the genome of plant species. This chapter discusses various tools available for genome assembly and widely focuses on SSR and SNP marker discovery.


Assuntos
Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequência de Bases , Marcadores Genéticos , Repetições de Microssatélites/genética , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , Triticum/genética
19.
PLoS One ; 9(7): e101673, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25006804

RESUMO

Resistance to pod shattering (shatter resistance) is a target trait for global rapeseed (canola, Brassica napus L.), improvement programs to minimise grain loss in the mature standing crop, and during windrowing and mechanical harvest. We describe the genetic basis of natural variation for shatter resistance in B. napus and show that several quantitative trait loci (QTL) control this trait. To identify loci underlying shatter resistance, we used a novel genotyping-by-sequencing approach DArT-Seq. QTL analysis detected a total of 12 significant QTL on chromosomes A03, A07, A09, C03, C04, C06, and C08; which jointly account for approximately 57% of the genotypic variation in shatter resistance. Through Genome-Wide Association Studies, we show that a large number of loci, including those that are involved in shattering in Arabidopsis, account for variation in shatter resistance in diverse B. napus germplasm. Our results indicate that genetic diversity for shatter resistance genes in B. napus is limited; many of the genes that might control this trait were not included during the natural creation of this species, or were not retained during the domestication and selection process. We speculate that valuable diversity for this trait was lost during the natural creation of B. napus. To improve shatter resistance, breeders will need to target the introduction of useful alleles especially from genotypes of other related species of Brassica, such as those that we have identified.


Assuntos
Brassica napus/genética , Genes de Plantas , Sementes/genética , Brassica napus/anatomia & histologia , Brassica napus/fisiologia , Mapeamento Cromossômico , Estudos de Associação Genética , Marcadores Genéticos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Sementes/anatomia & histologia , Sementes/fisiologia
20.
PLoS One ; 9(7): e101754, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25003610

RESUMO

Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone free software.


Assuntos
Mineração de Dados/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo de Nucleotídeo Único , Software , Cruzamento , Biologia Computacional/métodos , Genoma de Planta , Genômica/métodos , Internet , Plantas/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA