RESUMO
The reniform nematode (RN), a major agricultural pest particularly on cotton in the United States, is among the major plant-parasitic nematodes for which limited genomic information exists. In this study, over 380 Mb of sequence data were generated from pooled DNA of four adult female RNs and assembled into 67,317 contigs, including 25,904 (38.5%) predicted coding contigs and 41,413 (61.5%) noncoding contigs. Most of the characterized repeats were of low complexity (88.9%), and 0.9% of the contigs matched with 53.2% of GenBank ESTs. The most frequent Gene Ontology (GO) terms for molecular function and biological process were protein binding (32%) and embryonic development (20%). Further analysis showed that 741 (1.1%), 94 (0.1%), and 169 (0.25%) RN genomic contigs matched with 1328 (13.9%), 1480 (5.4%), and 1330 (7.4%) supercontigs of Meloidogyne incognita, Brugia malayi, and Pristionchus pacificus, respectively. Chromosome 5 of Caenorhabditis elegans had the highest number of hits to the RN contigs. Seven putative detoxification genes and three carbohydrate-active enzymes (CAZymes) involved in cell wall degradation were studied in more detail. Additionally, kinases, G protein-coupled receptors, and neuropeptides functioning in physiological, developmental, and regulatory processes were identified in the RN genome.
Assuntos
Genoma Helmíntico , Genômica , Nematoides/genética , Animais , Biologia Computacional/métodos , Bases de Dados Genéticas , Feminino , Perfilação da Expressão Gênica , Ontologia Genética , Gossypium/parasitologia , Anotação de Sequência Molecular , Nematoides/classificação , Análise de Sequência de DNA , TranscriptomaRESUMO
U.S. cotton production is suffering from the yield loss caused by the reniform nematode (RN), Rotylenchulus reniformis. Management of this devastating pest is of utmost importance because, no upland cotton cultivar exhibits adequate resistance to RN. Nine populations of RN from distinct regions in Alabama and one population from Mississippi were studied and thirteen morphometric features were measured on 20 male and 20 female nematodes from each population. Highly correlated variables (positive) in female and male RN morphometric parameters were observed for body length (L) and distance of vulva from the lip region (V) (r = 0.7) and tail length (TL) and c' (r = 0.8), respectively. The first and second principal components for the female and male populations showed distinct clustering into three groups. These results show pattern of sub-groups within the RN populations in Alabama. A one-way ANOVA on female and male RN populations showed significant differences (p ≤ 0.05) among the variables. Multiple sequence alignment (MSA) of 18S rRNA sequences (421) showed lengths of 653 bp. Sites within the aligned sequences were conserved (53%), parsimony-informative (17%), singletons (28%), and indels (2%), respectively. Neighbor-Joining analysis showed intra and inter-nematodal variations within the populations as clone sequences from different nematodes irrespective of the sex of nematode isolate clustered together. Morphologically, the three groups (I, II and III) could not be distinctly associated with the molecular data from the 18S rRNA sequences. The three groups may be identified as being non-geographically contiguous.
RESUMO
BACKGROUND: Earlier comparative maps between the genomes of rice (Oryza sativa L.), barley (Hordeum vulgare L.) and wheat (Triticum aestivum L.) were linkage maps based on cDNA-RFLP markers. The low number of polymorphic RFLP markers has limited the development of dense genetic maps in wheat and the number of available anchor points in comparative maps. Higher density comparative maps using PCR-based anchor markers are necessary to better estimate the conservation of colinearity among cereal genomes. The purposes of this study were to characterize the proportion of transcribed DNA sequences containing simple sequence repeats (SSR or microsatellites) by length and motif for wheat, barley and rice and to determine in-silico rice genome locations for primer sets developed for wheat and barley Expressed Sequence Tags. RESULTS: The proportions of SSR types (di-, tri-, tetra-, and penta-nucleotide repeats) and motifs varied with the length of the SSRs within and among the three species, with trinucleotide SSRs being the most frequent. Distributions of genomic microsatellites (gSSRs), EST-derived microsatellites (EST-SSRs), and transcribed regions in the contiguous sequence of rice chromosome 1 were highly correlated. More than 13,000 primer pairs were developed for use by the cereal research community as potential markers in wheat, barley and rice. CONCLUSION: Trinucleotide SSRs were the most common type in each of the species; however, the relative proportions of SSR types and motifs differed among rice, wheat, and barley. Genomic microsatellites were found to be primarily located in gene-rich regions of the rice genome. Microsatellite markers derived from the use of non-redundant EST-SSRs are an economic and efficient alternative to RFLP for comparative mapping in cereals.
Assuntos
Etiquetas de Sequências Expressas , Genes de Plantas , Hordeum/genética , Repetições de Microssatélites , Oryza/genética , Triticum/genética , Motivos de Aminoácidos , Mapeamento Cromossômico , Cromossomos de Plantas , Primers do DNA/química , Ligação Genética , Marcadores Genéticos , Genoma de Planta , Modelos Genéticos , Modelos Estatísticos , Nucleotídeos/genética , Reação em Cadeia da Polimerase , Polimorfismo Genético , Polimorfismo de Fragmento de Restrição , Análise de Regressão , Transcrição GênicaRESUMO
The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene.
Assuntos
Variação Genética , Filogenia , RNA Ribossômico 18S/genética , Tylenchida/genética , Animais , Sequência de Bases , Análise por Conglomerados , Primers do DNA/genética , Dados de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNARESUMO
Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy using minimum tiling path (MTP) BACs from Ctg-3301 and Ctg-465, two large genomic segments in A12 and D12 homoeologous chromosomes (Ctg). To enable generation of longer contig sequences in assembly, we implemented a hybrid assembly method to process ~35x data from 454 technology and 2.8-3x data from Sanger method. Hybrid assemblies offered higher sequence coverage and better sequence assemblies. Homology studies revealed the presence of retrotransposon regions like Copia and Gypsy elements in these contigs and also helped in identifying new genomic SSRs. Unigenes were anchored to the sequences in Ctg-3301 and Ctg-465 to support the physical map. Gene density, gene structure and protein sequence information derived from protein prediction programs were used to obtain the functional annotation of these genes. Comparative analysis of both contigs with Arabidopsis genome exhibited synteny and microcollinearity with a conserved gene order in both genomes. This study provides insight about use of MTP-based BAC-pool sequencing approach for sequencing complex polyploid genomes with limited constraints in generating better sequence assemblies to build reference scaffold sequences. Combining the utilities of MTP-based BAC-pool sequencing with current longer and short read NGS technologies in multiplexed format would provide a new direction to cost-effectively and precisely sequence complex plant genomes.
Assuntos
Cromossomos Artificiais Bacterianos/genética , Cromossomos de Plantas/genética , DNA de Plantas/genética , Gossypium/genética , Análise de Sequência de DNA/métodos , Mapeamento de Sequências Contíguas , DNA de Plantas/química , Genoma de Planta/genética , Biblioteca Genômica , Poliploidia , Reprodutibilidade dos Testes , Retroelementos/genéticaRESUMO
New source of molecular markers accelerate the efforts in improving cotton fiber traits and aid in developing high-density integrated genetic maps. We developed new markers based on candidate genes and G. arboreum EST sequences that were used for polymorphism detection followed by genetic and physical mapping. Nineteen gene-based markers were surveyed for polymorphism detection in 26 Gossypium species. Cluster analysis generated a phylogenetic tree with four major sub-clusters for 23 species while three species branched out individually. CAP method enhanced the rate of polymorphism of candidate gene-based markers between G. hirsutum and G. barbadense. Two hundred A-genome based SSR markers were designed after datamining of G. arboreum EST sequences (Mississippi Gossypium arboreum EST-SSR: MGAES). Over 70% of MGAES markers successfully produced amplicons while 65 of them demonstrated polymorphism between the parents of G. hirsutum and G. barbadense RIL population and formed 14 linkage groups. Chromosomal localization of both candidate gene-based and MGAES markers was assisted by euploid and hypoaneuploid CS-B analysis. Gene-based and MGAES markers were highly informative as they were designed from candidate genes and fiber transcriptome with a potential to be integrated into the existing cotton genetic and physical maps.
RESUMO
Tef [Eragrostis tef (Zucc.) Trotter] is the major cereal crop in Ethiopia. Tef is an allotetraploid with a base chromosome number of 10 (2n = 4x = 40) and a genome size of 730 Mbp. Ninety-four F(9) recombinant inbred lines (RIL) derived from the interspecific cross, Eragrostis tef cv. Kaye Murri x Eragrostis pilosa (accession 30-5), were mapped using restriction fragment length polymorphisms (RFLP), simple sequence repeats derived from expressed sequence tags (EST-SSR), single nucleotide polymorphism/insertion and deletion (SNP/INDEL), intron fragment length polymorphism (IFLP) and inter-simple sequence repeat amplification (ISSR). A total of 156 loci from 121 markers was grouped into 21 linkage groups at LOD 4, and the map covered 2,081.5 cM with a mean density of 12.3 cM per locus. Three putative homoeologous groups were identified based on multi-locus markers. Sixteen percent of the loci deviated from normal segregation with a predominance of E. tef alleles, and a majority of the distorted loci were clustered on three linkage groups. This map will be useful for further genetic studies in tef including mapping of loci controlling quantitative traits (QTL), and comparative analysis with other cereal crops.
Assuntos
Mapeamento Cromossômico , Eragrostis/genética , Ligação Genética , Etiquetas de Sequências Expressas , Deleção de Genes , Marcadores Genéticos , Íntrons , Repetições Minissatélites , Polimorfismo de Fragmento de Restrição , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Plant genomics projects involving model species and many agriculturally important crops are resulting in a rapidly increasing database of genomic and expressed DNA sequences. The publicly available collection of expressed sequence tags (ESTs) from several grass species can be used in the analysis of both structural and functional relationships in these genomes. We analyzed over 260000 EST sequences from five different cereals for their potential use in developing simple sequence repeat (SSR) markers. The frequency of SSR-containing ESTs (SSR-ESTs) in this collection varied from 1.5% for maize to 4.7% for rice. In addition, we identified several ESTs that are related to the SSR-ESTs by BLAST analysis. The SSR-ESTs and the related sequences were clustered within each species in order to reduce the redundancy and to produce a longer consensus sequence. The consensus and singleton sequences from each species were pooled and clustered to identify cross-species matches. Overall a reduction in the redundancy by 85% was observed when the resulting consensus and singleton sequences (3569) were compared to the total number of SSR-EST and related sequences analyzed (24 606). This information can be useful for the development of SSR markers that can amplify across the grass genera for comparative mapping and genetics. Functional analysis may reveal their role in plant metabolism and gene evolution.
Assuntos
Etiquetas de Sequências Expressas , Repetições de Microssatélites/genética , Poaceae/genética , Biologia Computacional , Bases de Dados Factuais , Marcadores Genéticos , Hordeum/genética , Oryza/genética , Homologia de Sequência do Ácido Nucleico , Triticum/genética , Zea mays/genéticaRESUMO
The use of DNA sequence-based comparative genomics for evolutionary studies and for transferring information from model species to crop species has revolutionized molecular genetics and crop improvement strategies. This study compared 4485 expressed sequence tags (ESTs) that were physically mapped in wheat chromosome bins, to the public rice genome sequence data from 2251 ordered BAC/PAC clones using BLAST. A rice genome view of homologous wheat genome locations based on comparative sequence analysis revealed numerous chromosomal rearrangements that will significantly complicate the use of rice as a model for cross-species transfer of information in nonconserved regions.