Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Genetics ; 192(3): 819-29, 2012 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-22904035

RESUMEN

Tef (Eragrostis tef) is a major cereal crop in Ethiopia. Lodging is the primary constraint to increasing productivity in this allotetraploid species, accounting for losses of ∼15-45% in yield each year. As a first step toward identifying semi-dwarf varieties that might have improved lodging resistance, an ∼6× fosmid library was constructed and used to identify both homeologues of the dw3 semi-dwarfing gene of Sorghum bicolor. An EMS mutagenized population, consisting of ∼21,210 tef plants, was planted and leaf materials were collected into 23 superpools. Two dwarfing candidate genes, homeologues of dw3 of sorghum and rht1 of wheat, were sequenced directly from each superpool with 454 technology, and 120 candidate mutations were identified. Out of 10 candidates tested, six independent mutations were validated by Sanger sequencing, including two predicted detrimental mutations in both dw3 homeologues with a potential to improve lodging resistance in tef through further breeding. This study demonstrates that high-throughput sequencing can identify potentially valuable mutations in under-studied plant species like tef and has provided mutant lines that can now be combined and tested in breeding programs for improved lodging resistance.


Asunto(s)
Eragrostis/genética , Genes de Plantas , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación , Secuencia de Bases , Etiopía , Estudios de Asociación Genética , Datos de Secuencia Molecular , Mutagénesis/efectos de los fármacos , Mutágenos/farmacología , Fenotipo , Reproducibilidad de los Resultados
2.
J Bacteriol ; 190(7): 2597-606, 2008 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-18245285

RESUMEN

Escherichia coli DH10B was designed for the propagation of large insert DNA library clones. It is used extensively, taking advantage of properties such as high DNA transformation efficiency and maintenance of large plasmids. The strain was constructed by serial genetic recombination steps, but the underlying sequence changes remained unverified. We report the complete genomic sequence of DH10B by using reads accumulated from the bovine sequencing project at Baylor College of Medicine and assembled with DNAStar's SeqMan genome assembler. The DH10B genome is largely colinear with that of the wild-type K-12 strain MG1655, although it is substantially more complex than previously appreciated, allowing DH10B biology to be further explored. The 226 mutated genes in DH10B relative to MG1655 are mostly attributable to the extensive genetic manipulations the strain has undergone. However, we demonstrate that DH10B has a 13.5-fold higher mutation rate than MG1655, resulting from a dramatic increase in insertion sequence (IS) transposition, especially IS150. IS elements appear to have remodeled genome architecture, providing homologous recombination sites for a 113,260-bp tandem duplication and an inversion. DH10B requires leucine for growth on minimal medium due to the deletion of leuLABCD and harbors both the relA1 and spoT1 alleles causing both sensitivity to nutritional downshifts and slightly lower growth rates relative to the wild type. Finally, while the sequence confirms most of the reported alleles, the sequence of deoR is wild type, necessitating reexamination of the assumed basis for the high transformability of DH10B.


Asunto(s)
Escherichia coli/genética , Genoma Bacteriano , Escherichia coli/crecimiento & desarrollo , Genes Bacterianos , Modelos Genéticos , Mutación , Análisis de Secuencia de ADN
3.
Nature ; 440(7088): 1194-8, 2006 Apr 27.
Artículo en Inglés | MEDLINE | ID: mdl-16641997

RESUMEN

After the completion of a draft human genome sequence, the International Human Genome Sequencing Consortium has proceeded to finish and annotate each of the 24 chromosomes comprising the human genome. Here we describe the sequencing and analysis of human chromosome 3, one of the largest human chromosomes. Chromosome 3 comprises just four contigs, one of which currently represents the longest unbroken stretch of finished DNA sequence known so far. The chromosome is remarkable in having the lowest rate of segmental duplication in the genome. It also includes a chemokine receptor gene cluster as well as numerous loci involved in multiple human cancers such as the gene encoding FHIT, which contains the most common constitutive fragile site in the genome, FRA3B. Using genomic sequence from chimpanzee and rhesus macaque, we were able to characterize the breakpoints defining a large pericentric inversion that occurred some time after the split of Homininae from Ponginae, and propose an evolutionary history of the inversion.


Asunto(s)
Cromosomas Humanos Par 3/genética , Animales , Secuencia de Bases , Rotura Cromosómica/genética , Inversión Cromosómica/genética , Mapeo Contig , Islas de CpG/genética , ADN Complementario/genética , Evolución Molecular , Etiquetas de Secuencia Expresada , Proyecto Genoma Humano , Humanos , Macaca mulatta/genética , Datos de Secuencia Molecular , Pan troglodytes/genética , Análisis de Secuencia de ADN , Sintenía/genética
4.
Nature ; 440(7082): 346-51, 2006 Mar 16.
Artículo en Inglés | MEDLINE | ID: mdl-16541075

RESUMEN

Human chromosome 12 contains more than 1,400 coding genes and 487 loci that have been directly implicated in human disease. The q arm of chromosome 12 contains one of the largest blocks of linkage disequilibrium found in the human genome. Here we present the finished sequence of human chromosome 12, which has been finished to high quality and spans approximately 132 megabases, representing approximately 4.5% of the human genome. Alignment of the human chromosome 12 sequence across vertebrates reveals the origin of individual segments in chicken, and a unique history of rearrangement through rodent and primate lineages. The rate of base substitutions in recent evolutionary history shows an overall slowing in hominids compared with primates and rodents.


Asunto(s)
Cromosomas Humanos Par 12/genética , Animales , Composición de Base , Islas de CpG/genética , Evolución Molecular , Etiquetas de Secuencia Expresada , Genes/genética , Humanos , Desequilibrio de Ligamiento/genética , Repeticiones de Microsatélite/genética , Datos de Secuencia Molecular , Mutagénesis Insercional/genética , Pan troglodytes/genética , Análisis de Secuencia de ADN , Eliminación de Secuencia/genética , Elementos de Nucleótido Esparcido Corto/genética , Sintenía/genética
5.
Plant Physiol ; 139(3): 1323-37, 2005 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-16244158

RESUMEN

In the fully sequenced Arabidopsis (Arabidopsis thaliana) genome, many gene models are annotated as "hypothetical protein," whose gene structures are predicted solely by computer algorithms with no support from either expressed sequence matches from Arabidopsis, or nucleic acid or protein homologs from other species. In order to confirm their existence and predicted gene structures, a high-throughput method of rapid amplification of cDNA ends (RACE) was used to obtain their cDNA sequences from 11 cDNA populations. Primers from all of the 797 hypothetical genes on chromosome 2 were designed, and, through 5' and 3' RACE, clones from 506 genes were sequenced and cDNA sequences from 399 target genes were recovered. The cDNA sequences were obtained by assembling their 5' and 3' RACE polymerase chain reaction products. These sequences revealed that (1) the structures of 151 hypothetical genes were different from their predictions; (2) 116 hypothetical genes had alternatively spliced transcripts and 187 genes displayed polyadenylation sites; and (3) there were transcripts arising from both strands, from the strand opposite to that of the prediction and possible dicistronic transcripts. Promoters from five randomly chosen hypothetical genes (At2g02540, At2g31270, At2g33640, At2g35550, and At2g36340) were cloned into report constructs, and their expressions are tissue or development stage specific. Our results indicate at least 50% of hypothetical genes on chromosome 2 are expressed in the cDNA populations with about 38% of the gene structures differing from their predictions. Thus, by using this targeted approach, high-throughput RACE, we revealed numerous transcripts including many uncharacterized variants from these hypothetical genes.


Asunto(s)
Arabidopsis/genética , Cromosomas de las Plantas/genética , ADN Complementario/genética , Genes de Plantas/genética , Transcripción Genética/genética , Empalme Alternativo/genética , Arabidopsis/anatomía & histología , Codón Iniciador/genética , Codón de Terminación/genética , Genes Reporteros/genética , Genoma de Planta , Sistemas de Lectura Abierta/genética , Regiones Promotoras Genéticas/genética , Alineación de Secuencia , Análisis de Secuencia de ADN , Homología de Secuencia de Ácido Nucleico
6.
Genome Res ; 15(4): 487-95, 2005 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-15805490

RESUMEN

Through comparative studies of the model organism Arabidopsis thaliana and its close relative Brassica oleracea, we have identified conserved regions that represent potentially functional sequences overlooked by previous Arabidopsis genome annotation methods. A total of 454,274 whole genome shotgun sequences covering 283 Mb (0.44 x) of the estimated 650 Mb Brassica genome were searched against the Arabidopsis genome, and conserved Arabidopsis genome sequences (CAGSs) were identified. Of these 229,735 conserved regions, 167,357 fell within or intersected existing gene models, while 60,378 were located in previously unannotated regions. After removal of sequences matching known proteins, CAGSs that were close to one another were chained together as potentially comprising portions of the same functional unit. This resulted in 27,347 chains of which 15,686 were sufficiently distant from existing gene annotations to be considered a novel conserved unit. Of 192 conserved regions examined, 58 were found to be expressed in our cDNA populations. Rapid amplification of cDNA ends (RACE) was used to obtain potentially full-length transcripts from these 58 regions. The resulting sequences led to the creation of 21 gene models at 17 new Arabidopsis loci and the addition of splice variants or updates to another 19 gene structures. In addition, CAGSs overlapping already annotated genes in Arabidopsis can provide guidance for manual improvement of existing gene models. Published genome-wide expression data based on whole genome tiling arrays and massively parallel signature sequencing were overlaid on the Brassica-Arabidopsis conserved sequences, and 1399 regions of intersection were identified. Collectively our results and these data sets suggest that several thousand new Arabidopsis genes remain to be identified and annotated.


Asunto(s)
Arabidopsis/genética , Brassica/genética , Genes de Plantas/genética , Genoma de Planta , Genómica/métodos , Mapeo Cromosómico , Cromosomas de las Plantas , Secuencia Conservada , ADN Complementario/genética , ADN de Plantas/genética , Bases de Datos Genéticas , Perfilación de la Expresión Génica , Modelos Genéticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...