Pesquisa | BVS CLAP/SMR-OPAS/OMS

The sequence of the human genome.

Venter, J C; Adams, M D; Myers, E W; Li, P W; Mural, R J; Sutton, G G; Smith, H O; Yandell, M; Evans, C A; Holt, R A; Gocayne, J D; Amanatides, P; Ballew, R M; Huson, D H; Wortman, J R; Zhang, Q; Kodira, C D; Zheng, X H; Chen, L; Skupski, M; Subramanian, G; Thomas, P D; Zhang, J; Gabor Miklos, G L; Nelson, C; Broder, S; Clark, A G; Nadeau, J; McKusick, V A; Zinder, N; Levine, A J; Roberts, R J; Simon, M; Slayman, C; Hunkapiller, M; Bolanos, R; Delcher, A; Dew, I; Fasulo, D; Flanigan, M; Florea, L; Halpern, A; Hannenhalli, S; Kravitz, S; Levy, S; Mobarry, C; Reinert, K; Remington, K; Abu-Threideh, J; Beasley, E.

Science ; 291(5507): 1304-51, 2001 02 16.

Artigo em Inglês | MEDLINE | ID: mdl-11181995

RESUMO

A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was generated by the whole-genome shotgun sequencing method. The 14.8-billion bp DNA sequence was generated over 9 months from 27,271,853 high-quality sequence reads (5.11-fold coverage of the genome) from both ends of plasmid clones made from the DNA of five individuals. Two assembly strategies-a whole-genome assembly and a regional chromosome assembly-were used, each combining sequence data from Celera and the publicly funded genome effort. The public data were shredded into 550-bp segments to create a 2.9-fold coverage of those genome regions that had been sequenced, without including biases inherent in the cloning and assembly procedure used by the publicly funded group. This brought the effective coverage in the assemblies to eightfold, reducing the number and size of gaps in the final assembly over what would be obtained with 5.11-fold coverage. The two assembly strategies yielded very similar results that largely agree with independent mapping data. The assemblies effectively cover the euchromatic regions of the human chromosomes. More than 90% of the genome is in scaffold assemblies of 100,000 bp or more, and 25% of the genome is in scaffolds of 10 million bp or larger. Analysis of the genome sequence revealed 26,588 protein-encoding transcripts for which there was strong corroborating evidence and an additional approximately 12,000 computationally derived genes with mouse matches or other weak supporting evidence. Although gene-dense clusters are obvious, almost half the genes are dispersed in low G+C sequence separated by large tracts of apparently noncoding sequence. Only 1.1% of the genome is spanned by exons, whereas 24% is in introns, with 75% of the genome being intergenic DNA. Duplications of segmental blocks, ranging in size up to chromosomal lengths, are abundant throughout the genome and reveal a complex evolutionary history. Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems. DNA sequence comparisons between the consensus sequence and publicly funded genome data provided locations of 2.1 million single-nucleotide polymorphisms (SNPs). A random pair of human haploid genomes differed at a rate of 1 bp per 1250 on average, but there was marked heterogeneity in the level of polymorphism across the genome. Less than 1% of all SNPs resulted in variation in proteins, but the task of determining which SNPs have functional consequences remains an open challenge.

Assuntos

Genoma Humano , Projeto Genoma Humano , Análise de Sequência de DNA , Algoritmos , Animais , Bandeamento Cromossômico , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Biologia Computacional , Sequência Consenso , Ilhas de CpG , DNA Intergênico , Bases de Dados Factuais , Evolução Molecular , Éxons , Feminino , Duplicação Gênica , Genes , Variação Genética , Humanos , Íntrons , Masculino , Fenótipo , Mapeamento Físico do Cromossomo , Polimorfismo de Nucleotídeo Único , Proteínas/genética , Proteínas/fisiologia , Pseudogenes , Sequências Repetitivas de Ácido Nucleico , Retroelementos , Análise de Sequência de DNA/métodos , Especificidade da Espécie

Comparative genomics of the eukaryotes.

Rubin, G M; Yandell, M D; Wortman, J R; Gabor Miklos, G L; Nelson, C R; Hariharan, I K; Fortini, M E; Li, P W; Apweiler, R; Fleischmann, W; Cherry, J M; Henikoff, S; Skupski, M P; Misra, S; Ashburner, M; Birney, E; Boguski, M S; Brody, T; Brokstein, P; Celniker, S E; Chervitz, S A; Coates, D; Cravchik, A; Gabrielian, A; Galle, R F; Gelbart, W M; George, R A; Goldstein, L S; Gong, F; Guan, P; Harris, N L; Hay, B A; Hoskins, R A; Li, J; Li, Z; Hynes, R O; Jones, S J; Kuehl, P M; Lemaitre, B; Littleton, J T; Morrison, D K; Mungall, C; O'Farrell, P H; Pickeral, O K; Shue, C; Vosshall, L B; Zhang, J; Zhao, Q; Zheng, X H; Lewis, S.

Science ; 287(5461): 2204-15, 2000 Mar 24.

Artigo em Inglês | MEDLINE | ID: mdl-10731134

RESUMO

A comparative analysis of the genomes of Drosophila melanogaster, Caenorhabditis elegans, and Saccharomyces cerevisiae-and the proteins they are predicted to encode-was undertaken in the context of cellular, developmental, and evolutionary processes. The nonredundant protein sets of flies and worms are similar in size and are only twice that of yeast, but different gene families are expanded in each genome, and the multidomain proteins and signaling pathways of the fly and worm are far more complex than those of yeast. The fly has orthologs to 177 of the 289 human disease genes examined and provides the foundation for rapid analysis of some of the basic processes involved in human disease.

Assuntos

Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Genoma , Proteoma , Saccharomyces cerevisiae/genética , Animais , Apoptose/genética , Evolução Biológica , Caenorhabditis elegans/química , Caenorhabditis elegans/fisiologia , Adesão Celular/genética , Ciclo Celular/genética , Drosophila melanogaster/química , Drosophila melanogaster/fisiologia , Proteínas Fúngicas/química , Proteínas Fúngicas/genética , Genes Duplicados , Doenças Genéticas Inatas/genética , Genética Médica , Proteínas de Helminto/química , Proteínas de Helminto/genética , Humanos , Imunidade/genética , Proteínas de Insetos/química , Proteínas de Insetos/genética , Família Multigênica , Neoplasias/genética , Estrutura Terciária de Proteína , Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/fisiologia , Transdução de Sinais/genética

Suppressed recombination and a pairing anomaly on the mating-type chromosome of Neurospora tetrasperma.

Gallegos, A; Jacobson, D J; Raju, N B; Skupski, M P; Natvig, D O.

Genetics ; 154(2): 623-33, 2000 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-10655216

RESUMO

Neurospora crassa and related heterothallic ascomycetes produce eight homokaryotic self-sterile ascospores per ascus. In contrast, asci of N. tetrasperma contain four self-fertile ascospores each with nuclei of both mating types (matA and mata). The self-fertile ascospores of N. tetrasperma result from first-division segregation of mating type and nuclear spindle overlap at the second meiotic division and at a subsequent mitotic division. Recently, Merino et al. presented population-genetic evidence that crossing over is suppressed on the mating-type chromosome of N. tetrasperma, thereby preventing second-division segregation of mating type and the formation of self-sterile ascospores. The present study experimentally confirmed suppressed crossing over for a large segment of the mating-type chromosome by examining segregation of markers in crosses of wild strains. Surprisingly, our study also revealed a region on the far left arm where recombination is obligatory. In cytological studies, we demonstrated that suppressed recombination correlates with an extensive unpaired region at pachytene. Taken together, these results suggest an unpaired region adjacent to one or more paired regions, analogous to the nonpairing and pseudoautosomal regions of animal sex chromosomes. The observed pairing and obligate crossover likely reflect mechanisms to ensure chromosome disjunction.

Assuntos

Cromossomos Fúngicos , Neurospora/genética , Recombinação Genética , Sequência de Bases , Cruzamentos Genéticos , Troca Genética , Primers do DNA

A survey of human disease gene counterparts in the Drosophila genome.

Fortini, M E; Skupski, M P; Boguski, M S; Hariharan, I K.

J Cell Biol ; 150(2): F23-30, 2000 Jul 24.

Artigo em Inglês | MEDLINE | ID: mdl-10908582

Assuntos

Modelos Animais de Doenças , Drosophila melanogaster/genética , Doenças Genéticas Inatas/genética , Genoma , Biblioteca Genômica , Animais , Humanos , Homologia de Sequência do Ácido Nucleico

Phylogenetic analysis of heterothallic Neurospora species.

Skupski, M P; Jackson, D A; Natvig, D O.

Fungal Genet Biol ; 21(1): 153-62, 1997 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-9126624

RESUMO

We examined the phylogenetic relationships among five heterothallic species of Neurospora using restriction fragment polymorphisms derived from cosmid probes and sequence data from the upstream regions of two genes, al-1 and frq. Distance, maximum likelihood, and parsimony trees derived from the data support the hypothesis that strains assigned to N. sitophila, N. discreta, and N. tetrasperma form respective monophyletic groups. Strains assigned to N. intermedia and N. crassa, however, did not form two respective monophyletic groups, consistent with a previous suggestion based on analysis of mitochondrial DNAs that N. crassa and N. intermedia may be incompletely resolved sister taxa. Trees derived from restriction fragments and the al-1 sequence position N. tetrasperma as the sister species of N. sitophila. None of the trees produced by our data supported a previous analysis of sequences in the region of the mating type idiomorph that grouped N. crassa and N. sitophila as sister taxa, as well as N. intermedia and N. tetrasperma as sister taxa. Moreover, sequences from al-1, frq, and the mating-type region produced different trees when analyzed separately. The lack of consensus obtained with different sequences could result from the sorting of ancestral polymorphism during speciation or gene flow across species boundaries, or both.

Assuntos

Neurospora/genética , Filogenia , Genes Fúngicos/genética , Genes Fúngicos Tipo Acasalamento , Dados de Sequência Molecular , Polimorfismo de Fragmento de Restrição , Análise de Sequência de DNA

The genome sequence DataBase.

Harger, C; Chen, G; Farmer, A; Huang, W; Inman, J; Kiphart, D; Schilkey, F; Skupski, M P; Weller, J.

Nucleic Acids Res ; 28(1): 31-2, 2000 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-10592174

RESUMO

The Genome Sequence DataBase (GSDB) is a database of publicly available nucleotide sequences and their associated biological and bibliographic information. Several notable changes have occurred in the past year: GSDB stopped accepting data submissions from researchers; ownership of data submitted to GSDB was transferred to GenBank; sequence analysis capabilities were expanded to include Smith-Waterman and Frame Search; and Sequence Viewer became available to Mac users. The content of GSDB remains up-to-date because publicly available data is acquired from the International Nucleotide Sequence Database Collaboration databases (IC) on a nightly basis. This allows GSDB to continue providing researchers with the ability to analyze, query and retrieve nucleotide sequences in the database. GSDB and its related tools are freely accessible from the URL: http://www.ncgr.org

Assuntos

Bases de Dados Factuais , Genoma , Armazenamento e Recuperação da Informação , Propriedade , Análise de Sequência

The Genome Sequence DataBase version 1.0 (GSDB): from low pass sequences to complete genomes.

Harger, C; Skupski, M; Allen, E; Clark, C; Crowley, D; Dickinson, E; Easley, D; Espinosa-Lujan, A; Farmer, A; Fields, C; Flores, L; Harris, L; Keen, G; Manning, M; McLeod, M; O'Neill, J; Pumilia, M; Reinert, R; Rider, D; Rohrlich, J; Romero, Y; Schwertfeger, J; Seluja, G; Siepel, A; Schad, P A.

Nucleic Acids Res ; 25(1): 18-23, 1997 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-9016496

RESUMO

The Genome Sequence DataBase (GSDB) has completed its conversion to an improved relational database. The new database, GSDB 1.0, is fully operational and publicly available. Data contributions, including both original sequence submissions and community annotation, are being accomplished through the use of a graphical client-server interface tool, the GSDB Annotator, and via GIO (GSDB Input/Output) files. Data retrieval services are being provided through a new Web Query Tool and direct SQL. All methods of data contribution and data retrieval fully support the new data types that have been incorporated into GSDB, including discontiguous sequences, multiple sequence alignments, and community annotation.

Assuntos

Sequência de Bases , Bases de Dados Factuais , Animais , Humanos , Setor Privado , Software

The Genome Sequence DataBase: towards an integrated functional genomics resource.

Skupski, M P; Booker, M; Farmer, A; Harpold, M; Huang, W; Inman, J; Kiphart, D; Kodira, C; Root, S; Schilkey, F; Schwertfeger, J; Siepel, A; Stamper, D; Thayer, N; Thompson, R; Wortman, J; Zhuang, J J; Harger, C.

Nucleic Acids Res ; 27(1): 35-8, 1999 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-9847136

RESUMO

During 1998 the primary focus of the Genome Sequence DataBase (GSDB; http://www.ncgr.org/gsdb ) located at the National Center for Genome Resources (NCGR) has been to improve data quality, improve data collections, and provide new methods and tools to access and analyze data. Data quality has been improved by extensive curation of certain data fields necessary for maintaining data collections and for using certain tools. Data quality has also been increased by improvements to the suite of programs that import data from the International Nucleotide Sequence Database Collaboration (IC). The Sequence Tag Alignment and Consensus Knowledgebase (STACK), a database of human expressed gene sequences developed by the South African National Bioinformatics Institute (SANBI), became available within the last year, allowing public access to this valuable resource of expressed sequences. Data access was improved by the addition of the Sequence Viewer, a platform-independent graphical viewer for GSDB sequence data. This tool has also been integrated with other searching and data retrieval tools. A BLAST homology search service was also made available, allowing researchers to search all of the data, including the unique data, that are available from GSDB. These improvements are designed to make GSDB more accessible to users, extend the rich searching capability already present in GSDB, and to facilitate the transition to an integrated system containing many different types of biological data.

Assuntos

Sequência de Bases , Bases de Dados Factuais , Genoma , Armazenamento e Recuperação da Informação , Animais , Biologia Computacional , Sequência Consenso , Expressão Gênica , Genoma Humano , Humanos , Alinhamento de Sequência

The Genome Sequence DataBase (GSDB): improving data quality and data access.

Harger, C; Skupski, M; Bingham, J; Farmer, A; Hoisie, S; Hraber, P; Kiphart, D; Krakowski, L; McLeod, M; Schwertfeger, J; Seluja, G; Siepel, A; Singh, G; Stamper, D; Steadman, P; Thayer, N; Thompson, R; Wargo, P; Waugh, M; Zhuang, J J; Schad, P A.

Nucleic Acids Res ; 26(1): 21-6, 1998 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-9399793

RESUMO

In 1997 the primary focus of the Genome Sequence DataBase (GSDB; www. ncgr.org/gsdb ) located at the National Center for Genome Resources was to improve data quality and accessibility. Efforts to increase the quality of data within the database included two major projects; one to identify and remove all vector contamination from sequences in the database and one to create premier sequence sets (including both alignments and discontiguous sequences). Data accessibility was improved during the course of the last year in several ways. First, a graphical database sequence viewer was made available to researchers. Second, an update process was implemented for the web-based query tool, Maestro. Third, a web-based tool, Excerpt, was developed to retrieve selected regions of any sequence in the database. And lastly, a GSDB flatfile that contains annotation unique to GSDB (e.g., sequence analysis and alignment data) was developed. Additionally, the GSDB web site provides a tool for the detection of matrix attachment regions (MARs), which can be used to identify regions of high coding potential. The ultimate goal of this work is to make GSDB a more useful resource for genomic comparison studies and gene level studies by improving data quality and by providing data access capabilities that are consistent with the needs of both types of studies.

Assuntos

Bases de Dados Factuais , Genoma , Sequência de Bases , Redes de Comunicação de Computadores , Previsões , Armazenamento e Recuperação da Informação

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA