Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 437(7058): 551-5, 2005 Sep 22.
Artigo em Inglês | MEDLINE | ID: mdl-16177791

RESUMO

Chromosome 18 appears to have the lowest gene density of any human chromosome and is one of only three chromosomes for which trisomic individuals survive to term. There are also a number of genetic disorders stemming from chromosome 18 trisomy and aneuploidy. Here we report the finished sequence and gene annotation of human chromosome 18, which will allow a better understanding of the normal and disease biology of this chromosome. Despite the low density of protein-coding genes on chromosome 18, we find that the proportion of non-protein-coding sequences evolutionarily conserved among mammals is close to the genome-wide average. Extending this analysis to the entire human genome, we find that the density of conserved non-protein-coding sequences is largely uncorrelated with gene density. This has important implications for the nature and roles of non-protein-coding sequence elements.


Assuntos
Cromossomos Humanos Par 18/genética , DNA/genética , Aneuploidia , Animais , Sequência Conservada/genética , Ilhas de CpG/genética , Éxons/genética , Etiquetas de Sequências Expressas , Genes/genética , Genoma Humano , Humanos , Íntrons/genética , Dados de Sequência Molecular , Análise de Sequência de DNA , Sintenia
2.
Nucleic Acids Res ; 30(1): 169-71, 2002 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-11752283

RESUMO

Genew, the Human Gene Nomenclature Database, is the only resource that provides data for all human genes which have approved symbols. It is managed by the HUGO Gene Nomenclature Committee (HGNC) as a confidential database, containing over 16 000 records, 80% of which are represented on the Web by searchable text files. The data in Genew are highly curated by HGNC editors and gene records can be searched on the Web by symbol or name to directly retrieve information on gene symbol, gene name, cytogenetic location, OMIM number and PubMed ID. Data are integrated with other human gene databases, e.g. GDB, LocusLink and SWISS-PROT, and approved gene symbols are carefully co-ordinated with the Mouse Genome Database (MGD). Approved gene symbols are available for querying and browsing at http://www.gene.ucl.ac.uk/cgi-bin/nomenclature/searchgenes.pl.


Assuntos
Bases de Dados Genéticas , Terminologia como Assunto , Confidencialidade , Coleta de Dados , Sistemas de Gerenciamento de Base de Dados , Genes , Humanos , Armazenamento e Recuperação da Informação , Internet
3.
Nucleic Acids Res ; 32(Database issue): D255-7, 2004 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-14681406

RESUMO

Genew, the Human Gene Nomenclature Database http://www.gene.ucl.ac.uk/cgi-bin/nomenclature/searchgenes.pl is the only resource that provides data for all human genes that have approved symbols. It is managed by the HUGO Gene Nomenclature Committee (HGNC) as a confidential database, containing over 22 000 records, 75% of which are represented online by a publicly searchable text file. Since 2002, there have been significant improvements to the Genew search engine. Additionally we have increased our capacity to analyse confidential sequence data, which has enabled us to manage the large numbers of gene symbol requests that we receive from the chromosome sequencing consortia.


Assuntos
Bases de Dados Genéticas , Genes , Terminologia como Assunto , Animais , Biologia Computacional , Humanos , Armazenamento e Recuperação da Informação , Internet , Interface Usuário-Computador
4.
BMC Bioinformatics ; 6: 149, 2005 Jun 16.
Artigo em Inglês | MEDLINE | ID: mdl-15958172

RESUMO

BACKGROUND: Massive text mining of the biological literature holds great promise of relating disparate information and discovering new knowledge. However, disambiguation of gene symbols is a major bottleneck. RESULTS: We developed a simple thesaurus-based disambiguation algorithm that can operate with very little training data. The thesaurus comprises the information from five human genetic databases and MeSH. The extent of the homonym problem for human gene symbols is shown to be substantial (33% of the genes in our combined thesaurus had one or more ambiguous symbols), not only because one symbol can refer to multiple genes, but also because a gene symbol can have many non-gene meanings. A test set of 52,529 Medline abstracts, containing 690 ambiguous human gene symbols taken from OMIM, was automatically generated. Overall accuracy of the disambiguation algorithm was up to 92.7% on the test set. CONCLUSION: The ambiguity of human gene symbols is substantial, not only because one symbol may denote multiple genes but particularly because many symbols have other, non-gene meanings. The proposed disambiguation approach resolves most ambiguities in our test set with high accuracy, including the important gene/not a gene decisions. The algorithm is fast and scalable, enabling gene-symbol disambiguation in massive text mining applications.


Assuntos
Algoritmos , Genes , Armazenamento e Recuperação da Informação/métodos , Terminologia como Assunto , Vocabulário Controlado , Bases de Dados Genéticas , Humanos , Simbolismo
5.
Pharmacogenetics ; 14(1): 1-18, 2004 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-15128046

RESUMO

OBJECTIVES: Completion of both the mouse and human genome sequences in the private and public sectors has prompted comparison between the two species at multiple levels. This review summarizes the cytochrome P450 (CYP) gene superfamily. For the first time, we have the ability to compare complete sets of CYP genes from two mammals. Use of the mouse as a model mammal, and as a surrogate for human biology, assumes reasonable similarity between the two. It is therefore of interest to catalog the genetic similarities and differences, and to clarify the limits of extrapolation from mouse to human. METHODS: Data-mining methods have been used to find all the mouse and human CYP sequences; this includes 102 putatively functional genes and 88 pseudogenes in the mouse, and 57 putatively functional genes and 58 pseudogenes in the human. Comparison is made between all these genes, especially the seven main CYP gene clusters. RESULTS AND CONCLUSIONS: The seven CYP clusters are greatly expanded in the mouse with 72 functional genes versus only 27 in the human, while many pseudogenes are present; presumably this phenomenon will be seen in many other gene superfamily clusters. Complete identification of all pseudogene sequences is likely to be clinically important, because some of these highly similar exons can interfere with PCR-based genotyping assays. A naming procedure for each of four categories of CYP pseudogenes is proposed, and we encourage various gene nomenclature committees to consider seriously the adoption and application of this pseudogene nomenclature system.


Assuntos
Processamento Alternativo , Sistema Enzimático do Citocromo P-450/genética , Pseudogenes , Terminologia como Assunto , Animais , Humanos , Camundongos , Família Multigênica
7.
Genomics ; 86(2): 242-51, 2005 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-15922553

RESUMO

The ARID is an ancient DNA-binding domain that is conserved throughout the evolution of higher eukaryotes. The ARID consensus sequence spans about 100 amino acid residues, and structural studies identify the major groove contact site as a modified helix-turn-helix motif. ARID-containing proteins exhibit a range of cellular functions, including participation in chromatin remodeling, and regulation of gene expression during cell growth, differentiation, and development. A subset of ARID family proteins binds DNA specifically at AT-rich sites; the remainder bind DNA nonspecifically. Orthologs to each of the seven distinct subfamilies of mammalian ARID-containing proteins are found in insect genomes, indicating the minimum age for the organization of these higher metazoan subfamilies. Many of these ancestral genes were duplicated and fixed over time to yield the 15 ARID-containing genes that are found in the human, mouse, and dog genomes. This paper describes a nomenclature, recommended by the Mouse Genomic Nomenclature Committee (MGNC) and accepted by the Human Genome Organization (HUGO) Gene Nomenclature Committee, for these mammalian ARID-containing genes that reflects this evolutionary history.


Assuntos
Proteínas de Ligação a DNA/química , Proteínas de Ligação a DNA/classificação , Proteínas de Ligação a DNA/genética , Sequência de Aminoácidos , Animais , Sítios de Ligação , Evolução Biológica , DNA/química , Evolução Molecular , Genoma , Humanos , Cooperação Internacional , Camundongos , Modelos Moleculares , Dados de Sequência Molecular , Filogenia , Conformação Proteica , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Homologia de Sequência de Aminoácidos , Terminologia como Assunto
8.
Genomics ; 81(6): 618-22, 2003 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-12782131

RESUMO

Nucleic acid helicases are characterized by the presence of the helicase domain containing eight motifs. The sequence of the helicase domain is used to classify helicases into families. To identify members of the DEAD and DEAH families of human RNA helicases, we used the helicase domain sequences to search the nonredundant peptide sequence database. We report the identification of 36 and 14 members of the DEAD and DEAH families of putative RNA helicases, including several novel genes. The gene symbol DDX had been used previously for both DEAD- and DEAH-box families. We have now adopted DDX and DHX symbols to denote DEAD- and DEAH-box families, respectively. Members of human DDX and DHX families of putative RNA helicases play roles in differentiation and carcinogenesis.


Assuntos
Família Multigênica , RNA Helicases/genética , Motivos de Aminoácidos , Sequência de Aminoácidos , RNA Helicases DEAD-box , Bases de Dados de Proteínas , Humanos , Proteínas de Membrana/genética , Proteínas de Neoplasias/genética , RNA Helicases/classificação
9.
Curr Protoc Hum Genet ; Appendix 1: Appendix 1C, 2003 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18428336

RESUMO

Standard genetic nomenclature is necessary to help researchers, clinicians, and the public to access data on their genes of interest, and to communicate in a globally understood language of approved gene symbols. In both human and mouse, one unique symbol (acronym/abbreviation) and one name are assigned for each gene. Co-ordination between human and mouse gene nomenclature is a successful endeavor, due in part to the historical interaction between the two nomenclature committee groups. This interaction grew out of the Human Gene Mapping (HGM) Workshops. This appendix discusses development and organization of gene nomenclature, how to find a gene and how to name a new gene.


Assuntos
Genes , Animais , Humanos , Camundongos
10.
Hum Genomics ; 1(1): 66-71, 2003 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-15601535

RESUMO

Why is agreeing on one particular name for each gene important? As one genome after another becomes sequenced, it is imperative to consider the complexity of genes, genetic architecture, gene expression, gene-gene and gene-product interactions and evolutionary relatedness across species. To agree on a particular gene name not only makes one's own research easier, but will also be helpful to the present generation, as well as future generations, of graduate students and postdoctoral fellows who are about to enter genomics research.


Assuntos
Genoma Humano , Terminologia como Assunto , Bases de Dados Factuais , Evolução Molecular , Humanos , Internet , Dados de Sequência Molecular
11.
Nat Rev Genet ; 5(12): 889-99, 2004 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-15573121

RESUMO

The major histocompatibility complex (MHC) is the most important region in the vertebrate genome with respect to infection and autoimmunity, and is crucial in adaptive and innate immunity. Decades of biomedical research have revealed many MHC genes that are duplicated, polymorphic and associated with more diseases than any other region of the human genome. The recent completion of several large-scale studies offers the opportunity to assimilate the latest data into an integrated gene map of the extended human MHC. Here, we present this map and review its content in relation to paralogy, polymorphism, immune function and disease.


Assuntos
Genoma Humano , Complexo Principal de Histocompatibilidade , Doenças Autoimunes/genética , Mapeamento Cromossômico , Cromossomos Humanos Par 6 , Humanos , Imunidade , Família Multigênica , Polimorfismo Genético , RNA de Transferência/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA