Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 40(1): 270-83, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21911356

RESUMO

RNA polymerase (Pol) III synthesizes the tRNAs, the 5S ribosomal RNA and a small number of untranslated RNAs. In vitro, it also transcribes short interspersed nuclear elements (SINEs). We investigated the distribution of Pol III and its associated transcription factors on the genome of mouse embryonic stem cells using a highly specific tandem ChIP-Seq method. Only a subset of the annotated class III genes was bound and thus transcribed. A few hundred SINEs were associated with the Pol III transcription machinery. We observed that Pol III and its transcription factors were present at 30 unannotated sites on the mouse genome, only one of which was conserved in human. An RNA was associated with >80% of these regions. More than 2200 regions bound by TFIIIC transcription factor were devoid of Pol III. These sites were associated with cohesins and often located close to CTCF-binding sites, suggesting that TFIIIC might cooperate with these factors to organize the chromatin. We also investigated the genome-wide distribution of the ubiquitous TFIIS variant, TCEA1. We found that, as in Saccharomyces cerevisiae, TFIIS is associated with class III genes and also with SINEs suggesting that TFIIS is a Pol III transcription factor in mammals.


Assuntos
Células-Tronco Embrionárias/metabolismo , RNA Polimerase III/metabolismo , Transcrição Gênica , Fatores de Elongação da Transcrição/metabolismo , Animais , Sítios de Ligação , Fator 1 de Resposta a Butirato , Linhagem Celular , Cromatina/metabolismo , Imunoprecipitação da Cromatina/métodos , Genoma , Camundongos , Proteínas Nucleares/metabolismo , RNA Polimerase II/metabolismo , RNA Nuclear Pequeno/genética , RNA de Transferência/genética , Proteínas de Ligação a RNA/metabolismo , Análise de Sequência de DNA , Elementos Nucleotídeos Curtos e Dispersos , Fator de Transcrição TFIIIB/metabolismo , Fatores de Transcrição TFIII/metabolismo
2.
Nucleic Acids Res ; 30(1): 273-5, 2002 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-11752313

RESUMO

SBASE (http://www.icgeb.trieste.it/sbase) is an online resource of protein domain sequences designed to facilitate detection of domain homologies based on a simple database search. The ninth release of the SBASE library of protein domain sequences contains 320 000 annotated structural, functional, ligand-binding and topogenic segments of proteins clustered into over 3481 domain groups and 483 protein families. Domain identification and functional prediction are based on a comparison of BLAST search outputs with a knowledge base of within-group ('self') and out-of-group ('non-self') similarities of the known domain groups. This is a memory-based approach wherein class-specific similarity functions are automatically learned from the database [Stanfill,C. and Waltz,D. (1986) COMMUN: ACM, 29, 1213-1228].


Assuntos
Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Proteínas/química , Animais , Gráficos por Computador , Armazenamento e Recuperação da Informação , Internet , Ligantes , Proteínas/fisiologia , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade
3.
Nucleic Acids Res ; 31(1): 403-5, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12520034

RESUMO

SBASE (http://www.icgeb.trieste.it/sbase) is an on-line collection of protein domain sequences and related computational tools designed to facilitate detection of domain homologies based on simple database search. The 10th 'jubilee release' of the SBASE library of protein domain sequences contains 1 052 904 protein sequence segments annotated by structure, function, ligand-binding or cellular topology, clustered into over 6000 domain groups. Domain identification and functional prediction are based on a comparison of BLAST search outputs with a knowledge base of biologically significant similarities extracted from known domain groups. The knowledge base is generated automatically for each domain group from the comparison of within-group ('self') and out-of-group ('non-self') similarities. This is a memory-based approach wherein group-specific similarity functions are automatically learned from the database.


Assuntos
Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Proteínas/química , Animais , Homologia Estrutural de Proteína , Interface Usuário-Computador
4.
Nucleic Acids Res ; 30(10): 2212-23, 2002 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-12000841

RESUMO

A computational method was developed for delineating connected gene neighborhoods in bacterial and archaeal genomes. These gene neighborhoods are not typically present, in their entirety, in any single genome, but are held together by overlapping, partially conserved gene arrays. The procedure was applied to comparing the orders of orthologous genes, which were extracted from the database of Clusters of Orthologous Groups of proteins (COGs), in 31 prokaryotic genomes and resulted in the identification of 188 clusters of gene arrays, which included 1001 of 2890 COGs. These clusters were projected onto actual genomes to produce extended neighborhoods including additional genes, which are adjacent to the genes from the clusters and are transcribed in the same direction, which resulted in a total of 2387 COGs being included in the neighborhoods. Most of the neighborhoods consist predominantly of genes united by a coherent functional theme, but also include a minority of genes without an obvious functional connection to the main theme. We hypothesize that although some of the latter genes might have unsuspected roles, others are maintained within gene arrays because of the advantage of expression at a level that is typical of the given neighborhood. We designate this phenomenon 'genomic hitchhiking'. The largest neighborhood includes 79 genes (COGs) and consists of overlapping, rearranged ribosomal protein superoperons; apparent genome hitchhiking is particularly typical of this neighborhood and other neighborhoods that consist of genes coding for translation machinery components. Several neighborhoods involve previously undetected connections between genes, allowing new functional predictions. Gene neighborhoods appear to evolve via complex rearrangement, with different combinations of genes from a neighborhood fixed in different lineages.


Assuntos
Genes Arqueais/genética , Genes Bacterianos/genética , Genoma Arqueal , Genoma Bacteriano , Algoritmos , Proteínas Arqueais/genética , Proteínas de Bactérias/genética , Bases de Dados Factuais , Ordem dos Genes , Proteínas Ribossômicas/genética
5.
Genetics ; 166(1): 351-72, 2004 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-15020430

RESUMO

We have studied a genome-wide set of single-nucleotide polymorphism (SNP) allele frequency measures for African-American, East Asian, and European-American samples. For this analysis we derived a simple, closed mathematical formulation for the spectrum of expected allele frequencies when the sampled populations have experienced nonstationary demographic histories. The direct calculation generates the spectrum orders of magnitude faster than coalescent simulations do and allows us to generate spectra for a large number of alternative histories on a multidimensional parameter grid. Model-fitting experiments using this grid reveal significant population-specific differences among the demographic histories that best describe the observed allele frequency spectra. European and Asian spectra show a bottleneck-shaped history: a reduction of effective population size in the past followed by a recent phase of size recovery. In contrast, the African-American spectrum shows a history of moderate but uninterrupted population expansion. These differences are expected to have profound consequences for the design of medical association studies. The analytical methods developed for this study, i.e., a closed mathematical formulation for the allele frequency spectrum, correcting the ascertainment bias introduced by shallow SNP sampling, and dealing with variable sample sizes provide a general framework for the analysis of public variation data.


Assuntos
Variação Genética , Genoma Humano , Negro ou Afro-Americano/genética , Alelos , América , Ásia , Biometria , Europa (Continente) , Frequência do Gene , Genética Populacional , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Densidade Demográfica
6.
Proc Natl Acad Sci U S A ; 100(1): 376-81, 2003 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-12502794

RESUMO

Single-nucleotide polymorphisms (SNPs) constitute the great majority of variations in the human genome, and as heritable variable landmarks they are useful markers for disease mapping and resolving population structure. Redundant coverage in overlaps of large-insert genomic clones, sequenced as part of the Human Genome Project, comprises a quarter of the genome, and it is representative in terms of base compositional and functional sequence features. We mined these regions to produce 500,000 high-confidence SNP candidates as a uniform resource for describing nucleotide diversity and its regional variation within the genome. Distributions of marker density observed at different overlap length scales under a model of recombination and population size change show that the history of the population represented by the public genome sequence is one of collapse followed by a recent phase of mild size recovery. The inferred times of collapse and recovery are Upper Paleolithic, in agreement with archaeological evidence of the initial modern human colonization of Europe.


Assuntos
Bases de Dados de Ácidos Nucleicos , Variação Genética , Genoma Humano , Frequência do Gene , Marcadores Genéticos , Genética Populacional , Humanos , Modelos Genéticos , Recombinação Genética , Reprodutibilidade dos Testes , Análise de Sequência de DNA , Fatores de Tempo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA