Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Pharmacogenomics J ; 17(2): 137-145, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-26856248

RESUMO

Variation in the expression level and activity of genes involved in drug disposition and action ('pharmacogenes') can affect drug response and toxicity, especially when in tissues of pharmacological importance. Previous studies have relied primarily on microarrays to understand gene expression differences, or have focused on a single tissue or small number of samples. The goal of this study was to use RNA-sequencing (RNA-seq) to determine the expression levels and alternative splicing of 389 Pharmacogenomics Research Network pharmacogenes across four tissues (liver, kidney, heart and adipose) and lymphoblastoid cell lines, which are used widely in pharmacogenomics studies. Analysis of RNA-seq data from 139 different individuals across the 5 tissues (20-45 individuals per tissue type) revealed substantial variation in both expression levels and splicing across samples and tissue types. Comparison with GTEx data yielded a consistent picture. This in-depth exploration also revealed 183 splicing events in pharmacogenes that were previously not annotated. Overall, this study serves as a rich resource for the research community to inform biomarker and drug discovery and use.


Assuntos
Processamento Alternativo , Biologia Computacional , Sequenciamento de Nucleotídeos em Larga Escala , Farmacogenética , Variantes Farmacogenômicos , Análise de Sequência de RNA , Transcriptoma , Tecido Adiposo/metabolismo , Linhagem Celular , Bases de Dados Genéticas , Genótipo , Humanos , Rim/metabolismo , Fígado/metabolismo , Miocárdio/metabolismo , Fenótipo
2.
Science ; 309(5740): 1559-63, 2005 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-16141072

RESUMO

This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.


Assuntos
Genoma , Camundongos/genética , Regiões Terminadoras Genéticas , Sítio de Iniciação de Transcrição , Transcrição Gênica , Regiões 3' não Traduzidas , Animais , Sequência de Bases , Sequência Conservada , DNA Complementar/química , Genoma Humano , Genômica , Humanos , Regiões Promotoras Genéticas , Proteínas/genética , RNA/química , RNA/classificação , Splicing de RNA , RNA não Traduzido/química , Sequências Reguladoras de Ácido Ribonucleico
3.
Nat Rev Genet ; 2(10): 801-9, 2001 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-11584296

RESUMO

Structural genomics projects aim to provide an experimental or computational three-dimensional model structure for all of the tractable macromolecules that are encoded by complete genomes. To this end, pilot centres worldwide are now exploring the feasibility of large-scale structure determination. Their experimental structures and computational models are expected to yield insight into the molecular function and mechanism of thousands of proteins. The pervasiveness of this information is likely to change the use of structure in molecular biology and biochemistry.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Modelos Moleculares , Conformação Proteica , Sequência de Aminoácidos , Animais , Biologia Computacional/organização & administração , Cristalografia por Raios X , Bases de Dados Factuais , Bases de Dados de Proteínas , Estudos de Viabilidade , Previsões , Humanos , Laboratórios , Modelos Estruturais , Dados de Sequência Molecular , Ressonância Magnética Nuclear Biomolecular , Projetos Piloto , Estrutura Terciária de Proteína , Proteínas/química , Proteínas/genética , Proteínas/isolamento & purificação , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade
4.
Genome Biol ; 2(12): RESEARCH0050, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11790253

RESUMO

BACKGROUND: Sequestration of transcription factors in the membrane is emerging as an important mechanism for the regulation of gene expression. A handful of membrane-spanning transcription factors has been previously identified whose access to the nucleus is regulated by proteolytic cleavage from the membrane. To investigate the existence of other transmembrane transcription factors, we analyzed computationally all proteins in SWISS-PROT/TrEMBL for the combined presence of a DNA-binding domain and a transmembrane segment. RESULTS: Using Pfam hidden Markov models and four transmembrane-prediction programs, we identified with high confidence 76 membrane-spanning transcription factors in SWISS-PROT/TrEMBL. Analysis of the distribution of two proteins predicted by our method, MTJ1 and DMRT2, confirmed their localization to intracellular membrane compartments. Furthermore, elimination of the predicted transmembrane segment led to nuclear localization for each of these proteins. CONCLUSIONS: Our analysis uncovered a wealth of predicted membrane-spanning transcription factors that are structurally and taxonomically diverse, 56 of which lack experimental annotation. Seventy-five of the proteins are modular in structure, suggesting that a single proteolysis may be sufficient to liberate a DNA-binding domain from the membrane. This study provides grounds for investigations into the stimuli and mechanisms that release this intriguing class of transcription factors from membranes.


Assuntos
Biologia Computacional/métodos , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo , Transporte Ativo do Núcleo Celular , Animais , Células COS , Núcleo Celular/metabolismo , Sequência Conservada , Bases de Dados de Proteínas , Membranas Intracelulares/metabolismo , Cadeias de Markov , Estrutura Terciária de Proteína , Deleção de Sequência , Fatores de Transcrição/genética
5.
Nat Struct Biol ; 7 Suppl: 967-9, 2000 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-11104002

RESUMO

Structural genomics aims to use high-throughput structure determination and computational analysis to provide three-dimensional models of every tractable protein. The process of choosing proteins for experimental structure characterization is known as target selection. In this nomenclature, the targets are regions of proteins to be studied by crystallography or NMR. Selection of the targets is principally a computational process of restricting candidate proteins to those that are tractable and of unknown structure, and prioritizing according to expected interest and accessibility.


Assuntos
Biologia Computacional/métodos , Genômica , Proteínas/química , Proteínas/classificação , Cristalografia por Raios X , Bases de Dados como Assunto , Internet , Modelos Moleculares , Ressonância Magnética Nuclear Biomolecular , Conformação Proteica , Proteínas/genética , Proteínas/metabolismo , Relação Estrutura-Atividade
6.
Protein Sci ; 9(1): 197-200, 2000 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-10739263

RESUMO

Structural genomics projects aim to provide an experimental structure or a good model for every protein in all completed genomes. Most of the experimental work for these projects will be directed toward proteins whose fold cannot be readily recognized by simple sequence comparison with proteins of known structure. Based on the history of proteins classified in the SCOP structure database, we expect that only about a quarter of the early structural genomics targets will have a new fold. Among the remaining ones, about half are likely to be evolutionarily related to proteins of known structure, even though the homology could not be readily detected by sequence analysis.


Assuntos
Genoma , Proteínas/química , Bases de Dados Factuais , Dobramento de Proteína , Proteínas/genética
7.
Nucleic Acids Res ; 28(1): 254-6, 2000 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-10592239

RESUMO

The ASTRAL compendium provides several databases and tools to aid in the analysis of protein structures, particularly through the use of their sequences. The SPACI scores included in the system summarize the overall characteristics of a protein structure. A structural alignments database indicates residue equivalencies in superimposed protein domain structures. The PDB sequence-map files provide a linkage between the amino acid sequence of the molecule studied (SEQRES records in a database entry) and the sequence of the atoms experimentally observed in the structure (ATOM records). These maps are combined with information in the SCOPdatabase to provide sequences of protein domains. Selected subsets of the domain database, with varying degrees of similarity measured in several different ways, are also available. ASTRALmay be accessed at http://astral.stanford.edu/


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Conformação Proteica , Sequência de Aminoácidos , Homologia de Sequência de Aminoácidos
8.
Nucleic Acids Res ; 28(1): 257-9, 2000 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-10592240

RESUMO

The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of known protein structures. The classification is on hierarchical levels: the first two levels, family and superfamily, describe near and distant evolutionary relationships; the third, fold, describes geometrical relationships. The distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is unique to this database so far. The sequences of proteins in SCOP provide the basis of the ASTRAL sequence libraries that can be used as a source of data to calibrate sequence search algorithms and for the generation of statistics on, or selections of, protein structures. Links can be made from SCOP to PDB-ISL: a library containing sequences homologous to proteins of known structure. Sequences of proteins of unknown structure can be matched to distantly related proteins of known structure by using pairwise sequence comparison methods to find homologues in PDB-ISL. The database and its associated files are freely accessible from a number of WWW sites mirrored from URL http://scop.mrc-lmb.cam.ac.uk/scop/


Assuntos
Bases de Dados Factuais , Conformação Proteica , Evolução Molecular , Armazenamento e Recuperação da Informação , Internet , Proteínas/química , Proteínas/genética
10.
Genome Res ; 9(1): 17-26, 1999 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-9927481

RESUMO

A sensitive protein-fold recognition procedure was developed on the basis of iterative database search using the PSI-BLAST program. A collection of 1193 position-dependent weight matrices that can be used as fold identifiers was produced. In the completely sequenced genomes, folds could be automatically identified for 20%-30% of the proteins, with 3%-6% more detectable by additional analysis of conserved motifs. The distribution of the most common folds is very similar in bacteria and archaea but distinct in eukaryotes. Within the bacteria, this distribution differs between parasitic and free-living species. In all analyzed genomes, the P-loop NTPases are the most abundant fold. In bacteria and archaea, the next most common folds are ferredoxin-like domains, TIM-barrels, and methyltransferases, whereas in eukaryotes, the second to fourth places belong to protein kinases, beta-propellers and TIM-barrels. The observed diversity of protein folds in different proteomes is approximately twice as high as it would be expected from a simple stochastic model describing a proteome as a finite sample from an infinite pool of proteins with an exponential distribution of the fold fractions. Distribution of the number of domains with different folds in one protein fits the geometric model, which is compatible with the evolution of multidomain proteins by random combination of domains. [Fold predictions for proteins from 14 proteomes are available on the World Wide Web at. The FIDs are available by anonymous ftp at the same location.]


Assuntos
Archaea/química , Bactérias/química , Células Eucarióticas/química , Dobramento de Proteína , Distribuições Estatísticas , Algoritmos , Biologia Computacional
11.
Nucleic Acids Res ; 27(1): 251-3, 1999 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-9847193

RESUMO

The PRESAGE database is a collaborative resource for structural genomics. It provides a database of proteins to which researchers add annotations indicating current experimental status, structural predictions and suggestions. The database is intended to enhance communication among structural genomics researchers and aid dissemination of their results. The PRESAGE database may be accessed at http://presage.stanford.edu/


Assuntos
Bases de Dados Factuais , Proteínas/química , Proteínas/genética , Armazenamento e Recuperação da Informação , Internet , Modelos Moleculares , Conformação Proteica , Pesquisa , Homologia de Sequência de Aminoácidos
12.
Nucleic Acids Res ; 27(1): 254-6, 1999 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-9847194

RESUMO

The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of all known proteins structures. The classification is on hierarchical levels: the first two levels, family and superfamily, describe near and far evolutionary relationships; the third, fold, describes geometrical relationships. The distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is unique to this database, so far. The database can be used as a source of data to calibrate sequence search algorithms and for the generation of population statistics on protein structures. The database and its associated files are freely accessible from a number of WWW sites mirrored from URL http://scop. mrc-lmb.cam.ac.uk/scop/


Assuntos
Bases de Dados Factuais , Conformação Proteica , Proteínas/química , Proteínas/classificação , Algoritmos , Evolução Molecular , Armazenamento e Recuperação da Informação , Internet , Dobramento de Proteína , Proteínas/genética , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Estatística como Assunto
14.
Proc Natl Acad Sci U S A ; 95(11): 6073-8, 1998 May 26.
Artigo em Inglês | MEDLINE | ID: mdl-9600919

RESUMO

Pairwise sequence comparison methods have been assessed using proteins whose relationships are known reliably from their structures and functions, as described in the SCOP database [Murzin, A. G., Brenner, S. E., Hubbard, T. & Chothia C. (1995) J. Mol. Biol. 247, 536-540]. The evaluation tested the programs BLAST [Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). J. Mol. Biol. 215, 403-410], WU-BLAST2 [Altschul, S. F. & Gish, W. (1996) Methods Enzymol. 266, 460-480], FASTA [Pearson, W. R. & Lipman, D. J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444-2448], and SSEARCH [Smith, T. F. & Waterman, M. S. (1981) J. Mol. Biol. 147, 195-197] and their scoring schemes. The error rate of all algorithms is greatly reduced by using statistical scores to evaluate matches rather than percentage identity or raw scores. The E-value statistical scores of SSEARCH and FASTA are reliable: the number of false positives found in our tests agrees well with the scores reported. However, the P-values reported by BLAST and WU-BLAST2 exaggerate significance by orders of magnitude. SSEARCH, FASTA ktup = 1, and WU-BLAST2 perform best, and they are capable of detecting almost all relationships between proteins whose sequence identities are >30%. For more distantly related proteins, they do much less well; only one-half of the relationships between proteins with 20-30% identity are found. Because many homologs have low sequence similarity, most distant relationships cannot be detected by any pairwise comparison method; however, those which are identified may be used with confidence.


Assuntos
Evolução Molecular , Proteínas/genética , Alinhamento de Sequência/métodos , Algoritmos , Animais , Bases de Dados Factuais , Humanos , Proteínas/química
15.
Acta Crystallogr D Biol Crystallogr ; 54(Pt 6 Pt 1): 1147-54, 1998 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-10089491

RESUMO

The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of all known protein structures. The classification is on hierarchical levels: the first two levels, family and superfamily, describe near and far evolutionary relationships; the third, fold, describes geometrical relationships. The distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is unique to this database, so far. The database can be used as a source of data to calibrate sequence search algorithms and for the generation of population statistics on protein structures. The database and its associated files are freely accessible from a number of WWW sites mirrored from URL http://scop. mrc-lmb.cam.ac.uk/scop/.


Assuntos
Bases de Dados Factuais , Conformação Proteica , Alinhamento de Sequência/normas , Algoritmos , Sequência de Aminoácidos , Sistemas de Gerenciamento de Base de Dados , Estudos de Avaliação como Assunto , Dados de Sequência Molecular , Dobramento de Proteína , Alinhamento de Sequência/métodos
16.
Curr Opin Struct Biol ; 7(3): 369-76, 1997 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-9204279

RESUMO

Structural classifications aid the interpretation of proteins by describing degrees of structural and evolutionary relatedness. They have also recently revealed strikingly skewed distributions at all levels; for example, a small number of folds are far more common than others, and just a few superfamilies are known to have diverged widely. The classifications also provide an indication of the total number of superfamilies in nature.


Assuntos
Proteínas/química , Proteínas/classificação , Sítios de Ligação , Evolução Molecular , Conformação Proteica , Dobramento de Proteína , Proteínas/metabolismo
17.
Nucleic Acids Res ; 25(1): 236-9, 1997 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-9016544

RESUMO

The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of all known proteins structures. The classification is on hierarchical levels: the first two levels, family and superfamily, describe near and far evolutionary relationships; the third, fold, describes geometrical relationships. The distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is unique to this database, so far. SCOP also provides for each structure links to atomic co-ordinates, images of the structures, interactive viewers, sequence data, data on any conformational changes related to function and literature references. The database is freely accessible on the World Wide Web (WWW) with an entry point at URL http://scop.mrc-lmb.cam.ac.uk/scop/


Assuntos
Bases de Dados Factuais , Estrutura Terciária de Proteína , Proteínas/química , Sequência de Aminoácidos , Dobramento de Proteína , Estrutura Secundária de Proteína
20.
FEBS Lett ; 372(2-3): 215-21, 1995 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-7556672

RESUMO

We have systematically compared structures of multi-helical DNA-binding domains (DBDs) which have been determined by crystallography or NMR spectroscopy. All the known multi-helical DBDs are very similar. The core of these structures consists of two alpha-helices in the helix-turn-helix combination, associated with one or two other helices. The structures can be classified according to either additional structural compositions or the configuration of the helices. Many DBDs, whose structures are currently unknown, have sequences which resemble those of known structures, permitting outlines of the new structures to be predicted.


Assuntos
Proteínas de Ligação a DNA/química , Dobramento de Proteína , Sequência de Aminoácidos , Sítios de Ligação , Proteínas de Ligação a DNA/classificação , Dados de Sequência Molecular , Análise de Sequência
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA