Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros

Base de dados
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 52(D1): D138-D144, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37933855

RESUMO

The Gene Expression Omnibus (GEO) is an international public repository that archives gene expression and epigenomics data sets generated by next-generation sequencing and microarray technologies. Data are typically submitted to GEO by researchers in compliance with widespread journal and funder mandates to make generated data publicly accessible. The resource handles raw data files, processed data files and descriptive metadata for over 200 000 studies and 6.5 million samples, all of which are indexed, searchable and downloadable. Additionally, GEO offers web-based tools that facilitate analysis and visualization of differential gene expression. This article presents the current status and recent advancements in GEO, including the generation of consistently computed gene expression count matrices for thousands of RNA-seq studies, and new interactive graphical plots in GEO2R that help users identify differentially expressed genes and assess data set quality. The GEO repository is built and maintained by the National Center for Biotechnology Information (NCBI), a division of the National Library of Medicine (NLM), and is publicly accessible at https://www.ncbi.nlm.nih.gov/geo/.


Assuntos
Epigenômica , Perfilação da Expressão Gênica , Expressão Gênica , Bases de Dados Genéticas , Análise de Sequência com Séries de Oligonucleotídeos
2.
Nucleic Acids Res ; 41(Database issue): D991-5, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23193258

RESUMO

The Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) is an international public repository for high-throughput microarray and next-generation sequence functional genomic data sets submitted by the research community. The resource supports archiving of raw data, processed data and metadata which are indexed, cross-linked and searchable. All data are freely available for download in a variety of formats. GEO also provides several web-based tools and strategies to assist users to query, analyse and visualize data. This article reports current status and recent database developments, including the release of GEO2R, an R-based web application that helps users analyse GEO data.


Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Análise de Sequência com Séries de Oligonucleotídeos
3.
Nucleic Acids Res ; 39(Database issue): D225-9, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21109532

RESUMO

NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.


Assuntos
Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Sequência de Aminoácidos , Sequência Conservada , Modelos Biológicos , Proteínas/classificação , Análise de Sequência de Proteína
4.
Nucleic Acids Res ; 37(Database issue): D205-10, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18984618

RESUMO

NCBI's Conserved Domain Database (CDD) is a collection of multiple sequence alignments and derived database search models, which represent protein domains conserved in molecular evolution. The collection can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, and is also part of NCBI's Entrez query and retrieval system, cross-linked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models can be queried with novel protein sequences via the CD-Search service at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. Starting with the latest version of CDD, v2.14, information from redundant and homologous domain models is summarized at a superfamily level, and domain annotation on proteins is flagged as either 'specific' (identifying molecular function with high confidence) or as 'non-specific' (identifying superfamily membership only).


Assuntos
Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Sequência de Aminoácidos , Sequência Conservada , Proteínas/classificação , Alinhamento de Sequência , Análise de Sequência de Proteína
5.
Proteins ; 74(1): 122-32, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18615713

RESUMO

The cyclin-dependent kinase 2 (cdk2) is a serine/threonine protein kinase that plays a key role in the cell cycle control system of all eukaryotic organisms. It has been a much studied drug target for potential anticancer therapy. Most cdk2 inhibitors in clinical development target almost exclusively the catalytic ATP-binding pocket of cdk2. However, several five amino-acid peptide inhibitors that are directed towards a noncatalytic binding pocket of cdk2 are reported here. Upon binding to this new pocket located at the cdk2 and cyclin interface, these peptide inhibitors are found to disrupt the cdk2/cyclin E complex partially and diminish its kinase activity in vitro.


Assuntos
Quinase 2 Dependente de Ciclina/antagonistas & inibidores , Quinase 2 Dependente de Ciclina/metabolismo , Sítios de Ligação , Simulação por Computador , Ciclina E/metabolismo , Quinase 2 Dependente de Ciclina/química , Humanos , Ligação Proteica
6.
Nucleic Acids Res ; 35(Database issue): D298-300, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17135201

RESUMO

Three-dimensional (3D) structure is now known for a large fraction of all protein families. Thus, it has become rather likely that one will find a homolog with known 3D structure when searching a sequence database with an arbitrary query sequence. Depending on the extent of similarity, such neighbor relationships may allow one to infer biological function and to identify functional sites such as binding motifs or catalytic centers. Entrez's 3D-structure database, the Molecular Modeling Database (MMDB), provides easy access to the richness of 3D structure data and its large potential for functional annotation. Entrez's search engine offers several tools to assist biologist users: (i) links between databases, such as between protein sequences and structures, (ii) pre-computed sequence and structure neighbors, (iii) visualization of structure and sequence/structure alignment. Here, we describe an annotation service that combines some of these tools automatically, Entrez's 'Related Structure' links. For all proteins in Entrez, similar sequences with known 3D structure are detected by BLAST and alignments are recorded. The 'Related Structure' service summarizes this information and presents 3D views mapping sequence residues onto all 3D structures available in MMDB (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=structure).


Assuntos
Bases de Dados de Proteínas , Modelos Moleculares , Conformação Proteica , Análise de Sequência de Proteína , Internet , Alinhamento de Sequência , Interface Usuário-Computador
7.
J Comput Chem ; 29(11): 1762-71, 2008 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-18351599

RESUMO

We adapt a combinatorial optimization algorithm, extremal optimization (EO), for the search problem in computational protein design. This algorithm takes advantage of the knowledge of local energy information and systematically improves on the residues that have high local energies. Power-law probability distributions are used to select the backbone sites to be improved on and the rotamer choices to be changed to. We compare this method with simulated annealing (SA) and motivate and present an improved method, which we call reference energy extremal optimization (REEO). REEO uses reference energies to convert a problem with a structured local-energy profile to one with more random profile, and extremal optimization proves to be extremely efficient for the latter problem. We show in detail the large improvement we have achieved using REEO as compared to simulated annealing and discuss a number of other heuristics we have attempted to date.


Assuntos
Algoritmos , Proteínas/química , Processos Estocásticos
8.
J Comput Chem ; 29(7): 1153-62, 2008 May.
Artigo em Inglês | MEDLINE | ID: mdl-18074340

RESUMO

Our goal is to develop accurate electrostatic models that can be implemented in current computational protein design protocols. To this end, we improve upon a previously reported pairwise decomposable, finite difference Poisson-Boltzmann (FDPB) model for protein design (Marshall et al., Protein Sci 2005, 14, 1293). The improvement involves placing generic sidechains at positions with unknown amino acid identity and explicitly capturing two-body perturbations to the dielectric environment. We compare the original and improved FDPB methods to standard FDPB calculations in which the dielectric environment is completely determined by protein atoms. The generic sidechain approach yields a two to threefold increase in accuracy per residue or residue pair over the original pairwise FDPB implementation, with no additional computational cost. Distance dependent dielectric and solvent-exclusion models were also compared with standard FDPB energies. The accuracy of the new pairwise FDPB method is shown to be superior to these models, even after reparameterization of the solvent-exclusion model.


Assuntos
Biologia Computacional/métodos , Simulação por Computador , Modelos Químicos , Proteínas/química , Aminoácidos/química , Desenho Assistido por Computador , Distribuição de Poisson , Reprodutibilidade dos Testes , Eletricidade Estática , Termodinâmica
9.
Proteins ; 57(3): 565-76, 2004 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-15382246

RESUMO

Protein solvation energies are often taken to be proportional to solvent-accessible surface areas. Computation of these areas is numerically demanding and may become a bottleneck for folding and design applications. Fast graph-based methods, such as dead-end elimination (DEE), become possible if all energies, including solvation energies, are expressed as single-residue and pair-residue terms. To this end, Street and Mayo originated a pair-residue approximation for solvent-accessible surface areas (Street AG, Mayo SL. Pairwise calculation of protein solvent accessible surface areas. Fold Des 1998;3:253-258). The dominant source of error in this method is the overlapping burial of side-chain surfaces in the protein core. Here we report a new pair-residue approximation, which greatly reduces this overlap error by the use of optimized generic side-chains. We have tested the generic-side-chain method for the ten proteins studied by Street and Mayo and for 377 single-domain proteins from the CATH database (Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM. CATH-A hierarchic classification of protein domain structures. Structure 1997;5:1093-1108). With little additional cost in computation, the new method consistently reduces error for total areas and residue-by-residue areas by more than a factor of two. For example, the residue-by-residue error (for buried area) is reduced from 7.42 A(2) to 3.70 A(2). This difference translates into a solvation energy difference of approximately 0.2 kcal/mol per residue, amounting to a reduction in root-mean-square energy error of 2 kcal/mol for a 100 residue chain, a potentially critical difference for both protein folding and design applications.


Assuntos
Biologia Computacional/métodos , Proteínas/química , Proteínas/metabolismo , Solventes/metabolismo , Bases de Dados de Proteínas , Desenho de Fármacos , Dobramento de Proteína , Projetos de Pesquisa , Sensibilidade e Especificidade , Solventes/química , Propriedades de Superfície , Termodinâmica , Fatores de Tempo
10.
Virology ; 345(2): 373-89, 2006 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-16289656

RESUMO

Although the introduction of highly active antiretroviral therapy (HAART) has led to a significant reduction in AIDS-related morbidity and mortality, unfortunately, many patients discontinue their initial HAART regimen, resulting in development of viral resistance. During HIV infection, the viral activator Tat is needed for viral progeny formation, and the basic and core domains of Tat are the most conserved parts of the protein. Here, we show that a Tat 41/44 peptide from the core domain can inhibit HIV-1 gene expression and replication. The peptides are not toxic to cells and target the Cdk2/Cyclin E complex, inhibiting the phosphorylation of serine 5 of RNAPII. Using the Cdk2 X-ray crystallography structure, we found that the low-energy wild-type peptides could bind to the ATP binding pocket, whereas the mutant peptide bound to the Cdk2 interface. Finally, we show that these peptides do not allow loading of the catalytic domain of the cdk/cyclin complex onto the HIV-1 promoter in vivo.


Assuntos
Produtos do Gene tat/química , Produtos do Gene tat/farmacologia , HIV-1/efeitos dos fármacos , Peptídeos/farmacologia , Replicação Viral/efeitos dos fármacos , Sequência de Aminoácidos , Sítios de Ligação , Simulação por Computador , Quinase 2 Dependente de Ciclina/química , Quinase 2 Dependente de Ciclina/metabolismo , Expressão Gênica/efeitos dos fármacos , Produtos do Gene tat/genética , Infecções por HIV/virologia , HIV-1/fisiologia , Humanos , Leucócitos Mononucleares/virologia , Dados de Sequência Molecular , Peptídeos/síntese química , Peptídeos/química , Solubilidade , Produtos do Gene tat do Vírus da Imunodeficiência Humana
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa