Pesquisa | Biblioteca Virtual em Saúde

The CATH database: an extended protein family resource for structural and functional genomics.

Pearl, F M G; Bennett, C F; Bray, J E; Harrison, A P; Martin, N; Shepherd, A; Sillitoe, I; Thornton, J; Orengo, C A.

Nucleic Acids Res ; 31(1): 452-5, 2003 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-12520050

RESUMO

The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath_new) currently contains 34 287 domain structures classified into 1383 superfamilies and 3285 sequence families. Each structural family is expanded with domain sequence relatives recruited from GenBank using a variety of efficient sequence search protocols and reliable thresholds. This extended resource, known as the CATH-protein family database (CATH-PFDB) contains a total of 310 000 domain sequences classified into 26 812 sequence families. New sequence search protocols have been designed, based on these intermediate sequence libraries, to allow more regular updating of the classification. Further developments include the adaptation of a recently developed method for rapid structure comparison, based on secondary structure matching, for domain boundary assignment. The philosophy behind CATHEDRAL is the recognition of recurrent folds already classified in CATH. Benchmarking of CATHEDRAL, using manually validated domain assignments, demonstrated that 43% of domains boundaries could be completely automatically assigned. This is an improvement on a previous consensus approach for which only 10-20% of domains could be reliably processed in a completely automated fashion. Since domain boundary assignment is a significant bottleneck in the classification of new structures, CATHEDRAL will also help to increase the frequency of CATH updates.

Assuntos

Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Proteínas/classificação , Animais , Automação , Genômica , Dobramento de Proteína , Estrutura Secundária de Proteína , Proteínas/química , Proteínas/fisiologia , Homologia de Sequência de Aminoácidos , Homologia Estrutural de Proteína

The CATH domain structure database.

Orengo, C A; Pearl, F M G; Thornton, J M.

Methods Biochem Anal ; 44: 249-71, 2003.

Artigo em Inglês | MEDLINE | ID: mdl-12647390

Assuntos

Bases de Dados de Proteínas , Proteínas/química , Biologia Computacional , Bases de Dados de Proteínas/história , História do Século XX , Internet , Modelos Moleculares , Estrutura Molecular , Filogenia , Dobramento de Proteína , Estrutura Terciária de Proteína , Proteínas/classificação , Proteínas/genética , Alinhamento de Sequência , Design de Software

RESUMO

Assuntos

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA