Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
J Mol Biol ; 429(3): 356-364, 2017 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-27561707

RESUMO

abYsis is a web-based antibody research system that includes an integrated database of antibody sequence and structure data. The system can be interrogated in numerous ways-from simple text and sequence searches to sophisticated queries that apply 3D structural constraints. The publicly available version includes pre-analyzed sequence data from the European Molecular Biology Laboratory European Nucleotide Archive (EMBL-ENA) and Kabat as well as structure data from the Protein Data Bank. A researcher's own sequences can also be analyzed through the web interface. A defining characteristic of abYsis is that the sequences are automatically numbered with a series of popular schemes such as Kabat and Chothia and then annotated with key information such as complementarity-determining regions and potential post-translational modifications. A unique aspect of abYsis is a set of residue frequency tables for each position in an antibody, allowing "unusual residues" (those rarely seen at a particular position) to be highlighted and decisions to be made on which mutations may be acceptable. This is especially useful when comparing antibodies from different species. abYsis is useful for any researcher specializing in antibody engineering, especially those developing antibodies as drugs. abYsis is available at www.abysis.org.


Assuntos
Anticorpos/química , Bases de Dados de Proteínas , Sequência de Aminoácidos , Animais , Regiões Determinantes de Complementaridade , Biologia Computacional , Humanos , Internet , Processamento de Proteína Pós-Traducional
2.
Mol Biosyst ; 8(8): 2076-84, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22692068

RESUMO

Over the past two decades, many ingenious efforts have been made in protein remote homology detection. Because homologous proteins often diversify extensively in sequence, it is challenging to demonstrate such relatedness through entirely sequence-driven searches. Here, we describe a computational method for the generation of 'protein-like' sequences that serves to bridge gaps in protein sequence space. Sequence profile information, as embodied in a position-specific scoring matrix of multiply aligned sequences of bona fide family members, serves as the starting point in this algorithm. The observed amino acid propensity and the selection of a random number dictate the selection of a residue for each position in the sequence. In a systematic manner, and by applying a 'roulette-wheel' selection approach at each position, we generate parent family-like sequences and thus facilitate an enlargement of sequence space around the family. When generated for a large number of families, we demonstrate that they expand the utility of natural intermediately related sequences in linking distant proteins. In 91% of the assessed examples, inclusion of designed sequences improved fold coverage by 5-10% over searches made in their absence. Furthermore, with several examples from proteins adopting folds such as TIM, globin, lipocalin and others, we demonstrate that the success of including designed sequences in a database positively sensitized methods such as PSI-BLAST and Cascade PSI-BLAST and is a promising opportunity for enormously improved remote homology recognition using sequence information alone.


Assuntos
Proteínas/química , Algoritmos , Sequência de Aminoácidos , Biologia Computacional , Bases de Dados de Proteínas , Dados de Sequência Molecular , Análise de Sequência de Proteína , Homologia de Sequência de Aminoácidos
3.
Protein Eng Des Sel ; 23(9): 689-97, 2010 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-20591902

RESUMO

The packing of V(H) and V(L) domains in antibodies can vary, influencing the topography of the antigen-combining site. However, until recently, this has largely been ignored in modelling antibody structure. We present an analysis of the degree of variability observed in known structures together with a machine-learning approach to predict the packing angle. A neural network was trained on sets of interface residues and a genetic algorithm designed to perform 'feature selection' to define which sets of interface residues could be used most successfully to perform the prediction. While this training procedure was very computationally intensive, prediction is performed in a matter of seconds. Thus, not only do we provide a rapid method for predicting the packing angle, but also we define a set of residues that may be important in antibody humanization in order to obtain the correct binding site topography.


Assuntos
Cadeias Pesadas de Imunoglobulinas/química , Cadeias Leves de Imunoglobulina/química , Região Variável de Imunoglobulina/química , Algoritmos , Humanos , Cadeias Pesadas de Imunoglobulinas/metabolismo , Cadeias Leves de Imunoglobulina/metabolismo , Região Variável de Imunoglobulina/metabolismo , Modelos Moleculares , Redes Neurais de Computação , Conformação Proteica , Análise de Regressão
4.
Comp Funct Genomics ; : 365637, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19809514

RESUMO

Protein Kinase-Like Non-kinases (PKLNKs), which are closely related to protein kinases, lack the crucial catalytic aspartate in the catalytic loop, and hence cannot function as protein kinase, have been analysed. Using various sensitive sequence analysis methods, we have recognized 82 PKLNKs from four higher eukaryotic organisms, namely, Homo sapiens, Mus musculus, Rattus norvegicus, and Drosophila melanogaster. On the basis of their domain combination and function, PKLNKs have been classified mainly into four categories: (1) Ligand binding PKLNKs, (2) PKLNKs with extracellular protein-protein interaction domain, (3) PKLNKs involved in dimerization, and (4) PKLNKs with cytoplasmic protein-protein interaction module. While members of the first two classes of PKLNKs have transmembrane domain tethered to the PKLNK domain, members of the other two classes of PKLNKs are cytoplasmic in nature. The current classification scheme hopes to provide a convenient framework to classify the PKLNKs from other eukaryotes which would be helpful in deciphering their roles in cellular processes.

5.
Mol Immunol ; 45(14): 3832-9, 2008 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-18614234

RESUMO

In analysing protein sequence and structure, standardized numbering schemes allow comparison of features without explicit alignment. This has proved particularly valuable in the case of antibodies. The most widely used schemes (Kabat: sequence-based; Chothia: structure-based) differ only in the numbering of the complementarity determining regions (CDRs). We have analyzed the numbered annotations in the widely used Kabat database and found that approximately 10% of entries contain errors or inconsistencies. Further analysis of sequence alignments in the context of structure suggest that the sites of the insertions in some framework regions in the Kabat and Chothia schemes are incorrect. We therefore propose a corrected version of the Chothia scheme which is structurally correct throughout the CDRs and frameworks. To perform this analysis, we have developed, and made available, a tool for the automatic application of Kabat, Chothia and modified-Chothia numbering schemes and have carefully benchmarked the performance of this tool.


Assuntos
Anticorpos/química , Bases de Dados Factuais , Alinhamento de Sequência/métodos , Design de Software , Software , Sequência de Aminoácidos , Animais , Anticorpos/genética , Automação/métodos , Humanos , Modelos Moleculares , Dados de Sequência Molecular , Estrutura Terciária de Proteína/genética
6.
J Mol Biol ; 369(3): 852-62, 2007 Jun 08.
Artigo em Inglês | MEDLINE | ID: mdl-17442342

RESUMO

Genetically engineered mouse antibodies are now commonly in clinical use. However, their development is limited because the human immune system tends to regard them as foreign and this triggers an immune response. The solution is to make engineered antibodies appear more human. Here, we propose a method to assess the "degree of humanness" of antibody sequences providing a tool that may contribute to predictions of antigenicity. We analyzed sequences of antibodies belonging to various chains/classes in human and mouse. Our analysis of metrics based on percentage sequence identity between antibody sequences shows distinct differences between human and mouse sequences. Based on mean sequence identity and standard deviation, we calculated Z-scores for data sets of antibody sequences extracted from the Kabat database. We applied the analysis to a set of humanized and chimeric antibodies and to human germline sequences. We conclude that this approach may aid in the selection of more suitable mouse variable domains for antibody engineering to render them more human but in general, we find that typicality of a sequence compared with the expressed human repertoire is not well correlated with antigenicity. We have provided a Web server allowing humanness to be assigned for a sequence.


Assuntos
Anticorpos/química , Especificidade de Anticorpos/genética , Região Variável de Imunoglobulina/genética , Engenharia de Proteínas , Animais , Anticorpos Monoclonais/química , Humanos , Imunossupressores/química , Camundongos , Modelos Estatísticos , Estrutura Terciária de Proteína , Especificidade da Espécie
7.
Nucleic Acids Res ; 34(Web Server issue): W143-6, 2006 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-16844978

RESUMO

Owing to high evolutionary divergence, it is not always possible to identify distantly related protein domains by sequence search techniques. Intermediate sequences possess sequence features of more than one protein and facilitate detection of remotely related proteins. We have demonstrated recently the employment of Cascade PSI-BLAST where we perform PSI-BLAST for many 'generations', initiating searches from new homologues as well. Such a rigorous propagation through generations of PSI-BLAST employs effectively the role of intermediates in detecting distant similarities between proteins. This approach has been tested on a large number of folds and its performance in detecting superfamily level relationships is approximately 35% better than simple PSI-BLAST searches. We present a web server for this search method that permits users to perform Cascade PSI-BLAST searches against the Pfam, SCOP and SwissProt databases. The URL for this server is http://crick.mbu.iisc.ernet.in/~CASCADE/CascadeBlast.html.


Assuntos
Estrutura Terciária de Proteína , Homologia de Sequência de Aminoácidos , Software , Internet
8.
J Biomol Struct Dyn ; 23(3): 283-98, 2005 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-16218755

RESUMO

Profile-based sequence search procedures are commonly employed to detect remote relationships between proteins. We provide an assessment of a Cascade PSI-BLAST protocol that rigorously employs intermediate sequences in detecting remote relationships between proteins. In this approach we detect using PSI-BLAST, which involves multiple rounds of iteration, an initial set of homologues for a protein in a 'first generation' search by querying a database. We propagate a 'second generation' search in the database, involving multiple runs of PSI-BLAST using each of the homologues identified in the previous generation as queries to recognize homologues not detected earlier. This non-directed search process can be viewed as an iteration of iterations that is continued to detect further homologues until no new hits are detectable. We present an assessment of the coverage of this 'cascaded' intermediate sequence search on diverse folds and find that searches for up to three generations detect most known homologues of a query. Our assessments show that this approach appears to perform better than the traditional use of PSI-BLAST by detecting 15% more relationships within a family and 35% more relationships within a superfamily. We show that such searches can be performed on generalized sequence databases and non-trivial relationships between proteins can be detected effectively. Such a propagation of searches maximizes the chances of detecting distant homologies by effectively scanning protein "fold space".


Assuntos
Bases de Dados de Proteínas , Conformação Proteica , Proteínas , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Modelos Moleculares , Dados de Sequência Molecular , Dobramento de Proteína , Proteínas/química , Proteínas/genética , Alinhamento de Sequência , Análise de Sequência de Proteína/instrumentação , Homologia de Sequência de Aminoácidos
9.
Nucleic Acids Res ; 32(Database issue): D153-5, 2004 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-14681382

RESUMO

The KinG database is a comprehensive collection of serine/threonine/tyrosine-specific kinases and their homologues identified in various completed genomes using sequence and profile search methods. The database hosted at http://hodgkin. mbu.iisc.ernet.in/ approximately king provides the amino acid sequences, functional domain assignments and classification of gene products containing protein kinase domains. A search tool enabling the retrieval of protein kinases with specified subfamily and domain combinations is one of the key features of the resource. Identification of a kinase catalytic domain in the user's query sequence is possible using another search tool. The occurrence and location of critical catalytic residues if the query has a catalytic kinase domain, recognition of non-kinase domains in the sequence and subfamily classification of the kinase in the query will help in deciphering the biological role of the kinase. This online compilation can also be used to compare the protein kinases of a given subfamily and domain combinations across various genomes. Another exclusive feature of the database is the collection of the Ser/Thr/Tyr protein kinases and similar sequences encoded in the genomes of archaea and bacteria.


Assuntos
Bases de Dados de Proteínas , Genoma , Proteínas Quinases/química , Animais , Domínio Catalítico , Genômica , Humanos , Internet , Proteínas Quinases/metabolismo , Estrutura Terciária de Proteína , Transdução de Sinais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA