Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 33(24): 3991-3992, 2017 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-28961944

RESUMO

MOTIVATION: ALIGNSEC is a module within ANTHEPROT designed for the interactive display, edition and printing of large-scale multiple alignments integrating secondary structure predictions. AVAILABILITY AND IMPLEMENTATION: The ALIGNSEC module is part of the ANTHEPROT package (http://antheprot-pbil.ibcp.fr) which can be used freely for academic users. It is running on Windows Operating systems. For commercial use, please contact the author. CONTACT: gilbert.deleage@ibcp.fr.


Assuntos
Estrutura Secundária de Proteína , Alinhamento de Sequência/métodos , Software , Simulação por Computador
2.
J Biol Chem ; 291(32): 16553-66, 2016 08 05.
Artigo em Inglês | MEDLINE | ID: mdl-27311711

RESUMO

Thyroid hormones modulate not only multiple functions in vertebrates (energy metabolism, central nervous system function, seasonal changes in physiology, and behavior) but also in some non-vertebrates where they control critical post-embryonic developmental transitions such as metamorphosis. Despite their obvious biological importance, the thyroid hormone precursor protein, thyroglobulin (Tg), has been experimentally investigated only in mammals. This may bias our view of how thyroid hormones are produced in other organisms. In this study we searched genomic databases and found Tg orthologs in all vertebrates including the sea lamprey (Petromyzon marinus). We cloned a full-size Tg coding sequence from western clawed frog (Xenopus tropicalis) and zebrafish (Danio rerio). Comparisons between the representative mammal, amphibian, teleost fish, and basal vertebrate indicate that all of the different domains of Tg, as well as Tg regional structure, are conserved throughout the vertebrates. Indeed, in Xenopus, zebrafish, and lamprey Tgs, key residues, including the hormonogenic tyrosines and the disulfide bond-forming cysteines critical for Tg function, are well conserved despite overall divergence of amino acid sequences. We uncovered upstream sequences that include start codons of zebrafish and Xenopus Tgs and experimentally proved that these are full-length secreted proteins, which are specifically recognized by antibodies against rat Tg. By contrast, we have not been able to find any orthologs of Tg among non-vertebrate species. Thus, Tg appears to be a novel protein elaborated as a single event at the base of vertebrates and virtually unchanged thereafter.


Assuntos
Evolução Molecular , Lampreias/genética , Tireoglobulina/genética , Proteínas de Xenopus/genética , Proteínas de Peixe-Zebra/genética , Peixe-Zebra/genética , Animais , Ratos , Xenopus
3.
J Virol ; 88(1): 574-82, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24173223

RESUMO

Last-generation nucleoside/nucleotide analogues are potent against hepatitis B virus (HBV) and have a high barrier to resistance. However, delayed responses have been observed in patients previously exposed to other drugs of the same class, long-term resistance is possible, and cure of infection cannot be achieved with these therapies, emphasizing the need for alternative therapeutic approaches. The HBV RNase H represents an interesting target because its enzyme activity is essential to the HBV life cycle. The goal of our study was to characterize the structure of the HBV RNase H by computing a 3-dimensional molecular model derived from E. coli RNase H and analyzing 2,326 sequences of all HBV genotypes available in public databases and 958,000 sequences generated by means of ultradeep pyrosequencing of sequences from a homogenous population of 73 treatment-naive patients infected with HBV genotype D. Our data revealed that (i) the putative 4th catalytic residue displays unexpected variability that could be explained by the overlap of the HBx gene and has no apparent impact on HBV replicative capacity and that (ii) the C-helix-containing basic protrusion, which is required to guide the RNA/DNA heteroduplex into the catalytic site, is highly conserved and bears unique structural properties that can be used to target HBV-specific RNase H inhibitors without cross-species activity. The model shows substantial differences from other known RNases H and paves the way for functional and structural studies as a prerequisite to the development of new inhibitors of the HBV cell cycle specifically targeting RNase H activity.


Assuntos
Vírus da Hepatite B/enzimologia , Sequenciamento de Nucleotídeos em Larga Escala , Ribonuclease H/genética , Sequência de Aminoácidos , Antivirais/farmacologia , Genótipo , Vírus da Hepatite B/efeitos dos fármacos , Vírus da Hepatite B/genética , Modelos Moleculares , Dados de Sequência Molecular , Ribonuclease H/química , Homologia de Sequência de Aminoácidos
4.
Nucleic Acids Res ; 41(Database issue): D566-70, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23125365

RESUMO

We have developed a specialized database, HBVdb (http://hbvdb.ibcp.fr), allowing the researchers to investigate the genetic variability of Hepatitis B Virus (HBV) and viral resistance to treatment. HBV is a major health problem worldwide with more than 350 million individuals being chronically infected. HBV is an enveloped DNA virus that replicates by reverse transcription of an RNA intermediate. HBV genome is optimized, being circular and encoding four overlapping reading frames. Indeed, each nucleotide of the genome takes part in the coding of at least one protein. However, HBV shows some genome variability leading to at least eight different genotypes and recombinant forms. The main drugs used to treat infected patients are nucleos(t)ides analogs (reverse transcriptase inhibitors). Unfortunately, HBV mutants resistant to these drugs may be selected and be responsible for treatment failure. HBVdb contains a collection of computer-annotated sequences based on manually annotated reference genomes. The database can be accessed through a web interface that allows static and dynamic queries and offers integrated generic sequence analysis tools and specialized analysis tools (e.g. annotation, genotyping, drug resistance profiling).


Assuntos
Bases de Dados Genéticas , Vírus da Hepatite B/genética , Farmacorresistência Viral/genética , Variação Genética , Genoma Viral , Técnicas de Genotipagem , Vírus da Hepatite B/efeitos dos fármacos , Internet , Anotação de Sequência Molecular , Interface Usuário-Computador , Proteínas Virais/genética
5.
Nucleic Acids Res ; 40(Database issue): D321-4, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22080550

RESUMO

Bacterial tyrosine-kinases share no resemblance with their eukaryotic counterparts and they have been unified in a new protein family named BY-kinases. These enzymes have been shown to control several biological functions in the bacterial cells. In recent years biochemical studies, sequence analyses and structure resolutions allowed the deciphering of a common signature. However, BY-kinase sequence annotations in primary databases remain incomplete. This prompted us to develop a specialized database of computer-annotated BY-kinase sequences: the Bacterial protein tyrosine-kinase database (BYKdb). BY-kinase sequences are first identified, thanks to a workflow developed in a previous work. A second workflow annotates the UniProtKB entries in order to provide the BYKdb entries. The database can be accessed through a web interface that allows static and dynamic queries and offers integrated sequence analysis tools. BYKdb can be found at http://bykdb.ibcp.fr.


Assuntos
Bactérias/enzimologia , Proteínas de Bactérias/química , Bases de Dados de Proteínas , Proteínas Tirosina Quinases/química , Anotação de Sequência Molecular , Análise de Sequência de Proteína , Interface Usuário-Computador
6.
Hum Mutat ; 31(2): 127-35, 2010 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-19921752

RESUMO

Understanding how genetic alterations affect gene products at the molecular level represents a first step in the elucidation of the complex relationships between genotypic and phenotypic variations, and is thus a major challenge in the postgenomic era. Here, we present SM2PH-db (http://decrypthon.igbmc.fr/sm2ph), a new database designed to investigate structural and functional impacts of missense mutations and their phenotypic effects in the context of human genetic diseases. A wealth of up-to-date interconnected information is provided for each of the 2,249 disease-related entry proteins (August 2009), including data retrieved from biological databases and data generated from a Sequence-Structure-Evolution Inference in Systems-based approach, such as multiple alignments, three-dimensional structural models, and multidimensional (physicochemical, functional, structural, and evolutionary) characterizations of mutations. SM2PH-db provides a robust infrastructure associated with interactive analysis tools supporting in-depth study and interpretation of the molecular consequences of mutations, with the more long-term goal of elucidating the chain of events leading from a molecular defect to its pathology. The entire content of SM2PH-db is regularly and automatically updated thanks to a computational grid data federation facilities provided in the context of the Decrypthon program.


Assuntos
Bases de Dados de Proteínas , Doenças Genéticas Inatas/genética , Mutação de Sentido Incorreto/genética , Software , Humanos , Internet , Fenótipo , Proteínas , Interface Usuário-Computador
7.
J Mol Graph Model ; 98: 107610, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32302938

RESUMO

Hepatitis B, one of the world's most common liver infections, is caused by the Hepatitis B Virus (HBV). Via the infected cells, this virus generates non pathogen particles with similar surface structures as those found in the full virus. These particles are used in a recombinant form (HBsAg) to produce efficient vaccines. The atomic structure of the HBsAg particles is currently unsolved, and the only existing structural data for the full particle were obtained by electronic microscopy with a maximum resolution of 12 Å. As many vaccines, HBsAg is a complex bio-system. This complexity results from numerous sources of heterogeneity, and traditional bio-immuno-chemistry analytic tools are often limited in their ability to fully describe the molecular surface or the particle. For the Hepatitis B vaccine particle (HBsAg), no atomic data are available so far. In this study, we used the principal well-known elements of HBsAg structure to reconstitute and model the full HBsAg particle assembly at a molecular level (protein assembly, particle formation and maturation). Full HBsAg particle atomic models were built based on an exhaustive experimental data review, amino acid sequence analysis, iterative threading modeling, and molecular dynamic approaches.


Assuntos
Antígenos de Superfície da Hepatite B , Vacinas contra Hepatite B , Vírus da Hepatite B
8.
Bioinformatics ; 24(21): 2427-30, 2008 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-18772155

RESUMO

MOTIVATION: Most of the protein tyrosine kinases found in bacteria have been recently classified in a new family, termed BY-kinase. Indeed, they share no sequence homology with their eukaryotic counterparts and have no known eukaryotic homologues. They are involved in several biological functions (e.g. capsule biosynthesis, antibiotic resistance, virulence mechanism). Thus, they can be considered interesting therapeutic targets to develop new drugs to treat infectious diseases. However, their identification is rendered difficult due to slow progress in their structural characterization and comes most often from biochemical experiments. Moreover BY-kinase sequences are related to many other bacterial proteins involved in several biological functions (e.g. ParA family proteins). Accordingly, their annotations in generalist databases, sequence analysis and classification remain partial and inhomogeneous and there is no bioinformatics resource dedicated to these proteins. RESULTS: The combination of similarity search with sequence-profile alignment, pattern matching and sliding window computation to detect the tyrosine cluster was used to identify BY-kinase sequences in UniProt Knowledgebase. Cross-validations with keywords searches, pattern matching with several patterns and checking of motifs conservation in multiple sequence alignments were performed. Our pipeline identified 640 sequences as BY-kinases and allowed the definition of a PROSITE pattern that is the signature of the BY-kinases. The sequences identified by our pipeline as BY-kinases share a good sequence similarity with BY-kinases that have already been biochemically characterized, and they all bear the characteristic motifs of the catalytic domain, including the three Walker-like motifs followed by a tyrosine cluster. AVAILABILITY: http://bykdb.ibcp.fr


Assuntos
Proteínas de Bactérias/química , Proteínas Tirosina Quinases/química , Motivos de Aminoácidos , Sequência de Aminoácidos , Biologia Computacional , Bases de Dados de Proteínas , Dados de Sequência Molecular , Proteínas/química , Alinhamento de Sequência
9.
Bioinformatics ; 24(23): 2782-3, 2008 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-18826958

RESUMO

MOTIVATION: The technique of chemical cross-linking followed by mass spectrometry has proven to bring valuable information about the protein structure and interactions between proteic subunits. It is an effective and efficient way to experimentally investigate some aspects of a protein structure when NMR and X-ray crystallography data are lacking. RESULTS: We introduce MSX-3D, a tool specifically geared to validate protein models using mass spectrometry. In addition to classical peptides identifications, it allows an interactive 3D visualization of the distance constraints derived from a cross-linking experiment. AVAILABILITY: Freely available at http://proteomics-pbil.ibcp.fr


Assuntos
Espectrometria de Massas/métodos , Conformação Proteica , Software , Simulação por Computador , Bases de Dados de Proteínas , Modelos Moleculares , Proteínas/química , Proteômica/métodos
10.
Nucleic Acids Res ; 35(Database issue): D363-6, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17142229

RESUMO

The hepatitis C virus (HCV) genome shows remarkable sequence variability, leading to the classification of at least six major genotypes, numerous subtypes and a myriad of quasispecies within a given host. A database allowing researchers to investigate the genetic and structural variability of all available HCV sequences is an essential tool for studies on the molecular virology and pathogenesis of hepatitis C as well as drug design and vaccine development. We describe here the European Hepatitis C Virus Database (euHCVdb, http://euhcvdb.ibcp.fr), a collection of computer-annotated sequences based on reference genomes. The annotations include genome mapping of sequences, use of recommended nomenclature, subtyping as well as three-dimensional (3D) molecular models of proteins. A WWW interface has been developed to facilitate database searches and the export of data for sequence and structure analyses. As part of an international collaborative effort with the US and Japanese databases, the European HCV Database (euHCVdb) is mainly dedicated to HCV protein sequences, 3D structures and functional analyses.


Assuntos
Bases de Dados de Proteínas , Hepacivirus/genética , Proteínas Virais/química , Proteínas Virais/genética , Genoma Viral , Genômica , Internet , Modelos Moleculares , Conformação Proteica , Análise de Sequência de Proteína , Interface Usuário-Computador
11.
BMC Bioinformatics ; 8: 62, 2007 Feb 23.
Artigo em Inglês | MEDLINE | ID: mdl-17319945

RESUMO

BACKGROUND: The post-genomic era is characterised by a torrent of biological information flooding the public databases. As a direct consequence, similarity searches starting with a single query sequence frequently lead to the identification of hundreds, or even thousands of potential homologues. The huge volume of data renders the subsequent structural, functional and evolutionary analyses very difficult. It is therefore essential to develop new strategies for efficient sampling of this large sequence space, in order to reduce the number of sequences to be processed. At the same time, it is important to retain the most pertinent sequences for structural and functional studies. RESULTS: An exhaustive analysis on a large scale test set (284 protein families) was performed to compare the efficiency of four different sampling methods aimed at selecting the most pertinent sequences. These four methods sample the proteins detected by BlastP searches and can be divided into two categories: two customisable methods where the user defines either the maximal number or the percentage of sequences to be selected; two automatic methods in which the number of sequences selected is determined by the program. We focused our analysis on the potential information content of the sampled sets of sequences using multiple alignment of complete sequences as the main validation tool. The study considered two criteria: the total number of sequences in BlastP and their associated E-values. The subsequent analyses investigated the influence of the sampling methods on the E-value distributions, the sequence coverage, the final multiple alignment quality and the active site characterisation at various residue conservation thresholds as a function of these criteria. CONCLUSION: The comparative analysis of the four sampling methods allows us to propose a suitable sampling strategy that significantly reduces the number of homologous sequences required for alignment, while at the same time maintaining the relevant information concerning the active site residues.


Assuntos
Algoritmos , Bases de Dados de Proteínas , Armazenamento e Recuperação da Informação/métodos , Proteínas/química , Proteínas/metabolismo , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Sequência Conservada , Sistemas de Gerenciamento de Base de Dados , Dados de Sequência Molecular , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade
12.
Nucleic Acids Res ; 33(Database issue): D641-6, 2005 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-15608279

RESUMO

Genomic projects heavily depend on genome annotations and are limited by the current deficiencies in the published predictions of gene structure and function. It follows that, improved annotation will allow better data mining of genomes, and more secure planning and design of experiments. The purpose of the GeneFarm project is to obtain homogeneous, reliable, documented and traceable annotations for Arabidopsis nuclear genes and gene products, and to enter them into an added-value database. This re-annotation project is being performed exhaustively on every member of each gene family. Performing a family-wide annotation makes the task easier and more efficient than a gene-by-gene approach since many features obtained for one gene can be extrapolated to some or all the other genes of a family. A complete annotation procedure based on the most efficient prediction tools available is being used by 16 partner laboratories, each contributing annotated families from its field of expertise. A database, named GeneFarm, and an associated user-friendly interface to query the annotations have been developed. More than 3000 genes distributed over 300 families have been annotated and are available at http://genoplante-info.infobiogen.fr/Genefarm/. Furthermore, collaboration with the Swiss Institute of Bioinformatics is underway to integrate the GeneFarm data into the protein knowledgebase Swiss-Prot.


Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Bases de Dados Genéticas , Genes de Plantas , Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/fisiologia , Filosofia , Integração de Sistemas , Interface Usuário-Computador
13.
BMC Bioinformatics ; 7: 255, 2006 May 16.
Artigo em Inglês | MEDLINE | ID: mdl-16704727

RESUMO

BACKGROUND: Membrane proteins are estimated to represent about 25% of open reading frames in fully sequenced genomes. However, the experimental study of proteins remains difficult. Considerable efforts have thus been made to develop prediction methods. Most of these were conceived to detect transmembrane helices in polytopic proteins. Alternatively, a membrane protein can be monotopic and anchored via an amphipathic helix inserted in a parallel way to the membrane interface, so-called in-plane membrane (IPM) anchors. This type of membrane anchor is still poorly understood and no suitable prediction method is currently available. RESULTS: We report here the "AmphipaSeeK" method developed to predict IPM anchors. It uses a set of 21 reported examples of IPM anchored proteins. The method is based on a pattern recognition Support Vector Machine with a dedicated kernel. CONCLUSION: AmphipaSeeK was shown to be highly specific, in contrast with classically used methods (e.g. hydrophobic moment). Additionally, it has been able to retrieve IPM anchors in naively tested sets of transmembrane proteins (e.g. PagP). AmphipaSeek and the list of the 21 IPM anchored proteins is available on NPS@, our protein sequence analysis server.


Assuntos
Biologia Computacional/métodos , Proteínas de Membrana/metabolismo , Reconhecimento Automatizado de Padrão/métodos , Algoritmos , Bases de Dados de Proteínas , Proteínas de Membrana/química , Proteínas de Membrana/genética , Estrutura Secundária de Proteína , Reprodutibilidade dos Testes , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos
14.
Nucleic Acids Res ; 31(13): 3393-9, 2003 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-12824334

RESUMO

The World Wide Web server of the PBIL (Pôle Bioinformatique Lyonnais) provides on-line access to sequence databanks and to many tools of nucleic acid and protein sequence analyses. This server allows to query nucleotide sequence banks in the EMBL and GenBank formats and protein sequence banks in the SWISS-PROT and PIR formats. The query engine on which our data bank access is based is the ACNUC system. It allows the possibility to build complex queries to access functional zones of biological interest and to retrieve large sequence sets. Of special interest are the unique features provided by this system to query the data banks of gene families developed at the PBIL. The server also provides access to a wide range of sequence analysis methods: similarity search programs, multiple alignments, protein structure prediction and multivariate statistics. An originality of this server is the integration of these two aspects: sequence retrieval and sequence analysis. Indeed, thanks to the introduction of re-usable lists, it is possible to perform treatments on large sets of data. The PBIL server can be reached at: http://pbil.univ-lyon1.fr.


Assuntos
Bases de Dados Genéticas , Conformação Proteica , Análise de Sequência de DNA , Análise de Sequência de Proteína , Análise de Sequência de RNA , Internet , Modelos Moleculares , Ácidos Nucleicos/química , Estrutura Secundária de Proteína , Proteínas/química , Alinhamento de Sequência , Homologia de Sequência , Software , Integração de Sistemas , Interface Usuário-Computador
15.
Stud Health Technol Inform ; 120: 142-52, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16823131

RESUMO

Biological data are most times published and then become public ones. They, then, do not need to be isolated or encrypted. But, in some cases, these data stemed from patients or are analyzed with, for instance, pharmaceutical or agronomics goals. Also in simple ways , these data, before to become public, have to be kept confidential while researchers haven't been able to publish their work or to register them. So they are a lot of cases where the integrity and the confidentiality of biological data have to be protected against unauthorized accesses. But, as these private data are also large datasets, they need high-throughput computing and huge data storage to processed, such as ones produced by complete genome projects. These requirements are enhanced in the context of a Grid such EGEE, where the computing and storage resources are distributed across a large-scale platform. We have developed a secured distributed service to manage biological data on grid: the EncFile encrypted files management system. We have deployed it on the production platform of the EGEE grid project. Thus we provided grid users with a user-friendly component that doesn't require any user privileges. And we have integrated into a bioinformatics grid portal associated to encrypted representative biological resources: world-famous databases and programs.


Assuntos
Biologia Computacional , Segurança Computacional , Bases de Dados como Assunto/organização & administração , Europa (Continente) , Disseminação de Informação , Linguagens de Programação , Software
16.
Stud Health Technol Inform ; 120: 187-93, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16823137

RESUMO

Bioinformatics analysis of data produced by complete genome sequencing projects is one of the major challenges of the current years. Integrating up-to-date databanks and relevant algorithms is a clear requirement of such analysis. Grid computing would be a viable solution to distribute data, algorithms, computing and storage resources for Genomics. Providing bioinformaticians with a good interface to grid infrastructure, such as the one provided by the EGEE European project, is also a challenge to take up. The GPS@ web portal, "Grid Protein Sequence Analysis", aims to provide such a user-friendly interface for these grid genomic resources on the EGEE grid.


Assuntos
Biologia Computacional , Internet , Europa (Continente) , Genômica , Interface Usuário-Computador
17.
Proteins ; 52(2): 137-45, 2003 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-12833538

RESUMO

An innovative bioinformatic method has been designed and implemented to detect similar three-dimensional (3D) sites in proteins. This approach allows the comparison of protein structures or substructures and detects local spatial similarities: this method is completely independent from the amino acid sequence and from the backbone structure. In contrast to already existing tools, the basis for this method is a representation of the protein structure by a set of stereochemical groups that are defined independently from the notion of amino acid. An efficient heuristic for finding similarities that uses graphs of triangles of chemical groups to represent the protein structures has been developed. The implementation of this heuristic constitutes a software named SuMo (Surfing the Molecules), which allows the dynamic definition of chemical groups, the selection of sites in the proteins, and the management and screening of databases. To show the relevance of this approach, we focused on two extreme examples illustrating convergent and divergent evolution. In two unrelated serine proteases, SuMo detects one common site, which corresponds to the catalytic triad. In the legume lectins family composed of >100 structures that share similar sequences and folds but may have lost their ability to bind a carbohydrate molecule, SuMo discriminates between functional and non-functional lectins with a selectivity of 96%. The time needed for searching a given site in a protein structure is typically 0.1 s on a PIII 800MHz/Linux computer; thus, in further studies, SuMo will be used to screen the PDB.


Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Proteínas/química , Algoritmos , Domínio Catalítico , Quimotripsina/química , Quimotripsina/genética , Evolução Molecular , Fabaceae/química , Lectinas de Plantas/química , Lectinas de Plantas/genética , Conformação Proteica , Reprodutibilidade dos Testes , Subtilisina/química , Subtilisina/genética
18.
J Bioinform Comput Biol ; 1(3): 505-520, 2003 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-15307241

RESUMO

We present an original strategy, that involves a bioinformatic software structure, in order to perform an exhaustive and objective statistical analysis of three-dimensional structures of proteins. We establish the relationship between multiple sequences alignments and various structural features of proteins. We show that amino acids implied in disulfide bonds, salt bridges and hydrophobic interactions have been studied. Furthermore, we point out that the more variable the sequences within a multiple alignment, the more informative the multiple alignment. The results support multiple alignments usefulness for predictions of structural features.


Assuntos
Proteínas/química , Proteínas/genética , Alinhamento de Sequência/estatística & dados numéricos , Sequência de Aminoácidos , Biologia Computacional , Sequência Conservada , Dissulfetos/química , Interações Hidrofóbicas e Hidrofílicas , Conformação Proteica , Estrutura Secundária de Proteína , Sais/química , Software
19.
Appl Bioinformatics ; 3(4): 237-40, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-15702954

RESUMO

UNLABELLED: To date, more than 30 000 hepatitis C virus (HCV) sequences have been deposited in the generalist databases DNA Data Bank of Japan (DDBJ), EMBL Nucleotide Sequence Database (EMBL) and GenBank. The main difficulties with HCV sequences in these databases are their retrieval, annotation and analyses. To help HCV researchers face the increasing needs of HCV sequence analyses, we developed a specialised database of computer-annotated HCV sequences, called HCVDB. HCVDB is re-built every month from an up-to-date EMBL database by an automated process. HCVDB provides key data about the HCV sequences (e.g. genotype, genomic region, protein names and functions, known 3-dimensional structures) and ensures consistency of the annotations, which enables reliable keyword queries. The database is highly integrated with sequence and structure analysis tools and the SRS (LION bioscience) keywords query system. Thus, any user can extract subsets of sequences matching particular criteria or enter their own sequences and analyse them with various bioinformatics programs available on the same server. AVAILABILITY: HCVDB is available from http://hepatitis.ibcp.fr.


Assuntos
Bases de Dados Genéticas , Hepacivirus/química , Hepacivirus/metabolismo , Análise de Sequência/métodos , Interface Usuário-Computador , Proteínas Virais/química , Proteínas Virais/metabolismo , Sequência de Aminoácidos , Sequência de Bases , Mapeamento Cromossômico/métodos , Sistemas de Gerenciamento de Base de Dados , Documentação/métodos , Genoma Viral , Hepacivirus/genética , Armazenamento e Recuperação da Informação/métodos , Dados de Sequência Molecular , Proteínas Virais/genética
20.
Database (Oxford) ; 2014: bau013, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24608034

RESUMO

BCL2DB (http://bcl2db.ibcp.fr) is a database designed to integrate data on BCL-2 family members and BH3-only proteins. These proteins control the mitochondrial apoptotic pathway and probably many other cellular processes as well. This large protein group is formed by a family of pro-apoptotic and anti-apoptotic homologs that have phylogenetic relationships with BCL-2, and by a collection of evolutionarily and structurally unrelated proteins characterized by the presence of a region of local sequence similarity with BCL-2, termed the BH3 motif. BCL2DB is monthly built, thanks to an automated procedure relying on a set of homemade profile HMMs computed from seed reference sequences representative of the various BCL-2 homologs and BH3-only proteins. The BCL2DB entries integrate data from the Ensembl, Ensembl Genomes, European Nucleotide Archive and Protein Data Bank databases and are enriched with specific information like protein classification into orthology groups and distribution of BH motifs along the sequences. The Web interface allows for easy browsing of the site and fast access to data, as well as sequence analysis with generic and specific tools. BCL2DB provides a helpful and powerful tool to both 'BCL-2-ologists' and researchers working in the various fields of physiopathology. Database URL: http://bcl2db.ibcp.fr.


Assuntos
Bases de Dados de Proteínas , Família Multigênica , Proteínas Proto-Oncogênicas c-bcl-2/metabolismo , Sequência de Aminoácidos , Animais , Humanos , Internet , Anotação de Sequência Molecular , Dados de Sequência Molecular , Proteínas Proto-Oncogênicas c-bcl-2/química , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA