Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
2.
Meta Gene ; 4: 92-106, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25941633

RESUMO

Tafazzin is a mitochondrial phospholipid transacylase, and its mutations cause Barth syndrome (BTHS). Human tafazzin gene produces four distinct alternatively spliced transcripts. To understand the molecular mechanisms of tafazzin deficiency, we performed an atomic resolution analysis of the influence of the BTHS mutations and of alternative splicing on the structure and function of tafazzin. From the three-dimensional (3D) homology modeling of tafazzin, we identified candidate amino acid residues that contribute to cardiolipin binding and to mitochondrial membrane associations that facilitate acyl-transfer reactions. Primate specific exon 5, which is alternatively spliced, is predicted to correspond to an intrinsically unstructured region in the protein. We proposed that this region should change the substrate-binding affinity and/or contribute to primate-specific molecular interactions. Exon 7, another alternatively spliced exon, encodes a region forming a part of the putative substrate-binding cleft, suggesting that the gene products lacking exon 7 will lose their substrate-binding ability. We demonstrate a clear localization of the BTHS mutations at residues responsible for membrane association, substrate binding, and the conformational stability of tafazzin. These findings provide new insights into the function of defective tafazzin and the pathogenesis of BTHS at the level of protein 3D structure and the evolution of alternatively spliced exons in primates.

3.
Bioinformatics ; 28(15): 2076-7, 2012 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-22645168

RESUMO

SUMMARY: Alternative Splicing Effects ASsessment Tools (AS-EAST) is an online tool for the functional annotation of putative proteins encoded by transcripts generated by alternative splicing (AS). When provided with a transcript sequence, AS-EAST identifies regions altered by AS events in the putative protein sequence encoded by the transcript. Users can evaluate the predicted function of the putative protein by inspecting whether functional domains are included in the altered regions. Moreover, users can infer the loss of inter-molecular interactions in the protein network according to whether the AS events affect interaction residues observed in the 3D structure of the reference isoform. The information obtained from AS-EAST will help to design experimental analyses for the functional significance of novel splice isoforms. AVAILABILITY: The online tool is freely available at http://as-alps.nagahama-i-bio.ac.jp/ASEAST/. CONTACT: m_shionyu@nagahama-i-bio.ac.jp.


Assuntos
Processamento Alternativo , Proteínas/genética , Software , Sequência de Aminoácidos , Biologia Computacional/métodos , Isoformas de Proteínas/genética , Interface Usuário-Computador
4.
Proteins ; 79(6): 1868-77, 2011 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-21465562

RESUMO

In comparative modeling, the quality of amino acid sequence alignment still constitutes a major bottleneck in the generation of high quality models of protein three-dimensional (3D) structures. Substantial efforts have been made to improve alignment quality by revising the substitution matrix, introducing multiple sequences, replacing dynamic programming with hidden Markov models, and incorporating 3D structure information. Improvements in the gap penalty have not been a major focus, however, following the development of the affine gap penalty and of the secondary structure dependent gap penalty. We revisited the correlation between protein 3D structure and gap location in a large protein 3D structure data set, and found that the frequency of gap locations approximated to an exponential function of the solvent accessibility of the inserted residues. The nonlinearity of the gap frequency as a function of accessibility corresponded well to the relationship between residue mutation pattern and residue accessibility. By introducing this relationship into the gap penalty calculation for pairwise alignment between template and target amino acid sequences, we were able to obtain a sequence alignment much closer to the structural alignment. The quality of the alignments was substantially improved on a pair of sequences with identity in the "twilight zone" between 20 and 40%. The relocation of gaps by our new method made a significant improvement in comparative modeling, exemplified here by the Bacillus subtilis yitF protein. The method was implemented in a computer program, ALAdeGAP (ALignment with Accessibility dependent GAp Penalty), which is available at http://cib.cf.ocha.ac.jp/target_protein/.


Assuntos
Alinhamento de Sequência/métodos , Aminoácidos/química , Bacillus subtilis/química , Proteínas de Bactérias/química , Escherichia coli/química , Modelos Moleculares , Conformação Proteica , Solventes
5.
Plant Cell Physiol ; 50(11): 1865-73, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19808808

RESUMO

Transcripts from mitochondrial and chloroplast DNA of land plants often undergo cytidine to uridine conversion-type RNA editing events. RESOPS is a newly built database that specializes in displaying RNA editing sites of land plant organelles on protein three-dimensional (3D) structures to help elucidate the mechanisms of RNA editing for gene expression regulation. RESOPS contains the following information: unedited and edited cDNA sequences with notes for the target nucleotides of RNA editing, conceptual translation from the edited cDNA sequence in pseudo-UniProt format, a list of proteins under the influence of RNA editing, multiple amino acid sequence alignments of edited proteins, the location of amino acid residues coded by codons under the influence of RNA editing in protein 3D structures and the statistics of biased distributions of the edited residues with respect to protein structures. Most of the data processing procedures are automated; hence, it is easy to keep abreast of updated genome and protein 3D structural data. In the RESOPS database, we clarified that the locations of residues switched by RNA editing are significantly biased to protein structural cores. The integration of different types of data in the database also help advance the understanding of RNA editing mechanisms. RESOPS is accessible at http://cib.cf.ocha.ac.jp/RNAEDITING/.


Assuntos
Bases de Dados de Proteínas , Genoma de Planta , Proteínas de Plantas/genética , Edição de RNA , Cloroplastos/genética , DNA de Cloroplastos/genética , DNA Complementar/genética , DNA Mitocondrial/genética , Mitocôndrias/genética , Conformação Proteica , Dobramento de Proteína , RNA de Plantas/genética , Alinhamento de Sequência , Interface Usuário-Computador
6.
Nucleic Acids Res ; 37(Database issue): D305-9, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19015123

RESUMO

We have constructed a database, AS-ALPS (alternative splicing-induced alteration of protein structure), which provides information that would be useful for analyzing the effects of alternative splicing (AS) on protein structure, interactions with other bio-molecules and protein interaction networks in human and mouse. Several AS events have been revealed to contribute to the diversification of protein structure, which results in diversification of interaction partners or affinities, which in turn contributes to regulation of bio-molecular networks. Most AS variants, however, are only known at the sequence level. It is important to determine the effects of AS on protein structure and interaction, and to provide candidates for experimental targets that are relevant to network regulation by AS. For this purpose, the three-dimensional (3D) structures of proteins are valuable sources of information; however, these have not been fully exploited in any other AS-related databases. AS-ALPS is the only AS-related database that describes the spatial relationships between protein regions altered by AS ('AS regions') and both the proteins' hydrophobic cores and sites of inter-molecular interactions. This information makes it possible to infer whether protein structural stability and/or protein interaction are affected by each AS event. AS-ALPS can be freely accessed at http://as-alps.nagahama-i-bio.ac.jp and http://genomenetwork.nig.ac.jp/as-alps/.


Assuntos
Processamento Alternativo , Bases de Dados de Proteínas , Isoformas de Proteínas/química , Animais , Códon sem Sentido , Humanos , Interações Hidrofóbicas e Hidrofílicas , Camundongos , Domínios e Motivos de Interação entre Proteínas , Mapeamento de Interação de Proteínas , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Estabilidade de RNA
7.
BMC Plant Biol ; 8: 79, 2008 Jul 16.
Artigo em Inglês | MEDLINE | ID: mdl-18631376

RESUMO

BACKGROUND: In plant organelles, specific messenger RNAs (mRNAs) are subjected to conversion editing, a process that often converts the first or second nucleotide of a codon and hence the encoded amino acid. No systematic patterns in converted sites were found on mRNAs, and the converted sites rarely encoded residues located at the active sites of proteins. The role and origin of RNA editing in plant organelles remain to be elucidated. RESULTS: Here we study the relationship between amino acid residues encoded by edited codons and the structural characteristics of these residues within proteins, e.g., in protein-protein interfaces, elements of secondary structure, or protein structural cores. We find that the residues encoded by edited codons are significantly biased toward involvement in helices and protein structural cores. RNA editing can convert codons for hydrophilic to hydrophobic amino acids. Hence, only the edited form of an mRNA can be translated into a polypeptide with helix-preferring and core-forming residues at the appropriate positions, which is often required for a protein to form a functional three-dimensional (3D) structure. CONCLUSION: We have performed a novel analysis of the location of residues affected by RNA editing in proteins in plant organelles. This study documents that RNA editing sites are often found in positions important for 3D structure formation. Without RNA editing, protein folding will not occur properly, thus affecting gene expression. We suggest that RNA editing may have conferring evolutionary advantage by acting as a mechanism to reduce susceptibility to DNA damage by allowing the increase in GC content in DNA while maintaining RNA codons essential to encode residues required for protein folding and activity.


Assuntos
Proteínas de Plantas/genética , Edição de RNA , RNA Mensageiro/genética , RNA de Plantas/genética , Sequência de Aminoácidos , Códon/genética , Bases de Dados de Ácidos Nucleicos , Dados de Sequência Molecular , Organelas/genética , Dobramento de Proteína , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Homologia de Sequência de Aminoácidos
8.
Biophysics (Nagoya-shi) ; 3: 1-12, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-27857562

RESUMO

To understand how protein reduces the conformational space to be searched for the native structure, it is crucial to characterize ensembles of conformations on the way of folding processes, in particular ensembles of relatively long-range structures connecting between an extensively unfolded state and a state with a native-like overall chain topology. To analyze such intermediate conformations, we performed multiple unfolding molecular dynamics simulations of barnase at 498K. Some short-range structures such as part of helix and turn were well sustained while most of the secondary structures and the hydrophobic cores were eventually lost, which is consistent with the results by other experimental and computational studies. The most important novel findings were persistence of long-range relatively compact substructures, which was captured by exploiting the concept of module. Module is originally introduced to describe the hierarchical structure of a globular protein in the native state. Modules are conceptually such relatively compact substructures that are resulted from partitioning the native structure of a globular protein completely into several contiguous segments with the least extended conformations. We applied this concept of module to detect a possible hierarchical structure of each snapshot structure in unfolding processes as well. Along with this conceptual extension, such detected relatively compact substructures are named quasi-modules. We found almost perfect persistence of quasi-module boundaries that are positioned close to the native module boundaries throughout the unfolding trajectories. Relatively compact conformations of the quasi-modules seemed to be retained mainly by hydrophobic interactions formed between residues located at both terminal regions within each module. From these results, we propose a hypothesis that hierarchical folding with the early formation of quasi-modules effectively reduces search space for the native structure.

9.
J Struct Funct Genomics ; 7(2): 65-76, 2006 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-17146617

RESUMO

We have been developing FAMSBASE, a protein homology-modeling database of whole ORFs predicted from genome sequences. The latest update of FAMSBASE ( http://daisy.nagahama-i-bio.ac.jp/Famsbase/ ), which is based on the protein three-dimensional (3D) structures released by November 2003, contains modeled 3D structures for 368,724 open reading frames (ORFs) derived from genomes of 276 species, namely 17 archaebacterial, 130 eubacterial, 18 eukaryotic and 111 phage genomes. Those 276 genomes are predicted to have 734,193 ORFs in total and the current FAMSBASE contains protein 3D structure of approximately 50% of the ORF products. However, cases that a modeled 3D structure covers the whole part of an ORF product are rare. When portion of an ORF with 3D structure is compared in three kingdoms of life, in archaebacteria and eubacteria, approximately 60% of the ORFs have modeled 3D structures covering almost the entire amino acid sequences, however, the percentage falls to about 30% in eukaryotes. When annual differences in the number of ORFs with modeled 3D structure are calculated, the fraction of modeled 3D structures of soluble protein for archaebacteria is increased by 5%, and that for eubacteria by 7% in the last 3 years. Assuming that this rate would be maintained and that determination of 3D structures for predicted disordered regions is unattainable, whole soluble protein model structures of prokaryotes without the putative disordered regions will be in hand within 15 years. For eukaryotic proteins, they will be in hand within 25 years. The 3D structures we will have at those times are not the 3D structure of the entire proteins encoded in single ORFs, but the 3D structures of separate structural domains. Measuring or predicting spatial arrangements of structural domains in an ORF will then be a coming issue of structural genomics.


Assuntos
Bases de Dados de Proteínas , Genômica , Modelos Moleculares , Proteínas/química , Proteínas/genética , Proteoma/análise , Homologia Estrutural de Proteína , Sequência de Aminoácidos , Animais , Genoma Arqueal , Genoma Bacteriano , Genoma Viral , Humanos , Imageamento Tridimensional , Fases de Leitura Aberta , Conformação Proteica , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Proteínas/fisiologia , Relação Estrutura-Atividade
10.
Nucleic Acids Res ; 34(14): 3917-28, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16914452

RESUMO

We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56,419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37,670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.


Assuntos
Processamento Alternativo , DNA Complementar/química , Genoma Humano , Proteínas/genética , RNA Mensageiro/química , Motivos de Aminoácidos , Sequência de Aminoácidos , Sequência de Bases , Biologia Computacional/métodos , Éxons , Variação Genética , Genômica/métodos , Humanos , Proteínas/química , Proteínas/fisiologia , RNA Mensageiro/metabolismo , Análise de Sequência de DNA
11.
Gene ; 380(2): 63-71, 2006 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-16872759

RESUMO

Alternative splicing is a molecular mechanism that produces multiple proteins from a single gene, and is thought to produce variety in proteins translated from a limited number of genes. Here we analyzed how alternative splicing produced variety in protein structure and function, by using human full-length cDNAs on the assumption that all of the alternatively spliced mRNAs were translated to proteins. We found that the length of alternatively spliced amino acid sequences, in most cases, fell into a size shorter than that of average protein domain. We evaluated comprehensively the presumptive three-dimensional structures of the alternatively spliced products to assess the impact of alternative splicing on gene function. We found that more than half of the products encoded proteins which were involved in signal transduction, transcription and translation, and more than half of alternatively spliced regions comprised interaction sites between proteins and their binding partners, including substrates, DNA/RNA, and other proteins. Intriguingly, 67% of the alternatively spliced isoforms showed significant alterations to regions of the protein structural core, which likely resulted in large conformational change. Based on those findings, we speculate that there are a large number of cases that alternative splicing modulates protein networks through significant alteration in protein conformation.


Assuntos
Processamento Alternativo/fisiologia , Proteoma/química , RNA Mensageiro/metabolismo , Transcrição Gênica/fisiologia , Sequência de Aminoácidos , Receptores de Apelina , Biologia Computacional/métodos , DNA Complementar/análise , Proteínas de Ligação ao GTP/metabolismo , Humanos , Modelos Biológicos , Modelos Moleculares , Dados de Sequência Molecular , Ligação Proteica , Conformação Proteica , Mapeamento de Interação de Proteínas/métodos , Isoformas de Proteínas/química , Estrutura Terciária de Proteína , RNA Helicases/química , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo , Transdução de Sinais
12.
Mol Biol Evol ; 23(5): 1085-94, 2006 May.
Artigo em Inglês | MEDLINE | ID: mdl-16520337

RESUMO

The serine/arginine-rich (SR) protein family plays an important role in constitutive and alternative splicing (AS). These proteins regulate AS in a tissue-specific and stress-responsive manner. Pre-mRNAs encoding SR proteins are often alternatively spliced, and these AS events may be important for the regulation of AS events of other pre-mRNAs. In this study, we analyzed AS events of SR proteins in Arabidopsis thaliana and Oryza sativa (rice). We found three sets of AS events conserved between Arabidopsis and rice. These conserved AS events were found in the plant-novel-SR protein, SC35-like (SCL), and two-Zn-knuckles-type 9G8 subfamilies. Each member of these subfamilies has at least one RNA recognition motif (RRM) and at least one intron in the RRM-encoded region. We found that the conserved AS events occurred in these introns and, in each case, the conserved AS events resulted in mature mRNAs encoding proteins with incomplete RRMs. To search for the evolutionary origin of these AS events, we analyzed SR proteins in Physcomitrella patens (moss) in addition to those in Arabidopsis and rice. We found moss homologues of the plant-novel-SR protein, SCL, and the two-Zn-knuckles-type 9G8 subfamilies in silico, and these homologues have long introns at the same location of the conserved AS sites in Arabidopsis and rice. Such long introns are quite specific for alternatively spliced introns concerning the Arabidopsis SR protein genes. The long introns found in the moss SR protein genes strongly suggested that conserved AS events in moss SR protein genes might be similar to those in Arabidopsis and rice. We traced the evolutionary origin of the conserved AS events to 400 MYA, when plants first invaded land. These events are likely important in the regulation of whole AS events and likely contribute to the complicated transcriptome described by AS. The complicated transcriptome created by regulated AS events might have provided plants tolerance against droughts or temperature shifts and given them the ability to live on land.


Assuntos
Processamento Alternativo , Arginina/química , Plantas/metabolismo , Serina/química , Sequência de Aminoácidos , Arabidopsis/metabolismo , Briófitas , Evolução Molecular , Éxons , Íntrons , Dados de Sequência Molecular , Oryza/metabolismo , Filogenia , Estrutura Terciária de Proteína , RNA Mensageiro/metabolismo , Homologia de Sequência de Aminoácidos
13.
Protein Eng Des Sel ; 19(2): 67-75, 2006 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-16403825

RESUMO

Protein structure data in the PDB (Protein Data Bank) were used to construct empirical scores of nucleotide-protein interactions. A simple strategy to evaluate the spatial distribution of protein atoms around the base moieties of nucleotides was applied to categorize adenine, guanine, nicotinamide and flavin nucleotide-binding sites. In addition to the known nucleotide-binding motifs, the empirical scores detected several other features that were shared among proteins with different folds. The empirical scores were also used to predict the binding sites on protein molecules and a comprehensive test of the prediction system was performed. As a result, adenine, guanine, nicotinamide and flavin sites were detected with efficiencies of 31, 29, 32 and 40%, respectively. The predictions were judged to be successful if the predicted base with the best score was located within a 3.0 A r.m.s.d. from the known ligand positions.


Assuntos
Nucleotídeos/metabolismo , Proteínas/química , Proteínas/metabolismo , Motivos de Aminoácidos , Sítios de Ligação , Simulação por Computador , Bases de Dados de Proteínas , Ligantes , Modelos Químicos , Modelos Moleculares , Dobramento de Proteína
14.
Protein J ; 24(5): 315-25, 2005 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-16284729

RESUMO

Guanylyl cyclase C (GC-C) is a single-transmembrane receptor that is specifically activated by endogenous ligands, including guanylin, and the exogenous ligand, heat-stable enterotoxin. Using combined HPLC separation and MS analysis techniques the positions of the disulfide linkages in the extracellular ligand-binding domain (ECD) of GC-C were determined to be between Cys7-Cys94, Cys72-Cys77, Cys101-Cys128 and Cys179-Cys226. Furthermore, a three-dimensional structural model of the ECD was constructed by homology modeling, using the structure of the ECD of GC-A as a template (van den Akker et al., 2000, Nature, 406: 101-104) and the information of the disulfide linkages. Although the GC-C model was similar to the known structure of GC-A, importantly its ligand-binding site appears to be located on the quite different region from that in GC-A.


Assuntos
Guanilato Ciclase/química , Guanilato Ciclase/genética , Receptores de Peptídeos/química , Receptores de Peptídeos/genética , Sequência de Aminoácidos , Sítios de Ligação , Brometo de Cianogênio/farmacologia , Dissulfetos/química , Ligantes , Modelos Moleculares , Dados de Sequência Molecular , Mapeamento de Peptídeos , Estrutura Terciária de Proteína , Receptores de Enterotoxina , Receptores Acoplados a Guanilato Ciclase , Homologia de Sequência de Aminoácidos
15.
Proc Natl Acad Sci U S A ; 101(38): 13927-32, 2004 Sep 21.
Artigo em Inglês | MEDLINE | ID: mdl-15347812

RESUMO

In the cyanobacterium Synechococcus elongatus PCC 7942, KaiA, KaiB, and KaiC are essential proteins for the generation of a circadian rhythm. KaiC is proposed as a negative regulator of the circadian expression of all genes in the genome, and its phosphorylation is regulated positively by KaiA and negatively by KaiB and shows a circadian rhythm in vivo. To study the functions of KaiC phosphorylation in the circadian clock system, we identified two autophosphorylation sites, Ser-431 and Thr-432, by using mass spectrometry (MS). We generated Synechococcus mutants in which these residues were substituted for alanine by using site-directed mutagenesis. Phosphorylation of KaiC was reduced in the single mutants and was completely abolished in the double mutant, indicating that KaiC is also phosphorylated at these sites in vivo. These mutants lost circadian rhythm, indicating that phosphorylation at each of the two sites is essential for the control of the circadian oscillation. Although the nonphosphorylatable mutant KaiC was able to form a hexamer in vitro, it failed to form a clock protein complex with KaiA, KaiB, and SasA in the Synechococcus cells. When nonphosphorylatable KaiC was overexpressed, the kaiBC promoter activity was only transiently repressed. These results suggest that KaiC phosphorylation regulates its transcriptional repression activity by controlling its binding affinity for other clock proteins.


Assuntos
Proteínas de Bactérias/metabolismo , Relógios Biológicos/fisiologia , Ritmo Circadiano/fisiologia , Cianobactérias/fisiologia , Proteínas de Bactérias/genética , Sequência de Bases , Sítios de Ligação , Peptídeos e Proteínas de Sinalização do Ritmo Circadiano , Primers do DNA , Regulação Bacteriana da Expressão Gênica/fisiologia , Fosforilação , Espectrometria de Massas por Ionização por Electrospray
16.
J Biol Chem ; 279(25): 26619-26, 2004 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-15078885

RESUMO

The catalytic domain of xylanases belonging to glycoside hydrolase family 10 (GH10) can be divided into 22 modules (M1 to M22; Sato, Y., Niimura, Y., Yura, K., and Go, M. (1999) Gene (Amst.) 238, 93-101). Inspection of the crystal structure of a GH10 xylanase from Streptomyces olivaceoviridis E-86 (SoXyn10A) revealed that the catalytic domain of GH10 xylanases can be dissected into two parts, an N-terminal larger region and C-terminal smaller region, by the substrate binding cleft, corresponding to the module border between M14 and M15. It has been suggested that the topology of the substrate binding clefts of GH10 xylanases are not conserved (Charnock, S. J., Spurway, T. D., Xie, H., Beylot, M. H., Virden, R., Warren, R. A. J., Hazlewood, G. P., and Gilbert, H. J. (1998) J. Biol. Chem. 273, 32187-32199). To facilitate a greater understanding of the structure-function relationship of the substrate binding cleft of GH10 xylanases, a chimeric xylanase between SoXyn10A and Xyn10A from Cellulomonas fimi (CfXyn10A) was constructed, and the topology of the hybrid substrate binding cleft established. At the three-dimensional level, SoXyn10A and CfXyn10A appear to possess 5 subsites, with the amino acid residues comprising subsites -3 to +1 being well conserved, although the +2 subsites are quite different. Biochemical analyses of the chimeric enzyme along with SoXyn10A and CfXyn10A indicated that differences in the structure of subsite +2 influence bond cleavage frequencies and the catalytic efficiency of xylooligosaccharide hydrolysis. The hybrid enzyme constructed in this study displays fascinating biochemistry, with an interesting combination of properties from the parent enzymes, resulting in a low production of xylose.


Assuntos
Cellulomonas/metabolismo , Endo-1,4-beta-Xilanases/química , Streptomyces/enzimologia , Domínio Catalítico , Escherichia coli/metabolismo , Hidrólise , Cinética , Modelos Biológicos , Modelos Moleculares , Ligação Proteica , Conformação Proteica , Estrutura Terciária de Proteína , Proteínas Recombinantes de Fusão/química , Relação Estrutura-Atividade , Especificidade por Substrato , Fatores de Tempo , Difração de Raios X , Xilanos/química
17.
J Biochem ; 135(1): 79-84, 2004 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-14999012

RESUMO

The genomes of more than 100 species have been sequenced, and the biological functions of encoded proteins are now actively being researched. Protein function is based on interactions between proteins and other molecules. One approach to assuming protein function based on genomic sequence is to predict interactions between an encoded protein and other molecules. As a data source for such predictions, knowledge regarding known protein-small molecule interactions needs to be compiled. We have, therefore, surveyed interactions between proteins and other molecules in Protein Data Bank (PDB), the protein three-dimensional (3D) structure database. Among 20,685 entries in PDB (April, 2003), 4,189 types of small molecules were found to interact with proteins. Biologically relevant small molecules most often found in PDB were metal ions, such as calcium, zinc, and magnesium. Sugars and nucleotides were the next most common. These molecules are known to act as cofactors for enzymes and/or stabilizers of proteins. In each case of interactions between a protein and small molecule, we found preferred amino acid residues at the interaction sites. These preferences can be the basis for predicting protein function from genomic sequence and protein 3D structures. The data pertaining to these small molecules were collected in a database named Het-PDB Navi., which is freely available at http://daisy.nagahama-i-bio.ac.jp/golab/hetpdbnavi.html and linked to the official PDB home page.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Proteínas/metabolismo , Trifosfato de Adenosina/metabolismo , Sítios de Ligação/fisiologia , Bases de Dados de Proteínas/classificação , Fucose/metabolismo , Galactose/metabolismo , Glucosamina/metabolismo , Glucose/metabolismo , Guanosina Trifosfato/metabolismo , Ligantes , Manose/metabolismo , Metais/metabolismo , Estrutura Terciária de Proteína
18.
J Biol Chem ; 278(42): 41205-12, 2003 Oct 17.
Artigo em Inglês | MEDLINE | ID: mdl-12888576

RESUMO

The proteoglycan aggregate is the major structural component of the cartilage matrix, comprising hyaluronan (HA), link protein (LP), and a large chondroitin sulfate (CS) proteoglycan, aggrecan. Here, we found that another member of aggrecan family, versican, biochemically binds to both HA and LP. Functional analyses of recombinant looped domains (subdomains) A, B, and B' of the N-terminal G1 domain revealed that the B-B' segment of versican is adequate for binding to HA and LP, whereas A and B-B' of aggrecan bound to LP and HA, respectively. BIAcore trade mark analyses showed that the A subdomain of versican G1 enhances HA binding but has a negligible effect on LP binding. Overlay sensorgrams demonstrated that versican G1 or its B-B' segment forms a complex with both HA and LP. We generated a molecular model of the B-B' segment, in which a deletion and an insertion of B' and B are critical for stable structure and HA binding. These results provide important insights into the mechanisms of formation of the proteoglycan aggregate and HA binding of molecules containing the link module.


Assuntos
Proteoglicanas de Sulfatos de Condroitina/metabolismo , Ácido Hialurônico/metabolismo , Sequência de Aminoácidos , Animais , Moléculas de Adesão Celular/química , Linhagem Celular , Proteoglicanas de Sulfatos de Condroitina/química , Sulfatos de Condroitina/metabolismo , Vetores Genéticos , Humanos , Receptores de Hialuronatos/química , Ácido Hialurônico/química , Immunoblotting , Cinética , Lectinas Tipo C , Camundongos , Modelos Moleculares , Dados de Sequência Molecular , Testes de Precipitina , Ligação Proteica , Estrutura Terciária de Proteína , Proteínas Recombinantes/metabolismo , Homologia de Sequência de Aminoácidos , Ressonância de Plasmônio de Superfície , Fatores de Tempo , Versicanas
19.
J Biol Chem ; 278(32): 30235-47, 2003 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-12761225

RESUMO

Chondroitin sulfate is found in a variety of tissues as proteoglycans and consists of repeating disaccharide units of N-acetylgalactosamine and glucuronic acid residues with sulfate residues at various places. We found a novel human gene (GenBank accession number AB086063) that possesses a sequence homologous with the human chondroitin sulfate glucuronyltransferase gene which we recently cloned and characterized. The full-length open reading frame encodes a typical type II membrane protein comprising 775 amino acids. The protein had a domain containing beta 3-glycosyltransferase motif but lacked a typical beta 4-glycosyltransferase motif, which is the same as chondroitin sulfate glucuronyltransferase, whereas chondroitin synthase had both domains. The putative catalytic domain was expressed in COS-7 cells as a soluble enzyme. Surprisingly, both glucuronyltransferase and N-acetylgalactosaminyltransferase activities were observed when chondroitin, chondroitin sulfate, and their oligosaccharides were used as the acceptor substrates. The reaction products were identified to have the linkage of GlcUA beta 1-3GalNAc and GalNAc beta 1-4GlcUA at the non-reducing terminus of chondroitin for glucuronyltransferase activity and N-acetylgalactosaminyltransferase activity, respectively. Quantitative real time PCR analysis revealed that the transcripts were ubiquitously expressed in various human tissues but highly expressed in the pancreas, ovary, placenta, small intestine, and stomach. These results indicate that this enzyme could synthesize chondroitin sulfate chains as a chondroitin sulfate synthase that has both glucuronyltransferase and N-acetylgalactosaminyltransferase activities. Sequence analysis based on three-dimensional structure revealed the presence of not typical but significant beta 4-glycosyltransferase architecture.


Assuntos
Sulfatos de Condroitina/química , Hexosiltransferases/química , Hexosiltransferases/genética , Motivos de Aminoácidos , Sequência de Aminoácidos , Aminoácidos , Animais , Células COS , Cátions , Bovinos , Divisão Celular , Condroitina/química , Mapeamento Cromossômico , Clonagem Molecular , DNA Complementar/metabolismo , Dissacarídeos/química , Relação Dose-Resposta a Droga , Epitopos , Vetores Genéticos , Ácido Glucurônico/química , Glucuronosiltransferase/metabolismo , Glicosiltransferases/metabolismo , Hexosiltransferases/metabolismo , Humanos , Concentração de Íons de Hidrogênio , Magnésio/farmacologia , Modelos Moleculares , Dados de Sequência Molecular , Monossacarídeos , Oligossacarídeos/química , Fases de Leitura Aberta , Ligação Proteica , Estrutura Terciária de Proteína , Proteínas Recombinantes/metabolismo , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Homologia de Sequência de Aminoácidos , Especificidade por Substrato , Distribuição Tecidual , Difosfato de Uridina/farmacologia
20.
Nucleic Acids Res ; 31(1): 463-8, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12520053

RESUMO

Enlarged FAMSBASE is a relational database of comparative protein structure models for the whole genome of 41 species, presented in the GTOP database. The models are calculated by Full Automatic Modeling System (FAMS). Enlarged FAMSBASE provides a wide range of query keys, such as name of ORF (open reading frame), ORF keywords, Protein Data Bank (PDB) ID, PDB heterogen atoms and sequence similarity. Heterogen atoms in PDB include cofactors, ligands and other factors that interact with proteins, and are a good starting point for analyzing interactions between proteins and other molecules. The data may also work as a template for drug design. The present number of ORFs with protein 3D models in FAMSBASE is 183 805, and the database includes an average of three models for each ORF. FAMSBASE is available at http://famsbase.bio.nagoya-u.ac.jp/famsbase/.


Assuntos
Bases de Dados de Proteínas , Modelos Moleculares , Homologia Estrutural de Proteína , Animais , Interpretação Estatística de Dados , Genoma , Imageamento Tridimensional , Fases de Leitura Aberta , Proteínas/química , Reprodutibilidade dos Testes
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA