Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Mol Genet Genomics ; 291(5): 1851-69, 2016 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-27290643

RESUMEN

Short Tandem Repeats (STRs) comprise repeats of one to several base pairs. Because of the high mutability due to strand slippage during DNA synthesis, rapid evolutionary change in the number of repeating units directly shapes the range of repeat-number variation according to selection pressure. However, the remaining questions include: Why are STRs causing repeat expansion diseases maintained in the human population; and why are these limited to neurodegenerative diseases? By evaluating the genome-wide selection pressure on STRs using the database we constructed, we identified two different patterns of relationship in repeat-number polymorphisms between DNA and amino-acid sequences, although both patterns are evolutionary consequences of avoiding the formation of harmful long STRs. First, a mixture of degenerate codons is represented in poly-proline (poly-P) repeats. Second, long poly-glutamine (poly-Q) repeats are favored at the protein level; however, at the DNA level, STRs encoding long poly-Qs are frequently divided by synonymous SNPs. Furthermore, significant enrichments of apoptosis and neurodevelopment were biological processes found specifically in genes encoding poly-Qs with repeat polymorphism. This suggests the existence of a specific molecular function for polymorphic and/or long poly-Q stretches. Given that the poly-Qs causing expansion diseases were longer than other poly-Qs, even in healthy subjects, our results indicate that the evolutionary benefits of long and/or polymorphic poly-Q stretches outweigh the risks of long CAG repeats predisposing to pathological hyper-expansions. Molecular pathways in neurodevelopment requiring long and polymorphic poly-Q stretches may provide a clue to understanding why poly-Q expansion diseases are limited to neurodegenerative diseases.


Asunto(s)
Enfermedad/genética , Repeticiones de Microsatélite , Neurogénesis , Evolución Molecular , Genoma Humano , Humanos , Selección Genética , Análisis de Secuencia de ADN , Análisis de Secuencia de Proteína
2.
Nucleic Acids Res ; 41(Database issue): D915-9, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23197657

RESUMEN

H-InvDB (http://www.h-invitational.jp/) is a comprehensive human gene database started in 2004. In the latest version, H-InvDB 8.0, a total of 244 709 human complementary DNA was mapped onto the hg19 reference genome and 43 829 gene loci, including nonprotein-coding ones, were identified. Of these loci, 35 631 were identified as potential protein-coding genes, and 22 898 of these were identical to known genes. In our analysis, 19 309 annotated genes were specific to H-InvDB and not found in RefSeq and Ensembl. In fact, 233 genes of the 19 309 turned out to have protein functions in this version of H-InvDB; they were annotated as unknown protein functions in the previous version. Furthermore, 11 genes were identified as known Mendelian disorder genes. It is advantageous that many biologically functional genes are hidden in the H-InvDB unique genes. As large-scale proteomic projects have been conducted to elucidate the functions of all human proteins, we have enhanced the proteomic information with an advanced protein view and new subdatabase of protein complexes (Protein Complex Database with quality index). We propose that H-InvDB is an important resource for finding novel candidate targets for medical care and drug development.


Asunto(s)
Bases de Datos Genéticas , Genes , Transcriptoma , Genoma Humano , Humanos , Internet , Anotación de Secuencia Molecular , Proteínas/genética , Proteómica
3.
J Proteome Res ; 12(1): 62-6, 2013 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-23245335

RESUMEN

H-Invitational Database (H-InvDB; http://hinv.jp/ ) is an integrated database of all human genes and transcripts that started in an international collaborative research project for establishing a functional annotation database of human full-length cDNAs. Because H-InvDB contains an abundance of information for human transcripts, including not only well-characterized protein-coding transcripts but also those without experimental evidence at the protein level, this will be a useful information resource for identifying novel and uncharacterized human proteins (so-called missing proteins). By extending predicted protein data in H-InvDB, we developed the H-Inv Extended Protein Database (H-EPD; http://hinv.jp/hinv/h-epd/ ). From now on, we plan to carry out a database-driven proteome research that makes full use of H-EPD to promote discoveries in the current and future C-HPP. Furthermore, we will push forward with the integration of genome, transcriptome, and proteome databases using a unique tool for connecting distributed databases and would like to develop a knowledge discovery system by incorporating data mining tools.


Asunto(s)
ADN Complementario , Perfilación de la Expresión Génica , Proteínas , Proteoma , ADN Complementario/genética , ADN Complementario/metabolismo , Bases de Datos Factuales , Expresión Génica , Genoma Humano , Proyecto Genoma Humano , Humanos , Espectrometría de Masas , Proteínas/clasificación , Proteínas/genética , Proteínas/metabolismo
4.
Nucleic Acids Res ; 39(Database issue): D807-14, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21071393

RESUMEN

The Ciona intestinalis protein database (CIPRO) is an integrated protein database for the tunicate species C. intestinalis. The database is unique in two respects: first, because of its phylogenetic position, Ciona is suitable model for understanding vertebrate evolution; and second, the database includes original large-scale transcriptomic and proteomic data. Ciona intestinalis has also been a favorite of developmental biologists. Therefore, large amounts of data exist on its development and morphology, along with a recent genome sequence and gene expression data. The CIPRO database is aimed at collecting those published data as well as providing unique information from unpublished experimental data, such as 3D expression profiling, 2D-PAGE and mass spectrometry-based large-scale analyses at various developmental stages, curated annotation data and various bioinformatic data, to facilitate research in diverse areas, including developmental, comparative and evolutionary biology. For medical and evolutionary research, homologs in humans and major model organisms are intentionally included. The current database is based on a recently developed KH model containing 36,034 unique sequences, but for higher usability it covers 89,683 all known and predicted proteins from all gene models for this species. Of these sequences, more than 10,000 proteins have been manually annotated. Furthermore, to establish a community-supported protein database, these annotations are open to evaluation by users through the CIPRO website. CIPRO 2.5 is freely accessible at http://cipro.ibio.jp/2.5.


Asunto(s)
Ciona intestinalis/metabolismo , Bases de Datos de Proteínas , Proteoma/metabolismo , Secuencia de Aminoácidos , Animales , Ciona intestinalis/genética , Ciona intestinalis/crecimiento & desarrollo , Biología Computacional , Gráficos por Computador , Perfilación de la Expresión Génica , Genómica , Anotación de Secuencia Molecular , Proteoma/química , Proteoma/genética , Proteómica , Integración de Sistemas , Interfaz Usuario-Computador
5.
Nucleic Acids Res ; 39(Database issue): D7-10, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21097465

RESUMEN

The present article proposes the adoption of a community-defined, uniform, generic description of the core attributes of biological databases, BioDBCore. The goals of these attributes are to provide a general overview of the database landscape, to encourage consistency and interoperability between resources and to promote the use of semantic and syntactic standards. BioDBCore will make it easier for users to evaluate the scope and relevance of available resources. This new resource will increase the collective impact of the information present in biological databases.


Asunto(s)
Bases de Datos Factuales/normas , Difusión de la Información
6.
Genomics ; 99(1): 1-9, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22019378

RESUMEN

Complex diseases result from contributions of multiple genes that act in concert through pathways. Here we present a method to prioritize novel candidates of disease-susceptibility genes depending on the biological similarities to the known disease-related genes. The extent of disease-susceptibility of a gene is prioritized by analyzing seven features of human genes captured in H-InvDB. Taking rheumatoid arthritis (RA) and prostate cancer (PC) as two examples, we evaluated the efficiency of our method. Highly scored genes obtained included TNFSF12 and OSM as candidate disease genes for RA and PC, respectively. Subsequent characterization of these genes based upon an extensive literature survey reinforced the validity of these highly scored genes as possible disease-susceptibility genes. Our approach, Prioritization ANalysis of Disease Association (PANDA), is an efficient and cost-effective method to narrow down a large set of genes into smaller subsets that are most likely to be involved in the disease pathogenesis.


Asunto(s)
Estudios de Asociación Genética/métodos , Predisposición Genética a la Enfermedad/genética , Genómica/métodos , Artritis Reumatoide/genética , Análisis Costo-Beneficio , Citocina TWEAK , Minería de Datos , Estudios de Asociación Genética/economía , Humanos , Masculino , Oncostatina M/genética , Neoplasias de la Próstata/genética , Factores de Necrosis Tumoral/genética
7.
J Man Manip Ther ; 31(5): 368-375, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37052492

RESUMEN

BACKGROUND: Selective deep neck flexor muscle activation through craniocervical flexion exercises has been considered to be different from cervical retraction exercises. OBJECTIVE: To compare the immediate analgesic effect of craniocervical flexion versus cervical retraction exercises in individuals with nonacute, directional preference (DP) for cervical retraction or extension. METHODS: A two-arm, assessor-blinded, pretest-posttest randomized experiment was conducted. Participants were randomly assigned to either craniocervical flexion or cervical retraction exercises and those who were confirmed at the post-intervention examination to have a DP for cervical retraction or extension were analyzed. The primary outcome measure was pressure pain thresholds at the C2 and C5-C6 levels. RESULTS: A total of 10 (mean age = 20.6 years) and nine participants (mean age = 19.4 years) undertook craniocervical flexion and retraction exercises, respectively. One-way analysis of variance demonstrated no statistically significant (p > 0.05) interaction effect regardless of the neck level. In the pre-post change percentages, retraction exercises provided greater analgesic effects compared to craniocervical flexion exercises at the C2 (Hedges' g = 0.679) and C5-C6 levels (g = 0.637). CONCLUSION: This study showed a comparable or greater immediate neck analgesic effect from cervical retraction exercises compared to craniocervical flexion exercises in individuals with a DP for cervical retraction or extension.

8.
Res Involv Engagem ; 9(1): 107, 2023 Nov 29.
Artículo en Inglés | MEDLINE | ID: mdl-38031179

RESUMEN

BACKGROUND: Although stakeholder involvement in policymaking is attracting attention in the fields of medicine and healthcare, a practical methodology has not yet been established. Rare-disease policy, specifically research priority setting for the allocation of limited research resources, is an area where evidence generation through stakeholder involvement is expected to be effective. We generated evidence for rare-disease policymaking through stakeholder involvement and explored effective collaboration among stakeholders. METHODS: We constructed a space called 'Evidence-generating Commons', where patients, family members, researchers, and former policymakers can share their knowledge and experiences and engage in continual deliberations on evidence generation. Ten rare diseases were consequently represented. In the 'Commons', 25 consecutive workshops were held predominantly online, from 2019 to 2021. These workshops focused on (1) clarification of difficulties faced by rare-disease patients, (2) development and selection of criteria for priority setting, and (3) priority setting through the application of the criteria. For the first step, an on-site workshop using sticky notes was held. The data were analysed based on KJ method. For the second and third steps, workshops on specific themes were held to build consensus. The workshop agendas and methods were modified based on participants' feedback. RESULTS: The 'Commons' was established with 43 participants, resulting in positive effects such as capacity building, opportunities for interactions, mutual understanding, and empathy among the participants. The difficulties faced by patients with rare diseases were classified into 10 categories. Seven research topics were identified as priority issues to be addressed including 'impediments to daily life', 'financial burden', 'anxiety', and 'burden of hospital visits'. This was performed by synthesising the results of the application of the two criteria that were particularly important to strengthen future research on rare diseases. We also clarified high-priority research topics by using criteria valued more by patients and family members than by researchers and former policymakers, and criteria with specific perspectives. CONCLUSION: We generated evidence for policymaking in the field of rare diseases. This study's insights into stakeholder involvement can enhance evidence-informed policymaking. We engaged in comprehensive discussions with policymakers regarding policy implementation and planned analysis of the participants' experiences in this project.


Stakeholder involvement is significant for effective policymaking in the field of rare diseases. However, practical methods for this involvement have not yet been established. Therefore, we developed the 'Commons project' to generate valuable policymaking information and explore effective ways for stakeholders' collaboration. This article explains the process and results of 25 continuous workshops, held from 2019 to 2021 with 43 participants, including patients, family members, researchers, and former policymakers. The main achievements of the discussion that took place in the 'Commons' included a presentation of the overview of the difficulties faced by patients with rare diseases and formulation of high priority research topics.First, the difficulties faced by patients with rare diseases were grouped into 10 categories. Second, seven research topics were identified as priority issues including 'impediments to daily life', 'financial burden', 'anxiety', and 'burden of hospital visits'. During the project process, positive effects such as capacity building, opportunities for interactions, mutual understanding, and empathy among the participants, were identified. Beyond the context of the field of rare diseases and science of policy, these findings are useful for the future of society, including co-creation among stakeholders and patient and public involvement. Based on this study's results, we have initiated communications with policy stakeholders in the field of rare diseases, with the aim of policy implementation.

9.
Nucleic Acids Res ; 38(Database issue): D626-32, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19933760

RESUMEN

We report the extended database and data mining resources newly released in the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). H-InvDB is a comprehensive annotation resource of human genes and transcripts, and consists of two main views and six sub-databases. The latest release of H-InvDB (release 6.2) provides the annotation for 219,765 human transcripts in 43,159 human gene clusters based on human full-length cDNAs and mRNAs. H-InvDB now provides several new annotation features, such as mapping of microarray probes, new gene models, relation to known ncRNAs and information from the Glycogene database. H-InvDB also provides useful data mining resources-'Navigation search', 'H-InvDB Enrichment Analysis Tool (HEAT)' and web service APIs. 'Navigation search' is an extended search system that enables complicated searches by combining 16 different search options. HEAT is a data mining tool for automatically identifying features specific to a given human gene set. HEAT searches for H-InvDB annotations that are significantly enriched in a user-defined gene set, as compared with the entire H-InvDB representative transcripts. H-InvDB now has web service APIs of SOAP and REST to allow the use of H-InvDB data in programs, providing the users extended data accessibility.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Familia de Multigenes , Biología Computacional/tendencias , ADN Complementario/metabolismo , Genoma Humano , Humanos , Almacenamiento y Recuperación de la Información/métodos , Internet , Modelos Genéticos , Análisis de Secuencia por Matrices de Oligonucleótidos , Estructura Terciaria de Proteína , ARN Mensajero/metabolismo , Programas Informáticos , Interfaz Usuario-Computador
10.
Nucleic Acids Res ; 37(Database issue): D810-5, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18953038

RESUMEN

Creation of a vast variety of proteins is accomplished by genetic variation and a variety of alternative splicing transcripts. Currently, however, the abundant available data on genetic variation and the transcriptome are stored independently and in a dispersed fashion. In order to provide a research resource regarding the effects of human genetic polymorphism on various transcripts, we developed VarySysDB, a genetic polymorphism database based on 187,156 extensively annotated matured mRNA transcripts from 36,073 loci provided by H-InvDB. VarySysDB offers information encompassing published human genetic polymorphisms for each of these transcripts separately. This allows comparisons of effects derived from a polymorphism on different transcripts. The published information we analyzed includes single nucleotide polymorphisms and deletion-insertion polymorphisms from dbSNP, copy number variations from Database of Genomic Variants, short tandem repeats and single amino acid repeats from H-InvDB and linkage disequilibrium regions from D-HaploDB. The information can be searched and retrieved by features, functions and effects of polymorphisms, as well as by keywords. VarySysDB combines two kinds of viewers, GBrowse and Sequence View, to facilitate understanding of the positional relationship among polymorphisms, genome, transcripts, loci and functional domains. We expect that VarySysDB will yield useful information on polymorphisms affecting gene expression and phenotypes. VarySysDB is available at http://h-invitational.jp/varygene/.


Asunto(s)
Empalme Alternativo , Bases de Datos de Ácidos Nucleicos , Polimorfismo Genético , ARN Mensajero/química , Humanos , Polimorfismo de Nucleótido Simple , Secuencias Repetitivas de Ácidos Nucleicos , Interfaz Usuario-Computador
11.
Res Involv Engagem ; 7(1): 9, 2021 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-33526087

RESUMEN

BACKGROUND: The role of patients in medical research is changing, as more emphasis is being placed on patient involvement, and patient reported outcomes are increasingly contributing to clinical decision-making. Information and communication technology provides new opportunities for patients to actively become involved in research. These trends are particularly noticeable in Europe and the US, but less obvious in Japan. The aim of this study was to investigate the practice of active involvement of patients in medical research in Japan by utilizing a digital platform, and to analyze the outcomes to clarify what specific approaches could be put into practice. METHODS: We developed the RUDY JAPAN system, an ongoing rare disease medical research platform, in collaboration with the Rare and Undiagnosed Diseases Study (RUDY) project in the UK. After 2 years of preparation, RUDY JAPAN was launched in December 2017. Skeletal muscle channelopathies were initially selected as target diseases, and hereditary angioedema was subsequently added. Several approaches for active patient involvement were designed through patient-researcher collaboration, namely the Steering Committee, questionnaire development, dynamic consent, and other communication strategies. We analyzed our practices and experiences focusing on how each approach affected and contributed to the research project. RESULTS: RUDY JAPAN has successfully involved patients in this research project in various ways. While not a part of the initial decision-making phase to launch the project, patients have increasingly been involved since then. A high level of patient involvement was achieved through the Steering Committee, a governance body that has made a major contribution to RUDY JAPAN, and the process of the questionnaire development. The creation of the Patient Network Forum, website and newsletter cultivated dialogue between patients and researchers. The registry itself allowed patient participation through data input and control of data usage through dynamic consent. CONCLUSIONS: We believe the initial outcomes demonstrate the feasibility and utility of active patient involvement in Japan. The collaboration realized through RUDY JAPAN was enabled by digital technologies. It allowed busy patients and researchers to find the space to meet and work together for the Steering Committee, questionnaire development and various communication activities. While the practice of active patient involvement in Japan is still in its early stages, this research confirms its viability if the right conditions are in place. (331 words).

12.
Nucleic Acids Res ; 36(Database issue): D787-92, 2008 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-17982176

RESUMEN

Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes (http://h-invitational.jp/), we constructed a fully curated database of evolutionary features of human genes, called 'Evola'. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In 'd(N)/d(S) view', natural selection on genes can be analyzed between human and other species. In 'Locus maps', all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/.


Asunto(s)
Bases de Datos Genéticas , Genes , Genoma Humano , Filogenia , Animales , Biología Computacional , Genómica , Humanos , Internet , ARN Mensajero/química , Selección Genética , Alineación de Secuencia , Análisis de Secuencia de Proteína , Sintenía
13.
Nucleic Acids Res ; 36(Database issue): D1028-33, 2008 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18089549

RESUMEN

The Rice Annotation Project Database (RAP-DB) was created to provide the genome sequence assembly of the International Rice Genome Sequencing Project (IRGSP), manually curated annotation of the sequence, and other genomics information that could be useful for comprehensive understanding of the rice biology. Since the last publication of the RAP-DB, the IRGSP genome has been revised and reassembled. In addition, a large number of rice-expressed sequence tags have been released, and functional genomics resources have been produced worldwide. Thus, we have thoroughly updated our genome annotation by manual curation of all the functional descriptions of rice genes. The latest version of the RAP-DB contains a variety of annotation data as follows: clone positions, structures and functions of 31 439 genes validated by cDNAs, RNA genes detected by massively parallel signature sequencing (MPSS) technology and sequence similarity, flanking sequences of mutant lines, transposable elements, etc. Other annotation data such as Gnomon can be displayed along with those of RAP for comparison. We have also developed a new keyword search system to allow the user to access useful information. The RAP-DB is available at: http://rapdb.dna.affrc.go.jp/ and http://rapdb.lab.nig.ac.jp/.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Oryza/genética , Genes de Plantas , Genómica , Internet , MicroARNs/genética , ARN Interferente Pequeño/genética , Interfaz Usuario-Computador
14.
Nucleic Acids Res ; 36(Database issue): D793-9, 2008 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18089548

RESUMEN

Here we report the new features and improvements in our latest release of the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/), a comprehensive annotation resource for human genes and transcripts. H-InvDB, originally developed as an integrated database of the human transcriptome based on extensive annotation of large sets of full-length cDNA (FLcDNA) clones, now provides annotation for 120 558 human mRNAs extracted from the International Nucleotide Sequence Databases (INSD), in addition to 54 978 human FLcDNAs, in the latest release H-InvDB_4.6. We mapped those human transcripts onto the human genome sequences (NCBI build 36.1) and determined 34 699 human gene clusters, which could define 34 057 (98.1%) protein-coding and 642 (1.9%) non-protein-coding loci; 858 (2.5%) transcribed loci overlapped with predicted pseudogenes. For all these transcripts and genes, we provide comprehensive annotation including gene structures, gene functions, alternative splicing variants, functional non-protein-coding RNAs, functional domains, predicted sub cellular localizations, metabolic pathways, predictions of protein 3D structure, mapping of SNPs and microsatellite repeat motifs, co-localization with orphan diseases, gene expression profiles, orthologous genes, protein-protein interactions (PPI) and annotation for gene families. The current H-InvDB annotation resources consist of two main views: Transcript view and Locus view and eight sub-databases: the DiseaseInfo Viewer, H-ANGEL, the Clustering Viewer, G-integra, the TOPO Viewer, Evola, the PPI view and the Gene family/group.


Asunto(s)
Bases de Datos Genéticas , Genes , ARN Mensajero/química , Animales , Mapeo Cromosómico , ADN Complementario/química , Humanos , Internet , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , ARN Mensajero/genética , Interfaz Usuario-Computador
15.
Nucleic Acids Res ; 34(Web Server issue): W345-9, 2006 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-16845023

RESUMEN

Transcriptome Auto-annotation Conducting Tool (TACT) is a newly developed web-based automated tool for conducting functional annotation of transcripts by the integration of sequence similarity searches and functional motif predictions. We developed the TACT system by integrating two kinds of similarity searches, FASTY and BLASTX, against protein sequence databases, UniProtKB (Swiss-Prot/TrEMBL) and RefSeq, and a unified motif prediction program, InterProScan, into the ORF-prediction pipeline originally designed for the 'H-Invitational' human transcriptome annotation project. This system successively applies these constituent programs to an mRNA sequence in order to predict the most plausible ORF and the function of the protein encoded. In this study, we applied the TACT system to 19 574 non-redundant human transcripts registered in H-InvDB and evaluated its predictive power by the degree of agreement with human-curated functional annotation in H-InvDB. As a result, the TACT system could assign functional description to 12 559 transcripts (64.2%), the remainder being hypothetical proteins. Furthermore, the overall agreement of functional annotation with H-InvDB, including those transcripts annotated as hypothetical proteins, was 83.9% (16 432/19 574). These results show that the TACT system is useful for functional annotation and that the prediction of ORFs and protein functions is highly accurate and close to the results of human curation. TACT is freely available at http://www.jbirc.aist.go.jp/tact/.


Asunto(s)
ARN Mensajero/química , Análisis de Secuencia/métodos , Programas Informáticos , Secuencias de Aminoácidos , Biología Computacional/métodos , ADN Complementario/química , Bases de Datos de Proteínas , Etiquetas de Secuencia Expresada/química , Humanos , Internet , Sistemas de Lectura Abierta , Proteínas/genética , Proteínas/fisiología , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN , Integración de Sistemas , Interfaz Usuario-Computador
16.
PLoS Biol ; 2(6): e162, 2004 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-15103394

RESUMEN

The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.


Asunto(s)
Biología Computacional/métodos , ADN Complementario/genética , Bases de Datos Genéticas , Genes/fisiología , Genoma Humano , Empalme Alternativo/genética , Genes/genética , Humanos , Internet , Repeticiones de Microsatélite/genética , Sistemas de Lectura Abierta/genética , Polimorfismo Genético , Polimorfismo de Nucleótido Simple , Estructura Terciaria de Proteína
17.
Biochim Biophys Acta ; 1671(1-3): 44-50, 2004 Mar 17.
Artículo en Inglés | MEDLINE | ID: mdl-15026144

RESUMEN

Shiga toxin (Stx) has an A1-B5 subunit structure, and the A subunit is an RNA N-glycosidase that inhibits cellular protein synthesis. We previously reported that in Caco-2 cells Stx induced cytokines and that the RNA N-glycosidase activity was essential for the cytokine induction. It is known that the binding of the Stx-B subunit to its receptor glycolipid, Gb3, mediates an A subunit-independent signal in some types of cells, but the involvement of this signal in the cytokine induction is unclear. In this study, we investigated whether RNA N-glycosidase itself induces cytokines. IL-8 production was enhanced by Stx, ricin, and modeccin, three toxins that inhibit protein synthesis through an identical RNA N-glycosidase activity, but not by two other types of protein synthesis inhibitors, diphtheria toxin and cycloheximide. The RNA N-glycosidase-type toxins showed a similar induction pattern of cytokine mRNAs. Brefeldin A, a Golgi apparatus inhibitor, completely suppressed the cytokine induction by the toxins. Analysis by using inhibitors of toxin binding and also Stx-B subunit showed that the cytokine-inducing activity was independent of Gb3-mediated signaling. These results indicate that RNA N-glycosidase itself induces the cytokine production and that intracellular transport of toxins through the Golgi apparatus is essential for the activity.


Asunto(s)
Citocinas/metabolismo , Interleucina-8/metabolismo , N-Glicosil Hidrolasas/metabolismo , Lectinas de Plantas/metabolismo , Ricina/metabolismo , Toxina Shiga/metabolismo , Animales , Brefeldino A/metabolismo , Células CACO-2 , Chlorocebus aethiops , Glucolípidos/metabolismo , Humanos , Lactosa/metabolismo , Liposomas/metabolismo , N-Glicosil Hidrolasas/inmunología , Subunidades de Proteína/metabolismo , Inhibidores de la Síntesis de la Proteína/metabolismo , Proteínas Inactivadoras de Ribosomas , Proteínas Inactivadoras de Ribosomas Tipo 2 , Esfingolípidos/metabolismo , Células Vero
18.
Gene ; 364: 99-107, 2005 Dec 30.
Artículo en Inglés | MEDLINE | ID: mdl-16185827

RESUMEN

H-Invitational Database (H-InvDB; ) is a human transcriptome database, containing integrative annotation of 41,118 full-length cDNA clones originated from 21,037 loci. H-InvDB is a product of the H-Invitational project, an international collaboration to systematically and functionally validate human genes by analysis of a unique set of high quality full-length cDNA clones using automatic annotation and human curation under unified criteria. Here, 19,574 proteins encoded by these cDNAs were classified into 11,709 function-known and 7865 function-unknown hypothetical proteins by similarity with protein databases and motif prediction (InterProScan). The proportion of "hypothetical proteins" in H-InvDB was as high as 40.4%. In this study, we thus conducted data-mining in H-InvDB with the aim of assigning advanced functional annotations to those hypothetical proteins. First, by data-mining in the H-InvDB version of GTOP, we identified 337 SCOP domains within 7865 H-Inv hypothetical proteins. Second, by data-mining of predicted subcellular localization by SOSUI and TMHMM in H-InvDB, we found 1032 transmembrane proteins within H-Inv hypothetical proteins. These results clearly demonstrate that structural prediction is effective for functional annotation of proteins with unknown functions. All the data in H-InvDB are shown in two main views, the cDNA view and the Locus view, and five auxiliary databases with web-based viewers; DiseaseInfo Viewer, H-ANGEL, Clustering Viewer, G-integra and TOPO Viewer; the data also are provided as flat files and XML files. The data consists of descriptions of their gene structures, novel alternative splicing isoforms, functional RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein 3D structure, mapping of SNPs and microsatellite repeat motifs in relation with orphan diseases, gene expression profiling, and comparisons with mouse full-length cDNAs in the context of molecular evolution. This unique integrative platform for conducting in silico data-mining represents a substantial contribution to resources required for the exploration of human biology and pathology.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Proteínas/genética , Transcripción Genética , Secuencia de Aminoácidos , Secuencia de Bases , ADN Complementario/genética , Genoma Humano , Humanos , Modelos Genéticos , Datos de Secuencia Molecular , Proteínas/química , Proteínas/metabolismo
20.
mBio ; 3(5): e00204-12, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22951932

RESUMEN

We sequenced the genome of Theileria orientalis, a tick-borne apicomplexan protozoan parasite of cattle. The focus of this study was a comparative genome analysis of T. orientalis relative to other highly pathogenic Theileria species, T. parva and T. annulata. T. parva and T. annulata induce transformation of infected cells of lymphocyte or macrophage/monocyte lineages; in contrast, T. orientalis does not induce uncontrolled proliferation of infected leukocytes and multiplies predominantly within infected erythrocytes. While synteny across homologous chromosomes of the three Theileria species was found to be well conserved overall, subtelomeric structures were found to differ substantially, as T. orientalis lacks the large tandemly arrayed subtelomere-encoded variable secreted protein-encoding gene family. Moreover, expansion of particular gene families by gene duplication was found in the genomes of the two transforming Theileria species, most notably, the TashAT/TpHN and Tar/Tpr gene families. Gene families that are present only in T. parva and T. annulata and not in T. orientalis, Babesia bovis, or Plasmodium were also identified. Identification of differences between the genome sequences of Theileria species with different abilities to transform and immortalize bovine leukocytes will provide insight into proteins and mechanisms that have evolved to induce and regulate this process. The T. orientalis genome database is available at http://totdb.czc.hokudai.ac.jp/.


Asunto(s)
Genoma de Protozoos , Theileria/genética , Theileria/patogenicidad , Factores de Virulencia/genética , Animales , Bovinos , Proliferación Celular , Leucocitos/parasitología , Sintenía
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA