Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 17 de 17
Filter
Add more filters










Publication year range
1.
Nucleic Acids Res ; 52(D1): D67-D71, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-37971299

ABSTRACT

The Bioinformation and DNA Data Bank of Japan (DDBJ) Center (https://www.ddbj.nig.ac.jp) provides database archives that cover a wide range of fields in life sciences. As a founding member of the International Nucleotide Sequence Database Collaboration (INSDC), DDBJ accepts and distributes nucleotide sequence data as well as their study and sample information along with the National Center for Biotechnology Information in the United States and the European Bioinformatics Institute (EBI). Besides INSDC databases, the DDBJ Center provides databases for functional genomics (GEA: Genomic Expression Archive), metabolomics (MetaboBank) and human genetic and phenotypic data (JGA: Japanese Genotype-phenotype Archive). These database systems have been built on the National Institute of Genetics (NIG) supercomputer, which is also open for domestic life science researchers to analyze large-scale sequence data. This paper reports recent updates on the archival databases and the services of the DDBJ Center, highlighting the newly redesigned MetaboBank. MetaboBank uses BioProject and BioSample in its metadata description making it suitable for multi-omics large studies. Its collaboration with MetaboLights at EBI brings synergy in locating and reusing public data.


Subject(s)
Databases, Nucleic Acid , Metabolomics , Metadata , Humans , Computational Biology , Genomics , Internet , Japan , Multiomics/methods
2.
Nucleic Acids Res ; 51(D1): D101-D105, 2023 01 06.
Article in English | MEDLINE | ID: mdl-36420889

ABSTRACT

The Bioinformation and DNA Data Bank of Japan (DDBJ) Center (https://www.ddbj.nig.ac.jp) maintains database archives that cover a wide range of fields in life sciences. As a founding member of the International Nucleotide Sequence Database Collaboration (INSDC), our primary mission is to collect and distribute nucleotide sequence data, as well as their study and sample information, in collaboration with the National Center for Biotechnology Information in the United States and the European Bioinformatics Institute. In addition to INSDC resources, the Center operates databases for functional genomics (GEA: Genomic Expression Archive), metabolomics (MetaboBank), and human genetic and phenotypic data (JGA: Japanese Genotype-Phenotype Archive). These databases are built on the supercomputer of the National Institute of Genetics, whose remaining computational capacity is actively utilized by domestic researchers for large-scale biological data analyses. Here, we report our recent updates and the activities of our services.


Subject(s)
Databases, Nucleic Acid , Genomics , Humans , United States , Computational Biology , Computers , Base Sequence , Japan , Internet
3.
Nucleic Acids Res ; 50(D1): D102-D105, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34751405

ABSTRACT

The Bioinformation and DDBJ (DNA Data Bank of Japan) Center (DDBJ Center; https://www.ddbj.nig.ac.jp) operates archival databases that collect nucleotide sequences, study and sample information, and distribute them without access restriction to progress life science research as a member of the International Nucleotide Sequence Database Collaboration (INSDC), in collaboration with the National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute. Besides the INSDC databases, the DDBJ Center also provides the Genomic Expression Archive for functional genomics data and the Japanese Genotype-phenotype Archive for human data requiring controlled access. Additionally, the DDBJ Center started a new public repository, MetaboBank, for experimental raw data and metadata from metabolomics research in October 2020. In response to the COVID-19 pandemic, the DDBJ Center openly shares SARS-CoV-2 genome sequences in collaboration with Shizuoka Prefecture and Keio University. The operation of DDBJ is based on the National Institute of Genetics (NIG) supercomputer, which is open for large-scale sequence data analysis for life science researchers. This paper reports recent updates on the archival databases and the services of DDBJ.


Subject(s)
Databases, Genetic , Databases, Nucleic Acid , Genome, Microbial , Japan , Metabolomics , SARS-CoV-2/genetics , Transcriptome
4.
Nucleic Acids Res ; 48(D1): D45-D50, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31724722

ABSTRACT

The Bioinformation and DDBJ Center (https://www.ddbj.nig.ac.jp) in the National Institute of Genetics (NIG) maintains a primary nucleotide sequence database as a member of the International Nucleotide Sequence Database Collaboration (INSDC) in partnership with the US National Center for Biotechnology Information and the European Bioinformatics Institute. The NIG operates the NIG supercomputer as a computational basis for the construction of DDBJ databases and as a large-scale computational resource for Japanese biologists and medical researchers. In order to accommodate the rapidly growing amount of deoxyribonucleic acid (DNA) nucleotide sequence data, NIG replaced its supercomputer system, which is designed for big data analysis of genome data, in early 2019. The new system is equipped with 30 PB of DNA data archiving storage; large-scale parallel distributed file systems (13.8 PB in total) and 1.1 PFLOPS computation nodes and graphics processing units (GPUs). Moreover, as a starting point of developing multi-cloud infrastructure of bioinformatics, we have also installed an automatic file transfer system that allows users to prevent data lock-in and to achieve cost/performance balance by exploiting the most suitable environment from among the supercomputer and public clouds for different workloads.


Subject(s)
Computational Biology/methods , Databases, Nucleic Acid , Genomics/methods , Software , Web Browser , Japan , Software Design
5.
Nucleic Acids Res ; 47(D1): D69-D73, 2019 01 08.
Article in English | MEDLINE | ID: mdl-30357349

ABSTRACT

The Genomic Expression Archive (GEA) for functional genomics data from microarray and high-throughput sequencing experiments has been established at the DNA Data Bank of Japan (DDBJ) Center (https://www.ddbj.nig.ac.jp), which is a member of the International Nucleotide Sequence Database Collaboration (INSDC) with the US National Center for Biotechnology Information and the European Bioinformatics Institute. The DDBJ Center collects nucleotide sequence data and associated biological information from researchers and also services the Japanese Genotype-phenotype Archive (JGA) with the National Bioscience Database Center for collecting human data. To automate the submission process, we have implemented the DDBJ BioSample validator which checks submitted records, auto-corrects their format, and issues error messages and warnings if necessary. The DDBJ Center also operates the NIG supercomputer, prepared for analyzing large-scale genome sequences. We now offer a secure platform specifically to handle personal human genomes. This report describes database activities for INSDC and JGA over the past year, the newly launched GEA, submission, retrieval, and analysis services available in our supercomputer system and their recent developments.


Subject(s)
Computational Biology/methods , Databases, Genetic , Gene Expression Regulation , Genomics , Genomics/methods , Humans , Software , User-Computer Interface , Web Browser , Workflow
6.
Nucleic Acids Res ; 46(D1): D30-D35, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29040613

ABSTRACT

The DNA Data Bank of Japan (DDBJ) Center (http://www.ddbj.nig.ac.jp) has been providing public data services for 30 years since 1987. We are collecting nucleotide sequence data and associated biological information from researchers as a member of the International Nucleotide Sequence Database Collaboration (INSDC), in collaboration with the US National Center for Biotechnology Information and the European Bioinformatics Institute. The DDBJ Center also services the Japanese Genotype-phenotype Archive (JGA) with the National Bioscience Database Center to collect genotype and phenotype data of human individuals. Here, we outline our database activities for INSDC and JGA over the past year, and introduce submission, retrieval and analysis services running on our supercomputer system and their recent developments. Furthermore, we highlight our responses to the amended Japanese rules for the protection of personal information and the launch of the DDBJ Group Cloud service for sharing pre-publication data among research groups.


Subject(s)
Databases, Nucleic Acid , Academies and Institutes , Cloud Computing , Computational Biology , Confidentiality/legislation & jurisprudence , Databases, Nucleic Acid/history , Databases, Nucleic Acid/trends , Europe , Genetic Association Studies , History, 20th Century , History, 21st Century , Humans , Information Storage and Retrieval , International Cooperation , Japan , National Library of Medicine (U.S.) , United States
7.
Nucleic Acids Res ; 44(D1): D51-7, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26578571

ABSTRACT

The DNA Data Bank of Japan Center (DDBJ Center; http://www.ddbj.nig.ac.jp) maintains and provides public archival, retrieval and analytical services for biological information. The contents of the DDBJ databases are shared with the US National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute (EBI) within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). Since 2013, the DDBJ Center has been operating the Japanese Genotype-phenotype Archive (JGA) in collaboration with the National Bioscience Database Center (NBDC) in Japan. In addition, the DDBJ Center develops semantic web technologies for data integration and sharing in collaboration with the Database Center for Life Science (DBCLS) in Japan. This paper briefly reports on the activities of the DDBJ Center over the past year including submissions to databases and improvements in our services for data retrieval, analysis, and integration.


Subject(s)
Databases, Nucleic Acid , Sequence Analysis, DNA , Biological Ontologies , Computers , Genotype , Phenotype
8.
Genome Announc ; 3(3)2015 May 21.
Article in English | MEDLINE | ID: mdl-25999558

ABSTRACT

Pseudomonas aeruginosa shows multidrug resistance, which is mainly attributable to its expression of xenobiotic efflux pumps. However, it is unclear how silent pumps are expressed in clinical isolates. Here, we sequenced the complete genome of P. aeruginosa strain 8380, which was isolated from a human gut.

9.
Nucleic Acids Res ; 43(Database issue): D18-22, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25477381

ABSTRACT

The DNA Data Bank of Japan Center (DDBJ Center; http://www.ddbj.nig.ac.jp) maintains and provides public archival, retrieval and analytical services for biological information. Since October 2013, DDBJ Center has operated the Japanese Genotype-phenotype Archive (JGA) in collaboration with our partner institute, the National Bioscience Database Center (NBDC) of the Japan Science and Technology Agency. DDBJ Center provides the JGA database system which securely stores genotype and phenotype data collected from individuals whose consent agreements authorize data release only for specific research use. NBDC has established guidelines and policies for sharing human-derived data and reviews data submission and usage requests from researchers. In addition to the JGA project, DDBJ Center develops Semantic Web technologies for data integration and sharing in collaboration with the Database Center for Life Science. This paper describes the overview of the JGA project, updates to the DDBJ databases, and services for data retrieval, analysis and integration.


Subject(s)
Databases, Nucleic Acid , Genotype , Phenotype , Genetic Association Studies , Humans , Internet , Sequence Analysis, DNA
10.
Nucleic Acids Res ; 42(Database issue): D44-9, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24194602

ABSTRACT

The DNA Data Bank of Japan (DDBJ; http://www.ddbj.nig.ac.jp) maintains and provides archival, retrieval and analytical resources for biological information. This database content is shared with the US National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute (EBI) within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). DDBJ launched a new nucleotide sequence submission system for receiving traditional nucleotide sequence. We expect that the new submission system will be useful for many submitters to input accurate annotation and reduce the time needed for data input. In addition, DDBJ has started a new service, the Japanese Genotype-phenotype Archive (JGA), with our partner institute, the National Bioscience Database Center (NBDC). JGA permanently archives and shares all types of individual human genetic and phenotypic data. We also introduce improvements in the DDBJ services and databases made during the past year.


Subject(s)
Base Sequence , Databases, Nucleic Acid , Molecular Sequence Annotation , Genomics , Genotype , High-Throughput Nucleotide Sequencing , Humans , Internet , Phenotype
11.
mBio ; 3(5): e00204-12, 2012.
Article in English | MEDLINE | ID: mdl-22951932

ABSTRACT

We sequenced the genome of Theileria orientalis, a tick-borne apicomplexan protozoan parasite of cattle. The focus of this study was a comparative genome analysis of T. orientalis relative to other highly pathogenic Theileria species, T. parva and T. annulata. T. parva and T. annulata induce transformation of infected cells of lymphocyte or macrophage/monocyte lineages; in contrast, T. orientalis does not induce uncontrolled proliferation of infected leukocytes and multiplies predominantly within infected erythrocytes. While synteny across homologous chromosomes of the three Theileria species was found to be well conserved overall, subtelomeric structures were found to differ substantially, as T. orientalis lacks the large tandemly arrayed subtelomere-encoded variable secreted protein-encoding gene family. Moreover, expansion of particular gene families by gene duplication was found in the genomes of the two transforming Theileria species, most notably, the TashAT/TpHN and Tar/Tpr gene families. Gene families that are present only in T. parva and T. annulata and not in T. orientalis, Babesia bovis, or Plasmodium were also identified. Identification of differences between the genome sequences of Theileria species with different abilities to transform and immortalize bovine leukocytes will provide insight into proteins and mechanisms that have evolved to induce and regulate this process. The T. orientalis genome database is available at http://totdb.czc.hokudai.ac.jp/.


Subject(s)
Genome, Protozoan , Theileria/genetics , Theileria/pathogenicity , Virulence Factors/genetics , Animals , Cattle , Cell Proliferation , Leukocytes/parasitology , Synteny
12.
Nucleic Acids Res ; 39(Database issue): D22-7, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21062814

ABSTRACT

The DNA Data Bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp) provides a nucleotide sequence archive database and accompanying database tools for sequence submission, entry retrieval and annotation analysis. The DDBJ collected and released 3,637,446 entries/2,272,231,889 bases between July 2009 and June 2010. A highlight of the released data was archive datasets from next-generation sequencing reads of Japanese rice cultivar, Koshihikari submitted by the National Institute of Agrobiological Sciences. In this period, we started a new archive for quantitative genomics data, the DDBJ Omics aRchive (DOR). The DOR stores quantitative data both from the microarray and high-throughput new sequencing platforms. Moreover, we improved the content of the DDBJ patent sequence, released a new submission tool of the DDBJ Sequence Read Archive (DRA) which archives massive raw sequencing reads, and enhanced a cloud computing-based analytical system from sequencing reads, the DDBJ Read Annotation Pipeline. In this article, we describe these new functions of the DDBJ databases and support tools.


Subject(s)
Databases, Nucleic Acid , Amino Acid Sequence , Databases, Protein , Genomics , Molecular Sequence Annotation , Patents as Topic , Software
13.
DNA Res ; 13(6): 245-54, 2006 Dec 31.
Article in English | MEDLINE | ID: mdl-17166861

ABSTRACT

A large number of complete microorganism genomes has been sequenced and submitted to the public database and then incorporated into our complete genome database, Genome Information Broker (GIB, http://gib.genes.nig.ac.jp/). However, when comparative genomics is carried out, researchers must be aware that there are protein-coding genes not confirmed by homology or motif search and that reliable protein-coding genes are missing. Therefore, we developed a protocol (Gene Trek in Prokaryote Space, GTPS) for finding possible protein-coding genes in bacterial genomes. GTPS assigns a degree of reliability to predicted protein-coding genes. We first systematically applied the protocol to the complete genomes of all 123 bacterial species and strains that were publicly available as of July 2003, and then to those of 183 species and strains available as of September 2004. We found a number of incorrect genes and several new ones in the genome data in question. We also found a way to estimate the total number of orthologous genes in the bacterial world.


Subject(s)
Bacteria/classification , Genes, Bacterial , Genetics, Microbial , Genome, Bacterial , Bacteria/genetics , Computational Biology , DNA, Bacterial/genetics , Database Management Systems , Open Reading Frames , Prokaryotic Cells
14.
OMICS ; 10(2): 105-13, 2006.
Article in English | MEDLINE | ID: mdl-16901214

ABSTRACT

The Third Party Annotation (TPA) project collects and presents high-quality annotation of nucleotide sequence. Annotation is submitted by researchers who have not themselves generated novel nucleotide sequence. In its first few years, the resource has proven to be popular with submitters from a range of biological research areas. Central to the project is the requirement for high-quality data, resulting from experimental and inferred analysis discussed in peer-reviewed publications. The data are divided into two tiers: those with experimental evidence and those with inferential evidence. Standards for TPA are detailed and illustrated with the aid of case studies.


Subject(s)
Databases, Nucleic Acid/standards , Genomics/standards , Animals , Data Collection/standards , Humans
15.
Nucleic Acids Res ; 34(1): 1-9, 2006.
Article in English | MEDLINE | ID: mdl-16397293

ABSTRACT

The goal of this group project has been to coordinate and bring up-to-date information on all genes of Escherichia coli K-12. Annotation of the genome of an organism entails identification of genes, the boundaries of genes in terms of precise start and end sites, and description of the gene products. Known and predicted functions were assigned to each gene product on the basis of experimental evidence or sequence analysis. Since both kinds of evidence are constantly expanding, no annotation is complete at any moment in time. This is a snapshot analysis based on the most recent genome sequences of two E.coli K-12 bacteria. An accurate and up-to-date description of E.coli K-12 genes is of particular importance to the scientific community because experimentally determined properties of its gene products provide fundamental information for annotation of innumerable genes of other organisms. Availability of the complete genome sequence of two K-12 strains allows comparison of their genotypes and mutant status of alleles.


Subject(s)
Escherichia coli K12/genetics , Escherichia coli Proteins/genetics , Genome, Bacterial , Congresses as Topic , Cooperative Behavior , Genomics , Terminology as Topic
16.
Plasmid ; 51(3): 227-37, 2004 May.
Article in English | MEDLINE | ID: mdl-15109829

ABSTRACT

The complete nucleotide sequence of pTT8, a cryptic plasmid from Thermus thermophilus HB8, was determined. pTT8 was 9328bp long and its G+C content was 69%. pTT8 contained eight putative open reading frames, three of which showed extensive similarities to the plasmid addiction proteins PasA and PasB of pTC-F14 and pAM10.6, and the RepA protein of the ColE2-related plasmids, respectively. During the analysis of pTT8-based plasmid pPP442, which had been obtained during a promoter-screening experiment, we occasionally isolated a plasmid with a relatively high-copy-number. This plasmid, pPP442m, contained a 1025 bp fragment derived from the genome of the HB27 host strain immediately upstream of the putative repA gene. Using the ori region of pPP442m, we constructed an expression vector, pTEV131m, with an estimated high-copy-number of 30-40. This plasmid was stably maintained in T. thermophilus HB27 under nonselective conditions for at least 100 generations. Cloning of the alpha-amylase gene of Bacillus stearothermophilus DY-5 into pTEV131m gave more than twofold production of the enzyme compared with pTEV131, the parental plasmid.


Subject(s)
Gene Expression , Genetic Vectors/genetics , Plasmids/genetics , Thermus thermophilus/genetics , Amino Acid Sequence , Base Composition , Base Sequence , Chromosome Mapping , Molecular Sequence Data , Mutation/genetics , Plasmids/isolation & purification , Sequence Analysis, DNA , Sequence Homology , alpha-Amylases
17.
FEMS Microbiol Lett ; 233(2): 315-24, 2004 Apr 15.
Article in English | MEDLINE | ID: mdl-15063502

ABSTRACT

An in vivo disruption-integration vector system for Thermus thermophilus was developed and used for the functional analysis of an evolutionary-related archaeal protein for lysine biosynthesis. In contrast to fungal one, the putative homoaconitase of T. thermophilus consists of two subunits and catalyzes the second and third steps of lysine biosynthesis. ORFs from hyperthermophilic archaeon Pyrococcus horikoshii, PH1726 and PH1724, share a high degree of amino acid identity with the T. thermophilus subunits LysT and LysU, respectively. In the present report, gene encoding the putative small subunit of archaeal homoaconitase, PH1724, was integrated into the lysU locus of T. thermophilus. The archaeal gene was expressed under the control of PslpA promoter and functional analyses were performed. Transformants were able to grow on minimal medium without lysine when PH1724 ORF was integrated, whereas the lysU disruption led to lysine auxotrophy. Chromosomal integration was verified by PCR analysis, and homoaconitase assay showed that the archaeal gene product functions as a small subunit of homoaconitase, possibly by forming a heterodimer with the LysT subunit of T. thermophilus. These results strongly suggest the functional relation of P. horikoshii PH1724 with LysU in the Thermus lysine biosynthetic pathway, together with functional assignment of LysU as small subunit of homoaconitase. In addition, the provided results indicate that archaeal genes products from hyperthermophiles can be studied in a thermophilic eubacterium such as T. thermophilus.


Subject(s)
Hydro-Lyases/genetics , Hydro-Lyases/metabolism , Lysine/biosynthesis , Pyrococcus horikoshii/enzymology , Thermus thermophilus/enzymology , Amino Acid Sequence , Citric Acid Cycle/physiology , Gene Expression Regulation, Archaeal , Gene Expression Regulation, Enzymologic , Hydro-Lyases/chemistry , Leucine/biosynthesis , Molecular Sequence Data , Protein Structure, Tertiary , Pyrococcus horikoshii/genetics , Thermus thermophilus/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...