Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
1.
Res Sq ; 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-39011113

ABSTRACT

Cerebral small vessel disease (cSVD) is a leading cause of stroke and dementia with no specific mechanism-based treatment. We used Mendelian randomization to combine a unique cerebrospinal fluid (CSF) and plasma pQTL resource with the latest European-ancestry GWAS of MRI-markers of cSVD (white matter hyperintensities, perivascular spaces). We describe a new biological fingerprint of 49 protein-cSVD associations, predominantly in the CSF. We implemented a multipronged follow-up, across fluids, platforms, and ancestries (Europeans and East-Asian), including testing associations of direct plasma protein measurements with MRI-cSVD. We highlight 16 proteins robustly associated in both CSF and plasma, with 24/4 proteins identified in CSF/plasma only. cSVD-proteins were enriched in extracellular matrix and immune response pathways, and in genes enriched in microglia and specific microglial states (integration with single-nucleus RNA sequencing). Immune-related proteins were associated with MRI-cSVD already at age twenty. Half of cSVD-proteins were associated with stroke, dementia, or both, and seven cSVD-proteins are targets for known drugs (used for other indications in directions compatible with beneficial therapeutic effects. This first cSVD proteogenomic signature opens new avenues for biomarker and therapeutic developments.

2.
J Hum Genet ; 2024 Jul 16.
Article in English | MEDLINE | ID: mdl-39014190

ABSTRACT

Genome-wide association studies have enabled the identification of important genetic factors in many trait studies. However, only a fraction of the heritability can be explained by known genetic factors, even in the most common diseases. Genetic loci combinations, or epistatic contributions expressed by combinations of single nucleotide polymorphisms (SNPs), have been argued to be one of the critical factors explaining some of the missing heritability, especially in oligogenic/polygenic diseases. Rheumatoid arthritis (RA) is a complex disease with more than 100 reported SNP associations, as well as various HLA haplotypes and amino acids; however, many associations between RA and inter-chromosomal SNP combinations are unknown. To discover novel associations of epistatic interactions with high odds ratios in RA, we applied the LAMPLINK method, a systematic enumerative procedure for identifying high-order SNP combinations, to a Japanese RA cohort (discovery cohort; 4024 patients with RA and 7731 controls). We validated the identified associations in a different Japanese cohort (validation cohort; 810 RA patients and 6303 controls). In this study, we identified 90 significant genetic associations in the discovery cohort. Among these, 74 (82.2%) associations were replicated in the validation cohort, and eight combinations were inter-chromosomal, all of which comprised rs7765379 or rs35265698 located in the HLA region. These two SNPs exhibited strong correlations with valine at amino acid position 11 in HLA-DRB1 (HLA-DRB1-11-Val). Finally, we discovered that rs9624 showed an association with RA through an epistatic interaction with HLA-DRB1-11-Val. Overall, LAMPLINK showed high reliability for identifying epistatic genetic contributions hidden in complex traits.

3.
Methods Mol Biol ; 2809: 67-75, 2024.
Article in English | MEDLINE | ID: mdl-38907890

ABSTRACT

HLA allele information is essential for a variety of medical applications, such as genomic studies of multifactorial diseases, including immune system and inflammation-related disorders, and donor selection in organ transplantation and regenerative medicine. To obtain this information, an accurate HLA typing method that is applicable for any allele registered in HLA allele databases is needed. Here we describe a method-called HLA-HD-for determining alleles from a current HLA database using next-generation sequencing (NGS) results.


Subject(s)
Alleles , HLA Antigens , High-Throughput Nucleotide Sequencing , Histocompatibility Testing , Humans , High-Throughput Nucleotide Sequencing/methods , Histocompatibility Testing/methods , HLA Antigens/genetics , Databases, Genetic , Software , Computational Biology/methods , Sequence Analysis, DNA/methods
4.
STAR Protoc ; 5(2): 103052, 2024 Jun 21.
Article in English | MEDLINE | ID: mdl-38700977

ABSTRACT

Genetic variations influence the levels of blood metabolites. We present analytical pipelines for assessing genetic influences on human blood metabolites. We describe steps for the normalization of metabolome data, genome-wide association studies, and the identification of metabolite quantitative trait loci (mQTLs). We then detail procedures for functional enrichment analysis of mQTLs. This protocol could be applicable to other quantitative traits, such as clinical measurements or proteome data. For complete details on the use and execution of this protocol, please refer to Iwasaki et al.1.


Subject(s)
Genome-Wide Association Study , Metabolome , Quantitative Trait Loci , Humans , Genome-Wide Association Study/methods , Quantitative Trait Loci/genetics , Metabolome/genetics , Metabolomics/methods , Polymorphism, Single Nucleotide/genetics
5.
Sci Rep ; 13(1): 13926, 2023 08 25.
Article in English | MEDLINE | ID: mdl-37626071

ABSTRACT

Gut-microbiota derived metabolites are important regulators of host biology and metabolism. To understand the impacts of the microbial metabolite 4-cresol sulfate (4-CS) on four chronic diseases [type 2 diabetes mellitus, metabolic syndrome (MetS), non-alcoholic fatty liver disease, and chronic kidney disease (CKD)], we conducted association analyses of plasma 4-CS quantified by liquid chromatography coupled to mass spectrometry (LC-MS) in 3641 participants of the Nagahama study. Our results validated the elevation of 4-CS in CKD and identified a reducing trend in MetS. To delineate the holistic effects of 4-CS, we performed a phenome-wide association analysis (PheWAS) with 937 intermediate biological and behavioral traits. We detected associations between 4-CS and 39 phenotypes related to blood pressure regulation, hepatic and renal functions, hematology, sleep quality, intraocular pressure, ion regulation, ketone and fatty acid metabolisms, disease history and dietary habits. Among them, 19 PheWAS significant traits, including fatty acids and 14 blood pressure indices, were correlated with MetS, suggesting that 4-CS is a potential biomarker for MetS. Consistent associations of this gut microbial-derived metabolite on multiple endophenotypes underlying distinct etiopathogenesis support its role in the overall host health, with prospects of probiotic-based therapeutic solutions in chronic diseases.


Subject(s)
Diabetes Mellitus, Type 2 , Humans , Sulfuric Acid Esters , Phenomics , Endophenotypes
6.
Arthritis Res Ther ; 25(1): 103, 2023 06 12.
Article in English | MEDLINE | ID: mdl-37309008

ABSTRACT

BACKGROUND: Polygenic risk score (PRS) analysis is used to predict disease risk. Although PRS has been shown to have great potential in improving clinical care, PRS accuracy assessment has been mainly focused on European ancestry. This study aimed to develop an accurate genetic risk score for knee osteoarthritis (OA) using a multi-population PRS and leveraging a multi-trait PRS in the Japanese population. METHODS: We calculated PRS using PRS-CS-auto, derived from genome-wide association study (GWAS) summary statistics for knee OA in the Japanese population (same ancestry) and multi-population. We further identified risk factor traits for which PRS could predict knee OA and subsequently developed an integrated PRS based on multi-trait analysis of GWAS (MTAG), including genetically correlated risk traits. PRS performance was evaluated in participants of the Nagahama cohort study who underwent radiographic evaluation of the knees (n = 3,279). PRSs were incorporated into knee OA integrated risk models along with clinical risk factors. RESULTS: A total of 2,852 genotyped individuals were included in the PRS analysis. The PRS based on Japanese knee OA GWAS was not associated with knee OA (p = 0.228). In contrast, PRS based on multi-population knee OA GWAS showed a significant association with knee OA (p = 6.7 × 10-5, odds ratio (OR) per standard deviation = 1.19), whereas PRS based on MTAG of multi-population knee OA, along with risk factor traits such as body mass index GWAS, displayed an even stronger association with knee OA (p = 5.4 × 10-7, OR = 1.24). Incorporating this PRS into traditional risk factors improved the predictive ability of knee OA (area under the curve, 74.4% to 74.7%; p = 0.029). CONCLUSIONS: This study showed that multi-trait PRS based on MTAG, combined with traditional risk factors, and using large sample size multi-population GWAS, significantly improved predictive accuracy for knee OA in the Japanese population, even when the sample size of GWAS of the same ancestry was small. To the best of our knowledge, this is the first study to show a statistically significant association between the PRS and knee OA in a non-European population. TRIAL REGISTRATION: No. C278.


Subject(s)
Genome-Wide Association Study , Osteoarthritis, Knee , Humans , Cohort Studies , Risk Factors , Risk Assessment
7.
Front Immunol ; 14: 1124417, 2023.
Article in English | MEDLINE | ID: mdl-36969256

ABSTRACT

Introduction: To allow the identification of IgG4-related disease (IgG4-RD) from a subclinical phase as it is important to understand the risk of elevated serum IgG4 levels. We planned to evaluate serum IgG4 levels in the participants of the Nagasaki Islands Study (NaIS), a large-scale health checkup cohort study. Methods: This study included 3,240 individuals who participated in the NaIS between 2016 and 2018 and consented to participate in the study. Serum IgG4, IgG, and IgE levels and human leukocyte antigen (HLA) genotyping results of the NaIS subjects as well as lifestyle habits and peripheral blood test results were analyzed. The magnetic bead panel assay (MBA) and the standard nephelometry immunoassay (NIA) were used to measure serum IgG4 levels. The data were evaluated using multivariate analysis to identify lifestyle and genetic factors associated with elevated serum IgG4 levels. Results: Serum IgG4 levels measured with the NIA and MBA showed a tight positive correlation between the two groups (correlation coefficient 0.942). The median age of the participants in the NaIS was 69 years [63-77]. The median serum IgG4 level was 30.2 mg/dL [IQR 12.5-59.8]. Overall, 1019 (32.1%) patients had a history of smoking. When the subjects were stratified into three groups based on the smoking intensity (pack-year), the serum IgG4 level was significantly higher among those with a higher smoking intensity. Accordingly, the multivariate analysis identified a significant relationship between smoking status and serum IgG4 elevation. Conclusion: In this study, smoking was identified as a lifestyle factor correlating positively with elevated serum IgG4 levels.


Subject(s)
Immunoglobulin G , Humans , Aged , Cohort Studies , Risk Factors
8.
iScience ; 26(1): 105738, 2023 Jan 20.
Article in English | MEDLINE | ID: mdl-36582826

ABSTRACT

An increase in ethnic diversity in genetic studies has the potential to provide unprecedented insights into how genetic variations influence human phenotypes. In this study, we conducted a quantitative trait locus (QTL) analysis of 121 metabolites measured using gas chromatography-mass spectrometry with plasma samples from 4,888 Japanese individuals. We found 60 metabolite-gene associations, of which 13 have not been previously reported. Meta-analyses with another Japanese and a European study identified six and two additional unreported loci, respectively. Genetic variants influencing metabolite levels were more enriched in protein-coding regions than in the regulatory regions while being associated with the risk of various diseases. Finally, we identified a signature of strong negative selection for uric acid ( S ˆ  = -1.53, p = 6.2 × 10-18). Our study expanded the knowledge of genetic influences on human blood metabolites, providing valuable insights into their physiological, pathological, and selective properties.

9.
Mod Rheumatol ; 33(2): 373-380, 2023 Mar 02.
Article in English | MEDLINE | ID: mdl-35353896

ABSTRACT

OBJECTIVES: This study aimed to evaluate the changes in knee pain, a dominant cause of physical disability, following the coronavirus disease (COVID-19) pandemic, and to identify factors affecting the changes in knee pain. METHODS: We analysed the pre- and post-COVID-19 longitudinal data set of the Nagahama Study. Knee pain was assessed using the Knee Society Score (KSS). The estimated KSS from the age and sex using regression model in the pre- and post-COVID-19 data set was compared. Factors including the activity score, educational level, and various impacts of COVID-19 were analysed for correlation analyses with changes in KSS. RESULTS: Data collected from 6409 participants showed statistically significant differences in KSS, pre- (mean = 22.0; SD = 4.4) and post-COVID-19 (mean = 19.5; SD = 6.4). Low activity score (p = .008), low educational level (p < .001), and undesirable financial impact (p = .030) were independently associated with knee pain exacerbation. CONCLUSION: The harmful effects of the COVID-19 pandemic on knee pain were suggested. People should be encouraged to engage in physical activities, such as walking, despite the state of emergency. Furthermore, social support for economically disadvantaged groups may improve healthcare access, preventing the acute exacerbations of knee pain.


Subject(s)
Arthroplasty, Replacement, Knee , COVID-19 , Osteoarthritis, Knee , Humans , Pandemics , Arthroplasty, Replacement, Knee/adverse effects , Osteoarthritis, Knee/surgery , Longitudinal Studies , COVID-19/epidemiology , Pain/epidemiology , Pain/etiology
10.
Mod Rheumatol ; 33(3): 608-616, 2023 Apr 13.
Article in English | MEDLINE | ID: mdl-35554562

ABSTRACT

OBJECTIVES: The concept of locomotive syndrome was proposed to highlight older adults who require nursing care services due to the malfunctioning of their locomotive organs. With the coming of a super-ageing society, there is a growing need to understand the relation between systemic chronic diseases and locomotive syndrome. METHODS: We analysed the second-visit dataset of the Nagahama Study. The association analysis was performed to identify the chronic diseases that were risk factors associated with the occurrence and the progression of locomotive syndrome in both the cross-sectional and longitudinal studies. RESULTS: Hypertension, stroke, coronary heart disease, rheumatoid arthritis, chronic renal failure, osteoporosis, anaemia, and gastroesophageal reflux disease were independently correlated with locomotive syndrome through the deterioration of body pain, social activity, and cognitive function in the cross-sectional study. Multiple chronic diseases had additive effects and significantly increased the risk of locomotive syndrome. In the longitudinal study, osteoporosis and kidney disease were significantly correlated with the worsening of the total GLFS-25 score. CONCLUSIONS: Locomotive syndrome coexisted with various systemic chronic diseases, especially cardiovascular diseases. Osteoporosis and kidney disease were significantly correlated with the progression of locomotive dysfunction. The management of various chronic diseases may be useful to prevent locomotive syndrome and vice versa.


Subject(s)
Locomotion , Osteoporosis , Humans , Aged , Cross-Sectional Studies , Longitudinal Studies , Osteoporosis/complications , Osteoporosis/epidemiology , Chronic Disease
11.
Proc Natl Acad Sci U S A ; 118(11)2021 03 16.
Article in English | MEDLINE | ID: mdl-33649182

ABSTRACT

HTLV-1-associated myelopathy (HAM/TSP) is a chronic and progressive inflammatory disease of the central nervous system. The aim of our study was to identify genetic determinants related to the onset of HAM/TSP in the Japanese population. We conducted a genome-wide association study comprising 753 HAM/TSP patients and 899 asymptomatic HTLV-1 carriers. We also performed comprehensive genotyping of HLA-A, -B, -C, -DPB1, -DQB1, and -DRB1 genes using next-generation sequencing technology for 651 HAM/TSP patients and 804 carriers. A strong association was observed in HLA class I (P = 1.54 × 10-9) and class II (P = 1.21 × 10-8) loci with HAM/TSP. Association analysis using HLA genotyping results showed that HLA-C*07:02 (P = 2.61 × 10-5), HLA-B*07:02 (P = 4.97 × 10-10), HLA-DRB1*01:01 (P = 1.15 × 10-9) and HLA-DQB1*05:01 (P = 2.30 × 10-9) were associated with disease risk, while HLA-B*40:06 (P = 3.03 × 10-5), HLA-DRB1*15:01 (P = 1.06 × 10-5) and HLA-DQB1*06:02 (P = 1.78 × 10-6) worked protectively. Logistic regression analysis identified amino acid position 7 in the G-BETA domain of HLA-DRB1 as strongly associated with HAM/TSP (P = 9.52 × 10-10); individuals homozygous for leucine had an associated increased risk of HAM/TSP (odds ratio, 9.57), and proline was protective (odds ratio, 0.65). Both associations were independent of the known risk associated with proviral load. DRB1-GB-7-Leu was not significantly associated with proviral load. We have identified DRB1-GB-7-Leu as a genetic risk factor for HAM/TSP development independent of proviral load. This suggests that the amino acid residue may serve as a specific marker to identify the risk of HAM/TSP even without knowledge of proviral load. In light of its allele frequency worldwide, this biomarker will likely prove useful in HTLV-1 endemic areas across the globe.


Subject(s)
Genome-Wide Association Study , HLA Antigens/genetics , Human T-lymphotropic virus 1/pathogenicity , Paraparesis, Tropical Spastic/genetics , Chromosome Mapping , Human T-lymphotropic virus 1/isolation & purification , Humans , Japan , Polymorphism, Single Nucleotide , Viral Load
12.
Methods Mol Biol ; 2131: 31-38, 2020.
Article in English | MEDLINE | ID: mdl-32162249

ABSTRACT

HLA is essential for various medical applications, such as genomic studies of multifactorial diseases, including immune system and inflammation-related disorders. Therefore, an accurate HLA typing method that is applicable for any allele registered in HLA allele databases is required to deduce scientific evidence related to disorders. Here, we describe a method for determining HLA alleles from next-generation sequencing (NGS) results by using currently available HLA sequence data in public HLA databases and show its application in association analysis.


Subject(s)
HLA Antigens/genetics , Histocompatibility Testing/methods , Algorithms , Alleles , High-Throughput Nucleotide Sequencing , Humans , Logistic Models , Sequence Analysis, DNA
13.
JCI Insight ; 5(2)2020 01 30.
Article in English | MEDLINE | ID: mdl-31855576

ABSTRACT

BACKGROUNDCurrent clinical biomarkers for the programmed cell death 1 (PD-1) blockade therapy are insufficient because they rely only on the tumor properties, such as programmed cell death ligand 1 expression frequency and tumor mutation burden. Identifying reliable, responsive biomarkers based on the host immunity is necessary to improve the predictive values.METHODSWe investigated levels of plasma metabolites and T cell properties, including energy metabolism markers, in the blood of patients with non-small cell lung cancer before and after treatment with nivolumab (n = 55). Predictive values of combination markers statistically selected were evaluated by cross-validation and linear discriminant analysis on discovery and validation cohorts, respectively. Correlation between plasma metabolites and T cell markers was investigated.RESULTSThe 4 metabolites derived from the microbiome (hippuric acid), fatty acid oxidation (butyrylcarnitine), and redox (cystine and glutathione disulfide) provided high response probability (AUC = 0.91). Similarly, a combination of 4 T cell markers, those related to mitochondrial activation (PPARγ coactivator 1 expression and ROS), and the frequencies of CD8+PD-1hi and CD4+ T cells demonstrated even higher prediction value (AUC = 0.96). Among the pool of selected markers, the 4 T cell markers were exclusively selected as the highest predictive combination, probably because of their linkage to the abovementioned metabolite markers. In a prospective validation set (n = 24), these 4 cellular markers showed a high accuracy rate for clinical responses of patients (AUC = 0.92).CONCLUSIONCombination of biomarkers reflecting host immune activity is quite valuable for responder prediction.FUNDINGAMED under grant numbers 18cm0106302h0003, 18gm0710012h0105, and 18lk1403006h0002; the Tang Prize Foundation; and JSPS KAKENHI grant numbers JP16H06149, 17K19593, and 19K17673.


Subject(s)
Antineoplastic Agents, Immunological/pharmacology , Biomarkers, Tumor/immunology , Immunotherapy/methods , Nivolumab/pharmacology , Programmed Cell Death 1 Receptor/drug effects , Programmed Cell Death 1 Receptor/immunology , Adult , Aged , Aged, 80 and over , Antineoplastic Agents, Immunological/therapeutic use , Carcinoma, Non-Small-Cell Lung , Carnitine/analogs & derivatives , Drug Therapy, Combination , Energy Metabolism , Female , Glutathione Disulfide , Hippurates , Humans , Lung Neoplasms , Male , Microbiota , Middle Aged , Nivolumab/therapeutic use , Programmed Cell Death 1 Receptor/metabolism , Prospective Studies
14.
Lancet Rheumatol ; 1(1): e14-e22, 2019 Sep.
Article in English | MEDLINE | ID: mdl-38229354

ABSTRACT

BACKGROUND: IgG4-related disease is a newly recognised immunopathological entity that includes autoimmune pancreatitis, IgG4-related sialadenitis, and IgG4-related kidney disease. To understand the genetic landscape of IgG4-related disease, we did a genome-wide association study. METHODS: We did a genome-wide association study of Japanese individuals, initially screening 857 patients with IgG4-related disease at 50 Japanese research institutions and DNA samples from 2082 healthy control participants from the Nagahama Prospective Genome Cohort for the Comprehensive Human Bioscience. From Oct 27, 2008, to July 22, 2014, we enrolled 835 patients and used data from 1789 healthy participants. Only patients with confirmed diagnosis of IgG4-related disease according to the international diagnostic criteria were included. Genotyping was done with the Infinium HumanOmni5Exome, HumanOmni2.5Exome, or HumanOmni2.5 Illumina arrays, and genomic distributions were compared between case and control samples for 958 440 single nucleotide polymorphisms. The HLA region was extensively analysed using imputation of HLA alleles and aminoacid residues. Fine mapping of the FCGR2B region was also done. Associations between clinical manifestations of disease and the genetic variations identified in these two genes were examined. FINDINGS: We identified the HLA-DRB1 (p=1·1×10-11) and FCGR2B (p=2·0×10-8) regions as susceptibility loci for IgG4-related disease. We also identified crucial aminoacid residues in the ß domain of the peptide-binding groove of HLA-DRB1, in which the seventh aminoacid residue showed the strongest association signal with IgG4-related disease (p=1·7×10-14), as has been reported with other autoimmune diseases. rs1340976 in FCGR2B showed an association with increased FCGR2B expression (p=2·7×10-10) and was in weak linkage disequilibrium with rs1050501, a missense variant of FCGR2B previously associated with systemic lupus erythematosus. Furthermore, rs1340976 was associated with the number of swollen organs at diagnosis (p=0·011) and IgG4 concentration at diagnosis (p=0·035). INTERPRETATION: Two susceptibility loci for IgG4-related disease were identified. Both FCGR2B and HLA loci might have important roles in IgG4-related disease development. Common molecular mechanisms might underlie IgG4-related disease and other immune-related disorders FUNDING: The Japanese Ministry of Health, Labour, and Welfare, the Japanese Agency of Medical Research and Development, and Kyoto University Grant for Top Global University Japan Project.

15.
Methods Mol Biol ; 1802: 225-233, 2018.
Article in English | MEDLINE | ID: mdl-29858813

ABSTRACT

HLA allele information is essential for a variety of medical applications, such as genomic studies of multifactorial diseases, including immune system and inflammation-related disorders, and donor selection in organ transplantation and regenerative medicine. To obtain this information, an accurate HLA typing method that is applicable for any allele registered in HLA allele databases is needed. Here, we describe a method for determining alleles from a current HLA database using next-generation sequencing (NGS) results.


Subject(s)
Alleles , Databases, Genetic , High-Throughput Nucleotide Sequencing/methods , Histocompatibility Testing/methods , Algorithms , Humans , Exome Sequencing
16.
Hum Mutat ; 38(7): 788-797, 2017 07.
Article in English | MEDLINE | ID: mdl-28419628

ABSTRACT

The accurate typing of human leukocyte antigen (HLA) alleles is critical for a variety of medical applications, such as genomic studies of multifactorial diseases, including immune system and inflammation-related disorders, and donor selection in organ transplantation and regenerative medicine. Here, we developed a new algorithm for determining HLA alleles using next-generation sequencing (NGS) results. The method consists of constructing an extensive dictionary of HLA alleles, precise mapping of the NGS reads, and calculating a score based on weighted read counts to select the most suitable pair of alleles. The developed algorithm compares the score of all allele pairs, taking into account variation not only in the domain for antigen presentation (G-DOMAIN), but also outside this domain. Using this method, HLA alleles could be determined with 6-digit precision. We showed that our method was more accurate than other NGS-based methods and revealed limitations of the conventional HLA typing technologies. Furthermore, we determined the complete genomic sequence of an HLA-A-like-pseudogene when we assembled NGS reads that had caused arguable typing, and found its identity with HLA-Y*02:01. The accuracy of the HLA-A allele typing was improved after the HLA-Y*02:01 sequence was included in the HLA allele dictionary.


Subject(s)
Computational Biology/methods , High-Throughput Nucleotide Sequencing/methods , Histocompatibility Testing , Sequence Analysis, DNA/methods , Algorithms , Alleles , Chromosome Mapping , Cloning, Molecular , DNA Primers , Databases, Factual , Exons , Genome, Human , Genomics , HLA Antigens/biosynthesis , HLA Antigens/genetics , Humans , Models, Statistical , Polymerase Chain Reaction , Pseudogenes , Reproducibility of Results
17.
Bioinformatics ; 28(7): 929-37, 2012 Apr 01.
Article in English | MEDLINE | ID: mdl-22332235

ABSTRACT

MOTIVATION: A reconstruction of full-length transcripts observed by next-generation sequencer or tiling arrays is an essential technique to know all phenomena of transcriptomes. Several techniques of the reconstruction have been developed. However, problems of high-level noises and biases still remain and interrupt the reconstruction. A method is required that is robust against noise and bias and correctly reconstructs transcripts regardless of equipment used. RESULTS: We propose a completely new statistical method that reconstructs full-length transcripts and can be applied on both next-generation sequencers and tiling arrays. The method called ARTADE2 analyzes 'positional correlation', meaning correlations of expression values for every combination on genomic positions of multiple transcriptional data. ARTADE2 then reconstructs full-length transcripts using a logistic model based on the positional correlation and the Markov model. ARTADE2 elucidated 17 591 full-length transcripts from 55 transcriptome datasets and showed notable performance compared with other recent prediction methods. Moreover, 1489 novel transcripts were discovered. We experimentally tested 16 novel transcripts, among which 14 were confirmed by reverse transcription-polymerase chain reaction and sequence mapping. The method also showed notable performance for reconstructing of mRNA observed by a next-generation sequencer. Moreover, the positional correlation and factor analysis embedded in ARTADE2 successfully detected regions at which alternative isoforms may exist, and thus are expected to be applied for discovering transcript biomarkers for a wide range of disciplines including preemptive medicine. AVAILABILITY: http://matome.base.riken.jp CONTACT: toyoda@base.riken.jp SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology/methods , Gene Expression Profiling/methods , Sequence Analysis, DNA/methods , Transcriptome , Algorithms , Logistic Models , Markov Chains , Protein Isoforms/genetics , RNA, Messenger/genetics
18.
Plant Cell Physiol ; 52(2): 254-64, 2011 Feb.
Article in English | MEDLINE | ID: mdl-21227933

ABSTRACT

Recent advances in technologies for observing high-resolution genomic activities, such as whole-genome tiling arrays and high-throughput sequencers, provide detailed information for understanding genome functions. However, the functions of 50% of known Arabidopsis thaliana genes remain unknown or are annotated only on the basis of static analyses such as protein motifs or similarities. In this paper, we describe dynamic structure-based dynamic expression (DSDE) analysis, which sequentially predicts both structural and functional features of transcripts. We show that DSDE analysis inferred gene functions 12% more precisely than static structure-based dynamic expression (SSDE) analysis or conventional co-expression analysis based on previously determined gene structures of A. thaliana. This result suggests that more precise structural information than the fixed conventional annotated structures is crucial for co-expression analysis in systems biology of transcriptional regulation and dynamics. Our DSDE method, ARabidopsis Tiling-Array-based Detection of Exons version 2 and over-representation analysis (ARTADE2-ORA), precisely predicts each gene structure by combining two statistical analyses: a probe-wise co-expression analysis of multiple transcriptome measurements and a Markov model analysis of genome sequences. ARTADE2-ORA successfully identified the true functions of about 90% of functionally annotated genes, inferred the functions of 98% of functionally unknown genes and predicted 1,489 new gene structures and functions. We developed a database ARTADE2DB that integrates not only the information predicted by ARTADE2-ORA but also annotations and other functional information, such as phenotypes and literature citations, and is expected to contribute to the study of the functional genomics of A. thaliana. URL: http://artade.org.


Subject(s)
Arabidopsis/genetics , Databases, Genetic , Genomics/methods , Exons , Gene Expression Profiling , Genome, Plant , Markov Chains , Models, Statistical , Sequence Analysis, DNA/methods , Structure-Activity Relationship , Systems Biology , User-Computer Interface
19.
Methods Mol Biol ; 639: 141-55, 2010.
Article in English | MEDLINE | ID: mdl-20387044

ABSTRACT

Plants respond and adapt to drought, cold, and high-salinity stresses. Stress-inducible gene products function in the stress response and tolerance in plants. Using cDNA microarrays and oligonucleotide microarrays, stress-inducible genes have been identified in various plant species so far. Recently, tiling array technology has become a powerful tool for the whole-genome transcriptome analysis. We applied the Arabidopsis Affymetrix tiling arrays to study the whole-genome transcriptome under drought, cold, and high-salinity stresses and identified a large number of drought, cold, and high-salinity stress-inducible genes and transcriptional units (TUs).


Subject(s)
Arabidopsis/genetics , Genes, Plant/genetics , Oligonucleotide Array Sequence Analysis/methods , Stress, Physiological/genetics , Biotin/metabolism , Computational Biology , DNA, Complementary/biosynthesis , Droughts , Gene Expression Regulation, Plant , RNA, Complementary/genetics , RNA, Plant/genetics , RNA, Plant/isolation & purification , Staining and Labeling , Transcription, Genetic
20.
Nucleic Acids Res ; 37(Web Server issue): W57-62, 2009 Jul.
Article in English | MEDLINE | ID: mdl-19528066

ABSTRACT

OmicBrowse is a genome browser designed as a scalable system for maintaining numerous genome annotation datasets. It is an open source tool capable of regulating multiple user data access to each dataset to allow multiple users to have their own integrative view of both their unpublished and published datasets, so that the maintenance costs related to supplying each collaborator exclusively with their own private data are significantly reduced. OmicBrowse supports DAS1 imports and exports of annotations to Internet site servers worldwide. We also provide a data-download named OmicDownload server that interactively selects datasets and filters the data on the selected datasets. Our OmicBrowse server has been freely available at http://omicspace.riken.jp/ since its launch in 2003. The OmicBrowse source code is downloadable from http://sourceforge.net/projects/omicbrowse/.


Subject(s)
Computer Graphics , Genomics , Software , Databases, Genetic , Systems Integration , User-Computer Interface
SELECTION OF CITATIONS
SEARCH DETAIL