Search | VHL Regional Portal

1.

Assessing computational predictions of the phenotypic effect of cystathionine-beta-synthase variants.

Kasak, Laura; Bakolitsa, Constantina; Hu, Zhiqiang; Yu, Changhua; Rine, Jasper; Dimster-Denk, Dago F; Pandey, Gaurav; De Baets, Greet; Bromberg, Yana; Cao, Chen; Capriotti, Emidio; Casadio, Rita; Van Durme, Joost; Giollo, Manuel; Karchin, Rachel; Katsonis, Panagiotis; Leonardi, Emanuela; Lichtarge, Olivier; Martelli, Pier Luigi; Masica, David; Mooney, Sean D; Olatubosun, Ayodeji; Radivojac, Predrag; Rousseau, Frederic; Pal, Lipika R; Savojardo, Castrense; Schymkowitz, Joost; Thusberg, Janita; Tosatto, Silvio C E; Vihinen, Mauno; Väliaho, Jouni; Repo, Susanna; Moult, John; Brenner, Steven E; Friedberg, Iddo.

Hum Mutat ; 40(9): 1530-1545, 2019 09.

Article in English | MEDLINE | ID: mdl-31301157

ABSTRACT

Accurate prediction of the impact of genomic variation on phenotype is a major goal of computational biology and an important contributor to personalized medicine. Computational predictions can lead to a better understanding of the mechanisms underlying genetic diseases, including cancer, but their adoption requires thorough and unbiased assessment. Cystathionine-beta-synthase (CBS) is an enzyme that catalyzes the first step of the transsulfuration pathway, from homocysteine to cystathionine, and in which variations are associated with human hyperhomocysteinemia and homocystinuria. We have created a computational challenge under the CAGI framework to evaluate how well different methods can predict the phenotypic effect(s) of CBS single amino acid substitutions using a blinded experimental data set. CAGI participants were asked to predict yeast growth based on the identity of the mutations. The performance of the methods was evaluated using several metrics. The CBS challenge highlighted the difficulty of predicting the phenotype of an ex vivo system in a model organism when classification models were trained on human disease data. We also discuss the variations in difficulty of prediction for known benign and deleterious variants, as well as identify methodological and experimental constraints with lessons to be learned for future challenges.

Subject(s)

Amino Acid Substitution , Computational Biology/methods , Cystathionine beta-Synthase/genetics , Cystathionine/metabolism , Cystathionine beta-Synthase/metabolism , Homocysteine/metabolism , Humans , Phenotype , Precision Medicine

2.

Characterization of all possible single-nucleotide change caused amino acid substitutions in the kinase domain of Bruton tyrosine kinase.

Väliaho, Jouni; Faisal, Imrul; Ortutay, Csaba; Smith, C I Edvard; Vihinen, Mauno.

Hum Mutat ; 36(6): 638-47, 2015 Jun.

Article in English | MEDLINE | ID: mdl-25777788

ABSTRACT

Knowledge about features distinguishing deleterious and neutral variations is crucial for interpretation of novel variants. Bruton tyrosine kinase (BTK) contains the highest number of unique disease-causing variations among the human protein kinases, still it is just 10% of all the possible single-nucleotide substitution-caused amino acid variations (SNAVs). In the BTK kinase domain (BTK-KD) can appear altogether 1,495 SNAVs. We investigated them all with bioinformatic and protein structure analysis methods. Most disease-causing variations affect conserved and buried residues disturbing protein stability. Minority of exposed residues is conserved, but strongly tied to pathogenicity. Sixty-seven percent of variations are predicted to be harmful. In 39% of the residues, all the variants are likely harmful, whereas in 10% of sites, all the substitutions are tolerated. Results indicate the importance of the entire kinase domain, involvement in numerous interactions, and intricate functional regulation by conformational change. These results can be extended to other protein kinases and organisms.

Subject(s)

Amino Acid Substitution , Polymorphism, Single Nucleotide , Protein Interaction Domains and Motifs/genetics , Protein-Tyrosine Kinases/genetics , Agammaglobulinaemia Tyrosine Kinase , Agammaglobulinemia/genetics , Conserved Sequence , Evolution, Molecular , Genes, X-Linked , Humans , Models, Molecular , Protein Conformation , Protein-Tyrosine Kinases/chemistry , Selection, Genetic , Structure-Activity Relationship

3.

Structure-function analysis indicates that sumoylation modulates DNA-binding activity of STAT1.

Grönholm, Juha; Vanhatupa, Sari; Ungureanu, Daniela; Väliaho, Jouni; Laitinen, Tuomo; Valjakka, Jarkko; Silvennoinen, Olli.

BMC Biochem ; 13: 20, 2012 Oct 08.

Article in English | MEDLINE | ID: mdl-23043228

ABSTRACT

BACKGROUND: STAT1 is an essential transcription factor for interferon-Î³-mediated gene responses. A distinct sumoylation consensus site (ψKxE) 702IKTE705 is localized in the C-terminal region of STAT1, where Lys703 is a target for PIAS-induced SUMO modification. Several studies indicate that sumoylation has an inhibitory role on STAT1-mediated gene expression but the molecular mechanisms are not fully understood. RESULTS: Here, we have performed a structural and functional analysis of sumoylation in STAT1. We show that deconjugation of SUMO by SENP1 enhances the transcriptional activity of STAT1, confirming a negative regulatory effect of sumoylation on STAT1 activity. Inspection of molecular model indicated that consensus site is well exposed to SUMO-conjugation in STAT1 homodimer and that the conjugated SUMO moiety is directed towards DNA, thus able to form a sterical hindrance affecting promoter binding of dimeric STAT1. In addition, oligoprecipitation experiments indicated that sumoylation deficient STAT1 E705Q mutant has higher DNA-binding activity on STAT1 responsive gene promoters than wild-type STAT1. Furthermore, sumoylation deficient STAT1 E705Q mutant displayed enhanced histone H4 acetylation on interferon-Î³-responsive promoter compared to wild-type STAT1. CONCLUSIONS: Our results suggest that sumoylation participates in regulation of STAT1 responses by modulating DNA-binding properties of STAT1.

Subject(s)

DNA/metabolism , STAT1 Transcription Factor/metabolism , Acetylation , Amino Acid Sequence , Amino Acid Substitution , Animals , COS Cells , Chlorocebus aethiops , Chromatin Immunoprecipitation , Cysteine Endopeptidases , Dimerization , Endopeptidases/chemistry , Endopeptidases/metabolism , HeLa Cells , Histones/metabolism , Humans , Promoter Regions, Genetic , Protein Structure, Tertiary , Recombinant Proteins/biosynthesis , Recombinant Proteins/chemistry , Recombinant Proteins/genetics , STAT1 Transcription Factor/chemistry , STAT1 Transcription Factor/genetics , Sumoylation

4.

PON-P: integrated predictor for pathogenicity of missense variants.

Olatubosun, Ayodeji; Väliaho, Jouni; Härkönen, Jani; Thusberg, Janita; Vihinen, Mauno.

Hum Mutat ; 33(8): 1166-74, 2012 Aug.

Article in English | MEDLINE | ID: mdl-22505138

ABSTRACT

High-throughput sequencing data generation demands the development of methods for interpreting the effects of genomic variants. Numerous computational methods have been developed to assess the impact of variations because experimental methods are unable to cope with both the speed and volume of data generation. To harness the strength of currently available predictors, the Pathogenic-or-Not-Pipeline (PON-P) integrates five predictors to predict the probability that nonsynonymous variations affect protein function and may consequently be disease related. Random forest methodology-based PON-P shows consistently improved performance in cross-validation tests and on independent test sets, providing ternary classification and statistical reliability estimate of results. Applied to missense variants in a melanoma cancer cell line, PON-P predicts variants in 17 genes to affect protein function. Previous studies implicate nine of these genes in the pathogenesis of various forms of cancer. PON-P may thus be used as a first step in screening and prioritizing variants to determine deleterious ones for further experimentation.

Subject(s)

Computational Biology/methods , Databases, Genetic , Genetic Predisposition to Disease/genetics , Humans , Mutation, Missense/genetics

5.

IDR knowledge base for primary immunodeficiencies.

Samarghitean, Crina; Väliaho, Jouni; Vihinen, Mauno.

Immunome Res ; 3: 6, 2007 Mar 29.

Article in English | MEDLINE | ID: mdl-17394641

ABSTRACT

BACKGROUND: The ImmunoDeficiency Resource (IDR) is a knowledge base for the integration of the clinical, biochemical, genetic, genomic, proteomic, structural, and computational data of primary immunodeficiencies. The need for the IDR arises from the lack of structured and systematic information about primary immunodeficiencies on the Internet, and from the lack of a common platform which enables doctors, researchers, students, nurses and patients to find out validated information about these diseases. DESCRIPTION: The IDR knowledge base, first released in 1999, has grown substantially. It contains information for 158 diseases, both from a clinical as well as molecular point of view. The database and the user interface have been reformatted. This new IDR release has a richer and more complete breadth, depth and scope. The service provides the most complete and up-to-date dataset. The IDR has been integrated with several internal and external databases and services. The contents of the IDR are validated and selected for different types of users (doctors, nurses, researchers and students, as well as patients and their families). The search engine has been improved and allows either a detailed or a broad search from a simple user interface. CONCLUSION: The IDR is the first knowledge base specifically designed to capture in a systematic and validated way both clinical and molecular information for primary immunodeficiencies. The service is freely available at http://bioinf.uta.fi/idr and is regularly updated. The IDR facilitates primary immunodeficiencies informatics and helps to parameterise in silico modelling of these diseases. The IDR is useful also as an advanced education tool for medical students, and physicians.

6.

PhenCode: connecting ENCODE data with mutations and phenotype.

Giardine, Belinda; Riemer, Cathy; Hefferon, Tim; Thomas, Daryl; Hsu, Fan; Zielenski, Julian; Sang, Yunhua; Elnitski, Laura; Cutting, Garry; Trumbower, Heather; Kern, Andrew; Kuhn, Robert; Patrinos, George P; Hughes, Jim; Higgs, Doug; Chui, David; Scriver, Charles; Phommarinh, Manyphong; Patnaik, Santosh K; Blumenfeld, Olga; Gottlieb, Bruce; Vihinen, Mauno; Väliaho, Jouni; Kent, Jim; Miller, Webb; Hardison, Ross C.

Hum Mutat ; 28(6): 554-62, 2007 Jun.

Article in English | MEDLINE | ID: mdl-17326095

ABSTRACT

PhenCode (Phenotypes for ENCODE; http://www.bx.psu.edu/phencode) is a collaborative, exploratory project to help understand phenotypes of human mutations in the context of sequence and functional data from genome projects. Currently, it connects human phenotype and clinical data in various locus-specific databases (LSDBs) with data on genome sequences, evolutionary history, and function from the ENCODE project and other resources in the UCSC Genome Browser. Initially, we focused on a few selected LSDBs covering genes encoding alpha- and beta-globins (HBA, HBB), phenylalanine hydroxylase (PAH), blood group antigens (various genes), androgen receptor (AR), cystic fibrosis transmembrane conductance regulator (CFTR), and Bruton's tyrosine kinase (BTK), but we plan to include additional loci of clinical importance, ultimately genomewide. We have also imported variant data and associated OMIM links from Swiss-Prot. Users can find interesting mutations in the UCSC Genome Browser (in a new Locus Variants track) and follow links back to the LSDBs for more detailed information. Alternatively, they can start with queries on mutations or phenotypes at an LSDB and then display the results at the Genome Browser to view complementary information such as functional data (e.g., chromatin modifications and protein binding from the ENCODE consortium), evolutionary constraint, regulatory potential, and/or any other tracks they choose. We present several examples illustrating the power of these connections for exploring phenotypes associated with functional elements, and for identifying genomic data that could help to explain clinical phenotypes.

Subject(s)

Databases, Genetic , Mutation , Phenotype , Agammaglobulinaemia Tyrosine Kinase , Blood Group Antigens/genetics , Cooperative Behavior , Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Databases, Genetic/standards , Genotype , Globins/genetics , Humans , Internet , Phenylalanine Hydroxylase/genetics , Protein-Tyrosine Kinases/genetics , Receptors, Androgen/genetics , Software Design , Systems Integration

7.

BTKbase: the mutation database for X-linked agammaglobulinemia.

Väliaho, Jouni; Smith, C I Edvard; Vihinen, Mauno.

Hum Mutat ; 27(12): 1209-17, 2006 Dec.

Article in English | MEDLINE | ID: mdl-16969761

ABSTRACT

X-linked agammaglobulinemia (XLA) is a hereditary immunodeficiency caused by mutations in the gene encoding Bruton tyrosine kinase (BTK). XLA patients have a decreased number of mature B cells and a lack of all immunoglobulin isotypes, resulting in susceptibility to severe bacterial infections. XLA-causing mutations are collected in a mutation database (BTKbase), which is available at http://bioinf.uta.fi/BTKbase. For each patient the following information is given (when available): the identification of the entry, a plain English description of the mutation followed by a reference, formal characterization of the mutation, and the various parameters from the patient. BTKbase is implemented with the MUTbase program suite, which provides an easy, interactive, and quality controlled submission of information to mutation databases. BTKbase version 8 lists mutation entries of 1,111 patients from 973 unrelated families showing 602 unique molecular events. The localization of the mutations on the gene and protein for BTK can be analyzed by clicking sequences on the web pages. The distribution of the mutations in the five structural domains is approximately proportional to the length of the domains, except for the Tec homology (TH) domain. The most frequently affected sites are CpG dinucleotides. The majority of the missense mutations are structural-disturbing Bruton tyrosine kinase (Btk) folding or decreasing stability. Many of the mutations affect functionally significant, conserved residues. The structural consequences of the mutations in all the domains have been studied based on crystallographic and nuclear magnetic resonance (NMR) structures as well as computer-aided molecular modeling.

Subject(s)

Agammaglobulinemia/genetics , Databases, Genetic , Genetic Diseases, X-Linked/genetics , Mutation , Amino Acid Sequence , Genetic Diseases, X-Linked/classification , Humans , Models, Molecular , Molecular Sequence Data , Structure-Activity Relationship

8.

Immunodeficiency mutation databases (IDbases).

Piirilä, Hilkka; Väliaho, Jouni; Vihinen, Mauno.

Hum Mutat ; 27(12): 1200-8, 2006 Dec.

Article in English | MEDLINE | ID: mdl-17004234

ABSTRACT

Primary immunodeficiencies (IDs) are a heterogenic group of inherited disorders of the immune system. Immunodeficiency patients have increased susceptibility to recurrent and persistent, even life-threatening infections. Mutations in a large number of genes can cause defects in different cellular functions and lead to impaired immune response. To date, approximately 150 IDs and more than 100 affected genes have been identified. ID-related genes are distributed throughout the genome, and diseases can be inherited in an X-linked, an autosomal recessive, or an autosomal dominant way. We have collected ID mutation data into locus-specific patient-related mutation databases, IDbases (http://bioinf.uta.fi/IDbases). Mutations are described at DNA, mRNA, and protein levels with links to reference sequences and reference articles. The mutation data has been collated into entries along with some clinical information. IDbases offer an easy way, e.g., to find recently identified mutations, to reveal genotype-phenotype correlations, and to discover a specific mutation or to examine the most common mutations in a single immunodeficiency related gene. At the moment we have databases for 107 ID genes with 4,140 public patient entries. An exhaustive statistical analysis of mutation data from the IDbases was made. Missense and nonsense mutations are the most common mutation types, and the most common single substitution is a nonsense mutation from tryptophan to a stop codon. Arginine is the most mutated as well as the most abundant mutant amino acid.

Subject(s)

Databases, Genetic , Immunologic Deficiency Syndromes/genetics , Mutation , Amino Acid Sequence , Humans , Molecular Sequence Data , Software

9.

Distribution of immunodeficiency fact files with XML--from Web to WAP.

Väliaho, Jouni; Riikonen, Pentti; Vihinen, Mauno.

BMC Med Inform Decis Mak ; 5: 21, 2005 Jun 26.

Article in English | MEDLINE | ID: mdl-15978138

ABSTRACT

BACKGROUND: Although biomedical information is growing rapidly, it is difficult to find and retrieve validated data especially for rare hereditary diseases. There is an increased need for services capable of integrating and validating information as well as proving it in a logically organized structure. A XML-based language enables creation of open source databases for storage, maintenance and delivery for different platforms. METHODS: Here we present a new data model called fact file and an XML-based specification Inherited Disease Markup Language (IDML), that were developed to facilitate disease information integration, storage and exchange. The data model was applied to primary immunodeficiencies, but it can be used for any hereditary disease. Fact files integrate biomedical, genetic and clinical information related to hereditary diseases. RESULTS: IDML and fact files were used to build a comprehensive Web and WAP accessible knowledge base ImmunoDeficiency Resource (IDR) available at http://bioinf.uta.fi/idr/. A fact file is a user oriented user interface, which serves as a starting point to explore information on hereditary diseases. CONCLUSION: The IDML enables the seamless integration and presentation of genetic and disease information resources in the Internet. IDML can be used to build information services for all kinds of inherited diseases. The open source specification and related programs are available at http://bioinf.uta.fi/idml/.

Subject(s)

Database Management Systems , Databases, Genetic , Genetic Diseases, Inborn , Immunologic Deficiency Syndromes , Internet , Humans , Metabolism, Inborn Errors , Programming Languages , Systems Integration , User-Computer Interface

10.

KinMutBase: a registry of disease-causing mutations in protein kinase domains.

Ortutay, Csaba; Väliaho, Jouni; Stenberg, Kaj; Vihinen, Mauno.

Hum Mutat ; 25(5): 435-42, 2005 May.

Article in English | MEDLINE | ID: mdl-15832311

ABSTRACT

A large number of disease-causing mutations have been identified from several protein kinases. KinMutBase is a comprehensive knowledge base for human disease-related mutations in protein kinase domains (http://bioinf.uta.fi/KinMutBase/). The latest version contains 582 different mutations for 1,790 cases in 1,322 families. KinMutBase entries are described on the DNA, mRNA, and protein level. Numbers for affected patients and families are also provided. KinMutBase has extensive amount of links and cross-references to literature, other databases, and information sources. There are numerous interactive pages about sequences, structures, mutation statistics, and diseases. Detailed statistical study was done on frequencies of different types of mutations both on the DNA and protein level in serine/threonine kinase (PSK) and tyrosine kinase (PTK). Three-dimensional structures indicate clustering of disease-related mutations mainly to conserved subdomains, and substrate and coligand binding amino acids, although mutations appear throughout the sequences. CpG containing codons, especially for arginine, constitute the majority of mutational hotspots. There are certain clear differences in mutation patterns and types between PSKs and PTKs.

Subject(s)

Databases, Nucleic Acid , Databases, Protein , Mutation , Protein Kinases/genetics , Registries , Amino Acid Sequence , Genetic Predisposition to Disease , Humans , Models, Molecular , Molecular Sequence Data , Protein Kinases/chemistry , Protein Structure, Tertiary

11.

Bruton's tyrosine kinase: cell biology, sequence conservation, mutation spectrum, siRNA modifications, and expression profiling.

Lindvall, Jessica M; Blomberg, K Emelie M; Väliaho, Jouni; Vargas, Leonardo; Heinonen, Juhana E; Berglöf, Anna; Mohamed, Abdalla J; Nore, Beston F; Vihinen, Mauno; Smith, C I Edvard.

Immunol Rev ; 203: 200-15, 2005 Feb.

Article in English | MEDLINE | ID: mdl-15661031

ABSTRACT

Bruton's tyrosine kinase (Btk) is encoded by the gene that when mutated causes the primary immunodeficiency disease X-linked agammaglobulinemia (XLA) in humans and X-linked immunodeficiency (Xid) in mice. Btk is a member of the Tec family of protein tyrosine kinases (PTKs) and plays a vital, but diverse, modulatory role in many cellular processes. Mutations affecting Btk block B-lymphocyte development. Btk is conserved among species, and in this review, we present the sequence of the full-length rat Btk and find it to be analogous to the mouse Btk sequence. We have also analyzed the wealth of information compiled in the mutation database for XLA (BTKbase), representing 554 unique molecular events in 823 families and demonstrate that only selected amino acids are sensitive to replacement (P < 0.001). Although genotype-phenotype correlations have not been established in XLA, based on these findings, we hypothesize that this relationship indeed exists. Using short interfering-RNA technology, we have previously generated active constructs downregulating Btk expression. However, application of recently established guidelines to enhance or decrease the activity was not successful, demonstrating the importance of the primary sequence. We also review the outcome of expression profiling, comparing B lymphocytes from XLA-, Xid-, and Btk-knockout (KO) donors to healthy controls. Finally, in spite of a few genes differing in expression between Xid- and Btk-KO mice, in vivo competition between cells expressing either mutation shows that there is no selective survival advantage of cells carrying one genetic defect over the other. We conclusively demonstrate that for the R28C-missense mutant (Xid), there is no biologically relevant residual activity or any dominant negative effect versus other proteins.

Subject(s)

Agammaglobulinemia/genetics , Immunologic Deficiency Syndromes/genetics , Protein-Tyrosine Kinases/chemistry , Protein-Tyrosine Kinases/genetics , Agammaglobulinaemia Tyrosine Kinase , Amino Acid Sequence , Animals , Conserved Sequence , Gene Expression Profiling , Humans , Mice , Molecular Sequence Data , Mutation , Protein-Tyrosine Kinases/metabolism , RNA, Small Interfering/genetics , Rats , Sequence Alignment

12.

Online registry of genetic and clinical immunodeficiency diagnostic laboratories, IDdiagnostics.

Samarghitean, Crina; Väliaho, Jouni; Vihinen, Mauno.

J Clin Immunol ; 24(1): 53-61, 2004 Jan.

Article in English | MEDLINE | ID: mdl-14997034

ABSTRACT

Primary immunodeficiencies (IDs) are caused by inherited genetic defects leading to intrinsic defects in cells of the immune systems. Most IDs are rare diseases and can be difficult to diagnose because similar symptoms characterize several disorders. Mutation detection is the most reliable method in such cases. These tests are not available at most centers and physicians can have difficulties in finding laboratories that could analyze the genetic defects because certain genes are possibly analyzed by just one laboratory. The IDdiagnostics registry has been established to provide information for physicians and other health care professionals. The database at http://bioinf.uta.fi/IDdiagnostics contains currently information for the analysis of defects in 30 ID-related genes. Another part of IDdiagnostics is a database of clinical tests. Laboratories performing these analyses, either gene or clinical tests, are asked to submit their information to the database by using a printed form or electronic submission at http://bioinf.uta.fi/cgi-bin/submit/IDClini.cgi. The clinical test database contains information about tests for clinical data, immune status, and studies of function, antibody response, cell function, enzyme assays, clinical function, and apoptosis assays. Both the services are freely available and regularly updated. The services aim at increasing the awareness of IDs and helping to obtain exact and early diagnosis.

Subject(s)

Databases as Topic , Diagnostic Services , Immunologic Deficiency Syndromes/diagnosis , Internet , Humans , Immunologic Deficiency Syndromes/genetics

13.

IDR: the ImmunoDeficiency Resource.

Väliaho, Jouni; Pusa, Marianne; Ylinen, Tuomo; Vihinen, Mauno.

Nucleic Acids Res ; 30(1): 232-4, 2002 Jan 01.

Article in English | MEDLINE | ID: mdl-11752302

ABSTRACT

The ImmunoDeficiency Resource (IDR), freely available at http://www.uta.fi/imt/bioinfo/idr/, is a comprehensive knowledge base on immunodeficiencies. It is designed for different user groups such as researchers, physicians and nurses as well as patients and their families and the general public. Information on immunodeficiencies is stored as fact files, which are disease- and gene-based information resources. We have developed an inherited disease markup language (IDML) data model, which is designed for storing disease- and gene-specific data in extensible markup language (XML) format. The fact files written by the IDML can be used to present data in different contexts and platforms. All the information in the IDR is validated by expert curators.

Subject(s)

Databases, Factual , Immunologic Deficiency Syndromes , Database Management Systems , Humans , Immunologic Deficiency Syndromes/diagnosis , Immunologic Deficiency Syndromes/immunology , Immunologic Deficiency Syndromes/therapy , Information Storage and Retrieval , Internet , Quality Control

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL