Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 176
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Blood ; 143(18): 1856-1872, 2024 May 02.
Article in English | MEDLINE | ID: mdl-38427583

ABSTRACT

ABSTRACT: Allogeneic stem cell transplantation (alloSCT) is a curative treatment for hematological malignancies. After HLA-matched alloSCT, antitumor immunity is caused by donor T cells recognizing polymorphic peptides, designated minor histocompatibility antigens (MiHAs), that are presented by HLA on malignant patient cells. However, T cells often target MiHAs on healthy nonhematopoietic tissues of patients, thereby inducing side effects known as graft-versus-host disease. Here, we aimed to identify the dominant repertoire of HLA-I-restricted MiHAs to enable strategies to predict, monitor or modulate immune responses after alloSCT. To systematically identify novel MiHAs by genome-wide association screening, T-cell clones were isolated from 39 transplanted patients and tested for reactivity against 191 Epstein-Barr virus transformed B cell lines of the 1000 Genomes Project. By discovering 81 new MiHAs, we more than doubled the antigen repertoire to 159 MiHAs and demonstrated that, despite many genetic differences between patients and donors, often the same MiHAs are targeted in multiple patients. Furthermore, we showed that one quarter of the antigens are cryptic, that is translated from unconventional open reading frames, for example long noncoding RNAs, showing that these antigen types are relevant targets in natural immune responses. Finally, using single cell RNA-seq data, we analyzed tissue expression of MiHA-encoding genes to explore their potential role in clinical outcome, and characterized 11 new hematopoietic-restricted MiHAs as potential targets for immunotherapy. In conclusion, we expanded the repertoire of HLA-I-restricted MiHAs and identified recurrent, cryptic and hematopoietic-restricted antigens, which are fundamental to predict, follow or manipulate immune responses to improve clinical outcome after alloSCT.


Subject(s)
Hematopoietic Stem Cell Transplantation , Histocompatibility Antigens Class I , Minor Histocompatibility Antigens , Humans , Minor Histocompatibility Antigens/genetics , Minor Histocompatibility Antigens/immunology , Histocompatibility Antigens Class I/immunology , Histocompatibility Antigens Class I/genetics , Hematologic Neoplasms/immunology , Hematologic Neoplasms/therapy , Hematologic Neoplasms/genetics , T-Lymphocytes/immunology , Genome-Wide Association Study , Transplantation, Homologous , Female , Male
2.
BMC Genomics ; 24(1): 305, 2023 Jun 06.
Article in English | MEDLINE | ID: mdl-37280537

ABSTRACT

Our incomplete knowledge of the human transcriptome impairs the detection of disease-causing variants, in particular if they affect transcripts only expressed under certain conditions. These transcripts are often lacking from reference transcript sets, such as Ensembl/GENCODE and RefSeq, and could be relevant for establishing genetic diagnoses. We present SUsPECT (Solving Unsolved Patient Exomes/gEnomes using Custom Transcriptomes), a pipeline based on the Ensembl Variant Effect Predictor (VEP) to predict variant impact on custom transcript sets, such as those generated by long-read RNA-sequencing, for downstream prioritization. Our pipeline predicts the functional consequence and likely deleteriousness scores for missense variants in the context of novel open reading frames predicted from any transcriptome. We demonstrate the utility of SUsPECT by uncovering potential mutational mechanisms of pathogenic variants in ClinVar that are not predicted to be pathogenic using the reference transcript annotation. In further support of SUsPECT's utility, we identified an enrichment of immune-related variants predicted to have a more severe molecular consequence when annotating with a newly generated transcriptome from stimulated immune cells instead of the reference transcriptome. Our pipeline outputs crucial information for further prioritization of potentially disease-causing variants for any disease and will become increasingly useful as more long-read RNA sequencing datasets become available.


Subject(s)
Software , Transcriptome , Humans , Molecular Sequence Annotation , Sequence Analysis, RNA/methods , Exome , High-Throughput Nucleotide Sequencing
3.
Biom J ; 65(1): e2100123, 2023 01.
Article in English | MEDLINE | ID: mdl-35818126

ABSTRACT

Statistical methods to test for effects of single nucleotide polymorphisms (SNPs) on exon inclusion exist but often rely on testing of associations between multiple exon-SNP pairs, with sometimes subsequent summarization of results at the gene level. Such approaches require heavy multiple testing corrections and detect mostly events with large effect sizes. We propose here a test to find spliceQTL (splicing quantitative trait loci) effects that takes all exons and all SNPs into account simultaneously. For any chosen gene, this score-based test looks for an association between the set of exon expressions and the set of SNPs, via a random-effects model framework. It is efficient to compute and can be used if the number of SNPs is larger than the number of samples. In addition, the test is powerful in detecting effects that are relatively small for individual exon-SNP pairs but are observed for many pairs. Furthermore, test results are more often replicated across datasets than pairwise testing results. This makes our test more robust to exon-SNP pair-specific effects, which do not extend to multiple pairs within the same gene. We conclude that the test we propose here offers more power and better replicability in the search for spliceQTL effects.


Subject(s)
Polymorphism, Single Nucleotide , Quantitative Trait Loci , Genome-Wide Association Study/methods
4.
Hum Mutat ; 43(6): 717-733, 2022 06.
Article in English | MEDLINE | ID: mdl-35178824

ABSTRACT

Rare disease patients are more likely to receive a rapid molecular diagnosis nowadays thanks to the wide adoption of next-generation sequencing. However, many cases remain undiagnosed even after exome or genome analysis, because the methods used missed the molecular cause in a known gene, or a novel causative gene could not be identified and/or confirmed. To address these challenges, the RD-Connect Genome-Phenome Analysis Platform (GPAP) facilitates the collation, discovery, sharing, and analysis of standardized genome-phenome data within a collaborative environment. Authorized clinicians and researchers submit pseudonymised phenotypic profiles encoded using the Human Phenotype Ontology, and raw genomic data which is processed through a standardized pipeline. After an optional embargo period, the data are shared with other platform users, with the objective that similar cases in the system and queries from peers may help diagnose the case. Additionally, the platform enables bidirectional discovery of similar cases in other databases from the Matchmaker Exchange network. To facilitate genome-phenome analysis and interpretation by clinical researchers, the RD-Connect GPAP provides a powerful user-friendly interface and leverages tens of information sources. As a result, the resource has already helped diagnose hundreds of rare disease patients and discover new disease causing genes.


Subject(s)
Genomics , Rare Diseases , Exome , Genetic Association Studies , Genomics/methods , Humans , Phenotype , Rare Diseases/diagnosis , Rare Diseases/genetics
5.
BMC Genomics ; 23(1): 546, 2022 Jul 31.
Article in English | MEDLINE | ID: mdl-35907790

ABSTRACT

Population-scale expression profiling studies can provide valuable insights into biological and disease-underlying mechanisms. The availability of phenotypic traits is essential for studying clinical effects. Therefore, missing, incomplete, or inaccurate phenotypic information can make analyses challenging and prevent RNA-seq or other omics data to be reused. A possible solution are predictors that infer clinical or behavioral phenotypic traits from molecular data. While such predictors have been developed based on different omics data types and are being applied in various studies, metabolomics-based surrogates are less commonly used than predictors based on DNA methylation profiles.In this study, we inferred 17 traits, including diabetes status and exposure to lipid medication, using previously trained metabolomic predictors. We evaluated whether these metabolomic surrogates can be used as an alternative to reported information for studying the respective phenotypes using expression profiling data of four population cohorts. For the majority of the 17 traits, the metabolomic surrogates performed similarly to the reported phenotypes in terms of effect sizes, number of significant associations, replication rates, and significantly enriched pathways.The application of metabolomics-derived surrogate outcomes opens new possibilities for reuse of multi-omics data sets. In studies where availability of clinical metadata is limited, missing or incomplete information can be complemented by these surrogates, thereby increasing the size of available data sets. Additionally, the availability of such surrogates could be used to correct for potential biological confounding. In the future, it would be interesting to further investigate the use of molecular predictors across different omics types and cohorts.


Subject(s)
Metabolomics , Phenotype
6.
BMC Med ; 20(1): 395, 2022 11 10.
Article in English | MEDLINE | ID: mdl-36352383

ABSTRACT

BACKGROUND: Myotonic dystrophy type 1 (DM1) is an incurable multisystem disease caused by a CTG-repeat expansion in the DM1 protein kinase (DMPK) gene. The OPTIMISTIC clinical trial demonstrated positive and heterogenous effects of cognitive behavioral therapy (CBT) on the capacity for activity and social participations in DM1 patients. Through a process of reverse engineering, this study aims to identify druggable molecular biomarkers associated with the clinical improvement in the OPTIMISTIC cohort. METHODS: Based on full blood samples collected during OPTIMISTIC, we performed paired mRNA sequencing for 27 patients before and after the CBT intervention. Linear mixed effect models were used to identify biomarkers associated with the disease-causing CTG expansion and the mean clinical improvement across all clinical outcome measures. RESULTS: We identified 608 genes for which their expression was significantly associated with the CTG-repeat expansion, as well as 1176 genes significantly associated with the average clinical response towards the intervention. Remarkably, all 97 genes associated with both returned to more normal levels in patients who benefited the most from CBT. This main finding has been replicated based on an external dataset of mRNA data of DM1 patients and controls, singling these genes out as candidate biomarkers for therapy response. Among these candidate genes were DNAJB12, HDAC5, and TRIM8, each belonging to a protein family that is being studied in the context of neurological disorders or muscular dystrophies. Across the different gene sets, gene pathway enrichment analysis revealed disease-relevant impaired signaling in, among others, insulin-, metabolism-, and immune-related pathways. Furthermore, evidence for shared dysregulations with another neuromuscular disease, Duchenne muscular dystrophy, was found, suggesting a partial overlap in blood-based gene dysregulation. CONCLUSIONS: DM1-relevant disease signatures can be identified on a molecular level in peripheral blood, opening new avenues for drug discovery and therapy efficacy assessments.


Subject(s)
Myotonic Dystrophy , Humans , Carrier Proteins/genetics , Carrier Proteins/metabolism , Gene Expression , HSP40 Heat-Shock Proteins/genetics , HSP40 Heat-Shock Proteins/metabolism , Myotonic Dystrophy/genetics , Myotonic Dystrophy/metabolism , Myotonic Dystrophy/therapy , Nerve Tissue Proteins/genetics , RNA, Messenger/genetics , Trinucleotide Repeat Expansion
7.
Hum Mutat ; 42(7): 799-810, 2021 07.
Article in English | MEDLINE | ID: mdl-33942434

ABSTRACT

Hereditary disorders are frequently caused by genetic variants that affect pre-messenger RNA splicing. Though genetic variants in the canonical splice motifs are almost always disrupting splicing, the pathogenicity of variants in the noncanonical splice sites (NCSS) and deep intronic (DI) regions are difficult to predict. Multiple splice prediction tools have been developed for this purpose, with the latest tools employing deep learning algorithms. We benchmarked established and deep learning splice prediction tools on published gold standard sets of 71 NCSS and 81 DI variants in the ABCA4 gene and 61 NCSS variants in the MYBPC3 gene with functional assessment in midigene and minigene splice assays. The selection of splice prediction tools included CADD, DSSP, GeneSplicer, MaxEntScan, MMSplice, NNSPLICE, SPIDEX, SpliceAI, SpliceRover, and SpliceSiteFinder-like. The best-performing splice prediction tool for the different variants was SpliceRover for ABCA4 NCSS variants, SpliceAI for ABCA4 DI variants, and the Alamut 3/4 consensus approach (GeneSplicer, MaxEntScacn, NNSPLICE and SpliceSiteFinder-like) for NCSS variants in MYBPC3 based on the area under the receiver operator curve. Overall, the performance in a real-time clinical setting is much more modest than reported by the developers of the tools.


Subject(s)
Deep Learning , ATP-Binding Cassette Transporters/genetics , Benchmarking , Humans , Introns/genetics , Mutation , RNA Splice Sites/genetics , RNA Splicing
8.
J Proteome Res ; 20(6): 3353-3364, 2021 06 04.
Article in English | MEDLINE | ID: mdl-33998808

ABSTRACT

Discovery of variant peptides such as a single amino acid variant (SAAV) in shotgun proteomics data is essential for personalized proteomics. Both the resolution of shotgun proteomics methods and the search engines have improved dramatically, allowing for confident identification of SAAV peptides. However, it is not yet known if these methods are truly successful in accurately identifying SAAV peptides without prior genomic information in the search database. We studied this in unprecedented detail by exploiting publicly available long-read RNA sequences and shotgun proteomics data from the gold standard reference cell line NA12878. Searching spectra from this cell line with the state-of-the-art open modification search engine ionbot against carefully curated search databases resulted in 96.7% false-positive SAAVs and an 85% lower true positive rate than searching with peptide search databases that incorporate prior genetic information. While adding genetic variants to the search database remains indispensable for correct peptide identification, inclusion of long-read RNA sequences in the search database contributes only 0.3% new peptide identifications. These findings reveal the differences in SAAV detection that result from various approaches, providing guidance to researchers studying SAAV peptides and developers of peptide spectrum identification tools.


Subject(s)
Proteogenomics , Amino Acids , Databases, Protein , Proteome/genetics , Proteomics , Search Engine
9.
J Biomed Inform ; 122: 103897, 2021 10.
Article in English | MEDLINE | ID: mdl-34454078

ABSTRACT

INTRODUCTION: Existing methods to make data Findable, Accessible, Interoperable, and Reusable (FAIR) are usually carried out in a post hoc manner: after the research project is conducted and data are collected. De-novo FAIRification, on the other hand, incorporates the FAIRification steps in the process of a research project. In medical research, data is often collected and stored via electronic Case Report Forms (eCRFs) in Electronic Data Capture (EDC) systems. By implementing a de novo FAIRification process in such a system, the reusability and, thus, scalability of FAIRification across research projects can be greatly improved. In this study, we developed and implemented a novel method for de novo FAIRification via an EDC system. We evaluated our method by applying it to the Registry of Vascular Anomalies (VASCA). METHODS: Our EDC and research project independent method ensures that eCRF data entered into an EDC system can be transformed into machine-readable, FAIR data using a semantic data model (a canonical representation of the data, based on ontology concepts and semantic web standards) and mappings from the model to questions on the eCRF. The FAIRified data are stored in a triple store and can, together with associated metadata, be accessed and queried through a FAIR Data Point. The method was implemented in Castor EDC, an EDC system, through a data transformation application. The FAIRness of the output of the method, the FAIRified data and metadata, was evaluated using the FAIR Evaluation Services. RESULTS: We successfully applied our FAIRification method to the VASCA registry. Data entered on eCRFs is automatically transformed into machine-readable data and can be accessed and queried using SPARQL queries in the FAIR Data Point. Twenty-one FAIR Evaluator tests pass and one test regarding the metadata persistence policy fails, since this policy is not in place yet. CONCLUSION: In this study, we developed a novel method for de novo FAIRification via an EDC system. Its application in the VASCA registry and the automated FAIR evaluation show that the method can be used to make clinical research data FAIR when they are entered in an eCRF without any intervention from data management and data entry personnel. Due to the generic approach and developed tooling, we believe that our method can be used in other registries and clinical trials as well.


Subject(s)
Biomedical Research , Metadata , Data Management , Electronics , Registries
10.
J Biol Chem ; 294(1): 1-9, 2019 01 04.
Article in English | MEDLINE | ID: mdl-30455357

ABSTRACT

RNA-binding proteins (RBPs) play important roles in the control of gene expression and the coordination of different layers of post-transcriptional regulation. Interactions between certain RBPs and mRNA transcripts are notoriously difficult to predict, as any given protein-RNA interaction may rely not only on RNA sequence, but also on three-dimensional RNA structures, competitive inhibition from other RBPs, and input from cellular signaling pathways. Advanced and high-throughput technologies for the identification of RNA-protein interactions have come to the rescue, but the identification of binding sites and downstream functional effects of RBPs from the resulting data can be challenging. In this review, we discuss statistical inference and machine-learning approaches and tools relevant for the study of RBPs and the analysis of large-scale RNA-protein interaction datasets. This primer is intended for life scientists who are interested in incorporating these tools into their own research. We begin with the demystification of regression models, as used in the analysis of next-generation sequencing data, and progress to a discussion of Hidden Markov Models, which are of particular value in analyzing cross-linking followed by immunoprecipitation data. We then continue with examples of machine learning techniques, such as support vector machines and gradient tree boosting. We close with a brief discussion of current trends in the field, including deep learning architectures.


Subject(s)
Computer Simulation , Models, Chemical , RNA-Binding Proteins/chemistry , Databases, Nucleic Acid , Databases, Protein , RNA/genetics , RNA/metabolism , RNA-Binding Proteins/genetics , RNA-Binding Proteins/metabolism
11.
BMC Biol ; 17(1): 50, 2019 06 24.
Article in English | MEDLINE | ID: mdl-31234833

ABSTRACT

BACKGROUND: Identification of imprinted genes, demonstrating a consistent preference towards the paternal or maternal allelic expression, is important for the understanding of gene expression regulation during embryonic development and of the molecular basis of developmental disorders with a parent-of-origin effect. Combining allelic analysis of RNA-Seq data with phased genotypes in family trios provides a powerful method to detect parent-of-origin biases in gene expression. RESULTS: We report findings in 296 family trios from two large studies: 165 lymphoblastoid cell lines from the 1000 Genomes Project and 131 blood samples from the Genome of the Netherlands (GoNL) participants. Based on parental haplotypes, we identified > 2.8 million transcribed heterozygous SNVs phased for parental origin and developed a robust statistical framework for measuring allelic expression. We identified a total of 45 imprinted genes and one imprinted unannotated transcript, including multiple imprinted transcripts showing incomplete parental expression bias that was located adjacent to strongly imprinted genes. For example, PXDC1, a gene which lies adjacent to the paternally expressed gene FAM50B, shows a 2:1 paternal expression bias. Other imprinted genes had promoter regions that coincide with sites of parentally biased DNA methylation identified in the blood from uniparental disomy (UPD) samples, thus providing independent validation of our results. Using the stranded nature of the RNA-Seq data in lymphoblastoid cell lines, we identified multiple loci with overlapping sense/antisense transcripts, of which one is expressed paternally and the other maternally. Using a sliding window approach, we searched for imprinted expression across the entire genome, identifying a novel imprinted putative lncRNA in 13q21.2. Overall, we identified 7 transcripts showing parental bias in gene expression which were not reported in 4 other recent RNA-Seq studies of imprinting. CONCLUSIONS: Our methods and data provide a robust and high-resolution map of imprinted gene expression in the human genome.


Subject(s)
Alleles , Gene Expression/genetics , Genomic Imprinting/genetics , Haplotypes/genetics , Blood Chemical Analysis , Cell Line , Humans , Sequence Analysis, RNA
12.
Hum Mutat ; 40(10): 1797-1812, 2019 10.
Article in English | MEDLINE | ID: mdl-31231902

ABSTRACT

Phenotype-based filtering and prioritization contribute to the interpretation of genetic variants detected in exome sequencing. However, it is currently unclear how extensive this phenotypic annotation should be. In this study, we compare methods for incorporating phenotype into the interpretation process and assess the extent to which phenotypic annotation aids prioritization of the correct variant. Using a cohort of 29 patients with congenital myasthenic syndromes with causative variants in known or newly discovered disease genes, exome data and the Human Phenotype Ontology (HPO)-coded phenotypic profiles, we show that gene-list filters created from phenotypic annotations perform similarly to curated disease-gene virtual panels. We use Exomiser, a prioritization tool incorporating phenotypic comparisons, to rank candidate variants while varying phenotypic annotation. Analyzing 3,712 combinations, we show that increasing phenotypic annotation improved prioritization of the causative variant, from 62% ranked first on variant alone to 90% with seven HPO annotations. We conclude that any HPO-based phenotypic annotation aids variant discovery and that annotation with over five terms is recommended in our context. Although focused on a constrained cohort, this provides real-world validation of the utility of phenotypic annotation for variant prioritization. Further research is needed to extend this concept to other diseases and more diverse cohorts.


Subject(s)
Computational Biology/methods , Exome Sequencing , Exome , Molecular Sequence Annotation , Neuromuscular Diseases/diagnosis , Neuromuscular Diseases/genetics , Phenotype , Databases, Genetic , Genetic Association Studies , Genetic Predisposition to Disease , Humans , Rare Diseases/diagnosis , Rare Diseases/genetics , Reproducibility of Results
13.
FASEB J ; 32(3): 1579-1590, 2018 03.
Article in English | MEDLINE | ID: mdl-29141996

ABSTRACT

Adult muscles have a vast adaptation capacity, enabling function switches in response to altered conditions. During intensive physical activity, disease, or aging, adult skeletal muscles change and adjust their functions. The competence to adjust varies among muscles. Muscle-specific molecular mechanisms in healthy and normal conditions could designate changes in physiologic and pathologic conditions. We generated deep mRNA-sequencing data in adult fast and slow mouse muscles, and applying paired analysis, we identified that the muscle-specific signatures are composed of half of the muscle transcriptome. The fast muscles showed a more compact gene network that is concordant with homogenous myofiber typing, compared with the pattern in the slow muscle. The muscle-specific mRNA landscape did not correlate with alternative spicing, alternative polyadenylation, or the expression of muscle transcription factor gene networks. However, we found significant correlation between the differentially expressed noncoding RNAs, microRNAs (miRNAs) and long noncoding RNAs (lncRNAs) and their target genes. More than 25% of the genes expressed in a muscle-specific fashion were found to be targets of muscle-specific miRNAs and lncRNAs. We suggest that muscle-specific miRNAs and lncRNAs contribute to the establishment of muscle-specific transcriptomes in adult muscles.-Raz, V., Riaz, M., Tatum, Z., Kielbasa, S. M., 't Hoen, P. A. C. The distinct transcriptomes of slow and fast adult muscles are delineated by noncoding RNAs.


Subject(s)
Gene Regulatory Networks/physiology , MicroRNAs/biosynthesis , Muscle Fibers, Fast-Twitch/metabolism , Muscle Fibers, Slow-Twitch/metabolism , RNA, Long Noncoding/biosynthesis , Transcriptome/physiology , Animals , Male , Mice , MicroRNAs/genetics , RNA, Long Noncoding/genetics , RNA, Messenger/biosynthesis , RNA, Messenger/genetics
14.
Nature ; 501(7468): 506-11, 2013 Sep 26.
Article in English | MEDLINE | ID: mdl-24037378

ABSTRACT

Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of messenger RNA and microRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project--the first uniformly processed high-throughput RNA-sequencing data from multiple human populations with high-quality genome sequences. We discover extremely widespread genetic variation affecting the regulation of most genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on the cellular mechanisms of regulatory and loss-of-function variation, and allows us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.


Subject(s)
Genetic Variation/genetics , Genome, Human/genetics , High-Throughput Nucleotide Sequencing , Sequence Analysis, RNA , Transcriptome/genetics , Alleles , Cell Line, Transformed , Exons/genetics , Gene Expression Profiling , Humans , Polymorphism, Single Nucleotide/genetics , Quantitative Trait Loci/genetics , RNA, Messenger/analysis , RNA, Messenger/genetics
15.
Mol Ther ; 26(1): 132-147, 2018 01 03.
Article in English | MEDLINE | ID: mdl-29103911

ABSTRACT

Duchenne muscular dystrophy (DMD) is a severe, progressive muscle wasting disorder caused by reading frame disrupting mutations in the DMD gene. Exon skipping is a therapeutic approach for DMD. It employs antisense oligonucleotides (AONs) to restore the disrupted open reading frame, allowing the production of shorter, but partly functional dystrophin protein as seen in less severely affected Becker muscular dystrophy patients. To be effective, AONs need to be delivered and effectively taken up by the target cells, which can be accomplished by the conjugation of tissue-homing peptides. We performed phage display screens using a cyclic peptide library combined with next generation sequencing analyses to identify candidate muscle-homing peptides. Conjugation of the lead peptide to 2'-O-methyl phosphorothioate AONs enabled a significant, 2-fold increase in delivery and exon skipping in all analyzed skeletal and cardiac muscle of mdx mice and appeared well tolerated. While selected as a muscle-homing peptide, uptake was increased in liver and kidney as well. The homing capacity of the peptide may have been overruled by the natural biodistribution of the AON. Nonetheless, our results suggest that the identified peptide has the potential to facilitate delivery of AONs and perhaps other compounds to skeletal and cardiac muscle.


Subject(s)
Alternative Splicing , Gene Transfer Techniques , Genetic Therapy , Muscular Dystrophy, Duchenne/genetics , Oligonucleotides, Antisense/genetics , Peptides, Cyclic , Amino Acid Sequence , Animals , Disease Models, Animal , Dystrophin/genetics , Exons , Humans , Mice , Mice, Inbred mdx , Muscular Dystrophy, Duchenne/therapy , Oligonucleotides, Antisense/administration & dosage , Oligonucleotides, Antisense/chemistry , Peptide Library , Peptides, Cyclic/chemistry
16.
Cell Mol Life Sci ; 75(20): 3857-3875, 2018 Oct.
Article in English | MEDLINE | ID: mdl-29808415

ABSTRACT

The release and uptake of nano-sized extracellular vesicles (EV) is a highly conserved means of intercellular communication. The molecular composition of EV, and thereby their signaling function to target cells, is regulated by cellular activation and differentiation stimuli. EV are regarded as snapshots of cells and are, therefore, in the limelight as biomarkers for disease. Although research on EV-associated RNA has predominantly focused on microRNAs, the transcriptome of EV consists of multiple classes of small non-coding RNAs with potential gene-regulatory functions. It is not known whether environmental cues imposed on cells induce specific changes in a broad range of EV-associated RNA classes. Here, we investigated whether immune-activating or -suppressing stimuli imposed on primary dendritic cells affected the release of various small non-coding RNAs via EV. The small RNA transcriptomes of highly pure EV populations free from ribonucleoprotein particles were analyzed by RNA sequencing and RT-qPCR. Immune stimulus-specific changes were found in the miRNA, snoRNA, and Y-RNA content of EV from dendritic cells, whereas tRNA and snRNA levels were much less affected. Only part of the changes in EV-RNA content reflected changes in cellular RNA, which urges caution in interpreting EV as snapshots of cells. By comprehensive analysis of RNA obtained from highly purified EV, we demonstrate that multiple RNA classes contribute to genetic messages conveyed via EV. The identification of multiple RNA classes that display cell stimulation-dependent association with EV is the prelude to unraveling the function and biomarker potential of these EV-RNAs.


Subject(s)
Dendritic Cells/metabolism , Extracellular Vesicles/genetics , Transcriptome , Animals , Bone Marrow Cells/cytology , Cells, Cultured , Cholecalciferol/pharmacology , Dendritic Cells/cytology , Dendritic Cells/drug effects , Extracellular Vesicles/metabolism , Fluorescent Dyes/chemistry , Lipopolysaccharides/pharmacology , Mice , Mice, Inbred C57BL , MicroRNAs/metabolism , Microscopy, Electron , Nanoparticles/chemistry , RNA, Small Nucleolar/metabolism , RNA, Small Untranslated/chemistry , RNA, Small Untranslated/isolation & purification , RNA, Small Untranslated/metabolism , RNA, Transfer/metabolism , Sequence Analysis, RNA
17.
BMC Genomics ; 19(1): 90, 2018 01 25.
Article in English | MEDLINE | ID: mdl-29370748

ABSTRACT

BACKGROUND: SNP panels that uniquely identify an individual are useful for genetic and forensic research. Previously recommended SNP panels are based on DNA profiles and mostly contain intragenic SNPs. With the increasing interest in RNA expression profiles, we aimed for establishing a SNP panel for both DNA and RNA-based genotyping. RESULTS: To determine a small set of SNPs with maximally discriminative power, genotype calls were obtained from DNA and blood-derived RNA sequencing data belonging to healthy, geographically dispersed, Dutch individuals. SNPs were selected based on different criteria like genotype call rate, minor allele frequency, Hardy-Weinberg equilibrium and linkage disequilibrium. A panel of 50 SNPs was sufficient to identify an individual uniquely: the probability of identity was 6.9 × 10- 20 when assuming no family relations and 1.2 × 10- 10 when accounting for the presence of full sibs. The ability of the SNP panel to uniquely identify individuals on DNA and RNA level was validated in an independent population dataset. The panel is applicable to individuals from European descent, with slightly lower power in non-Europeans. Whereas most of the genes containing the 50 SNPs are expressed in various tissues, our SNP panel needs optimization for other tissues than blood. CONCLUSIONS: This first DNA/RNA SNP panel will be useful to identify sample mix-ups in biomedical research and for assigning DNA and RNA stains in crime scenes to unique individuals.


Subject(s)
DNA/analysis , Ethnicity/genetics , Genetics, Population , Patient Identification Systems/methods , Polymorphism, Single Nucleotide , RNA/analysis , DNA/genetics , DNA Fingerprinting , Gene Frequency , Genetic Testing , Genotype , High-Throughput Nucleotide Sequencing , Humans , Individuality , Linkage Disequilibrium , RNA/genetics
18.
Trends Genet ; 31(3): 128-39, 2015 Mar.
Article in English | MEDLINE | ID: mdl-25648499

ABSTRACT

The human transcriptome comprises >80,000 protein-coding transcripts and the estimated number of proteins synthesized from these transcripts is in the range of 250,000 to 1 million. These transcripts and proteins are encoded by less than 20,000 genes, suggesting extensive regulation at the transcriptional, post-transcriptional, and translational level. Here we review how RNA sequencing (RNA-seq) technologies have increased our understanding of the mechanisms that give rise to alternative transcripts and their alternative translation. We highlight four different regulatory processes: alternative transcription initiation, alternative splicing, alternative polyadenylation, and alternative translation initiation. We discuss their transcriptome-wide distribution, their impact on protein expression, their biological relevance, and the possible molecular mechanisms leading to their alternative regulation. We conclude with a discussion of the coordination and the interdependence of these four regulatory layers.


Subject(s)
Alternative Splicing , High-Throughput Nucleotide Sequencing/methods , Protein Biosynthesis , RNA, Messenger/genetics , Transcription, Genetic , Transcriptome , Gene Expression Profiling , Humans , Polyadenylation , Promoter Regions, Genetic/genetics , Sequence Analysis, RNA
19.
Am J Physiol Renal Physiol ; 312(4): F806-F817, 2017 04 01.
Article in English | MEDLINE | ID: mdl-28148532

ABSTRACT

Polycystic kidney disease (PKD) is a major cause of end-stage renal disease. The disease mechanisms are not well understood and the pathogenesis toward renal failure remains elusive. In this study, we present the first RNASeq analysis of a Pkd1-mutant mouse model in a combined meta-analysis with other published PKD expression profiles. We introduce the PKD Signature, a set of 1,515 genes that are commonly dysregulated in PKD studies. We show that the signature genes include many known and novel PKD-related genes and functions. Moreover, genes with a role in injury repair, as evidenced by expression data and/or automated literature analysis, were significantly enriched in the PKD Signature, with 35% of the PKD Signature genes being directly implicated in injury repair. NF-κB signaling, epithelial-mesenchymal transition, inflammatory response, hypoxia, and metabolism were among the most prominent injury or repair-related biological processes with a role in the PKD etiology. Novel PKD genes with a role in PKD and in injury were confirmed in another Pkd1-mutant mouse model as well as in animals treated with a nephrotoxic agent. We propose that compounds that can modulate the injury-repair response could be valuable drug candidates for PKD treatment.


Subject(s)
Acute Kidney Injury/genetics , Kidney/metabolism , Polycystic Kidney, Autosomal Dominant/genetics , Regeneration/genetics , Reperfusion Injury/genetics , Transcriptome , Acute Kidney Injury/metabolism , Acute Kidney Injury/pathology , Animals , Data Mining , Databases, Genetic , Disease Models, Animal , Gene Expression Profiling/methods , Gene Expression Regulation , Genetic Association Studies , Genetic Predisposition to Disease , Kidney/drug effects , Kidney/pathology , Mice, Transgenic , Mutation , Phenotype , Polycystic Kidney, Autosomal Dominant/drug therapy , Polycystic Kidney, Autosomal Dominant/metabolism , Polycystic Kidney, Autosomal Dominant/pathology , Regeneration/drug effects , Reperfusion Injury/metabolism , Reperfusion Injury/pathology , Reproducibility of Results , Signal Transduction , TRPP Cation Channels/genetics
20.
Nucleic Acids Res ; 43(12): e80, 2015 Jul 13.
Article in English | MEDLINE | ID: mdl-25800735

ABSTRACT

Alternative splicing is a powerful mechanism present in eukaryotic cells to obtain a wide range of transcripts and protein isoforms from a relatively small number of genes. The mechanisms regulating (alternative) splicing and the paradigm of consecutive splicing have recently been challenged, especially for genes with a large number of introns. RNA-Seq, a powerful technology using deep sequencing in order to determine transcript structure and expression levels, is usually performed on mature mRNA, therefore not allowing detailed analysis of splicing progression. Sequencing pre-mRNA at different stages of splicing potentially provides insight into mRNA maturation. Although the number of tools that analyze total and cytoplasmic RNA in order to elucidate the transcriptome composition is rapidly growing, there are no tools specifically designed for the analysis of nuclear RNA (which contains mixtures of pre- and mature mRNA). We developed dedicated algorithms to investigate the splicing process. In this paper, we present a new classification of RNA-Seq reads based on three major stages of splicing: pre-, intermediate- and post-splicing. Applying this novel classification we demonstrate the possibility to analyze the order of splicing. Furthermore, we uncover the potential to investigate the multi-step nature of splicing, assessing various types of recursive splicing events. We provide the data that gives biological insight into the order of splicing, show that non-sequential splicing of certain introns is reproducible and coinciding in multiple cell lines. We validated our observations with independent experimental technologies and showed the reliability of our method. The pipeline, named SplicePie, is freely available at: https://github.com/pulyakhina/splicing_analysis_pipeline. The example data can be found at: https://barmsijs.lumc.nl/HG/irina/example_data.tar.gz.


Subject(s)
Alternative Splicing , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, RNA/methods , Software , Algorithms , Cell Line , Humans , Introns
SELECTION OF CITATIONS
SEARCH DETAIL