RESUMO
The detection of circular RNA molecules (circRNAs) is typically based on short-read RNA sequencing data processed using computational tools. Numerous such tools have been developed, but a systematic comparison with orthogonal validation is missing. Here, we set up a circRNA detection tool benchmarking study, in which 16 tools detected more than 315,000 unique circRNAs in three deeply sequenced human cell types. Next, 1,516 predicted circRNAs were validated using three orthogonal methods. Generally, tool-specific precision is high and similar (median of 98.8%, 96.3% and 95.5% for qPCR, RNase R and amplicon sequencing, respectively) whereas the sensitivity and number of predicted circRNAs (ranging from 1,372 to 58,032) are the most significant differentiators. Of note, precision values are lower when evaluating low-abundance circRNAs. We also show that the tools can be used complementarily to increase detection sensitivity. Finally, we offer recommendations for future circRNA detection and validation.
Assuntos
Benchmarking , RNA Circular , Humanos , RNA Circular/genética , RNA/genética , RNA/metabolismo , Análise de Sequência de RNA/métodosRESUMO
The heterogeneous fiber type composition of skeletal muscle makes it challenging to decipher the molecular signaling events driving the health- and performance benefits of exercise. We developed an optimized workflow for transcriptional profiling of individual human muscle fibers before, immediately after, and after 3 h of recovery from high-intensity interval cycling exercise. From a transcriptional point-of-view, we observe that there is no dichotomy in fiber activation, which could refer to a fiber being recruited or nonrecruited. Rather, the activation pattern displays a continuum with a more uniform response within fast versus slow fibers during the recovery from exercise. The transcriptome-wide response immediately after exercise is characterized by some distinct signatures for slow versus fast fibers, although the most exercise-responsive genes are common between the two fiber types. The temporal transcriptional waves further converge the gene signatures of both fiber types toward a more similar profile during the recovery from exercise. Furthermore, a large heterogeneity among all resting and exercised fibers was observed, with the principal drivers being independent of a slow/fast typology. This profound heterogeneity extends to distinct exercise responses of fibers beyond a classification based on myosin heavy chains. Collectively, our single-fiber methodological approach points to a substantial between-fiber diversity in muscle fiber responses to high-intensity interval exercise.NEW & NOTEWORTHY By development of a single-fiber transcriptomics technology, we assessed the transcriptional events in individual human skeletal muscle fibers upon high-intensity exercise. We demonstrate a large variability in transcriptional activation of fibers, with shared and distinct gene signatures for slow and fast fibers. The heterogeneous fiber-specific exercise response extends beyond this traditional slow/fast categorization. These findings expand on our understanding of exercise responses and uncover a profound between-fiber diversity in muscle fiber activation and transcriptional perturbations.
Assuntos
Exercício Físico , Fibras Musculares de Contração Lenta , Transcriptoma , Humanos , Masculino , Adulto , Exercício Físico/fisiologia , Fibras Musculares de Contração Lenta/metabolismo , Fibras Musculares Esqueléticas/metabolismo , Fibras Musculares de Contração Rápida/metabolismo , Treinamento Intervalado de Alta Intensidade/métodos , Perfilação da Expressão Gênica/métodos , Músculo Esquelético/metabolismo , Músculo Esquelético/fisiologia , Adulto JovemRESUMO
Spheroids are three-dimensional cellular models with widespread basic and translational application across academia and industry. However, methodological transparency and guidelines for spheroid research have not yet been established. The MISpheroID Consortium developed a crowdsourcing knowledgebase that assembles the experimental parameters of 3,058 published spheroid-related experiments. Interrogation of this knowledgebase identified heterogeneity in the methodological setup of spheroids. Empirical evaluation and interlaboratory validation of selected variations in spheroid methodology revealed diverse impacts on spheroid metrics. To facilitate interpretation, stimulate transparency and increase awareness, the Consortium defines the MISpheroID string, a minimum set of experimental parameters required to report spheroid research. Thus, MISpheroID combines a valuable resource and a tool for three-dimensional cellular models to mine experimental parameters and to improve reproducibility.
Assuntos
Biomarcadores Tumorais/genética , Proliferação de Células , Bases de Conhecimento , Neoplasias/patologia , Software , Esferoides Celulares/patologia , Microambiente Tumoral , Técnicas de Cultura de Células/métodos , Regulação Neoplásica da Expressão Gênica , Humanos , Neoplasias/classificação , Neoplasias/metabolismo , RNA-Seq , Reprodutibilidade dos Testes , Esferoides Celulares/imunologia , Esferoides Celulares/metabolismo , Células Tumorais CultivadasRESUMO
The potential of RNA-based liquid biopsy is increasingly being recognized in diffuse large B-cell lymphoma (DLBCL), the most common subtype of non-Hodgkin's lymphoma. This study explores the cell-free transcriptome in a humanized DLBCL patient-derived tumor xenograft (PDTX) model. Blood plasma samples (n = 171) derived from a DLBCL PDTX model, including 27 humanized (HIS) PDTX, 8 HIS non-PDTX, and 21 non-HIS PDTX non-obese diabetic (NOD)-scid IL2Rgnull (NSG) mice were collected during humanization, xenografting, treatment, and sacrifice. The mice were treated with either rituximab, cyclophosphamide, doxorubicin, vincristine, and prednisone (R-CHOP), CD20-targeted human IFNα2-based AcTaferon combined with CHOP (huCD20-Fc-AFN-CHOP), or phosphate-buffered saline (PBS). RNA was extracted using the miRNeasy serum/plasma kit and sequenced on the NovaSeq 6000 platform. RNA sequencing data of the formalin-fixed paraffin-embedded (FFPE) tissue and blood plasma samples of the original patient were included. Flow cytometry was performed on immune cells isolated from whole blood, spleen, and bone marrow. Bulk deconvolution was performed using the Tabula Sapiens v1 basis matrix. Both R-CHOP and huCD20-Fc-AFN-CHOP were able to control tumor growth in most mice. Xenograft tumor volume was strongly associated with circulating tumor RNA (ctRNA) concentration (p < 0.001, R = 0.89), as well as with the number of detected human genes (p < 0.001, R = 0.79). Abundance analysis identified tumor-specific biomarkers that were dynamically tracked during tumor growth or treatment. An 8-gene signature demonstrated high accuracy for assessing therapy response (AUC 0.92). The tumoral gene detectability in the ctRNA of the PDTX-derived plasma was associated with RNA abundance levels in the patient's tumor tissue and blood plasma (p < 0.001), confirming that tumoral gene abundance contributes to the cell-free RNA (cfRNA) profile. Decomposing the transcriptome, however, revealed high inter- and intra-mouse variability, which was lower in the HIS PDTX mice, indicating an impact of human engraftment on the stability and profile of cfRNA. Immunochemotherapy resulted in B cell depletion, and tumor clearance was reflected by a decrease in the fraction of human CD45+ cells. Lastly, bulk deconvolution provided complementary biological insights into the composition of the tumor and circulating immune system. In conclusion, the blood plasma-derived transcriptome serves as a biomarker source in a preclinical PDTX model, enables the assessment of biological pathways, and enhances the understanding of cfRNA dynamics.
Assuntos
Protocolos de Quimioterapia Combinada Antineoplásica , Ciclofosfamida , Linfoma Difuso de Grandes Células B , Camundongos Endogâmicos NOD , Camundongos SCID , Prednisona , Rituximab , Transcriptoma , Vincristina , Ensaios Antitumorais Modelo de Xenoenxerto , Linfoma Difuso de Grandes Células B/genética , Linfoma Difuso de Grandes Células B/patologia , Animais , Humanos , Camundongos , Biópsia Líquida/métodos , Rituximab/uso terapêutico , Prednisona/uso terapêutico , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico , Doxorrubicina , Biomarcadores Tumorais/genética , Masculino , Ácidos Nucleicos Livres/genética , Ácidos Nucleicos Livres/sangue , Feminino , Regulação Neoplásica da Expressão GênicaRESUMO
BACKGROUND: Two opposing B cell subsets have been defined based on their cytokine profile: IL-6 producing effector B cells (B-effs) versus IL-10 producing regulatory B cells (B-regs) that respectively positively or negatively regulate immune responses. B-regs are decreased and/or impaired in many autoimmune diseases and inflammatory conditions. Since there is increasing evidence that links B cells and B cell-rich lymphoid follicles to the pathogenesis of COPD, the aim of this study was to investigate the presence and function of B-regs in COPD. METHODS: First, presence of IL-10 producing regulatory B cells in human lung tissue was determined by immunohistochemistry. Secondly, quantification of IL-10 + B-regs and IL-6 + B-effs in peripheral blood mononuclear cells (PBMCs) from healthy controls, smokers without airflow limitation, and COPD patients (GOLD stage I-IV) was performed by flow cytometry. Thirdly, we exposed blood-derived B cells from COPD patients in vitro to cigarette smoke extract (CSE) and quantified IL-10 + B-regs and IL-6 + B-effs. Furthermore, we aimed at restoring the perturbed IL10 production by blocking BAFF. Fourthly, we determined mRNA expression of transcription factors involved in IL-10 production in FACS sorted memory- and naive B cells upon exposure to medium or CSE. RESULTS: The presence of IL-10 producing regulatory B cells in parenchyma and lymphoid follicles in lungs was confirmed by immunohistochemistry. The percentage of IL-10 + B-regs was significantly decreased in blood-derived memory B cell subsets from smokers without airflow limitation and patients with COPD, compared to never smokers. Furthermore, the capacity of B cells to produce IL-10 was reduced upon in vitro exposure to CSE and this could not be restored by BAFF-blockade. Finally, upon CSE exposure, mRNA levels of the transcription factors IRF4 and HIF-1α, were decreased in memory B cells. CONCLUSION: Decreased numbers and impaired function of B-regs in smokers and patients with COPD might contribute to the initiation and progression of the disease.
Assuntos
Linfócitos B Reguladores , Doença Pulmonar Obstrutiva Crônica , Linfócitos B Reguladores/metabolismo , Humanos , Interleucina-10 , Interleucina-6/metabolismo , Doença Pulmonar Obstrutiva Crônica/metabolismo , RNA Mensageiro , Fumantes , Nicotiana , Fatores de TranscriçãoRESUMO
We assess the performance of mRNA capture sequencing to identify fusion transcripts in FFPE tissue of different sarcoma types, followed by RT-qPCR confirmation. To validate our workflow, six positive control tumors with a specific chromosomal rearrangement were analyzed using the TruSight RNA Pan-Cancer Panel. Fusion transcript calling by FusionCatcher confirmed these aberrations and enabled the identification of both fusion gene partners and breakpoints. Next, whole-transcriptome TruSeq RNA Exome sequencing was applied to 17 fusion gene-negative alveolar rhabdomyosarcoma (ARMS) or undifferentiated round cell sarcoma (URCS) tumors, for whom fluorescence in situ hybridization (FISH) did not identify the classical pathognomonic rearrangements. For six patients, a pathognomonic fusion transcript was readily detected, i.e., PAX3-FOXO1 in two ARMS patients, and EWSR1-FLI1, EWSR1-ERG, or EWSR1-NFATC2 in four URCS patients. For the 11 remaining patients, 11 newly identified fusion transcripts were confirmed by RT-qPCR, including COPS3-TOM1L2, NCOA1-DTNB, WWTR1-LINC01986, PLAA-MOB3B, AP1B1-CHEK2, and BRD4-LEUTX fusion transcripts in ARMS patients. Additionally, recurrently detected secondary fusion transcripts in patients diagnosed with EWSR1-NFATC2-positive sarcoma were confirmed (COPS4-TBC1D9, PICALM-SYTL2, SMG6-VPS53, and UBE2F-ALS2). In conclusion, this study shows that mRNA capture sequencing enhances the detection rate of pathognomonic fusions and enables the identification of novel and secondary fusion transcripts in sarcomas.
Assuntos
Sarcoma , Neoplasias de Tecidos Moles , Complexo 1 de Proteínas Adaptadoras/genética , Subunidades beta do Complexo de Proteínas Adaptadoras , Proteínas de Ciclo Celular/genética , Ácido Ditionitrobenzoico , Humanos , Hibridização in Situ Fluorescente , Proteínas Nucleares/genética , Proteínas de Fusão Oncogênica/genética , RNA , RNA Mensageiro/genética , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Sarcoma/diagnóstico , Sarcoma/genética , Sarcoma/patologia , Neoplasias de Tecidos Moles/patologia , Fatores de Transcrição/genéticaRESUMO
While long non-coding RNA (lncRNA) research in the past has primarily focused on the discovery of novel genes, today it has shifted towards functional annotation of this large class of genes. With thousands of lncRNA studies published every year, the current challenge lies in keeping track of which lncRNAs are functionally described. This is further complicated by the fact that lncRNA nomenclature is not straightforward and lncRNA annotation is scattered across different resources with their own quality metrics and definition of a lncRNA. To overcome this issue, large scale curation and annotation is needed. Here, we present the fifth release of the human lncRNA database LNCipedia (https://lncipedia.org). The most notable improvements include manual literature curation of 2482 lncRNA articles and the use of official gene symbols when available. In addition, an improved filtering pipeline results in a higher quality reference lncRNA gene set.
Assuntos
Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , RNA Longo não Codificante/genética , Genômica/métodos , Humanos , Anotação de Sequência Molecular , NavegadorRESUMO
Single cell RNA sequencing methods have been increasingly used to understand cellular heterogeneity. Nevertheless, most of these methods suffer from one or more limitations, such as focusing only on polyadenylated RNA, sequencing of only the 3' end of the transcript, an exuberant fraction of reads mapping to ribosomal RNA, and the unstranded nature of the sequencing data. Here, we developed a novel single cell strand-specific total RNA library preparation method addressing all the aforementioned shortcomings. Our method was validated on a microfluidics system using three different cancer cell lines undergoing a chemical or genetic perturbation and on two other cancer cell lines sorted in microplates. We demonstrate that our total RNA-seq method detects an equal or higher number of genes compared to classic polyA[+] RNA-seq, including novel and non-polyadenylated genes. The obtained RNA expression patterns also recapitulate the expected biological signal. Inherent to total RNA-seq, our method is also able to detect circular RNAs. Taken together, SMARTer single cell total RNA sequencing is very well suited for any single cell sequencing experiment in which transcript level information is needed beyond polyadenylated genes.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA Circular/análise , RNA Mensageiro/análise , RNA Ribossômico/análise , Análise de Célula Única/métodos , Benchmarking , Linhagem Celular Tumoral , Biblioteca Gênica , Humanos , Técnicas Analíticas Microfluídicas , Poli A/genética , Poli A/metabolismo , RNA Circular/genética , RNA Mensageiro/genética , RNA Ribossômico/genética , Análise de Sequência de RNA/estatística & dados numéricosRESUMO
We argue that the field of extracellular vesicle (EV) biology needs more transparent reporting to facilitate interpretation and replication of experiments. To achieve this, we describe EV-TRACK, a crowdsourcing knowledgebase (http://evtrack.org) that centralizes EV biology and methodology with the goal of stimulating authors, reviewers, editors and funders to put experimental guidelines into practice.
Assuntos
Pesquisa Biomédica , Bases de Dados Bibliográficas , Vesículas Extracelulares/fisiologia , InternacionalidadeRESUMO
In microRNA (miRNA) target prediction, typically two levels of information need to be modeled: the number of potential miRNA binding sites present in a target mRNA and the genomic context of each individual site. Single model structures insufficiently cope with this complex training data structure, consisting of feature vectors of unequal length as a consequence of the varying number of miRNA binding sites in different mRNAs. To circumvent this problem, we developed a two-layered, stacked model, in which the influence of binding site context is separately modeled. Using logistic regression and random forests, we applied the stacked model approach to a unique data set of 7990 probed miRNA-mRNA interactions, hereby including the largest number of miRNAs in model training to date. Compared to lower-complexity models, a particular stacked model, named miSTAR (miRNA stacked model target prediction; www.mi-star.org), displays a higher general performance and precision on top scoring predictions. More importantly, our model outperforms published and widely used miRNA target prediction algorithms. Finally, we highlight flaws in cross-validation schemes for evaluation of miRNA target prediction models and adopt a more fair and stringent approach.
Assuntos
Regiões 3' não Traduzidas , MicroRNAs/metabolismo , Modelos Genéticos , Algoritmos , Sítios de Ligação , Humanos , Aprendizado de Máquina , RNA Mensageiro/metabolismo , SoftwareRESUMO
BACKGROUND: Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. RESULTS: To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. CONCLUSION: Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5'-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
Assuntos
Genômica/métodos , RNA Longo não Codificante/genética , Análise de Sequência de RNA/métodos , Software , Imunoprecipitação da Cromatina , Humanos , Sítio de Iniciação de TranscriçãoRESUMO
BACKGROUND: The universal qPCR data exchange file format RDML is today well accepted by the scientific community, part of the MIQE guidelines and implemented in many qPCR instruments. With the increased use of RDML new challenges emerge. The flexibility of the RDML format resulted in some implementations that did not meet the expectations of the consortium in the level of support or the use of elements. RESULTS: In the current RDML version 1.2 the description of the elements was sharpened. The open source editor RDML-Ninja was released (http://sourceforge.net/projects/qpcr-ninja/). RDML-Ninja allows to visualize, edit and validate RDML files and thus clarifies the use of RDML elements. Furthermore RDML-Ninja serves as reference implementation for RDML and enables migration between RDML versions independent of the instrument software. The database RDMLdb will serve as an online repository for RDML files and facilitate the exchange of RDML data (http://www.rdmldb.org). Authors can upload their RDML files and reference them in publications by the unique identifier provided by RDMLdb. The MIQE guidelines propose a rich set of information required to document each qPCR run. RDML provides the vehicle to store and maintain this information and current development aims at further integration of MIQE requirements into the RDML format. CONCLUSIONS: The editor RDML-Ninja and the database RDMLdb enable scientists to evaluate and exchange qPCR data in the instrument-independent RDML format. We are confident that this infrastructure will build the foundation for standardized qPCR data exchange among scientists, research groups, and during publication.
Assuntos
Redes de Comunicação de Computadores/normas , Bases de Dados Factuais , Reação em Cadeia da Polimerase/métodos , Software , HumanosRESUMO
In biomedical research, high-throughput screening is often applied as it comes with automatization, higher-efficiency, and more and faster results. High-throughput screening experiments encompass drug, drug combination, genetic perturbagen or a combination of genetic and chemical perturbagen screens. These experiments are conducted in real-time assays over time or in an endpoint assay. The data analysis consists of data cleaning and structuring, as well as further data processing and visualisation, which, due to the amount of data, can easily become laborious, time-consuming and error-prone. Therefore, several tools have been developed to aid researchers in this process, but these typically focus on specific experimental set-ups and are unable to process data of several time points and genetic-chemical perturbagen screens. To meet these needs, we developed HTSplotter, a web tool and Python module that performs automatic data analysis and visualization of visualization of eitherendpoint or real-time assays from different high-throughput screening experiments: drug, drug combination, genetic perturbagen and genetic-chemical perturbagen screens. HTSplotter implements an algorithm based on conditional statements to identify experiment types and controls. After appropriate data normalization, including growth rate normalization, HTSplotter executes downstream analyses such as dose-response relationship and drug synergism assessment by the Bliss independence (BI), Zero Interaction Potency (ZIP) and Highest Single Agent (HSA) methods. All results are exported as a text file and plots are saved in a PDF file. The main advantage of HTSplotter over other available tools is the automatic analysis of genetic-chemical perturbagen screens and real-time assays where growth rate and perturbagen effect results are plotted over time. In conclusion, HTSplotter allows for the automatic end-to-end data processing, analysis and visualisation of various high-throughput in vitro cell culture screens, offering major improvements in terms of versatility, efficiency and time over existing tools.
Assuntos
Algoritmos , Pesquisa Biomédica , Bioensaio , Análise de Dados , Combinação de MedicamentosRESUMO
Introduction: Diffuse large B-cell lymphoma (DLBCL) and primary mediastinal B-cell lymphoma (PMBCL) are aggressive histological subtypes of non-Hodgkin's lymphoma. Improved understanding of the underlying molecular pathogenesis has led to new classification and risk stratification tools, including the development of cell-free biomarkers through liquid biopsies. The goal of this study was to investigate cell-free RNA (cfRNA) biomarkers in DLBCL and PMBCL patients. Materials and methods: Blood plasma samples (n=168) and matched diagnostic formalin-fixed paraffin-embedded (FFPE) tissue samples (n=69) of DLBCL patients, PMBCL patients and healthy controls were collected between 2016-2021. Plasma samples were collected at diagnosis, at interim evaluation, after treatment, and in case of refractory or relapsed disease. RNA was extracted from 200 µl plasma using the miRNeasy serum/plasma kit and from FFPE tissue using the miRNeasy FFPE kit. RNA was subsequently sequenced on a NovaSeq 6000 instrument using the SMARTer Stranded Total RNA-seq pico v3 library preparation kit. Results: Higher cfRNA concentrations were demonstrated in lymphoma patients compared to healthy controls. A large number of differentially abundant genes were identified between the cell-free transcriptomes of DLBCL patients, PMBCL patients, and healthy controls. Overlap analyses with matched FFPE samples showed that blood plasma has a unique transcriptomic profile that significantly differs from that of the tumor tissue. As a good concordance between tissue-derived gene expression and the immunohistochemistry Hans algorithm for cell-of-origin (COO) classification was demonstrated in the FFPE samples, but not in the plasma samples, a 64-gene cfRNA classifier was developed that can accurately determine COO in plasma. High plasma levels of a 9-gene signature (BECN1, PRKCB, COPA, TSC22D3, MAP2K3, UQCRHL, PTMAP4, EHD1, NAP1L1 pseudogene) and a 5-gene signature (FTH1P7, PTMAP4, ATF4, FTH1P8, ARMC7) were significantly associated with inferior progression-free and overall survival in DLBCL patients, respectively, independent of the NCCN-IPI score. Conclusion: Total RNA sequencing of blood plasma samples allows the analysis of the cell-free transcriptome in DLBCL and PMBCL patients and demonstrates its unexplored potential in identifying diagnostic, cell-of-origin, and prognostic cfRNA biomarkers.
RESUMO
Uveal melanoma (UM) is the most common primary intraocular malignancy in adults. Owing to a lack of effective treatments, patients with metastatic disease have a median survival time of 6-12 months. We recently demonstrated that the Survival Associated Mitochondrial Melanoma Specific Oncogenic Non-coding RNA (SAMMSON) is essential for UM cell survival and that antisense oligonucleotide (ASO)-mediated silencing of SAMMSON impaired cell viability and tumor growth in vitro and in vivo. By screening a library of 2911 clinical stage compounds, we identified the mammalian target of rapamycin (mTOR) inhibitor GDC-0349 to synergize with SAMMSON inhibition in UM. Mechanistic studies revealed that mTOR inhibition enhanced uptake and reduced lysosomal accumulation of lipid complexed SAMMSON ASOs, improving SAMMSON knockdown and further decreasing UM cell viability. We found mTOR inhibition to also enhance target knockdown in other cancer cell lines as well as normal cells when combined with lipid nanoparticle complexed or encapsulated ASOs or small interfering RNAs (siRNAs). Our results are relevant to nucleic acid treatment in general and highlight the potential of mTOR inhibition to enhance ASO and siRNA-mediated target knockdown.
Assuntos
Melanoma , Oligonucleotídeos Antissenso , Humanos , Oligonucleotídeos Antissenso/genética , Oligonucleotídeos Antissenso/farmacologia , Oligonucleotídeos Antissenso/uso terapêutico , Linhagem Celular Tumoral , Melanoma/tratamento farmacológico , Melanoma/genética , Serina-Treonina Quinases TOR/genética , Serina-Treonina Quinases TOR/metabolismo , RNA Interferente Pequeno/uso terapêuticoRESUMO
Circular RNA (circRNA) is a class of endogenous non-coding RNA characterized by a back-splice junction (BSJ). In general, large-scale circRNA BSJ detection is performed based on RNA sequencing data, followed by the selection and validation of circRNAs of interest using RT-qPCR with circRNA-specific PCR primers. Such a primer pair is convergent and functional on the circRNA template but divergent and non-functional on the linear host gene. Although a few circRNA primer design pipelines have been published, none of them offer large-scale, easy-to-use circRNA primer design. Other limitations are that these tools generally do not take into account assay specificity, secondary structures, and SNPs in the primer annealing regions. Furthermore, these tools are limited to circRNA primer design for humans (no other organisms possible), and no wet-lab validation is demonstrated. Here, we present CIRCprimerXL, a circRNA RT-qPCR assay design pipeline based on the primer design framework primerXL. CIRCprimerXL takes a circRNA BSJ position as input, and designs BSJ-spanning primers using Primer3. The user can choose to use the unspliced or spliced circRNA sequence as template. Prior to primer design, sequence regions with secondary structures and common SNPs are flagged. Next, the primers are filtered based on predicted specificity and the absence of secondary structures of the amplicon to select a suitable primer pair. Our tool is both available as a user-friendly web tool and as a stand-alone pipeline based on Docker and Nextflow, allowing users to run the pipeline on a wide range of computer infrastructures. The CIRCprimerXL Nextflow pipeline can be used to design circRNA primers for any species by providing the appropriate reference genome. The CIRCprimerXL web tool supports circRNA primer design for human, mouse, rat, zebrafish, Xenopus tropicalis, and C. elegans. The design process can easily be scaled up for the qPCR assay design of tens of thousands of circRNAs within a couple of hours. We show how CIRCprimerXL has been successfully used to design qPCR assays for over 15,000 human circRNAs of which 20 were empirically validated. CIRCprimerXL software, documentation, and test data can be found at: https://github.com/OncoRNALab/CIRCprimerXL. CIRCprimerXL is also implemented as a webtool at: https://circprimerxl.cmgg.be.
RESUMO
Distinguishing circular RNA reads from reads derived from the linear host transcript is a challenging task because of sequence overlap. We developed a computational approach, CiLiQuant, that determines the relative circular and linear abundance of transcripts and gene loci using back-splice and unambiguous forward-splice junction reads generated by existing mapping and circular RNA discovery tools.
RESUMO
Molecular phenotyping through shallow 3'-end RNA-sequencing workflows is increasingly applied in the context of large-scale chemical or genetic perturbation screens to study disease biology or support drug discovery. While these workflows enable accurate quantification of the most abundant genes, they are less effective for applications that require expression profiling of low abundant transcripts, like long noncoding RNAs (lncRNAs), or selected gene panels. To tackle these issues, we describe a workflow combining 3'-end library preparation with 3'-end hybrid capture probes and shallow RNA-sequencing for cost-effective, targeted quantification of subsets of (low abundant) genes across hundreds to thousands of samples. To assess the performance of the method, we designed a capture probe set for more than 100 mRNA and lncRNA target genes and applied the workflow to a cohort of 360 samples. When compared to standard 3'-end RNA-sequencing, 3'-end capture sequencing resulted in a more than 200-fold enrichment of target gene abundance while conserving relative intergene and intersample abundances. 3'-end RNA capture sequencing enables accurate targeted gene expression profiling at extremely shallow sequencing depth.
Assuntos
Perfilação da Expressão Gênica , RNA Longo não Codificante , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , RNA Longo não Codificante/genética , RNA Mensageiro/genética , Análise de Sequência de RNA/métodosRESUMO
In the past decades, the incidence of esophageal adenocarcinoma has increased dramatically in Western populations. Better understanding of disease etiology along with the identification of novel prognostic and predictive biomarkers are urgently needed to improve the dismal survival probabilities. Here, we performed comprehensive RNA (coding and non-coding) profiling in various samples from 17 patients diagnosed with esophageal adenocarcinoma, high-grade dysplastic or non-dysplastic Barrett's esophagus. Per patient, a blood plasma sample, and a healthy and disease esophageal tissue sample were included. In total, this comprehensive dataset consists of 102 sequenced libraries from 51 samples. Based on this data, 119 expression profiles are available for three biotypes, including miRNA (51), mRNA (51) and circRNA (17). This unique resource allows for discovery of novel biomarkers and disease mechanisms, comparison of tissue and liquid biopsy profiles, integration of coding and non-coding RNA patterns, and can serve as a validation dataset in other RNA landscaping studies. Moreover, structural RNA differences can be identified in this dataset, including protein coding mutations, fusion genes, and circular RNAs.
Assuntos
Adenocarcinoma , Esôfago de Barrett , Neoplasias Esofágicas , MicroRNAs , Adenocarcinoma/sangue , Adenocarcinoma/genética , Esôfago de Barrett/sangue , Esôfago de Barrett/genética , Biomarcadores , Progressão da Doença , Neoplasias Esofágicas/sangue , Neoplasias Esofágicas/genética , Humanos , MicroRNAs/genética , Plasma/metabolismoRESUMO
BACKGROUND: Most ovarian cancer patients are diagnosed at an advanced stage and have a high mortality rate. Current screening strategies fail to improve prognosis because markers that are sensitive for early stage disease are lacking. This medical need justifies the search for novel approaches using utero-tubal lavage as a proximal liquid biopsy. METHODS: In this study, we explore the extracellular transcriptome of utero-tubal lavage fluid obtained from 26 ovarian cancer patients and 48 controls using messenger RNA (mRNA) capture and small RNA sequencing. RESULTS: We observed an enrichment of ovarian and fallopian tube specific messenger RNAs in utero-tubal lavage fluid compared to other human biofluids. Over 300 mRNAs and 41 miRNAs were upregulated in ovarian cancer samples compared with controls. Upregulated genes were enriched for genes involved in cell cycle activation and proliferation, hinting at a tumor-derived signal. CONCLUSION: This is a proof-of-principle that mRNA capture sequencing of utero-tubal lavage fluid is technically feasible, and that the extracellular transcriptome of utero-tubal lavage should be further explored in larger cohorts to assess the diagnostic value of the biomarkers identified in this study. IMPACT: Proximal liquid biopsy from the gynecologic tract is a promising source for mRNA and miRNA biomarkers for diagnosis of early-stage ovarian cancer.