Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 56
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Nat Immunol ; 21(10): 1160-1171, 2020 10.
Article in English | MEDLINE | ID: mdl-32747819

ABSTRACT

Autophagy supports both cellular and organismal homeostasis. However, whether autophagy should be inhibited or activated for cancer therapy remains unclear. Deletion of essential autophagy genes increased the sensitivity of mouse mammary carcinoma cells to radiation therapy in vitro and in vivo (in immunocompetent syngeneic hosts). Autophagy-deficient cells secreted increased amounts of type I interferon (IFN), which could be limited by CGAS or STING knockdown, mitochondrial DNA depletion or mitochondrial outer membrane permeabilization blockage via BCL2 overexpression or BAX deletion. In vivo, irradiated autophagy-incompetent mammary tumors elicited robust immunity, leading to improved control of distant nonirradiated lesions via systemic type I IFN signaling. Finally, a genetic signature of autophagy had negative prognostic value in patients with breast cancer, inversely correlating with mitochondrial abundance, type I IFN signaling and effector immunity. As clinically useful autophagy inhibitors are elusive, our findings suggest that mitochondrial outer membrane permeabilization may represent a valid target for boosting radiation therapy immunogenicity in patients with breast cancer.


Subject(s)
Autophagy-Related Protein 5/genetics , Autophagy-Related Protein 7/genetics , Autophagy/genetics , Breast Neoplasms/radiotherapy , DNA, Mitochondrial/genetics , Mammary Neoplasms, Animal/radiotherapy , Mitochondria/metabolism , Adult , Aged , Animals , Breast Neoplasms/genetics , Breast Neoplasms/mortality , Cell Line, Tumor , Cytotoxicity, Immunologic , Female , Humans , Interferon Type I/metabolism , Mammary Neoplasms, Animal/genetics , Mice , Mice, Inbred BALB C , Middle Aged , Prognosis , Radiation Tolerance , Signal Transduction , Survival Analysis
2.
Nat Rev Genet ; 20(12): 724-746, 2019 12.
Article in English | MEDLINE | ID: mdl-31515541

ABSTRACT

The remarkable success of cancer therapies with immune checkpoint blockers is revolutionizing oncology and has sparked intensive basic and translational research into the mechanisms of cancer-immune cell interactions. In parallel, numerous novel cutting-edge technologies for comprehensive molecular and cellular characterization of cancer immunity have been developed, including single-cell sequencing, mass cytometry and multiplexed spatial cellular phenotyping. In order to process, analyse and visualize multidimensional data sets generated by these technologies, computational methods and software tools are required. Here, we review computational tools for interrogating cancer immunity, discuss advantages and limitations of the various methods and provide guidelines to assist in method selection.


Subject(s)
Cell Communication , Computational Biology , High-Throughput Nucleotide Sequencing , Neoplasms , Single-Cell Analysis , Software , Cell Communication/genetics , Cell Communication/immunology , Humans , Neoplasms/genetics , Neoplasms/immunology
3.
J Transl Med ; 22(1): 190, 2024 02 21.
Article in English | MEDLINE | ID: mdl-38383458

ABSTRACT

BACKGROUND: Predictive biomarkers of immune checkpoint inhibitor (ICI) efficacy are currently lacking for non-small cell lung cancer (NSCLC). Here, we describe the results from the Anti-PD-1 Response Prediction DREAM Challenge, a crowdsourced initiative that enabled the assessment of predictive models by using data from two randomized controlled clinical trials (RCTs) of ICIs in first-line metastatic NSCLC. METHODS: Participants developed and trained models using public resources. These were evaluated with data from the CheckMate 026 trial (NCT02041533), according to the model-to-data paradigm to maintain patient confidentiality. The generalizability of the models with the best predictive performance was assessed using data from the CheckMate 227 trial (NCT02477826). Both trials were phase III RCTs with a chemotherapy control arm, which supported the differentiation between predictive and prognostic models. Isolated model containers were evaluated using a bespoke strategy that considered the challenges of handling transcriptome data from clinical trials. RESULTS: A total of 59 teams participated, with 417 models submitted. Multiple predictive models, as opposed to a prognostic model, were generated for predicting overall survival, progression-free survival, and progressive disease status with ICIs. Variables within the models submitted by participants included tumor mutational burden (TMB), programmed death ligand 1 (PD-L1) expression, and gene-expression-based signatures. The best-performing models showed improved predictive power over reference variables, including TMB or PD-L1. CONCLUSIONS: This DREAM Challenge is the first successful attempt to use protected phase III clinical data for a crowdsourced effort towards generating predictive models for ICI clinical outcomes and could serve as a blueprint for similar efforts in other tumor types and disease states, setting a benchmark for future studies aiming to identify biomarkers predictive of ICI efficacy. TRIAL REGISTRATION: CheckMate 026; NCT02041533, registered January 22, 2014. CheckMate 227; NCT02477826, registered June 23, 2015.


Subject(s)
Carcinoma, Non-Small-Cell Lung , Lung Neoplasms , Humans , Carcinoma, Non-Small-Cell Lung/drug therapy , Carcinoma, Non-Small-Cell Lung/genetics , Immune Checkpoint Inhibitors/therapeutic use , Lung Neoplasms/pathology , B7-H1 Antigen , Biomarkers, Tumor
4.
Bioinformatics ; 38(Suppl_2): ii141-ii147, 2022 09 16.
Article in English | MEDLINE | ID: mdl-36124800

ABSTRACT

MOTIVATION: As complex tissues are typically composed of various cell types, deconvolution tools have been developed to computationally infer their cellular composition from bulk RNA sequencing (RNA-seq) data. To comprehensively assess deconvolution performance, gold-standard datasets are indispensable. Gold-standard, experimental techniques like flow cytometry or immunohistochemistry are resource-intensive and cannot be systematically applied to the numerous cell types and tissues profiled with high-throughput transcriptomics. The simulation of 'pseudo-bulk' data, generated by aggregating single-cell RNA-seq expression profiles in pre-defined proportions, offers a scalable and cost-effective alternative. This makes it feasible to create in silico gold standards that allow fine-grained control of cell-type fractions not conceivable in an experimental setup. However, at present, no simulation software for generating pseudo-bulk RNA-seq data exists. RESULTS: We developed SimBu, an R package capable of simulating pseudo-bulk samples based on various simulation scenarios, designed to test specific features of deconvolution methods. A unique feature of SimBu is the modeling of cell-type-specific mRNA bias using experimentally derived or data-driven scaling factors. Here, we show that SimBu can generate realistic pseudo-bulk data, recapitulating the biological and statistical features of real RNA-seq data. Finally, we illustrate the impact of mRNA bias on the evaluation of deconvolution tools and provide recommendations for the selection of suitable methods for estimating mRNA content. SimBu is a user-friendly and flexible tool for simulating realistic pseudo-bulk RNA-seq datasets serving as in silico gold-standard for assessing cell-type deconvolution methods. AVAILABILITY AND IMPLEMENTATION: SimBu is freely available at https://github.com/omnideconv/SimBu as an R package under the GPL-3 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Gene Expression Profiling , RNA , Gene Expression Profiling/methods , RNA/genetics , RNA, Messenger , RNA-Seq , Sequence Analysis, RNA/methods
5.
Bioinformatics ; 38(4): 1131-1132, 2022 01 27.
Article in English | MEDLINE | ID: mdl-34788790

ABSTRACT

SUMMARY: Somatic mutations and gene fusions can produce immunogenic neoantigens mediating anticancer immune responses. However, their computational prediction from sequencing data requires complex computational workflows to identify tumor-specific aberrations, derive the resulting peptides, infer patients' Human Leukocyte Antigen types and predict neoepitopes binding to them, together with a set of features underlying their immunogenicity. Here, we present nextNEOpi (nextflow NEOantigen prediction pipeline) a comprehensive and fully automated bioinformatic pipeline to predict tumor neoantigens from raw DNA and RNA sequencing data. In addition, nextNEOpi quantifies neoepitope- and patient-specific features associated with tumor immunogenicity and response to immunotherapy. AVAILABILITY AND IMPLEMENTATION: nextNEOpi source code and documentation are available at https://github.com/icbi-lab/nextNEOpi. CONTACT: dietmar.rieder@i-med.ac.at or francesca.finotello@uibk.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Neoplasms , Humans , Neoplasms/genetics , Antigens, Neoplasm/genetics , Peptides/genetics , Sequence Analysis, RNA
6.
Int J Cancer ; 150(4): 688-704, 2022 02 15.
Article in English | MEDLINE | ID: mdl-34716584

ABSTRACT

The surface inhibitory receptor NKG2A forms heterodimers with the invariant CD94 chain and is expressed on a subset of activated CD8 T cells. As antibodies to block NKG2A are currently tested in several efficacy trials for different tumor indications, it is important to characterize the NKG2A+ CD8 T cell population in the context of other inhibitory receptors. Here we used a well-controlled culture system to study the kinetics of inhibitory receptor expression. Naïve mouse CD8 T cells were synchronously and repeatedly activated by artificial antigen presenting cells in the presence of the homeostatic cytokine IL-7. The results revealed NKG2A as a late inhibitory receptor, expressed after repeated cognate antigen stimulations. In contrast, the expression of PD-1, TIGIT and LAG-3 was rapidly induced, hours after first contact and subsequently down regulated during each resting phase. This late, but stable expression kinetics of NKG2A was most similar to that of TIM-3 and CD39. Importantly, single-cell transcriptomics of human tumor-infiltrating lymphocytes (TILs) showed indeed that these receptors were often coexpressed by the same CD8 T cell cluster. Furthermore, NKG2A expression was associated with cell division and was promoted by TGF-ß in vitro, although TGF-ß signaling was not necessary in a mouse tumor model in vivo. In summary, our data show that PD-1 reflects recent TCR triggering, but that NKG2A is induced after repeated antigen stimulations and represents a late inhibitory receptor. Together with TIM-3 and CD39, NKG2A might thus mark actively dividing tumor-specific TILs.


Subject(s)
Immune Checkpoint Proteins/physiology , NK Cell Lectin-Like Receptor Subfamily C/physiology , Animals , Antigens, CD/physiology , CD8-Positive T-Lymphocytes/immunology , Cell Division , Hepatitis A Virus Cellular Receptor 2/physiology , Humans , Lymphocytes, Tumor-Infiltrating/immunology , Mice , Mice, Inbred C57BL , Receptors, Antigen, T-Cell/physiology , Receptors, Immunologic/physiology , Transforming Growth Factor beta/pharmacology , Tumor Microenvironment , Lymphocyte Activation Gene 3 Protein
7.
Nat Rev Genet ; 17(8): 441-58, 2016 07 04.
Article in English | MEDLINE | ID: mdl-27376489

ABSTRACT

Recent breakthroughs in cancer immunotherapy and decreasing costs of high-throughput technologies have sparked intensive research into tumour-immune cell interactions using genomic tools. The wealth of the generated data and the added complexity pose considerable challenges and require computational tools to process, to analyse and to visualize the data. Recently, various tools have been developed and used to mine tumour immunologic and genomic data effectively and to provide novel mechanistic insights. Here, we review computational genomics tools for cancer immunology and provide information on the requirements and functionality in order to assist in the selection of tools and assembly of analytical pipelines.


Subject(s)
Cell Communication/immunology , Chemokines/immunology , Computational Biology/methods , Genomics/methods , Neoplasms/immunology , Neoplasms/metabolism , Animals , Humans , Neoplasms/genetics
8.
Bioinformatics ; 36(7): 2260-2261, 2020 04 01.
Article in English | MEDLINE | ID: mdl-31755900

ABSTRACT

SUMMARY: Gene fusions can generate immunogenic neoantigens that mediate anticancer immune responses. However, their computational prediction from RNA sequencing (RNA-seq) data requires deep bioinformatics expertise to assembly a computational workflow covering the prediction of: fusion transcripts, their translated proteins and peptides, Human Leukocyte Antigen (HLA) types, and peptide-HLA binding affinity. Here, we present NeoFuse, a computational pipeline for the prediction of fusion neoantigens from tumor RNA-seq data. NeoFuse can be applied to cancer patients' RNA-seq data to identify fusion neoantigens that might expand the repertoire of suitable targets for immunotherapy. AVAILABILITY AND IMPLEMENTATION: NeoFuse source code and documentation are available under GPLv3 license at https://icbi.i-med.ac.at/NeoFuse/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Antigens, Neoplasm , RNA , High-Throughput Nucleotide Sequencing , Humans , Sequence Analysis, RNA , Software , Exome Sequencing
9.
Bioinformatics ; 36(18): 4817-4818, 2020 09 15.
Article in English | MEDLINE | ID: mdl-32614448

ABSTRACT

SUMMARY: Advances in single-cell technologies have enabled the investigation of T-cell phenotypes and repertoires at unprecedented resolution and scale. Bioinformatic methods for the efficient analysis of these large-scale datasets are instrumental for advancing our understanding of adaptive immune responses. However, while well-established solutions are accessible for the processing of single-cell transcriptomes, no streamlined pipelines are available for the comprehensive characterization of T-cell receptors. Here, we propose single-cell immune repertoires in Python (Scirpy), a scalable Python toolkit that provides simplified access to the analysis and visualization of immune repertoires from single cells and seamless integration with transcriptomic data. AVAILABILITY AND IMPLEMENTATION: Scirpy source code and documentation are available at https://github.com/icbi-lab/scirpy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology , Software , Documentation , Receptors, Antigen, T-Cell
10.
Brief Bioinform ; 19(4): 679-692, 2018 07 20.
Article in English | MEDLINE | ID: mdl-28025179

ABSTRACT

The human microbiota is a complex ecological community of commensal, symbiotic and pathogenic microorganisms harboured by the human body. Next-generation sequencing (NGS) technologies, in particular targeted amplicon sequencing of the 16S ribosomal RNA gene (16S-seq), are enabling the identification and quantification of human-resident microorganisms at unprecedented resolution, providing novel insights into the role of the microbiota in health and disease. Once microbial abundances are quantified through NGS data analysis, diversity indices provide valuable mathematical tools to describe the ecological complexity of a single sample or to detect species differences between samples. However, diversity is not a determined physical quantity for which a consensus definition and unit of measure have been established, and several diversity indices are currently available. Furthermore, they were originally developed for macroecology and their robustness to the possible bias introduced by sequencing has not been characterized so far. To assist the reader with the selection and interpretation of diversity measures, we review a panel of broadly used indices, describing their mathematical formulations, purposes and properties, and characterize their behaviour and criticalities in dependence of the data features using simulated data as ground truth. In addition, we make available an R package, DiversitySeq, which implements in a unified framework the full panel of diversity indices and a simulator of 16S-seq data, and thus represents a valuable resource for the analysis of diversity from NGS count data and for the benchmarking of computational methods for 16S-seq.


Subject(s)
Bacteria/classification , Bacteria/genetics , Computational Biology/methods , High-Throughput Nucleotide Sequencing/methods , Microbiota , RNA, Ribosomal, 16S/genetics , Bacteria/isolation & purification , DNA, Bacterial/genetics , Humans , Metagenome , Phylogeny
11.
Bioinformatics ; 35(14): i436-i445, 2019 07 15.
Article in English | MEDLINE | ID: mdl-31510660

ABSTRACT

MOTIVATION: The composition and density of immune cells in the tumor microenvironment (TME) profoundly influence tumor progression and success of anti-cancer therapies. Flow cytometry, immunohistochemistry staining or single-cell sequencing are often unavailable such that we rely on computational methods to estimate the immune-cell composition from bulk RNA-sequencing (RNA-seq) data. Various methods have been proposed recently, yet their capabilities and limitations have not been evaluated systematically. A general guideline leading the research community through cell type deconvolution is missing. RESULTS: We developed a systematic approach for benchmarking such computational methods and assessed the accuracy of tools at estimating nine different immune- and stromal cells from bulk RNA-seq samples. We used a single-cell RNA-seq dataset of ∼11 000 cells from the TME to simulate bulk samples of known cell type proportions, and validated the results using independent, publicly available gold-standard estimates. This allowed us to analyze and condense the results of more than a hundred thousand predictions to provide an exhaustive evaluation across seven computational methods over nine cell types and ∼1800 samples from five simulated and real-world datasets. We demonstrate that computational deconvolution performs at high accuracy for well-defined cell-type signatures and propose how fuzzy cell-type signatures can be improved. We suggest that future efforts should be dedicated to refining cell population definitions and finding reliable signatures. AVAILABILITY AND IMPLEMENTATION: A snakemake pipeline to reproduce the benchmark is available at https://github.com/grst/immune_deconvolution_benchmark. An R package allows the community to perform integrated deconvolution using different methods (https://grst.github.io/immunedeconv). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Neoplasms , Transcriptome , Flow Cytometry , Humans , RNA , Sequence Analysis, RNA , Tumor Microenvironment
12.
BMC Bioinformatics ; 19(1): 343, 2018 Sep 29.
Article in English | MEDLINE | ID: mdl-30268091

ABSTRACT

BACKGROUND: Targeted amplicon sequencing of the 16S ribosomal RNA gene is one of the key tools for studying microbial diversity. The accuracy of this approach strongly depends on the choice of primer pairs and, in particular, on the balance between efficiency, specificity and sensitivity in the amplification of the different bacterial 16S sequences contained in a sample. There is thus the need for computational methods to design optimal bacterial 16S primers able to take into account the knowledge provided by the new sequencing technologies. RESULTS: We propose here a computational method for optimizing the choice of primer sets, based on multi-objective optimization, which simultaneously: 1) maximizes efficiency and specificity of target amplification; 2) maximizes the number of different bacterial 16S sequences matched by at least one primer; 3) minimizes the differences in the number of primers matching each bacterial 16S sequence. Our algorithm can be applied to any desired amplicon length without affecting computational performance. The source code of the developed algorithm is released as the mopo16S software tool (Multi-Objective Primer Optimization for 16S experiments) under the GNU General Public License and is available at http://sysbiobig.dei.unipd.it/?q=Software#mopo16S . CONCLUSIONS: Results show that our strategy is able to find better primer pairs than the ones available in the literature according to all three optimization criteria. We also experimentally validated three of the primer pairs identified by our method on multiple bacterial species, belonging to different genera and phyla. Results confirm the predicted efficiency and the ability to maximize the number of different bacterial 16S sequences matched by primers.


Subject(s)
Bacteria/genetics , DNA Primers/standards , Polymerase Chain Reaction/standards , RNA, Bacterial/genetics , RNA, Ribosomal, 16S/genetics , Software , DNA Primers/genetics
13.
Cancer Immunol Immunother ; 67(7): 1031-1040, 2018 07.
Article in English | MEDLINE | ID: mdl-29541787

ABSTRACT

By exerting pro- and anti-tumorigenic actions, tumor-infiltrating immune cells can profoundly influence tumor progression, as well as the success of anti-cancer therapies. Therefore, the quantification of tumor-infiltrating immune cells holds the promise to unveil the multi-faceted role of the immune system in human cancers and its involvement in tumor escape mechanisms and response to therapy. Tumor-infiltrating immune cells can be quantified from RNA sequencing data of human tumors using bioinformatics approaches. In this review, we describe state-of-the-art computational methods for the quantification of immune cells from transcriptomics data and discuss the open challenges that must be addressed to accurately quantify immune infiltrates from RNA sequencing data of human bulk tumors.


Subject(s)
Lymphocytes, Tumor-Infiltrating/immunology , Transcriptome/immunology , Humans
14.
Bioinformatics ; 33(19): 3140-3141, 2017 Oct 01.
Article in English | MEDLINE | ID: mdl-28633385

ABSTRACT

SUMMARY: Recently, a number of powerful computational tools for dissecting tumor-immune cell interactions from next-generation sequencing data have been developed. However, the assembly of analytical pipelines and execution of multi-step workflows are laborious and involve a large number of intermediate steps with many dependencies and parameter settings. Here we present TIminer, an easy-to-use computational pipeline for mining tumor-immune cell interactions from next-generation sequencing data. TIminer enables integrative immunogenomic analyses, including: human leukocyte antigens typing, neoantigen prediction, characterization of immune infiltrates and quantification of tumor immunogenicity. AVAILABILITY AND IMPLEMENTATION: TIminer is freely available at http://icbi.i-med.ac.at/software/timiner/timiner.shtml. CONTACT: zlatko.trajanoski@i-med.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
High-Throughput Nucleotide Sequencing , Neoplasms/immunology , Software , Data Mining , Humans , Immunogenetic Phenomena , Immunotherapy , Neoplasms/genetics , Neoplasms/therapy , Workflow
15.
Proc Natl Acad Sci U S A ; 111(38): 13924-9, 2014 Sep 23.
Article in English | MEDLINE | ID: mdl-25201977

ABSTRACT

Genetic variation can modulate gene expression, and thereby phenotypic variation and susceptibility to complex diseases such as type 2 diabetes (T2D). Here we harnessed the potential of DNA and RNA sequencing in human pancreatic islets from 89 deceased donors to identify genes of potential importance in the pathogenesis of T2D. We present a catalog of genetic variants regulating gene expression (eQTL) and exon use (sQTL), including many long noncoding RNAs, which are enriched in known T2D-associated loci. Of 35 eQTL genes, whose expression differed between normoglycemic and hyperglycemic individuals, siRNA of tetraspanin 33 (TSPAN33), 5'-nucleotidase, ecto (NT5E), transmembrane emp24 protein transport domain containing 6 (TMED6), and p21 protein activated kinase 7 (PAK7) in INS1 cells resulted in reduced glucose-stimulated insulin secretion. In addition, we provide a genome-wide catalog of allelic expression imbalance, which is also enriched in known T2D-associated loci. Notably, allelic imbalance in paternally expressed gene 3 (PEG3) was associated with its promoter methylation and T2D status. Finally, RNA editing events were less common in islets than previously suggested in other tissues. Taken together, this study provides new insights into the complexity of gene regulation in human pancreatic islets and better understanding of how genetic variation can influence glucose metabolism.


Subject(s)
Genomics , Glucose , Transcriptome/physiology , 5'-Nucleotidase/biosynthesis , 5'-Nucleotidase/genetics , Cell Line , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Female , GPI-Linked Proteins/biosynthesis , GPI-Linked Proteins/genetics , Glucose/genetics , Glucose/metabolism , Humans , Islets of Langerhans , Male , RNA Editing/physiology , RNA, Long Noncoding/biosynthesis , RNA, Long Noncoding/genetics , Tetraspanins/biosynthesis , Tetraspanins/genetics , Vesicular Transport Proteins/biosynthesis , Vesicular Transport Proteins/genetics , p21-Activated Kinases/biosynthesis , p21-Activated Kinases/genetics
16.
BMC Genomics ; 16: S2, 2015.
Article in English | MEDLINE | ID: mdl-26046293

ABSTRACT

BACKGROUND: Dynamic expression data, nowadays obtained using high-throughput RNA sequencing, are essential to monitor transient gene expression changes and to study the dynamics of their transcriptional activity in the cell or response to stimuli. Several methods for data selection, clustering and functional analysis are available; however, these steps are usually performed independently, without exploiting and integrating the information derived from each step of the analysis. METHODS: Here we present FunPat, an R package for time series RNA sequencing data that integrates gene selection, clustering and functional annotation into a single framework. FunPat exploits functional annotations by performing for each functional term, e.g. a Gene Ontology term, an integrated selection-clustering analysis to select differentially expressed genes that share, besides annotation, a common dynamic expression profile. RESULTS: FunPat performance was assessed on both simulated and real data. With respect to a stand-alone selection step, the integration of the clustering step is able to improve the recall without altering the false discovery rate. FunPat also shows high precision and recall in detecting the correct temporal expression patterns; in particular, the recall is significantly higher than hierarchical, k-means and a model-based clustering approach specifically designed for RNA sequencing data. Moreover, when biological replicates are missing, FunPat is able to provide reproducible lists of significant genes. The application to real time series expression data shows the ability of FunPat to select differentially expressed genes with high reproducibility, indirectly confirming high precision and recall in gene selection. Moreover, the expression patterns obtained as output allow an easy interpretation of the results. CONCLUSIONS: A novel analysis pipeline was developed to search the main temporal patterns in classes of genes similarly annotated, improving the sensitivity of gene selection by integrating the statistical evidence of differential expression with the information on temporal profiles and the functional annotations. Significant genes are associated to both the most informative functional terms, avoiding redundancy of information, and the most representative temporal patterns, thus improving the readability of the results. FunPat package is provided in R/Bioconductor at link: http://sysbiobig.dei.unipd.it/?q=node/79.


Subject(s)
Computational Biology/methods , Databases, Genetic , RNA/chemistry , User-Computer Interface , Cluster Analysis , High-Throughput Nucleotide Sequencing , Internet , Sequence Analysis, RNA
17.
BMC Bioinformatics ; 15 Suppl 1: S7, 2014.
Article in English | MEDLINE | ID: mdl-24564404

ABSTRACT

BACKGROUND: In the last decade, Next-Generation Sequencing technologies have been extensively applied to quantitative transcriptomics, making RNA sequencing a valuable alternative to microarrays for measuring and comparing gene transcription levels. Although several methods have been proposed to provide an unbiased estimate of transcript abundances through data normalization, all of them are based on an initial count of the total number of reads mapping on each transcript. This procedure, in principle robust to random noise, is actually error-prone if reads are not uniformly distributed along sequences, as happens indeed due to sequencing errors and ambiguity in read mapping. Here we propose a new approach, called maxcounts, to quantify the expression assigned to an exon as the maximum of its per-base counts, and we assess its performance in comparison with the standard approach described above, which considers the total number of reads aligned to an exon. The two measures are compared using multiple data sets and considering several evaluation criteria: independence from gene-specific covariates, such as exon length and GC-content, accuracy and precision in the quantification of true concentrations and robustness of measurements to variations of alignments quality. RESULTS: Both measures show high accuracy and low dependency on GC-content. However, maxcounts expression quantification is less biased towards long exons with respect to the standard approach. Moreover, it shows lower technical variability at low expressions and is more robust to variations in the quality of alignments. CONCLUSIONS: In summary, we confirm that counts computed with the standard approach depend on the length of the feature they are summarized on, and are sensitive to the non-uniform distribution of reads along transcripts. On the opposite, maxcounts are robust to biases due to the non-uniformity distribution of reads and are characterized by a lower technical variability. Hence, we propose maxcounts as an alternative approach for quantitative RNA-sequencing applications.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , RNA/analysis , Sequence Analysis, RNA/methods , Base Composition , Gene Expression Regulation , Humans , RNA/genetics , Sequence Alignment
18.
Brief Bioinform ; 13(3): 269-80, 2012 May.
Article in English | MEDLINE | ID: mdl-22021898

ABSTRACT

Next-generation sequencing technologies have fostered an unprecedented proliferation of high-throughput sequencing projects and a concomitant development of novel algorithms for the assembly of short reads. In this context, an important issue is the need of a careful assessment of the accuracy of the assembly process. Here, we review the efficiency of a panel of assemblers, specifically designed to handle data from GS FLX 454 platform, on three bacterial data sets with different characteristics in terms of reads coverage and repeats content. Our aim is to investigate their strengths and weaknesses in the reconstruction of the reference genomes. In our benchmarking, we assess assemblers' performance, quantifying and characterizing assembly gaps and errors, and evaluating their ability to solve complex genomic regions containing repeats. The final goal of this analysis is to highlight pros and cons of each method, in order to provide the final user with general criteria for the right choice of the appropriate assembly strategy, depending on the specific needs. A further aspect we have explored is the relationship between coverage of a sequencing project and quality of the obtained results. The final outcome suggests that, for a good tradeoff between costs and results, the planned genome coverage of an experiment should not exceed 20-30 ×.


Subject(s)
Algorithms , Genome , Genomics/methods , Animals , Humans , Sequence Analysis, DNA/methods
19.
Int Rev Cell Mol Biol ; 382: 103-143, 2024.
Article in English | MEDLINE | ID: mdl-38225101

ABSTRACT

Methods for in silico deconvolution of bulk transcriptomics can characterize the cellular composition of the tumor microenvironment, quantifying the abundance of cell types associated with patients' prognosis and response to therapy. While first-generation deconvolution methods rely on precomputed, transcriptional signatures of a handful of cell types, second-generation methods can be trained with single-cell data to disentangle more fine-grained cell phenotypes and states. These novel approaches can also be applied to spatial transcriptomic data to reveal the spatial organization of tumors. In this review, we describe state-of-the-art deconvolution methods (first-generation, second-generation, and spatial) which can be used to investigate the tumor microenvironment, discussing their strengths and limitations. We conclude with an outlook on the challenges that need to be overcome to unlock the full potential of next-generation deconvolution for oncology and the life sciences.


Subject(s)
Transcriptome , Tumor Microenvironment , Humans , Gene Expression Profiling , Technology
20.
Bioinform Adv ; 4(1): vbae032, 2024.
Article in English | MEDLINE | ID: mdl-38464974

ABSTRACT

Summary: Transcriptome deconvolution has emerged as a reliable technique to estimate cell-type abundances from bulk RNA sequencing data. Unlike their human equivalents, methods to quantify the cellular composition of complex tissues from murine transcriptomics are sparse and sometimes not easy to use. We extended the immunedeconv R package to facilitate the deconvolution of mouse transcriptomics, enabling the quantification of murine immune-cell types using 13 different methods. Through immunedeconv, we further offer the possibility of tweaking cell signatures used by deconvolution methods, providing custom annotations tailored for specific cell types and tissues. These developments strongly facilitate the study of the immune-cell composition of mouse models and further open new avenues in the investigation of the cellular composition of other tissues and organisms. Availability and implementation: The R package and the documentation are available at https://github.com/omnideconv/immunedeconv.

SELECTION OF CITATIONS
SEARCH DETAIL