RESUMO
Autophagy supports both cellular and organismal homeostasis. However, whether autophagy should be inhibited or activated for cancer therapy remains unclear. Deletion of essential autophagy genes increased the sensitivity of mouse mammary carcinoma cells to radiation therapy in vitro and in vivo (in immunocompetent syngeneic hosts). Autophagy-deficient cells secreted increased amounts of type I interferon (IFN), which could be limited by CGAS or STING knockdown, mitochondrial DNA depletion or mitochondrial outer membrane permeabilization blockage via BCL2 overexpression or BAX deletion. In vivo, irradiated autophagy-incompetent mammary tumors elicited robust immunity, leading to improved control of distant nonirradiated lesions via systemic type I IFN signaling. Finally, a genetic signature of autophagy had negative prognostic value in patients with breast cancer, inversely correlating with mitochondrial abundance, type I IFN signaling and effector immunity. As clinically useful autophagy inhibitors are elusive, our findings suggest that mitochondrial outer membrane permeabilization may represent a valid target for boosting radiation therapy immunogenicity in patients with breast cancer.
Assuntos
Proteína 5 Relacionada à Autofagia/genética , Proteína 7 Relacionada à Autofagia/genética , Autofagia/genética , Neoplasias da Mama/radioterapia , DNA Mitocondrial/genética , Neoplasias Mamárias Animais/radioterapia , Mitocôndrias/metabolismo , Adulto , Idoso , Animais , Neoplasias da Mama/genética , Neoplasias da Mama/mortalidade , Linhagem Celular Tumoral , Citotoxicidade Imunológica , Feminino , Humanos , Interferon Tipo I/metabolismo , Neoplasias Mamárias Animais/genética , Camundongos , Camundongos Endogâmicos BALB C , Pessoa de Meia-Idade , Prognóstico , Tolerância a Radiação , Transdução de Sinais , Análise de SobrevidaRESUMO
The remarkable success of cancer therapies with immune checkpoint blockers is revolutionizing oncology and has sparked intensive basic and translational research into the mechanisms of cancer-immune cell interactions. In parallel, numerous novel cutting-edge technologies for comprehensive molecular and cellular characterization of cancer immunity have been developed, including single-cell sequencing, mass cytometry and multiplexed spatial cellular phenotyping. In order to process, analyse and visualize multidimensional data sets generated by these technologies, computational methods and software tools are required. Here, we review computational tools for interrogating cancer immunity, discuss advantages and limitations of the various methods and provide guidelines to assist in method selection.
Assuntos
Comunicação Celular , Biologia Computacional , Sequenciamento de Nucleotídeos em Larga Escala , Neoplasias , Análise de Célula Única , Software , Comunicação Celular/genética , Comunicação Celular/imunologia , Humanos , Neoplasias/genética , Neoplasias/imunologiaRESUMO
BACKGROUND: Predictive biomarkers of immune checkpoint inhibitor (ICI) efficacy are currently lacking for non-small cell lung cancer (NSCLC). Here, we describe the results from the Anti-PD-1 Response Prediction DREAM Challenge, a crowdsourced initiative that enabled the assessment of predictive models by using data from two randomized controlled clinical trials (RCTs) of ICIs in first-line metastatic NSCLC. METHODS: Participants developed and trained models using public resources. These were evaluated with data from the CheckMate 026 trial (NCT02041533), according to the model-to-data paradigm to maintain patient confidentiality. The generalizability of the models with the best predictive performance was assessed using data from the CheckMate 227 trial (NCT02477826). Both trials were phase III RCTs with a chemotherapy control arm, which supported the differentiation between predictive and prognostic models. Isolated model containers were evaluated using a bespoke strategy that considered the challenges of handling transcriptome data from clinical trials. RESULTS: A total of 59 teams participated, with 417 models submitted. Multiple predictive models, as opposed to a prognostic model, were generated for predicting overall survival, progression-free survival, and progressive disease status with ICIs. Variables within the models submitted by participants included tumor mutational burden (TMB), programmed death ligand 1 (PD-L1) expression, and gene-expression-based signatures. The best-performing models showed improved predictive power over reference variables, including TMB or PD-L1. CONCLUSIONS: This DREAM Challenge is the first successful attempt to use protected phase III clinical data for a crowdsourced effort towards generating predictive models for ICI clinical outcomes and could serve as a blueprint for similar efforts in other tumor types and disease states, setting a benchmark for future studies aiming to identify biomarkers predictive of ICI efficacy. TRIAL REGISTRATION: CheckMate 026; NCT02041533, registered January 22, 2014. CheckMate 227; NCT02477826, registered June 23, 2015.
Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Humanos , Carcinoma Pulmonar de Células não Pequenas/tratamento farmacológico , Carcinoma Pulmonar de Células não Pequenas/genética , Inibidores de Checkpoint Imunológico/uso terapêutico , Neoplasias Pulmonares/patologia , Antígeno B7-H1 , Biomarcadores TumoraisRESUMO
MOTIVATION: As complex tissues are typically composed of various cell types, deconvolution tools have been developed to computationally infer their cellular composition from bulk RNA sequencing (RNA-seq) data. To comprehensively assess deconvolution performance, gold-standard datasets are indispensable. Gold-standard, experimental techniques like flow cytometry or immunohistochemistry are resource-intensive and cannot be systematically applied to the numerous cell types and tissues profiled with high-throughput transcriptomics. The simulation of 'pseudo-bulk' data, generated by aggregating single-cell RNA-seq expression profiles in pre-defined proportions, offers a scalable and cost-effective alternative. This makes it feasible to create in silico gold standards that allow fine-grained control of cell-type fractions not conceivable in an experimental setup. However, at present, no simulation software for generating pseudo-bulk RNA-seq data exists. RESULTS: We developed SimBu, an R package capable of simulating pseudo-bulk samples based on various simulation scenarios, designed to test specific features of deconvolution methods. A unique feature of SimBu is the modeling of cell-type-specific mRNA bias using experimentally derived or data-driven scaling factors. Here, we show that SimBu can generate realistic pseudo-bulk data, recapitulating the biological and statistical features of real RNA-seq data. Finally, we illustrate the impact of mRNA bias on the evaluation of deconvolution tools and provide recommendations for the selection of suitable methods for estimating mRNA content. SimBu is a user-friendly and flexible tool for simulating realistic pseudo-bulk RNA-seq datasets serving as in silico gold-standard for assessing cell-type deconvolution methods. AVAILABILITY AND IMPLEMENTATION: SimBu is freely available at https://github.com/omnideconv/SimBu as an R package under the GPL-3 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Perfilação da Expressão Gênica , RNA , Perfilação da Expressão Gênica/métodos , RNA/genética , RNA Mensageiro , RNA-Seq , Análise de Sequência de RNA/métodosRESUMO
SUMMARY: Somatic mutations and gene fusions can produce immunogenic neoantigens mediating anticancer immune responses. However, their computational prediction from sequencing data requires complex computational workflows to identify tumor-specific aberrations, derive the resulting peptides, infer patients' Human Leukocyte Antigen types and predict neoepitopes binding to them, together with a set of features underlying their immunogenicity. Here, we present nextNEOpi (nextflow NEOantigen prediction pipeline) a comprehensive and fully automated bioinformatic pipeline to predict tumor neoantigens from raw DNA and RNA sequencing data. In addition, nextNEOpi quantifies neoepitope- and patient-specific features associated with tumor immunogenicity and response to immunotherapy. AVAILABILITY AND IMPLEMENTATION: nextNEOpi source code and documentation are available at https://github.com/icbi-lab/nextNEOpi. CONTACT: dietmar.rieder@i-med.ac.at or francesca.finotello@uibk.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Neoplasias , Humanos , Neoplasias/genética , Antígenos de Neoplasias/genética , Peptídeos/genética , Análise de Sequência de RNARESUMO
The surface inhibitory receptor NKG2A forms heterodimers with the invariant CD94 chain and is expressed on a subset of activated CD8 T cells. As antibodies to block NKG2A are currently tested in several efficacy trials for different tumor indications, it is important to characterize the NKG2A+ CD8 T cell population in the context of other inhibitory receptors. Here we used a well-controlled culture system to study the kinetics of inhibitory receptor expression. Naïve mouse CD8 T cells were synchronously and repeatedly activated by artificial antigen presenting cells in the presence of the homeostatic cytokine IL-7. The results revealed NKG2A as a late inhibitory receptor, expressed after repeated cognate antigen stimulations. In contrast, the expression of PD-1, TIGIT and LAG-3 was rapidly induced, hours after first contact and subsequently down regulated during each resting phase. This late, but stable expression kinetics of NKG2A was most similar to that of TIM-3 and CD39. Importantly, single-cell transcriptomics of human tumor-infiltrating lymphocytes (TILs) showed indeed that these receptors were often coexpressed by the same CD8 T cell cluster. Furthermore, NKG2A expression was associated with cell division and was promoted by TGF-ß in vitro, although TGF-ß signaling was not necessary in a mouse tumor model in vivo. In summary, our data show that PD-1 reflects recent TCR triggering, but that NKG2A is induced after repeated antigen stimulations and represents a late inhibitory receptor. Together with TIM-3 and CD39, NKG2A might thus mark actively dividing tumor-specific TILs.
Assuntos
Proteínas de Checkpoint Imunológico/fisiologia , Subfamília C de Receptores Semelhantes a Lectina de Células NK/fisiologia , Animais , Antígenos CD/fisiologia , Linfócitos T CD8-Positivos/imunologia , Divisão Celular , Receptor Celular 2 do Vírus da Hepatite A/fisiologia , Humanos , Linfócitos do Interstício Tumoral/imunologia , Camundongos , Camundongos Endogâmicos C57BL , Receptores de Antígenos de Linfócitos T/fisiologia , Receptores Imunológicos/fisiologia , Fator de Crescimento Transformador beta/farmacologia , Microambiente Tumoral , Proteína do Gene 3 de Ativação de LinfócitosRESUMO
Recent breakthroughs in cancer immunotherapy and decreasing costs of high-throughput technologies have sparked intensive research into tumour-immune cell interactions using genomic tools. The wealth of the generated data and the added complexity pose considerable challenges and require computational tools to process, to analyse and to visualize the data. Recently, various tools have been developed and used to mine tumour immunologic and genomic data effectively and to provide novel mechanistic insights. Here, we review computational genomics tools for cancer immunology and provide information on the requirements and functionality in order to assist in the selection of tools and assembly of analytical pipelines.
Assuntos
Comunicação Celular/imunologia , Quimiocinas/imunologia , Biologia Computacional/métodos , Genômica/métodos , Neoplasias/imunologia , Neoplasias/metabolismo , Animais , Humanos , Neoplasias/genéticaRESUMO
SUMMARY: Gene fusions can generate immunogenic neoantigens that mediate anticancer immune responses. However, their computational prediction from RNA sequencing (RNA-seq) data requires deep bioinformatics expertise to assembly a computational workflow covering the prediction of: fusion transcripts, their translated proteins and peptides, Human Leukocyte Antigen (HLA) types, and peptide-HLA binding affinity. Here, we present NeoFuse, a computational pipeline for the prediction of fusion neoantigens from tumor RNA-seq data. NeoFuse can be applied to cancer patients' RNA-seq data to identify fusion neoantigens that might expand the repertoire of suitable targets for immunotherapy. AVAILABILITY AND IMPLEMENTATION: NeoFuse source code and documentation are available under GPLv3 license at https://icbi.i-med.ac.at/NeoFuse/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Antígenos de Neoplasias , RNA , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de RNA , Software , Sequenciamento do ExomaRESUMO
SUMMARY: Advances in single-cell technologies have enabled the investigation of T-cell phenotypes and repertoires at unprecedented resolution and scale. Bioinformatic methods for the efficient analysis of these large-scale datasets are instrumental for advancing our understanding of adaptive immune responses. However, while well-established solutions are accessible for the processing of single-cell transcriptomes, no streamlined pipelines are available for the comprehensive characterization of T-cell receptors. Here, we propose single-cell immune repertoires in Python (Scirpy), a scalable Python toolkit that provides simplified access to the analysis and visualization of immune repertoires from single cells and seamless integration with transcriptomic data. AVAILABILITY AND IMPLEMENTATION: Scirpy source code and documentation are available at https://github.com/icbi-lab/scirpy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Biologia Computacional , Software , Documentação , Receptores de Antígenos de Linfócitos TRESUMO
The human microbiota is a complex ecological community of commensal, symbiotic and pathogenic microorganisms harboured by the human body. Next-generation sequencing (NGS) technologies, in particular targeted amplicon sequencing of the 16S ribosomal RNA gene (16S-seq), are enabling the identification and quantification of human-resident microorganisms at unprecedented resolution, providing novel insights into the role of the microbiota in health and disease. Once microbial abundances are quantified through NGS data analysis, diversity indices provide valuable mathematical tools to describe the ecological complexity of a single sample or to detect species differences between samples. However, diversity is not a determined physical quantity for which a consensus definition and unit of measure have been established, and several diversity indices are currently available. Furthermore, they were originally developed for macroecology and their robustness to the possible bias introduced by sequencing has not been characterized so far. To assist the reader with the selection and interpretation of diversity measures, we review a panel of broadly used indices, describing their mathematical formulations, purposes and properties, and characterize their behaviour and criticalities in dependence of the data features using simulated data as ground truth. In addition, we make available an R package, DiversitySeq, which implements in a unified framework the full panel of diversity indices and a simulator of 16S-seq data, and thus represents a valuable resource for the analysis of diversity from NGS count data and for the benchmarking of computational methods for 16S-seq.
Assuntos
Bactérias/classificação , Bactérias/genética , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Microbiota , RNA Ribossômico 16S/genética , Bactérias/isolamento & purificação , DNA Bacteriano/genética , Humanos , Metagenoma , FilogeniaRESUMO
MOTIVATION: The composition and density of immune cells in the tumor microenvironment (TME) profoundly influence tumor progression and success of anti-cancer therapies. Flow cytometry, immunohistochemistry staining or single-cell sequencing are often unavailable such that we rely on computational methods to estimate the immune-cell composition from bulk RNA-sequencing (RNA-seq) data. Various methods have been proposed recently, yet their capabilities and limitations have not been evaluated systematically. A general guideline leading the research community through cell type deconvolution is missing. RESULTS: We developed a systematic approach for benchmarking such computational methods and assessed the accuracy of tools at estimating nine different immune- and stromal cells from bulk RNA-seq samples. We used a single-cell RNA-seq dataset of â¼11 000 cells from the TME to simulate bulk samples of known cell type proportions, and validated the results using independent, publicly available gold-standard estimates. This allowed us to analyze and condense the results of more than a hundred thousand predictions to provide an exhaustive evaluation across seven computational methods over nine cell types and â¼1800 samples from five simulated and real-world datasets. We demonstrate that computational deconvolution performs at high accuracy for well-defined cell-type signatures and propose how fuzzy cell-type signatures can be improved. We suggest that future efforts should be dedicated to refining cell population definitions and finding reliable signatures. AVAILABILITY AND IMPLEMENTATION: A snakemake pipeline to reproduce the benchmark is available at https://github.com/grst/immune_deconvolution_benchmark. An R package allows the community to perform integrated deconvolution using different methods (https://grst.github.io/immunedeconv). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Neoplasias , Transcriptoma , Citometria de Fluxo , Humanos , RNA , Análise de Sequência de RNA , Microambiente TumoralRESUMO
BACKGROUND: Targeted amplicon sequencing of the 16S ribosomal RNA gene is one of the key tools for studying microbial diversity. The accuracy of this approach strongly depends on the choice of primer pairs and, in particular, on the balance between efficiency, specificity and sensitivity in the amplification of the different bacterial 16S sequences contained in a sample. There is thus the need for computational methods to design optimal bacterial 16S primers able to take into account the knowledge provided by the new sequencing technologies. RESULTS: We propose here a computational method for optimizing the choice of primer sets, based on multi-objective optimization, which simultaneously: 1) maximizes efficiency and specificity of target amplification; 2) maximizes the number of different bacterial 16S sequences matched by at least one primer; 3) minimizes the differences in the number of primers matching each bacterial 16S sequence. Our algorithm can be applied to any desired amplicon length without affecting computational performance. The source code of the developed algorithm is released as the mopo16S software tool (Multi-Objective Primer Optimization for 16S experiments) under the GNU General Public License and is available at http://sysbiobig.dei.unipd.it/?q=Software#mopo16S . CONCLUSIONS: Results show that our strategy is able to find better primer pairs than the ones available in the literature according to all three optimization criteria. We also experimentally validated three of the primer pairs identified by our method on multiple bacterial species, belonging to different genera and phyla. Results confirm the predicted efficiency and the ability to maximize the number of different bacterial 16S sequences matched by primers.
Assuntos
Bactérias/genética , Primers do DNA/normas , Reação em Cadeia da Polimerase/normas , RNA Bacteriano/genética , RNA Ribossômico 16S/genética , Software , Primers do DNA/genéticaRESUMO
By exerting pro- and anti-tumorigenic actions, tumor-infiltrating immune cells can profoundly influence tumor progression, as well as the success of anti-cancer therapies. Therefore, the quantification of tumor-infiltrating immune cells holds the promise to unveil the multi-faceted role of the immune system in human cancers and its involvement in tumor escape mechanisms and response to therapy. Tumor-infiltrating immune cells can be quantified from RNA sequencing data of human tumors using bioinformatics approaches. In this review, we describe state-of-the-art computational methods for the quantification of immune cells from transcriptomics data and discuss the open challenges that must be addressed to accurately quantify immune infiltrates from RNA sequencing data of human bulk tumors.
Assuntos
Linfócitos do Interstício Tumoral/imunologia , Transcriptoma/imunologia , HumanosRESUMO
SUMMARY: Recently, a number of powerful computational tools for dissecting tumor-immune cell interactions from next-generation sequencing data have been developed. However, the assembly of analytical pipelines and execution of multi-step workflows are laborious and involve a large number of intermediate steps with many dependencies and parameter settings. Here we present TIminer, an easy-to-use computational pipeline for mining tumor-immune cell interactions from next-generation sequencing data. TIminer enables integrative immunogenomic analyses, including: human leukocyte antigens typing, neoantigen prediction, characterization of immune infiltrates and quantification of tumor immunogenicity. AVAILABILITY AND IMPLEMENTATION: TIminer is freely available at http://icbi.i-med.ac.at/software/timiner/timiner.shtml. CONTACT: zlatko.trajanoski@i-med.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Neoplasias/imunologia , Software , Mineração de Dados , Humanos , Fenômenos Imunogenéticos , Imunoterapia , Neoplasias/genética , Neoplasias/terapia , Fluxo de TrabalhoRESUMO
Genetic variation can modulate gene expression, and thereby phenotypic variation and susceptibility to complex diseases such as type 2 diabetes (T2D). Here we harnessed the potential of DNA and RNA sequencing in human pancreatic islets from 89 deceased donors to identify genes of potential importance in the pathogenesis of T2D. We present a catalog of genetic variants regulating gene expression (eQTL) and exon use (sQTL), including many long noncoding RNAs, which are enriched in known T2D-associated loci. Of 35 eQTL genes, whose expression differed between normoglycemic and hyperglycemic individuals, siRNA of tetraspanin 33 (TSPAN33), 5'-nucleotidase, ecto (NT5E), transmembrane emp24 protein transport domain containing 6 (TMED6), and p21 protein activated kinase 7 (PAK7) in INS1 cells resulted in reduced glucose-stimulated insulin secretion. In addition, we provide a genome-wide catalog of allelic expression imbalance, which is also enriched in known T2D-associated loci. Notably, allelic imbalance in paternally expressed gene 3 (PEG3) was associated with its promoter methylation and T2D status. Finally, RNA editing events were less common in islets than previously suggested in other tissues. Taken together, this study provides new insights into the complexity of gene regulation in human pancreatic islets and better understanding of how genetic variation can influence glucose metabolism.
Assuntos
Genômica , Glucose , Transcriptoma/fisiologia , 5'-Nucleotidase/biossíntese , 5'-Nucleotidase/genética , Linhagem Celular , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Feminino , Proteínas Ligadas por GPI/biossíntese , Proteínas Ligadas por GPI/genética , Glucose/genética , Glucose/metabolismo , Humanos , Ilhotas Pancreáticas , Masculino , Edição de RNA/fisiologia , RNA Longo não Codificante/biossíntese , RNA Longo não Codificante/genética , Tetraspaninas/biossíntese , Tetraspaninas/genética , Proteínas de Transporte Vesicular/biossíntese , Proteínas de Transporte Vesicular/genética , Quinases Ativadas por p21/biossíntese , Quinases Ativadas por p21/genéticaRESUMO
BACKGROUND: Dynamic expression data, nowadays obtained using high-throughput RNA sequencing, are essential to monitor transient gene expression changes and to study the dynamics of their transcriptional activity in the cell or response to stimuli. Several methods for data selection, clustering and functional analysis are available; however, these steps are usually performed independently, without exploiting and integrating the information derived from each step of the analysis. METHODS: Here we present FunPat, an R package for time series RNA sequencing data that integrates gene selection, clustering and functional annotation into a single framework. FunPat exploits functional annotations by performing for each functional term, e.g. a Gene Ontology term, an integrated selection-clustering analysis to select differentially expressed genes that share, besides annotation, a common dynamic expression profile. RESULTS: FunPat performance was assessed on both simulated and real data. With respect to a stand-alone selection step, the integration of the clustering step is able to improve the recall without altering the false discovery rate. FunPat also shows high precision and recall in detecting the correct temporal expression patterns; in particular, the recall is significantly higher than hierarchical, k-means and a model-based clustering approach specifically designed for RNA sequencing data. Moreover, when biological replicates are missing, FunPat is able to provide reproducible lists of significant genes. The application to real time series expression data shows the ability of FunPat to select differentially expressed genes with high reproducibility, indirectly confirming high precision and recall in gene selection. Moreover, the expression patterns obtained as output allow an easy interpretation of the results. CONCLUSIONS: A novel analysis pipeline was developed to search the main temporal patterns in classes of genes similarly annotated, improving the sensitivity of gene selection by integrating the statistical evidence of differential expression with the information on temporal profiles and the functional annotations. Significant genes are associated to both the most informative functional terms, avoiding redundancy of information, and the most representative temporal patterns, thus improving the readability of the results. FunPat package is provided in R/Bioconductor at link: http://sysbiobig.dei.unipd.it/?q=node/79.
Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , RNA/química , Interface Usuário-Computador , Análise por Conglomerados , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Análise de Sequência de RNARESUMO
BACKGROUND: In the last decade, Next-Generation Sequencing technologies have been extensively applied to quantitative transcriptomics, making RNA sequencing a valuable alternative to microarrays for measuring and comparing gene transcription levels. Although several methods have been proposed to provide an unbiased estimate of transcript abundances through data normalization, all of them are based on an initial count of the total number of reads mapping on each transcript. This procedure, in principle robust to random noise, is actually error-prone if reads are not uniformly distributed along sequences, as happens indeed due to sequencing errors and ambiguity in read mapping. Here we propose a new approach, called maxcounts, to quantify the expression assigned to an exon as the maximum of its per-base counts, and we assess its performance in comparison with the standard approach described above, which considers the total number of reads aligned to an exon. The two measures are compared using multiple data sets and considering several evaluation criteria: independence from gene-specific covariates, such as exon length and GC-content, accuracy and precision in the quantification of true concentrations and robustness of measurements to variations of alignments quality. RESULTS: Both measures show high accuracy and low dependency on GC-content. However, maxcounts expression quantification is less biased towards long exons with respect to the standard approach. Moreover, it shows lower technical variability at low expressions and is more robust to variations in the quality of alignments. CONCLUSIONS: In summary, we confirm that counts computed with the standard approach depend on the length of the feature they are summarized on, and are sensitive to the non-uniform distribution of reads along transcripts. On the opposite, maxcounts are robust to biases due to the non-uniformity distribution of reads and are characterized by a lower technical variability. Hence, we propose maxcounts as an alternative approach for quantitative RNA-sequencing applications.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/análise , Análise de Sequência de RNA/métodos , Composição de Bases , Regulação da Expressão Gênica , Humanos , RNA/genética , Alinhamento de SequênciaRESUMO
Next-generation sequencing technologies have fostered an unprecedented proliferation of high-throughput sequencing projects and a concomitant development of novel algorithms for the assembly of short reads. In this context, an important issue is the need of a careful assessment of the accuracy of the assembly process. Here, we review the efficiency of a panel of assemblers, specifically designed to handle data from GS FLX 454 platform, on three bacterial data sets with different characteristics in terms of reads coverage and repeats content. Our aim is to investigate their strengths and weaknesses in the reconstruction of the reference genomes. In our benchmarking, we assess assemblers' performance, quantifying and characterizing assembly gaps and errors, and evaluating their ability to solve complex genomic regions containing repeats. The final goal of this analysis is to highlight pros and cons of each method, in order to provide the final user with general criteria for the right choice of the appropriate assembly strategy, depending on the specific needs. A further aspect we have explored is the relationship between coverage of a sequencing project and quality of the obtained results. The final outcome suggests that, for a good tradeoff between costs and results, the planned genome coverage of an experiment should not exceed 20-30 ×.
Assuntos
Algoritmos , Genoma , Genômica/métodos , Animais , Humanos , Análise de Sequência de DNA/métodosRESUMO
Methods for in silico deconvolution of bulk transcriptomics can characterize the cellular composition of the tumor microenvironment, quantifying the abundance of cell types associated with patients' prognosis and response to therapy. While first-generation deconvolution methods rely on precomputed, transcriptional signatures of a handful of cell types, second-generation methods can be trained with single-cell data to disentangle more fine-grained cell phenotypes and states. These novel approaches can also be applied to spatial transcriptomic data to reveal the spatial organization of tumors. In this review, we describe state-of-the-art deconvolution methods (first-generation, second-generation, and spatial) which can be used to investigate the tumor microenvironment, discussing their strengths and limitations. We conclude with an outlook on the challenges that need to be overcome to unlock the full potential of next-generation deconvolution for oncology and the life sciences.
Assuntos
Transcriptoma , Microambiente Tumoral , Humanos , Perfilação da Expressão Gênica , TecnologiaRESUMO
Summary: Transcriptome deconvolution has emerged as a reliable technique to estimate cell-type abundances from bulk RNA sequencing data. Unlike their human equivalents, methods to quantify the cellular composition of complex tissues from murine transcriptomics are sparse and sometimes not easy to use. We extended the immunedeconv R package to facilitate the deconvolution of mouse transcriptomics, enabling the quantification of murine immune-cell types using 13 different methods. Through immunedeconv, we further offer the possibility of tweaking cell signatures used by deconvolution methods, providing custom annotations tailored for specific cell types and tissues. These developments strongly facilitate the study of the immune-cell composition of mouse models and further open new avenues in the investigation of the cellular composition of other tissues and organisms. Availability and implementation: The R package and the documentation are available at https://github.com/omnideconv/immunedeconv.