Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 56
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nat Immunol ; 21(10): 1160-1171, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32747819

RESUMEN

Autophagy supports both cellular and organismal homeostasis. However, whether autophagy should be inhibited or activated for cancer therapy remains unclear. Deletion of essential autophagy genes increased the sensitivity of mouse mammary carcinoma cells to radiation therapy in vitro and in vivo (in immunocompetent syngeneic hosts). Autophagy-deficient cells secreted increased amounts of type I interferon (IFN), which could be limited by CGAS or STING knockdown, mitochondrial DNA depletion or mitochondrial outer membrane permeabilization blockage via BCL2 overexpression or BAX deletion. In vivo, irradiated autophagy-incompetent mammary tumors elicited robust immunity, leading to improved control of distant nonirradiated lesions via systemic type I IFN signaling. Finally, a genetic signature of autophagy had negative prognostic value in patients with breast cancer, inversely correlating with mitochondrial abundance, type I IFN signaling and effector immunity. As clinically useful autophagy inhibitors are elusive, our findings suggest that mitochondrial outer membrane permeabilization may represent a valid target for boosting radiation therapy immunogenicity in patients with breast cancer.


Asunto(s)
Proteína 5 Relacionada con la Autofagia/genética , Proteína 7 Relacionada con la Autofagia/genética , Autofagia/genética , Neoplasias de la Mama/radioterapia , ADN Mitocondrial/genética , Neoplasias Mamarias Animales/radioterapia , Mitocondrias/metabolismo , Adulto , Anciano , Animales , Neoplasias de la Mama/genética , Neoplasias de la Mama/mortalidad , Línea Celular Tumoral , Citotoxicidad Inmunológica , Femenino , Humanos , Interferón Tipo I/metabolismo , Neoplasias Mamarias Animales/genética , Ratones , Ratones Endogámicos BALB C , Persona de Mediana Edad , Pronóstico , Tolerancia a Radiación , Transducción de Señal , Análisis de Supervivencia
2.
Nat Rev Genet ; 20(12): 724-746, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31515541

RESUMEN

The remarkable success of cancer therapies with immune checkpoint blockers is revolutionizing oncology and has sparked intensive basic and translational research into the mechanisms of cancer-immune cell interactions. In parallel, numerous novel cutting-edge technologies for comprehensive molecular and cellular characterization of cancer immunity have been developed, including single-cell sequencing, mass cytometry and multiplexed spatial cellular phenotyping. In order to process, analyse and visualize multidimensional data sets generated by these technologies, computational methods and software tools are required. Here, we review computational tools for interrogating cancer immunity, discuss advantages and limitations of the various methods and provide guidelines to assist in method selection.


Asunto(s)
Comunicación Celular , Biología Computacional , Secuenciación de Nucleótidos de Alto Rendimiento , Neoplasias , Análisis de la Célula Individual , Programas Informáticos , Comunicación Celular/genética , Comunicación Celular/inmunología , Humanos , Neoplasias/genética , Neoplasias/inmunología
3.
J Transl Med ; 22(1): 190, 2024 02 21.
Artículo en Inglés | MEDLINE | ID: mdl-38383458

RESUMEN

BACKGROUND: Predictive biomarkers of immune checkpoint inhibitor (ICI) efficacy are currently lacking for non-small cell lung cancer (NSCLC). Here, we describe the results from the Anti-PD-1 Response Prediction DREAM Challenge, a crowdsourced initiative that enabled the assessment of predictive models by using data from two randomized controlled clinical trials (RCTs) of ICIs in first-line metastatic NSCLC. METHODS: Participants developed and trained models using public resources. These were evaluated with data from the CheckMate 026 trial (NCT02041533), according to the model-to-data paradigm to maintain patient confidentiality. The generalizability of the models with the best predictive performance was assessed using data from the CheckMate 227 trial (NCT02477826). Both trials were phase III RCTs with a chemotherapy control arm, which supported the differentiation between predictive and prognostic models. Isolated model containers were evaluated using a bespoke strategy that considered the challenges of handling transcriptome data from clinical trials. RESULTS: A total of 59 teams participated, with 417 models submitted. Multiple predictive models, as opposed to a prognostic model, were generated for predicting overall survival, progression-free survival, and progressive disease status with ICIs. Variables within the models submitted by participants included tumor mutational burden (TMB), programmed death ligand 1 (PD-L1) expression, and gene-expression-based signatures. The best-performing models showed improved predictive power over reference variables, including TMB or PD-L1. CONCLUSIONS: This DREAM Challenge is the first successful attempt to use protected phase III clinical data for a crowdsourced effort towards generating predictive models for ICI clinical outcomes and could serve as a blueprint for similar efforts in other tumor types and disease states, setting a benchmark for future studies aiming to identify biomarkers predictive of ICI efficacy. TRIAL REGISTRATION: CheckMate 026; NCT02041533, registered January 22, 2014. CheckMate 227; NCT02477826, registered June 23, 2015.


Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas , Neoplasias Pulmonares , Humanos , Carcinoma de Pulmón de Células no Pequeñas/tratamiento farmacológico , Carcinoma de Pulmón de Células no Pequeñas/genética , Inhibidores de Puntos de Control Inmunológico/uso terapéutico , Neoplasias Pulmonares/patología , Antígeno B7-H1 , Biomarcadores de Tumor
4.
Bioinformatics ; 38(Suppl_2): ii141-ii147, 2022 09 16.
Artículo en Inglés | MEDLINE | ID: mdl-36124800

RESUMEN

MOTIVATION: As complex tissues are typically composed of various cell types, deconvolution tools have been developed to computationally infer their cellular composition from bulk RNA sequencing (RNA-seq) data. To comprehensively assess deconvolution performance, gold-standard datasets are indispensable. Gold-standard, experimental techniques like flow cytometry or immunohistochemistry are resource-intensive and cannot be systematically applied to the numerous cell types and tissues profiled with high-throughput transcriptomics. The simulation of 'pseudo-bulk' data, generated by aggregating single-cell RNA-seq expression profiles in pre-defined proportions, offers a scalable and cost-effective alternative. This makes it feasible to create in silico gold standards that allow fine-grained control of cell-type fractions not conceivable in an experimental setup. However, at present, no simulation software for generating pseudo-bulk RNA-seq data exists. RESULTS: We developed SimBu, an R package capable of simulating pseudo-bulk samples based on various simulation scenarios, designed to test specific features of deconvolution methods. A unique feature of SimBu is the modeling of cell-type-specific mRNA bias using experimentally derived or data-driven scaling factors. Here, we show that SimBu can generate realistic pseudo-bulk data, recapitulating the biological and statistical features of real RNA-seq data. Finally, we illustrate the impact of mRNA bias on the evaluation of deconvolution tools and provide recommendations for the selection of suitable methods for estimating mRNA content. SimBu is a user-friendly and flexible tool for simulating realistic pseudo-bulk RNA-seq datasets serving as in silico gold-standard for assessing cell-type deconvolution methods. AVAILABILITY AND IMPLEMENTATION: SimBu is freely available at https://github.com/omnideconv/SimBu as an R package under the GPL-3 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Perfilación de la Expresión Génica , ARN , Perfilación de la Expresión Génica/métodos , ARN/genética , ARN Mensajero , RNA-Seq , Análisis de Secuencia de ARN/métodos
5.
Bioinformatics ; 38(4): 1131-1132, 2022 01 27.
Artículo en Inglés | MEDLINE | ID: mdl-34788790

RESUMEN

SUMMARY: Somatic mutations and gene fusions can produce immunogenic neoantigens mediating anticancer immune responses. However, their computational prediction from sequencing data requires complex computational workflows to identify tumor-specific aberrations, derive the resulting peptides, infer patients' Human Leukocyte Antigen types and predict neoepitopes binding to them, together with a set of features underlying their immunogenicity. Here, we present nextNEOpi (nextflow NEOantigen prediction pipeline) a comprehensive and fully automated bioinformatic pipeline to predict tumor neoantigens from raw DNA and RNA sequencing data. In addition, nextNEOpi quantifies neoepitope- and patient-specific features associated with tumor immunogenicity and response to immunotherapy. AVAILABILITY AND IMPLEMENTATION: nextNEOpi source code and documentation are available at https://github.com/icbi-lab/nextNEOpi. CONTACT: dietmar.rieder@i-med.ac.at or francesca.finotello@uibk.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Neoplasias , Humanos , Neoplasias/genética , Antígenos de Neoplasias/genética , Péptidos/genética , Análisis de Secuencia de ARN
6.
Int J Cancer ; 150(4): 688-704, 2022 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-34716584

RESUMEN

The surface inhibitory receptor NKG2A forms heterodimers with the invariant CD94 chain and is expressed on a subset of activated CD8 T cells. As antibodies to block NKG2A are currently tested in several efficacy trials for different tumor indications, it is important to characterize the NKG2A+ CD8 T cell population in the context of other inhibitory receptors. Here we used a well-controlled culture system to study the kinetics of inhibitory receptor expression. Naïve mouse CD8 T cells were synchronously and repeatedly activated by artificial antigen presenting cells in the presence of the homeostatic cytokine IL-7. The results revealed NKG2A as a late inhibitory receptor, expressed after repeated cognate antigen stimulations. In contrast, the expression of PD-1, TIGIT and LAG-3 was rapidly induced, hours after first contact and subsequently down regulated during each resting phase. This late, but stable expression kinetics of NKG2A was most similar to that of TIM-3 and CD39. Importantly, single-cell transcriptomics of human tumor-infiltrating lymphocytes (TILs) showed indeed that these receptors were often coexpressed by the same CD8 T cell cluster. Furthermore, NKG2A expression was associated with cell division and was promoted by TGF-ß in vitro, although TGF-ß signaling was not necessary in a mouse tumor model in vivo. In summary, our data show that PD-1 reflects recent TCR triggering, but that NKG2A is induced after repeated antigen stimulations and represents a late inhibitory receptor. Together with TIM-3 and CD39, NKG2A might thus mark actively dividing tumor-specific TILs.


Asunto(s)
Proteínas de Punto de Control Inmunitario/fisiología , Subfamília C de Receptores Similares a Lectina de Células NK/fisiología , Animales , Antígenos CD/fisiología , Linfocitos T CD8-positivos/inmunología , División Celular , Receptor 2 Celular del Virus de la Hepatitis A/fisiología , Humanos , Linfocitos Infiltrantes de Tumor/inmunología , Ratones , Ratones Endogámicos C57BL , Receptores de Antígenos de Linfocitos T/fisiología , Receptores Inmunológicos/fisiología , Factor de Crecimiento Transformador beta/farmacología , Microambiente Tumoral , Proteína del Gen 3 de Activación de Linfocitos
7.
Nat Rev Genet ; 17(8): 441-58, 2016 07 04.
Artículo en Inglés | MEDLINE | ID: mdl-27376489

RESUMEN

Recent breakthroughs in cancer immunotherapy and decreasing costs of high-throughput technologies have sparked intensive research into tumour-immune cell interactions using genomic tools. The wealth of the generated data and the added complexity pose considerable challenges and require computational tools to process, to analyse and to visualize the data. Recently, various tools have been developed and used to mine tumour immunologic and genomic data effectively and to provide novel mechanistic insights. Here, we review computational genomics tools for cancer immunology and provide information on the requirements and functionality in order to assist in the selection of tools and assembly of analytical pipelines.


Asunto(s)
Comunicación Celular/inmunología , Quimiocinas/inmunología , Biología Computacional/métodos , Genómica/métodos , Neoplasias/inmunología , Neoplasias/metabolismo , Animales , Humanos , Neoplasias/genética
8.
Bioinformatics ; 36(7): 2260-2261, 2020 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-31755900

RESUMEN

SUMMARY: Gene fusions can generate immunogenic neoantigens that mediate anticancer immune responses. However, their computational prediction from RNA sequencing (RNA-seq) data requires deep bioinformatics expertise to assembly a computational workflow covering the prediction of: fusion transcripts, their translated proteins and peptides, Human Leukocyte Antigen (HLA) types, and peptide-HLA binding affinity. Here, we present NeoFuse, a computational pipeline for the prediction of fusion neoantigens from tumor RNA-seq data. NeoFuse can be applied to cancer patients' RNA-seq data to identify fusion neoantigens that might expand the repertoire of suitable targets for immunotherapy. AVAILABILITY AND IMPLEMENTATION: NeoFuse source code and documentation are available under GPLv3 license at https://icbi.i-med.ac.at/NeoFuse/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Antígenos de Neoplasias , ARN , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ARN , Programas Informáticos , Secuenciación del Exoma
9.
Bioinformatics ; 36(18): 4817-4818, 2020 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-32614448

RESUMEN

SUMMARY: Advances in single-cell technologies have enabled the investigation of T-cell phenotypes and repertoires at unprecedented resolution and scale. Bioinformatic methods for the efficient analysis of these large-scale datasets are instrumental for advancing our understanding of adaptive immune responses. However, while well-established solutions are accessible for the processing of single-cell transcriptomes, no streamlined pipelines are available for the comprehensive characterization of T-cell receptors. Here, we propose single-cell immune repertoires in Python (Scirpy), a scalable Python toolkit that provides simplified access to the analysis and visualization of immune repertoires from single cells and seamless integration with transcriptomic data. AVAILABILITY AND IMPLEMENTATION: Scirpy source code and documentation are available at https://github.com/icbi-lab/scirpy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biología Computacional , Programas Informáticos , Documentación , Receptores de Antígenos de Linfocitos T
10.
Brief Bioinform ; 19(4): 679-692, 2018 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-28025179

RESUMEN

The human microbiota is a complex ecological community of commensal, symbiotic and pathogenic microorganisms harboured by the human body. Next-generation sequencing (NGS) technologies, in particular targeted amplicon sequencing of the 16S ribosomal RNA gene (16S-seq), are enabling the identification and quantification of human-resident microorganisms at unprecedented resolution, providing novel insights into the role of the microbiota in health and disease. Once microbial abundances are quantified through NGS data analysis, diversity indices provide valuable mathematical tools to describe the ecological complexity of a single sample or to detect species differences between samples. However, diversity is not a determined physical quantity for which a consensus definition and unit of measure have been established, and several diversity indices are currently available. Furthermore, they were originally developed for macroecology and their robustness to the possible bias introduced by sequencing has not been characterized so far. To assist the reader with the selection and interpretation of diversity measures, we review a panel of broadly used indices, describing their mathematical formulations, purposes and properties, and characterize their behaviour and criticalities in dependence of the data features using simulated data as ground truth. In addition, we make available an R package, DiversitySeq, which implements in a unified framework the full panel of diversity indices and a simulator of 16S-seq data, and thus represents a valuable resource for the analysis of diversity from NGS count data and for the benchmarking of computational methods for 16S-seq.


Asunto(s)
Bacterias/clasificación , Bacterias/genética , Biología Computacional/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Microbiota , ARN Ribosómico 16S/genética , Bacterias/aislamiento & purificación , ADN Bacteriano/genética , Humanos , Metagenoma , Filogenia
11.
Bioinformatics ; 35(14): i436-i445, 2019 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31510660

RESUMEN

MOTIVATION: The composition and density of immune cells in the tumor microenvironment (TME) profoundly influence tumor progression and success of anti-cancer therapies. Flow cytometry, immunohistochemistry staining or single-cell sequencing are often unavailable such that we rely on computational methods to estimate the immune-cell composition from bulk RNA-sequencing (RNA-seq) data. Various methods have been proposed recently, yet their capabilities and limitations have not been evaluated systematically. A general guideline leading the research community through cell type deconvolution is missing. RESULTS: We developed a systematic approach for benchmarking such computational methods and assessed the accuracy of tools at estimating nine different immune- and stromal cells from bulk RNA-seq samples. We used a single-cell RNA-seq dataset of ∼11 000 cells from the TME to simulate bulk samples of known cell type proportions, and validated the results using independent, publicly available gold-standard estimates. This allowed us to analyze and condense the results of more than a hundred thousand predictions to provide an exhaustive evaluation across seven computational methods over nine cell types and ∼1800 samples from five simulated and real-world datasets. We demonstrate that computational deconvolution performs at high accuracy for well-defined cell-type signatures and propose how fuzzy cell-type signatures can be improved. We suggest that future efforts should be dedicated to refining cell population definitions and finding reliable signatures. AVAILABILITY AND IMPLEMENTATION: A snakemake pipeline to reproduce the benchmark is available at https://github.com/grst/immune_deconvolution_benchmark. An R package allows the community to perform integrated deconvolution using different methods (https://grst.github.io/immunedeconv). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Neoplasias , Transcriptoma , Citometría de Flujo , Humanos , ARN , Análisis de Secuencia de ARN , Microambiente Tumoral
12.
BMC Bioinformatics ; 19(1): 343, 2018 Sep 29.
Artículo en Inglés | MEDLINE | ID: mdl-30268091

RESUMEN

BACKGROUND: Targeted amplicon sequencing of the 16S ribosomal RNA gene is one of the key tools for studying microbial diversity. The accuracy of this approach strongly depends on the choice of primer pairs and, in particular, on the balance between efficiency, specificity and sensitivity in the amplification of the different bacterial 16S sequences contained in a sample. There is thus the need for computational methods to design optimal bacterial 16S primers able to take into account the knowledge provided by the new sequencing technologies. RESULTS: We propose here a computational method for optimizing the choice of primer sets, based on multi-objective optimization, which simultaneously: 1) maximizes efficiency and specificity of target amplification; 2) maximizes the number of different bacterial 16S sequences matched by at least one primer; 3) minimizes the differences in the number of primers matching each bacterial 16S sequence. Our algorithm can be applied to any desired amplicon length without affecting computational performance. The source code of the developed algorithm is released as the mopo16S software tool (Multi-Objective Primer Optimization for 16S experiments) under the GNU General Public License and is available at http://sysbiobig.dei.unipd.it/?q=Software#mopo16S . CONCLUSIONS: Results show that our strategy is able to find better primer pairs than the ones available in the literature according to all three optimization criteria. We also experimentally validated three of the primer pairs identified by our method on multiple bacterial species, belonging to different genera and phyla. Results confirm the predicted efficiency and the ability to maximize the number of different bacterial 16S sequences matched by primers.


Asunto(s)
Bacterias/genética , Cartilla de ADN/normas , Reacción en Cadena de la Polimerasa/normas , ARN Bacteriano/genética , ARN Ribosómico 16S/genética , Programas Informáticos , Cartilla de ADN/genética
13.
Cancer Immunol Immunother ; 67(7): 1031-1040, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-29541787

RESUMEN

By exerting pro- and anti-tumorigenic actions, tumor-infiltrating immune cells can profoundly influence tumor progression, as well as the success of anti-cancer therapies. Therefore, the quantification of tumor-infiltrating immune cells holds the promise to unveil the multi-faceted role of the immune system in human cancers and its involvement in tumor escape mechanisms and response to therapy. Tumor-infiltrating immune cells can be quantified from RNA sequencing data of human tumors using bioinformatics approaches. In this review, we describe state-of-the-art computational methods for the quantification of immune cells from transcriptomics data and discuss the open challenges that must be addressed to accurately quantify immune infiltrates from RNA sequencing data of human bulk tumors.


Asunto(s)
Linfocitos Infiltrantes de Tumor/inmunología , Transcriptoma/inmunología , Humanos
14.
Bioinformatics ; 33(19): 3140-3141, 2017 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-28633385

RESUMEN

SUMMARY: Recently, a number of powerful computational tools for dissecting tumor-immune cell interactions from next-generation sequencing data have been developed. However, the assembly of analytical pipelines and execution of multi-step workflows are laborious and involve a large number of intermediate steps with many dependencies and parameter settings. Here we present TIminer, an easy-to-use computational pipeline for mining tumor-immune cell interactions from next-generation sequencing data. TIminer enables integrative immunogenomic analyses, including: human leukocyte antigens typing, neoantigen prediction, characterization of immune infiltrates and quantification of tumor immunogenicity. AVAILABILITY AND IMPLEMENTATION: TIminer is freely available at http://icbi.i-med.ac.at/software/timiner/timiner.shtml. CONTACT: zlatko.trajanoski@i-med.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Neoplasias/inmunología , Programas Informáticos , Minería de Datos , Humanos , Fenómenos Inmunogenéticos , Inmunoterapia , Neoplasias/genética , Neoplasias/terapia , Flujo de Trabajo
15.
Proc Natl Acad Sci U S A ; 111(38): 13924-9, 2014 Sep 23.
Artículo en Inglés | MEDLINE | ID: mdl-25201977

RESUMEN

Genetic variation can modulate gene expression, and thereby phenotypic variation and susceptibility to complex diseases such as type 2 diabetes (T2D). Here we harnessed the potential of DNA and RNA sequencing in human pancreatic islets from 89 deceased donors to identify genes of potential importance in the pathogenesis of T2D. We present a catalog of genetic variants regulating gene expression (eQTL) and exon use (sQTL), including many long noncoding RNAs, which are enriched in known T2D-associated loci. Of 35 eQTL genes, whose expression differed between normoglycemic and hyperglycemic individuals, siRNA of tetraspanin 33 (TSPAN33), 5'-nucleotidase, ecto (NT5E), transmembrane emp24 protein transport domain containing 6 (TMED6), and p21 protein activated kinase 7 (PAK7) in INS1 cells resulted in reduced glucose-stimulated insulin secretion. In addition, we provide a genome-wide catalog of allelic expression imbalance, which is also enriched in known T2D-associated loci. Notably, allelic imbalance in paternally expressed gene 3 (PEG3) was associated with its promoter methylation and T2D status. Finally, RNA editing events were less common in islets than previously suggested in other tissues. Taken together, this study provides new insights into the complexity of gene regulation in human pancreatic islets and better understanding of how genetic variation can influence glucose metabolism.


Asunto(s)
Genómica , Glucosa , Transcriptoma/fisiología , 5'-Nucleotidasa/biosíntesis , 5'-Nucleotidasa/genética , Línea Celular , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Femenino , Proteínas Ligadas a GPI/biosíntesis , Proteínas Ligadas a GPI/genética , Glucosa/genética , Glucosa/metabolismo , Humanos , Islotes Pancreáticos , Masculino , Edición de ARN/fisiología , ARN Largo no Codificante/biosíntesis , ARN Largo no Codificante/genética , Tetraspaninas/biosíntesis , Tetraspaninas/genética , Proteínas de Transporte Vesicular/biosíntesis , Proteínas de Transporte Vesicular/genética , Quinasas p21 Activadas/biosíntesis , Quinasas p21 Activadas/genética
16.
BMC Genomics ; 16: S2, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26046293

RESUMEN

BACKGROUND: Dynamic expression data, nowadays obtained using high-throughput RNA sequencing, are essential to monitor transient gene expression changes and to study the dynamics of their transcriptional activity in the cell or response to stimuli. Several methods for data selection, clustering and functional analysis are available; however, these steps are usually performed independently, without exploiting and integrating the information derived from each step of the analysis. METHODS: Here we present FunPat, an R package for time series RNA sequencing data that integrates gene selection, clustering and functional annotation into a single framework. FunPat exploits functional annotations by performing for each functional term, e.g. a Gene Ontology term, an integrated selection-clustering analysis to select differentially expressed genes that share, besides annotation, a common dynamic expression profile. RESULTS: FunPat performance was assessed on both simulated and real data. With respect to a stand-alone selection step, the integration of the clustering step is able to improve the recall without altering the false discovery rate. FunPat also shows high precision and recall in detecting the correct temporal expression patterns; in particular, the recall is significantly higher than hierarchical, k-means and a model-based clustering approach specifically designed for RNA sequencing data. Moreover, when biological replicates are missing, FunPat is able to provide reproducible lists of significant genes. The application to real time series expression data shows the ability of FunPat to select differentially expressed genes with high reproducibility, indirectly confirming high precision and recall in gene selection. Moreover, the expression patterns obtained as output allow an easy interpretation of the results. CONCLUSIONS: A novel analysis pipeline was developed to search the main temporal patterns in classes of genes similarly annotated, improving the sensitivity of gene selection by integrating the statistical evidence of differential expression with the information on temporal profiles and the functional annotations. Significant genes are associated to both the most informative functional terms, avoiding redundancy of information, and the most representative temporal patterns, thus improving the readability of the results. FunPat package is provided in R/Bioconductor at link: http://sysbiobig.dei.unipd.it/?q=node/79.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , ARN/química , Interfaz Usuario-Computador , Análisis por Conglomerados , Secuenciación de Nucleótidos de Alto Rendimiento , Internet , Análisis de Secuencia de ARN
17.
BMC Bioinformatics ; 15 Suppl 1: S7, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24564404

RESUMEN

BACKGROUND: In the last decade, Next-Generation Sequencing technologies have been extensively applied to quantitative transcriptomics, making RNA sequencing a valuable alternative to microarrays for measuring and comparing gene transcription levels. Although several methods have been proposed to provide an unbiased estimate of transcript abundances through data normalization, all of them are based on an initial count of the total number of reads mapping on each transcript. This procedure, in principle robust to random noise, is actually error-prone if reads are not uniformly distributed along sequences, as happens indeed due to sequencing errors and ambiguity in read mapping. Here we propose a new approach, called maxcounts, to quantify the expression assigned to an exon as the maximum of its per-base counts, and we assess its performance in comparison with the standard approach described above, which considers the total number of reads aligned to an exon. The two measures are compared using multiple data sets and considering several evaluation criteria: independence from gene-specific covariates, such as exon length and GC-content, accuracy and precision in the quantification of true concentrations and robustness of measurements to variations of alignments quality. RESULTS: Both measures show high accuracy and low dependency on GC-content. However, maxcounts expression quantification is less biased towards long exons with respect to the standard approach. Moreover, it shows lower technical variability at low expressions and is more robust to variations in the quality of alignments. CONCLUSIONS: In summary, we confirm that counts computed with the standard approach depend on the length of the feature they are summarized on, and are sensitive to the non-uniform distribution of reads along transcripts. On the opposite, maxcounts are robust to biases due to the non-uniformity distribution of reads and are characterized by a lower technical variability. Hence, we propose maxcounts as an alternative approach for quantitative RNA-sequencing applications.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ARN/análisis , Análisis de Secuencia de ARN/métodos , Composición de Base , Regulación de la Expresión Génica , Humanos , ARN/genética , Alineación de Secuencia
18.
Brief Bioinform ; 13(3): 269-80, 2012 May.
Artículo en Inglés | MEDLINE | ID: mdl-22021898

RESUMEN

Next-generation sequencing technologies have fostered an unprecedented proliferation of high-throughput sequencing projects and a concomitant development of novel algorithms for the assembly of short reads. In this context, an important issue is the need of a careful assessment of the accuracy of the assembly process. Here, we review the efficiency of a panel of assemblers, specifically designed to handle data from GS FLX 454 platform, on three bacterial data sets with different characteristics in terms of reads coverage and repeats content. Our aim is to investigate their strengths and weaknesses in the reconstruction of the reference genomes. In our benchmarking, we assess assemblers' performance, quantifying and characterizing assembly gaps and errors, and evaluating their ability to solve complex genomic regions containing repeats. The final goal of this analysis is to highlight pros and cons of each method, in order to provide the final user with general criteria for the right choice of the appropriate assembly strategy, depending on the specific needs. A further aspect we have explored is the relationship between coverage of a sequencing project and quality of the obtained results. The final outcome suggests that, for a good tradeoff between costs and results, the planned genome coverage of an experiment should not exceed 20-30 ×.


Asunto(s)
Algoritmos , Genoma , Genómica/métodos , Animales , Humanos , Análisis de Secuencia de ADN/métodos
19.
Int Rev Cell Mol Biol ; 382: 103-143, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38225101

RESUMEN

Methods for in silico deconvolution of bulk transcriptomics can characterize the cellular composition of the tumor microenvironment, quantifying the abundance of cell types associated with patients' prognosis and response to therapy. While first-generation deconvolution methods rely on precomputed, transcriptional signatures of a handful of cell types, second-generation methods can be trained with single-cell data to disentangle more fine-grained cell phenotypes and states. These novel approaches can also be applied to spatial transcriptomic data to reveal the spatial organization of tumors. In this review, we describe state-of-the-art deconvolution methods (first-generation, second-generation, and spatial) which can be used to investigate the tumor microenvironment, discussing their strengths and limitations. We conclude with an outlook on the challenges that need to be overcome to unlock the full potential of next-generation deconvolution for oncology and the life sciences.


Asunto(s)
Transcriptoma , Microambiente Tumoral , Humanos , Perfilación de la Expresión Génica , Tecnología
20.
Bioinform Adv ; 4(1): vbae032, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38464974

RESUMEN

Summary: Transcriptome deconvolution has emerged as a reliable technique to estimate cell-type abundances from bulk RNA sequencing data. Unlike their human equivalents, methods to quantify the cellular composition of complex tissues from murine transcriptomics are sparse and sometimes not easy to use. We extended the immunedeconv R package to facilitate the deconvolution of mouse transcriptomics, enabling the quantification of murine immune-cell types using 13 different methods. Through immunedeconv, we further offer the possibility of tweaking cell signatures used by deconvolution methods, providing custom annotations tailored for specific cell types and tissues. These developments strongly facilitate the study of the immune-cell composition of mouse models and further open new avenues in the investigation of the cellular composition of other tissues and organisms. Availability and implementation: The R package and the documentation are available at https://github.com/omnideconv/immunedeconv.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA