RESUMEN
Tumors weakly infiltrated by T lymphocytes poorly respond to immunotherapy. We aimed to unveil malignancy-associated programs regulating T cell entrance, arrest, and activation in the tumor environment. Differential expression of cell adhesion and tissue architecture programs, particularly the presence of the membrane tetraspanin claudin (CLDN)18 as a signature gene, demarcated immune-infiltrated from immune-depleted mouse pancreatic tumors. In human pancreatic ductal adenocarcinoma (PDAC) and non-small cell lung cancer, CLDN18 expression positively correlated with more differentiated histology and favorable prognosis. CLDN18 on the cell surface promoted accrual of cytotoxic T lymphocytes (CTLs), facilitating direct CTL contacts with tumor cells by driving the mobilization of the adhesion protein ALCAM to the lipid rafts of the tumor cell membrane through actin. This process favored the formation of robust immunological synapses (ISs) between CTLs and CLDN18-positive cancer cells, resulting in increased T cell activation. Our data reveal an immune role for CLDN18 in orchestrating T cell infiltration and shaping the tumor immune contexture.
Asunto(s)
Carcinoma Ductal Pancreático , Claudinas , Activación de Linfocitos , Neoplasias Pancreáticas , Linfocitos T Citotóxicos , Animales , Humanos , Ratones , Carcinoma de Pulmón de Células no Pequeñas/inmunología , Carcinoma de Pulmón de Células no Pequeñas/patología , Carcinoma de Pulmón de Células no Pequeñas/metabolismo , Carcinoma Ductal Pancreático/inmunología , Carcinoma Ductal Pancreático/patología , Carcinoma Ductal Pancreático/metabolismo , Línea Celular Tumoral , Claudinas/metabolismo , Claudinas/genética , Regulación Neoplásica de la Expresión Génica/inmunología , Sinapsis Inmunológicas/metabolismo , Sinapsis Inmunológicas/inmunología , Neoplasias Pulmonares/inmunología , Neoplasias Pulmonares/patología , Activación de Linfocitos/inmunología , Linfocitos Infiltrantes de Tumor/inmunología , Microdominios de Membrana/metabolismo , Microdominios de Membrana/inmunología , Ratones Endogámicos C57BL , Neoplasias Pancreáticas/inmunología , Neoplasias Pancreáticas/patología , Linfocitos T Citotóxicos/inmunología , Microambiente Tumoral/inmunologíaRESUMEN
Transcription factors (TFs) are key regulatory proteins that control the transcriptional rate of cells by binding short DNA sequences called transcription factor binding sites (TFBS) or motifs. Identifying and characterizing TFBS is fundamental to understanding the regulatory mechanisms governing the transcriptional state of cells. During the last decades, several experimental methods have been developed to recover DNA sequences containing TFBS. In parallel, computational methods have been proposed to discover and identify TFBS motifs based on these DNA sequences. This is one of the most widely investigated problems in bioinformatics and is referred to as the motif discovery problem. In this manuscript, we review classical and novel experimental and computational methods developed to discover and characterize TFBS motifs in DNA sequences, highlighting their advantages and drawbacks. We also discuss open challenges and future perspectives that could fill the remaining gaps in the field.
Asunto(s)
Algoritmos , Factores de Transcripción , Unión Proteica , Factores de Transcripción/metabolismo , Sitios de Unión , Secuencia de Bases , Biología ComputacionalRESUMEN
MOTIVATION: Cytometry comprises powerful techniques for analyzing the cell heterogeneity of a biological sample by examining the expression of protein markers. These technologies impact especially the field of oncoimmunology, where cell identification is essential to analyze the tumor microenvironment. Several classification tools have been developed for the annotation of cytometry datasets, which include supervised tools that require a training set as a reference (i.e. reference-based) and semisupervised tools based on the manual definition of a marker table. The latter is closer to the traditional annotation of cytometry data based on manual gating. However, they require the manual definition of a marker table that cannot be extracted automatically in a reference-based fashion. Therefore, we are lacking methods that allow both classification approaches while maintaining the high biological interpretability given by the marker table. RESULTS: We present a new tool called GateMeClass (Gate Mining and Classification) which overcomes the limitation of the current methods of classification of cytometry data allowing both semisupervised and supervised annotation based on a marker table that can be defined manually or extracted from an external annotated dataset. We measured the accuracy of GateMeClass for annotating three well-established benchmark mass cytometry datasets and one flow cytometry dataset. The performance of GateMeClass is comparable to reference-based methods and marker table-based techniques, offering greater flexibility and rapid execution times. AVAILABILITY AND IMPLEMENTATION: GateMeClass is implemented in R language and is publicly available at https://github.com/simo1c/GateMeClass.
Asunto(s)
Minería de Datos , Citometría de Flujo , Citometría de Flujo/métodos , Minería de Datos/métodos , Humanos , Programas Informáticos , Algoritmos , Microambiente TumoralRESUMEN
Most cell type-specific genes are regulated by the interaction of enhancers with their promoters. The identification of enhancers is not trivial as enhancers are diverse in their characteristics and dynamic in their interaction partners. We present Esearch3D, a new method that exploits network theory approaches to identify active enhancers. Our work is based on the fact that enhancers act as a source of regulatory information to increase the rate of transcription of their target genes and that the flow of this information is mediated by the folding of chromatin in the three-dimensional (3D) nuclear space between the enhancer and the target gene promoter. Esearch3D reverse engineers this flow of information to calculate the likelihood of enhancer activity in intergenic regions by propagating the transcription levels of genes across 3D genome networks. Regions predicted to have high enhancer activity are shown to be enriched in annotations indicative of enhancer activity. These include: enhancer-associated histone marks, bidirectional CAGE-seq, STARR-seq, P300, RNA polymerase II and expression quantitative trait loci (eQTLs). Esearch3D leverages the relationship between chromatin architecture and transcription, allowing the prediction of active enhancers and an understanding of the complex underpinnings of regulatory networks. The method is available at: https://github.com/InfOmics/Esearch3D and https://doi.org/10.5281/zenodo.7737123.
Asunto(s)
Cromatina , Elementos de Facilitación Genéticos , Programas Informáticos , Cromatina/genética , Expresión Génica , Regiones Promotoras Genéticas , ARN Polimerasa II/genética , ARN Polimerasa II/metabolismoRESUMEN
The joint modeling of genetic data and brain imaging information allows for determining the pathophysiological pathways of neurodegenerative diseases such as Alzheimer's disease (AD). This task has typically been approached using mass-univariate methods that rely on a complete set of Single Nucleotide Polymorphisms (SNPs) to assess their association with selected image-derived phenotypes (IDPs). However, such methods are prone to multiple comparisons bias and, most importantly, fail to account for potential cross-feature interactions, resulting in insufficient detection of significant associations. Ways to overcome these limitations while reducing the number of traits aim at conveying genetic information at the gene level and capturing the integrated genetic effects of a set of genetic variants, rather than looking at each SNP individually. Their associations with brain IDPs are still largely unexplored in the current literature, though they can uncover new potential genetic determinants for brain modulations in the AD continuum. In this work, we explored an explainable multivariate model to analyze the genetic basis of the grey matter modulations, relying on the AD Neuroimaging Initiative (ADNI) phase 3 dataset. Cortical thicknesses and subcortical volumes derived from T1-weighted Magnetic Resonance were considered to describe the imaging phenotypes. At the same time the genetic counterpart was represented by gene variant scores extracted by the Sequence Kernel Association Test (SKAT) filtering model. Moreover, transcriptomic analysis was carried on to assess the expression of the resulting genes in the main brain structures as a form of validation. Results highlighted meaningful genotype-phenotype interactionsas defined by three latent components showing a significant difference in the projection scores between patients and controls. Among the significant associations, the model highlighted EPHX1 and BCAS1 gene variant scores involved in neurodegenerative and myelination processes, hence relevant for AD. In particular, the first was associated with decreased subcortical volumes and the second with decreasedtemporal lobe thickness. Noteworthy, BCAS1 is particularly expressed in the dentate gyrus. Overall, the proposed approach allowed capturing genotype-phenotype interactions in a restricted study cohort that was confirmed by transcriptomic analysis, offering insights into the underlying mechanisms of neurodegeneration in AD in line with previous findings and suggesting new potential disease biomarkers.
Asunto(s)
Enfermedad de Alzheimer , Humanos , Enfermedad de Alzheimer/diagnóstico por imagen , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/patología , Neuroimagen/métodos , Imagen por Resonancia Magnética/métodos , Encéfalo/diagnóstico por imagen , Atrofia/patología , Proteínas de NeoplasiasRESUMEN
Given a group of genomes, represented as the sets of genes that belong to them, the discovery of the pangenomic content is based on the search of genetic homology among the genes for clustering them into families. Thus, pangenomic analyses investigate the membership of the families to the given genomes. This approach is referred to as the gene-oriented approach in contrast to other definitions of the problem that takes into account different genomic features. In the past years, several tools have been developed to discover and analyse pangenomic contents. Because of the hardness of the problem, each tool applies a different strategy for discovering the pangenomic content. This results in a differentiation of the performance of each tool that depends on the composition of the input genomes. This review reports the main analysis instruments provided by the current state of the art tools for the discovery of pangenomic contents. Moreover, unlike previous works, the presented study compares pangenomic tools from a methodological perspective, analysing the causes that lead a given methodology to outperform other tools. The analysis is performed by taking into account different bacterial populations, which are synthetically generated by changing evolutionary parameters. The benchmarks used to compare the pangenomic tools, in addition to the computational pipeline developed for this purpose, are available at https://github.com/InfOmics/pangenes-review. Contact: V. Bonnici, R. Giugno Supplementary information: Supplementary data are available at Briefings in Bioinformatics online.
Asunto(s)
Algoritmos , Biología Computacional/métodos , Genoma Bacteriano/genética , Genoma/genética , Genómica/métodos , Bacterias/clasificación , Bacterias/genética , Evolución Biológica , Mycoplasma/clasificación , Mycoplasma/genética , Filogenia , Programas InformáticosRESUMEN
MOTIVATION: Computational tools for pangenomic analysis have gained increasing interest over the past two decades in various applications such as evolutionary studies and vaccine development. Synthetic benchmarks are essential for the systematic evaluation of their performance. Currently, benchmarking tools represent a genome as a set of genetic sequences and fail to simulate the complete information of the genomes, which is essential for evaluating pangenomic detection between fragmented genomes. RESULTS: We present PANPROVA, a benchmark tool to simulate prokaryotic pangenomic evolution by evolving the complete genomic sequence of an ancestral isolate. In this way, the possibility of operating in the preassembly phase is enabled. Gene set variations, sequence variation and horizontal acquisition from a pool of external genomes are the evolutionary features of the tool. AVAILABILITY AND IMPLEMENTATION: PANPROVA is publicly available at https://github.com/InfOmics/PANPROVA. The manuscript explicitelly refers to the github repository. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Genoma , Programas Informáticos , Análisis de Secuencia de ADN , Secuenciación de Nucleótidos de Alto Rendimiento , BenchmarkingRESUMEN
Pangenomics was originally defined as the problem of comparing the composition of genes into gene families within a set of bacterial isolates belonging to the same species. The problem requires the calculation of sequence homology among such genes. When combined with metagenomics, namely for human microbiome composition analysis, gene-oriented pangenome detection becomes a promising method to decipher ecosystem functions and population-level evolution. Established computational tools are able to investigate the genetic content of isolates for which a complete genomic sequence is available. However, there is a plethora of incomplete genomes that are available on public resources, which only a few tools may analyze. Incomplete means that the process for reconstructing their genomic sequence is not complete, and only fragments of their sequence are currently available. However, the information contained in these fragments may play an essential role in the analyses. Here, we present PanDelos-frags, a computational tool which exploits and extends previous results in analyzing complete genomes. It provides a new methodology for inferring missing genetic information and thus for managing incomplete genomes. PanDelos-frags outperforms state-of-the-art approaches in reconstructing gene families in synthetic benchmarks and in a real use case of metagenomics. PanDelos-frags is publicly available at https://github.com/InfOmics/PanDelos-frags.
Asunto(s)
Genómica , Microbiota , Humanos , Ecosistema , Genoma , Genómica/métodos , Metagenómica/métodos , Programas Informáticos , Microbiota/genéticaRESUMEN
BACKGROUND: Microglia are the endogenous immune cells of the brain and act as sensors of pathology to maintain brain homeostasis and eliminate potential threats. In Alzheimer's disease (AD), toxic amyloid beta (Aß) accumulates in the brain and forms stiff plaques. In late-onset AD accounting for 95% of all cases, this is thought to be due to reduced clearance of Aß. Human genome-wide association studies and animal models suggest that reduced clearance results from aberrant function of microglia. While the impact of neurochemical pathways on microglia had been broadly studied, mechanical receptors regulating microglial functions remain largely unexplored. METHODS: Here we showed that a mechanotransduction ion channel, PIEZO1, is expressed and functional in human and mouse microglia. We used a small molecule agonist, Yoda1, to study how activation of PIEZO1 affects AD-related functions in human induced pluripotent stem cell (iPSC)-derived microglia-like cells (iMGL) under controlled laboratory experiments. Cell survival, metabolism, phagocytosis and lysosomal activity were assessed using real-time functional assays. To evaluate the effect of activation of PIEZO1 in vivo, 5-month-old 5xFAD male mice were infused daily with Yoda1 for two weeks through intracranial cannulas. Microglial Iba1 expression and Aß pathology were quantified with immunohistochemistry and confocal microscopy. Published human and mouse AD datasets were used for in-depth analysis of PIEZO1 gene expression and related pathways in microglial subpopulations. RESULTS: We show that PIEZO1 orchestrates Aß clearance by enhancing microglial survival, phagocytosis, and lysosomal activity. Aß inhibited PIEZO1-mediated calcium transients, whereas activation of PIEZO1 with a selective agonist, Yoda1, improved microglial phagocytosis resulting in Aß clearance both in human and mouse models of AD. Moreover, PIEZO1 expression was associated with a unique microglial transcriptional phenotype in AD as indicated by assessment of cellular metabolism, and human and mouse single-cell datasets. CONCLUSION: These results indicate that the compromised function of microglia in AD could be improved by controlled activation of PIEZO1 channels resulting in alleviated Aß burden. Pharmacological regulation of these mechanoreceptors in microglia could represent a novel therapeutic paradigm for AD.
Asunto(s)
Enfermedad de Alzheimer , Péptidos beta-Amiloides , Células Madre Pluripotentes Inducidas , Enfermedad de Alzheimer/patología , Péptidos beta-Amiloides/metabolismo , Animales , Modelos Animales de Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Células Madre Pluripotentes Inducidas/metabolismo , Canales Iónicos/metabolismo , Masculino , Mecanotransducción Celular , Ratones , Ratones Transgénicos , Microglía/metabolismoRESUMEN
Transcription factors (TFs) are proteins that promote or reduce the expression of genes by binding short genomic DNA sequences known as transcription factor binding sites (TFBS). While several tools have been developed to scan for potential occurrences of TFBS in linear DNA sequences or reference genomes, no tool exists to find them in pangenome variation graphs (VGs). VGs are sequence-labelled graphs that can efficiently encode collections of genomes and their variants in a single, compact data structure. Because VGs can losslessly compress large pangenomes, TFBS scanning in VGs can efficiently capture how genomic variation affects the potential binding landscape of TFs in a population of individuals. Here we present GRAFIMO (GRAph-based Finding of Individual Motif Occurrences), a command-line tool for the scanning of known TF DNA motifs represented as Position Weight Matrices (PWMs) in VGs. GRAFIMO extends the standard PWM scanning procedure by considering variations and alternative haplotypes encoded in a VG. Using GRAFIMO on a VG based on individuals from the 1000 Genomes project we recover several potential binding sites that are enhanced, weakened or missed when scanning only the reference genome, and which could constitute individual-specific binding events. GRAFIMO is available as an open-source tool, under the MIT license, at https://github.com/pinellolab/GRAFIMO and https://github.com/InfOmics/GRAFIMO.
Asunto(s)
Variación Genética , Motivos de Nucleótidos , Programas Informáticos , Factores de Transcripción/metabolismo , Secuencia de Bases , Sitios de Unión/genética , Biología Computacional , Gráficos por Computador , Genoma Humano , Genómica , Haplotipos , Humanos , Unión Proteica/genéticaRESUMEN
Olfactory function, orchestrated by the cells of the olfactory mucosa at the rooftop of the nasal cavity, is disturbed early in the pathogenesis of Alzheimer's disease (AD). Biometals including zinc and calcium are known to be important for sense of smell and to be altered in the brains of AD patients. Little is known about elemental homeostasis in the AD patient olfactory mucosa. Here we aimed to assess whether the disease-related alterations to biometal homeostasis observed in the brain are also reflected in the olfactory mucosa. We applied RNA sequencing to discover gene expression changes related to metals in olfactory mucosal cells of cognitively healthy controls, individuals with mild cognitive impairment and AD patients, and performed analysis of the elemental content to determine metal levels. Results demonstrate that the levels of zinc, calcium and sodium are increased in the AD olfactory mucosa concomitantly with alterations to 17 genes related to metal-ion binding or metal-related function of the protein product. A significant elevation in alpha-2-macroglobulin, a known metal-binding biomarker correlated with brain disease burden, was observed on the gene and protein levels in the olfactory mucosa cells of AD patients. These data demonstrate that the olfactory mucosa cells derived from AD patients recapitulate certain impairments of biometal homeostasis observed in the brains of patients.
Asunto(s)
Enfermedad de Alzheimer , Oligoelementos , Enfermedad de Alzheimer/metabolismo , Calcio/metabolismo , Quelantes/metabolismo , Humanos , Mucosa Olfatoria/metabolismo , Oligoelementos/metabolismo , Zinc/metabolismoRESUMEN
BACKGROUND: Graphs are mathematical structures widely used for expressing relationships among elements when representing biomedical and biological information. On top of these representations, several analyses are performed. A common task is the search of one substructure within one graph, called target. The problem is referred to as one-to-one subgraph search, and it is known to be NP-complete. Heuristics and indexing techniques can be applied to facilitate the search. Indexing techniques are also exploited in the context of searching in a collection of target graphs, referred to as one-to-many subgraph problem. Filter-and-verification methods that use indexing approaches provide a fast pruning of target graphs or parts of them that do not contain the query. The expensive verification phase is then performed only on the subset of promising targets. Indexing strategies extract graph features at a sufficient granularity level for performing a powerful filtering step. Features are memorized in data structures allowing an efficient access. Indexing size, querying time and filtering power are key points for the development of efficient subgraph searching solutions. RESULTS: An existing approach, GRAPES, has been shown to have good performance in terms of speed-up for both one-to-one and one-to-many cases. However, it suffers in the size of the built index. For this reason, we propose GRAPES-DD, a modified version of GRAPES in which the indexing structure has been replaced with a Decision Diagram. Decision Diagrams are a broad class of data structures widely used to encode and manipulate functions efficiently. Experiments on biomedical structures and synthetic graphs have confirmed our expectation showing that GRAPES-DD has substantially reduced the memory utilization compared to GRAPES without worsening the searching time. CONCLUSION: The use of Decision Diagrams for searching in biochemical and biological graphs is completely new and potentially promising thanks to their ability to encode compactly sets by exploiting their structure and regularity, and to manipulate entire sets of elements at once, instead of exploring each single element explicitly. Search strategies based on Decision Diagram makes the indexing for biochemical graphs, and not only, more affordable allowing us to potentially deal with huge and ever growing collections of biochemical and biological structures.
Asunto(s)
Vitis , Indización y Redacción de Resúmenes , Algoritmos , Bases de Datos FactualesRESUMEN
MOTIVATION: Clustered regularly interspaced short palindromic repeats (CRISPR) technologies allow for facile genomic modification in a site-specific manner. A key step in this process is the in silico design of single guide RNAs to efficiently and specifically target a site of interest. To this end, it is necessary to enumerate all potential off-target sites within a given genome that could be inadvertently altered by nuclease-mediated cleavage. Currently available software for this task is limited by computational efficiency, variant support or annotation, and assessment of the functional impact of potential off-target effects. RESULTS: To overcome these limitations, we have developed CRISPRitz, a suite of software tools to support the design and analysis of CRISPR/CRISPR-associated (Cas) experiments. Using efficient data structures combined with parallel computation, we offer a rapid, reliable, and exhaustive search mechanism to enumerate a comprehensive list of putative off-target sites. As proof-of-principle, we performed a head-to-head comparison with other available tools on several datasets. This analysis highlighted the unique features and superior computational performance of CRISPRitz including support for genomic searching with DNA/RNA bulges and mismatches of arbitrary size as specified by the user as well as consideration of genetic variants (variant-aware). In addition, graphical reports are offered for coding and non-coding regions that annotate the potential impact of putative off-target sites that lie within regions of functional genomic annotation (e.g. insulator and chromatin accessible sites from the ENCyclopedia Of DNA Elements [ENCODE] project). AVAILABILITY AND IMPLEMENTATION: The software is freely available at: https://github.com/pinellolab/CRISPRitzhttps://github.com/InfOmics/CRISPRitz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Edición Génica , Sistemas CRISPR-Cas , ARN Guía de Kinetoplastida , Programas InformáticosRESUMEN
BACKGROUND: Ischemic stroke is a devastating disease without a cure. The available treatments for ischemic stroke, thrombolysis by tissue plasminogen activator, and thrombectomy are suitable only to a fraction of patients and thus novel therapeutic approaches are urgently needed. The neuroinflammatory responses elicited secondary to the ischemic attack further aggravate the stroke-induced neuronal damage. It has been demonstrated that these responses are regulated at the level of non-coding RNAs, especially miRNAs. METHODS: We utilized lentiviral vectors to overexpress miR-669c in BV2 microglial cells in order to modulate their polarization. To detect whether the modulation of microglial activation by miR-669c provides protection in a mouse model of transient focal ischemic stroke, miR-669c overexpression was driven by a lentiviral vector injected into the striatum prior to induction of ischemic stroke. RESULTS: Here, we demonstrate that miR-669c-3p, a member of chromosome 2 miRNA cluster (C2MC), is induced upon hypoxic and excitotoxic conditions in vitro and in two different in vivo models of stroke. Rather than directly regulating the neuronal survival in vitro, miR-669c is capable of attenuating the microglial proinflammatory activation in vitro and inducing the expression of microglial alternative activation markers arginase 1 (Arg1), chitinase-like 3 (Ym1), and peroxisome proliferator-activated receptor gamma (PPAR-γ). Intracerebral overexpression of miR-669c significantly decreased the ischemia-induced cell death and ameliorated the stroke-induced neurological deficits both at 1 and 3 days post injury (dpi). Albeit miR-669c overexpression failed to alter the overall Iba1 protein immunoreactivity, it significantly elevated Arg1 levels in the ischemic brain and increased colocalization of Arg1 and Iba1. Moreover, miR-669c overexpression under cerebral ischemia influenced several morphological characteristics of Iba1 positive cells. We further demonstrate the myeloid differentiation primary response gene 88 (MyD88) transcript as a direct target for miR-669c-3p in vitro and show reduced levels of MyD88 in miR-669c overexpressing ischemic brains in vivo. CONCLUSIONS: Collectively, our data provide the evidence that miR-669c-3p is protective in a mouse model of ischemic stroke through enhancement of the alternative microglial/macrophage activation and inhibition of MyD88 signaling. Our results accentuate the importance of controlling miRNA-regulated responses for the therapeutic benefit in conditions of stroke and neuroinflammation.
Asunto(s)
Ventrículos Cerebrales/metabolismo , Accidente Cerebrovascular Isquémico/metabolismo , Activación de Macrófagos/fisiología , Macrófagos/metabolismo , MicroARNs/metabolismo , Microglía/metabolismo , Factor 88 de Diferenciación Mieloide/metabolismo , Animales , Células Cultivadas , Modelos Animales de Enfermedad , Accidente Cerebrovascular Isquémico/genética , Ratones , MicroARNs/genética , Neuronas/metabolismo , Transducción de Señal/fisiologíaRESUMEN
miRandola (http://mirandola.iit.cnr.it/) is a database of extracellular non-coding RNAs (ncRNAs) that was initially published in 2012, foreseeing the relevance of ncRNAs as non-invasive biomarkers. An increasing amount of experimental evidence shows that ncRNAs are frequently dysregulated in diseases. Further, ncRNAs have been discovered in different extracellular forms, such as exosomes, which circulate in human body fluids. Thus, miRandola 2017 is an effort to update and collect the accumulating information on extracellular ncRNAs that is spread across scientific publications and different databases. Data are manually curated from 314 articles that describe miRNAs, long non-coding RNAs and circular RNAs. Fourteen organisms are now included in the database, and associations of ncRNAs with 25 drugs, 47 sample types and 197 diseases. miRandola also classifies extracellular RNAs based on their extracellular form: Argonaute2 protein, exosome, microvesicle, microparticle, membrane vesicle, high density lipoprotein and circulating. We also implemented a new web interface to improve the user experience.
Asunto(s)
Bases de Datos Genéticas , Bases del Conocimiento , ARN no Traducido , Biomarcadores , Ácidos Nucleicos Libres de Células , Curaduría de Datos , Humanos , MicroARNs , ARN , ARN Circular , ARN Largo no Codificante , Interfaz Usuario-ComputadorRESUMEN
The 17th International NETTAB workshop was held in Palermo, Italy, on October 16-18, 2017. The special topic for the meeting was "Methods, tools and platforms for Personalised Medicine in the Big Data Era", but the traditional topics of the meeting series were also included in the event. About 40 scientific contributions were presented, including four keynote lectures, five guest lectures, and many oral communications and posters. Also, three tutorials were organised before and after the workshop. Full papers from some of the best works presented in Palermo were submitted for this Supplement of BMC Bioinformatics. Here, we provide an overview of meeting aims and scope. We also shortly introduce selected papers that have been accepted for publication in this Supplement, for a complete presentation of the outcomes of the meeting.
Asunto(s)
Biología Computacional/métodos , Atención a la Salud , Genómica , Humanos , Italia , Neoplasias/genética , Medicina de PrecisiónRESUMEN
Posttranscriptional cross talk and communication between genes mediated by microRNA response element (MREs) yield large regulatory competing endogenous RNA (ceRNA) networks. Their inference may improve the understanding of pathologies and shed new light on biological mechanisms. A variety of RNA: messenger RNA, transcribed pseudogenes, noncoding RNA, circular RNA and proteins related to RNA-induced silencing complex complex interacting with RNA transfer and ribosomal RNA have been experimentally proved to be ceRNAs. We retrace the ceRNA hypothesis of posttranscriptional regulation from its original formulation [Salmena L, Poliseno L, Tay Y, et al. Cell 2011;146:353-8] to the most recent experimental and computational validations. We experimentally analyze the methods in literature [Li J-H, Liu S, Zhou H, et al. Nucleic Acids Res 2013;42:D92-7; Sumazin P, Yang X, Chiu H-S, et al. Cell 2011;147:370-81; Sarver AL, Subramanian S. Bioinformation 2012;8:731-3] comparing them with a general machine learning approach, called ceRNA predIction Algorithm, evaluating the performance in predicting novel MRE-based ceRNAs.
Asunto(s)
Biología Computacional/métodos , Regulación de la Expresión Génica , MicroARNs/genética , ARN Largo no Codificante/genética , ARN Mensajero/genética , ARN/genética , Redes Reguladoras de Genes , Humanos , ARN Circular , Elementos de RespuestaRESUMEN
BACKGROUND: Pan-genome approaches afford the discovery of homology relations in a set of genomes, by determining how some gene families are distributed among a given set of genomes. The retrieval of a complete gene distribution among a class of genomes is an NP-hard problem because computational costs increase with the number of analyzed genomes, in fact, all-against-all gene comparisons are required to completely solve the problem. In presence of phylogenetically distant genomes, due to the variability introduced in gene duplication and transmission, the task of recognizing homologous genes becomes even more difficult. A challenge on this field is that of designing fast and adaptive similarity measures in order to find a suitable pan-genome structure of homology relations. RESULTS: We present PanDelos, a stand alone tool for the discovery of pan-genome contents among phylogenetic distant genomes. The methodology is based on information theory and network analysis. It is parameter-free because thresholds are automatically deduced from the context. PanDelos avoids sequence alignment by introducing a measure based on k-mer multiplicity. The k-mer length is defined according to general arguments rather than empirical considerations. Homology candidate relations are integrated into a global network and groups of homologous genes are extracted by applying a community detection algorithm. CONCLUSIONS: PanDelos outperforms existing approaches, Roary and EDGAR, in terms of running times and quality content discovery. Tests were run on collections of real genomes, previously used in analogous studies, and in synthetic benchmarks that represent fully trusted golden truth. The software is available at https://github.com/GiugnoLab/PanDelos .
Asunto(s)
Diccionarios como Asunto , Genoma Bacteriano , Programas Informáticos , Bacterias/genética , Bases de Datos Genéticas , Duplicación de Gen , Filogenia , Factores de TiempoRESUMEN
BACKGROUND: The analysis of tissue-specific protein interaction networks and their functional enrichment in pathological and normal tissues provides insights on the etiology of diseases. The Pan-cancer proteomic project, in The Cancer Genome Atlas, collects protein expressions in human cancers and it is a reference resource for the functional study of cancers. However, established protocols to infer interaction networks from protein expressions are still missing. RESULTS: We have developed a methodology called Inference Network Based on iRefIndex Analysis (INBIA) to accurately correlate proteomic inferred relations to protein-protein interaction (PPI) networks. INBIA makes use of 14 network inference methods on protein expressions related to 16 cancer types. It uses as reference model the iRefIndex human PPI network. Predictions are validated through non-interacting and tissue specific PPI networks resources. The first, Negatome, takes into account likely non-interacting proteins by combining both structure properties and literature mining. The latter, TissueNet and GIANT, report experimentally verified PPIs in more than 50 human tissues. The reliability of the proposed methodology is assessed by comparing INBIA with PERA, a tool which infers protein interaction networks from Pathway Commons, by both functional and topological analysis. CONCLUSION: Results show that INBIA is a valuable approach to predict proteomic interactions in pathological conditions starting from the current knowledge of human protein interactions.