Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 57
Filtrar
1.
Brief Bioinform ; 25(2)2024 Jan 22.
Artigo em Inglês | MEDLINE | ID: mdl-38436559

RESUMO

A wide range of approaches can be used to detect micro RNA (miRNA)-target gene pairs (mTPs) from expression data, differing in the ways the gene and miRNA expression profiles are calculated, combined and correlated. However, there is no clear consensus on which is the best approach across all datasets. Here, we have implemented multiple strategies and applied them to three distinct rare disease datasets that comprise smallRNA-Seq and RNA-Seq data obtained from the same samples, obtaining mTPs related to the disease pathology. All datasets were preprocessed using a standardized, freely available computational workflow, DEG_workflow. This workflow includes coRmiT, a method to compare multiple strategies for mTP detection. We used it to investigate the overlap of the detected mTPs with predicted and validated mTPs from 11 different databases. Results show that there is no clear best strategy for mTP detection applicable to all situations. We therefore propose the integration of the results of the different strategies by selecting the one with the highest odds ratio for each miRNA, as the optimal way to integrate the results. We applied this selection-integration method to the datasets and showed it to be robust to changes in the predicted and validated mTP databases. Our findings have important implications for miRNA analysis. coRmiT is implemented as part of the ExpHunterSuite Bioconductor package available from https://bioconductor.org/packages/ExpHunterSuite.


Assuntos
MicroRNAs , Consenso , Bases de Dados Factuais , MicroRNAs/genética , Razão de Chances , RNA-Seq
2.
Genes Immun ; 25(2): 124-131, 2024 04.
Artigo em Inglês | MEDLINE | ID: mdl-38396174

RESUMO

Meniere Disease (MD) is a chronic inner ear disorder characterized by vertigo attacks, sensorineural hearing loss, tinnitus, and aural fullness. Extensive evidence supporting the inflammatory etiology of MD has been found, therefore, by using transcriptome analysis, we aim to describe the inflammatory variants of MD. We performed Bulk RNAseq on 45 patients with definite MD and 15 healthy controls. MD patients were classified according to their basal levels of IL-1ß into 2 groups: high and low. Differentially expression analysis was performed using the ExpHunter Suite, and cell type proportion was evaluated using the estimation algorithms xCell, ABIS, and CIBERSORTx. MD patients showed 15 differentially expressed genes (DEG) compared to controls. The top DEGs include IGHG1 (p = 1.64 × 10-6) and IGLV3-21 (p = 6.28 × 10-3), supporting a role in the adaptative immune response. Cytokine profiling defines a subgroup of patients with high levels of IL-1ß with up-regulation of IL6 (p = 7.65 × 10-8) and INHBA (p = 3.39 × 10-7) genes. Transcriptomic data from peripheral blood mononuclear cells support a proinflammatory subgroup of MD patients with high levels of IL6 and an increase in naïve B-cells, and memory CD8+ T cells.


Assuntos
Doença de Meniere , Humanos , Doença de Meniere/metabolismo , Leucócitos Mononucleares/metabolismo , Interleucina-6/metabolismo , Linfócitos T CD8-Positivos/metabolismo , Perfilação da Expressão Gênica
3.
Brief Bioinform ; 23(4)2022 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-35731990

RESUMO

BACKGROUND: Angiogenesis is regulated by multiple genes whose variants can lead to different disorders. Among them, rare diseases are a heterogeneous group of pathologies, most of them genetic, whose information may be of interest to determine the still unknown genetic and molecular causes of other diseases. In this work, we use the information on rare diseases dependent on angiogenesis to investigate the genes that are associated with this biological process and to determine if there are interactions between the genes involved in its deregulation. RESULTS: We propose a systemic approach supported by the use of pathological phenotypes to group diseases by semantic similarity. We grouped 158 angiogenesis-related rare diseases in 18 clusters based on their phenotypes. Of them, 16 clusters had traceable gene connections in a high-quality interaction network. These disease clusters are associated with 130 different genes. We searched for genes associated with angiogenesis througth ClinVar pathogenic variants. Of the seven retrieved genes, our system confirms six of them. Furthermore, it allowed us to identify common affected functions among these disease clusters. AVAILABILITY: https://github.com/ElenaRojano/angio_cluster. CONTACT: seoanezonjic@uma.es and elenarojano@uma.es.


Assuntos
Biologia Computacional , Doenças Raras , Algoritmos , Análise por Conglomerados , Humanos , Fenótipo , Doenças Raras/genética , Semântica
4.
J Med Genet ; 60(4): 406-415, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36243518

RESUMO

BACKGROUND: Schaaf-Yang syndrome (SYS) is caused by truncating mutations in MAGEL2, mapping to the Prader-Willi region (15q11-q13), with an observed phenotype partially overlapping that of Prader-Willi syndrome. MAGEL2 plays a role in retrograde transport and protein recycling regulation. Our aim is to contribute to the characterisation of SYS pathophysiology at clinical, genetic and molecular levels. METHODS: We performed an extensive phenotypic and mutational revision of previously reported patients with SYS. We analysed the secretion levels of amyloid-ß 1-40 peptide (Aß1-40) and performed targeted metabolomic and transcriptomic profiles in fibroblasts of patients with SYS (n=7) compared with controls (n=11). We also transfected cell lines with vectors encoding wild-type (WT) or mutated MAGEL2 to assess stability and subcellular localisation of the truncated protein. RESULTS: Functional studies show significantly decreased levels of secreted Aß1-40 and intracellular glutamine in SYS fibroblasts compared with WT. We also identified 132 differentially expressed genes, including non-coding RNAs (ncRNAs) such as HOTAIR, and many of them related to developmental processes and mitotic mechanisms. The truncated form of MAGEL2 displayed a stability similar to the WT but it was significantly switched to the nucleus, compared with a mainly cytoplasmic distribution of the WT MAGEL2. Based on the updated knowledge, we offer guidelines for the clinical management of patients with SYS. CONCLUSION: A truncated MAGEL2 protein is stable and localises mainly in the nucleus, where it might exert a pathogenic neomorphic effect. Aß1-40 secretion levels and HOTAIR mRNA levels might be promising biomarkers for SYS. Our findings may improve SYS understanding and clinical management.


Assuntos
Síndrome de Prader-Willi , Humanos , Síndrome de Prader-Willi/genética , Fenótipo , Mutação , Proteínas/genética , Biomarcadores
5.
J Biomed Inform ; 144: 104421, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37315831

RESUMO

Angiogenesis is essential for tumor growth and cancer metastasis. Identifying the molecular pathways involved in this process is the first step in the rational design of new therapeutic strategies to improve cancer treatment. In recent years, RNA-seq data analysis has helped to determine the genetic and molecular factors associated with different types of cancer. In this work we performed integrative analysis using RNA-seq data from human umbilical vein endothelial cells (HUVEC) and patients with angiogenesis-dependent diseases to find genes that serve as potential candidates to improve the prognosis of tumor angiogenesis deregulation and understand how this process is orchestrated at the genetic and molecular level. We downloaded four RNA-seq datasets (including cellular models of tumor angiogenesis and ischaemic heart disease) from the Sequence Read Archive. Our integrative analysis includes a first step to determine differentially and co-expressed genes. For this, we used the ExpHunter Suite, an R package that performs differential expression, co-expression and functional analysis of RNA-seq data. We used both differentially and co-expressed genes to explore the human gene interaction network and determine which genes were found in the different datasets that may be key for the angiogenesis deregulation. Finally, we performed drug repositioning analysis to find potential targets related to angiogenesis inhibition. We found that that among the transcriptional alterations identified, SEMA3D and IL33 genes are deregulated in all datasets. Microenvironment remodeling, cell cycle, lipid metabolism and vesicular transport are the main molecular pathways affected. In addition to this, interacting genes are involved in intracellular signaling pathways, especially in immune system and semaphorins, respiratory electron transport and fatty acid metabolism. The methodology presented here can be used for finding common transcriptional alterations in other genetically-based diseases.


Assuntos
Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Perfilação da Expressão Gênica/métodos , Células Endoteliais , Transdução de Sinais/genética
6.
PLoS Genet ; 16(10): e1009054, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-33001999

RESUMO

Genetic and molecular analysis of rare disease is made difficult by the small numbers of affected patients. Phenotypic comorbidity analysis can help rectify this by combining information from individuals with similar phenotypes and looking for overlap in terms of shared genes and underlying functional systems. However, few studies have combined comorbidity analysis with genomic data. We present a computational approach that connects patient phenotypes based on phenotypic co-occurence and uses genomic information related to the patient mutations to assign genes to the phenotypes, which are used to detect enriched functional systems. These phenotypes are clustered using network analysis to obtain functionally coherent phenotype clusters. We applied the approach to the DECIPHER database, containing phenotypic and genomic information for thousands of patients with heterogeneous rare disorders and copy number variants. Validity was demonstrated through overlap with known diseases, co-mention within the biomedical literature, semantic similarity measures, and patient cluster membership. These connected pairs formed multiple phenotype clusters, showing functional coherence, and mapped to genes and systems involved in similar pathological processes. Examples include claudin genes from the 22q11 genomic region associated with a cluster of phenotypes related to DiGeorge syndrome and genes related to the GO term anterior/posterior pattern specification associated with abnormal development. The clusters generated can help with the diagnosis of rare diseases, by suggesting additional phenotypes for a given patient and potential underlying functional systems. Other tools to find causal genes based on phenotype were also investigated. The approach has been implemented as a workflow, named PhenCo, which can be adapted to any set of patients for which phenomic and genomic data is available. Full details of the analysis, including the clusters formed, their constituent functional systems and underlying genes are given. Code to implement the workflow is available from GitHub.


Assuntos
Comorbidade , Predisposição Genética para Doença , Genômica , Doenças Raras/genética , Variações do Número de Cópias de DNA/genética , Bases de Dados Genéticas , Estudos de Associação Genética , Genoma Humano/genética , Genótipo , Humanos , Mutação/genética , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/patologia
7.
BMC Bioinformatics ; 23(1): 43, 2022 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-35033002

RESUMO

BACKGROUND: Protein function prediction remains a key challenge. Domain composition affects protein function. Here we present DomFun, a Ruby gem that uses associations between protein domains and functions, calculated using multiple indices based on tripartite network analysis. These domain-function associations are combined at the protein level, to generate protein-function predictions. RESULTS: We analysed 16 tripartite networks connecting homologous superfamily and FunFam domains from CATH-Gene3D with functional annotations from the three Gene Ontology (GO) sub-ontologies, KEGG, and Reactome. We validated the results using the CAFA 3 benchmark platform for GO annotation, finding that out of the multiple association metrics and domain datasets tested, Simpson index for FunFam domain-function associations combined with Stouffer's method leads to the best performance in almost all scenarios. We also found that using FunFams led to better performance than superfamilies, and better results were found for GO molecular function compared to GO biological process terms. DomFun performed as well as the highest-performing method in certain CAFA 3 evaluation procedures in terms of [Formula: see text] and [Formula: see text] We also implemented our own benchmark procedure, Pathway Prediction Performance (PPP), which can be used to validate function prediction for additional annotations sources, such as KEGG and Reactome. Using PPP, we found similar results to those found with CAFA 3 for GO, moreover we found good performance for the other annotation sources. As with CAFA 3, Simpson index with Stouffer's method led to the top performance in almost all scenarios. CONCLUSIONS: DomFun shows competitive performance with other methods evaluated in CAFA 3 when predicting proteins function with GO, although results vary depending on the evaluation procedure. Through our own benchmark procedure, PPP, we have shown it can also make accurate predictions for KEGG and Reactome. It performs best when using FunFams, combining Simpson index derived domain-function associations using Stouffer's method. The tool has been implemented so that it can be easily adapted to incorporate other protein features, such as domain data from other sources, amino acid k-mers and motifs. The DomFun Ruby gem is available from https://rubygems.org/gems/DomFun . Code maintained at https://github.com/ElenaRojano/DomFun . Validation procedure scripts can be found at https://github.com/ElenaRojano/DomFun_project .


Assuntos
Biologia Computacional , Proteínas , Bases de Dados de Proteínas , Ontologia Genética , Anotação de Sequência Molecular , Proteínas/genética
8.
Bioinformatics ; 37(8): 1076-1082, 2021 05 23.
Artigo em Inglês | MEDLINE | ID: mdl-33135068

RESUMO

MOTIVATION: Predicting the residues controlling a protein's interaction specificity is important not only to better understand its interactions but also to design mutations aimed at fine-tuning or swapping them as well. RESULTS: In this work, we present a methodology that combines sequence information (in the form of multiple sequence alignments) with interactome information to detect that kind of residues in paralogous families of proteins. The interactome is used to define pairwise similarities of interaction contexts for the proteins in the alignment. The method looks for alignment positions with patterns of amino-acid changes reflecting the similarities/differences in the interaction neighborhoods of the corresponding proteins. We tested this new methodology in a large set of human paralogous families with structurally characterized interactions, and discuss in detail the results for the RasH family. We show that this approach is a better predictor of interfacial residues than both, sequence conservation and an equivalent 'unsupervised' method that does not use interactome information. AVAILABILITY AND IMPLEMENTATION: http://csbg.cnb.csic.es/pazos/Xdet/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Proteínas , Software , Humanos , Proteínas/genética , Alinhamento de Sequência , Análise de Sequência de Proteína
9.
Hum Genet ; 140(3): 457-475, 2021 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-32778951

RESUMO

Copy number variation (CNV) related disorders tend to show complex phenotypic profiles that do not match known diseases. This makes it difficult to ascertain their underlying molecular basis. A potential solution is to compare the affected genomic regions for multiple patients that share a pathological phenotype, looking for commonalities. Here, we present a novel approach to associate phenotypes with functional systems, in terms of GO categories and KEGG and Reactome pathways, based on patient data. The approach uses genomic and phenomic data from the same patients, finding shared genomic regions between patients with similar phenotypes. These regions are mapped to genes to find associated functional systems. We applied the approach to analyse patients in the DECIPHER database with de novo CNVs, finding functional systems associated with most phenotypes, often due to mutations affecting related genes in the same genomic region. Manual inspection of the ten top-scoring phenotypes found multiple FunSys connections supported by the previous studies for seven of them. The workflow also produces reports focussed on the genes and FunSys connected to the different phenotypes, alongside patient-specific reports, which give details of the associated genes and FunSys for each individual in the cohort. These can be run in "confidential" mode, preserving patient confidentiality. The workflow presented here can be used to associate phenotypes with functional systems using data at the level of a whole cohort of patients, identifying important connections that could not be found when considering them individually. The full workflow is available for download, enabling it to be run on any patient cohort for which phenotypic and CNV data are available.


Assuntos
Variações do Número de Cópias de DNA , Predisposição Genética para Doença , Genótipo , Fenótipo , Estudos de Coortes , Bases de Dados Genéticas , Humanos
10.
Brief Bioinform ; 20(5): 1639-1654, 2019 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-29893792

RESUMO

Variants within non-coding genomic regions can greatly affect disease. In recent years, increasing focus has been given to these variants, and how they can alter regulatory elements, such as enhancers, transcription factor binding sites and DNA methylation regions. Such variants can be considered regulatory variants. Concurrently, much effort has been put into establishing international consortia to undertake large projects aimed at discovering regulatory elements in different tissues, cell lines and organisms, and probing the effects of genetic variants on regulation by measuring gene expression. Here, we describe methods and techniques for discovering disease-associated non-coding variants using sequencing technologies. We then explain the computational procedures that can be used for annotating these variants using the information from the aforementioned projects, and prediction of their putative effects, including potential pathogenicity, based on rule-based and machine learning approaches. We provide the details of techniques to validate these predictions, by mapping chromatin-chromatin and chromatin-protein interactions, and introduce Clustered Regularly Interspaced Short Palindromic Repeats-Associated Protein 9 (CRISPR-Cas9) technology, which has already been used in this field and is likely to have a big impact on its future evolution. We also give examples of regulatory variants associated with multiple complex diseases. This review is aimed at bioinformaticians interested in the characterization of regulatory variants, molecular biologists and geneticists interested in understanding more about the nature and potential role of such variants from a functional point of views, and clinicians who may wish to learn about variants in non-coding genomic regions associated with a given disease and find out what to do next to uncover how they impact on the underlying mechanisms.


Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Sequências Reguladoras de Ácido Nucleico , Cromatina/metabolismo , Genoma Humano , Humanos , Aprendizado de Máquina , Ligação Proteica
11.
BMC Bioinformatics ; 18(1): 96, 2017 Feb 10.
Artigo em Inglês | MEDLINE | ID: mdl-28183267

RESUMO

BACKGROUND: Loss-of-function phenotypes are widely used to infer gene function using the principle that similar phenotypes are indicative of similar functions. However, converting phenotypic to functional annotations requires careful interpretation of phenotypic descriptions and assessment of phenotypic similarity. Understanding how functions and phenotypes are linked will be crucial for the development of methods for the automatic conversion of gene loss-of-function phenotypes to gene functional annotations. RESULTS: We explored the relation between cellular phenotypes from RNAi-based screens in human cells and gene annotations of cellular functions as provided by the Gene Ontology (GO). Comparing different similarity measures, we found that information content-based measures of phenotypic similarity were the best at capturing gene functional similarity. However, phenotypic similarities did not map to the Gene Ontology organization of gene function but to functions defined as groups of GO terms with shared gene annotations. CONCLUSIONS: Our observations have implications for the use and interpretation of phenotypic similarities as a proxy for gene functions both in RNAi screen data analysis and curation and in the prediction of disease genes.


Assuntos
Biologia Computacional/métodos , Área Sob a Curva , Análise por Conglomerados , Humanos , Fenótipo , Interferência de RNA , Curva ROC
12.
Mol Genet Genomics ; 292(4): 857-869, 2017 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-28386641

RESUMO

Drug resistance remains a major problem in combating malignancies, resulting critical the resistance to paclitaxel used in the treatment of many different cancers. Elucidating the cellular heterogeneity composition of tumours may be relevant to designing more effective treatment strategies on drug resistance. In particular, such heterogeneity correlates with the measurement of gene expression below the population level. However, experimental assays capturing differential response are limited and cannot discern the variation in gene expression specific to different cellular types in tumour populations. These limitations led us to consider a mathematical modelling approach, in which the gene expression of cellular subpopulations is recovered by deconvolution. Mathematically, the deconvolution is a multi-linear regression-based problem. We combined herein data on cellular subpopulation frequency composition with gene expression values from 16 tumour lines (8 resistant and 8 sensitive to paclitaxel treatment) to find genes that are differentially expressed between paclitaxel resistant and paclitaxel sensitive tumour lines in different cellular subpopulations. The results indicate that many genes differentially expressed between paclitaxel resistant and sensitive cancer lines are only detected when considering their heterogeneous cellular composition. Overall, our methodology is thought to keep in mind phenotypic heterogeneity improving our resolution in the identification of biomarkers on resistance to chemo-therapeutic agents.


Assuntos
Antineoplásicos/farmacologia , Resistencia a Medicamentos Antineoplásicos/genética , Regulação Neoplásica da Expressão Gênica , Modelos Teóricos , Paclitaxel/farmacologia , Linhagem Celular Tumoral , Células HCT116 , Células HT29 , Humanos , Células MCF-7 , Transdução de Sinais/genética
13.
BMC Genomics ; 17: 232, 2016 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-26980139

RESUMO

BACKGROUND: Network medicine is a promising new discipline that combines systems biology approaches and network science to understand the complexity of pathological phenotypes. Given the growing availability of personalized genomic and phenotypic profiles, network models offer a robust integrative framework for the analysis of "omics" data, allowing the characterization of the molecular aetiology of pathological processes underpinning genetic diseases. METHODS: Here we make use of patient genomic data to exploit different network-based analyses to study genetic and phenotypic relationships between individuals. For this method, we analyzed a dataset of structural variants and phenotypes for 6,564 patients from the DECIPHER database, which encompasses one of the most comprehensive collections of pathogenic Copy Number Variations (CNVs) and their associated ontology-controlled phenotypes. We developed a computational strategy that identifies clusters of patients in a synthetic patient network according to their genetic overlap and phenotype enrichments. RESULTS: Many of these clusters of patients represent new genotype-phenotype associations, suggesting the identification of newly discovered phenotypically enriched loci (indicative of potential novel syndromes) that are currently absent from reference genomic disorder databases such as ClinVar, OMIM or DECIPHER itself. CONCLUSIONS: We provide a high-resolution map of pathogenic phenotypes associated with their respective significant genomic regions and a new powerful tool for diagnosis of currently uncharacterized mutations leading to deleterious phenotypes and syndromes.


Assuntos
Variações do Número de Cópias de DNA , Doenças Genéticas Inatas/genética , Genômica/métodos , Fenótipo , Estudos de Casos e Controles , Bases de Dados Genéticas , Estudos de Associação Genética , Loci Gênicos , Humanos , Mutação
14.
Bioinformatics ; 31(12): 2052-3, 2015 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-25667547

RESUMO

MOTIVATION: Most biological processes remain only partially characterized with many components still to be identified. Given that a whole genome can usually not be tested in a functional assay, identifying the genes most likely to be of interest is of critical importance to avoid wasting resources. RESULTS: Given a set of known functionally related genes and using a state-of-the-art approach to data integration and mining, our Functional Lists (FUN-L) method provides a ranked list of candidate genes for testing. Validation of predictions from FUN-L with independent RNAi screens confirms that FUN-L-produced lists are enriched in genes with the expected phenotypes. In this article, we describe a website front end to FUN-L. AVAILABILITY AND IMPLEMENTATION: The website is freely available to use at http://funl.org


Assuntos
Algoritmos , Biologia Computacional/métodos , Mineração de Dados/métodos , Redes Reguladoras de Genes , Interferência de RNA , Software , Humanos , Fenótipo
15.
Amino Acids ; 48(10): 2433-42, 2016 10.
Artigo em Inglês | MEDLINE | ID: mdl-27270572

RESUMO

Almost 90 % of disease-associated genetic variants found using genome wide association studies (GWAS) are located in non-coding regions of the genome. Such variants can affect phenotype by altering important regulatory elements such as promoters, enhancers or repressors, leading to changes in gene expression and consequently disease, such as thyroid cancer and allergic diseases. A number of allergy and atopy related diseases such as asthma and atopic dermatitis are related to histamine receptors; however, these diseases are not fully characterized at the molecular level. Moreover, candidate gene based studies of common variants known as single nucleotide polymorphism (SNPs) located in the coding regions of these receptors have given mixed results. It is important to complement these approaches by identifying and characterising non-coding variants in order to further elucidate the role of these receptors in disease. Here we present an analysis of histamine receptor genes using the tool AnNCR-SNP to characterise variants in non-coding genomic regions. AnNCR-SNP combines bioinformatics and experimental data sets from various sources to predict the effects of genetic variation on gene expression regulation. We find many SNPs located in areas of open chromatin, overlapping with transcription factor binding sites and associated with changes in gene expression in expression quantitative trait loci (eQTL) experiments. Here we present the results as a catalogue of non-coding variation in histamine receptor genes to aid histamine researchers in identifying putative functional SNPs found in GWAS for further validation, and to help select variants for candidate gene studies.


Assuntos
Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Receptores Histamínicos/genética , Análise de Sequência de DNA/métodos , Software , Estudo de Associação Genômica Ampla , Humanos
16.
BMC Genomics ; 16: 608, 2015 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-26275604

RESUMO

BACKGROUND: In complex Metazoans a given gene frequently codes for multiple protein isoforms, through processes such as alternative splicing. Large scale functional annotation of these isoforms is a key challenge for functional genomics. This annotation gap is increasing with the large numbers of multi transcript genes being identified by technologies such as RNASeq. Furthermore attempts to characterise the functions of splicing in an organism are complicated by the difficulty in distinguishing functional isoforms from those produced by splicing errors or transcription noise. Tools to help prioritise candidate isoforms for testing are largely absent. RESULTS: In this study we implement a Time-course Switch (TS) score for ranking isoforms by their likelihood of producing additional functions based on their developmental expression profiles, as reported by modENCODE. The TS score allows us to better investigate functional roles of different isoforms expressed in multi transcript genes. From this analysis, we find that isoforms with high TS scores have sequence feature changes consistent with more deterministic splicing and functional changes and tend to gain domains or whole exons which could carry additional functions. Furthermore these functions appear to be particularly important for essential regulatory roles, establishing functional isoform switching as key for regulatory processes. Based on the TS score we develop a Transcript Annotations Pipeline for Alternative Splicing (TAPAS) that identifies functional neighbourhoods of potentially interesting isoforms. CONCLUSIONS: We have identified a subset of protein isoforms which appear to have high functional significance, particularly in regulation. This has been made possible through the development of novel methods that make use of transcript expression profiles. The methods and analyses we present here represent important first steps in the development of tools to address the near complete lack of isoform specific function annotation. In turn the tools allow us to better characterise the regulatory functions of alternative splicing in more detail.


Assuntos
Processamento Alternativo , Drosophila melanogaster/crescimento & desenvolvimento , Isoformas de Proteínas/metabolismo , Algoritmos , Animais , Biologia Computacional/métodos , Bases de Dados Genéticas , Drosophila melanogaster/genética , Regulação da Expressão Gênica no Desenvolvimento , RNA Mensageiro/metabolismo
17.
Bioinformatics ; 29(16): 1934-7, 2013 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-23740740

RESUMO

MOTIVATION: Polypharmacology (the ability of a single drug to affect multiple targets) is a key feature that may explain part of the decreasing success of conventional drug discovery strategies driven by the quest for drugs to act selectively on a single target. Most drug targets are proteins that are composed of domains (their structural and functional building blocks). RESULTS: In this work, we model drug-domain networks to explore the role of protein domains as drug targets and to explain drug polypharmacology in terms of the interactions between drugs and protein domains. We find that drugs are organized around a privileged set of druggable domains. CONCLUSIONS: Protein domains are a good proxy for drug targets, and drug polypharmacology emerges as a consequence of the multi-domain composition of proteins. CONTACT: amoyag@uma.es SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Polifarmacologia , Estrutura Terciária de Proteína/efeitos dos fármacos , Humanos , Filogenia , Proteínas/efeitos dos fármacos , Proteínas/metabolismo
18.
Database (Oxford) ; 20242024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38564426

RESUMO

The CoMentG resource contains millions of relationships between terms of biomedical interest obtained from the scientific literature. At the core of the system is a methodology for detecting significant co-mentions of concepts in the entire PubMed corpus. That method was applied to nine sets of terms covering the most important classes of biomedical concepts: diseases, symptoms/clinical signs, molecular functions, biological processes, cellular compartments, anatomic parts, cell types, bacteria and chemical compounds. We obtained more than 7 million relationships between more than 74 000 terms, and many types of relationships were not available in any other resource. As the terms were obtained from widely used resources and ontologies, the relationships are given using the standard identifiers provided by them and hence can be linked to other data. A web interface allows users to browse these associations, searching for relationships for a set of terms of interests provided as input, such as between a disease and their associated symptoms, underlying molecular processes or affected tissues. The results are presented in an interactive interface where the user can explore the reported relationships in different ways and follow links to other resources. Database URL: https://csbg.cnb.csic.es/CoMentG/.


Assuntos
Publicações , PubMed , Bases de Dados Factuais
19.
Biochim Biophys Acta Mol Basis Dis ; 1870(5): 167163, 2024 06.
Artigo em Inglês | MEDLINE | ID: mdl-38599261

RESUMO

PMM2-CDG (MIM # 212065), the most common congenital disorder of glycosylation, is caused by the deficiency of phosphomannomutase 2 (PMM2). It is a multisystemic disease of variable severity that particularly affects the nervous system; however, its molecular pathophysiology remains poorly understood. Currently, there is no effective treatment. We performed an RNA-seq based transcriptomic study using patient-derived fibroblasts to gain insight into the mechanisms underlying the clinical symptomatology and to identify druggable targets. Systems biology methods were used to identify cellular pathways potentially affected by PMM2 deficiency, including Senescence, Bone regulation, Cell adhesion and Extracellular Matrix (ECM) and Response to cytokines. Functional validation assays using patients' fibroblasts revealed defects related to cell proliferation, cell cycle, the composition of the ECM and cell migration, and showed a potential role of the inflammatory response in the pathophysiology of the disease. Furthermore, treatment with a previously described pharmacological chaperone reverted the differential expression of some of the dysregulated genes. The results presented from transcriptomic data might serve as a platform for identifying therapeutic targets for PMM2-CDG, as well as for monitoring the effectiveness of therapeutic strategies, including pharmacological candidates and mannose-1-P, drug repurposing.


Assuntos
Defeitos Congênitos da Glicosilação , Fibroblastos , Fosfotransferases (Fosfomutases) , Humanos , Defeitos Congênitos da Glicosilação/genética , Defeitos Congênitos da Glicosilação/patologia , Defeitos Congênitos da Glicosilação/metabolismo , Defeitos Congênitos da Glicosilação/tratamento farmacológico , Fosfotransferases (Fosfomutases)/genética , Fosfotransferases (Fosfomutases)/metabolismo , Fosfotransferases (Fosfomutases)/deficiência , Fibroblastos/metabolismo , Fibroblastos/patologia , Transcriptoma , Perfilação da Expressão Gênica , Proliferação de Células/genética , Proliferação de Células/efeitos dos fármacos , Feminino , Masculino , Movimento Celular/genética , Movimento Celular/efeitos dos fármacos
20.
J Mol Med (Berl) ; 2024 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-39225820

RESUMO

Epigenetic alterations play a pivotal role in conditions influenced by environmental factors such as overweight and obesity. Many of these changes are tissue-specific, which entails a problem in its study since obtaining human tissue is a complex and invasive practice. While blood is widely used as a surrogate biomarker, it cannot directly extrapolate the evidence found in blood to tissue. Moreover, the intricacies of metabolic diseases add a new layer of complexity, as obesity leads to significant alterations in adipose tissue, potentially causing associated pathologies that can disrupt existing correlations seen in healthy individuals. Here, our objective was to determine which epigenetic markers exhibit correlations between blood and adipose tissue, regardless of the metabolic status. We collected paired blood and adipose tissue samples from 64 patients with morbidity obesity and non-obese and employed the MethylationEPIC 850 K array for analysis. We found that only a small fraction, specifically 4.3% (corresponding to 34,825 CpG sites), of the sites showed statistically significant correlations (R ≥ 0.6) between blood and adipose tissue. Within this subset, 5327 CpG sites exhibited a strong correlation (R ≥ 0.8) between blood and adipose tissue. Our findings suggest that the majority of epigenetic markers in peripheral blood do not reliably reflect changes occurring in visceral adipose tissues. However, it is important to note that there exists a distinct set of epigenetic markers that can indeed mirror changes in adipose tissue within blood samples. KEY MESSAGES: More than 8% of methylation sites exhibit similarity between blood and adipose tissues, regardless of BMI The correlation percentage between blood and adipose tissue is strongly influenced by gender The principal genes implicated in this correlation are related to metabolism or the immunological system.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA