Your browser doesn't support javascript.
loading
ModEx: A text mining system for extracting mode of regulation of transcription factor-gene regulatory interaction.
Farahmand, Saman; Riley, Todd; Zarringhalam, Kourosh.
Afiliación
  • Farahmand S; Computational Sciences PhD program, University of Massachusetts Boston, Boston, USA; Department of Biology, University of Massachusetts Boston, Boston, USA.
  • Riley T; Department of Biology, University of Massachusetts Boston, Boston, USA.
  • Zarringhalam K; Department of Mathematics, University of Massachusetts Boston, Boston, USA. Electronic address: kourosh.zarringhalam@umb.edu.
J Biomed Inform ; 102: 103353, 2020 02.
Article en En | MEDLINE | ID: mdl-31857203
BACKGROUND: Transcription factors (TFs) are proteins that are fundamental to transcription and regulation of gene expression. Each TF may regulate multiple genes and each gene may be regulated by multiple TFs. TFs can act as either activator or repressor of gene expression. This complex network of interactions between TFs and genes underlies many developmental and biological processes and is implicated in several human diseases such as cancer. Hence deciphering the network of TF-gene interactions with information on mode of regulation (activation vs. repression) is an important step toward understanding the regulatory pathways that underlie complex traits. There are many experimental, computational, and manually curated databases of TF-gene interactions. In particular, high-throughput ChIP-Seq datasets provide a large-scale map or transcriptional regulatory interactions. However, these interactions are not annotated with information on context and mode of regulation. Such information is crucial to gain a global picture of gene regulatory mechanisms and can aid in developing machine learning models for applications such as biomarker discovery, prediction of response to therapy, and precision medicine. METHODS: In this work, we introduce a text-mining system to annotate ChIP-Seq derived interaction with such meta data through mining PubMed articles. We evaluate the performance of our system using gold standard small scale manually curated databases. RESULTS: Our results show that the method is able to accurately extract mode of regulation with F-score 0.77 on TRRUST curated interaction and F-score 0.96 on intersection of TRUSST and ChIP-network. We provide a HTTP REST API for our code to facilitate usage. Availibility: Source code and datasets are available for download on GitHub: https://github.com/samanfrm/modex.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Factores de Transcripción / Regulación de la Expresión Génica / Minería de Datos Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: J Biomed Inform Asunto de la revista: INFORMATICA MEDICA Año: 2020 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Factores de Transcripción / Regulación de la Expresión Génica / Minería de Datos Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: J Biomed Inform Asunto de la revista: INFORMATICA MEDICA Año: 2020 Tipo del documento: Article País de afiliación: Estados Unidos