Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Brief Bioinform ; 23(5)2022 09 20.
Artículo en Inglés | MEDLINE | ID: mdl-35959979

RESUMEN

The rapid development of genomic high-throughput sequencing has identified a large number of DNA regulatory elements with abundant epigenetics markers, which promotes the rapid accumulation of functional genomic region data. The comprehensively understanding and research of human functional genomic regions is still a relatively urgent work at present. However, the existing analysis tools lack extensive annotation and enrichment analytical abilities for these regions. Here, we designed a novel software, Genomic Region sets Enrichment Analysis Platform (GREAP), which provides comprehensive region annotation and enrichment analysis capabilities. Currently, GREAP supports 85 370 genomic region reference sets, which cover 634 681 107 regions across 11 different data types, including super enhancers, transcription factors, accessible chromatins, etc. GREAP provides widespread annotation and enrichment analysis of genomic regions. To reflect the significance of enrichment analysis, we used the hypergeometric test and also provided a Locus Overlap Analysis. In summary, GREAP is a powerful platform that provides many types of genomic region sets for users and supports genomic region annotations and enrichment analyses. In addition, we developed a customizable genome browser containing >400 000 000 customizable tracks for visualization. The platform is freely available at http://www.liclab.net/Greap/view/index.


Asunto(s)
Genómica , Programas Informáticos , Cromatina , Genoma Humano , Humanos , Anotación de Secuencia Molecular , Factores de Transcripción
2.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36477791

RESUMEN

MOTIVATION: DNA methylation within gene body and promoters in cancer cells is well documented. An increasing number of studies showed that cytosine-phosphate-guanine (CpG) sites falling within other regulatory elements could also regulate target gene activation, mainly by affecting transcription factors (TFs) binding in human cancers. This led to the urgent need for comprehensively and effectively collecting distinct cis-regulatory elements and TF-binding sites (TFBS) to annotate DNA methylation regulation. RESULTS: We developed a database (CanMethdb, http://meth.liclab.net/CanMethdb/) that focused on the upstream and downstream annotations for CpG-genes in cancers. This included upstream cis-regulatory elements, especially those involving distal regions to genes, and TFBS annotations for the CpGs and downstream functional annotations for the target genes, computed through integrating abundant DNA methylation and gene expression profiles in diverse cancers. Users could inquire CpG-target gene pairs for a cancer type through inputting a genomic region, a CpG, a gene name, or select hypo/hypermethylated CpG sets. The current version of CanMethdb documented a total of 38 986 060 CpG-target gene pairs (with 6 769 130 unique pairs), involving 385 217 CpGs and 18 044 target genes, abundant cis-regulatory elements and TFs for 33 TCGA cancer types. CanMethdb might help biologists perform in-depth studies of target gene regulations based on DNA methylations in cancer. AVAILABILITY AND IMPLEMENTATION: The main program is available at https://github.com/chunquanlipathway/CanMethdb. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Metilación de ADN , Neoplasias , Humanos , Factores de Transcripción/metabolismo , Genoma , Secuencias Reguladoras de Ácidos Nucleicos , Regiones Promotoras Genéticas , Neoplasias/genética , ADN/metabolismo , Islas de CpG
3.
Bioinformatics ; 39(4)2023 04 03.
Artículo en Inglés | MEDLINE | ID: mdl-37018146

RESUMEN

SUMMARY: We developed the eccDB database to integrate available resources for extrachromosomal circular DNA (eccDNA) data. eccDB is a comprehensive repository for storing, browsing, searching, and analyzing eccDNAs from multispecies. The database provides regulatory and epigenetic information on eccDNAs, with a focus on analyzing intrachromosomal and interchromosomal interactions to predict their transcriptional regulatory functions. Moreover, eccDB identifies eccDNAs from unknown DNA sequences and analyzes the functional and evolutionary relationships of eccDNAs among different species. Overall, eccDB offers web-based analytical tools and a comprehensive resource for biologists and clinicians to decipher the molecular regulatory mechanisms of eccDNAs. AVAILABILITY AND IMPLEMENTATION: eccDB is freely available at http://www.xiejjlab.bio/eccDB.


Asunto(s)
Cromatina , ADN Circular , Cromatina/genética , Cromosomas , ADN , Secuencia de Bases
4.
Brief Bioinform ; 22(2): 1929-1939, 2021 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-32047897

RESUMEN

Long noncoding RNAs (lncRNAs) have been proven to play important roles in transcriptional processes and biological functions. With the increasing study of human diseases and biological processes, information in human H3K27ac ChIP-seq, ATAC-seq and DNase-seq datasets is accumulating rapidly, resulting in an urgent need to collect and process data to identify transcriptional regulatory regions of lncRNAs. We therefore developed a comprehensive database for human regulatory information of lncRNAs (TRlnc, http://bio.licpathway.net/TRlnc), which aimed to collect available resources of transcriptional regulatory regions of lncRNAs and to annotate and illustrate their potential roles in the regulation of lncRNAs in a cell type-specific manner. The current version of TRlnc contains 8 683 028 typical enhancers/super-enhancers and 32 348 244 chromatin accessibility regions associated with 91 906 human lncRNAs. These regions are identified from over 900 human H3K27ac ChIP-seq, ATAC-seq and DNase-seq samples. Furthermore, TRlnc provides the detailed genetic and epigenetic annotation information within transcriptional regulatory regions (promoter, enhancer/super-enhancer and chromatin accessibility regions) of lncRNAs, including common SNPs, risk SNPs, eQTLs, linkage disequilibrium SNPs, transcription factors, methylation sites, histone modifications and 3D chromatin interactions. It is anticipated that the use of TRlnc will help users to gain in-depth and useful insights into the transcriptional regulatory mechanisms of lncRNAs.


Asunto(s)
Bases de Datos Genéticas , ARN Largo no Codificante/genética , Secuencias Reguladoras de Ácidos Nucleicos , Transcripción Genética , Inmunoprecipitación de Cromatina , Elementos de Facilitación Genéticos , Humanos , Desequilibrio de Ligamiento , Metilación , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , Sitios de Carácter Cuantitativo
5.
Nucleic Acids Res ; 49(D1): D1431-D1444, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33095866

RESUMEN

With the study of human diseases and biological processes increasing, a large number of non-coding variants have been identified and facilitated. The rapid accumulation of genetic and epigenomic information has resulted in an urgent need to collect and process data to explore the regulation of non-coding variants. Here, we developed a comprehensive variation annotation database for human (VARAdb, http://www.licpathway.net/VARAdb/), which specifically considers non-coding variants. VARAdb provides annotation information for 577,283,813 variations and novel variants, prioritizes variations based on scores using nine annotation categories, and supports pathway downstream analysis. Importantly, VARAdb integrates a large amount of genetic and epigenomic data into five annotation sections, which include 'Variation information', 'Regulatory information', 'Related genes', 'Chromatin accessibility' and 'Chromatin interaction'. The detailed annotation information consists of motif changes, risk SNPs, LD SNPs, eQTLs, clinical variant-drug-gene pairs, sequence conservation, somatic mutations, enhancers, super enhancers, promoters, transcription factors, chromatin states, histone modifications, chromatin accessibility regions and chromatin interactions. This database is a user-friendly interface to query, browse and visualize variations and related annotation information. VARAdb is a useful resource for selecting potential functional variations and interpreting their effects on human diseases and biological processes.


Asunto(s)
Enfermedad de Alzheimer/genética , Bases de Datos Genéticas , Diabetes Mellitus Tipo 2/genética , Variación Genética , Genoma Humano , Sitios de Carácter Cuantitativo , Enfermedad de Alzheimer/metabolismo , Enfermedad de Alzheimer/patología , Cromatina , Ensamble y Desensamble de Cromatina , Diabetes Mellitus Tipo 2/metabolismo , Diabetes Mellitus Tipo 2/patología , Elementos de Facilitación Genéticos , Humanos , Internet , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , Programas Informáticos
6.
Nucleic Acids Res ; 49(D1): D55-D64, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33125076

RESUMEN

Accessible chromatin is a highly informative structural feature for identifying regulatory elements, which provides a large amount of information about transcriptional activity and gene regulatory mechanisms. Human ATAC-seq datasets are accumulating rapidly, prompting an urgent need to comprehensively collect and effectively process these data. We developed a comprehensive human chromatin accessibility database (ATACdb, http://www.licpathway.net/ATACdb), with the aim of providing a large amount of publicly available resources on human chromatin accessibility data, and to annotate and illustrate potential roles in a tissue/cell type-specific manner. The current version of ATACdb documented a total of 52 078 883 regions from over 1400 ATAC-seq samples. These samples have been manually curated from over 2200 chromatin accessibility samples from NCBI GEO/SRA. To make these datasets more accessible to the research community, ATACdb provides a quality assurance process including four quality control (QC) metrics. ATACdb provides detailed (epi)genetic annotations in chromatin accessibility regions, including super-enhancers, typical enhancers, transcription factors (TFs), common single-nucleotide polymorphisms (SNPs), risk SNPs, eQTLs, LD SNPs, methylations, chromatin interactions and TADs. Especially, ATACdb provides accurate inference of TF footprints within chromatin accessibility regions. ATACdb is a powerful platform that provides the most comprehensive accessible chromatin data, QC, TF footprint and various other annotations.


Asunto(s)
Cromatina/genética , Biología Computacional/métodos , Bases de Datos Genéticas , Programas Informáticos , Cromatina/metabolismo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Anotación de Secuencia Molecular , Análisis de Secuencia de ADN , Diseño de Software , Navegador Web
7.
Brief Bioinform ; 21(4): 1411-1424, 2020 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31350847

RESUMEN

With the increasing awareness of heterogeneity in cancers, better prediction of cancer prognosis is much needed for more personalized treatment. Recently, extensive efforts have been made to explore the variations in gene expression for better prognosis. However, the prognostic gene signatures predicted by most existing methods have little robustness among different datasets of the same cancer. To improve the robustness of the gene signatures, we propose a novel high-frequency sub-pathways mining approach (HiFreSP), integrating a randomization strategy with gene interaction pathways. We identified a six-gene signature (CCND1, CSF3R, E2F2, JUP, RARA and TCF7) in esophageal squamous cell carcinoma (ESCC) by HiFreSP. This signature displayed a strong ability to predict the clinical outcome of ESCC patients in two independent datasets (log-rank test, P = 0.0045 and 0.0087). To further show the predictive performance of HiFreSP, we applied it to two other cancers: pancreatic adenocarcinoma and breast cancer. The identified signatures show high predictive power in all testing datasets of the two cancers. Furthermore, compared with the two popular prognosis signature predicting methods, the least absolute shrinkage and selection operator penalized Cox proportional hazards model and the random survival forest, HiFreSP showed better predictive accuracy and generalization across all testing datasets of the above three cancers. Lastly, we applied HiFreSP to 8137 patients involving 20 cancer types in the TCGA database and found high-frequency prognosis-associated pathways in many cancers. Taken together, HiFreSP shows higher prognostic capability and greater robustness, and the identified signatures provide clinical guidance for cancer prognosis. HiFreSP is freely available via GitHub: https://github.com/chunquanlipathway/HiFreSP.


Asunto(s)
Perfilación de la Expresión Génica , Neoplasias/genética , Algoritmos , Humanos , Pronóstico
8.
Nucleic Acids Res ; 48(D1): D51-D57, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31665430

RESUMEN

Enhancers are a class of cis-regulatory elements that can increase gene transcription by forming loops in intergenic regions, introns and exons. Enhancers, as well as their associated target genes, and transcription factors (TFs) that bind to them, are highly associated with human disease and biological processes. Although some enhancer databases have been published, most only focus on enhancers identified by high-throughput experimental techniques. Therefore, it is highly desirable to construct a comprehensive resource of manually curated enhancers and their related information based on low-throughput experimental evidences. Here, we established a comprehensive manually-curated enhancer database for human and mouse, which provides a resource for experimentally supported enhancers, and to annotate the detailed information of enhancers. The current release of ENdb documents 737 experimentally validated enhancers and their related information, including 384 target genes, 263 TFs, 110 diseases and 153 functions in human and mouse. Moreover, the enhancer-related information was supported by experimental evidences, such as RNAi, in vitro knockdown, western blotting, qRT-PCR, luciferase reporter assay, chromatin conformation capture (3C) and chromosome conformation capture-on-chip (4C) assays. ENdb provides a user-friendly interface to query, browse and visualize the detailed information of enhancers. The database is available at http://www.licpathway.net/ENdb.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Elementos de Facilitación Genéticos , Genómica/métodos , Animales , Humanos , Ratones , Programas Informáticos , Diseño de Software , Interfaz Usuario-Computador , Navegador Web
9.
Brief Bioinform ; 20(6): 2327-2333, 2019 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-30184150

RESUMEN

In recent years, high-throughput genomic technologies like chromatin immunoprecipitation sequencing (ChIp-seq) and transcriptome sequencing (RNA-seq) have been becoming both more refined and less expensive, making them more accessible. Many circular RNAs (circRNAs) that originate from back-spliced exons have been identified in various cell lines across different species. However, the regulatory mechanism for transcription of circRNAs remains unclear. Therefore, there is an urgent need to construct a database detailing the transcriptional regulation of circRNAs. TRCirc (http://www.licpathway.net/TRCirc) provides a resource for efficient retrieval, browsing and visualization of transcriptional regulation information of circRNAs. The current version of TRCirc documents 92 375 circRNAs and 161 transcription factors (TFs) from more than 100 cell types and together represent more than 765 000 TF-circRNA regulatory relationships. Furthermore, TRCirc provides other regulatory information about transcription of circRNAs, including their expression, methylation levels, H3K27ac signals in regulation regions and super-enhancers associated with circRNAs. TRCirc provides a convenient, user-friendly interface to search, browse and visualize detailed information about these circRNAs.


Asunto(s)
Regulación de la Expresión Génica , ARN Circular/genética , Transcripción Genética , Bases de Datos Genéticas , Humanos , Almacenamiento y Recuperación de la Información
10.
Nucleic Acids Res ; 47(D1): D235-D243, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30371817

RESUMEN

Super-enhancers are important for controlling and defining the expression of cell-specific genes. With research on human disease and biological processes, human H3K27ac ChIP-seq datasets are accumulating rapidly, creating the urgent need to collect and process these data comprehensively and efficiently. More importantly, many studies showed that super-enhancer-associated single nucleotide polymorphisms (SNPs) and transcription factors (TFs) strongly influence human disease and biological processes. Here, we developed a comprehensive human super-enhancer database (SEdb, http://www.licpathway.net/sedb) that aimed to provide a large number of available resources on human super-enhancers. The database was annotated with potential functions of super-enhancers in the gene regulation. The current version of SEdb documented a total of 331 601 super-enhancers from 542 samples. Especially, unlike existing super-enhancer databases, we manually curated and classified 410 available H3K27ac samples from >2000 ChIP-seq samples from NCBI GEO/SRA. Furthermore, SEdb provides detailed genetic and epigenetic annotation information on super-enhancers. Information includes common SNPs, motif changes, expression quantitative trait locus (eQTL), risk SNPs, transcription factor binding sites (TFBSs), CRISPR/Cas9 target sites and Dnase I hypersensitivity sites (DHSs) for in-depth analyses of super-enhancers. SEdb will help elucidate super-enhancer-related functions and find potential biological effects.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Elementos de Facilitación Genéticos , Genómica/métodos , Humanos , Almacenamiento y Recuperación de la Información , Anotación de Secuencia Molecular , Programas Informáticos , Diseño de Software , Interfaz Usuario-Computador , Navegador Web
11.
Nucleic Acids Res ; 47(W1): W248-W255, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31028388

RESUMEN

Super-enhancers (SEs) have prominent roles in biological and pathological processes through their unique transcriptional regulatory capability. To date, several SE databases have been developed by us and others. However, these existing databases do not provide downstream or upstream regulatory analyses of SEs. Pathways, transcription factors (TFs), SEs, and SE-associated genes form complex regulatory networks. Therefore, we designed a novel web server, SEanalysis, which provides comprehensive SE-associated regulatory network analyses. SEanalysis characterizes SE-associated genes, TFs binding to target SEs, and their upstream pathways. The current version of SEanalysis contains more than 330 000 SEs from more than 540 types of cells/tissues, 5042 TF ChIP-seq data generated from these cells/tissues, DNA-binding sequence motifs for ∼700 human TFs and 2880 pathways from 10 databases. SEanalysis supports searching by either SEs, samples, TFs, pathways or genes. The complex regulatory networks formed by these factors can be interactively visualized. In addition, we developed a customizable genome browser containing >6000 customizable tracks for visualization. The server is freely available at http://licpathway.net/SEanalysis.


Asunto(s)
Bases de Datos Genéticas , Elementos de Facilitación Genéticos/genética , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Programas Informáticos , Sitios de Unión/genética , Humanos , Internet , Factores de Transcripción/genética
12.
J Cell Mol Med ; 23(2): 967-984, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30421585

RESUMEN

Competing endogenous RNAs (ceRNAs) represent a novel mechanism of gene regulation that may mediate key subpathway regions and contribute to the altered activities of pathways. However, the classical methods used to identify pathways fail to specifically consider ceRNAs within the pathways and key regions impacted by them. We proposed a powerful strategy named ce-Subpathway for the identification of ceRNA-mediated functional subpathways. It provided an effective level of pathway analysis via integrating ceRNAs, differentially expressed (DE) genes and their key regions within the given pathways. We respectively analysed one pulmonary arterial hypertension (PAH) and one myocardial infarction (MI) data sets and demonstrated that ce-Subpathway could identify many subpathways whose corresponding entire pathways were ignored by those non-ceRNA-mediated pathway identification methods. And these pathways have been well reported to be associated with PAH/MI-related cardiovascular diseases. Further evidence showed reliability of ceRNA interactions and robustness/reproducibility of the ce-Subpathway strategy by several data sets of different cancers, including breast cancer, oesophageal cancer and colon cancer. Survival analysis was finally applied to illustrate the clinical application value of the ceRNA-mediated functional subpathways using another data sets of pancreatic cancer. Comprehensive analyses have shown the power of a joint ceRNAs/DE genes and subpathway strategy based on their topologies.


Asunto(s)
ARN/genética , Transducción de Señal/genética , Regulación Neoplásica de la Expresión Génica/genética , Humanos , Infarto del Miocardio/genética , Neoplasias/genética , Hipertensión Arterial Pulmonar/genética , Reproducibilidad de los Resultados
13.
J Cell Mol Med ; 22(2): 892-903, 2018 02.
Artículo en Inglés | MEDLINE | ID: mdl-29154475

RESUMEN

Cardiac hypertrophy (CH) is a common disease that originates from long-term heart pressure overload and finally leads to heart failure. Recently, long non-coding RNAs (lncRNAs) have attracted attention because they have broad and crucial functions in regulating complex biological processes. Some studies had found that lncRNAs play vital roles in complex cardiovascular diseases. However, the function and mechanism of lncRNAs in CH have not been elucidated. In our study, to investigate the potential roles of lncRNAs in CH, the Cardiac Hypertrophy-associated LncRNAs-Protein coding genes Network (CHLPN) was constructed by integrating gene microarray re-annotation and subpathway enrichment analyses. After performing random walking with restart in CHLPN, we predicted 21 significant risk lncRNAs, of which 7 (Kis2, 1700110K17Rik, Gm17501, E330017L17Rik, C630043F03Rik, Gm9866 and Ube4bos1) formed a close module with their co-expressed protein-coding genes (PCGs). We found that the module might play crucial roles in the development of CH. In particular, 44 PCGs that were co-expressed with six lncRNAs were enriched in CH-related biological processes and pathways. We also found that some lncRNAs participated in the competitive endogenous RNA cross-talk that might be involved in CH. These results indicate that the functional lncRNAs are related to post-transcriptional regulation and could shed light on a new molecular diagnostic target of CH.


Asunto(s)
Cardiomegalia/genética , ARN Largo no Codificante/genética , Animales , Análisis por Conglomerados , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Ratones , ARN Largo no Codificante/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo
14.
J Biomed Inform ; 54: 132-40, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25724580

RESUMEN

One of the challenging problems in drug discovery is to identify the novel targets for drugs. Most of the traditional methods for drug targets optimization focused on identifying the particular families of "druggable targets", but ignored their topological properties based on the biological pathways. In this study, we characterized the topological properties of human anticancer drug targets (ADTs) in the context of biological pathways. We found that the ADTs tended to present the following seven topological properties: influence the number of the pathways related to cancer, be localized at the start or end of the pathways, interact with cancer related genes, exhibit higher connectivity, vulnerability, betweenness, and closeness than other genes. We first ranked ADTs based on their topological property values respectively, then fused them into one global-rank using the joint cumulative distribution of an N-dimensional order statistic to optimize human ADTs. We applied the optimization method to 13 anticancer drugs, respectively. Results demonstrated that over 70% of known ADTs were ranked in the top 20%. Furthermore, the performance for mercaptopurine was significant: 6 known targets (ADSL, GMPR2, GMPR, HPRT1, AMPD3, AMPD2) were ranked in the top 15 and other four out of the top 15 (MAT2A, CDKN1A, AREG, JUN) have the potentialities to become new targets for cancer therapy.


Asunto(s)
Antineoplásicos/farmacología , Antineoplásicos/uso terapéutico , Biología Computacional/métodos , Descubrimiento de Drogas/métodos , Neoplasias/tratamiento farmacológico , Neoplasias/mortalidad , Transducción de Señal/efectos de los fármacos , Bases de Datos Factuales , Humanos
15.
Database (Oxford) ; 20222020 Jan 17.
Artículo en Inglés | MEDLINE | ID: mdl-35134148

RESUMEN

Accessible chromatin refers to the active regions of a chromosome that are bound by many transcription factors (TFs). Changes in chromatin accessibility play a critical role in tumorigenesis. With the emergence of novel methods like Assay for Transposase-accessible Chromatin Sequencing, a sequencing method that maps chromatin-accessible regions (CARs) and enables the computational analysis of TF binding at chromatin-accessible sites, the regulatory landscape in cancer can be dissected. Herein, we developed a comprehensive cancer chromatin accessibility database named CATA, which aims to provide available resources of cancer CARs and to annotate their potential roles in the regulation of genes in a cancer type-specific manner. In this version, CATA stores 2 991 163 CARs from 23 cancer types, binding information of 1398 TFs within the CARs, and provides multiple annotations about these regions, including common single nucleotide polymorphisms (SNPs), risk SNPs, copy number variation, somatic mutations, motif changes, expression quantitative trait loci, methylation and CRISPR/Cas9 target loci. Moreover, CATA supports cancer survival analysis of the CAR-associated genes and provides detailed clinical information of the tumor samples. Database URL: CATA is available at http://www.xiejjlab.bio/cata/.

16.
Front Genet ; 11: 590672, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33569079

RESUMEN

Circular RNAs (circRNAs) are evolutionarily conserved and abundant non-coding RNAs whose functions and regulatory mechanisms remain largely unknown. Here, we identify and characterize an epigenomically distinct group of circRNAs (TAH-circRNAs), which are transcribed to a higher level than their host genes. By integrative analysis of cistromic and transcriptomic data, we find that compared with other circRNAs, TAH-circRNAs are expressed more abundantly and have more transcription factors (TFs) binding sites and lower DNA methylation levels. Concordantly, TAH-circRNAs are enriched in open and active chromatin regions. Importantly, ChIA-PET results showed that 23-52% of transcription start sites (TSSs) of TAH-circRNAs have direct interactions with cis-regulatory regions, strongly suggesting their independent transcriptional regulation from host genes. In addition, we characterize molecular features of super-enhancer-driven circRNAs in cancer biology. Together, this study comprehensively analyzes epigenomic characteristics of circRNAs and identifies a distinct group of TAH-circRNAs that are independently transcribed via enhancers and super-enhancers by TFs. These findings substantially advance our understanding of the regulatory mechanism of circRNAs and may have important implications for future investigations of this class of non-coding RNAs.

17.
Front Genet ; 11: 606940, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33362865

RESUMEN

BACKGROUND: Pancreatic cancer (PC) remains one of the most lethal cancers. In contrast to the steady increase in survival for most cancers, the 5-year survival remains low for PC patients. METHODS: We describe a new pipeline that can be used to identify prognostic molecular biomarkers by identifying miRNA-mediated subpathways associated with PC. These modules were then further extracted from a comprehensive miRNA-gene network (CMGN). An exhaustive survival analysis was performed to estimate the prognostic value of these modules. RESULTS: We identified 105 miRNA-mediated subpathways associated with PC. Two subpathways within the MAPK signaling and cell cycle pathways were found to be highly related to PC. Of the miRNA-mRNA modules extracted from CMGN, six modules showed good prognostic performance in both independent validated datasets. CONCLUSIONS: Our study provides novel insight into the mechanisms of PC. We inferred that six miRNA-mRNA modules could serve as potential prognostic molecular biomarkers in PC based on the pipeline we proposed.

18.
Mol Omics ; 15(2): 150-163, 2019 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-30916068

RESUMEN

Growing evidence shows that circular RNAs (circRNAs) play important roles in physiological and pathological processes, but our knowledge about the function of circRNAs in diseases is still limited. CircRNA functions are closely related to their expression levels. We developed a probe reannotating program named ReCirc, which is based on sequence alignment between microarray probes and circRNAs, to reannotate circRNAs from non-circRNA microarrays (any microarray that was not designed to profile circRNAs) with microarray probe sequences that were aligned to the body and back-spliced junction sequences of circRNAs to identify circRNAs. Through ReCirc, we obtained 39 818 reannotated probe set-circRNA pairs, which involved 5388 circRNAs, from an Affymetrix human exon array. We evaluated our method by comparing circRNAs obtained by us with golden standard RNase R-resistant (RNase R+) circRNAs, predicted by an RNA-seq-based method find_circ, in the HeLa cell line. The results showed that ReCirc circRNAs, especially those with higher expression level, were partially present in RNase R+ data. In addition to RNA-seq, a circRNA microarray, such as the Agilent-069978 Arraystar Human CircRNA microarray, was also applied to predict and profile circRNAs. Thus, we compared the circRNA profile obtained from ReCirc with that from the circRNA microarray. The results showed that circRNA expression is similar between ReCirc and circRNA microarray in samples from the same tissue. We also evaluated ReCirc, by comparing ReCirc with the find_circ program, in their abilities to compute circRNA expression variation in multiple cell lines and performed molecular verification in the HeLaS3 cell line for those circRNAs that got good performance. As a result, 5 of the 9 randomly selected circRNAs were successfully verified. Functional analysis of identified circRNAs in 4 different cancers indicated that circRNAs may be crucial biomarkers for cancer diagnosis and prognosis. Thus, ReCirc allows us to identify circRNAs from any non-circRNA microarray, and to back-annotate old microarray data from public data sets, which would facilitate re-utilization of the wealth of microarray data sets, to enable the characterization of circRNAs in tissues and cell lines. Here we state that our method is designed only for microarrays and cannot be used for RNAseq data.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Neoplasias/genética , ARN/genética , Programas Informáticos , Biomarcadores/análisis , Línea Celular , Células HeLa , Humanos , Anotación de Secuencia Molecular , Neoplasias/diagnóstico , Análisis de Secuencia por Matrices de Oligonucleótidos , Pronóstico , ARN/metabolismo , ARN Circular , Alineación de Secuencia
19.
Oncotarget ; 8(31): 51134-51150, 2017 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-28881636

RESUMEN

Although systematic studies have identified a host of long non-coding RNAs (lncRNAs) which are involved in breast cancer, the knowledge about the methyla-tion-mediated dysregulation of those lncRNAs remains limited. Here, we integrated multi-omics data to analyze the methylated alteration of lncRNAs in breast invasive carcinoma (BRCA). We found that lncRNAs showed diverse methylation patterns on promoter regions in BRCA. LncRNAs were divided into two categories and four subcategories based on their promoter methylation patterns and expression levels be-tween tumor and normal samples. Through cis-regulatory analysis and gene ontology network, abnormally methylated lncRNAs were identified to be associated with can-cer regulation, proliferation or expression of transcription factors. Competing endog-enous RNA network and functional enrichment analysis of abnormally methylated lncRNAs showed that lncRNAs with different methylation patterns were involved in several hallmarks and KEGG pathways of cancers significantly. Finally, survival analysis based on mRNA modules in networks revealed that lncRNAs silenced by high methylation were associated with prognosis significantly in BRCA. This study enhances the understanding of aberrantly methylated patterns of lncRNAs and pro-vides a novel insight for identifying cancer biomarkers and potential therapeutic tar-gets in breast cancer.

20.
J Hypertens ; 35(4): 798-809, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28079595

RESUMEN

BACKGROUND: Autophagy is a major intracellular degradation and recycling process that maintains cellular homeostasis, which is involved in structural and functional abnormalities of pulmonary vasculature in hypoxic pulmonary arterial hypertension (HPAH). Cyclophilin A (CyPA) is a secreted, oxidative stress-induced factor. Its role in inducing autophagy and augmenting endothelial cell dysfunction has never been explored. METHODS: Lungs from rats exposed to chronic hypoxia were examined for autophagy with electron microscopy, western blotting, and fluorescence microscopy. RESULTS: Activated autophagy was seen in the endothelium of the pulmonary artery from experimental rat models of HPAH and cultured bovine pulmonary arterial endothelial cells under hypoxia. Inhibiting autophagy attenuated the pathological progression of HPAH and repressed endothelial cell migration and angiogenesis. We also showed that CyPA was upregulated and acetylated under hypoxia and led to the abnormal occurrence of autophagy through its interaction with autophagy protein 5 and autophagy protein 7. Moreover, acetylated CyPA was essential for the excessive proliferation, migration, and tube formation networks of pulmonary arterial endothelial cells. CONCLUSION: Our results indicate the crucial role of acetylated CyPA in the abnormal occurrence of autophagy and subsequent pulmonary vascular angiogenesis.


Asunto(s)
Autofagia , Ciclofilina A/metabolismo , Endotelio/fisiopatología , Hipertensión Pulmonar/metabolismo , Neovascularización Patológica/metabolismo , Acetilación , Animales , Proteína 5 Relacionada con la Autofagia/metabolismo , Proteína 7 Relacionada con la Autofagia/metabolismo , Bovinos , Movimiento Celular , Células Cultivadas , Citoprotección , Células Endoteliales/metabolismo , Endotelio/metabolismo , Hipertensión Pulmonar/fisiopatología , Hipoxia/fisiopatología , Masculino , Músculo Liso Vascular/metabolismo , Arteria Pulmonar , Ratas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA