Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
1.
Nucleic Acids Res ; 52(D1): D183-D193, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-37956336

RESUMEN

Transcription factors (TFs), transcription co-factors (TcoFs) and their target genes perform essential functions in diseases and biological processes. KnockTF 2.0 (http://www.licpathway.net/KnockTF/index.html) aims to provide comprehensive gene expression profile datasets before/after T(co)F knockdown/knockout across multiple tissue/cell types of different species. Compared with KnockTF 1.0, KnockTF 2.0 has the following improvements: (i) Newly added T(co)F knockdown/knockout datasets in mice, Arabidopsis thaliana and Zea mays and also an expanded scale of datasets in humans. Currently, KnockTF 2.0 stores 1468 manually curated RNA-seq and microarray datasets associated with 612 TFs and 172 TcoFs disrupted by different knockdown/knockout techniques, which are 2.5 times larger than those of KnockTF 1.0. (ii) Newly added (epi)genetic annotations for T(co)F target genes in humans and mice, such as super-enhancers, common SNPs, methylation sites and chromatin interactions. (iii) Newly embedded and updated search and analysis tools, including T(co)F Enrichment (GSEA), Pathway Downstream Analysis and Search by Target Gene (BLAST). KnockTF 2.0 is a comprehensive update of KnockTF 1.0, which provides more T(co)F knockdown/knockout datasets and (epi)genetic annotations across multiple species than KnockTF 1.0. KnockTF 2.0 facilitates not only the identification of functional T(co)Fs and target genes but also the investigation of their roles in the physiological and pathological processes.


Asunto(s)
Bases de Datos Genéticas , Factores de Transcripción , Transcriptoma , Animales , Humanos , Ratones , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Internet , Marcación de Gen , Arabidopsis , Zea mays
2.
Nucleic Acids Res ; 52(D1): D285-D292, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-37897340

RESUMEN

Chromatin accessibility profiles at single cell resolution can reveal cell type-specific regulatory programs, help dissect highly specialized cell functions and trace cell origin and evolution. Accurate cell type assignment is critical for effectively gaining biological and pathological insights, but is difficult in scATAC-seq. Hence, by extensively reviewing the literature, we designed scATAC-Ref (https://bio.liclab.net/scATAC-Ref/), a manually curated scATAC-seq database aimed at providing a comprehensive, high-quality source of chromatin accessibility profiles with known cell labels across broad cell types. Currently, scATAC-Ref comprises 1 694 372 cells with known cell labels, across various biological conditions, >400 cell/tissue types and five species. We used uniform system environment and software parameters to perform comprehensive downstream analysis on these chromatin accessibility profiles with known labels, including gene activity score, TF enrichment score, differential chromatin accessibility regions, pathway/GO term enrichment analysis and co-accessibility interactions. The scATAC-Ref also provided a user-friendly interface to query, browse and visualize cell types of interest, thereby providing a valuable resource for exploring epigenetic regulation in different tissues and cell types.


Asunto(s)
Secuenciación de Inmunoprecipitación de Cromatina , Cromatina , Bases de Datos Genéticas , Análisis de la Célula Individual , Cromatina/genética , Epigénesis Genética , Humanos , Animales
3.
Nucleic Acids Res ; 51(D1): D280-D290, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36318264

RESUMEN

Super-enhancers (SEs) are cell-specific DNA cis-regulatory elements that can supervise the transcriptional regulation processes of downstream genes. SEdb 2.0 (http://www.licpathway.net/sedb) aims to provide a comprehensive SE resource and annotate their potential roles in gene transcriptions. Compared with SEdb 1.0, we have made the following improvements: (i) Newly added the mouse SEs and expanded the scale of human SEs. SEdb 2.0 contained 1 167 518 SEs from 1739 human H3K27ac chromatin immunoprecipitation sequencing (ChIP-seq) samples and 550 226 SEs from 931 mouse H3K27ac ChIP-seq samples, which was five times that of SEdb 1.0. (ii) Newly added transcription factor binding sites (TFBSs) in SEs identified by TF motifs and TF ChIP-seq data. (iii) Added comprehensive (epi)genetic annotations of SEs, including chromatin accessibility regions, methylation sites, chromatin interaction regions and topologically associating domains (TADs). (iv) Newly embedded and updated search and analysis tools, including 'Search SE by TF-based', 'Differential-Overlapping-SE analysis' and 'SE-based TF-Gene analysis'. (v) Newly provided quality control (QC) metrics for ChIP-seq processing. In summary, SEdb 2.0 is a comprehensive update of SEdb 1.0, which curates more SEs and annotation information than SEdb 1.0. SEdb 2.0 provides a friendly platform for researchers to more comprehensively clarify the important role of SEs in the biological process.


Asunto(s)
Bases de Datos Genéticas , Elementos de Facilitación Genéticos , Animales , Humanos , Ratones , Cromatina/genética , Regulación de la Expresión Génica , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
4.
Nucleic Acids Res ; 51(W1): W520-W527, 2023 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-37194711

RESUMEN

Super-enhancers (SEs) play an essential regulatory role in various biological processes and diseases through their specific interaction with transcription factors (TFs). Here, we present the release of SEanalysis 2.0 (http://licpathway.net/SEanalysis), an updated version of the SEanalysis web server for the comprehensive analyses of transcriptional regulatory networks formed by SEs, pathways, TFs, and genes. The current version added mouse SEs and further expanded the scale of human SEs, documenting 1 167 518 human SEs from 1739 samples and 550 226 mouse SEs from 931 samples. The SE-related samples in SEanalysis 2.0 were more than five times that in version 1.0, which significantly improved the ability of original SE-related network analyses ('pathway downstream analysis', 'upstream regulatory analysis' and 'genomic region annotation') for understanding context-specific gene regulation. Furthermore, we designed two novel analysis models, 'TF regulatory analysis' and 'Sample comparative analysis' for supporting more comprehensive analyses of SE regulatory networks driven by TFs. Further, the risk SNPs were annotated to the SE regions to provide potential SE-related disease/trait information. Hence, we believe that SEanalysis 2.0 has significantly expanded the data and analytical capabilities of SEs, which helps researchers in an in-depth understanding of the regulatory mechanisms of SEs.


Asunto(s)
Elementos de Facilitación Genéticos , Redes Reguladoras de Genes , Programas Informáticos , Factores de Transcripción , Animales , Humanos , Ratones , Regulación de la Expresión Génica , Genómica , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
5.
Brief Bioinform ; 23(5)2022 09 20.
Artículo en Inglés | MEDLINE | ID: mdl-35959979

RESUMEN

The rapid development of genomic high-throughput sequencing has identified a large number of DNA regulatory elements with abundant epigenetics markers, which promotes the rapid accumulation of functional genomic region data. The comprehensively understanding and research of human functional genomic regions is still a relatively urgent work at present. However, the existing analysis tools lack extensive annotation and enrichment analytical abilities for these regions. Here, we designed a novel software, Genomic Region sets Enrichment Analysis Platform (GREAP), which provides comprehensive region annotation and enrichment analysis capabilities. Currently, GREAP supports 85 370 genomic region reference sets, which cover 634 681 107 regions across 11 different data types, including super enhancers, transcription factors, accessible chromatins, etc. GREAP provides widespread annotation and enrichment analysis of genomic regions. To reflect the significance of enrichment analysis, we used the hypergeometric test and also provided a Locus Overlap Analysis. In summary, GREAP is a powerful platform that provides many types of genomic region sets for users and supports genomic region annotations and enrichment analyses. In addition, we developed a customizable genome browser containing >400 000 000 customizable tracks for visualization. The platform is freely available at http://www.liclab.net/Greap/view/index.


Asunto(s)
Genómica , Programas Informáticos , Cromatina , Genoma Humano , Humanos , Anotación de Secuencia Molecular , Factores de Transcripción
6.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36477791

RESUMEN

MOTIVATION: DNA methylation within gene body and promoters in cancer cells is well documented. An increasing number of studies showed that cytosine-phosphate-guanine (CpG) sites falling within other regulatory elements could also regulate target gene activation, mainly by affecting transcription factors (TFs) binding in human cancers. This led to the urgent need for comprehensively and effectively collecting distinct cis-regulatory elements and TF-binding sites (TFBS) to annotate DNA methylation regulation. RESULTS: We developed a database (CanMethdb, http://meth.liclab.net/CanMethdb/) that focused on the upstream and downstream annotations for CpG-genes in cancers. This included upstream cis-regulatory elements, especially those involving distal regions to genes, and TFBS annotations for the CpGs and downstream functional annotations for the target genes, computed through integrating abundant DNA methylation and gene expression profiles in diverse cancers. Users could inquire CpG-target gene pairs for a cancer type through inputting a genomic region, a CpG, a gene name, or select hypo/hypermethylated CpG sets. The current version of CanMethdb documented a total of 38 986 060 CpG-target gene pairs (with 6 769 130 unique pairs), involving 385 217 CpGs and 18 044 target genes, abundant cis-regulatory elements and TFs for 33 TCGA cancer types. CanMethdb might help biologists perform in-depth studies of target gene regulations based on DNA methylations in cancer. AVAILABILITY AND IMPLEMENTATION: The main program is available at https://github.com/chunquanlipathway/CanMethdb. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Metilación de ADN , Neoplasias , Humanos , Factores de Transcripción/metabolismo , Genoma , Secuencias Reguladoras de Ácidos Nucleicos , Regiones Promotoras Genéticas , Neoplasias/genética , ADN/metabolismo , Islas de CpG
7.
Nucleic Acids Res ; 50(D1): D402-D412, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34986601

RESUMEN

Transcription factors (TFs) play key roles in biological processes and are usually used as cell markers. The emerging importance of TFs and related markers in identifying specific cell types in human diseases increases the need for a comprehensive collection of human TFs and related markers sets. Here, we developed the TF-Marker database (TF-Marker, http://bio.liclab.net/TF-Marker/), aiming to provide cell/tissue-specific TFs and related markers for human. By manually curating thousands of published literature, 5905 entries including information about TFs and related markers were classified into five types according to their functions: (i) TF: TFs which regulate expression of the markers; (ii) T Marker: markers which are regulated by the TF; (iii) I Marker: markers which influence the activity of TFs; (iv) TFMarker: TFs which play roles as markers and (v) TF Pmarker: TFs which play roles as potential markers. The 5905 entries of TF-Marker include 1316 TFs, 1092 T Markers, 473 I Markers, 1600 TFMarkers and 1424 TF Pmarkers, involving 383 cell types and 95 tissue types in human. TF-Marker further provides a user-friendly interface to browse, query and visualize the detailed information about TFs and related markers. We believe TF-Marker will become a valuable resource to understand the regulation patterns of different tissues and cells.


Asunto(s)
Bases de Datos Genéticas , Neoplasias/genética , Programas Informáticos , Factores de Transcripción/genética , Transcripción Genética , Huesos/química , Huesos/metabolismo , Encéfalo/metabolismo , Colon/química , Colon/metabolismo , Femenino , Regulación de la Expresión Génica , Marcadores Genéticos , Humanos , Internet , Hígado/química , Hígado/metabolismo , Pulmón/química , Pulmón/metabolismo , Masculino , Glándulas Mamarias Humanas/química , Glándulas Mamarias Humanas/metabolismo , Anotación de Secuencia Molecular , Neoplasias/metabolismo , Neoplasias/patología , Especificidad de Órganos , Próstata/química , Próstata/metabolismo , Factores de Transcripción/clasificación , Factores de Transcripción/metabolismo
8.
Brief Bioinform ; 22(2): 1929-1939, 2021 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-32047897

RESUMEN

Long noncoding RNAs (lncRNAs) have been proven to play important roles in transcriptional processes and biological functions. With the increasing study of human diseases and biological processes, information in human H3K27ac ChIP-seq, ATAC-seq and DNase-seq datasets is accumulating rapidly, resulting in an urgent need to collect and process data to identify transcriptional regulatory regions of lncRNAs. We therefore developed a comprehensive database for human regulatory information of lncRNAs (TRlnc, http://bio.licpathway.net/TRlnc), which aimed to collect available resources of transcriptional regulatory regions of lncRNAs and to annotate and illustrate their potential roles in the regulation of lncRNAs in a cell type-specific manner. The current version of TRlnc contains 8 683 028 typical enhancers/super-enhancers and 32 348 244 chromatin accessibility regions associated with 91 906 human lncRNAs. These regions are identified from over 900 human H3K27ac ChIP-seq, ATAC-seq and DNase-seq samples. Furthermore, TRlnc provides the detailed genetic and epigenetic annotation information within transcriptional regulatory regions (promoter, enhancer/super-enhancer and chromatin accessibility regions) of lncRNAs, including common SNPs, risk SNPs, eQTLs, linkage disequilibrium SNPs, transcription factors, methylation sites, histone modifications and 3D chromatin interactions. It is anticipated that the use of TRlnc will help users to gain in-depth and useful insights into the transcriptional regulatory mechanisms of lncRNAs.


Asunto(s)
Bases de Datos Genéticas , ARN Largo no Codificante/genética , Secuencias Reguladoras de Ácidos Nucleicos , Transcripción Genética , Inmunoprecipitación de Cromatina , Elementos de Facilitación Genéticos , Humanos , Desequilibrio de Ligamiento , Metilación , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , Sitios de Carácter Cuantitativo
9.
Nucleic Acids Res ; 49(D1): D969-D980, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33045741

RESUMEN

Long non-coding RNAs (lncRNAs) have been proven to play important roles in transcriptional processes and various biological functions. Establishing a comprehensive collection of human lncRNA sets is urgent work at present. Using reference lncRNA sets, enrichment analyses will be useful for analyzing lncRNA lists of interest submitted by users. Therefore, we developed a human lncRNA sets database, called LncSEA, which aimed to document a large number of available resources for human lncRNA sets and provide annotation and enrichment analyses for lncRNAs. LncSEA supports >40 000 lncRNA reference sets across 18 categories and 66 sub-categories, and covers over 50 000 lncRNAs. We not only collected lncRNA sets based on downstream regulatory data sources, but also identified a large number of lncRNA sets regulated by upstream transcription factors (TFs) and DNA regulatory elements by integrating TF ChIP-seq, DNase-seq, ATAC-seq and H3K27ac ChIP-seq data. Importantly, LncSEA provides annotation and enrichment analyses of lncRNA sets associated with upstream regulators and downstream targets. In summary, LncSEA is a powerful platform that provides a variety of types of lncRNA sets for users, and supports lncRNA annotations and enrichment analyses. The LncSEA database is freely accessible at http://bio.liclab.net/LncSEA/index.php.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Regulación de la Expresión Génica , ARN Largo no Codificante/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Factores de Transcripción/genética , Minería de Datos/métodos , Humanos , Internet , Anotación de Secuencia Molecular/métodos , Análisis de Secuencia de ARN/métodos , Interfaz Usuario-Computador
10.
Brief Bioinform ; 21(4): 1411-1424, 2020 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31350847

RESUMEN

With the increasing awareness of heterogeneity in cancers, better prediction of cancer prognosis is much needed for more personalized treatment. Recently, extensive efforts have been made to explore the variations in gene expression for better prognosis. However, the prognostic gene signatures predicted by most existing methods have little robustness among different datasets of the same cancer. To improve the robustness of the gene signatures, we propose a novel high-frequency sub-pathways mining approach (HiFreSP), integrating a randomization strategy with gene interaction pathways. We identified a six-gene signature (CCND1, CSF3R, E2F2, JUP, RARA and TCF7) in esophageal squamous cell carcinoma (ESCC) by HiFreSP. This signature displayed a strong ability to predict the clinical outcome of ESCC patients in two independent datasets (log-rank test, P = 0.0045 and 0.0087). To further show the predictive performance of HiFreSP, we applied it to two other cancers: pancreatic adenocarcinoma and breast cancer. The identified signatures show high predictive power in all testing datasets of the two cancers. Furthermore, compared with the two popular prognosis signature predicting methods, the least absolute shrinkage and selection operator penalized Cox proportional hazards model and the random survival forest, HiFreSP showed better predictive accuracy and generalization across all testing datasets of the above three cancers. Lastly, we applied HiFreSP to 8137 patients involving 20 cancer types in the TCGA database and found high-frequency prognosis-associated pathways in many cancers. Taken together, HiFreSP shows higher prognostic capability and greater robustness, and the identified signatures provide clinical guidance for cancer prognosis. HiFreSP is freely available via GitHub: https://github.com/chunquanlipathway/HiFreSP.


Asunto(s)
Perfilación de la Expresión Génica , Neoplasias/genética , Algoritmos , Humanos , Pronóstico
11.
Nucleic Acids Res ; 48(D1): D93-D100, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31598675

RESUMEN

Transcription factors (TFs) and their target genes have important functions in human diseases and biological processes. Gene expression profile analysis before and after knockdown or knockout is one of the most important strategies for obtaining target genes of TFs and exploring TF functions. Human gene expression profile datasets with TF knockdown and knockout are accumulating rapidly. Based on the urgent need to comprehensively and effectively collect and process these data, we developed KnockTF (http://www.licpathway.net/KnockTF/index.html), a comprehensive human gene expression profile database of TF knockdown and knockout. KnockTF provides a number of resources for human gene expression profile datasets associated with TF knockdown and knockout and annotates TFs and their target genes in a tissue/cell type-specific manner. The current version of KnockTF has 570 manually curated RNA-seq and microarray datasets associated with 308 TFs disrupted by different knockdown and knockout techniques and across multiple tissue/cell types. KnockTF collects upstream pathway information of TFs and functional annotation results of downstream target genes. It provides details about TFs binding to promoters, super-enhancers and typical enhancers of target genes. KnockTF constructs a TF-differentially expressed gene network and performs network analyses for genes of interest. KnockTF will help elucidate TF-related functions and potential biological effects.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Perfilación de la Expresión Génica/métodos , Técnicas de Silenciamiento del Gen , Programas Informáticos , Factores de Transcripción/genética , Humanos , Anotación de Secuencia Molecular , Factores de Transcripción/metabolismo , Interfaz Usuario-Computador , Navegador Web
12.
Brief Bioinform ; 20(6): 2327-2333, 2019 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-30184150

RESUMEN

In recent years, high-throughput genomic technologies like chromatin immunoprecipitation sequencing (ChIp-seq) and transcriptome sequencing (RNA-seq) have been becoming both more refined and less expensive, making them more accessible. Many circular RNAs (circRNAs) that originate from back-spliced exons have been identified in various cell lines across different species. However, the regulatory mechanism for transcription of circRNAs remains unclear. Therefore, there is an urgent need to construct a database detailing the transcriptional regulation of circRNAs. TRCirc (http://www.licpathway.net/TRCirc) provides a resource for efficient retrieval, browsing and visualization of transcriptional regulation information of circRNAs. The current version of TRCirc documents 92 375 circRNAs and 161 transcription factors (TFs) from more than 100 cell types and together represent more than 765 000 TF-circRNA regulatory relationships. Furthermore, TRCirc provides other regulatory information about transcription of circRNAs, including their expression, methylation levels, H3K27ac signals in regulation regions and super-enhancers associated with circRNAs. TRCirc provides a convenient, user-friendly interface to search, browse and visualize detailed information about these circRNAs.


Asunto(s)
Regulación de la Expresión Génica , ARN Circular/genética , Transcripción Genética , Bases de Datos Genéticas , Humanos , Almacenamiento y Recuperación de la Información
13.
Arterioscler Thromb Vasc Biol ; 40(6): 1464-1478, 2020 06.
Artículo en Inglés | MEDLINE | ID: mdl-32268789

RESUMEN

OBJECTIVE: Despite the current antiatherosclerotic and antithrombotic therapies, the incidence of advanced atherosclerosis-associated clinical events remains high. Whether long noncoding RNAs (lncRNAs) affect the progression of atherosclerosis and whether they are potential targets for the treatment of advanced atherosclerosis are poorly understood. Approach and Results: The progression of atherosclerotic lesions was accompanied by dynamic alterations in lncRNA expression, as revealed by RNA sequencing and quantitative polymerase chain reaction. Among the dynamically changing lncRNAs, we identified a novel lncRNA, lncRNA Associated with the Progression and Intervention of Atherosclerosis (RAPIA), that was highly expressed in advanced atherosclerotic lesions and in macrophages. Inhibition of RAPIA in vivo not only repressed the progression of atherosclerosis but also exerted atheroprotective effects similar to those of atorvastatin on advanced atherosclerotic plaques that had already formed. In vitro assays demonstrated that RAPIA promoted proliferation and reduced apoptosis of macrophages. A molecular sponge interaction between RAPIA and microRNA-183-5p was demonstrated by dual-luciferase reporter and RNA immunoprecipitation assays. Rescue assays indicated that RAPIA functioned at least in part by targeting the microRNA-183-5p/ITGB1 (integrin ß1) pathway in macrophages. In addition, the transcription factor FoxO1 (forkhead box O1) could bind to the RAPIA promoter region and facilitate the expression of RAPIA. CONCLUSIONS: The progression of atherosclerotic lesions was accompanied by dynamic changes in the expression of lncRNAs. Inhibition of the pivotal lncRNA RAPIA may be a novel preventive and therapeutic strategy for advanced atherosclerosis, especially in patients resistant or intolerant to statins.


Asunto(s)
Aterosclerosis/terapia , Expresión Génica , Macrófagos/metabolismo , ARN Largo no Codificante/antagonistas & inhibidores , ARN Largo no Codificante/genética , Animales , Apoptosis/efectos de los fármacos , Aterosclerosis/genética , Aterosclerosis/prevención & control , Atorvastatina/farmacología , Proliferación Celular/efectos de los fármacos , Progresión de la Enfermedad , Proteína Forkhead Box O1/metabolismo , Humanos , Integrina beta1/metabolismo , Macrófagos/química , Macrófagos/patología , Masculino , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados para ApoE , MicroARNs/metabolismo , MicroARNs/farmacología , Regiones Promotoras Genéticas/fisiología , Células RAW 264.7 , ARN Largo no Codificante/fisiología
14.
Nucleic Acids Res ; 47(D1): D235-D243, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30371817

RESUMEN

Super-enhancers are important for controlling and defining the expression of cell-specific genes. With research on human disease and biological processes, human H3K27ac ChIP-seq datasets are accumulating rapidly, creating the urgent need to collect and process these data comprehensively and efficiently. More importantly, many studies showed that super-enhancer-associated single nucleotide polymorphisms (SNPs) and transcription factors (TFs) strongly influence human disease and biological processes. Here, we developed a comprehensive human super-enhancer database (SEdb, http://www.licpathway.net/sedb) that aimed to provide a large number of available resources on human super-enhancers. The database was annotated with potential functions of super-enhancers in the gene regulation. The current version of SEdb documented a total of 331 601 super-enhancers from 542 samples. Especially, unlike existing super-enhancer databases, we manually curated and classified 410 available H3K27ac samples from >2000 ChIP-seq samples from NCBI GEO/SRA. Furthermore, SEdb provides detailed genetic and epigenetic annotation information on super-enhancers. Information includes common SNPs, motif changes, expression quantitative trait locus (eQTL), risk SNPs, transcription factor binding sites (TFBSs), CRISPR/Cas9 target sites and Dnase I hypersensitivity sites (DHSs) for in-depth analyses of super-enhancers. SEdb will help elucidate super-enhancer-related functions and find potential biological effects.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Elementos de Facilitación Genéticos , Genómica/métodos , Humanos , Almacenamiento y Recuperación de la Información , Anotación de Secuencia Molecular , Programas Informáticos , Diseño de Software , Interfaz Usuario-Computador , Navegador Web
15.
Nucleic Acids Res ; 47(W1): W248-W255, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31028388

RESUMEN

Super-enhancers (SEs) have prominent roles in biological and pathological processes through their unique transcriptional regulatory capability. To date, several SE databases have been developed by us and others. However, these existing databases do not provide downstream or upstream regulatory analyses of SEs. Pathways, transcription factors (TFs), SEs, and SE-associated genes form complex regulatory networks. Therefore, we designed a novel web server, SEanalysis, which provides comprehensive SE-associated regulatory network analyses. SEanalysis characterizes SE-associated genes, TFs binding to target SEs, and their upstream pathways. The current version of SEanalysis contains more than 330 000 SEs from more than 540 types of cells/tissues, 5042 TF ChIP-seq data generated from these cells/tissues, DNA-binding sequence motifs for ∼700 human TFs and 2880 pathways from 10 databases. SEanalysis supports searching by either SEs, samples, TFs, pathways or genes. The complex regulatory networks formed by these factors can be interactively visualized. In addition, we developed a customizable genome browser containing >6000 customizable tracks for visualization. The server is freely available at http://licpathway.net/SEanalysis.


Asunto(s)
Bases de Datos Genéticas , Elementos de Facilitación Genéticos/genética , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Programas Informáticos , Sitios de Unión/genética , Humanos , Internet , Factores de Transcripción/genética
16.
Curr Med Imaging ; 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38494941

RESUMEN

BACKGROUND: Coronary Heart Disease (CHD) is one of the most common types of cardiovascular disease, and Heart Failure (HF) is an important factor in its progression. We aimed to evaluate the diagnostic value and predictors of multiparametric Cardiac Magnetic Resonance (CMR) in CHD patients with HF. METHODS: The study retrospectively included 145 CHD patients who were classified into CHD (HF+) (n = 91) and CHD (HF-) (n = 54) groups according to whether HF occurred. CMR assessed LV function, myocardial strain and T1 mapping. Multivariate linear regression analyses were performed to identify predictors of LV dysfunction, myocardial fibrosis, and LV remodeling. RESULTS: CHD (HF+) group had impaired strain, with increased native T1, ECV, and LVM index. The impaired strain was associated with LVM index (p < 0.05), where native T1 and ECV were affected by log-transformed amino-terminal pro-B-type natriuretic peptide (NT-proBNP) levels. ROC analysis showed the combination of global circumferential strain (GCS), native T1, and LVM had a higher diagnostic value for the occurrence of HF in CHD patients.

Meanwhile, log-transformed NT-proBNP was an independent determinant of impaired strain, increased LVM index, native T1 and ECV. CONCLUSION: HF has harmful effects on LV systolic function in patients with CHD. In CHD (HF+) group, LV dysfunction is strongly correlated with the degree of LV remodeling and myocardial fibrosis. The combination of the three is more valuable in diagnosing HF than conventional indicators.

17.
Comput Struct Biotechnol J ; 23: 1877-1885, 2024 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38707542

RESUMEN

Transcription factors (TFs) are major contributors to gene transcription, especially in controlling cell-specific gene expression and disease occurrence and development. Uncovering the relationship between TFs and their target genes is critical to understanding the mechanism of action of TFs. With the development of high-throughput sequencing techniques, a large amount of TF-related data has accumulated, which can be used to identify their target genes. In this study, we developed TFTG (Transcription Factor and Target Genes) database (http://tf.liclab.net/TFTG), which aimed to provide a large number of available human TF-target gene resources by multiple strategies, besides performing a comprehensive functional and epigenetic annotations and regulatory analyses of TFs. We identified extensive available TF-target genes by collecting and processing TF-associated ChIP-seq datasets, perturbation RNA-seq datasets and motifs. We also obtained experimentally confirmed relationships between TF and target genes from available resources. Overall, the target genes of TFs were obtained through integrating the relevant data of various TFs as well as fourteen identification strategies. Meanwhile, TFTG was embedded with user-friendly search, analysis, browsing, downloading and visualization functions. TFTG is designed to be a convenient resource for exploring human TF-target gene regulations, which will be useful for most users in the TF and gene expression regulation research.

18.
Comput Struct Biotechnol J ; 23: 2746-2753, 2024 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-39050785

RESUMEN

The advent of single cell transposase-accessible chromatin sequencing (scATAC-seq) technology enables us to explore the genomic characteristics and chromatin accessibility of blood cells at the single-cell level. To fully make sense of the roles and regulatory complexities of blood cells, it is critical to collect and analyze these rapidly accumulating scATAC-seq datasets at a system level. Here, we present scBlood (https://bio.liclab.net/scBlood/), a comprehensive single-cell accessible chromatin database of blood cells. The current version of scBlood catalogs 770,907 blood cells and 452,247 non-blood cells from ∼400 high-quality scATAC-seq samples covering 30 tissues and 21 disease types. All data hosted on scBlood have undergone preprocessing from raw fastq files and multiple standards of quality control. Furthermore, we conducted comprehensive downstream analyses, including multi-sample integration analysis, cell clustering and annotation, differential chromatin accessibility analysis, functional enrichment analysis, co-accessibility analysis, gene activity score calculation, and transcription factor (TF) enrichment analysis. In summary, scBlood provides a user-friendly interface for searching, browsing, analyzing, visualizing, and downloading scATAC-seq data of interest. This platform facilitates insights into the functions and regulatory mechanisms of blood cells, as well as their involvement in blood-related diseases.

19.
Mol Ther Nucleic Acids ; 33: 655-667, 2023 Sep 12.
Artículo en Inglés | MEDLINE | ID: mdl-37637211

RESUMEN

Cis-regulatory elements are important molecular switches in controlling gene expression and are regarded as determinant hubs in the transcriptional regulatory network. Collection and processing of large-scale cis-regulatory data are urgent to decipher the potential mechanisms of cardiovascular diseases from a cis-regulatory element aspect. Here, we developed a novel web server, Cis-Cardio, which aims to document a large number of available cardiovascular-related cis-regulatory data and to provide analysis for unveiling the comprehensive mechanisms at a cis-regulation level. The current version of Cis-Cardio catalogs a total of 45,382,361 genomic regions from 1,013 human and mouse epigenetic datasets, including ATAC-seq, DNase-seq, Histone ChIP-seq, TF/TcoF ChIP-seq, RNA polymerase ChIP-seq, and Cohesin ChIP-seq. Importantly, Cis-Cardio provides six analysis tools, including region overlap analysis, element upstream/downstream analysis, transcription regulator enrichment analysis, variant interpretation, and protein-protein interaction-based co-regulatory analysis. Additionally, Cis-Cardio provides detailed and abundant (epi-) genetic annotations in cis-regulatory regions, such as super-enhancers, enhancers, transcription factor binding sites (TFBSs), methylation sites, common SNPs, risk SNPs, expression quantitative trait loci (eQTLs), motifs, DNase I hypersensitive sites (DHSs), and 3D chromatin interactions. In summary, Cis-Cardio is a valuable resource for elucidating and analyzing regulatory cues of cardiovascular-specific cis-regulatory elements. The platform is freely available at http://www.licpathway.net/Cis-Cardio/index.html.

20.
Mol Ther Nucleic Acids ; 32: 385-401, 2023 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-37131406

RESUMEN

A core transcription regulatory circuitry (CRC) is an interconnected self-regulatory circuitry that is formed by a group of core transcription factors (TFs). These core TFs collectively regulate gene expression by binding not only to their own super enhancers (SEs) but also to the SEs of one another. For most human tissue/cell types, a global view of CRCs and core TFs has not been generated. Here, we identified numerous CRCs using two identification methods and detailed the landscape of the CRCs driven by SEs in large cell/tissue samples. The comprehensive biological analyses, including sequence conservation, CRC activity and genome binding affinity were conducted for common TFs, moderate TFs, and specific TFs, which exhibit different biological features. The local module located from the common CRC network highlighted the essential functions and prognostic performance. The tissue-specific CRC network was highly related to cell identity. Core TFs in tissue-specific CRC networks exhibited disease markers, and had regulatory potential for cancer immunotherapy. Moreover, a user-friendly resource named CRCdb (http://www.licpathway.net/crcdb/index.html) was developed, which contained the detailed information of CRCs and core TFs used in this study, as well as other interesting results, such as the most representative CRC, frequency of TFs, and indegree/outdegree of TFs.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA