Búsqueda | Portal Regional de la BVS

1.

DMLS: an automated pipeline to extract the Drosophila modular transcription regulators and targets from massive literature articles.

Yang, Tzu-Hsien; Yu, Yu-Huai; Wu, Sheng-Hang; Chang, Fang-Yuan; Tsai, Hsiu-Chun; Yang, Ya-Chiao.

Database (Oxford) ; 2024: 0, 2024 Jun 20.

Artículo en Inglés | MEDLINE | ID: mdl-38900628

RESUMEN

Transcription regulation in multicellular species is mediated by modular transcription factor (TF) binding site combinations termed cis-regulatory modules (CRMs). Such CRM-mediated transcription regulation determines the gene expression patterns during development. Biologists frequently investigate CRM transcription regulation on gene expressions. However, the knowledge of the target genes and regulatory TFs participating in the CRMs under study is mostly fragmentary throughout the literature. Researchers need to afford tremendous human resources to fully surf through the articles deposited in biomedical literature databases in order to obtain the information. Although several novel text-mining systems are now available for literature triaging, these tools do not specifically focus on CRM-related literature prescreening, failing to correctly extract the information of the CRM target genes and regulatory TFs from the literature. For this reason, we constructed a supportive auto-literature prescreener called Drosophila Modular transcription-regulation Literature Screener (DMLS) that achieves the following: (i) prescreens articles describing experiments on modular transcription regulation, (ii) identifies the described target genes and TFs of the CRMs under study for each modular transcription-regulation-describing article and (iii) features an automated and extendable pipeline to perform the task. We demonstrated that the final performance of DMLS in extracting the described target gene and regulatory TF lists of CRMs under study for given articles achieved test macro area under the ROC curve (auROC) = 89.7% and area under the precision-recall curve (auPRC) = 77.6%, outperforming the intuitive gene name-occurrence-counting method by at least 19.9% in auROC and 30.5% in auPRC. The web service and the command line versions of DMLS are available at https://cobis.bme.ncku.edu.tw/DMLS/ and https://github.com/cobisLab/DMLS/, respectively. Database Tool URL: https://cobis.bme.ncku.edu.tw/DMLS/.

Asunto(s)

Minería de Datos , Factores de Transcripción , Animales , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Minería de Datos/métodos , Drosophila/genética , Drosophila melanogaster/genética , Bases de Datos Genéticas , Regulación de la Expresión Génica , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo

2.

BAPCP: A comprehensive and user-friendly web tool for identifying biomarkers from protein microarray technologies.

Yang, Tzu-Hsien; Syu, Guan-Da; Chen, Chien-Sheng; Chen, Guan-Ru; Jhong, Song-En; Lin, Po-Heng; Lin, Pei-Chun; Wang, Yun-Cih; Shah, Pramod; Tseng, Yan-Yuan; Wu, Wei-Sheng.

Comput Methods Programs Biomed ; 254: 108260, 2024 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-38878357

RESUMEN

BACKGROUND AND OBJECTIVE: Proteome microarrays are one of the popular high-throughput screening methods for large-scale investigation of protein interactions in cells. These interactions can be measured on protein chips when coupled with fluorescence-labeled probes, helping indicate potential biomarkers or discover drugs. Several computational tools were developed to help analyze the protein chip results. However, existing tools fail to provide a user-friendly interface for biologists and present only one or two data analysis methods suitable for limited experimental designs, restricting the use cases. METHODS: In order to facilitate the biomarker examination using protein chips, we implemented a user-friendly and comprehensive web tool called BAPCP (Biomarker Analysis tool for Protein Chip Platforms) in this research to deal with diverse chip data distributions. RESULTS: BAPCP is well integrated with standard chip result files and includes 7 data normalization methods and 7 custom-designed quality control/differential analysis filters for biomarker extraction among experiment groups. Moreover, it can handle cost-efficient chip designs that repeat several blocks/samples within one single slide. Using experiments of the human coronavirus (HCoV) protein microarray and the E. coli proteome chip that helps study the immune response of Kawasaki disease as examples, we demonstrated that BAPCP can accelerate the time-consuming week-long manual biomarker identification process to merely 3 min. CONCLUSIONS: The developed BAPCP tool provides substantial analysis support for protein interaction studies and conforms to the necessity of expanding computer usage and exchanging information in bioscience and medicine. The web service of BAPCP is available at https://cosbi.ee.ncku.edu.tw/BAPCP/.

Asunto(s)

Biomarcadores , Análisis por Matrices de Proteínas , Programas Informáticos , Biomarcadores/metabolismo , Humanos , Internet , Proteoma , Interfaz Usuario-Computador , Escherichia coli , Proteómica/métodos , Biología Computacional

3.

Associations of diabetes status and glucose measures with outcomes after endovascular therapy in patients with acute ischemic stroke: an analysis of the nationwide TREAT-AIS registry.

Hsieh, Meng-Tsang; Hsieh, Cheng-Yang; Yang, Tzu-Hsien; Sung, Sheng-Feng; Hsieh, Yi-Chen; Lee, Chung-Wei; Lin, Chun-Jen; Chen, Yu-Wei; Lin, Kuan-Hung; Sung, Pi-Shan; Tang, Chih-Wei; Chu, Hai-Jui; Tsai, Kun-Chang; Chou, Chao-Liang; Lin, Ching-Huang; Wei, Cheng-Yu; Chen, Te-Yuan; Yan, Shang-Yih; Chen, Po-Lin; Hsiao, Chen-Yu; Chan, Lung; Huang, Yen-Chu; Liu, Hon-Man; Tang, Sung-Chun; Lee, I-Hui; Lien, Li-Ming; Chiou, Hung-Yi; Lee, Jiunn-Tay; Jeng, Jiann-Shing.

Front Neurol ; 15: 1351150, 2024.

Artículo en Inglés | MEDLINE | ID: mdl-38813247

RESUMEN

Background: Hyperglycemia affects the outcomes of endovascular therapy (EVT) for acute ischemic stroke (AIS). This study compares the predictive ability of diabetes status and glucose measures on EVT outcomes using nationwide registry data. Methods: The study included 1,097 AIS patients who underwent EVT from the Taiwan Registry of Endovascular Thrombectomy for Acute Ischemic Stroke. The variables analyzed included diabetes status, admission glucose, glycated hemoglobin (HbA1c), admission glucose-to-HbA1c ratio (GAR), and outcomes such as 90-day poor functional outcome (modified Rankin Scale score ≥ 2) and symptomatic intracranial hemorrhage (SICH). Multivariable analyses investigated the independent effects of diabetes status and glucose measures on outcomes. A receiver operating characteristic (ROC) analysis was performed to compare their predictive abilities. Results: The multivariable analysis showed that individuals with known diabetes had a higher likelihood of poor functional outcomes (odds ratios [ORs] 2.10 to 2.58) and SICH (ORs 3.28 to 4.30) compared to those without diabetes. Higher quartiles of admission glucose and GAR were associated with poor functional outcomes and SICH. Higher quartiles of HbA1c were significantly associated with poor functional outcomes. However, patients in the second HbA1c quartile (5.6-5.8%) showed a non-significant tendency toward good functional outcomes compared to those in the lowest quartile (<5.6%). The ROC analysis indicated that diabetes status and admission glucose had higher predictive abilities for poor functional outcomes, while admission glucose and GAR were better predictors for SICH. Conclusion: In AIS patients undergoing EVT, diabetes status, admission glucose, and GAR were associated with 90-day poor functional outcomes and SICH. Admission glucose was likely the most suitable glucose measure for predicting outcomes after EVT.

4.

DEBFold: Computational Identification of RNA Secondary Structures for Sequences across Structural Families Using Deep Learning.

Yang, Tzu-Hsien.

J Chem Inf Model ; 64(9): 3756-3766, 2024 May 13.

Artículo en Inglés | MEDLINE | ID: mdl-38648189

RESUMEN

It is now known that RNAs play more active roles in cellular pathways beyond simply serving as transcription templates. These biological mechanisms might be mediated by higher RNA stereo conformations, triggering the need to understand RNA secondary structures first. However, experimental protocols for solving RNA structures are unavailable for large-scale investigation due to their high costs and time-consuming nature. Various computational tools were thus developed to predict the RNA secondary structures from sequences. Recently, deep networks have been investigated to help predict RNA structures directly from their sequences. However, existing deep-learning-based tools are more or less suffering from model overfitting due to their complicated problem formulation and defective model training processes, limiting their applications across sequences from different structural families. In this research, we designed a two-stage RNA structure prediction strategy called DEBFold (deep ensemble boosting and folding) based on convolution encoding/decoding and self-attention mechanisms to enhance the existing thermodynamic structure models. Moreover, the model training process followed rigorous steps to achieve an acceptable prediction generalization. On the family-wise reserved test sets and the PDB-derived test set, DEBFold achieves better structure prediction performance over traditional tools and existing deep-learning methods. In summary, we obtained a cutting-edge deep-learning-based structure prediction tool with supreme across-family generalization performance. The DEBFold tool can be accessed at https://cobis.bme.ncku.edu.tw/DEBFold/.

Asunto(s)

Biología Computacional , Aprendizaje Profundo , Conformación de Ácido Nucleico , ARN , ARN/química , Biología Computacional/métodos , Modelos Moleculares , Termodinámica , Secuencia de Bases

5.

Magnetic resonance imaging-based deep learning imaging biomarker for predicting functional outcomes after acute ischemic stroke.

Yang, Tzu-Hsien; Su, Ying-Ying; Tsai, Chia-Ling; Lin, Kai-Hsuan; Lin, Wei-Yang; Sung, Sheng-Feng.

Eur J Radiol ; 174: 111405, 2024 May.

Artículo en Inglés | MEDLINE | ID: mdl-38447430

RESUMEN

PURPOSE: Clinical risk scores are essential for predicting outcomes in stroke patients. The advancements in deep learning (DL) techniques provide opportunities to develop prediction applications using magnetic resonance (MR) images. We aimed to develop an MR-based DL imaging biomarker for predicting outcomes in acute ischemic stroke (AIS) and evaluate its additional benefit to current risk scores. METHOD: This study included 3338 AIS patients. We trained a DL model using deep neural network architectures on MR images and radiomics to predict poor functional outcomes at three months post-stroke. The DL model generated a DL score, which served as the DL imaging biomarker. We compared the predictive performance of this biomarker to five risk scores on a holdout test set. Additionally, we assessed whether incorporating the imaging biomarker into the risk scores improved the predictive performance. RESULTS: The DL imaging biomarker achieved an area under the receiver operating characteristic curve (AUC) of 0.788. The AUCs of the five studied risk scores were 0.789, 0.793, 0.804, 0.810, and 0.826, respectively. The imaging biomarker's predictive performance was comparable to four of the risk scores but inferior to one (p = 0.038). Adding the imaging biomarker to the risk scores improved the AUCs (p-values) to 0.831 (0.003), 0.825 (0.001), 0.834 (0.003), 0.836 (0.003), and 0.839 (0.177), respectively. The net reclassification improvement and integrated discrimination improvement indices also showed significant improvements (all p < 0.001). CONCLUSIONS: Using DL techniques to create an MR-based imaging biomarker is feasible and enhances the predictive ability of current risk scores.

Asunto(s)

Isquemia Encefálica , Aprendizaje Profundo , Accidente Cerebrovascular Isquémico , Accidente Cerebrovascular , Humanos , Isquemia Encefálica/diagnóstico por imagen , Accidente Cerebrovascular/diagnóstico por imagen , Imagen por Resonancia Magnética , Biomarcadores , Estudios Retrospectivos

6.

Identifying Human miRNA Target Sites via Learning the Interaction Patterns between miRNA and mRNA Segments.

Yang, Tzu-Hsien; Chen, Jhih-Cheng; Lee, Yuan-Han; Lu, Shang-Yi; Wu, Sheng-Hang; Chang, Fang-Yuan; Huang, Yan-Cheng; Lee, Mei-Hsien; Tseng, Yan-Yuan; Wu, Wei-Sheng.

J Chem Inf Model ; 64(7): 2445-2453, 2024 Apr 08.

Artículo en Inglés | MEDLINE | ID: mdl-37903033

RESUMEN

miRNAs (microRNAs) target specific mRNA (messenger RNA) sites to regulate their translation expression. Although miRNA targeting can rely on seed region base pairing, animal miRNAs, including human miRNAs, typically cooperate with several cofactors, leading to various noncanonical pairing rules. Therefore, identifying the binding sites of animal miRNAs remains challenging. Because experiments for mapping miRNA targets are costly, computational methods are preferred for extracting potential miRNA-mRNA fragment binding pairs first. However, existing prediction tools can have significant false positives due to the prevalent noncanonical miRNA binding behaviors and the information-biased training negative sets that were used while constructing these tools. To overcome these obstacles, we first prepared an information-balanced miRNA binding pair ground-truth data set. A miRNA-mRNA interaction-aware model was then designed to help identify miRNA binding events. On the test set, our model (auROC = 94.4%) outperformed existing models by at least 2.8% in auROC. Furthermore, we showed that this model can suggest potential binding patterns for miRNA-mRNA sequence interacting pairs. Finally, we made the prepared data sets and the designed model available at http://cosbi2.ee.ncku.edu.tw/mirna_binding/download.

Asunto(s)

MicroARNs , Animales , Humanos , MicroARNs/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Algoritmos , Biología Computacional/métodos

7.

RDDL: A systematic ensemble pipeline tool that streamlines balancing training schemes to reduce the effects of data imbalance in rare-disease-related deep-learning applications.

Yang, Tzu-Hsien; Liao, Zhan-Yi; Yu, Yu-Huai; Hsia, Min.

Comput Biol Chem ; 106: 107929, 2023 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-37517206

RESUMEN

Identifying lowly prevalent diseases, or rare diseases, in their early stages is key to disease treatment in the medical field. Deep learning techniques now provide promising tools for this purpose. Nevertheless, the low prevalence of rare diseases entangles the proper application of deep networks for disease identification due to the severe class-imbalance issue. In the past decades, some balancing methods have been studied to handle the data-imbalance issue. The bad news is that it is verified that none of these methods guarantees superior performance to others. This performance variation causes the need to formulate a systematic pipeline with a comprehensive software tool for enhancing deep-learning applications in rare disease identification. We reviewed the existing balancing schemes and summarized a systematic deep ensemble pipeline with a constructed tool called RDDL for handling the data imbalance issue. Through two real case studies, we showed that rare disease identification could be boosted with this systematic RDDL pipeline tool by lessening the data imbalance problem during model training. The RDDL pipeline tool is available at https://github.com/cobisLab/RDDL/.

Asunto(s)

Aprendizaje Profundo , Humanos , Enfermedades Raras , Programas Informáticos

8.

Mechanical Behaviors of Microwave-Assisted Pyrolysis Recycled Carbon Fiber-Reinforced Concrete with Early-Strength Cement.

Li, Yeou-Fong; Li, Jie-You; Syu, Jin-Yuan; Yang, Tzu-Hsien; Chang, Shu-Mei; Shen, Ming-Yuan.

Materials (Basel) ; 16(4)2023 Feb 10.

Artículo en Inglés | MEDLINE | ID: mdl-36837136

RESUMEN

This study aimed to investigate the mechanical performance of early-strength carbon fiber-reinforced concrete (ECFRC) by incorporating original carbon fiber (OCF), recycled carbon fiber (RCF), and sizing-removed carbon fiber (SCF). Compressive, flexural, and splitting tensile strength were tested under three fiber-to-cement weight ratios (5‱, 10‱, and 15‱). The RCF was produced from waste bicycle parts made of carbon fiber-reinforced polymer (CFRP) through microwave-assisted pyrolysis (MAP). The sizing-removed fiber was obtained through a heat-treatment method applied to the OCF. The results of scanning electron microscopy (SEM) analysis with energy dispersive X-ray spectrometry (EDS) indicated the successful removal of sizing and impurities from the surface of the RCF and SCF. The mechanical test results showed that ECFRC with a 10‱ fiber-to-cement weight ratio of carbon fiber had the greatest improvement in its mechanical strengths. Moreover, the ECFRC with 10‱ RCF exhibited higher compressive, flexural, and splitting tensile strength than that of benchmark specimen by 14.2%, 56.5%, and 22.5%, respectively. The ECFRC specimens with a 10‱ fiber-to-cement weight ratio were used to analyze their impact resistance under various impact energies in the impact test. At 50 joules of impact energy, the impact number of the ECFRC with SCF was over 23 times that of the benchmark specimen (early-strength concrete without fiber) and was also greater than that of ECFRC with OCF and RCF.

9.

Precisely Closed Reduction of Nasal Bone Fracture Assisted With Plain Film Measurements Under the Picture Archiving and Communication System.

Yang, Tzu-Hsien; Fang, Chien-Liang; Tsai, Chong-Bin; Chen, Ming-Shan; Changchien, Chih-Hsuan; Yang, Hsin-Yi; Fang, Kai-Jan.

Ear Nose Throat J ; 102(8): NP413, 2023 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-34006146

RESUMEN

OBJECTIVES: To prevent aesthetic and functional deformities, precisely closed reduction is crucial in the management of nasal fractures. Plain film radiography (PF), ultrasonography (USG), and computed tomography can help confirm the diagnosis and classification of fractures and assist in performing closed reduction. However, no study in the literature reports on precisely closed reduction assisted with PF measurements under the picture archiving and communication system (PACS). METHODS: We retrospectively evaluated 153 patients with nasal bone fracture between January 2013 and December 2017. Surgeons conducted precisely closed reduction assisted with PF measurement of the distance between the fracture site and nasal tip under PACS on 34 patients (group A). Another group on 119 patients were reduced under surgeon's experience (group B). RESULTS: No significant differences in age, gender, Arbeitsgemeinschaft fur Osteosynthesefragen (AO) classification, and reduction outcome were observed between group A and group B (P > .05). The operative time of the group A was significantly lower (12.50 ± 4.64 minutes) compared to group B (23.78 ± 11.20 minutes; P < .001). After adjusted age, gender, and AO classification, patients in group A scored 10.46 minutes less on the operative time than those in group B (P < .001). In addition, the severity of nasal bone fracture (AO classification, ß = 3.37, P = .002) was positive associated with the operative time. CONCLUSIONS: In this study, closed reduction in nasal bone fracture assisted with PF measurements under PACS was performed precisely, thereby effectively decreasing operative time and the occurrence of complications. This procedure requires neither the use of new instruments or C-arm nor USG or navigation experience. Moreover, reduction can be easily performed using this method, and it requires short operative time, helps achieve great reduction, less radiation exposures, and is cost-effective.

Asunto(s)

Reducción Cerrada , Fracturas Óseas , Hueso Nasal , Hueso Nasal/diagnóstico por imagen , Hueso Nasal/lesiones , Hueso Nasal/cirugía , Humanos , Fracturas Óseas/diagnóstico por imagen , Fracturas Óseas/cirugía , Sistemas de Información Radiológica , Estudios Retrospectivos , Masculino , Femenino , Adulto , Tempo Operativo , Resultado del Tratamiento

10.

CFA: An explainable deep learning model for annotating the transcriptional roles of cis-regulatory modules based on epigenetic codes.

Yang, Tzu-Hsien; Yu, Yu-Huai; Wu, Sheng-Hang; Zhang, Fang-Yuan.

Comput Biol Med ; 152: 106375, 2023 01.

Artículo en Inglés | MEDLINE | ID: mdl-36502693

RESUMEN

Metazoa gene expression is controlled by modular DNA segments called cis-regulatory modules (CRMs). CRMs can convey promoter/enhancer/insulator roles, generating additional regulation layers in transcription. Experiments for understanding CRM roles are low-throughput and costly. Large-scale CRM function investigation still depends on computational methods. However, existing in silico tools only recognize enhancers or promoters exclusively, thus accumulating errors when considering CRM promoter/enhancer/insulator roles altogether. Currently, no algorithm can concurrently consider these CRM roles. In this research, we developed the CRM Function Annotator (CFA) model. CFA provides complete CRM transcriptional role labeling based on epigenetic profiling interpretation. We demonstrated that CFA achieves high performance (test macro auROC/auPRC = 94.1%/90.3%) and outperforms existing tools in promoter/enhancer/insulator identification. CFA is also inspected to recognize explainable epigenetic codes consistent with previous findings when labeling CRM roles. By considering the higher-order combinations of the epigenetic codes, CFA significantly reduces false-positive rates in CRM transcriptional role annotation. CFA is available at https://github.com/cobisLab/CFA/.

Asunto(s)

Aprendizaje Profundo , Regiones Promotoras Genéticas/genética , Epigénesis Genética/genética

11.

YMLA: A comparative platform to carry out functional enrichment analysis for multiple gene lists in yeast.

Yang, Tzu-Hsien; Hsu, Chia-Wei; Wang, Yan-Xiang; Yu, Chien-Hung; Rathod, Jagat; Tseng, Yan-Yuan; Wu, Wei-Sheng.

Comput Biol Med ; 151(Pt B): 106314, 2022 12.

Artículo en Inglés | MEDLINE | ID: mdl-36455295

RESUMEN

Comparative analysis among multiple gene lists on their functional features is now a routine task due to the advancement of high-throughput experiments. Several enrichment analysis tools were developed in the past. However, these tools mainly focus on one gene list and contain only gene ontology or interaction features. What makes it worse, comparative investigation and customized feature set reanalysis are still unavailable. Therefore, we constructed the YMLA (Yeast Multiple List Analyzer) platform in this research. YMLA includes 39 yeast features and facilitates comparative analysis among multiple gene lists via tabular views, heatmaps, and network plots. Moreover, the customized feature set reanalysis function was implemented in YMLA to help form mechanism hypotheses based on a selected enriched feature subset. We demonstrated the biological applicability of YMLA via example lists consisting of genes with top/bottom translation efficiency values. The analysis results provided by YMLA reveal novel facts consistent with previous experiments. YMLA is available at https://cosbi7.ee.ncku.edu.tw/YMLA/.

Asunto(s)

Saccharomyces cerevisiae , Programas Informáticos , Saccharomyces cerevisiae/genética

12.

YTLR: Extracting yeast transcription factor-gene associations from the literature using automated literature readers.

Yang, Tzu-Hsien; Wang, Chung-Yu; Tsai, Hsiu-Chun; Yang, Ya-Chiao; Liu, Cheng-Tse.

Comput Struct Biotechnol J ; 20: 4636-4644, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-36090812

RESUMEN

Cells adapt to environmental stresses mainly via transcription reprogramming. Correct transcription control is mediated by the interactions between transcription factors (TF) and their target genes. These TF-gene associations can be probed by chromatin immunoprecipitation techniques and knockout experiments, revealing TF binding (TFB) and regulatory (TFR) evidence, respectively. Nevertheless, most evidence is still fragmentary in the literature and requires tremendous human resources to curate. We developed the first pipeline called YTLR (Yeast Transcription-regulation Literature Reader) to automate TF-gene relation extraction from the literature. YTLR first identifies articles with TFB and TFR information. Then TF-gene binding pairs are extracted from the TFB articles, and TF-gene regulatory associations are recognized from the TFR papers. On gathered test sets, YTLR achieves an AUC value of 98.8% in identifying articles with TFB evidence and AUC = 83.4% in extracting the detailed TF-gene binding pairs. And similarly, YTLR also obtains an AUC value of 98.2% in identifying TFR articles and AUC = 80.4% in extracting the detailed TF-gene regulatory associations. Furthermore, YTLR outperforms previous methods in both tasks. To facilitate researchers in extracting TF-gene transcriptional relations from large-scale queried articles, an automated and easy-to-use software tool based on the YTLR pipeline is constructed. In summary, YTLR aims to provide easier literature pre-screening for curators and help researchers gather yeast TF-gene transcriptional relation conclusions from articles in a high-throughput fashion. The YTLR pipeline software tool can be downloaded at https://github.com/cobisLab/YTLR/.

13.

SSRTool: A web tool for evaluating RNA secondary structure predictions based on species-specific functional interpretability.

Yang, Tzu-Hsien; Lin, Yu-Cian; Hsia, Min; Liao, Zhan-Yi.

Comput Struct Biotechnol J ; 20: 2473-2483, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35664227

RESUMEN

RNA secondary structures can carry out essential cellular functions alone or interact with one another to form the hierarchical tertiary structures. Experimental structure identification approa ches can show the in vitro structures of RNA molecules. However, they usually have limits in the resolution and are costly. In silico structure prediction tools are thus primarily relied on for pre-experiment analysis. Various structure prediction models have been developed over the decades. Since these tools are usually used before knowing the actual RNA structures, evaluating and ranking the pile of secondary structure predictions of a given sequence is essential in computational analysis. In this research, we implemented a web service called SSRTool (RNA Secondary Structure prediction Ranking Tool) to assist in the ranking and evaluation of the generated predicted structures of a given sequence. Based on the computed species-specific interpretability significance in four common RNA structure-function aspects, SSRTool provides three functions along with visualization interfaces: (1) Rank user-generated predictions. (2) Provide an automated streamline of structure prediction and ranking for a given sequence. (3) Infer the functional aspects of a given structure. We demonstrated the applicability of SSRTool via real case studies and reported the similar trends between computed species-specific rankings and the corresponding prediction F1 values. The SSRTool web service is available online at https://cobisHSS0.im.nuk.edu.tw/SSRTool/, http://cosbi3.ee.ncku.edu.tw/SSRTool/, or the redirecting site https://github.com/cobisLab/SSRTool/.

14.

KDmarkers: A biomarker database for investigating epigenetic methylation and gene expression levels in Kawasaki disease.

Wu, Wei-Sheng; Yang, Tzu-Hsien; Chen, Kuang-Den; Lin, Po-Heng; Chen, Guan-Ru; Kuo, Ho-Chang.

Comput Struct Biotechnol J ; 20: 1295-1305, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35356542

RESUMEN

Kawasaki disease (KD) is a form of acute systemic vasculitis that primarily affects children and has become the most common cause of acquired heart disease. While the etiopathogenesis of KD remains unknown, the diagnostic criteria of KD have been well established. Nevertheless, the diagnosis of KD is currently based on subjective clinical symptoms, and no molecular biomarker is yet available. We have previously performed and combined methylation array (Illumina HumanMethylation450 BeadChip) and transcriptome array (Affymetrix GeneChip Human Transcriptome Array 2.0) to identify genes that are differentially methylated/expressed in KD patients compared with control subjects. We have found that decreased methylation levels combined with elevated gene expression can indicate genes (e.g., toll-like receptors and CD177) involved in the disease mechanisms of KD. In this study, we constructed a database called KDmarkers to allow researchers to access these valuable potential KD biomarkers identified via methylation array and transcriptome array. KDmarkers provides three search modes. First, users can search genes differentially methylated and/or differentially expressed in KD patients compared with control subjects. Second, users can check the KD patient groups in which a given gene is differentially methylated and/or differentially expressed. Third, users can explore the DNA methylation levels and gene expression levels in all samples (KD patients and controls) for a particular gene of interest. We further demonstrated that the results in KDmarkers are strongly associated with KD immune responses. All analysis results can be downloaded for downstream experimental designs. KDmarkers is available online at https://cosbi.ee.ncku.edu.tw/KDmarkers/.

15.

regCNN: identifying Drosophila genome-wide cis-regulatory modules via integrating the local patterns in epigenetic marks and transcription factor binding motifs.

Yang, Tzu-Hsien; Yang, Ya-Chiao; Tu, Kai-Chi.

Comput Struct Biotechnol J ; 20: 296-308, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35035784

RESUMEN

Transcription regulation in metazoa is controlled by the binding events of transcription factors (TFs) or regulatory proteins on specific modular DNA regulatory sequences called cis-regulatory modules (CRMs). Understanding the distributions of CRMs on a genomic scale is essential for constructing the metazoan transcriptional regulatory networks that help diagnose genetic disorders. While traditional reporter-assay CRM identification approaches can provide an in-depth understanding of functions of some CRM, these methods are usually cost-inefficient and low-throughput. It is generally believed that by integrating diverse genomic data, reliable CRM predictions can be made. Hence, researchers often first resort to computational algorithms for genome-wide CRM screening before specific experiments. However, current existing in silico methods for searching potential CRMs were restricted by low sensitivity, poor prediction accuracy, or high computation time from TFBS composition combinatorial complexity. To overcome these obstacles, we designed a novel CRM identification pipeline called regCNN by considering the base-by-base local patterns in TF binding motifs and epigenetic profiles. On the test set, regCNN shows an accuracy/auROC of 84.5%/92.5% in CRM identification. And by further considering local patterns in epigenetic profiles and TF binding motifs, it can accomplish 4.7% (92.5%-87.8%) improvement in the auROC value over the average value-based pure multi-layer perceptron model. We also demonstrated that regCNN outperforms all currently available tools by at least 11.3% in auROC values. Finally, regCNN is verified to be robust against its resizing window hyperparameter in dealing with the variable lengths of CRMs. The model of regCNN can be downloaded athttp://cobisHSS0.im.nuk.edu.tw/regCNN/.

16.

An Aggregation Method to Identify the RNA Meta-Stable Secondary Structure and its Functionally Interpretable Structure Ensemble.

Yang, Tzu-Hsien.

IEEE/ACM Trans Comput Biol Bioinform ; 19(1): 75-86, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-34014829

RESUMEN

RNA can provide vital cellular functions through its secondary or tertiary structure. Due to the low-throughput nature of experimental approaches, studies on RNA structures mainly resort to computational methods. However, current existing tools fail to consider RNA structure ensembles and do not provide ways to decipher functional hypotheses for the new predictions. In this research, a novel method was proposed to identify the functionally interpretable structure ensemble of a given RNA sequence and provide the meta-stable structure, or the most frequently observed functional RNA cellular conformation, based on the ensemble. In the prediction of meta-stable structures, the proposed method outperformed existing tools on a yeast test set. The inferred functional aspects were then manually checked and demonstrated a micro-averaging F1 value of 0.92. Further, a biological example of the yeast ASH1-E1 element was discussed to articulate that these functional aspects can also suggest testable hypotheses. Then the proposed method was verified to be well applicable to other species through a human test set. Finally, the proposed method was demonstrated to show resistance to sequence length-dependent performance deterioration.

Asunto(s)

Algoritmos , ARN , Biología Computacional , Humanos , Conformación de Ácido Nucleico , Estructura Secundaria de Proteína , ARN/genética

17.

Identifying piRNA targets on mRNAs in C. elegans using a deep multi-head attention network.

Yang, Tzu-Hsien; Shiue, Sheng-Cian; Chen, Kuan-Yu; Tseng, Yan-Yuan; Wu, Wei-Sheng.

BMC Bioinformatics ; 22(1): 503, 2021 Oct 16.

Artículo en Inglés | MEDLINE | ID: mdl-34656087

RESUMEN

BACKGROUND: Piwi-interacting RNAs (piRNAs) are the small non-coding RNAs (ncRNAs) that silence genomic transposable elements. And researchers found out that piRNA also regulates various endogenous transcripts. However, there is no systematic understanding of the piRNA binding patterns and how piRNA targets genes. While various prediction methods have been developed for other similar ncRNAs (e.g., miRNAs), piRNA holds distinctive characteristics and requires its own computational model for binding target prediction. RESULTS: Recently, transcriptome-wide piRNA binding events in C. elegans were probed by PRG-1 CLASH experiments. Based on the probed piRNA-messenger RNAs (mRNAs) binding pairs, in this research, we devised the first deep learning architecture based on multi-head attention to computationally identify piRNA targeting mRNA sites. In the devised deep network, the given piRNA and mRNA segment sequences are first one-hot encoded and undergo a combined operation of convolution and squeezing-extraction to unravel motif patterns. And we incorporate a novel multi-head attention sub-network to extract the hidden piRNA binding rules that can simulate the biological piRNA target recognition process. Finally, the true piRNA-mRNA binding pairs are identified by a deep fully connected sub-network. Our model obtains a supreme discriminatory power of AUC [Formula: see text] 93.3% on an independent test set and successfully extracts the verified binding pattern of a synthetic piRNA. These results demonstrated that the devised model achieves high prediction performance and suggests testable potential biological piRNA binding rules. CONCLUSIONS: In this research, we developed the first deep learning method to identify piRNA targeting sites on C. elegans mRNAs. And the developed deep learning method is demonstrated to be of high accuracy and can provide biological insights into piRNA-mRNA binding patterns. The piRNA binding target identification network can be downloaded from http://cosbi2.ee.ncku.edu.tw/data_download/piRNA_mRNA_binding .

Asunto(s)

Proteínas de Caenorhabditis elegans , MicroARNs , Animales , Proteínas Argonautas , Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/genética , Elementos Transponibles de ADN , ARN Mensajero/genética , ARN Interferente Pequeño/genética

18.

Cancer DEIso: An integrative analysis platform for investigating differentially expressed gene-level and isoform-level human cancer markers.

Yang, Tzu-Hsien; Chiang, Yu-Hsuan; Shiue, Sheng-Cian; Lin, Po-Heng; Yang, Ya-Chiao; Tu, Kai-Chi; Tseng, Yan-Yuan; Tseng, Joseph T; Wu, Wei-Sheng.

Comput Struct Biotechnol J ; 19: 5149-5159, 2021.

Artículo en Inglés | MEDLINE | ID: mdl-34589189

RESUMEN

Transcript isoforms regulated by alternative splicing can substantially impact carcinogenesis, leading to a need to obtain clues for both gene differential expression and malfunctions of isoform distributions in cancer studies. The Cancer Genome Atlas (TCGA) project was launched in 2008 to collect cancer-related genome mutation raw data from the population. While many repositories tried to add insights into the raw data in TCGA, no existing database provides both comprehensive gene-level and isoform-level cancer stage marker investigation and survival analysis. We constructed Cancer DEIso to facilitate in-depth analyses for both gene-level and isoform-level human cancer studies. Patient RNA-seq data, sample sheets, patient clinical data, and human genome datasets were collected and processed in Cancer DEIso. And four functions to search differentially expressed genes/isoforms between cancer stages were implemented: (i) Search potential gene/isoform markers for a specified cancer type and its two stages; (ii) Search potentially induced cancer types and stages for a gene/isoform; (iii) Expression survival analysis on a given gene/isoform for some cancer; (iv) Gene/isoform stage expression comparison visualization. As an example, we demonstrate that Cancer DEIso can indicate potential colorectal cancer isoform diagnostic markers that are not easily detected when only gene-level expressions are considered. Cancer DEIso is available at http://cosbi4.ee.ncku.edu.tw/DEIso/.

19.

Role of the Inflammatory Response of RAW 264.7 Cells in the Metastasis of Novel Cancer Stem-Like Cells.

Kuo, Chan-Yen; Yang, Tzu-Hsien; Tsai, Pei-Fang; Yu, Chun-Hsien.

Medicina (Kaunas) ; 57(8)2021 Jul 30.

Artículo en Inglés | MEDLINE | ID: mdl-34440983

RESUMEN

Background and objectives: Tumor progression and the immune response are intricately linked. Additionally, the presence of macrophages in the microenvironment is essential for carcinogenesis, but regulation of the polarization of M1- and M2-like macrophages and their role in metastasis remain unclear. Based on previous studies, both reactive oxygen species (ROS) and the endoplasmic reticulum (ER) are emerging as key players in macrophage polarization. While it is known that cancers alter macrophage inflammatory responses to promote tumor progression, there is limited knowledge regarding how they affect the macrophage-dependent innate host defense. Materials and methods: We detected the levels of ROS, the ability of chemotaxis, the expressions of markers of M1-/M2-like macrophages in RAW264.7 in presence of T2- and T2C-conditioned medium. Results: The results of this study indicated that ROS levels were decreased in RAW 264.7 cells when cultured with T2C-conditioned medium, while there was an improvement in chemotaxis abilities. We also found that the M2-like macrophages were characterized by an elongated shape in RAW 264.7 cells cultured in T2C-conditioned medium, which had increased CD206 expression but decreased expression of CD86 and inducible nitric oxide synthase. Suppression of ER stress shifted polarized M1-like macrophages toward an M2-like phenotype in RAW 264.7 cells cultured in T2C-conditioned medium. Conclusions: Taken together, we conclude that the polarization of macrophages is associated with the alteration of cell shape, ROS accumulation, and ER stress.

Asunto(s)

Activación de Macrófagos , Neoplasias , Animales , Macrófagos , Ratones , Células RAW 264.7 , Especies Reactivas de Oxígeno , Microambiente Tumoral

20.

Human IRES Atlas: an integrative platform for studying IRES-driven translational regulation in humans.

Yang, Tzu-Hsien; Wang, Chung-Yu; Tsai, Hsiu-Chun; Liu, Cheng-Tse.

Database (Oxford) ; 20212021 05 03.

Artículo en Inglés | MEDLINE | ID: mdl-33942874

RESUMEN

It is now known that cap-independent translation initiation facilitated by internal ribosome entry sites (IRESs) is vital in selective cellular protein synthesis under stress and different physiological conditions. However, three problems make it hard to understand transcriptome-wide cellular IRES-mediated translation initiation mechanisms: (i) complex interplay between IRESs and other translation initiation-related information, (ii) reliability issue of in silico cellular IRES investigation and (iii) labor-intensive in vivo IRES identification. In this research, we constructed the Human IRES Atlas database for a comprehensive understanding of cellular IRESs in humans. First, currently available and suitable IRES prediction tools (IRESfinder, PatSearch and IRESpy) were used to obtain transcriptome-wide human IRESs. Then, we collected eight genres of translation initiation-related features to help study the potential molecular mechanisms of each of the putative IRESs. Three functional tests (conservation, structural RNA-protein scores and conditional translation efficiency) were devised to evaluate the functionality of the identified putative IRESs. Moreover, an easy-to-use interface and an IRES-translation initiation interaction map for each gene transcript were implemented to help understand the interactions between IRESs and translation initiation-related features. Researchers can easily search/browse an IRES of interest using the web interface and deduce testable mechanism hypotheses of human IRES-driven translation initiation based on the integrated results. In summary, Human IRES Atlas integrates putative IRES elements and translation initiation-related experiments for better usage of these data and deduction of mechanism hypotheses. Database URL: http://cobishss0.im.nuk.edu.tw/Human_IRES_Atlas/.

Asunto(s)

Sitios Internos de Entrada al Ribosoma , Humanos , Sitios Internos de Entrada al Ribosoma/genética , ARN , ARN Viral , Reproducibilidad de los Resultados

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA