Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Bioinformatics ; 40(6)2024 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-38837395

RESUMEN

MOTIVATION: Tissue context and molecular profiling are commonly used measures in understanding normal development and disease pathology. In recent years, the development of spatial molecular profiling technologies (e.g. spatial resolved transcriptomics) has enabled the exploration of quantitative links between tissue morphology and gene expression. However, these technologies remain expensive and time-consuming, with subsequent analyses necessitating high-throughput pathological annotations. On the other hand, existing computational tools are limited to predicting only a few dozen to several hundred genes, and the majority of the methods are designed for bulk RNA-seq. RESULTS: In this context, we propose HE2Gene, the first multi-task learning-based method capable of predicting tens of thousands of spot-level gene expressions along with pathological annotations from H&E-stained images. Experimental results demonstrate that HE2Gene is comparable to state-of-the-art methods and generalizes well on an external dataset without the need for re-training. Moreover, HE2Gene preserves the annotated spatial domains and has the potential to identify biomarkers. This capability facilitates cancer diagnosis and broadens its applicability to investigate gene-disease associations. AVAILABILITY AND IMPLEMENTATION: The source code and data information has been deposited at https://github.com/Microbiods/HE2Gene.


Asunto(s)
Transcriptoma , Humanos , Perfilación de la Expresión Génica/métodos , Biología Computacional/métodos , Aprendizaje Automático , ARN/metabolismo
2.
Comput Biol Med ; 175: 108487, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38653064

RESUMEN

Drug repurposing is promising in multiple scenarios, such as emerging viral outbreak controls and cost reductions of drug discovery. Traditional graph-based drug repurposing methods are limited to fast, large-scale virtual screens, as they constrain the counts for drugs and targets and fail to predict novel viruses or drugs. Moreover, though deep learning has been proposed for drug repurposing, only a few methods have been used, including a group of pre-trained deep learning models for embedding generation and transfer learning. Hence, we propose DeepSeq2Drug to tackle the shortcomings of previous methods. We leverage multi-modal embeddings and an ensemble strategy to complement the numbers of drugs and viruses and to guarantee the novel prediction. This framework (including the expanded version) involves four modal types: six NLP models, four CV models, four graph models, and two sequence models. In detail, we first make a pipeline and calculate the predictive performance of each pair of viral and drug embeddings. Then, we select the best embedding pairs and apply an ensemble strategy to conduct anti-viral drug repurposing. To validate the effect of the proposed ensemble model, a monkeypox virus (MPV) case study is conducted to reflect the potential predictive capability. This framework could be a benchmark method for further pre-trained deep learning optimization and anti-viral drug repurposing tasks. We also build software further to make the proposed model easier to reuse. The code and software are freely available at http://deepseq2drug.cs.cityu.edu.hk.


Asunto(s)
Antivirales , Aprendizaje Profundo , Reposicionamiento de Medicamentos , Reposicionamiento de Medicamentos/métodos , Antivirales/farmacología , Antivirales/uso terapéutico , Humanos , Programas Informáticos , Benchmarking
3.
Neurol Ther ; 13(3): 727-737, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38619804

RESUMEN

INTRODUCTION: Previous studies have reported controversial relationships between circulating vascular endothelial growth factors (VEGF) and ischemic stroke (IS). This study aims to demonstrate the causal effect between VEGF and IS using Mendelian randomization (MR). METHODS: Summary statistics data from two large-scale genome-wide association studies (GWAS) for 16,112 patients with measured VEGF levels and 40,585 patients with IS were downloaded from public databases and included in this study. A published calculator was adopted for MR power calculation. The primary outcome was any ischemic stroke, and the secondary outcomes were large-artery stroke, cardioembolic stroke, and small-vessel stroke. We used the inverse variance-weighted (IVW) method for primary analysis, supplemented by MR-Egger regression and the weighted median method. RESULTS: Nine SNPs were included to represent serum VEGF levels. The IVW method revealed no strong causal association between VEGF and any ischemic stroke (odds ratio [OR] 1.01, 95% CI 0.99-1.04, p = 0.39), cardioembolic stroke (OR 1.04, 95% CI 0.97-1.12, p = 0.28), large-artery stroke (OR 1.02, 95% CI 0.95-1.09, p = 0.62), and small-vessel stroke (OR 0.98, 95% CI 0.91-1.04, p = 0.46). These findings remained robust in sensitivity analyses. MR-Egger regression suggested no horizontal pleiotropy. CONCLUSIONS: This Mendelian randomization study found no relationship between genetically predisposed serum VEGF levels and risks of IS or its subtypes.

4.
iScience ; 27(4): 109352, 2024 Apr 19.
Artículo en Inglés | MEDLINE | ID: mdl-38510148

RESUMEN

Gene regulatory networks (GRNs) involve complex and multi-layer regulatory interactions between regulators and their target genes. Precise knowledge of GRNs is important in understanding cellular processes and molecular functions. Recent breakthroughs in single-cell sequencing technology made it possible to infer GRNs at single-cell level. Existing methods, however, are limited by expensive computations, and sometimes simplistic assumptions. To overcome these obstacles, we propose scGREAT, a framework to infer GRN using gene embeddings and transformer from single-cell transcriptomics. scGREAT starts by constructing gene expression and gene biotext dictionaries from scRNA-seq data and gene text information. The representation of TF gene pairs is learned through optimizing embedding space by transformer-based engine. Results illustrated scGREAT outperformed other contemporary methods on benchmarks. Besides, gene representations from scGREAT provide valuable gene regulation insights, and external validation on spatial transcriptomics illuminated the mechanism behind scGREAT annotation. Moreover, scGREAT identified several TF target regulations corroborated in studies.

5.
iScience ; 26(11): 108197, 2023 Nov 17.
Artículo en Inglés | MEDLINE | ID: mdl-37965148

RESUMEN

By soaking microRNAs (miRNAs), long non-coding RNAs (lncRNAs) have the potential to regulate gene expression. Few methods have been created based on this mechanism to anticipate the lncRNA-gene relationship prediction. Hence, we present lncRNA-Top to forecast potential lncRNA-gene regulation relationships. Specifically, we constructed controlled deep-learning methods using 12417 lncRNAs and 16127 genes. We have provided retrospective and innovative views among negative sampling, random seeds, cross-validation, metrics, and independent datasets. The AUC, AUPR, and our defined precision@k were leveraged to evaluate performance. In-depth case studies demonstrate that 47 out of 100 projected top unknown pairings were recorded in publications, supporting the predictive power. Our additional software can annotate the scores with target candidates. The lncRNA-Top will be a helpful tool to uncover prospective lncRNA targets and better comprehend the regulatory processes of lncRNAs.

6.
Adv Sci (Weinh) ; 10(11): e2204113, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36762572

RESUMEN

The single-cell RNA sequencing (scRNA-seq) quantifies the gene expression of individual cells, while the bulk RNA sequencing (bulk RNA-seq) characterizes the mixed transcriptome of cells. The inference of drug sensitivities for individual cells can provide new insights to understand the mechanism of anti-cancer response heterogeneity and drug resistance at the cellular resolution. However, pharmacogenomic information related to their corresponding scRNA-Seq is often limited. Therefore, a transfer learning model is proposed to infer the drug sensitivities at single-cell level. This framework learns bulk transcriptome profiles and pharmacogenomics information from population cell lines in a large public dataset and transfers the knowledge to infer drug efficacy of individual cells. The results suggest that it is suitable to learn knowledge from pre-clinical cell lines to infer pre-existing cell subpopulations with different drug sensitivities prior to drug exposure. In addition, the model offers a new perspective on drug combinations. It is observed that drug-resistant subpopulation can be sensitive to other drugs (e.g., a subset of JHU006 is Vorinostat-resistant while Gefitinib-sensitive); such finding corroborates the previously reported drug combination (Gefitinib + Vorinostat) strategy in several cancer types. The identified drug sensitivity biomarkers reveal insights into the tumor heterogeneity and treatment at cellular resolution.


Asunto(s)
Transcriptoma , RNA-Seq/métodos , Gefitinib , Vorinostat , Transcriptoma/genética , Análisis de Secuencia de ARN/métodos
7.
IEEE J Biomed Health Inform ; 26(8): 4335-4344, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35471879

RESUMEN

Targeted therapy for one for a set of genes has made it possible to apply precision medicine for different patients due to the existence of tumor heterogeneity. However, how to regulate those genes are still problematic. One of the natural regulators of genes is microRNAs. Thus, a better understanding of the miRNA-gene interaction mechanism might contribute to future diagnosis, prevention, and cancer therapy. The interactions between microRNA and genes play an essential role in molecular genetics. The in-vivo experiments validating the relationships between them are time-consuming, money-costly, and labor-intensive. With the development of high-throughput technology, we dealt with tons of biological data. However, extracting features from tremendous raw data and making a mathematical model is still a challenging topic. Machine learning and deep learning algorithms have become powerful tools in dealing with biological data. Inspired by this, in this paper, we propose a model that combines features/embedding extraction methods, deep learning algorithms, and a voting system. We leverage doc2vec to generate sequential embedding from molecular sequences. The role2vec, GCN, and GMM for geometrical embedding were generated from the complex network from similarity and pair-wise datasets. For the deep learning algorithms, we leveraged LSTM and Bi-LSTM according to different embedding and features. Finally, we adopted a voting system to balance results from different data sources. The results have shown that our voting system could achieve a higher AUC than the existing benchmark. The case studies demonstrate that our model could reveal potential relationships between miRNAs and genes. The source code, features, and predictive results can be downloaded at https://github.com/Xshelton/SRG-vote.


Asunto(s)
Algoritmos , MicroARNs , Biología Computacional/métodos , Humanos , Aprendizaje Automático , MicroARNs/genética , Política , Programas Informáticos
8.
IEEE J Biomed Health Inform ; 26(8): 4303-4313, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35439152

RESUMEN

Exploring the prognostic classification and biomarkers in Head and Neck Squamous Carcinoma (HNSC) is of great clinical significance. We hybridized three prominent strategies to comprehensively characterize the molecular features of HNSC. We constructed a 15-gene signature to predict patients' death risk with an average AUC of 0.744 for 1-, 3-, and 5-year on TCGA-HNSC training set, and average AUCs of 0.636, 0.584, 0.755 in GSE65858, GSE-112026, CPTAC-HNSCC datasets, respectively. By combined with NMF clustering and consensus clustering of fraction of tumor immune cell infiltration (ICI) in the tumor microenvironment (TME), we captured a more refined biological characteristics of HNSC, and observed a prognosis heterogeneity in high tumor immunity patients. By matching tumor subset-specific expression signatures to drug-induced cell line expression profiles from large-scale pharmacogenomic databases in the OCTAD workspace, we identified a group of HNSC patients featured with poor prognosis and demonstrated that the individuals in this group are likely to receive increased drug sensitivity to reverse differentially expressed disease signature genes. This trend is especially highlighted among those with higher death risk and tumour immunity.


Asunto(s)
Perfilación de la Expresión Génica , Neoplasias de Cabeza y Cuello , Biomarcadores de Tumor/genética , Neoplasias de Cabeza y Cuello/tratamiento farmacológico , Neoplasias de Cabeza y Cuello/genética , Humanos , Pronóstico , Carcinoma de Células Escamosas de Cabeza y Cuello/genética , Transcriptoma , Resultado del Tratamiento , Microambiente Tumoral/genética
9.
Brief Bioinform ; 22(2): 2032-2042, 2021 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-32181478

RESUMEN

MOTIVATION: MircroRNAs (miRNAs) regulate target genes and are responsible for lethal diseases such as cancers. Accurately recognizing and identifying miRNA and gene pairs could be helpful in deciphering the mechanism by which miRNA affects and regulates the development of cancers. Embedding methods and deep learning methods have shown their excellent performance in traditional classification tasks in many scenarios. But not so many attempts have adapted and merged these two methods into miRNA-gene relationship prediction. Hence, we proposed a novel computational framework. We first generated representational features for miRNAs and genes using both sequence and geometrical information and then leveraged a deep learning method for the associations' prediction. RESULTS: We used long short-term memory (LSTM) to predict potential relationships and proved that our method outperformed other state-of-the-art methods. Results showed that our framework SG-LSTM got an area under curve of 0.94 and was superior to other methods. In the case study, we predicted the top 10 miRNA-gene relationships and recommended the top 10 potential genes for hsa-miR-335-5p for SG-LSTM-core. We also tested our model using a larger dataset, from which 14 668 698 miRNA-gene pairs were predicted. The top 10 unknown pairs were also listed. AVAILABILITY: Our work can be download in https://github.com/Xshelton/SG_LSTM. CONTACT: luojiawei@hnu.edu.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online.


Asunto(s)
Biología Computacional/métodos , Epistasis Genética , MicroARNs/genética , Algoritmos , Conjuntos de Datos como Asunto , Aprendizaje Profundo , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...