Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 78
Filtrar
1.
IEEE J Biomed Health Inform ; 28(2): 1110-1121, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38055359

RESUMO

Accumulating evidence indicates that microRNAs (miRNAs) can control and coordinate various biological processes. Consequently, abnormal expressions of miRNAs have been linked to various complex diseases. Recognizable proof of miRNA-disease associations (MDAs) will contribute to the diagnosis and treatment of human diseases. Nevertheless, traditional experimental verification of MDAs is laborious and limited to small-scale. Therefore, it is necessary to develop reliable and effective computational methods to predict novel MDAs. In this work, a multi-kernel graph attention deep autoencoder (MGADAE) method is proposed to predict potential MDAs. In detail, MGADAE first employs the multiple kernel learning (MKL) algorithm to construct an integrated miRNA similarity and disease similarity, providing more biological information for further feature learning. Second, MGADAE combines the known MDAs, disease similarity, and miRNA similarity into a heterogeneous network, then learns the representations of miRNAs and diseases through graph convolution operation. After that, an attention mechanism is introduced into MGADAE to integrate the representations from multiple graph convolutional network (GCN) layers. Lastly, the integrated representations of miRNAs and diseases are input into the bilinear decoder to obtain the final predicted association scores. Corresponding experiments prove that the proposed method outperforms existing advanced approaches in MDA prediction. Furthermore, case studies related to two human cancers provide further confirmation of the reliability of MGADAE in practice.


Assuntos
MicroRNAs , Neoplasias , Humanos , MicroRNAs/genética , Reprodutibilidade dos Testes , Biologia Computacional/métodos , Neoplasias/genética , Algoritmos
2.
IEEE J Biomed Health Inform ; 27(10): 5187-5198, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37498764

RESUMO

Advances in omics technology have enriched the understanding of the biological mechanisms of diseases, which has provided a new approach for cancer research. Multi-omics data contain different levels of cancer information, and comprehensive analysis of them has attracted wide attention. However, limited by the dimensionality of matrix models, traditional methods cannot fully use the key high-dimensional global structure of multi-omics data. Moreover, besides global information, local features within each omics are also critical. It is necessary to consider the potential local information together with the high-dimensional global information, ensuring that the shared and complementary features of the omics data are comprehensively observed. In view of the above, this article proposes a new tensor integrative framework called the strong complementarity tensor decomposition model (BioSTD) for cancer multi-omics data. It is used to identify cancer subtype specific genes and cluster subtype samples. Different from the matrix framework, BioSTD utilizes multi-view tensors to coordinate each omics to maximize high-dimensional spatial relationships, which jointly considers the different characteristics of different omics data. Meanwhile, we propose the concept of strong complementarity constraint applicable to omics data and introduce it into BioSTD. Strong complementarity is used to explore the potential local information, which can enhance the separability of different subtypes, allowing consistency and complementarity in the omics data to be fully represented. Experimental results on real cancer datasets show that our model outperforms other advanced models, which confirms its validity.


Assuntos
Neoplasias , Humanos , Neoplasias/genética , Multiômica
3.
J Comput Biol ; 30(8): 889-899, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37471239

RESUMO

The analysis of cancer data from multi-omics can effectively promote cancer research. The main focus of this article is to cluster cancer samples and identify feature genes to reveal the correlation between cancers and genes, with the primary approach being the analysis of multi-view cancer omics data. Our proposed solution, the Multi-View Enhanced Tensor Nuclear Norm and Local Constraint (MVET-LC) model, aims to utilize the consistency and complementarity of omics data to support biological research. The model is designed to maximize the utilization of multi-view data and incorporates a nuclear norm and local constraint to achieve this goal. The first step involves introducing the concept of enhanced partial sum of tensor nuclear norm, which significantly enhances the flexibility of the tensor nuclear norm. After that, we incorporate total variation regularization into the MVET-LC model to further augment its performance. It enables MVET-LC to make use of the relationship between tensor data structures and sparse data while paying attention to the feature details of the tensor data. To tackle the iterative optimization problem of MVET-LC, the alternating direction method of multipliers is utilized. Through experimental validation, it is demonstrated that our proposed model outperforms other comparison models.


Assuntos
Algoritmos , Neoplasias , Humanos , Neoplasias/genética , Análise por Conglomerados
4.
BMC Genomics ; 24(1): 426, 2023 Jul 29.
Artigo em Inglês | MEDLINE | ID: mdl-37516822

RESUMO

Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes.


Assuntos
Algoritmos , Redes Reguladoras de Genes , Estudos de Associação Genética , Aprendizado de Máquina , Mapeamento de Interação de Proteínas
5.
BMC Genomics ; 24(1): 279, 2023 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-37226081

RESUMO

BACKGROUND: Piwi-interacting RNAs (piRNAs) have been proven to be closely associated with human diseases. The identification of the potential associations between piRNA and disease is of great significance for complex diseases. Traditional "wet experiment" is time-consuming and high-priced, predicting the piRNA-disease associations by computational methods is of great significance. METHODS: In this paper, a method based on the embedding transformation graph convolution network is proposed to predict the piRNA-disease associations, named ETGPDA. Specifically, a heterogeneous network is constructed based on the similarity information of piRNA and disease, as well as the known piRNA-disease associations, which is applied to extract low-dimensional embeddings of piRNA and disease based on graph convolutional network with an attention mechanism. Furthermore, the embedding transformation module is developed for the problem of embedding space inconsistency, which is lightweighter, stronger learning ability and higher accuracy. Finally, the piRNA-disease association score is calculated by the similarity of the piRNA and disease embedding. RESULTS: Evaluated by fivefold cross-validation, the AUC of ETGPDA achieves 0.9603, which is better than the other five selected computational models. The case studies based on Head and neck squamous cell carcinoma and Alzheimer's disease further prove the superior performance of ETGPDA. CONCLUSIONS: Hence, the ETGPDA is an effective method for predicting the hidden piRNA-disease associations.


Assuntos
Doença de Alzheimer , Neoplasias de Cabeça e Pescoço , Humanos , RNA de Interação com Piwi , Doença de Alzheimer/genética , Aprendizagem , Projetos de Pesquisa
6.
BMC Bioinformatics ; 24(1): 13, 2023 Jan 09.
Artigo em Inglês | MEDLINE | ID: mdl-36624376

RESUMO

BACKGROUND: Constructing molecular interaction networks from microarray data and then identifying disease module biomarkers can provide insight into the underlying pathogenic mechanisms of non-small cell lung cancer. A promising approach for identifying disease modules in the network is community detection. RESULTS: In order to identify disease modules from gene co-expression networks, a community detection method is proposed based on multi-objective optimization genetic algorithm with decomposition. The method is named DM-MOGA and possesses two highlights. First, the boundary correction strategy is designed for the modules obtained in the process of local module detection and pre-simplification. Second, during the evolution, we introduce Davies-Bouldin index and clustering coefficient as fitness functions which are improved and migrated to weighted networks. In order to identify modules that are more relevant to diseases, the above strategies are designed to consider the network topology of genes and the strength of connections with other genes at the same time. Experimental results of different gene expression datasets of non-small cell lung cancer demonstrate that the core modules obtained by DM-MOGA are more effective than those obtained by several other advanced module identification methods. CONCLUSIONS: The proposed method identifies disease-relevant modules by optimizing two novel fitness functions to simultaneously consider the local topology of each gene and its connection strength with other genes. The association of the identified core modules with lung cancer has been confirmed by pathway and gene ontology enrichment analysis.


Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Humanos , Carcinoma Pulmonar de Células não Pequenas/genética , Neoplasias Pulmonares/genética , Redes Reguladoras de Genes , Análise em Microsséries , Algoritmos , Perfilação da Expressão Gênica/métodos
7.
IEEE/ACM Trans Comput Biol Bioinform ; 20(3): 1774-1782, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36251902

RESUMO

With the development of bioinformatics, the important role played by lncRNAs in various intractable diseases has aroused the interest of many experts. In recent studies, researchers have found that several human diseases are related to lncRANs. Moreover, it is very difficult and expensive to explore the unknown lncRNA-disease associations (LDAs), so only a few associations have been confirmed. It is vital to find a more accurate and effective method to identify potential LDAs. In this study, a method of collaborative matrix factorization based on correntropy (LDCMFC) is proposed for the identification of potential LDAs. To improve the robustness of the algorithm, the traditional minimization of the Euclidean distance is replaced with the maximized correntropy. In addition, the weighted K nearest known neighbor (WKNKN) method is used to rebuild the adjacency matrix. Finally, the performance of LDCMFC is tested by 5-fold cross-validation. Compared with other traditional methods, LDACMFC obtains a higher AUC of 0.8628. In different types of studies of three important cancer cases, most of the potentially relevant lncRNAs derived from the experiments have been validated in the databases. The final result shows that LDCMFC is a feasible method to predict LDAs.


Assuntos
RNA Longo não Codificante , Humanos , RNA Longo não Codificante/genética , Algoritmos , Biologia Computacional/métodos , Bases de Dados Factuais , Análise por Conglomerados
8.
Exp Biol Med (Maywood) ; 248(3): 232-241, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36573462

RESUMO

Cancer is one of the major contributors to human mortality and has a serious influence on human survival and health. In biomedical research, the identification of cancer driver genes (cancer drivers for short) is an important task; cancer drivers can promote the progression and generation of cancer. To identify cancer drivers, many methods have been developed. These computational models only identify coding cancer drivers; however, non-coding drivers likewise play significant roles in the progression of cancer. Hence, we propose a Network-based Method for identifying cancer Driver Genes based on node Control Centrality (NMDGCC), which can identify coding and non-coding cancer driver genes. The process of NMDGCC for identifying driver genes mainly includes the following two steps. In the first step, we construct a gene interaction network by using mRNAs and miRNAs expression data in the cancer state. In the second step, the control centrality of the node is used to identify cancer drivers in the constructed network. We use the breast cancer dataset from The Cancer Genome Atlas (TCGA) to verify the effectiveness of NMDGCC. Compared with the existing methods of cancer driver genes identification, NMDGCC has a better performance. NMDGCC also identifies 295 miRNAs as non-coding cancer drivers, of which 158 are related to tumorigenesis of BRCA. We also apply NMDGCC to identify driver genes related to the different breast cancer subtypes. The result shows that NMDGCC detects many cancer drivers of specific cancer subtypes.


Assuntos
Neoplasias da Mama , MicroRNAs , Humanos , Feminino , Oncogenes , Neoplasias da Mama/genética , MicroRNAs/genética , Carcinogênese/genética , Transformação Celular Neoplásica
9.
Medicine (Baltimore) ; 101(36): e30417, 2022 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-36086762

RESUMO

BACKGROUND: The incidence of threatened abortion (TA) is increasing due to poor diet and living habits, which brings great pressure to pregnant women and their families. Huangqin-Baizhu herb pair recorded in ancient books of traditional Chinese medicine has been widely used in the treatment of TA with remarkable effect. In this study, we will use the network pharmacology method to predict the target and mechanism of Huangqin-Baizhu herb pair. METHODS: Traditional Chinese Medicine Systems Pharmacology Database and Analysis Platform database was used to screen the active components of Huangqin-Baizhu herb pair. Pubchem and Swiss Target Prediction databases were used to predict the action targets. Genecards, OMIM, and Drugbank databases were used to predict the related targets of TA. The intersection of drug target and disease target was selected and the intersection genes were uploaded to STRING database to construct protein-protein interaction network and conduct module analysis. Metascape database was used for Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis, which was imported into Cytoscape software to construct component-pathway-gene network and finally verified by molecular docking. Ethical approval and informed consent of patients are not required because the data used in this study is publicly available and does not involve individual patient data or privacy. RESULTS: The main active components of the herb pair are baicalein, flavanone, and norwogonin, etc. The main targets are AKT1, VEGFA, STAT3, MAPK1, SRC, etc. Cluster module analysis shows that the targets are related to cell metabolism, immune regulation and hormone level regulation. There were 2073, 3169, and 161 KEGG pathways involved in the biological processes, cell components, and molecular functions of Gene Ontology analysis, respectively. The main KEGG pathways involved in the intervention were HIF1 signaling pathway, PI3K-Akt signaling pathway, and Rap1 signaling pathway. Molecular docking showed that the main active components of the herb pair were well combined with the key targets. CONCLUSIONS: In this study, 42 active components, 152 potential targets and 11 key targets of Huangqin-Baizhu herb pair for the treatment of TA were revealed, participating in multiple signaling pathways such as PI3K-Akt, providing a theoretical basis for further experimental research.


Assuntos
Ameaça de Aborto , Medicamentos de Ervas Chinesas , Ameaça de Aborto/tratamento farmacológico , Medicamentos de Ervas Chinesas/farmacologia , Medicamentos de Ervas Chinesas/uso terapêutico , Feminino , Humanos , Simulação de Acoplamento Molecular , Fosfatidilinositol 3-Quinases , Gravidez , Proteínas Proto-Oncogênicas c-akt , Scutellaria baicalensis
10.
Int J Mol Sci ; 23(17)2022 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-36077236

RESUMO

Compared to single-drug therapy, drug combinations have shown great potential in cancer treatment. Most of the current methods employ genomic data and chemical information to construct drug-cancer cell line features, but there is still a need to explore methods to combine topological information in the protein interaction network (PPI). Therefore, we propose a network-embedding-based prediction model, NEXGB, which integrates the corresponding protein modules of drug-cancer cell lines with PPI network information. NEXGB extracts the topological features of each protein node in a PPI network by struc2vec. Then, we combine the topological features with the target protein information of drug-cancer cell lines, to generate drug features and cancer cell line features, and utilize extreme gradient boosting (XGBoost) to predict the synergistic relationship between drug combinations and cancer cell lines. We apply our model on two recently developed datasets, the Oncology-Screen dataset (Oncology-Screen) and the large drug combination dataset (DrugCombDB). The experimental results show that NEXGB outperforms five current methods, and it effectively improves the predictive power in discovering relationships between drug combinations and cancer cell lines. This further demonstrates that the network information is valid for detecting combination therapies for cancer and other complex diseases.


Assuntos
Protocolos de Quimioterapia Combinada Antineoplásica , Neoplasias , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico , Combinação de Medicamentos , Genômica , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/genética , Mapas de Interação de Proteínas , Proteínas/uso terapêutico
11.
G3 (Bethesda) ; 12(11)2022 11 04.
Artigo em Inglês | MEDLINE | ID: mdl-36124952

RESUMO

Tumor stratification plays an important role in cancer diagnosis and individualized treatment. Recent developments in high-throughput sequencing technologies have produced huge amounts of multi-omics data, making it possible to stratify cancer types using multiple molecular datasets. We introduce a Network Embedding method for tumor Stratification by integrating Multi-omics data. Network Embedding method for tumor Stratification by integrating Multi-omics pregroup the samples, integrate the gene features and somatic mutation corresponding to cancer types within each group to construct patient features, and then integrate all groups to obtain comprehensive patient information. The gene features contain network topology information, because it is extracted by integrating deoxyribonucleic acid methylation, messenger ribonucleic acid expression data, and protein-protein interactions through network embedding method. On the one hand, a supervised learning method Light Gradient Boosting Machine is used to classify cancer types based on patient features. When compared with other 3 methods, Network Embedding method for tumor Stratification by integrating Multi-omics has the highest AUC in most cancer types. The average AUC for stratifying cancer types is 0.91, indicating that the patient features extracted by Network Embedding method for tumor Stratification by integrating Multi-omics are effective for tumor stratification. On the other hand, an unsupervised clustering algorithm Density-Based Spatial Clustering of Applications with Noise is utilized to divide single cancer subtypes. The vast majority of the subtypes identified by Network Embedding method for tumor Stratification by integrating Multi-omics are significantly associated with patient survival.


Assuntos
Neoplasias , Humanos , Neoplasias/diagnóstico , Neoplasias/genética , Análise por Conglomerados , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala
12.
Medicine (Baltimore) ; 101(25): e29434, 2022 Jun 24.
Artigo em Inglês | MEDLINE | ID: mdl-35758378

RESUMO

BACKGROUND: In recent years, clinical studies have found that there is a close relationship between osteoporosis and polycystic ovary syndrome. However, there are few literature on the pathogenesis of osteoporosis and polycystic ovary syndrome. In order to clarify their common pathogenic mechanism and provide potential targets for drugs to regulate them at the same time, bioinformatics methods are used to explore, so as to provide a new direction for the study of the relationship between diseases in the future. METHODS: To screen the targets of osteoporosis and polycystic ovary syndrome by Genecards, Online Mendelian Inheritance in Man databases and Therapeutic Target Database to take the intersection of the two mappings and upload the intersection targets to the STRING database to construct protein-protein interaction network; to screen the core targets by degree value and import them to Metascape database for Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analysis; and finally, to construct the visualization network of core targets and pathways by Cytoscape software. Ethical approval and informed consent of patients are not required because the data used in this study is publicly available and does not involve individual patient data or privacy. RESULTS: The core targets of polycystic ovary syndrome and osteoporosis were insulin gene, insulin-like growth factor 1, CTNNB1, serine/threonine kinase 1, signal transducer and activator of transcription 3, LEP, etc. The biological processes involved include the regulation of protein phosphorylation, cell proliferation and differentiation, hormone endocrine, reproductive system and skeletal system. The related pathways were concentrated in Foxo signaling pathway, HTLV-I infection, PI3K-AKT signaling pathway, MAPK signaling pathway and AGE-RAGE signaling pathway in diabetic complications. CONCLUSIONS: There is a close relationship between osteoporosis and polycystic ovary syndrome in terms of target and molecular mechanism. This study used bioinformatics to clarify their targets and mechanisms, providing potential targets for drugs to regulate both diseases simultaneously and providing new directions to explore the relationship between the diseases.


Assuntos
Medicamentos de Ervas Chinesas , Osteoporose , Síndrome do Ovário Policístico , Biologia Computacional , Medicamentos de Ervas Chinesas/uso terapêutico , Feminino , Humanos , Osteoporose/tratamento farmacológico , Osteoporose/genética , Fosfatidilinositol 3-Quinases/metabolismo , Síndrome do Ovário Policístico/complicações , Síndrome do Ovário Policístico/tratamento farmacológico , Síndrome do Ovário Policístico/genética
13.
J Bioinform Comput Biol ; 20(2): 2250002, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35191362

RESUMO

Tensor Robust Principal Component Analysis (TRPCA) has achieved promising results in the analysis of genomics data. However, the TRPCA model under the existing tensor singular value decomposition ([Formula: see text]-SVD) framework insufficiently extracts the potential low-rank structure of the data, resulting in suboptimal restored components. Simultaneously, the tensor nuclear norm (TNN) defined based on [Formula: see text]-SVD uses the same standard to handle various singular values. TNN ignores the difference of singular values, leading to the failure of the main information that needs to be well preserved. To preserve the heterogeneous structure in the low-rank information, we propose a novel TNN and extend it to the TRPCA model. Potential low-rank space may contain important information. We learn the low-rank structural information from the core tensor. The singular value space contains the association information between genes and cancers. The [Formula: see text]-shrinkage generalized threshold function is utilized to preserve the low-rank properties of larger singular values. The optimization problem is solved by the alternating direction method of the multiplier (ADMM) algorithm. Clustering and feature selection experiments are performed on the TCGA data set. The experimental results show that the proposed model is more promising than other state-of-the-art tensor decomposition methods.


Assuntos
Algoritmos , Neoplasias , Análise por Conglomerados , Genômica , Humanos , Neoplasias/genética , Análise de Componente Principal
14.
IEEE J Biomed Health Inform ; 26(7): 3578-3589, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35157604

RESUMO

Cancer genome data generally consists of multiple views from different sources. These views provide different levels of information about gene activity, as well as more comprehensive cancer information. The low-rank representation (LRR) method, as a powerful subspace clustering method, has been extended and applied in cancer data research. Although the multi-view learning methods based on low rank representation have achieved good results in cancer multi-omics analysis because they fully consider the consistency and complementarity between views, these methods have some shortcomings in mining the potential local geometry of data. In view of this, this paper proposes a new method named Multi-view Random-walk Graph regularization Low-Rank Representation (MRGLRR) to comprehensively analyze multi-view genomics data. This method uses multi-view model to find the common centroid of view. By constructing a joint affinity matrix to learn the low-rank subspace representation of multiple sets of data, the hidden information of each view is fully obtained. In addition, this method introduces random walk graph regularization constraint to obtain more accurate similarity between samples. Different from the traditional graph regularization constraint, after constructing the KNN graph, we use the random walk algorithm to obtain the weight matrix. The random walk algorithm can retain more local geometric information and better learn the topological structure of the data. What's more, a feature gene selection strategy suitable for multi-view model is proposed to find more differentially expressed genes with research value. Experimental results show that our method is better than other representative methods in terms of clustering and feature gene selection for cancer multi-omics data.


Assuntos
Algoritmos , Neoplasias , Análise por Conglomerados , Genômica , Humanos , Neoplasias/genética , Caminhada
15.
Interdiscip Sci ; 14(1): 22-33, 2022 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-34115312

RESUMO

In recent years, clustering analysis of cancer genomics data has gained widespread attention. However, limited by the dimensions of the matrix, the traditional methods cannot fully mine the underlying geometric structure information in the data. Besides, noise and outliers inevitably exist in the data. To solve the above two problems, we come up with a new method which uses tensor to represent cancer omics data and applies hypergraph to save the geometric structure information in original data. This model is called hypergraph regularized tensor robust principal component analysis (HTRPCA). The data processed by HTRPCA becomes two parts, one of which is a low-rank component that contains pure underlying structure information between samples, and the other is some sparse interference points. So we can use the low-rank component for clustering. This model can retain complex geometric information between more sample points due to the addition of the hypergraph regularization. Through clustering, we can demonstrate the effectiveness of HTRPCA, and the experimental results on TCGA datasets demonstrate that HTRPCA precedes other advanced methods. This paper proposes a new method of using tensors to represent cancer omics data and introduces hypergraph items to save the geometric structure information of the original data. At the same time, the model decomposes the original tensor into low-order tensors and sparse tensors. The low-rank tensor was used to cluster cancer samples to verify the effectiveness of the method.


Assuntos
Algoritmos , Neoplasias , Análise por Conglomerados , Genômica , Humanos , Neoplasias/genética , Análise de Componente Principal
16.
IEEE/ACM Trans Comput Biol Bioinform ; 19(4): 2420-2430, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-33690124

RESUMO

Extracting genes involved in cancer lesions from gene expression data is critical for cancer research and drug development. The method of feature selection has attracted much attention in the field of bioinformatics. Principal Component Analysis (PCA) is a widely used method for learning low-dimensional representation. Some variants of PCA have been proposed to improve the robustness and sparsity of the algorithm. However, the existing methods ignore the high-order relationships between data. In this paper, a new model named Robust Principal Component Analysis via Hypergraph Regularization (HRPCA) is proposed. In detail, HRPCA utilizes L2,1-norm to reduce the effect of outliers and make data sufficiently row-sparse. And the hypergraph regularization is introduced to consider the complex relationship among data. Important information hidden in the data are mined, and this method ensures the accuracy of the resulting data relationship information. Extensive experiments on multi-view biological data demonstrate that the feasible and effective of the proposed approach.


Assuntos
Biologia Computacional , Neoplasias , Algoritmos , Análise por Conglomerados , Biologia Computacional/métodos , Humanos , Neoplasias/genética , Neoplasias/metabolismo , Análise de Componente Principal
17.
Front Genet ; 12: 678642, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34367241

RESUMO

Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic data analysis. Nevertheless, it is a challenge for them to integrate and decompose different types of genetic data simultaneously. In this paper, a non-NMF network analysis method, NMFNA, is proposed, which introduces a graph-regularized constraint to the NMF, for identifying modules and characteristic genes from two-type PC data of methylation (ME) and copy number variation (CNV). Firstly, three PC networks, i.e., ME network, CNV network, and ME-CNV network, are constructed using the Pearson correlation coefficient (PCC). Then, modules are detected from these three PC networks effectively due to the introduced graph-regularized constraint, which is the highlight of the NMFNA. Finally, both gene ontology (GO) and pathway enrichment analyses are performed, and characteristic genes are detected by the multimeasure score, to deeply understand biological functions of PC core modules. Experimental results demonstrated that the NMFNA facilitates the integration and decomposition of two types of PC data simultaneously and can further serve as an alternative method for detecting modules and characteristic genes from multiple genetic data of complex diseases.

18.
Shanghai Kou Qiang Yi Xue ; 30(2): 206-209, 2021 Apr.
Artigo em Chinês | MEDLINE | ID: mdl-34109364

RESUMO

PURPOSE: To evaluate the sedative effect of dexmedetomidine in awake intubation and its influence on swallowing function. METHODS: Fifty patients with awake intubation in oral and maxillofacial surgery were randomly divided into two groups: dexmedetomidine(DEX) group and midazolam+fentanyl(MF) group. 15 min before intubation, patients in DEX group were intravenously given 50 mL dexmedetomidine(1.0 µg/kg), and others in MF group were intravenously given 50 mL normal saline respectively. 5 min before intubation, 10 mL normal saline was given to DEX group, 0.02 mg/kg midazolam and 2.0 µg/kg fentanyl were given to MF group. HR, MAP, RR, SpO2, Ramsay sedation score and swallowing time were measured at different time points (before induction-T0, before intubation-T1 and after intubation-T2). SPSS 20.0 software package was used for data analysis. RESULTS: There was no significant difference in HR, RR, MAP, SpO2 and swallowing time between the two groups at T0 time point(P>0.05). Compared with MF group, HR significantly decreased and swallowing time significantly shortened(P<0.05). RR, MAP, SpO2 and Ramsay sedation score had no significant difference (P>0.05) in DEX group at T1 time point. Compared with MF group, HR significantly decreased and Ramsay sedation score significantly increased(P<0.05); RR, MAP and SpO2 had no significant difference (P>0.05) in DEX group at T2 time point. Compared with T0 time, HR significantly decreased and swallowing time significantly prolonged (P<0.05); RR, MAP and SpO2 had no significant difference(P>0.05) in DEX group at T1 time point. Compared with T1 time, Ramsay sedation score decreased with significant difference(P<0.05) at T2 time point. CONCLUSIONS: Dexmedetomidine can provide good sedative effect for patients with awaking intubation without obvious inhibitory effect on swallowing reflex of patients, improving the safety of intubation.


Assuntos
Dexmedetomidina , Cirurgia Bucal , Deglutição , Humanos , Hipnóticos e Sedativos , Intubação Intratraqueal
19.
BMC Bioinformatics ; 22(Suppl 3): 241, 2021 May 12.
Artigo em Inglês | MEDLINE | ID: mdl-33980147

RESUMO

BACKGROUND: In the development of science and technology, there are increasing evidences that there are some associations between lncRNAs and human diseases. Therefore, finding these associations between them will have a huge impact on our treatment and prevention of some diseases. However, the process of finding the associations between them is very difficult and requires a lot of time and effort. Therefore, it is particularly important to find some good methods for predicting lncRNA-disease associations (LDAs). RESULTS: In this paper, we propose a method based on dual sparse collaborative matrix factorization (DSCMF) to predict LDAs. The DSCMF method is improved on the traditional collaborative matrix factorization method. To increase the sparsity, the L2,1-norm is added in our method. At the same time, Gaussian interaction profile kernel is added to our method, which increase the network similarity between lncRNA and disease. Finally, the AUC value obtained by the experiment is used to evaluate the quality of our method, and the AUC value is obtained by the ten-fold cross-validation method. CONCLUSIONS: The AUC value obtained by the DSCMF method is 0.8523. At the end of the paper, simulation experiment is carried out, and the experimental results of prostate cancer, breast cancer, ovarian cancer and colorectal cancer are analyzed in detail. The DSCMF method is expected to bring some help to lncRNA-disease associations research. The code can access the https://github.com/Ming-0113/DSCMF website.


Assuntos
Neoplasias da Mama , Neoplasias da Próstata , RNA Longo não Codificante , Algoritmos , Simulação por Computador , Humanos , Masculino , Neoplasias da Próstata/genética , RNA Longo não Codificante/genética
20.
Front Genet ; 12: 621317, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33708239

RESUMO

The dimensionality reduction method accompanied by different norm constraints plays an important role in mining useful information from large-scale gene expression data. In this article, a novel method named Lp-norm and L2,1-norm constrained graph Laplacian principal component analysis (PL21GPCA) based on traditional principal component analysis (PCA) is proposed for robust tumor sample clustering and gene network module discovery. Three aspects are highlighted in the PL21GPCA method. First, to degrade the high sensitivity to outliers and noise, the non-convex proximal Lp-norm (0 < p < 1)constraint is applied on the loss function. Second, to enhance the sparsity of gene expression in cancer samples, the L2,1-norm constraint is used on one of the regularization terms. Third, to retain the geometric structure of the data, we introduce the graph Laplacian regularization item to the PL21GPCA optimization model. Extensive experiments on five gene expression datasets, including one benchmark dataset, two single-cancer datasets from The Cancer Genome Atlas (TCGA), and two integrated datasets of multiple cancers from TCGA, are performed to validate the effectiveness of our method. The experimental results demonstrate that the PL21GPCA method performs better than many other methods in terms of tumor sample clustering. Additionally, this method is used to discover the gene network modules for the purpose of finding key genes that may be associated with some cancers.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA