RESUMO
The high dimensionality and noise challenges in genomic data make it difficult for traditional clustering methods. Existing multi-kernel clustering methods aim to improve the quality of the affinity matrix by learning a set of base kernels, thereby enhancing clustering performance. However, directly learning from the original base kernels presents challenges in handling errors and redundancies when dealing with high-dimensional data, and there is still a lack of feasible multi-kernel fusion strategies. To address these issues, we propose a Multi-Kernel Clustering method with Tensor fusion on Grassmann manifolds, called MKCTM. Specifically, we maximize the clustering consensus among base kernels by imposing tensor low-rank constraints to eliminate noise and redundancy. Unlike traditional kernel fusion approaches, our method fuses learned base kernels on the Grassmann manifold, resulting in a final consensus matrix for clustering. We integrate tensor learning and fusion processes into a unified optimization model and propose an effective iterative optimization algorithm for solving it. Experimental results on ten datasets, comparing against 12 popular baseline clustering methods, confirm the superiority of our approach. Our code is available at https://github.com/foureverfei/MKCTM.git.
Assuntos
Algoritmos , Genômica , Genômica/métodos , Análise por Conglomerados , Humanos , SoftwareRESUMO
Hashing technology has exhibited great cross-modal retrieval potential due to its appealing retrieval efficiency and storage effectiveness. Most current supervised cross-modal retrieval methods heavily rely on accurate semantic supervision, which is intractable for annotations with ever-growing sample sizes. By comparison, the existing unsupervised methods rely on accurate sample similarity preservation strategies with intensive computational costs to compensate for the lack of semantic guidance, which causes these methods to lose the power to bridge the semantic gap. Furthermore, both kinds of approaches need to search for the nearest samples among all samples in a large search space, whose process is laborious. To address these issues, this paper proposes an unsupervised dual deep hashing (UDDH) method with semantic-index and content-code for cross-modal retrieval. Deep hashing networks are utilized to extract deep features and jointly encode the dual hashing codes in a collaborative manner with a common semantic index and modality content codes to simultaneously bridge the semantic and heterogeneous gaps for cross-modal retrieval. The dual deep hashing architecture, comprising the head code on semantic index and tail codes on modality content, enhances the efficiency for cross-modal retrieval. A query sample only needs to search for the retrieved samples with the same semantic index, thus greatly shrinking the search space and achieving superior retrieval efficiency. UDDH integrates the learning processes of deep feature extraction, binary optimization, common semantic index, and modality content code within a unified model, allowing for collaborative optimization to enhance the overall performance. Extensive experiments are conducted to demonstrate the retrieval superiority of the proposed approach over the state-of-the-art baselines.
RESUMO
MOTIVATION: Neoantigens, derived from somatic mutations in cancer cells, can elicit anti-tumor immune responses when presented to autologous T cells by human leukocyte antigen. Identifying immunogenic neoantigens is crucial for cancer immunotherapy development. However, the accuracy of current bioinformatic methods remains unsatisfactory. Surface and structural features of peptide-HLA class I (pHLA-I) complexes offer valuable insight into the immunogenicity of neoantigens. RESULTS: We present NeoaPred, a deep-learning framework for neoantigen prediction. NeoaPred accurately constructs pHLA-I complex structures, with 82.37% of the predicted structures showing an RMSD of < 1 Å. Using these structures, NeoaPred integrates differences in surface, structural, and atom group features between the mutant peptide and its wild-type counterpart to predict a foreignness score. This foreignness score is an effective factor for neoantigen prediction, achieving an AUROC (Area Under the Receiver Operating Characteristic Curve) of 0.81 and an AUPRC (Area Under the Precision-Recall Curve) of 0.54 in the test set, outperforming existing methods. AVAILABILITY AND IMPLEMENTATION: The source code is released under an Apache v2.0 license and is available at the GitHub repository (https://github.com/Dulab2020/NeoaPred).
Assuntos
Antígenos de Neoplasias , Aprendizado Profundo , Peptídeos , Humanos , Antígenos de Neoplasias/imunologia , Antígenos de Neoplasias/química , Peptídeos/química , Peptídeos/imunologia , Antígenos HLA/imunologia , Antígenos HLA/química , Biologia Computacional/métodos , Antígenos de Histocompatibilidade Classe I/química , Antígenos de Histocompatibilidade Classe I/imunologia , Software , Neoplasias/imunologiaRESUMO
BACKGROUND: Determining the proper iliosacral screw orientation in a dysmorphic S1 sacral segment using a C-arm is difficult, and pelvic computed tomography (CT) is often necessary for the preoperative planning. On the preoperative pelvic axial CT section, the intended screw trajectory can be delineated intraosseously along the axis of the oblique osseous corridor. An inherently accurate orientation would be determined by 2 factors: (1) the trajectory is in the pelvic transverse plane, and (2) it is oriented relative to the coronal plane at a patient-specific angle, which should be measured preoperatively. Based on the above reasoning, we aimed to simplify and verify the orientation. METHODS: After establishing the starting point on the sacral lateral view, we tested a method of simplifying the guidewire orientation: placing the guidewire in the pelvic transverse plane and then manipulating it to be angled relative to the coronal plane at the preoperatively measured patient-specific angle. The guidewire orientation should then be reproducibly accurate on the pelvic outlet and inlet views. The feasibility and safety of our method were verified through computer-simulated virtual surgical procedures in 95 dysmorphic sacra and clinical surgical procedures in 12 patients. The primary outcome parameters were the guidewire orientation and screw placement accuracy. RESULTS: Using our method, the S1 guidewire orientation was reproducibly accurate on the pelvic outlet and inlet views in all of the virtual and clinical surgical procedures. Ninety-five virtual S1 screws (1 screw in each left hemipelvis) were placed intraosseously in the pelvic transverse plane. Fourteen unilateral S1 screws were placed intraosseously in the pelvic transverse plane in the 12 patients (2 patients had double screws) without iatrogenic injuries. CONCLUSIONS: The guidewire orientation can be simplified by placing the guidewire in the pelvic transverse plane and replicating the preoperatively measured patient-specific angle between the guidewire and the coronal plane. After establishing the starting point on the sacral lateral view, our simplified manipulation yields a reproducibly accurate orientation on the pelvic outlet and inlet views. LEVEL OF EVIDENCE: Therapeutic Level III. See Instructions for Authors for a complete description of levels of evidence.
RESUMO
Increasing evidence has indicated that RNA-binding proteins (RBPs) play an essential role in mediating alternative splicing (AS) events during epithelial-mesenchymal transition (EMT). However, due to the substantial cost and complexity of biological experiments, how AS events are regulated and influenced remains largely unknown. Thus, it is important to construct effective models for inferring hidden RBP-AS event associations during EMT process. In this paper, a novel and efficient model was developed to identify AS event-related candidate RBPs based on Adaptive Graph-based Multi-Label learning (AGML). In particular, we propose to adaptively learn a new affinity graph to capture the intrinsic structure of data for both RBPs and AS events. Multi-view similarity matrices are employed for maintaining the intrinsic structure and guiding the adaptive graph learning. We then simultaneously update the RBP and AS event associations that are predicted from both spaces by applying multi-label learning. The experimental results have shown that our AGML achieved AUC values of 0.9521 and 0.9873 by 5-fold and leave-one-out cross-validations, respectively, indicating the superiority and effectiveness of our proposed model. Furthermore, AGML can serve as an efficient and reliable tool for uncovering novel AS events-associated RBPs and is applicable for predicting the associations between other biological entities. The source code of AGML is available at https://github.com/yushanqiu/AGML.
RESUMO
Mild cognitive impairment (MCI) represents an early stage of Alzheimer's disease (AD), characterized by subtle clinical symptoms that pose challenges for accurate diagnosis. The quest for the identification of MCI individuals has highlighted the importance of comprehending the underlying mechanisms of disease causation. Integrated analysis of brain imaging and genomics offers a promising avenue for predicting MCI risk before clinical symptom onset. However, most existing methods face challenges in: 1) mining the brain network-specific topological structure and addressing the single nucleotide polymorphisms (SNPs)-related noise contamination and 2) extracting the discriminative properties of brain imaging genomics, resulting in limited accuracy for MCI diagnosis. To this end, a modality-aware discriminative fusion network (MA-DFN) is proposed to integrate the complementary information from brain imaging genomics to diagnose MCI. Specifically, we first design two modality-specific feature extraction modules: the graph convolutional network with edge-augmented self-attention module (GCN-EASA) and the deep adversarial denoising autoencoder module (DAD-AE), to capture the topological structure of brain networks and the intrinsic distribution of SNPs. Subsequently, a discriminative-enhanced fusion network with correlation regularization module (DFN-CorrReg) is employed to enhance inter-modal consistency and between-class discrimination in brain imaging and genomics. Compared to other state-of-the-art approaches, MA-DFN not only exhibits superior performance in stratifying cognitive normal (CN) and MCI individuals but also identifies disease-related brain regions and risk SNPs locus, which hold potential as putative biomarkers for MCI diagnosis.
RESUMO
Traditional clustering methods rely on pairwise affinity to divide samples into different subgroups. However, high-dimensional small-sample (HDLSS) data are affected by the concentration effects, rendering traditional pairwise metrics unable to accurately describe relationships between samples, leading to suboptimal clustering results. This article advances the proposition of employing high-order affinities to characterize multiple sample relationships as a strategic means to circumnavigate the concentration effects. We establish a nexus between different order affinities by constructing specialized decomposable high-order affinities, thereby formulating a uniform mathematical framework. Building upon this insight, a novel clustering method named uniform tensor clustering (UTC) is proposed, which learns a consensus low-dimensional embedding for clustering by the synergistic exploitation of multiple-order affinities. Extensive experiments on synthetic and real-world datasets demonstrate two findings: 1) high-order affinities are better suited for characterizing sample relationships in complex data and 2) reasonable use of different order affinities can enhance clustering effectiveness, especially in handling high-dimensional data.
RESUMO
Tensor spectral clustering (TSC) is a recently proposed approach to robustly group data into underlying clusters. Unlike the traditional spectral clustering (SC), which merely uses pairwise similarities of data in an affinity matrix, TSC aims at exploring their multiwise similarities in an affinity tensor to achieve better performance. However, the performance of TSC highly relies on the design of multiwise similarities, and it remains unclear especially for high-dimension-low-sample-size (HDLSS) data. To this end, this article has proposed a discriminating TSC (DTSC) for HDLSS data. Specifically, DTSC uses the proposed discriminating affinity tensor that encodes the pair-to-pair similarities, which are particularly constructed by the anchor-based distance. HDLSS asymptotic analysis shows that the proposed affinity tensor can explicitly differentiate samples from different clusters when the feature dimension is large. This theoretical property allows DTSC to improve the clustering performance on HDLSS data. Experimental results on synthetic and benchmark datasets demonstrate the effectiveness and robustness of the proposed method in comparison to several baseline methods.
RESUMO
Recently, single-image SVBRDF capture is formulated as a regression problem, which uses a network to infer four SVBRDF maps from a flash-lit image. However, the accuracy is still not satisfactory since previous approaches usually adopt endto-end inference strategies. To mitigate the challenge, we propose "auxiliary renderings" as the intermediate regression targets, through which we divide the original end-to-end regression task into several easier sub-tasks, thus achieving better inference accuracy. Our contributions are threefold. First, we design three (or two pairs of) auxiliary renderings and summarize the motivations behind the designs. By our design, the auxiliary images are bumpiness-flattened or highlight-removed, containing disentangled visual cues about the final SVBRDF maps and can be easily transformed to the final maps. Second, to help estimate the auxiliary targets from the input image, we propose two mask images including a bumpiness mask and a highlight mask. Our method thus first infers mask images, then with the help of the mask images infers auxiliary renderings, and finally transforms the auxiliary images to SVBRDF maps. Third, we propose backbone UNets to infer mask images, and gated deformable UNets for estimating auxiliary targets. Thanks to the well designed networks and intermediate images, our method outputs better SVBRDF maps than previous approaches, validated by the extensive comparisonal and ablation experiments.
RESUMO
Emerging research indicates that the degenerative biomarkers associated with Alzheimer's disease (AD) exhibit a non-random distribution within the cerebral cortex, instead following the structural brain network. The alterations in brain networks occur much earlier than the onset of clinical symptoms, thereby affecting the progression of brain disease. In this context, the utilization of computational methods to ascertain the propagation patterns of neuropathological events would contribute to the comprehension of the pathophysiological mechanism involved in the evolution of AD. Despite the encouraging findings achieved by existing graph-based deep learning approaches in analyzing irregular graph data, their applications in identifying the spreading pathway of neuropathology are limited due to two disadvantages. They include (1) lack of a common brain network as an unbiased reference basis for group comparison, and (2) lack of an appropriate mechanism for the identification of propagation patterns. To this end, we propose a proof-of-concept harmonic wavelet neural network (HWNN) to predict the early stage of AD and localize disease-related significant wavelets, which can be used to characterize the spreading pathways of neuropathological events across the brain network. The extensive experiments constructed on both synthetic and real datasets demonstrate that our proposed method achieves superior performance in classification accuracy and statistical power of identifying propagation patterns, compared with other representative approaches.
RESUMO
Histopathology image-based survival prediction aims to provide a precise assessment of cancer prognosis and can inform personalized treatment decision-making in order to improve patient outcomes. However, existing methods cannot automatically model the complex correlations between numerous morphologically diverse patches in each whole slide image (WSI), thereby preventing them from achieving a more profound understanding and inference of the patient status. To address this, here we propose a novel deep learning framework, termed dual-stream multi-dependency graph neural network (DM-GNN), to enable precise cancer patient survival analysis. Specifically, DM-GNN is structured with the feature updating and global analysis branches to better model each WSI as two graphs based on morphological affinity and global co-activating dependencies. As these two dependencies depict each WSI from distinct but complementary perspectives, the two designed branches of DM-GNN can jointly achieve the multi-view modeling of complex correlations between the patches. Moreover, DM-GNN is also capable of boosting the utilization of dependency information during graph construction by introducing the affinity-guided attention recalibration module as the readout function. This novel module offers increased robustness against feature perturbation, thereby ensuring more reliable and stable predictions. Extensive benchmarking experiments on five TCGA datasets demonstrate that DM-GNN outperforms other state-of-the-art methods and offers interpretable prediction insights based on the morphological depiction of high-attention patches. Overall, DM-GNN represents a powerful and auxiliary tool for personalized cancer prognosis from histopathology images and has great potential to assist clinicians in making personalized treatment decisions and improving patient outcomes.
Assuntos
Redes Neurais de Computação , Humanos , Análise de Sobrevida , Aprendizado Profundo , Neoplasias/diagnóstico por imagem , Neoplasias/mortalidade , Interpretação de Imagem Assistida por Computador/métodos , PrognósticoRESUMO
3D neural rendering enables photo-realistic reconstruction of a specific scene by encoding discontinuous inputs into a neural representation. Despite the remarkable rendering results, the storage of network parameters is not transmission-friendly and not extendable to metaverse applications. In this paper, we propose an invertible neural rendering approach that enables generating an interactive 3D model from a single image (i.e., 3D Snapshot). Our idea is to distill a pre-trained neural rendering model (e.g., NeRF) into a visualizable image form that can then be easily inverted back to a neural network. To this end, we first present a neural image distillation method to optimize three neural planes for representing the original neural rendering model. However, this representation is noisy and visually meaningless. We thus propose a dynamic invertible neural network to embed this noisy representation into a plausible image representation of the scene. We demonstrate promising reconstruction quality quantitatively and qualitatively, by comparing to the original neural rendering model, as well as video-based invertible methods. On the other hand, our method can store dozens of NeRFs with a compact restoration network (5MB), and embedding each 3D scene takes up only 160KB of storage. More importantly, our approach is the first solution that allows embedding a neural rendering model into image representations, which enables applications like creating an interactive 3D model from a printed image in the metaverse.
RESUMO
The adenosine A3 receptor (A3AR), a key member of the G protein-coupled receptor family, is a promising therapeutic target for inflammatory and cancerous conditions. The selective A3AR agonists, CF101 and CF102, are clinically significant, yet their recognition mechanisms remained elusive. Here we report the cryogenic electron microscopy structures of the full-length human A3AR bound to CF101 and CF102 with heterotrimeric Gi protein in complex at 3.3-3.2 Å resolution. These agonists reside in the orthosteric pocket, forming conserved interactions via their adenine moieties, while their 3-iodobenzyl groups exhibit distinct orientations. Functional assays reveal the critical role of extracellular loop 3 in A3AR's ligand selectivity and receptor activation. Key mutations, including His3.37, Ser5.42, and Ser6.52, in a unique sub-pocket of A3AR, significantly impact receptor activation. Comparative analysis with the inactive A2AAR structure highlights a conserved receptor activation mechanism. Our findings provide comprehensive insights into the molecular recognition and signaling of A3AR, paving the way for designing subtype-selective adenosine receptor ligands.
Assuntos
Receptor A3 de Adenosina , Transdução de Sinais , Humanos , Receptor A3 de Adenosina/metabolismo , Microscopia CrioeletrônicaRESUMO
Graph-based multi-view clustering encodes multi-view data into sample affinities to find consensus representation, effectively overcoming heterogeneity across different views. However, traditional affinity measures tend to collapse as the feature dimension expands, posing challenges in estimating a unified alignment that reveals both cross-view and inner relationships. To tackle this challenge, we propose to achieve multi-view uniform clustering via consensus representation co-regularization. First, the sample affinities are encoded by both popular dyadic affinity and recent high-order affinities to comprehensively characterize spatial distributions of the HDLSS data. Second, a fused consensus representation is learned through aligning the multi-view low-dimensional representation by co-regularization. The learning of the fused representation is modeled by a high-order eigenvalue problem within manifold space to preserve the intrinsic connections and complementary correlations of original data. A numerical scheme via manifold minimization is designed to solve the high-order eigenvalue problem efficaciously. Experiments on eight HDLSS datasets demonstrate the effectiveness of our proposed method in comparison with the recent thirteen benchmark methods.
RESUMO
BACKGROUND: Obstructive sleep apnea (OSA) is a common sleep breathing disorder that is often accompanied by changes in structural connectivity (SC) and functional connectivity (FC). However, the current understanding of the interaction between SC and FC in OSA is still limited. METHODS: The aim of this study is to integrate complementary neuroimaging modalities into a unified framework using multi-layer network analysis methods and to reveal their complex interrelationships. We introduce a new graph metric called SC-FC bandwidth, which measures the throughput of SC mediating FC in a multi-layer network. The bandwidth differences between two groups are evaluated using the network-based statistics (NBS) method. Additionally, we traced and analyzed the SC pathways corresponding to the abnormal bandwidth. RESULTS: In both the healthy control and patients with OSA, the majority offunctionally synchronized nodes were connected via SC paths of length 2. With the NBS method, we observed significantly lower bandwidth between the right Posterior cingulate gyrus and right Cuneus, bilateral Middle frontal gyrus, bilateral Gyrus rectus in OSA patients. By tracing the high-proportion SC pathways, it was found that OSA patients typically exhibit a decrease in direct SC-FC, SC-FC triangles, and SC-FC quads intra- and inter-networks. CONCLUSION: Complex interrelationship changes have been observed between the SC and FC in patients with OSA, which might leads to abnormal information transmission and communication in the brain network.
Assuntos
Imageamento por Ressonância Magnética , Apneia Obstrutiva do Sono , Humanos , Imageamento por Ressonância Magnética/métodos , Apneia Obstrutiva do Sono/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Giro do Cíngulo , Mapeamento EncefálicoRESUMO
Tensor spectral clustering (TSC) is an emerging approach that explores multi-wise similarities to boost learning. However, two key challenges have yet to be well addressed in the existing TSC methods: (1) The construction and storage of high-order affinity tensors to encode the multi-wise similarities are memory-intensive and hampers their applicability, and (2) they mostly employ a two-stage approach that integrates multiple affinity tensors of different orders to learn a consensus tensor spectral embedding, thus often leading to a suboptimal clustering result. To this end, this paper proposes a tensor spectral clustering network (TSC-Net) to achieve one-stage learning of a consensus tensor spectral embedding, while reducing the memory cost. TSC-Net employs a deep neural network that learns to map the input samples to the consensus tensor spectral embedding, guided by a TSC objective with multiple affinity tensors. It uses stochastic optimization to calculate a small part of the affinity tensors, thereby avoiding loading the whole affinity tensors for computation, thus significantly reducing the memory cost. Through using an ensemble of multiple affinity tensors, the TSC can dramatically improve clustering performance. Empirical studies on benchmark datasets demonstrate that TSC-Net outperforms the recent baseline methods.
RESUMO
Cancer is one of the most challenging health problems worldwide. Accurate cancer survival prediction is vital for clinical decision making. Many deep learning methods have been proposed to understand the association between patients' genomic features and survival time. In most cases, the gene expression matrix is fed directly to the deep learning model. However, this approach completely ignores the interactions between biomolecules, and the resulting models can only learn the expression levels of genes to predict patient survival. In essence, the interaction between biomolecules is the key to determining the direction and function of biological processes. Proteins are the building blocks and principal undertakings of life activities, and as such, their complex interaction network is potentially informative for deep learning methods. Therefore, a more reliable approach is to have the neural network learn both gene expression data and protein interaction networks. We propose a new computational approach, termed CRESCENT, which is a protein-protein interaction (PPI) prior knowledge graph-based convolutional neural network (GCN) to improve cancer survival prediction. CRESCENT relies on the gene expression networks rather than gene expression levels to predict patient survival. The performance of CRESCENT is evaluated on a large-scale pan-cancer dataset consisting of 5991 patients from 16 different types of cancers. Extensive benchmarking experiments demonstrate that our proposed method is competitive in terms of the evaluation metric of the time-dependent concordance index( Ctd) when compared with several existing state-of-the-art approaches. Experiments also show that incorporating the network structure between genomic features effectively improves cancer survival prediction.
Assuntos
Neoplasias , Mapas de Interação de Proteínas , Humanos , Mapas de Interação de Proteínas/genética , Algoritmos , Redes Neurais de Computação , Genômica , Neoplasias/genéticaRESUMO
BACKGROUND: Sacral dysmorphism is not uncommon and complicates S1 iliosacral screw placement partially because of the difficulty of determining the starting point accurately on the sacral lateral view. We propose a method of specifying the starting point. METHODS: The starting point for the S1 iliosacral screw into the dysmorphic sacrum was specifically set at a point where the ossification of the S1/S2 intervertebral disc (OSID) intersected the posterior vertebral cortical line (PVCL) on the sacral lateral view, followed by guidewire manipulation and screw placement on the pelvic outlet and inlet views. Computer-simulated virtual surgical procedures based on pelvic computed tomography (CT) data on 95 dysmorphic sacra were performed to determine whether the starting point was below the iliac cortical density (ICD) and in the S1 oblique osseous corridor and to evaluate the accuracy of screw placement (with 1 screw being used, in the left hemipelvis). Surgical procedures on 17 patients were performed to verify the visibility of the OSID and PVCL, to check the location of the starting point relative to the ICD, and to validate the screw placement safety as demonstrated with postoperative CT scans. RESULTS: In the virtual surgical procedures, the starting point was consistently below the ICD and in the oblique osseous corridor in all patients and all screws were Grade 1. In the clinical surgical procedures, the OSID and PVCL were consistently visible and the starting point was always below the ICD in all patients; overall, 21 S1 iliosacral screws were placed in these 17 patients without malpositioning or iatrogenic injury. CONCLUSIONS: On the lateral view of the dysmorphic sacrum, the OSID and PVCL are visible and intersect at a point that is consistently below the ICD and in the oblique osseous corridor, and thus they can be used to identify the starting point. LEVEL OF EVIDENCE: Therapeutic Level III . See Instructions for Authors for a complete description of levels of evidence.