Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Cancer Res ; 2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38536129

RESUMO

T cells recognize tumor antigens and initiate an anti-cancer immune response in the very early stages of tumor development, and the antigen specificity of T cells is determined by the T cell receptor (TCR). Therefore, monitoring changes in the TCR repertoire in peripheral blood may offer a strategy to detect various cancers at a relatively early stages. Here, we developed the deep learning framework iCanTCR to identify cancer patients based on the TCR repertoire. The iCanTCR framework uses TCRß sequences from an individual as an input and outputs the predicted cancer probability. The model was trained on over 2000 publicly available TCR repertoires from eleven types of cancer and healthy controls. Analysis of several additional publicly available datasets validated the ability of iCanTCR to distinguish cancer patients from non-cancer individuals and demonstrated the capability of iCanTCR for the accurate classification of multiple cancers. Importantly, iCanTCR precisely identified individuals with early-stage cancer with an area under the curve (AUC) of 86%. Altogether, this work provides a liquid biopsy approach to capture immune signals from peripheral blood for non-invasive cancer diagnosis.

2.
Mol Ther Nucleic Acids ; 35(1): 102129, 2024 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-38370981

RESUMO

Circulating tumor cells (CTCs) that undergo epithelial-to-mesenchymal transition (EMT) can provide valuable information regarding metastasis and potential therapies. However, current studies on the EMT overlook alternative splicing. Here, we used single-cell full-length transcriptome data and mRNA sequencing of CTCs to identify stage-specific alternative splicing of partial EMT and mesenchymal states during pancreatic cancer metastasis. We classified definitive tumor and normal epithelial cells via genetic aberrations and demonstrated dynamic changes in the epithelial-mesenchymal continuum in both epithelial cancer cells and CTCs. We provide the landscape of alternative splicing in CTCs at different stages of EMT, uncovering cell-type-specific splicing patterns and splicing events in cell surface proteins suitable for therapies. We show that MBNL1 governs cell fate through alternative splicing independently of changes in gene expression and affects the splicing pattern during EMT. We found a high frequency of events that contained multiple premature termination codons and were enriched with C and G nucleotides in close proximity, which influence the likelihood of stop codon readthrough and expand the range of potential therapeutic targets. Our study provides insights into the EMT transcriptome's dynamic changes and identifies potential diagnostic and therapeutic targets in pancreatic cancer.

3.
Comput Biol Med ; 169: 107812, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38091725

RESUMO

Unexpected side effects may accompany the research stage and post-marketing of drugs. These accidents lead to drug development failure and even endanger patients' health. Thus, it is essential to recognize the unknown drug-side effects. Most existing methods in silico find the answer from the association network or similarity network of drugs while ignoring the drug-intrinsic attributes. The limitation is that they can only handle drugs in the maturation stage. To be suitable for early drug-side effect screening, we conceive a multi-structural deep learning framework, MSDSE, which synthetically considers the multi-scale features derived from the drug. MSDSE can jointly learn SMILES sequence-based word embedding, substructure-based molecular fingerprint, and chemical structure-based graph embedding. In the preprocessing stage of MSDSE, we project all features to the abstract space with the same dimension. MSDSE builds a bi-level channel strategy, including a convolutional neural network module with an Inception structure and a multi-head Self-Attention module, to learn and integrate multi-modal features from local to global perspectives. Finally, MSDSE regards the prediction of drug-side effects as pair-wise learning and outputs the pair-wise probability of drug-side effects through the inner product operation. MSDSE is evaluated and analyzed on benchmark datasets and performs optimally compared to other baseline models. We also set up the ablation study to explain the rationality of the feature approach and model structure. Moreover, we select model partial prediction results for the case study to reveal actual capability. The original data are available at http://github.com/yuliyi/MSDSE.


Assuntos
Benchmarking , Desenvolvimento de Medicamentos , Humanos , Redes Neurais de Computação , Probabilidade
4.
Front Immunol ; 14: 1267755, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38094296

RESUMO

N4-acetylcytidine (ac4C) is a modification of cytidine at the nitrogen-4 position, playing a significant role in the translation process of mRNA. However, the precise mechanism and details of how ac4C modifies translated mRNA remain unclear. Since identifying ac4C sites using conventional experimental methods is both labor-intensive and time-consuming, there is an urgent need for a method that can promptly recognize ac4C sites. In this paper, we propose a comprehensive ensemble learning model, the Stacking-based heterogeneous integrated ac4C model, engineered explicitly to identify ac4C sites. This innovative model integrates three distinct feature extraction methodologies: Kmer, electron-ion interaction pseudo-potential values (PseEIIP), and pseudo-K-tuple nucleotide composition (PseKNC). The model also incorporates the robust Cluster Centroids algorithm to enhance its performance in dealing with imbalanced data and alleviate underfitting issues. Our independent testing experiments indicate that our proposed model improves the Mcc by 15.61% and the ROC by 5.97% compared to existing models. To test our model's adaptability, we also utilized a balanced dataset assembled by the authors of iRNA-ac4C. Our model showed an increase in Sn of 4.1%, an increase in Acc of nearly 1%, and ROC improvement of 0.35% on this balanced dataset. The code for our model is freely accessible at https://github.com/louliliang/ST-ac4C.git, allowing users to quickly build their model without dealing with complicated mathematical equations.


Assuntos
Citidina , Nucleotídeos , RNA Mensageiro/genética , Citidina/genética , Algoritmos
5.
Math Biosci Eng ; 20(11): 19133-19151, 2023 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-38052593

RESUMO

Malignancies such as bladder urothelial carcinoma, colon adenocarcinoma, liver hepatocellular carcinoma, lung adenocarcinoma and prostate adenocarcinoma significantly impact men's well-being. Accurate cancer classification is vital in determining treatment strategies and improving patient prognosis. This study introduced an innovative method that utilizes gene selection from high-dimensional datasets to enhance the performance of the male tumor classification algorithm. The method assesses the reliability of DNA methylation data to distinguish the five most prevalent types of male cancers from normal tissues by employing DNA methylation 450K data obtained from The Cancer Genome Atlas (TCGA) database. First, the chi-square test is used for dimensionality reduction and second, L1 penalized logistic regression is used for feature selection. Furthermore, the stacking ensemble learning technique was employed to integrate seven common multiclassification models. Experimental results demonstrated that the ensemble learning model utilizing multiple classification models outperformed any base classification model. The proposed ensemble model achieved an astonishing overall accuracy (ACC) of 99.2% in independent testing data. Moreover, it may present novel ideas and pathways for the early detection and treatment of future diseases.


Assuntos
Adenocarcinoma , Carcinoma Hepatocelular , Carcinoma de Células de Transição , Neoplasias do Colo , Neoplasias Hepáticas , Neoplasias Pulmonares , Neoplasias da Bexiga Urinária , Humanos , Masculino , Metilação de DNA , Adenocarcinoma/genética , Carcinoma de Células de Transição/genética , Reprodutibilidade dos Testes , Neoplasias da Bexiga Urinária/genética , Neoplasias do Colo/genética , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Neoplasias Pulmonares/genética , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética
6.
Bioinformatics ; 39(10)2023 10 03.
Artigo em Inglês | MEDLINE | ID: mdl-37740953

RESUMO

MOTIVATION: Cell-cell interactions (CCIs) play critical roles in many biological processes such as cellular differentiation, tissue homeostasis, and immune response. With the rapid development of high throughput single-cell RNA sequencing (scRNA-seq) technologies, it is of high importance to identify CCIs from the ever-increasing scRNA-seq data. However, limited by the algorithmic constraints, current computational methods based on statistical strategies ignore some key latent information contained in scRNA-seq data with high sparsity and heterogeneity. RESULTS: Here, we developed a deep learning framework named DeepCCI to identify meaningful CCIs from scRNA-seq data. Applications of DeepCCI to a wide range of publicly available datasets from diverse technologies and platforms demonstrate its ability to predict significant CCIs accurately and effectively. Powered by the flexible and easy-to-use software, DeepCCI can provide the one-stop solution to discover meaningful intercellular interactions and build CCI networks from scRNA-seq data. AVAILABILITY AND IMPLEMENTATION: The source code of DeepCCI is available online at https://github.com/JiangBioLab/DeepCCI.


Assuntos
Aprendizado Profundo , Perfilação da Expressão Gênica , Análise de Sequência de RNA , Análise de Célula Única , Software , Análise por Conglomerados
7.
Brief Bioinform ; 24(4)2023 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-37225419

RESUMO

Single-cell RNA sequencing (scRNA-seq) detects whole transcriptome signals for large amounts of individual cells and is powerful for determining cell-to-cell differences and investigating the functional characteristics of various cell types. scRNA-seq datasets are usually sparse and highly noisy. Many steps in the scRNA-seq analysis workflow, including reasonable gene selection, cell clustering and annotation, as well as discovering the underlying biological mechanisms from such datasets, are difficult. In this study, we proposed an scRNA-seq analysis method based on the latent Dirichlet allocation (LDA) model. The LDA model estimates a series of latent variables, i.e. putative functions (PFs), from the input raw cell-gene data. Thus, we incorporated the 'cell-function-gene' three-layer framework into scRNA-seq analysis, as this framework is capable of discovering latent and complex gene expression patterns via a built-in model approach and obtaining biologically meaningful results through a data-driven functional interpretation process. We compared our method with four classic methods on seven benchmark scRNA-seq datasets. The LDA-based method performed best in the cell clustering test in terms of both accuracy and purity. By analysing three complex public datasets, we demonstrated that our method could distinguish cell types with multiple levels of functional specialization, and precisely reconstruct cell development trajectories. Moreover, the LDA-based method accurately identified the representative PFs and the representative genes for the cell types/cell stages, enabling data-driven cell cluster annotation and functional interpretation. According to the literature, most of the previously reported marker/functionally relevant genes were recognized.


Assuntos
Perfilação da Expressão Gênica , Análise de Célula Única , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Transcriptoma , Análise por Conglomerados , Algoritmos
8.
Heliyon ; 9(4): e15096, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37095983

RESUMO

The mortality rate from cervical cancer (CESC), a malignant tumor that affects women, has increased significantly globally in recent years. The discovery of biomarkers points to a direction for the diagnosis of cervical cancer with the advancement of bioinformatics technology. The goal of this study was to look for potential biomarkers for the diagnosis and prognosis of CESC using the GEO and TCGA databases. Because of the high dimension and small sample size of the omic data, or the use of biomarkers generated from a single omic data, the diagnosis of cervical cancer may be inaccurate and unreliable. The purpose of this study was to search the GEO and TCGA databases for potential biomarkers for the diagnosis and prognosis of CESC. We begin by downloading CESC (GSE30760) DNA methylation data from GEO, then perform differential analysis on the downloaded methylation data and screen out the differential genes. Then, using estimation algorithms, we score immune cells and stromal cells in the tumor microenvironment and perform survival analysis on the gene expression profile data and the most recent clinical data of CESC from TCGA. Then, using the 'limma' package and Venn plot in R language to perform differential analysis of genes and screen out overlapping genes, these overlapping genes were then subjected to GO and KEGG functional enrichment analysis. The differential genes screened by the GEO methylation data and the differential genes screened by the TCGA gene expression data were intersected to screen out the common differential genes. A protein-protein interaction (PPI) network of gene expression data was then created in order to discover important genes. The PPI network's key genes were crossed with previously identified common differential genes to further validate them. The Kaplan-Meier curve was then used to determine the prognostic importance of the key genes. Survival analysis has shown that CD3E and CD80 are important for the identification of cervical cancer and can be considered as potential biomarkers for cervical cancer.

9.
Mol Ther Nucleic Acids ; 32: 189-202, 2023 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-37096165

RESUMO

Tumor-infiltrating T cells are essential players in tumor immunotherapy. Great progress has been achieved in the investigation of T cell heterogeneity. However, little is well known about the shared characteristics of tumor-infiltrating T cells across cancers. In this study, we conduct a pan-cancer analysis of 349,799 T cells across 15 cancers. The results show that the same T cell types had similar expression patterns regulated by specific transcription factor (TF) regulons across cancers. Multiple T cell type transition paths were consistent in cancers. We found that TF regulons associated with CD8+ T cells transitioned to terminally differentiated effector memory (Temra) or exhausted (Tex) states were associated with patient clinical classification. We also observed universal activated cell-cell interaction pathways of tumor-infiltrating T cells in all cancers, some of which specifically mediated crosstalk in certain cell types. Moreover, consistent characteristics of TCRs in the aspect of variable and joining region genes were found across cancers. Overall, our study reveals common features of tumor-infiltrating T cells in different cancers and suggests future avenues for rational, targeted immunotherapies.

10.
Int J Mol Sci ; 24(5)2023 Feb 24.
Artigo em Inglês | MEDLINE | ID: mdl-36901929

RESUMO

A norm in modern medicine is to prescribe polypharmacy to treat disease. The core concern with the co-administration of drugs is that it may produce adverse drug-drug interaction (DDI), which can cause unexpected bodily injury. Therefore, it is essential to identify potential DDI. Most existing methods in silico only judge whether two drugs interact, ignoring the importance of interaction events to study the mechanism implied in combination drugs. In this work, we propose a deep learning framework named MSEDDI that comprehensively considers multi-scale embedding representations of the drug for predicting drug-drug interaction events. In MSEDDI, we design three-channel networks to process biomedical network-based knowledge graph embedding, SMILES sequence-based notation embedding, and molecular graph-based chemical structure embedding, respectively. Finally, we fuse three heterogeneous features from channel outputs through a self-attention mechanism and feed them to the linear layer predictor. In the experimental section, we evaluate the performance of all methods on two different prediction tasks on two datasets. The results show that MSEDDI outperforms other state-of-the-art baselines. Moreover, we also reveal the stable performance of our model in a broader sample set via case studies.


Assuntos
Bases de Conhecimento , Polimedicação , Humanos , Interações Medicamentosas
11.
Int J Mol Sci ; 23(24)2022 Dec 07.
Artigo em Inglês | MEDLINE | ID: mdl-36555143

RESUMO

N6-methyladenosine (m6A) is the most abundant within eukaryotic messenger RNA modification, which plays an essential regulatory role in the control of cellular functions and gene expression. However, it remains an outstanding challenge to detect mRNA m6A transcriptome-wide at base resolution via experimental approaches, which are generally time-consuming and expensive. Developing computational methods is a good strategy for accurate in silico detection of m6A modification sites from the large amount of RNA sequence data. Unfortunately, the existing computational models are usually only for m6A site prediction in a single species, without considering the tissue level of species, while most of them are constructed based on low-confidence level data generated by an m6A antibody immunoprecipitation (IP)-based sequencing method, thereby restricting reliability and generalizability of proposed models. Here, we review recent advances in computational prediction of m6A sites and construct a new computational approach named im6APred using ensemble deep learning to accurately identify m6A sites based on high-confidence level data in multiple tissues of mammals. Our model im6APred builds upon a comprehensive evaluation of multiple classification methods, including four traditional classification algorithms and three deep learning methods and their ensembles. The optimal base-classifier combinations are then chosen by five-fold cross-validation test to achieve an effective stacked model. Our model im6APred can produce the area under the receiver operating characteristic curve (AUROC) in the range of 0.82-0.91 on independent tests, indicating that our model has the ability to learn general methylation rules on RNA bases and generalize to m6A transcriptome-wide identification. Moreover, AUROCs in the range of 0.77-0.96 were achieved using cross-species/tissues validation on the benchmark dataset, demonstrating differences in predictive performance at the tissue level and the need for constructing tissue-specific models for m6A site prediction.


Assuntos
Aprendizado Profundo , Animais , Reprodutibilidade dos Testes , RNA/metabolismo , Adenosina/genética , Adenosina/metabolismo , Mamíferos/metabolismo , Biologia Computacional/métodos
12.
Int J Mol Sci ; 23(19)2022 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-36232325

RESUMO

N6,2'-O-dimethyladenosine (m6Am) is a post-transcriptional modification that may be associated with regulatory roles in the control of cellular functions. Therefore, it is crucial to accurately identify transcriptome-wide m6Am sites to understand underlying m6Am-dependent mRNA regulation mechanisms and biological functions. Here, we used three sequence-based feature-encoding schemes, including one-hot, nucleotide chemical property (NCP), and nucleotide density (ND), to represent RNA sequence samples. Additionally, we proposed an ensemble deep learning framework, named DLm6Am, to identify m6Am sites. DLm6Am consists of three similar base classifiers, each of which contains a multi-head attention module, an embedding module with two parallel deep learning sub-modules, a convolutional neural network (CNN) and a Bi-directional long short-term memory (BiLSTM), and a prediction module. To demonstrate the superior performance of our model's architecture, we compared multiple model frameworks with our method by analyzing the training data and independent testing data. Additionally, we compared our model with the existing state-of-the-art computational methods, m6AmPred and MultiRM. The accuracy (ACC) for the DLm6Am model was improved by 6.45% and 8.42% compared to that of m6AmPred and MultiRM on independent testing data, respectively, while the area under receiver operating characteristic curve (AUROC) for the DLm6Am model was increased by 4.28% and 5.75%, respectively. All the results indicate that DLm6Am achieved the best prediction performance in terms of ACC, Matthews correlation coefficient (MCC), AUROC, and the area under precision and recall curves (AUPR). To further assess the generalization performance of our proposed model, we implemented chromosome-level leave-out cross-validation, and found that the obtained AUROC values were greater than 0.83, indicating that our proposed method is robust and can accurately predict m6Am sites.


Assuntos
Algoritmos , Aprendizado Profundo , Sequência de Bases , Nucleotídeos , RNA Mensageiro/genética
13.
Nucleic Acids Res ; 50(22): e131, 2022 12 09.
Artigo em Inglês | MEDLINE | ID: mdl-36250636

RESUMO

Recent advances in spatial transcriptomics (ST) have brought unprecedented opportunities to understand tissue organization and function in spatial context. However, it is still challenging to precisely dissect spatial domains with similar gene expression and histology in situ. Here, we present DeepST, an accurate and universal deep learning framework to identify spatial domains, which performs better than the existing state-of-the-art methods on benchmarking datasets of the human dorsolateral prefrontal cortex. Further testing on a breast cancer ST dataset, we showed that DeepST can dissect spatial domains in cancer tissue at a finer scale. Moreover, DeepST can achieve not only effective batch integration of ST data generated from multiple batches or different technologies, but also expandable capabilities for processing other spatial omics data. Together, our results demonstrate that DeepST has the exceptional capacity for identifying spatial domains, making it a desirable tool to gain novel insights from ST studies.


Assuntos
Aprendizado Profundo , Perfilação da Expressão Gênica , Humanos , Benchmarking , Perfilação da Expressão Gênica/métodos , Transcriptoma
14.
Genomics ; 114(6): 110486, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-36126833

RESUMO

DNA methylation is an important epigenetics, which occurs in the early stages of tumor formation. And it also is of great significance to find the relationship between DNA methylation and cancer. This paper proposes a novel model, iCancer-Pred, to identify cancer and classify its types further. The datasets of DNA methylation information of 7 cancer types have been collected from The Cancer Genome Atlas (TCGA). The coefficient of variation firstly is used to reduce the number of features, and then the elastic network is applied to select important features. Finally, a fully connected neural network is constructed with these selected features. In predicting seven types of cancers, iCancer-Pred has achieved an overall accuracy of over 97% accuracy with 5-fold cross-validation. For the convenience of the application, a user-friendly web server: http://bioinfo.jcu.edu.cn/cancer or http://121.36.221.79/cancer/ is available. And the source codes are freely available for download at https://github.com/Huerhu/iCancer-Pred.


Assuntos
Metilação de DNA , Neoplasias , Humanos , Epigenômica , Neoplasias/genética
15.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35870203

RESUMO

The rapid development of single-cel+l RNA sequencing (scRNA-seq) technology provides unprecedented opportunities for exploring biological phenomena at the single-cell level. The discovery of cell types is one of the major applications for researchers to explore the heterogeneity of cells. Some computational methods have been proposed to solve the problem of scRNA-seq data clustering. However, the unavoidable technical noise and notorious dropouts also reduce the accuracy of clustering methods. Here, we propose the cauchy-based bounded constraint low-rank representation (CBLRR), which is a low-rank representation-based method by introducing cauchy loss function (CLF) and bounded nuclear norm regulation, aiming to alleviate the above issue. Specifically, as an effective loss function, the CLF is proven to enhance the robustness of the identification of cell types. Then, we adopt the bounded constraint to ensure the entry values of single-cell data within the restricted interval. Finally, the performance of CBLRR is evaluated on 15 scRNA-seq datasets, and compared with other state-of-the-art methods. The experimental results demonstrate that CBLRR performs accurately and robustly on clustering scRNA-seq data. Furthermore, CBLRR is an effective tool to cluster cells, and provides great potential for downstream analysis of single-cell data. The source code of CBLRR is available online at https://github.com/Ginnay/CBLRR.


Assuntos
Análise de Célula Única , Software , Algoritmos , Análise por Conglomerados , Perfilação da Expressão Gênica/métodos , RNA-Seq , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos
16.
Comput Biol Med ; 145: 105509, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35421792

RESUMO

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causing an outbreak of coronavirus disease 2019 (COVID-19), is a major threat to public health worldwide. Previous studies have shown that the spike protein of SARS-CoV-2 determines viral infectivity and major antigenicity. However, the spike protein has been undergoing various mutations, which bring a great challenge to the prevention and treatment of COVID-19. Here we present the MutCov, a pipeline for evaluating the effect of mutations in spike protein on infectivity and antigenicity of SARS-CoV-2 by calculating the binding free energy between spike protein and angiotensin-converting enzyme 2 (ACE2) or neutralizing monoclonal antibody (mAb). The predicted infectivity and antigenicity were highly consistent with biologically experimental results, and demonstrated that the MutCov achieved good prediction performance. In conclusion, the MutCov is of high importance for systematically evaluating the effect of novel mutations and improving the prevention and treatment of COVID-19. The source code and installation instruction of MutCov are freely available at http://jianglab.org.cn/MutCov.


Assuntos
COVID-19 , SARS-CoV-2 , Anticorpos Neutralizantes , COVID-19/genética , Humanos , Mutação , Ligação Proteica , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus/genética
17.
Front Immunol ; 13: 814239, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35250991

RESUMO

Immune system plays important roles in the pathogenesis of Parkinson's disease (PD). However, the role of B cells in this complex disease are still not fully understood. B cells produce antibodies but can also regulate immune responses. In order to decode the relative contribution of peripheral B cell subtypes to the etiology of PD, we performed single cell RNA and BCR sequencing for 10,466 B cells from 8 PD patients and 6 age-matched healthy controls. We observed significant increased memory B cells and significant decreased naïve B cells in PD patients compared to healthy controls. Notably, we also discovered increased IgG and IgA isotypes and more frequent class switch recombination events in PD patients. Moreover, we identified preferential V and J gene segments of B cell receptors in PD patients as the evidence of convergent selection in PD. Finally, we found a marked clonal expanded memory B cell population in PD patients, up-regulating both MHC II genes (HLA-DRB5, HLA-DQA2 and HLA-DPB1) and transcription factor activator protein 1 (AP-1), suggesting that the antigen presentation capacity of B cells was enhanced and B cells were activated in PD patients. Overall, this study conducted a comprehensive analysis of peripheral B cell characteristics of PD patients, which provided novel insights into the humoral immune response in the pathogenesis of PD.


Assuntos
Doença de Parkinson , Apresentação de Antígeno , Linfócitos B , Humanos , RNA , Receptores de Antígenos de Linfócitos B/genética
18.
Brief Bioinform ; 23(2)2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35279714

RESUMO

Messenger RNA (mRNA) vaccines have shown great potential for anti-tumor therapy due to the advantages in safety, efficacy and industrial production. However, it remains a challenge to identify suitable cancer neoantigens that can be targeted for mRNA vaccines. Abnormal alternative splicing occurs in a variety of tumors, which may result in the translation of abnormal transcripts into tumor-specific proteins. High-throughput technologies make it possible for systematic characterization of alternative splicing as a source of suitable target neoantigens for mRNA vaccine development. Here, we summarized difficulties and challenges for identifying alternative splicing-derived cancer neoantigens from RNA-seq data and proposed a conceptual framework for designing personalized mRNA vaccines based on alternative splicing-derived cancer neoantigens. In addition, several points were presented to spark further discussion toward improving the identification of alternative splicing-derived cancer neoantigens.


Assuntos
Processamento Alternativo , Neoplasias , Antígenos de Neoplasias/genética , Humanos , Imunoterapia , Neoplasias/genética , Neoplasias/terapia , RNA Mensageiro/genética , Vacinas Sintéticas , Vacinas de mRNA
19.
Front Cell Dev Biol ; 9: 697035, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34414185

RESUMO

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causing an outbreak of coronavirus disease 2019 (COVID-19), has been undergoing various mutations. The analysis of the structural and energetic effects of mutations on protein-protein interactions between the receptor binding domain (RBD) of SARS-CoV-2 and angiotensin converting enzyme 2 (ACE2) or neutralizing monoclonal antibodies will be beneficial for epidemic surveillance, diagnosis, and optimization of neutralizing agents. According to the molecular dynamics simulation, a key mutation N439K in the SARS-CoV-2 RBD region created a new salt bridge with Glu329 of hACE2, which resulted in greater electrostatic complementarity, and created a weak salt bridge with Asp442 of RBD. Furthermore, the N439K-mutated RBD bound hACE2 with a higher affinity than wild-type, which may lead to more infectious. In addition, the N439K-mutated RBD was markedly resistant to the SARS-CoV-2 neutralizing antibody REGN10987, which may lead to the failure of neutralization. The results show consistent with the previous experimental conclusion and clarify the structural mechanism under affinity changes. Our methods will offer guidance on the assessment of the infection efficiency and antigenicity effect of continuing mutations in SARS-CoV-2.

20.
Brief Bioinform ; 22(6)2021 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-34415016

RESUMO

Accurate prediction of immunogenic peptide recognized by T cell receptor (TCR) can greatly benefit vaccine development and cancer immunotherapy. However, identifying immunogenic peptides accurately is still a huge challenge. Most of the antigen peptides predicted in silico fail to elicit immune responses in vivo without considering TCR as a key factor. This inevitably causes costly and time-consuming experimental validation test for predicted antigens. Therefore, it is necessary to develop novel computational methods for precisely and effectively predicting immunogenic peptide recognized by TCR. Here, we described DLpTCR, a multimodal ensemble deep learning framework for predicting the likelihood of interaction between single/paired chain(s) of TCR and peptide presented by major histocompatibility complex molecules. To investigate the generality and robustness of the proposed model, COVID-19 data and IEDB data were constructed for independent evaluation. The DLpTCR model exhibited high predictive power with area under the curve up to 0.91 on COVID-19 data while predicting the interaction between peptide and single TCR chain. Additionally, the DLpTCR model achieved the overall accuracy of 81.03% on IEDB data while predicting the interaction between peptide and paired TCR chains. The results demonstrate that DLpTCR has the ability to learn general interaction rules and generalize to antigen peptide recognition by TCR. A user-friendly webserver is available at http://jianglab.org.cn/DLpTCR/. Additionally, a stand-alone software package that can be downloaded from https://github.com/jiangBiolab/DLpTCR.


Assuntos
Tratamento Farmacológico da COVID-19 , Epitopos/imunologia , Peptídeos/imunologia , Receptores de Antígenos de Linfócitos T/imunologia , SARS-CoV-2/imunologia , Sequência de Aminoácidos/genética , COVID-19/genética , COVID-19/imunologia , COVID-19/virologia , Simulação por Computador , Aprendizado Profundo , Epitopos/genética , Humanos , Peptídeos/genética , Peptídeos/uso terapêutico , Ligação Proteica/genética , Receptores de Antígenos de Linfócitos T/genética , SARS-CoV-2/genética , SARS-CoV-2/patogenicidade , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA