Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 6.981
Filtrar
Mais filtros

Intervalo de ano de publicação
1.
Mol Cell ; 82(2): 285-303, 2022 01 20.
Artigo em Inglês | MEDLINE | ID: mdl-35063097

RESUMO

Combining diverse experimental structural and interactomic methods allows for the construction of comprehensible molecular encyclopedias of biological systems. Typically, this involves merging several independent approaches that provide complementary structural and functional information from multiple perspectives and at different resolution ranges. A particularly potent combination lies in coupling structural information from cryoelectron microscopy or tomography (cryo-EM or cryo-ET) with interactomic and structural information from mass spectrometry (MS)-based structural proteomics. Cryo-EM/ET allows for sub-nanometer visualization of biological specimens in purified and near-native states, while MS provides bioanalytical information for proteins and protein complexes without introducing additional labels. Here we highlight recent achievements in protein structure and interactome determination using cryo-EM/ET that benefit from additional MS analysis. We also give our perspective on how combining cryo-EM/ET and MS will continue bridging gaps between molecular and cellular studies by capturing and describing 3D snapshots of proteomes and interactomes.


Assuntos
Microscopia Crioeletrônica , Tomografia com Microscopia Eletrônica , Espectrometria de Massas , Proteoma , Proteômica , Animais , Humanos , Modelos Moleculares , Mapas de Interação de Proteínas , Transdução de Sinais
2.
Proc Natl Acad Sci U S A ; 121(2): e2314030121, 2024 Jan 09.
Artigo em Inglês | MEDLINE | ID: mdl-38165933

RESUMO

Multiplex, digital nucleic acid detections have important biomedical applications, but the multiplexity of existing methods is predominantly achieved using fluorescent dyes or probes, making the detection complicated and costly. Here, we present the StratoLAMP for label-free, multiplex digital loop-mediated isothermal amplification based on visual stratification of the precipitate byproduct. The StratoLAMP designates two sets of primers with different concentrations to achieve different precipitate yields when amplifying different nucleic acid targets. In the detection, deep learning image analysis is used to stratify the precipitate within each droplet and determine the encapsulated targets for nucleic acid quantification. We investigated the effect of the amplification reagents and process on the precipitate generation and optimized the assay conditions. We then implemented a deep-learning image analysis pipeline for droplet detection, achieving an overall accuracy of 94.3%. In the application, the StratoLAMP successfully achieved the simultaneous quantification of two nucleic acid targets with high accuracy. By eliminating the need for fluorescence, StratoLAMP represents a unique concept toward label-free, multiplex nucleic acid assays and an analytical tool with great cost-effectiveness.


Assuntos
Técnicas de Amplificação de Ácido Nucleico , Ácidos Nucleicos , Técnicas de Amplificação de Ácido Nucleico/métodos , Técnicas de Diagnóstico Molecular/métodos , Primers do DNA , Sensibilidade e Especificidade
3.
Proc Natl Acad Sci U S A ; 121(13): e2321825121, 2024 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-38498716

RESUMO

Label-free optical imaging of single-nanometer-scale matter is extremely important for a variety of biomedical, physical, and chemical investigations. One central challenge is that the background intensity is much stronger than the intensity of the scattering light from single nano-objects. Here, we propose an optical module comprising cascaded momentum-space polarization filters that can perform vector field modulation to block most of the background field and result in an almost black background; in contrast, only a small proportion of the scattering field is blocked, leading to obvious imaging contrast enhancement. This module can be installed in various optical microscopies to realize a black-field microscopy. Various single nano-objects with dimensions smaller than 20 nm appear distinctly in the black-field images. The chemical reactions occurring on single nanocrystals with edge lengths of approximately 10 nm are in situ real-time monitored by using the black-field microscopy. This label-free black-field microscopy is highly promising for a wide range of future multidisciplinary science applications.

4.
Proc Natl Acad Sci U S A ; 121(16): e2400203121, 2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38598338

RESUMO

Viral outbreaks can cause widespread disruption, creating the need for diagnostic tools that provide high performance and sample versatility at the point of use with moderate complexity. Current gold standards such as PCR and rapid antigen tests fall short in one or more of these aspects. Here, we report a label-free and amplification-free nanopore sensor platform that overcomes these challenges via direct detection and quantification of viral RNA in clinical samples from a variety of biological fluids. The assay uses an optofluidic chip that combines optical waveguides with a fluidic channel and integrates a solid-state nanopore for sensing of individual biomolecules upon translocation through the pore. High specificity and low limit of detection are ensured by capturing RNA targets on microbeads and collecting them by optical trapping at the nanopore location where targets are released and rapidly detected. We use this device for longitudinal studies of the viral load progression for Zika and Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) infections in marmoset and baboon animal models, respectively. The up to million-fold trapping-based target concentration enhancement enables amplification-free RNA quantification across the clinically relevant concentration range down to the assay limit of RT-qPCR as well as cases in which PCR failed. The assay operates across all relevant biofluids, including semen, urine, and whole blood for Zika and nasopharyngeal and throat swab, rectal swab, and bronchoalveolar lavage for SARS-CoV-2. The versatility, performance, simplicity, and potential for full microfluidic integration of the amplification-free nanopore assay points toward a unique approach to molecular diagnostics for nucleic acids, proteins, and other targets.


Assuntos
Nanoporos , Infecção por Zika virus , Zika virus , Animais , RNA Viral/genética , RNA Viral/metabolismo , SARS-CoV-2/genética , SARS-CoV-2/metabolismo , Primatas/genética , Zika virus/genética , Sensibilidade e Especificidade , Técnicas de Amplificação de Ácido Nucleico
5.
Proc Natl Acad Sci U S A ; 121(28): e2315043121, 2024 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-38968128

RESUMO

Only 30% of embryos from in vitro fertilized oocytes successfully implant and develop to term, leading to repeated transfer cycles. To reduce time-to-pregnancy and stress for patients, there is a need for a diagnostic tool to better select embryos and oocytes based on their physiology. The current standard employs brightfield imaging, which provides limited physiological information. Here, we introduce METAPHOR: Metabolic Evaluation through Phasor-based Hyperspectral Imaging and Organelle Recognition. This non-invasive, label-free imaging method combines two-photon illumination and AI to deliver the metabolic profile of embryos and oocytes based on intrinsic autofluorescence signals. We used it to classify i) mouse blastocysts cultured under standard conditions or with depletion of selected metabolites (glucose, pyruvate, lactate); and ii) oocytes from young and old mouse females, or in vitro-aged oocytes. The imaging process was safe for blastocysts and oocytes. The METAPHOR classification of control vs. metabolites-depleted embryos reached an area under the ROC curve (AUC) of 93.7%, compared to 51% achieved for human grading using brightfield imaging. The binary classification of young vs. old/in vitro-aged oocytes and their blastulation prediction using METAPHOR reached an AUC of 96.2% and 82.2%, respectively. Finally, organelle recognition and segmentation based on the flavin adenine dinucleotide signal revealed that quantification of mitochondria size and distribution can be used as a biomarker to classify oocytes and embryos. The performance and safety of the method highlight the accuracy of noninvasive metabolic imaging as a complementary approach to evaluate oocytes and embryos based on their physiology.


Assuntos
Blastocisto , Oócitos , Animais , Blastocisto/metabolismo , Camundongos , Oócitos/metabolismo , Feminino , Organelas/metabolismo , Imagem Óptica/métodos
6.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38935069

RESUMO

MOTIVATION: In the past decade, single-cell RNA sequencing (scRNA-seq) has emerged as a pivotal method for transcriptomic profiling in biomedical research. Precise cell-type identification is crucial for subsequent analysis of single-cell data. And the integration and refinement of annotated data are essential for building comprehensive databases. However, prevailing annotation techniques often overlook the hierarchical organization of cell types, resulting in inconsistent annotations. Meanwhile, most existing integration approaches fail to integrate datasets with different annotation depths and none of them can enhance the labels of outdated data with lower annotation resolutions using more intricately annotated datasets or novel biological findings. RESULTS: Here, we introduce scPLAN, a hierarchical computational framework designed for scRNA-seq data analysis. scPLAN excels in annotating unlabeled scRNA-seq data using a reference dataset structured along a hierarchical cell-type tree. It identifies potential novel cell types in a systematic, layer-by-layer manner. Additionally, scPLAN effectively integrates annotated scRNA-seq datasets with varying levels of annotation depth, ensuring consistent refinement of cell-type labels across datasets with lower resolutions. Through extensive annotation and novel cell detection experiments, scPLAN has demonstrated its efficacy. Two case studies have been conducted to showcase how scPLAN integrates datasets with diverse cell-type label resolutions and refine their cell-type labels. AVAILABILITY: https://github.com/michaelGuo1204/scPLAN.


Assuntos
Biologia Computacional , Perfilação da Expressão Gênica , Análise de Célula Única , Análise de Célula Única/métodos , Perfilação da Expressão Gênica/métodos , Biologia Computacional/métodos , Humanos , Software , Transcriptoma , Análise de Sequência de RNA/métodos , RNA-Seq/métodos , Anotação de Sequência Molecular/métodos
7.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38581422

RESUMO

Reliable cell type annotations are crucial for investigating cellular heterogeneity in single-cell omics data. Although various computational approaches have been proposed for single-cell RNA sequencing (scRNA-seq) annotation, high-quality cell labels are still lacking in single-cell sequencing assay for transposase-accessible chromatin (scATAC-seq) data, because of extreme sparsity and inconsistent chromatin accessibility between datasets. Here, we present a novel automated cell annotation method that transfers cell type information from a well-labeled scRNA-seq reference to an unlabeled scATAC-seq target, via a parallel graph neural network, in a semi-supervised manner. Unlike existing methods that utilize only gene expression or gene activity features, HyGAnno leverages genome-wide accessibility peak features to facilitate the training process. In addition, HyGAnno reconstructs a reference-target cell graph to detect cells with low prediction reliability, according to their specific graph connectivity patterns. HyGAnno was assessed across various datasets, showcasing its strengths in precise cell annotation, generating interpretable cell embeddings, robustness to noisy reference data and adaptability to tumor tissues.


Assuntos
Cromatina , Redes Neurais de Computação , Reprodutibilidade dos Testes
8.
Mol Cell Proteomics ; 23(1): 100694, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38097181

RESUMO

Multiplex proteomics using isobaric labeling tags has emerged as a powerful tool for the simultaneous relative quantification of peptides and proteins across multiple experimental conditions. However, the quantitative accuracy of the approach is largely compromised by ion interference, a phenomenon that causes fold changes to appear compressed. The degree of compression is generally unknown, and the contributing factors are poorly understood. In this study, we thoroughly characterized ion interference at the MS2 level using a defined two-proteome experimental system with known ground-truth. We discovered remarkably poor agreement between the apparent precursor purity in the isolation window and the actual level of observed reporter ion interference in MS2 scans-a discrepancy that we found resolved by considering cofragmentation of peptide ions hidden within the spectral "noise" of the MS1 isolation window. To address this issue, we developed a regression modeling strategy to accurately predict reporter ion interference in any dataset. Finally, we demonstrate the utility of our procedure for improved fold change estimation and unbiased PTM site-to-protein normalization. All computational tools and code required to apply this method to any MS2 TMT dataset are documented and freely available.


Assuntos
Peptídeos , Proteômica , Proteômica/métodos , Proteoma/metabolismo , Íons
9.
Mol Cell Proteomics ; 23(2): 100716, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38219859

RESUMO

Previous work has shown that inhibition of abundant myeloid azurophil granule-associated serine proteases (ELANE [neutrophil elastase], PRTN3 [protease 3], and CTSG [Cathepsin G]) is required to stabilize some proteins in myeloid cells. We therefore hypothesized that effective inhibition of these proteases may be necessary for quantitative proteomic analysis of samples containing myeloid cells. To test this hypothesis, we thawed viably preserved acute myeloid leukemia cells from cryovials in the presence or the absence of diisopropyl fluorophosphate (DFP), a cell-permeable and irreversible serine protease inhibitor. Global proteomic analysis was performed, using label-free and isobaric peptide-labeling quantitation. The presence of DFP resulted in an increase of tryptic peptides (14-57%) and proteins (9-31%). In the absence of DFP, 11 to 31% of peptide intensity came from nontryptic peptides; 52 to 75% had cleavage specificity consistent with activities of ELANE-PRTN3. Treatment with DFP reduced the intensity of nontryptic peptides to 4-8% of the total. ELANE inhibition was 95%, based on diisopropyl phosphate modification of active site serine residue. Overall, the relative abundance of 20% of proteins was significantly altered by DFP treatment. These results suggest that active myeloid serine proteases, released during sample processing, can skew quantitative proteomic measurements. Finally, significant ELANE activity was also detected in Clinical Proteomics Tumor Analysis Consortium datasets of solid tumors (many of which have known myeloid infiltration). In the pancreatic cancer dataset, the median percentage of nontryptic intensity detected across patient samples was 34%, with many patient samples having more than half of their detected peptide intensity from nontryptic cleavage events consistent with ELANE-PRTN3 cleavage specificity. Our study suggests that in vitro cleavage of proteins by myeloid serine proteases may be relevant for proteomic studies of any tumor that contains infiltrating myeloid cells.


Assuntos
Leucemia Mieloide Aguda , Proteoma , Humanos , Proteômica , Endopeptidases/metabolismo , Serina Proteases , Peptídeos/química
10.
RNA ; 29(10): 1575-1590, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37460153

RESUMO

Current methods for detecting unlabeled antisense oligonucleotide (ASO) drugs rely on immunohistochemistry (IHC) and/or conjugated molecules, which lack sufficient sensitivity, specificity, and resolution to fully investigate their biodistribution. Our aim was to demonstrate the qualitative and quantitative distribution of unlabeled bepirovirsen, a clinical stage ASO, in livers and kidneys of dosed mice using novel staining and imaging technologies at subcellular resolution. ASOs were detected in formalin-fixed paraffin-embedded (FFPE) and frozen tissues using an automated chromogenic in situ hybridization (ISH) assay: miRNAscope. This was then combined with immunohistochemical detection of cell lineage markers. ASO distribution in hepatocytes versus nonparenchymal cell lineages was quantified using HALO AI image analysis. To complement this, hyperspectral coherent anti-Stokes Raman scattering (HS-CARS) imaging microscopy was used to specifically detect the unique cellular Raman spectral signatures following ASO treatment. Bepirovirsen was localized primarily in nonparenchymal liver cells and proximal renal tubules. Codetection of ASO with distinct cell lineage markers of liver and kidney populations aided target cell identity facilitating quantification. Positive liver signal was quantified using HALO AI, with 12.9% of the ASO localized to the hepatocytes and 87.1% in nonparenchymal cells. HS-CARS imaging specifically detected ASO fingerprints based on the unique vibrational signatures following unlabeled ASO treatment in a totally nonperturbative manner at subcellular resolution. Together, these novel detection and imaging modalities represent a significant increase in our ability to detect unlabeled ASOs in tissues, demonstrating improved levels of specificity and resolution. These methods help us understand their underlying mechanisms of action and ultimately improve the therapeutic potential of these important drugs for treating globally significant human diseases.


Assuntos
Fígado , Oligonucleotídeos Antissenso , Camundongos , Humanos , Animais , Oligonucleotídeos Antissenso/genética , Oligonucleotídeos Antissenso/metabolismo , Distribuição Tecidual , Fígado/diagnóstico por imagem , Fígado/metabolismo , Hibridização In Situ , Coloração e Rotulagem
11.
Brief Bioinform ; 24(4)2023 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-37317617

RESUMO

Human prescription drug labeling contains a summary of the essential scientific information needed for the safe and effective use of the drug and includes the Prescribing Information, FDA-approved patient labeling (Medication Guides, Patient Package Inserts and/or Instructions for Use), and/or carton and container labeling. Drug labeling contains critical information about drug products, such as pharmacokinetics and adverse events. Automatic information extraction from drug labels may facilitate finding the adverse reaction of the drugs or finding the interaction of one drug with another drug. Natural language processing (NLP) techniques, especially recently developed Bidirectional Encoder Representations from Transformers (BERT), have exhibited exceptional merits in text-based information extraction. A common paradigm in training BERT is to pretrain the model on large unlabeled generic language corpora, so that the model learns the distribution of the words in the language, and then fine-tune on a downstream task. In this paper, first, we show the uniqueness of language used in drug labels, which therefore cannot be optimally handled by other BERT models. Then, we present the developed PharmBERT, which is a BERT model specifically pretrained on the drug labels (publicly available at Hugging Face). We demonstrate that our model outperforms the vanilla BERT, ClinicalBERT and BioBERT in multiple NLP tasks in the drug label domain. Moreover, how the domain-specific pretraining has contributed to the superior performance of PharmBERT is demonstrated by analyzing different layers of PharmBERT, and more insight into how it understands different linguistic aspects of the data is gained.


Assuntos
Rotulagem de Medicamentos , Armazenamento e Recuperação da Informação , Humanos , Aprendizagem , Processamento de Linguagem Natural
12.
Brief Bioinform ; 24(3)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-36964722

RESUMO

Protein function prediction is an essential task in bioinformatics which benefits disease mechanism elucidation and drug target discovery. Due to the explosive growth of proteins in sequence databases and the diversity of their functions, it remains challenging to fast and accurately predict protein functions from sequences alone. Although many methods have integrated protein structures, biological networks or literature information to improve performance, these extra features are often unavailable for most proteins. Here, we propose SPROF-GO, a Sequence-based alignment-free PROtein Function predictor, which leverages a pretrained language model to efficiently extract informative sequence embeddings and employs self-attention pooling to focus on important residues. The prediction is further advanced by exploiting the homology information and accounting for the overlapping communities of proteins with related functions through the label diffusion algorithm. SPROF-GO was shown to surpass state-of-the-art sequence-based and even network-based approaches by more than 14.5, 27.3 and 10.1% in area under the precision-recall curve on the three sub-ontology test sets, respectively. Our method was also demonstrated to generalize well on non-homologous proteins and unseen species. Finally, visualization based on the attention mechanism indicated that SPROF-GO is able to capture sequence domains useful for function prediction. The datasets, source codes and trained models of SPROF-GO are available at https://github.com/biomed-AI/SPROF-GO. The SPROF-GO web server is freely available at http://bio-web1.nscc-gz.cn/app/sprof-go.


Assuntos
Proteínas , Software , Proteínas/metabolismo , Algoritmos , Biologia Computacional/métodos , Ontologia Genética
13.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36592062

RESUMO

Recent studies have revealed that long noncoding RNAs (lncRNAs) are closely linked to several human diseases, providing new opportunities for their use in detection and therapy. Many graph propagation and similarity fusion approaches can be used for predicting potential lncRNA-disease associations. However, existing similarity fusion approaches suffer from noise and self-similarity loss in the fusion process. To address these problems, a new prediction approach, termed SSMF-BLNP, based on organically combining selective similarity matrix fusion (SSMF) and bidirectional linear neighborhood label propagation (BLNP), is proposed in this paper to predict lncRNA-disease associations. In SSMF, self-similarity networks of lncRNAs and diseases are obtained by selective preprocessing and nonlinear iterative fusion. The fusion process assigns weights to each initial similarity network and introduces a unit matrix that can reduce noise and compensate for the loss of self-similarity. In BLNP, the initial lncRNA-disease associations are employed in both lncRNA and disease directions as label information for linear neighborhood label propagation. The propagation was then performed on the self-similarity network obtained from SSMF to derive the scoring matrix for predicting the relationships between lncRNAs and diseases. Experimental results showed that SSMF-BLNP performed better than seven other state of-the-art approaches. Furthermore, a case study demonstrated up to 100% and 80% accuracy in 10 lncRNAs associated with hepatocellular carcinoma and 10 lncRNAs associated with renal cell carcinoma, respectively. The source code and datasets used in this paper are available at: https://github.com/RuiBingo/SSMF-BLNP.


Assuntos
RNA Longo não Codificante , Humanos , Algoritmos , Biologia Computacional/métodos , RNA Longo não Codificante/genética , Software , Carcinoma Hepatocelular/genética , Carcinoma de Células Renais/genética , Neoplasias Hepáticas/genética , Neoplasias Renais/genética
14.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36403090

RESUMO

The label-free quantification (LFQ) has emerged as an exceptional technique in proteomics owing to its broad proteome coverage, great dynamic ranges and enhanced analytical reproducibility. Due to the extreme difficulty lying in an in-depth quantification, the LFQ chains incorporating a variety of transformation, pretreatment and imputation methods are required and constructed. However, it remains challenging to determine the well-performing chain, owing to its strong dependence on the studied data and the diverse possibility of integrated chains. In this study, an R package EVALFQ was therefore constructed to enable a performance evaluation on >3000 LFQ chains. This package is unique in (a) automatically evaluating the performance using multiple criteria, (b) exploring the quantification accuracy based on spiking proteins and (c) discovering the well-performing chains by comprehensive assessment. All in all, because of its superiority in assessing from multiple perspectives and scanning among over 3000 chains, this package is expected to attract broad interests from the fields of proteomic quantification. The package is available at https://github.com/idrblab/EVALFQ.


Assuntos
Proteoma , Proteômica , Proteoma/metabolismo , Proteômica/métodos , Reprodutibilidade dos Testes
15.
Brief Bioinform ; 24(6)2023 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-37930024

RESUMO

Development of robust and effective strategies for synthesizing new compounds, drug targeting and constructing GEnome-scale Metabolic models (GEMs) requires a deep understanding of the underlying biological processes. A critical step in achieving this goal is accurately identifying the categories of pathways in which a compound participated. However, current machine learning-based methods often overlook the multifaceted nature of compounds, resulting in inaccurate pathway predictions. Therefore, we present a novel framework on Multi-View Multi-Label Learning for Metabolic Pathway Inference, hereby named MVML-MPI. First, MVML-MPI learns the distinct compound representations in parallel with corresponding compound encoders to fully extract features. Subsequently, we propose an attention-based mechanism that offers a fusion module to complement these multi-view representations. As a result, MVML-MPI accurately represents and effectively captures the complex relationship between compounds and metabolic pathways and distinguishes itself from current machine learning-based methods. In experiments conducted on the Kyoto Encyclopedia of Genes and Genomes pathways dataset, MVML-MPI outperformed state-of-the-art methods, demonstrating the superiority of MVML-MPI and its potential to utilize the field of metabolic pathway design, which can aid in optimizing drug-like compounds and facilitating the development of GEMs. The code and data underlying this article are freely available at https://github.com/guofei-tju/MVML-MPI. Contact:  jtang@cse.sc.edu, guofei@csu.edu.com or wuxi_dyj@csj.uestc.edu.cn.


Assuntos
Aprendizado de Máquina , Redes e Vias Metabólicas
16.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38066710

RESUMO

Post-translational modification (PTM) occurs after a protein is translated from ribonucleic acid. It is an important living creature life phenomenon because it is implicated in almost all cellular processes. Identification of PTM sites from a given protein sequence is a hot topic in bioinformatics. Lots of computational methods have been proposed, and they provide good performance. However, most previous methods can only tackle one PTM type. Few methods consider multiple PTM types. In this study, a multi-label classification model, named RMTLysPTM, was developed to recognize four types of lysine (K) PTM sites, including acetylation, crotonylation, methylation and succinylation. The surrounding sites of a lysine site were selected to constitute a peptide segment, representing the lysine at the center. Deep analysis was conducted to count the distribution of 2-residues with fixed location across the four types of lysine PTM sites. By aggregating the distribution information of 2-residues in one peptide segment, the peptide segment was encoded by informative features. Furthermore, a prediction engine that can precisely capture the traits of the above representations was designed to recognize the types of lysine PTM sites. The cross-validation results on two datasets (Qiu and CPLM training datasets) suggested that the model had extremely high performance and RMTLysPTM had strong generalization ability by testing it on protein Q16778 and CPLM testing datasets. The model was found to be generally superior to all previous models and those using popular methods and features. A web server was set up for RMTLysPTM, and it can be accessed at http://119.3.127.138/.


Assuntos
Lisina , Proteínas , Lisina/metabolismo , Proteínas/química , Processamento de Proteína Pós-Traducional , Sequência de Aminoácidos , Peptídeos/metabolismo
17.
Brief Bioinform ; 24(3)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-37139561

RESUMO

With the development of chromosome conformation capture technique, the study of spatial conformation of a genome based on Hi-C technique has made a quantum leap. Previous studies reveal that genomes are folded into hierarchy of three-dimensional (3D) structures associated with topologically associating domains (TADs), and detecting TAD boundaries is of great significance in the chromosome-level analysis of 3D genome architecture. In this paper, we propose a novel TAD identification method, LPAD, which first extracts node correlations from global interactions of chromosomes based on the random walk with restart and then builds an undirected graph from Hi-C contact matrix. Next, LPAD designs a label propagation-based approach to discover communities and generates TADs. Experimental results verify the effectiveness and quality of TAD detections compared with existing methods. Furthermore, experimental evaluation of chromatin immunoprecipitation sequencing data shows that LPAD performs high enrichment of histone modifications remarkably nearby the TAD boundaries, and these results demonstrate LPAD's advantages on TAD identification accuracy.


Assuntos
Cromossomos , Genoma , Cromossomos/genética , Código das Histonas , Conformação Molecular
18.
Bioinformatics ; 2024 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-39133151

RESUMO

MOTIVATION: The asymmetrical distribution of expressed mRNAs tightly controls the precise synthesis of proteins within human cells. This non-uniform distribution, a cornerstone of developmental biology, plays a pivotal role in numerous cellular processes. To advance our comprehension of gene regulatory networks, it is essential to develop computational tools for accurately identifying the subcellular localizations of mRNAs. However, considering multi-localization phenomena remains limited in existing approaches, with none considering the influence of RNA's secondary structure. RESULTS: In this study, we propose Allocator, a multi-view parallel deep learning framework that seamlessly integrates the RNA sequence-level and structure-level information, enhancing the prediction of mRNA multi-localization. The Allocator models equip four efficient feature extractors, each designed to handle different inputs. Two are tailored for sequence-based inputs, incorporating multilayer perceptron and multi-head self-attention mechanisms. The other two are specialized in processing structure-based inputs, employing graph neural networks. Benchmarking results underscore Allocator's superiority over state-of-the-art methods, showcasing its strength in revealing intricate localization associations. AVAILABILITY AND IMPLEMENTATION: The webserver of Allocator is available at http://Allocator.unimelb-biotools.cloud.edu.au; the source code and datasets are available on GitHub (https://github.com/lifuyi774/Allocator) and Zenodo (https://doi.org/10.5281/zenodo.13235798). SUPPLEMENTARY INFORMATION: Available at Bioinformatics online.

19.
FASEB J ; 38(5): e23536, 2024 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-38470360

RESUMO

Rituximab, the first monoclonal antibody approved for the treatment of lymphoma, eventually became one of the most popular and versatile drugs ever in terms of clinical application and revenue. Since its patent expiration, and consequently, the loss of exclusivity of the original biologic, its repurposing as an off-label drug has increased dramatically, propelled by the development and commercialization of its many biosimilars. Currently, rituximab is prescribed worldwide to treat a vast range of autoimmune diseases mediated by B cells. Here, we present a comprehensive overview of rituximab repurposing in 115 autoimmune diseases across 17 medical specialties, sourced from over 1530 publications. Our work highlights the extent of its off-label use and clinical benefits, underlining the success of rituximab repurposing for both common and orphan immune-related diseases. We discuss the scientific mechanism associated with its clinical efficacy and provide additional indications for which rituximab could be investigated. Our study presents rituximab as a flagship example of drug repurposing owing to its central role in targeting cluster of differentiate 20 positive (CD20) B cells in 115 autoimmune diseases.


Assuntos
Doenças Autoimunes , Medicamentos Biossimilares , Humanos , Rituximab/uso terapêutico , Medicamentos Biossimilares/uso terapêutico , Reposicionamento de Medicamentos , Uso Off-Label , Doenças Autoimunes/tratamento farmacológico , Doenças Raras
20.
Mol Cell Proteomics ; 22(7): 100581, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37225017

RESUMO

Recent advances in mass spectrometry-based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Although peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratio-based approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 min and 100,000 proteomes in less than 2 h, a 1000-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition and data-independent acquisition. In addition, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is an important part of an overall quantitative proteomic pipeline that also needs to include high sensitive statistical analysis leading to proteoform resolution. Available as an open-source Python package and a graphical user interface with a one-click installer, it can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines.


Assuntos
Proteoma , Proteômica , Proteoma/análise , Proteômica/métodos , Ecossistema , Peptídeos/análise , Espectrometria de Massas/métodos , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA