Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 137
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Brief Bioinform ; 25(2)2024 Jan 22.
Artículo en Inglés | MEDLINE | ID: mdl-38305456

RESUMEN

Protein structure prediction is a longstanding issue crucial for identifying new drug targets and providing a mechanistic understanding of protein functions. To enhance the progress in this field, a spectrum of computational methodologies has been cultivated. AlphaFold2 has exhibited exceptional precision in predicting wild-type protein structures, with performance exceeding that of other methods. However, predicting the structures of missense mutant proteins using AlphaFold2 remains challenging due to the intricate and substantial structural alterations caused by minor sequence variations in the mutant proteins. Molecular dynamics (MD) has been validated for precisely capturing changes in amino acid interactions attributed to protein mutations. Therefore, for the first time, a strategy entitled 'MoDAFold' was proposed to improve the accuracy and reliability of missense mutant protein structure prediction by combining AlphaFold2 with MD. Multiple case studies have confirmed the superior performance of MoDAFold compared to other methods, particularly AlphaFold2.


Asunto(s)
Aminoácidos , Simulación de Dinámica Molecular , Proteínas Mutantes , Reproducibilidad de los Resultados , Mutación , Conformación Proteica
2.
Nucleic Acids Res ; 52(W1): W272-W279, 2024 Jul 05.
Artículo en Inglés | MEDLINE | ID: mdl-38738624

RESUMEN

Protein scaffolds with small size, high stability and low immunogenicity show important applications in the field of protein engineering and design. However, no relevant computational platform has been reported yet to mining such scaffolds with the desired properties from massive protein structures in human body. Here, we developed PROSCA, a structure-based online platform dedicated to explore the space of the entire human proteome, and to discovery new privileged protein scaffolds with potential engineering value that have never been noticed. PROSCA accepts structure of protein as an input, which can be subsequently aligned with a certain class of protein structures (e.g. the human proteome either from experientially resolved or AlphaFold2 predicted structures, and the human proteins belonging to specific families or domains), and outputs humanized protein scaffolds which are structurally similar with the input protein as well as other related important information such as families, sequences, structures and expression level in human tissues. Through PROSCA, the user can also get excellent experience in visualizations of protein structures and expression overviews, and download the figures and tables of results which can be customized according to the user's needs. Along with the advanced protein engineering and selection technologies, PROSCA will facilitate the rational design of new functional proteins with privileged scaffolds. PROSCA is freely available at https://idrblab.org/prosca/.


Asunto(s)
Ingeniería de Proteínas , Programas Informáticos , Humanos , Ingeniería de Proteínas/métodos , Proteínas/química , Proteínas/genética , Proteoma , Modelos Moleculares , Internet , Conformación Proteica
3.
PLoS Pathog ; 19(1): e1011128, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36689483

RESUMEN

Coronavirus disease 2019 is a respiratory infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Evidence on the pathogenesis of SARS-CoV-2 is accumulating rapidly. In addition to structural proteins such as Spike and Envelope, the functional roles of non-structural and accessory proteins in regulating viral life cycle and host immune responses remain to be understood. Here, we show that open reading frame 8 (ORF8) acts as messenger for inter-cellular communication between alveolar epithelial cells and macrophages during SARS-CoV-2 infection. Mechanistically, ORF8 is a secretory protein that can be secreted by infected epithelial cells via both conventional and unconventional secretory pathways. Conventionally secreted ORF8 is glycosylated and loses the ability to recognize interleukin 17 receptor A of macrophages, possibly due to the steric hindrance imposed by N-glycosylation at Asn78. However, unconventionally secreted ORF8 does not undergo glycosylation without experiencing the ER-Golgi trafficking, thereby activating the downstream NF-κB signaling pathway and facilitating a burst of cytokine release. Furthermore, we show that ORF8 deletion in SARS-CoV-2 attenuates inflammation and yields less lung lesions in hamsters. Our data collectively highlights a role of ORF8 protein in the development of cytokine storms during SARS-CoV-2 infection.


Asunto(s)
COVID-19 , Síndrome de Liberación de Citoquinas , SARS-CoV-2 , Proteínas Virales , Humanos , COVID-19/patología , Síndrome de Liberación de Citoquinas/patología , Inflamación , Sistemas de Lectura Abierta , SARS-CoV-2/fisiología , Proteínas Virales/metabolismo
4.
Mol Ther ; 2024 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-38879754

RESUMEN

Despite the remarkable success of chimeric antigen receptor (CAR) T therapy in hematological malignancies, its efficacy in solid tumors remains limited. Cytokine-engineered CAR T cells offer a promising avenue, yet their clinical translation is hindered by the risks associated with constitutive cytokine expression. In this proof-of-concept study, we leverage the endogenous interferon (IFN)-γ promoter for transgenic interleukin (IL)-15 expression. We demonstrate that IFN-γ expression is tightly regulated by T cell receptor signaling. By introducing an internal ribosome entry site IL15 into the 3' UTR of the IFN-γ gene via homology directed repair-mediated knock-in, we confirm that IL-15 expression can co-express with IFN-γ in an antigen stimulation-dependent manner. Importantly, the insertion of transgenes does not compromise endogenous IFN-γ expression. In vitro and in vivo data demonstrate that IL-15 driven by the IFN-γ promoter dramatically improves CAR T cells' antitumor activity, suggesting the effectiveness of IL-15 expression. Last, as a part of our efforts toward clinical translation, we have developed an innovative two-gene knock-in approach. This approach enables the simultaneous integration of CAR and IL-15 genes into TRAC and IFN-γ gene loci using a single AAV vector. CAR T cells engineered to express IL-15 using this approach demonstrate enhanced antitumor efficacy. Overall, our study underscores the feasibility of utilizing endogenous promoters for transgenic cytokines expression in CAR T cells.

5.
J Chem Inf Model ; 64(10): 4059-4070, 2024 May 27.
Artículo en Inglés | MEDLINE | ID: mdl-38739718

RESUMEN

Central nervous system (CNS) drugs have had a significant impact on treating a wide range of neurodegenerative and psychiatric disorders. In recent years, deep learning-based generative models have shown great potential for accelerating drug discovery and improving efficacy. However, specific applications of these techniques in CNS drug discovery have not been widely reported. In this study, we developed the CNSMolGen model, which uses a framework of bidirectional recurrent neural networks (Bi-RNNs) for de novo molecular design of CNS drugs. Results showed that the pretrained model was able to generate more than 90% of completely new molecular structures, which possessed the properties of CNS drug molecules and were synthesizable. In addition, transfer learning was performed on small data sets with specific biological activities to evaluate the potential application of the model for CNS drug optimization. Here, we used drugs against the classical CNS disease target serotonin transporter (SERT) as a fine-tuned data set and generated a focused database against the target protein. The potential biological activities of the generated molecules were verified by using the physics-based induced-fit docking study. The success of this model demonstrates its potential in CNS drug design and optimization, which provides a new impetus for future CNS drug development.


Asunto(s)
Fármacos del Sistema Nervioso Central , Diseño de Fármacos , Redes Neurales de la Computación , Fármacos del Sistema Nervioso Central/farmacología , Fármacos del Sistema Nervioso Central/química , Simulación del Acoplamiento Molecular , Humanos , Proteínas de Transporte de Serotonina en la Membrana Plasmática/metabolismo , Proteínas de Transporte de Serotonina en la Membrana Plasmática/química
6.
J Chem Inf Model ; 64(5): 1433-1455, 2024 Mar 11.
Artículo en Inglés | MEDLINE | ID: mdl-38294194

RESUMEN

Solute carrier transporters (SLCs) are a class of important transmembrane proteins that are involved in the transportation of diverse solute ions and small molecules into cells. There are approximately 450 SLCs within the human body, and more than a quarter of them are emerging as attractive therapeutic targets for multiple complex diseases, e.g., depression, cancer, and diabetes. However, only 44 unique transporters (∼9.8% of the SLC superfamily) with 3D structures and specific binding sites have been reported. To design innovative and effective drugs targeting diverse SLCs, there are a number of obstacles that need to be overcome. However, computational chemistry, including physics-based molecular modeling and machine learning- and deep learning-based artificial intelligence (AI), provides an alternative and complementary way to the classical drug discovery approach. Here, we present a comprehensive overview on recent advances and existing challenges of the computational techniques in structure-based drug design of SLCs from three main aspects: (i) characterizing multiple conformations of the proteins during the functional process of transportation, (ii) identifying druggability sites especially the cryptic allosteric ones on the transporters for substrates and drugs binding, and (iii) discovering diverse small molecules or synthetic protein binders targeting the binding sites. This work is expected to provide guidelines for a deep understanding of the structure and function of the SLC superfamily to facilitate rational design of novel modulators of the transporters with the aid of state-of-the-art computational chemistry technologies including artificial intelligence.


Asunto(s)
Inteligencia Artificial , Química Computacional , Humanos , Proteínas de Transporte de Membrana/química , Diseño de Fármacos , Descubrimiento de Drogas/métodos
7.
J Chem Inf Model ; 64(4): 1319-1330, 2024 Feb 26.
Artículo en Inglés | MEDLINE | ID: mdl-38346323

RESUMEN

Traditional Chinese medicine (TCM) has been extensively employed for the treatment of coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, there is demand for discovering more SARS-CoV-2 Mpro inhibitors with diverse scaffolds to optimize anti-SARS-CoV-2 lead compounds. In this study, comprehensive in silico and in vitro assays were utilized to determine the potential inhibitors from TCM compounds against SARS-CoV-2 Mpro, which is an important therapeutic target for SARS-CoV-2. The ensemble docking analysis of 18263 TCM compounds against 15 SARS-CoV-2 Mpro conformations identified 19 TCM compounds as promising candidates. Further in vitro testing validated three compounds as inhibitors of SARS-CoV-2 Mpro and showed IC50 values of 4.64 ± 0.11, 7.56 ± 0.78, and 11.16 ± 0.26 µM, with EC50 values of 12.25 ± 1.68, 15.58 ± 0.77, and 29.32 ± 1.25 µM, respectively. Molecular dynamics (MD) simulations indicated that the three complexes remained stable over the last 100 ns of production run. An analysis of the binding mode revealed that the active compounds occupy different subsites (S1, S2, S3, and S4) of the active site of SARS-CoV-2 Mpro via specific poses through noncovalent interactions with key amino acids (e.g., HIS 41, ASN 142, GLY 143, MET 165, GLU 166, or GLN 189). Overall, this study provides evidence indicating that the three natural products obtained from TCM could be further used for anti-COVID-19 research, justifying the investigation of Chinese herbal medicinal ingredients as bioactive constituents for therapeutic targets.


Asunto(s)
COVID-19 , Proteasas 3C de Coronavirus , Humanos , SARS-CoV-2/metabolismo , Medicina Tradicional China , Simulación del Acoplamiento Molecular , Simulación de Dinámica Molecular , Inhibidores de Proteasas/química
8.
Pathol Int ; 74(4): 210-221, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38411359

RESUMEN

The importance of mitochondrial dysfunction and oxidative stress has been indicated in the progression of heart failure (HF). The molecular mechanisms, however, remain to be fully elucidated. This study aimed to explore the role and underlying mechanism of secreted frizzled-related protein 4 (SFRP4) in these two events in HF. Mice with HF were developed using transverse aortic constriction, and hematoxylin-eosin staining, MASSON staining, and Terminal deoxynucleotidyl transferase (TdT)-mediated 2'-Deoxyuridine 5'- Triphosphate nick end labeling (TUNEL assays) were conducted to detect morphological damage in the myocardial tissues of mice. HL-1 mouse cardiomyocytes were induced with isoproterenol (ISO), and cell viability and apoptosis were examined using cell counting kit-8 and TUNEL assays. SFRP4 and Jumonji domain-containing protein 2A (JMJD2A) were highly expressed in myocardial tissues. Suppression of SFRP4 alleviated apoptosis and fibrosis in myocardial tissues of mice. In addition, the extent of mitochondrial dysfunction and oxidative stress in damaged myocardial tissues and HL-1 cells was mitigated by SFRP4 inhibition as well. JMJD2A catalyzed demethylation modification of the SFRP4 promoter, thus promoting SFRP4 transcription in the development of HF. JMJD2A is responsible for SFRP4 transcription activation in the failing hearts of mice. Blockade of JMJD2A or SFRP4 might be a novel therapy effective in mitigating HF progression.


Asunto(s)
Insuficiencia Cardíaca , Enfermedades Mitocondriales , Animales , Ratones , Apoptosis/fisiología , Insuficiencia Cardíaca/genética , Estrés Oxidativo , Regiones Promotoras Genéticas , Activación Transcripcional
9.
Nucleic Acids Res ; 50(D1): D560-D570, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34664670

RESUMEN

The success of protein engineering and design has extensively expanded the protein space, which presents a promising strategy for creating next-generation proteins of diverse functions. Among these proteins, the synthetic binding proteins (SBPs) are smaller, more stable, less immunogenic, and better of tissue penetration than others, which make the SBP-related data attracting extensive interest from worldwide scientists. However, no database has been developed to systematically provide the valuable information of SBPs yet. In this study, a database named 'Synthetic Binding Proteins for Research, Diagnosis, and Therapy (SYNBIP)' was thus introduced. This database is unique in (a) comprehensively describing thousands of SBPs from the perspectives of scaffolds, biophysical & functional properties, etc.; (b) panoramically illustrating the binding targets & the broad application of each SBP and (c) enabling a similarity search against the sequences of all SBPs and their binding targets. Since SBP is a human-made protein that has not been found in nature, the discovery of novel SBPs relied heavily on experimental protein engineering and could be greatly facilitated by in-silico studies (such as AI and computational modeling). Thus, the data provided in SYNBIP could lay a solid foundation for the future development of novel SBPs. The SYNBIP is accessible without login requirement at both official (https://idrblab.org/synbip/) and mirror (http://synbip.idrblab.net/) sites.


Asunto(s)
Proteínas Bacterianas/clasificación , Proteínas Portadoras/genética , Bases de Datos de Proteínas , Proteínas/clasificación , Proteínas Bacterianas/química , Proteínas Portadoras/clasificación , Simulación por Computador , Humanos , Conformación Proteica , Ingeniería de Proteínas/tendencias , Proteínas/química
10.
EMBO Rep ; 22(9): e52252, 2021 09 06.
Artículo en Inglés | MEDLINE | ID: mdl-34288348

RESUMEN

Tuberculosis (TB) is an infectious disease caused by Mycobacterium tuberculosis (Mtb) that places a heavy strain on public health. Host susceptibility to Mtb is modulated by macrophages, which regulate the balance between cell apoptosis and necrosis. However, the role of molecular switches that modulate apoptosis and necrosis during Mtb infection remains unclear. Here, we show that Mtb-susceptible mice and TB patients have relatively low miR-342-3p expression, while mice with miR-342-3p overexpression are more resistant to Mtb. We demonstrate that the miR-342-3p/SOCS6 axis regulates anti-Mtb immunity by increasing the production of inflammatory cytokines and chemokines. Most importantly, the miR-342-3p/SOCS6 axis participates in the switching between Mtb-induced apoptosis and necrosis through A20-mediated K48-linked ubiquitination and RIPK3 degradation. Our findings reveal several strategies by which the host innate immune system controls intracellular Mtb growth via the miRNA-mRNA network and pave the way for host-directed therapies targeting these pathways.


Asunto(s)
MicroARNs , Mycobacterium tuberculosis , Tuberculosis , Animales , Muerte Celular , Humanos , Inflamación/genética , Ratones , MicroARNs/genética , Mycobacterium tuberculosis/genética , Proteínas Supresoras de la Señalización de Citocinas , Tuberculosis/genética
11.
J Chem Inf Model ; 63(14): 4458-4467, 2023 07 24.
Artículo en Inglés | MEDLINE | ID: mdl-37410882

RESUMEN

Human dopamine transporter (hDAT) regulates the reuptake of extracellular dopamine (DA) and is an essential therapeutic target for central nervous system (CNS) diseases. The allosteric modulation of hDAT has been identified for decades. However, the molecular mechanism underlying the transportation is still elusive, which hinders the rational design of allosteric modulators against hDAT. Here, a systematic structure-based method was performed to explore allosteric sites on hDAT in inward-open (IO) conformation and to screen compounds with allosteric affinity. First, the model of the hDAT structure was constructed based on the recently reported Cryo-EM structure of the human serotonin transporter (hSERT) and Gaussian-accelerated molecular dynamics (GaMD) simulation was further utilized for the identification of intermediate energetic stable states of the transporter. Then, with the potential druggable allosteric site on hDAT in IO conformation, virtual screening of seven enamine chemical libraries (∼440,000 compounds) was processed, resulting in 10 compounds being purchased for in vitro assay and with Z1078601926 discovered to allosterically inhibit hDAT (IC50 = 0.527 [0.284; 0.988] µM) when nomifensine was introduced as an orthosteric ligand. Finally, the synergistic effect underlying the allosteric inhibition of hDAT by Z1078601926 and nomifensine was explored using additional GaMD simulation and postbinding free energy analysis. The hit compound discovered in this work not only provides a good starting point for lead optimization but also demonstrates the usability of the method for the structure-based discovery of novel allosteric modulators of other therapeutic targets.


Asunto(s)
Proteínas de Transporte de Dopamina a través de la Membrana Plasmática , Nomifensina , Humanos , Proteínas de Transporte de Dopamina a través de la Membrana Plasmática/química , Simulación de Dinámica Molecular , Sitio Alostérico , Ligandos
12.
Nucleic Acids Res ; 49(D1): D715-D722, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33045729

RESUMEN

Besides the environmental factors having tremendous impacts on the composition of microbial community, the host factors have recently gained extensive attentions on their roles in shaping human microbiota. There are two major types of host factors: host genetic factors (HGFs) and host immune factors (HIFs). These factors of each type are essential for defining the chemical and physical landscapes inhabited by microbiota, and the collective consideration of both types have great implication to serve comprehensive health management. However, no database was available to provide the comprehensive factors of both types. Herein, a database entitled 'Host Genetic and Immune Factors Shaping Human Microbiota (GIMICA)' was constructed. Based on the 4257 microbes confirmed to inhabit nine sites of human body, 2851 HGFs (1368 single nucleotide polymorphisms (SNPs), 186 copy number variations (CNVs), and 1297 non-coding ribonucleic acids (RNAs)) modulating the expression of 370 microbes were collected, and 549 HIFs (126 lymphocytes and phagocytes, 387 immune proteins, and 36 immune pathways) regulating the abundance of 455 microbes were also provided. All in all, GIMICA enabled the collective consideration not only between different types of host factor but also between the host and environmental ones, which is freely accessible without login requirement at: https://idrblab.org/gimica/.


Asunto(s)
Factores Inmunológicos/genética , Microbiota/genética , Programas Informáticos , Humanos , Almacenamiento y Recuperación de la Información , Estándares de Referencia
13.
Brief Bioinform ; 21(3): 1023-1037, 2020 05 21.
Artículo en Inglés | MEDLINE | ID: mdl-31323688

RESUMEN

The pathogenesis of multiple sclerosis (MS) is significantly regulated by long noncoding RNAs (lncRNAs), the expression of which is substantially influenced by a number of MS-associated risk single nucleotide polymorphisms (SNPs). It is thus hypothesized that the dysregulation of lncRNA induced by genomic variants may be one of the key molecular mechanisms for the pathology of MS. However, due to the lack of sufficient data on lncRNA expression and SNP genotypes of the same MS patients, such molecular mechanisms underlying the pathology of MS remain elusive. In this study, a bioinformatics strategy was applied to obtain lncRNA expression and SNP genotype data simultaneously from 142 samples (51 MS patients and 91 controls) based on RNA-seq data, and an expression quantitative trait loci (eQTL) analysis was conducted. In total, 2383 differentially expressed lncRNAs were identified as specifically expressing in brain-related tissues, and 517 of them were affected by SNPs. Then, the functional characterization, secondary structure changes and tissue and disease specificity of the cis-eQTL SNPs and lncRNA were assessed. The cis-eQTL SNPs were substantially and specifically enriched in neurological disease and intergenic region, and the secondary structure was altered in 17.6% of all lncRNAs in MS. Finally, the weighted gene coexpression network and gene set enrichment analyses were used to investigate how the influence of SNPs on lncRNAs contributed to the pathogenesis of MS. As a result, the regulation of lncRNAs by SNPs was found to mainly influence the antigen processing/presentation and mitogen-activated protein kinases (MAPK) signaling pathway in MS. These results revealed the effectiveness of the strategy proposed in this study and give insight into the mechanism (SNP-mediated modulation of lncRNAs) underlying the pathology of MS.


Asunto(s)
Esclerosis Múltiple/genética , Sitios de Carácter Cuantitativo , ARN Largo no Codificante/genética , Análisis de Secuencia de ARN/métodos , Perfilación de la Expresión Génica , Genotipo , Humanos , Sistema de Señalización de MAP Quinasas , Conformación de Ácido Nucleico , Polimorfismo de Nucleótido Simple , ARN Largo no Codificante/química , Transducción de Señal
14.
Brief Bioinform ; 21(6): 2142-2152, 2020 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-31776543

RESUMEN

Unwanted experimental/biological variation and technical error are frequently encountered in current metabolomics, which requires the employment of normalization methods for removing undesired data fluctuations. To ensure the 'thorough' removal of unwanted variations, the collective consideration of multiple criteria ('intragroup variation', 'marker stability' and 'classification capability') was essential. However, due to the limited number of available normalization methods, it is extremely challenging to discover the appropriate one that can meet all these criteria. Herein, a novel approach was proposed to discover the normalization strategies that are consistently well performing (CWP) under all criteria. Based on various benchmarks, all normalization methods popular in current metabolomics were 'first' discovered to be non-CWP. 'Then', 21 new strategies that combined the 'sample'-based method with the 'metabolite'-based one were found to be CWP. 'Finally', a variety of currently available methods (such as cubic splines, range scaling, level scaling, EigenMS, cyclic loess and mean) were identified to be CWP when combining with other normalization. In conclusion, this study not only discovered several strategies that performed consistently well under all criteria, but also proposed a novel approach that could ensure the identification of CWP strategies for future biological problems.


Asunto(s)
Biología Computacional , Metabolómica , Proyectos de Investigación
15.
Brief Bioinform ; 21(4): 1437-1447, 2020 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31504150

RESUMEN

Functional annotation of protein sequence with high accuracy has become one of the most important issues in modern biomedical studies, and computational approaches of significantly accelerated analysis process and enhanced accuracy are greatly desired. Although a variety of methods have been developed to elevate protein annotation accuracy, their ability in controlling false annotation rates remains either limited or not systematically evaluated. In this study, a protein encoding strategy, together with a deep learning algorithm, was proposed to control the false discovery rate in protein function annotation, and its performances were systematically compared with that of the traditional similarity-based and de novo approaches. Based on a comprehensive assessment from multiple perspectives, the proposed strategy and algorithm were found to perform better in both prediction stability and annotation accuracy compared with other de novo methods. Moreover, an in-depth assessment revealed that it possessed an improved capacity of controlling the false discovery rate compared with traditional methods. All in all, this study not only provided a comprehensive analysis on the performances of the newly proposed strategy but also provided a tool for the researcher in the fields of protein function annotation.


Asunto(s)
Aprendizaje Profundo , Proteínas/química , Algoritmos , Redes Neurales de la Computación
16.
Brief Bioinform ; 21(5): 1825-1836, 2020 09 25.
Artículo en Inglés | MEDLINE | ID: mdl-31860715

RESUMEN

The type IV bacterial secretion system (SS) is reported to be one of the most ubiquitous SSs in nature and can induce serious conditions by secreting type IV SS effectors (T4SEs) into the host cells. Recent studies mainly focus on annotating new T4SE from the huge amount of sequencing data, and various computational tools are therefore developed to accelerate T4SE annotation. However, these tools are reported as heavily dependent on the selected methods and their annotation performance need to be further enhanced. Herein, a convolution neural network (CNN) technique was used to annotate T4SEs by integrating multiple protein encoding strategies. First, the annotation accuracies of nine encoding strategies integrated with CNN were assessed and compared with that of the popular T4SE annotation tools based on independent benchmark. Second, false discovery rates of various models were systematically evaluated by (1) scanning the genome of Legionella pneumophila subsp. ATCC 33152 and (2) predicting the real-world non-T4SEs validated using published experiments. Based on the above analyses, the encoding strategies, (a) position-specific scoring matrix (PSSM), (b) protein secondary structure & solvent accessibility (PSSSA) and (c) one-hot encoding scheme (Onehot), were identified as well-performing when integrated with CNN. Finally, a novel strategy that collectively considers the three well-performing models (CNN-PSSM, CNN-PSSSA and CNN-Onehot) was proposed, and a new tool (CNN-T4SE, https://idrblab.org/cnnt4se/) was constructed to facilitate T4SE annotation. All in all, this study conducted a comprehensive analysis on the performance of a collection of encoding strategies when integrated with CNN, which could facilitate the suppression of T4SS in infection and limit the spread of antimicrobial resistance.


Asunto(s)
Redes Neurales de la Computación , Sistemas de Secreción Tipo IV , Algoritmos , Posición Específica de Matrices de Puntuación
17.
Brief Bioinform ; 21(2): 621-636, 2020 03 23.
Artículo en Inglés | MEDLINE | ID: mdl-30649171

RESUMEN

Label-free quantification (LFQ) with a specific and sequentially integrated workflow of acquisition technique, quantification tool and processing method has emerged as the popular technique employed in metaproteomic research to provide a comprehensive landscape of the adaptive response of microbes to external stimuli and their interactions with other organisms or host cells. The performance of a specific LFQ workflow is highly dependent on the studied data. Hence, it is essential to discover the most appropriate one for a specific data set. However, it is challenging to perform such discovery due to the large number of possible workflows and the multifaceted nature of the evaluation criteria. Herein, a web server ANPELA (https://idrblab.org/anpela/) was developed and validated as the first tool enabling performance assessment of whole LFQ workflow (collective assessment by five well-established criteria with distinct underlying theories), and it enabled the identification of the optimal LFQ workflow(s) by a comprehensive performance ranking. ANPELA not only automatically detects the diverse formats of data generated by all quantification tools but also provides the most complete set of processing methods among the available web servers and stand-alone tools. Systematic validation using metaproteomic benchmarks revealed ANPELA's capabilities in 1 discovering well-performing workflow(s), (2) enabling assessment from multiple perspectives and (3) validating LFQ accuracy using spiked proteins. ANPELA has a unique ability to evaluate the performance of whole LFQ workflow and enables the discovery of the optimal LFQs by the comprehensive performance ranking of all 560 workflows. Therefore, it has great potential for applications in metaproteomic and other studies requiring LFQ techniques, as many features are shared among proteomic studies.


Asunto(s)
Proteínas/química , Proteómica/métodos , Flujo de Trabajo , Internet , Reproducibilidad de los Resultados
18.
Brief Bioinform ; 21(3): 1058-1068, 2020 05 21.
Artículo en Inglés | MEDLINE | ID: mdl-31157371

RESUMEN

The etiology of schizophrenia (SCZ) is regarded as one of the most fundamental puzzles in current medical research, and its diagnosis is limited by the lack of objective molecular criteria. Although plenty of studies were conducted, SCZ gene signatures identified by these independent studies are found highly inconsistent. As one of the most important factors contributing to this inconsistency, the feature selection methods used currently do not fully consider the reproducibility among the signatures discovered from different datasets. Therefore, it is crucial to develop new bioinformatics tools of novel strategy for ensuring a stable discovery of gene signature for SCZ. In this study, a novel feature selection strategy (1) integrating repeated random sampling with consensus scoring and (2) evaluating the consistency of gene rank among different datasets was constructed. By systematically assessing the identified SCZ signature comprising 135 differentially expressed genes, this newly constructed strategy demonstrated significantly enhanced stability and better differentiating ability compared with the feature selection methods popular in current SCZ research. Based on a first-ever assessment on methods' reproducibility cross-validated by independent datasets from three representative studies, the new strategy stood out among the popular methods by showing superior stability and differentiating ability. Finally, 2 novel and 17 previously reported transcription factors were identified and showed great potential in revealing the etiology of SCZ. In sum, the SCZ signature identified in this study would provide valuable clues for discovering diagnostic molecules and potential targets for SCZ.


Asunto(s)
Esquizofrenia/genética , Transcriptoma , Biología Computacional/métodos , Conjuntos de Datos como Asunto , Regulación de la Expresión Génica , Humanos , Reproducibilidad de los Resultados
19.
Brief Bioinform ; 21(4): 1378-1390, 2020 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31197323

RESUMEN

Microbial community (MC) has great impact on mediating complex disease indications, biogeochemical cycling and agricultural productivities, which makes metaproteomics powerful technique for quantifying diverse and dynamic composition of proteins or peptides. The key role of biostatistical strategies in MC study is reported to be underestimated, especially the appropriate application of feature selection method (FSM) is largely ignored. Although extensive efforts have been devoted to assessing the performance of FSMs, previous studies focused only on their classification accuracy without considering their ability to correctly and comprehensively identify the spiked proteins. In this study, the performances of 14 FSMs were comprehensively assessed based on two key criteria (both sample classification and spiked protein discovery) using a variety of metaproteomics benchmarks. First, the classification accuracies of those 14 FSMs were evaluated. Then, their abilities in identifying the proteins of different spiked concentrations were assessed. Finally, seven FSMs (FC, LMEB, OPLS-DA, PLS-DA, SAM, SVM-RFE and T-Test) were identified as performing consistently superior or good under both criteria with the PLS-DA performing consistently superior. In summary, this study served as comprehensive analysis on the performances of current FSMs and could provide a valuable guideline for researchers in metaproteomics.


Asunto(s)
Proteómica/métodos , Biomarcadores/metabolismo , Análisis por Conglomerados , Proteínas/metabolismo
20.
Stem Cells ; 39(8): 1025-1032, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-33657255

RESUMEN

Spinal cord injury (SCI) typically results in long-lasting functional deficits, largely due to primary and secondary white matter damage at the site of injury. The transplantation of neural stem cells (NSCs) has shown promise for re-establishing communications between separated regions of the spinal cord through the insertion of new neurons between the injured axons and target neurons. However, the inhibitory microenvironment that develops after SCI often causes endogenous and transplanted NSCs to differentiate into glial cells rather than neurons. Functional biomaterials have been shown to mitigate the effects of the adverse SCI microenvironment and promote the neuronal differentiation of NSCs. A clear understanding of the mechanisms of neuronal differentiation within the injury-induced microenvironment would likely allow for the development of treatment strategies designed to promote the innate ability of NSCs to differentiate into neurons. The increased differentiation of neurons may contribute to relay formation, facilitating functional recovery after SCI. In this review, we summarize current strategies used to enhance the neuronal differentiation of NSCs through the reconstruction of the SCI microenvironment and to improve the intrinsic neuronal differentiation abilities of NSCs, which is significant for SCI repair.


Asunto(s)
Células-Madre Neurales , Traumatismos de la Médula Espinal , Trasplante de Células Madre , Diferenciación Celular , Humanos , Células-Madre Neurales/trasplante , Neuroglía/patología , Neuronas/patología , Médula Espinal , Traumatismos de la Médula Espinal/patología , Traumatismos de la Médula Espinal/terapia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA