Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 299
Filtrar
1.
RNA ; 30(3): 189-199, 2024 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-38164624

RESUMO

Aptamers have emerged as research hotspots of the next generation due to excellent performance benefits and application potentials in pharmacology, medicine, and analytical chemistry. Despite the numerous aptamer investigations, the lack of comprehensive data integration has hindered the development of computational methods for aptamers and the reuse of aptamers. A public access database named AptaDB, derived from experimentally validated data manually collected from the literature, was hence developed, integrating comprehensive aptamer-related data, which include six key components: (i) experimentally validated aptamer-target interaction information, (ii) aptamer property information, (iii) structure information of aptamer, (iv) target information, (v) experimental activity information, and (vi) algorithmically calculated similar aptamers. AptaDB currently contains 1350 experimentally validated aptamer-target interactions, 1230 binding affinity constants, 1293 aptamer sequences, and more. Compared to other aptamer databases, it contains twice the number of entries found in available databases. The collection and integration of the above information categories is unique among available aptamer databases and provides a user-friendly interface. AptaDB will also be continuously updated as aptamer research evolves. We expect that AptaDB will become a powerful source for aptamer rational design and a valuable tool for aptamer screening in the future. For access to AptaDB, please visit http://lmmd.ecust.edu.cn/aptadb/.


Assuntos
Aptâmeros de Nucleotídeos , Oligonucleotídeos , Bases de Dados Factuais , Aptâmeros de Nucleotídeos/química , Técnica de Seleção de Aptâmeros
2.
Nucleic Acids Res ; 2024 Apr 22.
Artigo em Inglês | MEDLINE | ID: mdl-38647076

RESUMO

Absorption, distribution, metabolism, excretion and toxicity (ADMET) properties play a crucial role in drug discovery and chemical safety assessment. Built on the achievements of admetSAR and its successor, admetSAR2.0, this paper introduced the new version of the series, admetSAR3.0, as a comprehensive platform for chemical ADMET assessment, including search, prediction and optimization modules. In the search module, admetSAR3.0 hosted over 370 000 high-quality experimental ADMET data for 104 652 unique compounds, and supplemented chemical structure similarity search function to facilitate read-across. In the prediction module, we introduced comprehensive ADMET endpoints and two new sections for environmental and cosmetic risk assessments, empowering admetSAR3.0 to provide prediction for 119 endpoints, more than double numbers compared to the previous version. Furthermore, the advanced multi-task graph neural network framework offered robust and reliable support for ADMET prediction. In particular, a module named ADMETopt was added to automatically optimize the ADMET properties of query molecules through transformation rules or scaffold hopping. Finally, admetSAR3.0 provides user-friendly interfaces for multiple types of input data, such as SMILES string, chemical structure and batch molecule file, and supports various output types, including digital, chart displays and file downloads. In summary, admetSAR3.0 is anticipated to be a valuable and powerful tool in drug discovery and chemical safety assessment at http://lmmd.ecust.edu.cn/admetsar3/.

3.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36537081

RESUMO

Qualitative or quantitative prediction models of structure-activity relationships based on graph neural networks (GNNs) are prevalent in drug discovery applications and commonly have excellently predictive power. However, the network information flows of GNNs are highly complex and accompanied by poor interpretability. Unfortunately, there are relatively less studies on GNN attributions, and their developments in drug research are still at the early stages. In this work, we adopted several advanced attribution techniques for different GNN frameworks and applied them to explain multiple drug molecule property prediction tasks, enabling the identification and visualization of vital chemical information in the networks. Additionally, we evaluated them quantitatively with attribution metrics such as accuracy, sparsity, fidelity and infidelity, stability and sensitivity; discussed their applicability and limitations; and provided an open-source benchmark platform for researchers. The results showed that all attribution techniques were effective, while those directly related to the predicted labels, such as integrated gradient, preferred to have better attribution performance. These attribution techniques we have implemented could be directly used for the vast majority of chemical GNN interpretation tasks.


Assuntos
Benchmarking , Descoberta de Drogas , Humanos , Redes Neurais de Computação , Pesquisadores , Relação Estrutura-Atividade
4.
J Am Chem Soc ; 146(5): 3427-3437, 2024 Feb 07.
Artigo em Inglês | MEDLINE | ID: mdl-38243892

RESUMO

Despite half a century's advance in the field of transition-metal-catalyzed asymmetric alkene hydrogenation, the enantioselective hydrogenation of purely alkyl-substituted 1,1-dialkylethenes has remained an unmet challenge. Herein, we describe a chiral PCNOx-pincer iridium complex for asymmetric transfer hydrogenation of this alkene class with ethanol, furnishing all-alkyl-substituted tertiary stereocenters. High levels of enantioselectivity can be achieved in the reactions of substrates with secondary/primary and primary/primary alkyl combinations. The catalyst is further applied to the redox isomerization of disubstituted alkenols, producing a tertiary stereocenter remote to the resulting carbonyl group. Mechanistic studies reveal a dihydride species, (PCNOx)Ir(H)2, as the catalytically active intermediate, which can decay to a dimeric species (κ3-PCNOx)IrH(µ-H)2IrH(κ2-PCNOx) via a ligand-remetalation pathway. The catalyst deactivation under the hydrogenation conditions with H2 is much faster than that under the transfer hydrogenation conditions with EtOH, which explains why the (PCNOx)Ir catalyst is effective for the transfer hydrogenation but ineffective for the hydrogenation. The suppression of di-to-trisubstituted alkene isomerization by regioselective 1,2-insertion is partly responsible for the success of this system, underscoring the critical role played by the pincer ligand in enantioselective transfer hydrogenation of 1,1-dialkylethenes. Moreover, computational studies elucidate the significant influence of the London dispersion interaction between the ligand and the substrate on enantioselectivity control, as illustrated by the complete reversal of stereochemistry through cyclohexyl-to-cyclopropyl group substitution in the alkene substrates.

5.
Small ; 20(1): e2305287, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37653592

RESUMO

Poor water stability and single luminous color are the major drawbacks of the most phosphors reported. Therefore, it is important to realize multicolor luminescence in a phosphor with single host and single activator as well as moisture resistance. LaF3 :Pr3+ @SiO2 yolk-shell nanospheres are facilely obtained by a designing new technology of a simple and cost-effective electrospray ionization combined with a dicrucible fluorating technique without using protective gas. In addition, tunable photoluminescence, especially white-light emission, is successfully obtained in LaF3 :Pr3+ @SiO2 yolk-shell nanospheres by adjusting Pr3+ ion concentrations, and the luminescence mechanism of Pr3+ ion is advanced. Compared with the counterpart LaF3 :Pr3+ nanospheres, the water stability of LaF3 :Pr3+ @SiO2 yolk-shell nanospheres is improved by 15% after immersion in water for 72 h, and the fluorescence intensity can be maintained at 86% of the initial intensity. Furthermore, by treating the yolk-shell nanospheres with hydrofluoric acid, it is not only demonstrated that the shell-layer is SiO2 but also core-LaF3 :Pr3+ nanospheres are obtained. Particularly, only fluorination procedure among the halogenation can produce such special yolk-shell nanospheres, the formation mechanism of yolk-shell nanospheres is proposed detailedly based on the sound experiments and a corresponding new technology is built. These findings broaden practical applications of LaF3 :Pr3+ @SiO2 yolk-shell nanospheres.

6.
Small ; 20(16): e2308603, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38009482

RESUMO

The shuttle effect of lithium polysulfides (LiPSs) severely hinders the development and commercialization of lithium-sulfur batteries, and the design of high-conductive carbon fiber-host material has become a key solution to suppress the shuttle effect. In this work, a unique Co/CoN-carbon nanocages@TiO2-carbon nanotubes structure (NC@TiO2-CNTs) is constructed using an electrospinning and nitriding process. Lithium-sulfur batteries using NC@TiO2-CNTs as cathode host materials exhibit high sulfur utilization (1527 mAh g-1 at 0.2 C) and can still maintain a discharge capacity of 663 mAh g-1 at a high current density of 5 C, and the capacity loss is only 0.056% per cycle during 500 cycles at 1 C. It is worth noting that even under extreme conditions (sulfur-loading = 90%, surface-loading = 5.0 mg cm-2 (S), and E/S = 6.63 µL mg-1), the lithium-sulfur batteries can still provide a reversible capacity of 4 mAh cm-2. Throughdensity functional theory calculations, it has been found that the Co/CoN heterostructures can adsorb and catalyze LiPSs conversion effectively. Simultaneously, the TiO2 can adsorb LiPSs and transfer Li+ selectively, achieving dual confinement for the shuttle effect of LiPSs (nanocages and nanotubes). The new findings provide a new performance enhancement strategy for the commercialization of lithium-sulfur batteries.

7.
Brief Bioinform ; 23(2)2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35039845

RESUMO

Identification of adverse drug events (ADEs) is crucial to reduce human health risks and improve drug safety assessment. With an increasing number of biological and medical data, computational methods such as network-based methods were proposed for ADE prediction with high efficiency and low cost. However, previous network-based methods rely on the topological information of known drug-ADE networks, and hence cannot make predictions for novel compounds without any known ADE. In this study, we introduced chemical substructures to bridge the gap between the drug-ADE network and novel compounds, and developed a novel network-based method named ADENet, which can predict potential ADEs for not only drugs within the drug-ADE network, but also novel compounds outside the network. To show the performance of ADENet, we collected drug-ADE associations from a comprehensive database named MetaADEDB and constructed a series of network-based prediction models. These models obtained high area under the receiver operating characteristic curve values ranging from 0.871 to 0.947 in 10-fold cross-validation. The best model further showed high performance in external validation, which outperformed a previous network-based and a recent deep learning-based method. Using several approved drugs as case studies, we found that 32-54% of the predicted ADEs can be validated by the literature, indicating the practical value of ADENet. Moreover, ADENet is freely available at our web server named NetInfer (http://lmmd.ecust.edu.cn/netinfer). In summary, our method would provide a promising tool for ADE prediction and drug safety assessment in drug discovery and development.


Assuntos
Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Bases de Dados Factuais , Descoberta de Drogas , Humanos , Curva ROC , Projetos de Pesquisa
8.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35915052

RESUMO

Antibiotic combination is a promising strategy to extend the lifetime of antibiotics and thereby combat antimicrobial resistance. However, screening for new antibiotic combinations is both time-consuming and labor-intensive. In recent years, an increasing number of researchers have used computational models to predict effective antibiotic combinations. In this review, we summarized existing computational models for antibiotic combinations and discussed the limitations and challenges of these models in detail. In addition, we also collected and summarized available data resources and tools for antibiotic combinations. This study aims to help computational biologists design more accurate and interpretable computational models.


Assuntos
Antibacterianos , Biologia Computacional , Antibacterianos/uso terapêutico , Simulação por Computador , Bases de Dados Factuais , Sinergismo Farmacológico
9.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35998896

RESUMO

Nuclear receptors (NRs) are ligand-activated transcription factors, which constitute one of the most important targets for drug discovery. Current computational strategies mainly focus on a single target, and the transfer of learned knowledge among NRs was not considered yet. Herein we proposed a novel computational framework named NR-Profiler for prediction of potential NR modulators with high affinity and specificity. First, we built a comprehensive NR data set including 42 684 interactions to connect 42 NRs and 31 033 compounds. Then, we used multi-task deep neural network and multi-task graph convolutional neural network architectures to construct multi-task multi-classification models. To improve the predictive capability and robustness, we built a consensus model with an area under the receiver operating characteristic curve (AUC) = 0.883. Compared with conventional machine learning and structure-based approaches, the consensus model showed better performance in external validation. Using this consensus model, we demonstrated the practical value of NR-Profiler in virtual screening for NRs. In addition, we designed a selectivity score to quantitatively measure the specificity of NR modulators. Finally, we developed a freely available standalone software for users to make profiling predictions for their compounds of interest. In summary, our NR-Profiler provides a useful tool for NR-profiling prediction and is expected to facilitate NR-based drug discovery.


Assuntos
Aprendizado Profundo , Receptores Artificiais , Receptores dos Hormônios Gastrointestinais , Receptores de Imunoglobulina Polimérica , Receptor do Fator Ativador de Células B , Proteína Semelhante a Receptor de Calcitonina , Receptor gp130 de Citocina , Antagonistas dos Receptores H2 da Histamina , Ligantes , Antagonistas dos Receptores de Neurocinina-1 , Proteínas Proto-Oncogênicas c-met , Receptor de Glutamato Metabotrópico 5 , Proteínas Tirosina Fosfatases Classe 2 Semelhantes a Receptores , Receptores de Hidrocarboneto Arílico , Receptores de Calcitriol , Receptores Citoplasmáticos e Nucleares , Receptores Muscarínicos
10.
Chem Res Toxicol ; 37(6): 894-909, 2024 Jun 17.
Artigo em Inglês | MEDLINE | ID: mdl-38753056

RESUMO

Skin sensitization is increasingly becoming a significant concern in the development of drugs and cosmetics due to consumer safety and occupational health problems. In silico methods have emerged as alternatives to traditional in vivo animal testing due to ethical and economic considerations. In this study, machine learning methods were used to build quantitative structure-activity relationship (QSAR) models on five skin sensitization data sets (GPMT, LLNA, DPRA, KeratinoSens, and h-CLAT), achieving effective predictive accuracies (correct classification rates of 0.688-0.764 on test sets). To address the complex mechanisms of human skin sensitization, the Dempster-Shafer theory was applied to merge multiple QSAR models, resulting in an evidence-based integrated decision model. Various evidence combinations and combination rules were explored, with the self-defined Q3 rule showing superior balance. The combination of evidence such as GPMT and KeratinoSens and h-CLAT achieved a correct classification rate (CCR) of 0.880 and coverage of 0.893 while maintaining the competitiveness of other combinations. Additionally, the Shapley additive explanations (SHAP) method was used to interpret important features and substructures related to skin sensitization. A comparative analysis of an external human test set demonstrated the superior performance of the proposed method. Finally, to enhance accessibility, the workflow was implemented into a user-friendly software named HSkinSensDS.


Assuntos
Aprendizado de Máquina , Relação Quantitativa Estrutura-Atividade , Pele , Humanos , Pele/efeitos dos fármacos , Simulação por Computador
11.
Chem Res Toxicol ; 37(2): 361-373, 2024 02 19.
Artigo em Inglês | MEDLINE | ID: mdl-38294881

RESUMO

Skin Corrosion/Irritation (Corr./Irrit.) has long been a health hazard in the Globally Harmonized System (GHS). Several in silico models have been built to predict Skin Corr./Irrit. as an alternative to the increasingly restricted animal testing. However, current studies are limited by data amount/quality and model availability. To address these issues, we compiled a traceable consensus GHS data set comprising 731 Corr., 1283 Irrit., and 1205 negative (Neg.) samples from 6 governmental databases and 2 external data sets. Then, a series of binary classifiers were developed with five machine learning (ML) algorithms and six molecular representations. For 10-fold cross-validation, the best Corr. vs Neg. classifier achieved an Area Under the Receiver Operating Characteristic Curve (AUC) of 97.1%, while the best Irrit. vs Neg. classifier achieved an AUC of 84.7%. Compared with existing in silico tools on external validation, our Attentive FP classifiers showed the highest metrics on Corr. vs Neg. and the second highest accuracy on Irrit. vs Neg. The SHapley Additive exPlanation approach was further applied to figure out important molecular features, and the attention weights were visualized to perform interpretable prediction. Structural alerts associated with Skin Corr./Irrit. were also identified. The interpretable Attentive FP classifiers were integrated into the software AttentiveSkin at https://github.com/BeeBeeWong/AttentiveSkin. The conventional ML classifiers are also provided on our platform admetSAR at http://lmmd.ecust.edu.cn/admetsar2/. Considering the data deficiency and the limited model availability of Skin Corr./Irrit., we believe that our data set and models could facilitate chemical safety assessment and relevant studies.


Assuntos
Algoritmos , Pele , Animais , Corrosão , Software , Aprendizado de Máquina
12.
Chem Res Toxicol ; 37(3): 513-524, 2024 03 18.
Artigo em Inglês | MEDLINE | ID: mdl-38380652

RESUMO

The research on acute dermal toxicity has consistently been a crucial component in assessing the potential risks of human exposure to active ingredients in pesticides and related plant protection products. However, it is difficult to directly identify the acute dermal toxicity of potential compounds through animal experiments alone. In our study, we separately integrated 1735 experimental data based on rabbits and 1679 experimental data based on rats to construct acute dermal toxicity prediction models using machine learning and deep learning algorithms. The best models for the two animal species achieved AUC values of 78.0 and 82.0%, respectively, on 10-fold cross-validation. Additionally, we employed SARpy to extract structural alerts, and in conjunction with Shapley additive explanation and attentive FP heatmap, we identified important features and structural fragments associated with acute dermal toxicity. This approach offers valuable insights for the detection of positive compounds. Moreover, a standalone software tool was developed to make acute dermal toxicity prediction easier. In summary, our research would provide an effective tool for acute dermal toxicity evaluation of pesticides, cosmetics, and drug safety assessment.


Assuntos
Cosméticos , Praguicidas , Humanos , Ratos , Coelhos , Animais , Testes de Toxicidade , Cosméticos/química
13.
J Chem Inf Model ; 64(8): 3451-3464, 2024 Apr 22.
Artigo em Inglês | MEDLINE | ID: mdl-38593186

RESUMO

Cytochrome P450 3A4 (CYP3A4) is one of the most important drug-metabolizing enzymes in the human body and is well known for its complicated, atypical kinetic characteristics. The existence of multiple ligand-binding sites in CYP3A4 has been widely recognized as being capable of interfering with the active pocket through allosteric effects. The identification of ligand-binding sites other than the canonical active site above the heme is especially important for understanding the atypical kinetic characteristics of CYP3A4 and the intriguing association between the ligand and the receptor. In this study, we first employed mixed-solvent molecular dynamics (MixMD) simulations coupled with the online computational predictive tools to explore potential ligand-binding sites in CYP3A4. The MixMD approach demonstrates better performance in dealing with the receptor flexibility compared with other computational tools. From the sites identified by MixMD, we then picked out multiple sites for further exploration using ensemble docking and conventional molecular dynamics (cMD) simulations. Our results indicate that three extra sites are suitable for ligand binding in CYP3A4, including one experimentally confirmed site and two novel sites.


Assuntos
Citocromo P-450 CYP3A , Simulação de Dinâmica Molecular , Solventes , Citocromo P-450 CYP3A/química , Citocromo P-450 CYP3A/metabolismo , Ligantes , Sítios de Ligação , Solventes/química , Humanos , Simulação de Acoplamento Molecular , Ligação Proteica , Conformação Proteica
14.
J Chem Inf Model ; 64(1): 57-75, 2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-38150548

RESUMO

Drug discovery is time-consuming, expensive, and predominantly follows the "one drug → one target → one disease" paradigm. With the rapid development of systems biology and network pharmacology, a novel drug discovery paradigm, "multidrug → multitarget → multidisease", has emerged. This new holistic paradigm of drug discovery aligns well with the essence of networks, leading to the emergence of network-based methods in the field of drug discovery. In this Perspective, we initially introduce the concept and data sources of networks and highlight classical methodologies employed in network-based methods. Subsequently, we focus on the practical applications of network-based methods across various areas of drug discovery, such as target prediction, virtual screening, prediction of drug therapeutic effects or adverse drug events, and elucidation of molecular mechanisms. In addition, we provide representative web servers for researchers to use network-based methods in specific applications. Finally, we discuss several challenges of network-based methods and the directions for future development. In a word, network-based methods could serve as powerful tools to accelerate drug discovery.


Assuntos
Descoberta de Drogas , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Humanos , Descoberta de Drogas/métodos , Biologia de Sistemas/métodos
15.
Acta Pharmacol Sin ; 2024 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-38902503

RESUMO

Identification of compounds to modulate NADPH metabolism is crucial for understanding complex diseases and developing effective therapies. However, the complex nature of NADPH metabolism poses challenges in achieving this goal. In this study, we proposed a novel strategy named NADPHnet to predict key proteins and drug-target interactions related to NADPH metabolism via network-based methods. Different from traditional approaches only focusing on one single protein, NADPHnet could screen compounds to modulate NADPH metabolism from a comprehensive view. Specifically, NADPHnet identified key proteins involved in regulation of NADPH metabolism using network-based methods, and characterized the impact of natural products on NADPH metabolism using a combined score, NADPH-Score. NADPHnet demonstrated a broader applicability domain and improved accuracy in the external validation set. This approach was further employed along with molecular docking to identify 27 compounds from a natural product library, 6 of which exhibited concentration-dependent changes of cellular NADPH level within 100 µM, with Oxyberberine showing promising effects even at 10 µM. Mechanistic and pathological analyses of Oxyberberine suggest potential novel mechanisms to affect diabetes and cancer. Overall, NADPHnet offers a promising method for prediction of NADPH metabolism modulation and advances drug discovery for complex diseases.

16.
Can J Microbiol ; 70(3): 70-85, 2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38096505

RESUMO

The grasslands in North China are rich in fungal resources. However, the knowledge of the structure and function of fungal communities and the role of microbial communities in vegetation restoration and succession are limited. Thus, we used an Illumina HiSeq PE250 high-throughput sequencing platform to study the changing characteristics of soil fungal communities in degraded grasslands, which were categorized as non-degraded (ND), lightly degraded, moderately degraded, and severely degraded (SD). Moreover, a correlation analysis between soil physical and chemical properties and fungal communities was completed. The results showed that the number of plant species, vegetation coverage, aboveground biomass, and diversity index decreased significantly with increasing degradation, and there were significant differences in the physical and chemical properties of the soil among the different degraded grasslands. The dominant fungal phyla in the degraded grassland were as follows: Ascomycota, 44.88%-65.03%; Basidiomycota, 12.68%-29.91%; and unclassified, 5.51%-16.91%. The dominant fungi were as follows: Mortierella, 6.50%-11.41%; Chaetomium, 6.71%-11.58%; others, 25.95%-36.14%; and unclassified, 25.56%-53.0%. There were significant differences in the microbial Shannon-Wiener and Chao1 indices between the ND and degraded meadows, and the composition and diversity of the soil fungal community differed significantly as the meadows continued to deteriorate. The results showed that pH was the most critical factor affecting soil microbial and fungal communities in SD grasslands, whereas soil microbial and fungal communities in ND grasslands were mainly affected by water content and other environmental factors.


Assuntos
Microbiota , Micobioma , Pradaria , China , Solo
17.
J Appl Toxicol ; 2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38544296

RESUMO

Cytochrome P450 (CYP) enzymes are involved in the metabolism of approximately 75% of marketed drugs. Inhibition of the major drug-metabolizing P450s could alter drug metabolism and lead to undesirable drug-drug interactions. Therefore, it is of great significance to explore the inhibition of P450s in drug discovery. Currently, machine learning including deep learning algorithms has been widely used for constructing in silico models for the prediction of P450 inhibition. These models exhibited varying predictive performance depending on the use of machine learning algorithms and molecular representations. This leads to the difficulty in the selection of appropriate models for practical use. In this study, we systematically evaluated the conventional machine learning and deep learning models for three major P450 enzymes, CYP3A4, CYP2D6, and CYP2C9 from several perspectives, such as algorithms, molecular representation, and data partitioning strategies. Our results showed that the XGBoost and CatBoost algorithms coupled with the combined fingerprint/physicochemical descriptor features exhibited the best performance with Area Under Curve (AUC)  of 0.92, while the deep learning models were generally inferior to the conventional machine learning models (average AUC reached 0.89) on the same test sets. We also found that data volume and sampling strategy had a minor effect on model performance. We anticipate that these results are helpful for the selection of molecular representations and machine learning/deep learning algorithms in the P450 model construction and the future model development of P450 inhibition.

18.
J Appl Toxicol ; 44(6): 892-907, 2024 06.
Artigo em Inglês | MEDLINE | ID: mdl-38329145

RESUMO

The accurate identification of chemicals with ocular toxicity is of paramount importance in health hazard assessment. In contemporary chemical toxicology, there is a growing emphasis on refining, reducing, and replacing animal testing in safety evaluations. Therefore, the development of robust computational tools is crucial for regulatory applications. The performance of predictive models is heavily reliant on the quality and quantity of data. In this investigation, we amalgamated the most extensive dataset (4901 compounds) sourced from governmental GHS-compliant databases and literature to develop binary classification models of chemical ocular toxicity. We employed 12 molecular representations in conjunction with six machine learning algorithms and two deep learning algorithms to create a series of binary classification models. The findings indicated that the deep learning method GCN outperformed the machine learning models in cross-validation, achieving an impressive AUC of 0.915. However, the top-performing machine learning model (RF-Descriptor) demonstrated excellent performance with an AUC of 0.869 on the test set and was therefore selected as the best model. To enhance model interpretability, we conducted the SHAP method and attention weights analysis. The two approaches offered visual depictions of the relevance of key descriptors and substructures in predicting ocular toxicity of chemicals. Thus, we successfully struck a delicate balance between data quality and model interpretability, rendering our model valuable for predicting and comprehending potential ocular-toxic compounds in the early stages of drug discovery.


Assuntos
Simulação por Computador , Aprendizado Profundo , Aprendizado de Máquina , Humanos , Olho/efeitos dos fármacos , Bases de Dados Factuais , Animais , Algoritmos
19.
BMC Biol ; 21(1): 161, 2023 07 21.
Artigo em Inglês | MEDLINE | ID: mdl-37480118

RESUMO

BACKGROUND: Pennisetum giganteum (AABB, 2n = 4x = 28) is a C4 plant in the genus Pennisetum with origin in Africa but currently also grown in Asia and America. It is a crucial forage and potential energy grass with significant advantages in yield, stress resistance, and environmental adaptation. However, the mechanisms underlying these advantageous traits remain largely unexplored. Here, we present a high-quality genome assembly of the allotetraploid P. giganteum aiming at providing insights into biomass accumulation. RESULTS: Our assembly has a genome size 2.03 Gb and contig N50 of 88.47 Mb that was further divided into A and B subgenomes. Genome evolution analysis revealed the evolutionary relationships across the Panicoideae subfamily lineages and identified numerous genome rearrangements that had occurred in P. giganteum. Comparative genomic analysis showed functional differentiation between the subgenomes. Transcriptome analysis found no subgenome dominance at the overall gene expression level; however, differentially expressed homoeologous genes and homoeolog-specific expressed genes between the two subgenomes were identified, suggesting that complementary effects between the A and B subgenomes contributed to biomass accumulation of P. giganteum. Besides, C4 photosynthesis-related genes were significantly expanded in P. giganteum and their sequences and expression patterns were highly conserved between the two subgenomes, implying that both subgenomes contributed greatly and almost equally to the highly efficient C4 photosynthesis in P. giganteum. We also identified key candidate genes in the C4 photosynthesis pathway that showed sustained high expression across all developmental stages of P. giganteum. CONCLUSIONS: Our study provides important genomic resources for elucidating the genetic basis of advantageous traits in polyploid species, and facilitates further functional genomics research and genetic improvement of P. giganteum.


Assuntos
Pennisetum , Pennisetum/genética , Biomassa , Genoma de Planta , Poliploidia , Perfilação da Expressão Gênica
20.
BMC Bioinformatics ; 24(1): 452, 2023 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-38036960

RESUMO

BACKGROUND: The identification of essential proteins is of great significance in biology and pathology. However, protein-protein interaction (PPI) data obtained through high-throughput technology include a high number of false positives. To overcome this limitation, numerous computational algorithms based on biological characteristics and topological features have been proposed to identify essential proteins. RESULTS: In this paper, we propose a novel method named SESN for identifying essential proteins. It is a seed expansion method based on PPI sub-networks and multiple biological characteristics. Firstly, SESN utilizes gene expression data to construct PPI sub-networks. Secondly, seed expansion is performed simultaneously in each sub-network, and the expansion process is based on the topological features of predicted essential proteins. Thirdly, the error correction mechanism is based on multiple biological characteristics and the entire PPI network. Finally, SESN analyzes the impact of each biological characteristic, including protein complex, gene expression data, GO annotations, and subcellular localization, and adopts the biological data with the best experimental results. The output of SESN is a set of predicted essential proteins. CONCLUSIONS: The analysis of each component of SESN indicates the effectiveness of all components. We conduct comparison experiments using three datasets from two species, and the experimental results demonstrate that SESN achieves superior performance compared to other methods.


Assuntos
Biologia Computacional , Mapeamento de Interação de Proteínas , Mapeamento de Interação de Proteínas/métodos , Biologia Computacional/métodos , Mapas de Interação de Proteínas , Proteínas/metabolismo , Algoritmos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA