Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 229
Filtrar
1.
BMC Bioinformatics ; 25(1): 11, 2024 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-38177985

RESUMO

BACKGROUND: Machine learning (ML) has a rich history in structural bioinformatics, and modern approaches, such as deep learning, are revolutionizing our knowledge of the subtle relationships between biomolecular sequence, structure, function, dynamics and evolution. As with any advance that rests upon statistical learning approaches, the recent progress in biomolecular sciences is enabled by the availability of vast volumes of sufficiently-variable data. To be useful, such data must be well-structured, machine-readable, intelligible and manipulable. These and related requirements pose challenges that become especially acute at the computational scales typical in ML. Furthermore, in structural bioinformatics such data generally relate to protein three-dimensional (3D) structures, which are inherently more complex than sequence-based data. A significant and recurring challenge concerns the creation of large, high-quality, openly-accessible datasets that can be used for specific training and benchmarking tasks in ML pipelines for predictive modeling projects, along with reproducible splits for training and testing. RESULTS: Here, we report 'Prop3D', a platform that allows for the creation, sharing and extensible reuse of libraries of protein domains, featurized with biophysical and evolutionary properties that can range from detailed, atomically-resolved physicochemical quantities (e.g., electrostatics) to coarser, residue-level features (e.g., phylogenetic conservation). As a community resource, we also supply a 'Prop3D-20sf' protein dataset, obtained by applying our approach to CATH . We have developed and deployed the Prop3D framework, both in the cloud and on local HPC resources, to systematically and reproducibly create comprehensive datasets via the Highly Scalable Data Service ( HSDS ). Our datasets are freely accessible via a public HSDS instance, or they can be used with accompanying Python wrappers for popular ML frameworks. CONCLUSION: Prop3D and its associated Prop3D-20sf dataset can be of broad utility in at least three ways. Firstly, the Prop3D workflow code can be customized and deployed on various cloud-based compute platforms, with scalability achieved largely by saving the results to distributed HDF5 files via HSDS . Secondly, the linked Prop3D-20sf dataset provides a hand-crafted, already-featurized dataset of protein domains for 20 highly-populated CATH families; importantly, provision of this pre-computed resource can aid the more efficient development (and reproducible deployment) of ML pipelines. Thirdly, Prop3D-20sf's construction explicitly takes into account (in creating datasets and data-splits) the enigma of 'data leakage', stemming from the evolutionary relationships between proteins.


Assuntos
Biologia Computacional , Proteínas , Humanos , Filogenia , Biologia Computacional/métodos , Fluxo de Trabalho , Aprendizado de Máquina
2.
PLoS Comput Biol ; 19(12): e1011652, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38060459

RESUMO

Information is the cornerstone of research, from experimental (meta)data and computational processes to complex inventories of reagents and equipment. These 10 simple rules discuss best practices for leveraging laboratory information management systems to transform this large information load into useful scientific findings.

3.
PLoS Comput Biol ; 19(12): e1011698, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38127691
4.
ACS Med Chem Lett ; 14(11): 1503-1508, 2023 Nov 09.
Artigo em Inglês | MEDLINE | ID: mdl-37974950

RESUMO

ATP-competitive kinase inhibitors form hydrogen bond interactions with the kinase hinge region at the adenine binding site. Thus, it is crucial to explore hinge-ligand recognition as part of a rational drug design strategy. Here, harnessing known ligand-bound kinase structures and experimental assay resources, we first created a kinase structure-assay database (KSAD) containing 2705 nM ligand-bound kinase complexes. Then, using KSAD, we systematically investigate hinge-ligand binding patterns using interaction fingerprints, thereby delineating 15 different hydrogen-bond interaction modes. We believe these results will be valuable for de novo drug design and/or scaffold hopping of kinase-targeted drugs.

5.
ACS Pharmacol Transl Sci ; 6(8): 1182-1191, 2023 Aug 11.
Artigo em Inglês | MEDLINE | ID: mdl-37588756

RESUMO

Macrocyclic kinase inhibitors (MKIs) are gaining attention due to their favorable selectivity and potential to overcome drug resistance, yet they remain challenging to design because of their novel structures. To facilitate the design and discovery of MKIs, we investigate MKI rational design starting from initial acyclic compounds by performing microsecond-scale atomistic simulations for multiple MKIs, constructing an MKI database, and analyzing MKIs using hierarchical cluster analysis. Our studies demonstrate that the binding modes of MKIs are like those of their corresponding acyclic counterparts against the same kinase targets. Importantly, within the respective binding sites, the MKI scaffolds retain the same conformations as their corresponding acyclic counterparts, demonstrating the rigidity of scaffolds before and after molecular cyclization. The MKI database includes 641 nanomole-level MKIs from 56 human kinases elucidating the features of rigid scaffolds and the core structures of MKIs. Collectively these results and resources can facilitate MKI development.

6.
PLoS Biol ; 21(7): e3002204, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37478129

RESUMO

Research data is optimized when it can be freely accessed and reused. To maximize research equity, transparency, and reproducibility, policymakers should take concrete steps to ensure that research software is openly accessible and reusable.


Assuntos
Políticas , Software , Reprodutibilidade dos Testes
7.
PLoS Comput Biol ; 19(3): e1010911, 2023 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-36862619
8.
J Chem Inf Model ; 63(4): 1362-1370, 2023 02 27.
Artigo em Inglês | MEDLINE | ID: mdl-36780612

RESUMO

KRAS, a common human oncogene, has been recognized as a critical drug target in treating multiple cancers. After four decades of effort, one allosteric KRAS drug (Sotorasib) has been approved, inspiring more KRAS-targeted drug research. Here, we provide the features of KRAS binding pockets and ligand-binding characteristics of KRAS complexes using a structural systems pharmacology approach. Three distinct binding sites (conserved nucleotide-binding site, shallow Switch-I/II pocket, and allosteric Switch-II/α3 pocket) are characterized. Ligand-binding features are determined based on encoded KRAS-inhibitor interaction fingerprints. Finally, the flexibility of the three distinct binding sites to accommodate different potential ligands, based on MD simulation, is discussed. Collectively, these findings are intended to facilitate rational KRAS drug design.


Assuntos
Neoplasias , Proteínas Proto-Oncogênicas p21(ras) , Humanos , Ligantes , Sítios de Ligação , Desenho de Fármacos , Neoplasias/tratamento farmacológico , Mutação
9.
PLoS Comput Biol ; 19(1): e1010851, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36652496

RESUMO

Systematically discovering protein-ligand interactions across the entire human and pathogen genomes is critical in chemical genomics, protein function prediction, drug discovery, and many other areas. However, more than 90% of gene families remain "dark"-i.e., their small-molecule ligands are undiscovered due to experimental limitations or human/historical biases. Existing computational approaches typically fail when the dark protein differs from those with known ligands. To address this challenge, we have developed a deep learning framework, called PortalCG, which consists of four novel components: (i) a 3-dimensional ligand binding site enhanced sequence pre-training strategy to encode the evolutionary links between ligand-binding sites across gene families; (ii) an end-to-end pretraining-fine-tuning strategy to reduce the impact of inaccuracy of predicted structures on function predictions by recognizing the sequence-structure-function paradigm; (iii) a new out-of-cluster meta-learning algorithm that extracts and accumulates information learned from predicting ligands of distinct gene families (meta-data) and applies the meta-data to a dark gene family; and (iv) a stress model selection step, using different gene families in the test data from those in the training and development data sets to facilitate model deployment in a real-world scenario. In extensive and rigorous benchmark experiments, PortalCG considerably outperformed state-of-the-art techniques of machine learning and protein-ligand docking when applied to dark gene families, and demonstrated its generalization power for target identifications and compound screenings under out-of-distribution (OOD) scenarios. Furthermore, in an external validation for the multi-target compound screening, the performance of PortalCG surpassed the rational design from medicinal chemists. Our results also suggest that a differentiable sequence-structure-function deep learning framework, where protein structural information serves as an intermediate layer, could be superior to conventional methodology where predicted protein structures were used for the compound screening. We applied PortalCG to two case studies to exemplify its potential in drug discovery: designing selective dual-antagonists of dopamine receptors for the treatment of opioid use disorder (OUD), and illuminating the understudied human genome for target diseases that do not yet have effective and safe therapeutics. Our results suggested that PortalCG is a viable solution to the OOD problem in exploring understudied regions of protein functional space.


Assuntos
Algoritmos , Proteínas , Humanos , Ligantes , Proteínas/química , Sítios de Ligação , Aprendizado de Máquina , Ligação Proteica
10.
PLoS Biol ; 20(12): e3001901, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36508416

RESUMO

Does reductionism, in the era of machine learning and now interpretable AI, facilitate or hinder scientific insight? The protein ribbon diagram, as a means of visual reductionism, is a case in point.


Assuntos
Aprendizado de Máquina , Sinapses
11.
Pharmaceuticals (Basel) ; 15(11)2022 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-36355497

RESUMO

Kinase-targeted drug discovery for cancer therapy has advanced significantly in the last three decades. Currently, diverse kinase inhibitors or degraders have been reported, such as allosteric inhibitors, covalent inhibitors, macrocyclic inhibitors, and PROTAC degraders. Out of these, covalent kinase inhibitors (CKIs) have been attracting attention due to their enhanced selectivity and exceptionally strong affinity. Eight covalent kinase drugs have been FDA-approved thus far. Here, we review current developments in CKIs. We explore the characteristics of the CKIs: the features of nucleophilic amino acids and the preferences of electrophilic warheads. We provide systematic insights into privileged warheads for repurposing to other kinase targets. Finally, we discuss trends in CKI development across the whole proteome.

13.
PLoS Comput Biol ; 18(8): e1010395, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-36006874

RESUMO

Special sessions are important parts of scientific meetings and conferences: They gather together researchers and students interested in a specific topic and can strongly contribute to the success of the conference itself. Moreover, they can be the first step for trainees and students to the organization of a scientific event. Organizing a special session, however, can be uneasy for beginners and students. Here, we provide ten simple rules to follow to organize a special session at a scientific conference.


Assuntos
Pesquisadores , Estudantes , Humanos
14.
Drug Discov Today ; 27(10): 103319, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35850431

RESUMO

Determining protein-ligand interaction characteristics and mechanisms is crucial to the drug discovery process. Here, we review recent progress and successful applications of a systematic protein-ligand interaction fingerprint (IFP) approach for investigating proteome-wide protein-ligand interactions for drug development. Specifically, we review the use of this IFP approach for revealing polypharmacology across the kinome, predicting promising targets from which to design allosteric inhibitors and covalent kinase inhibitors, uncovering the binding mechanisms of drugs of interest, and demonstrating resistant mechanisms of specific drugs. Together, we demonstrate that the IFP strategy is efficient and practical for drug design research for protein kinases as targets and is extensible to other protein families.


Assuntos
Polifarmacologia , Proteoma , Descoberta de Drogas , Ligantes , Inibidores de Proteínas Quinases/farmacologia , Proteínas Quinases/metabolismo
16.
PLoS Comput Biol ; 18(6): e1010133, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35679224
17.
PLoS Comput Biol ; 18(6): e1010130, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35737640

RESUMO

Communication is a fundamental part of scientific development and methodology. With the advancement of the internet and social networks, communication has become rapid and sometimes overwhelming, especially in science. It is important to provide scientists with useful, effective, and dynamic tools to establish and build a fluid communication framework that allows for scientific advancement. Therefore, in this article, we present advice and recommendations that can help promote and improve science communication while respecting an adequate balance in the degree of commitment toward collaborative work. We have developed 10 rules shown in increasing order of commitment that are grouped into 3 key categories: (1) speak (based on active participation); (2) join (based on joining scientific groups); and (3) assess (based on the analysis and retrospective consideration of the weaknesses and strengths). We include examples and resources that provide actionable strategies for involvement and engagement with science communication, from basic steps to more advanced, introspective, and long-term commitments. Overall, we aim to help spread science from within and encourage and engage scientists to become involved in science communication effectively and dynamically.


Assuntos
Comunicação , Rede Social , Estudos Retrospectivos
19.
Vaccines (Basel) ; 10(3)2022 Mar 08.
Artigo em Inglês | MEDLINE | ID: mdl-35335040

RESUMO

Background: The COVID-19 pandemic is being battled via the largest vaccination campaign in history, with more than eight billion doses administered thus far. Therefore, discussions about potentially adverse reactions, and broader safety concerns, are critical. The U.S. Vaccination Adverse Event Reporting System (VAERS) has recorded vaccination side effects for over 30 years. About 580,000 events have been filed for COVID-19 thus far, primarily for the Johnson & Johnson (New Jersey, USA), Pfizer/BioNTech (Mainz, Germany), and Moderna (Cambridge, USA) vaccines. Methods: Using available databases, we evaluated these three vaccines in terms of the occurrence of four generally-noticed adverse reactions­namely, cerebral venous sinus thrombosis, Guillain−Barré syndrome (a severe paralytic neuropathy), myocarditis, and pericarditis. Our statistical analysis also included a calculation of odds ratios (ORs) based on total vaccination numbers, accounting for incidence rates in the general population. Results: ORs for a number of adverse events and patient groups were (largely) increased, most notably for the occurrence of cerebral venous sinus thrombosis after vaccination with the Johnson & Johnson vaccine. The overall population OR of 10 increases to 12.5 when limited to women, and further yet (to 14.4) among women below age 50 yrs. In addition, elevated risks were found (i) for Guillain−Barré syndrome (OR of 11.6) and (ii) for myocarditis/pericarditis (ORs of 5.3/4.1, respectively) among young men (<25 yrs) vaccinated with the Pfizer/BioNTech vaccine. Conclusions: Any conclusions from such a retrospective, real-world data analysis must be drawn cautiously, and should be confirmed by prospective double-blinded clinical trials. In addition, we emphasize that the adverse events reported here are not specific side effects of COVID vaccines, and the significant, well-established benefits of COVID-19 vaccination outweigh the potential complications surveyed here.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...