Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 45
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 184(15): 4073-4089.e17, 2021 07 22.
Artigo em Inglês | MEDLINE | ID: mdl-34214469

RESUMO

Cellular processes arise from the dynamic organization of proteins in networks of physical interactions. Mapping the interactome has therefore been a central objective of high-throughput biology. However, the dynamics of protein interactions across physiological contexts remain poorly understood. Here, we develop a quantitative proteomic approach combining protein correlation profiling with stable isotope labeling of mammals (PCP-SILAM) to map the interactomes of seven mouse tissues. The resulting maps provide a proteome-scale survey of interactome rewiring across mammalian tissues, revealing more than 125,000 unique interactions at a quality comparable to the highest-quality human screens. We identify systematic suppression of cross-talk between the evolutionarily ancient housekeeping interactome and younger, tissue-specific modules. Rewired proteins are tightly regulated by multiple cellular mechanisms and are implicated in disease. Our study opens up new avenues to uncover regulatory mechanisms that shape in vivo interactome responses to physiological and pathophysiological stimuli in mammalian systems.


Assuntos
Especificidade de Órgãos , Mapeamento de Interação de Proteínas , Animais , Marcação por Isótopo , Masculino , Mamíferos , Camundongos Endogâmicos C57BL , Reprodutibilidade dos Testes
2.
Nature ; 2024 Jun 19.
Artigo em Inglês | MEDLINE | ID: mdl-38898272

RESUMO

Here, we introduce the Tabulae Paralytica-a compilation of four atlases of spinal cord injury (SCI) comprising a single-nucleus transcriptome atlas of half a million cells, a multiome atlas pairing transcriptomic and epigenomic measurements within the same nuclei, and two spatial transcriptomic atlases of the injured spinal cord spanning four spatial and temporal dimensions. We integrated these atlases into a common framework to dissect the molecular logic that governs the responses to injury within the spinal cord1. The Tabulae Paralytica uncovered new biological principles that dictate the consequences of SCI, including conserved and divergent neuronal responses to injury; the priming of specific neuronal subpopulations to upregulate circuit-reorganizing programs after injury; an inverse relationship between neuronal stress responses and the activation of circuit reorganization programs; the necessity of re-establishing a tripartite neuroprotective barrier between immune-privileged and extra-neural environments after SCI and a failure to form this barrier in old mice. We leveraged the Tabulae Paralytica to develop a rejuvenative gene therapy that re-established this tripartite barrier, and restored the natural recovery of walking after paralysis in old mice. The Tabulae Paralytica provides a window into the pathobiology of SCI, while establishing a framework for integrating multimodal, genome-scale measurements in four dimensions to study biology and medicine.

3.
Nature ; 611(7936): 540-547, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-36352232

RESUMO

A spinal cord injury interrupts pathways from the brain and brainstem that project to the lumbar spinal cord, leading to paralysis. Here we show that spatiotemporal epidural electrical stimulation (EES) of the lumbar spinal cord1-3 applied during neurorehabilitation4,5 (EESREHAB) restored walking in nine individuals with chronic spinal cord injury. This recovery involved a reduction in neuronal activity in the lumbar spinal cord of humans during walking. We hypothesized that this unexpected reduction reflects activity-dependent selection of specific neuronal subpopulations that become essential for a patient to walk after spinal cord injury. To identify these putative neurons, we modelled the technological and therapeutic features underlying EESREHAB in mice. We applied single-nucleus RNA sequencing6-9 and spatial transcriptomics10,11 to the spinal cords of these mice to chart a spatially resolved molecular atlas of recovery from paralysis. We then employed cell type12,13 and spatial prioritization to identify the neurons involved in the recovery of walking. A single population of excitatory interneurons nested within intermediate laminae emerged. Although these neurons are not required for walking before spinal cord injury, we demonstrate that they are essential for the recovery of walking with EES following spinal cord injury. Augmenting the activity of these neurons phenocopied the recovery of walking enabled by EESREHAB, whereas ablating them prevented the recovery of walking that occurs spontaneously after moderate spinal cord injury. We thus identified a recovery-organizing neuronal subpopulation that is necessary and sufficient to regain walking after paralysis. Moreover, our methodology establishes a framework for using molecular cartography to identify the neurons that produce complex behaviours.


Assuntos
Neurônios , Paralisia , Traumatismos da Medula Espinal , Medula Espinal , Caminhada , Animais , Humanos , Camundongos , Neurônios/fisiologia , Paralisia/genética , Paralisia/fisiopatologia , Paralisia/terapia , Medula Espinal/citologia , Medula Espinal/fisiologia , Medula Espinal/fisiopatologia , Traumatismos da Medula Espinal/genética , Traumatismos da Medula Espinal/fisiopatologia , Traumatismos da Medula Espinal/terapia , Caminhada/fisiologia , Estimulação Elétrica , Região Lombossacral/inervação , Reabilitação Neurológica , Análise de Sequência de RNA , Perfilação da Expressão Gênica
4.
Nat Methods ; 18(7): 806-815, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-34211188

RESUMO

Co-fractionation mass spectrometry (CF-MS) has emerged as a powerful technique for interactome mapping. However, there is little consensus on optimal strategies for the design of CF-MS experiments or their computational analysis. Here, we reanalyzed a total of 206 CF-MS experiments to generate a uniformly processed resource containing over 11 million measurements of protein abundance. We used this resource to benchmark experimental designs for CF-MS studies and systematically optimize computational approaches to network inference. We then applied this optimized methodology to reconstruct a draft-quality human interactome by CF-MS and predict over 700,000 protein-protein interactions across 27 eukaryotic species or clades. Our work defines new resources to illuminate proteome organization over evolutionary timescales and establishes best practices for the design and analysis of CF-MS studies.


Assuntos
Espectrometria de Massas/métodos , Mapeamento de Interação de Proteínas/métodos , Proteoma/análise , Animais , Evolução Biológica , Fracionamento Químico , Bases de Dados de Proteínas , Humanos , Camundongos , Plasmodium berghei/química , Plasmodium berghei/metabolismo , Proteômica/métodos
5.
Anal Chem ; 95(47): 17300-17310, 2023 11 28.
Artigo em Inglês | MEDLINE | ID: mdl-37966487

RESUMO

Over the last two decades, hundreds of new psychoactive substances (NPSs), also known as "designer drugs", have emerged on the illicit drug market. The toxic and potentially fatal effects of these compounds oblige laboratories around the world to screen for NPS in seized materials and biological samples, commonly using high-resolution mass spectrometry. However, unambiguous identification of a NPS by mass spectrometry requires comparison to data from analytical reference materials, acquired on the same instrument. The sheer number of NPSs that are available on the illicit market, and the pace at which new compounds are introduced, means that forensic laboratories must make difficult decisions about which reference materials to acquire. Here, we asked whether retrospective suspect screening of population-scale mass spectrometry data could provide a data-driven platform to prioritize emerging NPSs for assay development. We curated a suspect database of precursor and diagnostic fragment ion masses for 83 emerging NPSs and used this database to retrospectively screen mass spectrometry data from 12,727 urine drug screens from one Canadian province. We developed integrative computational strategies to prioritize the most reliable identifications and tracked the frequency of these identifications over a 3 year study period between August 2019 and August 2022. The resulting data were used to guide the acquisition of new reference materials, which were in turn used to validate a subset of the retrospective identifications. Last, we took advantage of matching clinical reports for all 12,727 samples to systematically benchmark the accuracy of our retrospective data analysis approach. Our work opens up new avenues to enable the rapid detection of emerging illicit drugs through large-scale reanalysis of mass spectrometry data.


Assuntos
Drogas Ilícitas , Psicotrópicos , Estudos Retrospectivos , Psicotrópicos/análise , Canadá , Espectrometria de Massas/métodos , Drogas Ilícitas/análise , Detecção do Abuso de Substâncias/métodos
6.
Anal Chem ; 95(50): 18326-18334, 2023 12 19.
Artigo em Inglês | MEDLINE | ID: mdl-38048435

RESUMO

The market for illicit drugs has been reshaped by the emergence of more than 1100 new psychoactive substances (NPS) over the past decade, posing a major challenge to the forensic and toxicological laboratories tasked with detecting and identifying them. Tandem mass spectrometry (MS/MS) is the primary method used to screen for NPS within seized materials or biological samples. The most contemporary workflows necessitate labor-intensive and expensive MS/MS reference standards, which may not be available for recently emerged NPS on the illicit market. Here, we present NPS-MS, a deep learning method capable of accurately predicting the MS/MS spectra of known and hypothesized NPS from their chemical structures alone. NPS-MS is trained by transfer learning from a generic MS/MS prediction model on a large data set of MS/MS spectra. We show that this approach enables a more accurate identification of NPS from experimentally acquired MS/MS spectra than any existing method. We demonstrate the application of NPS-MS to identify a novel derivative of phencyclidine (PCP) within an unknown powder seized in Denmark without the use of any reference standards. We anticipate that NPS-MS will allow forensic laboratories to identify more rapidly both known and newly emerging NPS. NPS-MS is available as a web server at https://nps-ms.ca/, which provides MS/MS spectra prediction capabilities for given NPS compounds. Additionally, it offers MS/MS spectra identification against a vast database comprising approximately 8.7 million predicted NPS compounds from DarkNPS and 24.5 million predicted ESI-QToF-MS/MS spectra for these compounds.


Assuntos
Aprendizado Profundo , Drogas Ilícitas , Espectrometria de Massas em Tandem/métodos , Psicotrópicos/análise , Drogas Ilícitas/análise , Espectrometria de Massas por Ionização por Electrospray
7.
Mol Cell Proteomics ; 20: 100002, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33592499

RESUMO

Biological functions emerge from complex and dynamic networks of protein-protein interactions. Because these protein-protein interaction networks, or interactomes, represent pairwise connections within a hierarchically organized system, it is often useful to identify higher-order associations embedded within them, such as multimember protein complexes. Graph-based clustering techniques are widely used to accomplish this goal, and dozens of field-specific and general clustering algorithms exist. However, interactomes can be prone to errors, especially when inferred from high-throughput biochemical assays. Therefore, robustness to network-level noise is an important criterion. Here, we tested the robustness of a range of graph-based clustering algorithms in the presence of noise, including algorithms common across domains and those specific to protein networks. Strikingly, we found that all of the clustering algorithms tested here markedly amplified network-level noise. Randomly rewiring only 1% of network edges yielded more than a 50% change in clustering results. Moreover, we found the impact of network noise on individual clusters was not uniform: some clusters were consistently robust to injected noise, whereas others were not. Therefore we developed the clust.perturb R package and Shiny web application to measure the reproducibility of clusters by randomly perturbing the network. We show that clust.perturb results are predictive of real-world cluster stability: poorly reproducible clusters as identified by clust.perturb are significantly less likely to be reclustered across experiments. We conclude that graph-based clustering amplifies noise in protein interaction networks, but quantifying the robustness of a cluster to network noise can separate stable protein complexes from spurious associations.


Assuntos
Mapas de Interação de Proteínas , Algoritmos , Análise por Conglomerados , Humanos , Reprodutibilidade dos Testes , Software
8.
Mol Cell Proteomics ; 20: 100096, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34129941

RESUMO

Despite the emergence of promising therapeutic approaches in preclinical studies, the failure of large-scale clinical trials leaves clinicians without effective treatments for acute spinal cord injury (SCI). These trials are hindered by their reliance on detailed neurological examinations to establish outcomes, which inflate the time and resources required for completion. Moreover, therapeutic development takes place in animal models whose relevance to human injury remains unclear. Here, we address these challenges through targeted proteomic analyses of cerebrospinal fluid and serum samples from 111 patients with acute SCI and, in parallel, a large animal (porcine) model of SCI. We develop protein biomarkers of injury severity and recovery, including a prognostic model of neurological improvement at 6 months with an area under the receiver operating characteristic curve of 0.91, and validate these in an independent cohort. Through cross-species proteomic analyses, we dissect evolutionarily conserved and divergent aspects of the SCI response and establish the cerebrospinal fluid abundance of glial fibrillary acidic protein as a biochemical outcome measure in both humans and pigs. Our work opens up new avenues to catalyze translation by facilitating the evaluation of novel SCI therapies, while also providing a resource from which to direct future preclinical efforts.


Assuntos
Proteína Glial Fibrilar Ácida/sangue , Proteína Glial Fibrilar Ácida/líquido cefalorraquidiano , Traumatismos da Medula Espinal/sangue , Traumatismos da Medula Espinal/líquido cefalorraquidiano , Animais , Feminino , Humanos , Proteômica , Medula Espinal/patologia , Traumatismos da Medula Espinal/patologia , Suínos
9.
Proc Natl Acad Sci U S A ; 117(1): 371-380, 2020 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-31871149

RESUMO

Microbial natural products represent a rich resource of evolved chemistry that forms the basis for the majority of pharmacotherapeutics. Ribosomally synthesized and posttranslationally modified peptides (RiPPs) are a particularly interesting class of natural products noted for their unique mode of biosynthesis and biological activities. Analyses of sequenced microbial genomes have revealed an enormous number of biosynthetic loci encoding RiPPs but whose products remain cryptic. In parallel, analyses of bacterial metabolomes typically assign chemical structures to only a minority of detected metabolites. Aligning these 2 disparate sources of data could provide a comprehensive strategy for natural product discovery. Here we present DeepRiPP, an integrated genomic and metabolomic platform that employs machine learning to automate the selective discovery and isolation of novel RiPPs. DeepRiPP includes 3 modules. The first, NLPPrecursor, identifies RiPPs independent of genomic context and neighboring biosynthetic genes. The second module, BARLEY, prioritizes loci that encode novel compounds, while the third, CLAMS, automates the isolation of their corresponding products from complex bacterial extracts. DeepRiPP pinpoints target metabolites using large-scale comparative metabolomics analysis across a database of 10,498 extracts generated from 463 strains. We apply the DeepRiPP platform to expand the landscape of novel RiPPs encoded within sequenced genomes and to discover 3 novel RiPPs, whose structures are exactly as predicted by our platform. By building on advances in machine learning technologies, DeepRiPP integrates genomic and metabolomic data to guide the isolation of novel RiPPs in an automated manner.


Assuntos
Proteínas de Bactérias/isolamento & purificação , Produtos Biológicos/isolamento & purificação , Descoberta de Drogas/métodos , Peptídeos/isolamento & purificação , Software , Bactérias/genética , Bactérias/metabolismo , Proteínas de Bactérias/biossíntese , Proteínas de Bactérias/genética , Produtos Biológicos/metabolismo , Genômica/métodos , Aprendizado de Máquina , Metabolômica/métodos , Biossíntese Peptídica/genética , Peptídeos/genética , Peptídeos/metabolismo , Processamento de Proteína Pós-Traducional , Ribossomos/metabolismo
10.
Nat Methods ; 16(5): 381-386, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-30962620

RESUMO

Single-cell transcriptomics provides an opportunity to characterize cell-type-specific transcriptional networks, intercellular signaling pathways and cellular diversity with unprecedented resolution by profiling thousands of cells in a single experiment. However, owing to the unique statistical properties of scRNA-seq data, the optimal measures of association for identifying gene-gene and cell-cell relationships from single-cell transcriptomics remain unclear. Here, we conducted a large-scale evaluation of 17 measures of association for their ability to reconstruct cellular networks, cluster cells of the same type and link cell-type-specific transcriptional programs to disease. Measures of proportionality were consistently among the best-performing methods across datasets and tasks. Our analysis provides data-driven guidance for gene and cell network analysis in single-cell transcriptomics.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes/genética , Análise de Célula Única/métodos , Animais , Doenças Cardiovasculares/genética , Linhagem Celular , Doenças do Sistema Nervoso Central/genética , Análise por Conglomerados , Humanos , Camundongos , Reprodutibilidade dos Testes , Análise de Sequência de RNA/métodos , Software
11.
Bioinformatics ; 37(17): 2775-2777, 2021 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-33471077

RESUMO

SUMMARY: We present PrInCE, an R/Bioconductor package that employs a machine-learning approach to infer protein-protein interaction networks from co-fractionation mass spectrometry (CF-MS) data. Previously distributed as a collection of Matlab scripts, our ground-up rewrite of this software package in an open-source language dramatically improves runtime and memory requirements. We describe several new features in the R implementation, including a test for the detection of co-eluting protein complexes and a method for differential network analysis. PrInCE is extensively documented and fully compatible with Bioconductor classes, ensuring it can fit seamlessly into existing proteomics workflows. AVAILABILITY AND IMPLEMENTATION: PrInCE is available from Bioconductor (https://www.bioconductor.org/packages/devel/bioc/html/PrInCE.html). Source code is freely available from GitHub under the MIT license (https://github.com/fosterlab/PrInCE). Support is provided via the GitHub issues tracker (https://github.com/fosterlab/PrInCE/issues). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

12.
PLoS Comput Biol ; 14(10): e1006474, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30332399

RESUMO

Elucidating the complete network of protein-protein interactions, or interactome, is a fundamental goal of the post-genomic era, yet existing interactome maps are far from complete. To increase the throughput and resolution of interactome mapping, methods for protein-protein interaction discovery by co-migration have been introduced. However, accurate identification of interacting protein pairs within the resulting large-scale proteomic datasets is challenging. Consequently, most computational pipelines for co-migration data analysis incorporate external genomic datasets to distinguish interacting from non-interacting protein pairs. The effect of this procedure on interactome mapping is poorly understood. Here, we conduct a rigorous analysis of genomic data integration for interactome recovery across a large number of co-migration datasets, spanning diverse experimental and computational methods. We find that genomic data integration leads to an increase in the functional coherence of the resulting interactome maps, but this comes at the expense of a decrease in power to discover novel interactions. Importantly, putative novel interactions predicted by genomic data integration are no more likely to later be experimentally discovered than those predicted from co-migration data alone. Our results reveal a widespread and unappreciated limitation in a methodology that has been widely used to map the interactome of humans and model organisms.


Assuntos
Bases de Dados Genéticas , Genômica/métodos , Mapeamento de Interação de Proteínas/métodos , Animais , Humanos , Camundongos
13.
Nucleic Acids Res ; 45(W1): W49-W54, 2017 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-28460067

RESUMO

Microbial natural products represent a rich resource of pharmaceutically and industrially important compounds. Genome sequencing has revealed that the majority of natural products remain undiscovered, and computational methods to connect biosynthetic gene clusters to their corresponding natural products therefore have the potential to revitalize natural product discovery. Previously, we described PRediction Informatics for Secondary Metabolomes (PRISM), a combinatorial approach to chemical structure prediction for genetically encoded nonribosomal peptides and type I and II polyketides. Here, we present a ground-up rewrite of the PRISM structure prediction algorithm to derive prediction of natural products arising from non-modular biosynthetic paradigms. Within this new version, PRISM 3, natural product scaffolds are modeled as chemical graphs, permitting structure prediction for aminocoumarins, antimetabolites, bisindoles and phosphonate natural products, and building upon the addition of ribosomally synthesized and post-translationally modified peptides. Further, with the addition of cluster detection for 11 new cluster types, PRISM 3 expands to detect 22 distinct natural product cluster types. Other major modifications to PRISM include improved sequence input and ORF detection, user-friendliness and output. Distribution of PRISM 3 over a 300-core server grid improves the speed and capacity of the web application. PRISM 3 is available at http://magarveylab.ca/prism/.


Assuntos
Produtos Biológicos/química , Genoma Microbiano , Software , Algoritmos , Vias Biossintéticas/genética , Internet , Metaboloma/genética , Metabolismo Secundário/genética
14.
Proc Natl Acad Sci U S A ; 113(42): E6343-E6351, 2016 10 18.
Artigo em Inglês | MEDLINE | ID: mdl-27698135

RESUMO

Microbial natural products are an evolved resource of bioactive small molecules, which form the foundation of many modern therapeutic regimes. Ribosomally synthesized and posttranslationally modified peptides (RiPPs) represent a class of natural products which have attracted extensive interest for their diverse chemical structures and potent biological activities. Genome sequencing has revealed that the vast majority of genetically encoded natural products remain unknown. Many bioinformatic resources have therefore been developed to predict the chemical structures of natural products, particularly nonribosomal peptides and polyketides, from sequence data. However, the diversity and complexity of RiPPs have challenged systematic investigation of RiPP diversity, and consequently the vast majority of genetically encoded RiPPs remain chemical "dark matter." Here, we introduce an algorithm to catalog RiPP biosynthetic gene clusters and chart genetically encoded RiPP chemical space. A global analysis of 65,421 prokaryotic genomes revealed 30,261 RiPP clusters, encoding 2,231 unique products. We further leverage the structure predictions generated by our algorithm to facilitate the genome-guided discovery of a molecule from a rare family of RiPPs. Our results provide the systematic investigation of RiPP genetic and chemical space, revealing the widespread distribution of RiPP biosynthesis throughout the prokaryotic tree of life, and provide a platform for the targeted discovery of RiPPs based on genome sequencing.


Assuntos
Produtos Biológicos , Biologia Computacional/métodos , Genômica , Biossíntese de Proteínas/genética , Ribossomos/metabolismo , Algoritmos , Análise por Conglomerados , Genômica/métodos , Cadeias de Markov , Peptídeos/genética , Peptídeos/metabolismo , Células Procarióticas/fisiologia , Processamento de Proteína Pós-Traducional , Reprodutibilidade dos Testes
15.
BMC Genomics ; 19(1): 758, 2018 Oct 19.
Artigo em Inglês | MEDLINE | ID: mdl-30340458

RESUMO

BACKGROUND: Databases of literature-curated protein-protein interactions (PPIs) are often used to interpret high-throughput interactome mapping studies and estimate error rates. These databases combine interactions across thousands of published studies and experimental techniques. Because the tendency for two proteins to interact depends on the local conditions, this heterogeneity of conditions means that only a subset of database PPIs are interacting during any given experiment. A typical use of these databases as gold standards in interactome mapping projects, however, assumes that PPIs included in the database are indeed interacting under the experimental conditions of the study. RESULTS: Using raw data from 20 co-fractionation experiments and six published interactomes, we demonstrate that this assumption is often false, with up to 55% of purported gold standard interactions showing no evidence of interaction, on average. We identify a subset of CORUM database complexes that do show consistent evidence of interaction in co-fractionation studies, and we use this subset as gold standards to dramatically improve interactome mapping as judged by the number of predicted interactions at a given error rate. CONCLUSIONS: We recommend using this CORUM subset as the gold standard set in future co-fractionation studies. More generally, we recommend using the subset of literature-curated PPIs that are specific to the experimental context whenever possible.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas/métodos
16.
BMC Genomics ; 19(1): 45, 2018 01 15.
Artigo em Inglês | MEDLINE | ID: mdl-29334896

RESUMO

BACKGROUND: Among naturally occurring small molecules, tRNA-derived cyclodipeptides are a class that have attracted attention for their diverse and desirable biological activities. However, no tools are available to link cyclodipeptide synthases identified within prokaryotic genome sequences to their chemical products. Consequently, it is unclear how many genetically encoded cyclodipeptides represent novel products, and which producing organisms should be targeted for discovery. RESULTS: We developed a pipeline for identification and classification of cyclodipeptide biosynthetic gene clusters and prediction of aminoacyl-tRNA substrates and complete chemical structures. We leveraged this tool to conduct a global analysis of tRNA-derived cyclodipeptide biosynthesis in 93,107 prokaryotic genomes, and compared predicted cyclodipeptides to known cyclodipeptide synthase products and all known chemically characterized cyclodipeptides. By integrating predicted chemical structures and gene cluster architectures, we created a unified map of known and unknown genetically encoded cyclodipeptides. CONCLUSIONS: Our analysis suggests that sizeable regions of the chemical space encoded within sequenced prokaryotic genomes remain unexplored. Our map of the landscape of genetically encoded cyclodipeptides provides candidates for targeted discovery of novel compounds. The integration of our pipeline into a user-friendly web application provides a resource for further discovery of cyclodipeptides in newly sequenced prokaryotic genomes.


Assuntos
Bactérias/genética , Dipeptídeos/biossíntese , Peptídeos Cíclicos/biossíntese , RNA de Transferência/metabolismo , Algoritmos , Genômica , Fases de Leitura Aberta
17.
Nat Chem Biol ; 12(12): 1007-1014, 2016 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-27694801

RESUMO

Polyketides (PKs) and nonribosomal peptides (NRPs) are profoundly important natural products, forming the foundations of many therapeutic regimes. Decades of research have revealed over 11,000 PK and NRP structures, and genome sequencing is uncovering new PK and NRP gene clusters at an unprecedented rate. However, only ∼10% of PK and NRPs are currently associated with gene clusters, and it is unclear how many of these orphan gene clusters encode previously isolated molecules. Therefore, to efficiently guide the discovery of new molecules, we must first systematically de-orphan emergent gene clusters from genomes. Here we provide to our knowledge the first comprehensive retro-biosynthetic program, generalized retro-biosynthetic assembly prediction engine (GRAPE), for PK and NRP families and introduce a computational pipeline, global alignment for natural products cheminformatics (GARLIC), to uncover how observed biosynthetic gene clusters relate to known molecules, leading to the identification of gene clusters that encode new molecules.


Assuntos
Família Multigênica , Biossíntese de Peptídeos Independentes de Ácido Nucleico , Peptídeos/metabolismo , Policetídeos/metabolismo , Algoritmos , Família Multigênica/genética , Biossíntese de Peptídeos Independentes de Ácido Nucleico/genética , Peptídeos/química , Peptídeos/genética , Policetídeos/química
18.
Nat Chem Biol ; 12(4): 233-9, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-26829473

RESUMO

Antibiotics are essential for numerous medical procedures, including the treatment of bacterial infections, but their widespread use has led to the accumulation of resistance, prompting calls for the discovery of antibacterial agents with new targets. A majority of clinically approved antibacterial scaffolds are derived from microbial natural products, but these valuable molecules are not well annotated or organized, limiting the efficacy of modern informatic analyses. Here, we provide a comprehensive resource defining the targets, chemical origins and families of the natural antibacterial collective through a retrobiosynthetic algorithm. From this we also detail the directed mining of biosynthetic scaffolds and resistance determinants to reveal structures with a high likelihood of having previously unknown modes of action. Implementing this pipeline led to investigations of the telomycin family of natural products from Streptomyces canus, revealing that these bactericidal molecules possess a new antibacterial mode of action dependent on the bacterial phospholipid cardiolipin.


Assuntos
Antibacterianos/farmacologia , Produtos Biológicos/farmacologia , Cardiolipinas/biossíntese , Bactérias Gram-Positivas/efeitos dos fármacos , Peptídeos/farmacologia , Streptomyces/metabolismo , Antibacterianos/biossíntese , Antibacterianos/isolamento & purificação , Produtos Biológicos/isolamento & purificação , Vias Biossintéticas , Cardiolipinas/genética , Contagem de Colônia Microbiana , Bases de Dados Genéticas , Farmacorresistência Bacteriana/efeitos dos fármacos , Farmacorresistência Bacteriana/genética , Bactérias Gram-Positivas/genética , Bactérias Gram-Positivas/crescimento & desenvolvimento , Bactérias Gram-Positivas/metabolismo , Testes de Sensibilidade Microbiana , Família Multigênica , Peptídeos/genética , Peptídeos/isolamento & purificação , Navegador
19.
Med Educ ; 52(5): 536-545, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29532953

RESUMO

CONTEXT: MD/PhD programmes provide structured paths for physician-scientist training. However, considerable proportions of graduates of these programmes do not pursue careers in research consistent with their training. OBJECTIVES: We sought to identify factors associated with sustained involvement in research after completion of all postgraduate training. METHODS: Anonymised data from a national survey of Canadian MD/PhD programme graduates who had completed all physician-scientist training (n = 70) were analysed. Multivariable logistic regression was used to measure the associations between characteristics of graduates and five indicators of sustained research involvement following postgraduate training: (i) protected research time in the current appointment; (ii) percentage of time dedicated to research; (iii) planned future involvement in research; (iv) role as a principal investigator on a recent funded project, and (v) receipt of funding from a federal granting agency since graduation. RESULTS: The majority of graduates were significantly involved in research on the basis of at least one outcome. Completion of a research fellowship, number of first-authored or co-authored manuscripts published during MD/PhD training, and duration of MD/PhD training were positively associated with continued research involvement. Completion of a Masters degree prior to MD/PhD training, female gender, debt greater than CAD$50 000 at completion of training, and pursuit of a clinical specialty other than internal medicine, paediatrics, neurology, pathology and the surgical specialties were negatively associated with sustained research involvement. CONCLUSIONS: Most MD/PhD programme graduates remain significantly involved in research, but this involvement often does not correspond to traditional physician-scientist roles, in which a majority of time is dedicated to research. To minimise loss of investment in physician-scientist training, MD/PhD programmes should prioritise research productivity during training and the pursuit of additional research training during residency, and policymakers should establish stable sources of funding to reduce debt among graduates. Our data suggest further study is warranted to identify interventions to reduce attrition among female MD/PhD programme graduates.


Assuntos
Pesquisa Biomédica/educação , Escolha da Profissão , Educação de Pós-Graduação em Medicina , Internato e Residência/estatística & dados numéricos , Médicos/estatística & dados numéricos , Apoio ao Desenvolvimento de Recursos Humanos/estatística & dados numéricos , Canadá , Feminino , Humanos , Masculino
20.
BMC Bioinformatics ; 18(1): 457, 2017 Oct 23.
Artigo em Inglês | MEDLINE | ID: mdl-29061110

RESUMO

BACKGROUND: An organism's protein interactome, or complete network of protein-protein interactions, defines the protein complexes that drive cellular processes. Techniques for studying protein complexes have traditionally applied targeted strategies such as yeast two-hybrid or affinity purification-mass spectrometry to assess protein interactions. However, given the vast number of protein complexes, more scalable methods are necessary to accelerate interaction discovery and to construct whole interactomes. We recently developed a complementary technique based on the use of protein correlation profiling (PCP) and stable isotope labeling in amino acids in cell culture (SILAC) to assess chromatographic co-elution as evidence of interacting proteins. Importantly, PCP-SILAC is also capable of measuring protein interactions simultaneously under multiple biological conditions, allowing the detection of treatment-specific changes to an interactome. Given the uniqueness and high dimensionality of co-elution data, new tools are needed to compare protein elution profiles, control false discovery rates, and construct an accurate interactome. RESULTS: Here we describe a freely available bioinformatics pipeline, PrInCE, for the analysis of co-elution data. PrInCE is a modular, open-source library that is computationally inexpensive, able to use label and label-free data, and capable of detecting tens of thousands of protein-protein interactions. Using a machine learning approach, PrInCE offers greatly reduced run time, more predicted interactions at the same stringency, prediction of protein complexes, and greater ease of use over previous bioinformatics tools for co-elution data. PrInCE is implemented in Matlab (version R2017a). Source code and standalone executable programs for Windows and Mac OSX are available at https://github.com/fosterlab/PrInCE , where usage instructions can be found. An example dataset and output are also provided for testing purposes. CONCLUSIONS: PrInCE is the first fast and easy-to-use data analysis pipeline that predicts interactomes and protein complexes from co-elution data. PrInCE allows researchers without bioinformatics expertise to analyze high-throughput co-elution datasets.


Assuntos
Algoritmos , Mapeamento de Interação de Proteínas/métodos , Bases de Dados de Proteínas , Humanos , Anotação de Sequência Molecular , Proteômica , Reprodutibilidade dos Testes , Saccharomyces cerevisiae/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA