RESUMO
Enhancers bind transcription factors, chromatin regulators, and non-coding transcripts to modulate the expression of target genes. Here, we report 3D genome structures of single mouse ES cells as they are induced to exit pluripotency and transition through a formative stage prior to undergoing neuroectodermal differentiation. We find that there is a remarkable reorganization of 3D genome structure where inter-chromosomal intermingling increases dramatically in the formative state. This intermingling is associated with the formation of a large number of multiway hubs that bring together enhancers and promoters with similar chromatin states from typically 5-8 distant chromosomal sites that are often separated by many Mb from each other. In the formative state, genes important for pluripotency exit establish contacts with emerging enhancers within these multiway hubs, suggesting that the structural changes we have observed may play an important role in modulating transcription and establishing new cell identities.
Assuntos
Células-Tronco Embrionárias Murinas , Sequências Reguladoras de Ácido Nucleico , Camundongos , Animais , Células-Tronco Embrionárias Murinas/metabolismo , Células-Tronco Embrionárias/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Cromatina/genética , Cromatina/metabolismo , Elementos Facilitadores GenéticosRESUMO
General approaches for designing sequence-specific peptide-binding proteins would have wide utility in proteomics and synthetic biology. However, designing peptide-binding proteins is challenging, as most peptides do not have defined structures in isolation, and hydrogen bonds must be made to the buried polar groups in the peptide backbone1-3. Here, inspired by natural and re-engineered protein-peptide systems4-11, we set out to design proteins made out of repeating units that bind peptides with repeating sequences, with a one-to-one correspondence between the repeat units of the protein and those of the peptide. We use geometric hashing to identify protein backbones and peptide-docking arrangements that are compatible with bidentate hydrogen bonds between the side chains of the protein and the peptide backbone12. The remainder of the protein sequence is then optimized for folding and peptide binding. We design repeat proteins to bind to six different tripeptide-repeat sequences in polyproline II conformations. The proteins are hyperstable and bind to four to six tandem repeats of their tripeptide targets with nanomolar to picomolar affinities in vitro and in living cells. Crystal structures reveal repeating interactions between protein and peptide interactions as designed, including ladders of hydrogen bonds from protein side chains to peptide backbones. By redesigning the binding interfaces of individual repeat units, specificity can be achieved for non-repeating peptide sequences and for disordered regions of native proteins.
Assuntos
Peptídeos , Engenharia de Proteínas , Proteínas , Sequência de Aminoácidos , Modelos Moleculares , Peptídeos/química , Peptídeos/metabolismo , Proteínas/química , Proteínas/metabolismo , Engenharia de Proteínas/métodos , Ligação de Hidrogênio , Ligação Proteica , Dobramento de Proteína , Conformação ProteicaRESUMO
During morphogenesis, large-scale changes of tissue primordia are coordinated across an embryo. In Drosophila, several tissue primordia and embryonic regions are bordered or encircled by supracellular actomyosin cables, junctional actomyosin enrichments networked between many neighbouring cells. We show that the single Drosophila Alp/Enigma-family protein Zasp52, which is most prominently found in Z-discs of muscles, is a component of many supracellular actomyosin structures during embryogenesis, including the ventral midline and the boundary of the salivary gland placode. We reveal that Zasp52 contains within its central coiled-coil region a type of actin-binding motif usually found in CapZbeta proteins, and this domain displays actin-binding activity. Using endogenously-tagged lines, we identify that Zasp52 interacts with junctional components, including APC2, Polychaetoid and Sidekick, and actomyosin regulators. Analysis of zasp52 mutant embryos reveals that the severity of the embryonic defects observed scales inversely with the amount of functional protein left. Large tissue deformations occur where actomyosin cables are found during embryogenesis, and in vivo and in silico analyses suggest a model whereby supracellular Zasp52-containing cables aid to insulate morphogenetic changes from one another.
Assuntos
Actomiosina , Proteínas de Drosophila , Animais , Actomiosina/metabolismo , Actinas/metabolismo , Drosophila melanogaster/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila/metabolismo , Sarcômeros/metabolismo , Morfogênese/genéticaRESUMO
The human genome encodes approximately 20,000 proteins, many still uncharacterised. It has become clear that scientific research tends to focus on well-studied proteins, leading to a concern that poorly understood genes are unjustifiably neglected. To address this, we have developed a publicly available and customisable "Unknome database" that ranks proteins based on how little is known about them. We applied RNA interference (RNAi) in Drosophila to 260 unknown genes that are conserved between flies and humans. Knockdown of some genes resulted in loss of viability, and functional screening of the rest revealed hits for fertility, development, locomotion, protein quality control, and resilience to stress. CRISPR/Cas9 gene disruption validated a component of Notch signalling and 2 genes contributing to male fertility. Our work illustrates the importance of poorly understood genes, provides a resource to accelerate future research, and highlights a need to support database curation to ensure that misannotation does not erode our awareness of our own ignorance.
Assuntos
Drosophila , Fertilidade , Animais , Masculino , Humanos , Drosophila/genética , Interferência de RNA , Fertilidade/genéticaRESUMO
The various membranes of eukaryotic cells differ in composition, but it is at present unclear if this results in differences in physical properties. The sequences of transmembrane domains (TMDs) of integral membrane proteins should reflect the physical properties of the bilayers in which they reside. We used large datasets from both fungi and vertebrates to perform a comprehensive comparison of the TMDs of proteins from different organelles. We find that TMDs are not generic but have organelle-specific properties with a dichotomy in TMD length between the early and late parts of the secretory pathway. In addition, TMDs from post-ER organelles show striking asymmetries in amino acid compositions across the bilayer that is linked to residue size and varies between organelles. The pervasive presence of organelle-specific features among the TMDs of a particular organelle has implications for TMD prediction, regulation of protein activity by location, and sorting of proteins and lipids in the secretory pathway.
Assuntos
Proteínas de Membrana/química , Estrutura Terciária de Proteína , Homologia Estrutural de Proteína , Animais , Membrana Celular/metabolismo , Complexo de Golgi/metabolismo , Humanos , Interações Hidrofóbicas e Hidrofílicas , Proteínas de Membrana/metabolismo , Organelas/metabolismoRESUMO
The folding of genomic DNA from the beads-on-a-string-like structure of nucleosomes into higher-order assemblies is crucially linked to nuclear processes. Here we calculate 3D structures of entire mammalian genomes using data from a new chromosome conformation capture procedure that allows us to first image and then process single cells. The technique enables genome folding to be examined at a scale of less than 100 kb, and chromosome structures to be validated. The structures of individual topological-associated domains and loops vary substantially from cell to cell. By contrast, A and B compartments, lamina-associated domains and active enhancers and promoters are organized in a consistent way on a genome-wide basis in every cell, suggesting that they could drive chromosome and genome folding. By studying genes regulated by pluripotency factor and nucleosome remodelling deacetylase (NuRD), we illustrate how the determination of single-cell genome structure provides a new approach for investigating biological processes.
Assuntos
Montagem e Desmontagem da Cromatina , Genoma , Imagem Molecular/métodos , Nucleossomos/química , Análise de Célula Única/métodos , Animais , Fator de Ligação a CCCTC , Proteínas de Ciclo Celular/metabolismo , Montagem e Desmontagem da Cromatina/genética , Proteínas Cromossômicas não Histona/metabolismo , Cromossomos de Mamíferos/química , Cromossomos de Mamíferos/genética , Cromossomos de Mamíferos/metabolismo , DNA/química , DNA/genética , DNA/metabolismo , Elementos Facilitadores Genéticos , Fase G1 , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Genoma/genética , Haploidia , Complexo Mi-2 de Remodelação de Nucleossomo e Desacetilase/metabolismo , Camundongos , Modelos Moleculares , Conformação Molecular , Imagem Molecular/normas , Células-Tronco Embrionárias Murinas/citologia , Células-Tronco Embrionárias Murinas/metabolismo , Nucleossomos/genética , Nucleossomos/metabolismo , Regiões Promotoras Genéticas , Proteínas Repressoras/metabolismo , Reprodutibilidade dos Testes , Análise de Célula Única/normas , CoesinasRESUMO
Protein aggregation is a hallmark of diverse neurodegenerative diseases. Multiple lines of evidence have revealed that protein aggregates can penetrate inside cells and spread like prions. How such aggregates enter cells remains elusive. Through a focused siRNA screen targeting genes involved in membrane trafficking, we discovered that mutant SOD1 aggregates, like viruses, exploit cofilin-1 to remodel cortical actin and enter cells. Upstream of cofilin-1, signalling from the RHO GTPase and the ROCK1 and LIMK1 kinases controls cofilin-1 activity to remodel actin and modulate aggregate entry. In the spinal cord of symptomatic SOD1G93A transgenic mice, cofilin-1 phosphorylation is increased and actin dynamics altered. Importantly, the RHO to cofilin-1 signalling pathway also modulates entry of tau and α-synuclein aggregates. Our results identify a common host cell signalling pathway that diverse protein aggregates exploit to remodel actin and enter cells.
Assuntos
Cofilina 1/metabolismo , Agregados Proteicos , Proteínas rho de Ligação ao GTP/metabolismo , Actinas/metabolismo , Animais , Linhagem Celular , Quinases Lim/metabolismo , Camundongos Transgênicos , RNA Interferente Pequeno/genética , Proteínas Recombinantes/metabolismo , Transdução de Sinais , Medula Espinal/metabolismo , Superóxido Dismutase-1/genética , Superóxido Dismutase-1/metabolismo , alfa-Sinucleína/genética , alfa-Sinucleína/metabolismo , Quinases Associadas a rho/metabolismo , Proteínas tau/genética , Proteínas tau/metabolismoRESUMO
The order of enzymatic activity across Golgi cisternae is essential for complex molecule biosynthesis. However, an inability to separate Golgi cisternae has meant that the cisternal distribution of most resident proteins, and their underlying localization mechanisms, are unknown. Here, we exploit differences in surface charge of intact cisternae to perform separation of early to late Golgi subcompartments. We determine protein and glycan abundance profiles across the Golgi; over 390 resident proteins are identified, including 136 new additions, with over 180 cisternal assignments. These assignments provide a means to better understand the functional roles of Golgi proteins and how they operate sequentially. Protein and glycan distributions are validated in vivo using high-resolution microscopy. Results reveal distinct functional compartmentalization among resident Golgi proteins. Analysis of transmembrane proteins shows several sequence-based characteristics relating to pI, hydrophobicity, Ser abundance, and Phe bilayer asymmetry that change across the Golgi. Overall, our results suggest that a continuum of transmembrane features, rather than discrete rules, guide proteins to earlier or later locations within the Golgi stack.
Assuntos
Complexo de Golgi/metabolismo , Proteínas de Plantas/química , Proteínas de Plantas/metabolismo , Complexo de Golgi/ultraestrutura , Interações Hidrofóbicas e Hidrofílicas , Membranas Intracelulares , Proteínas de Membrana/química , Proteínas de Membrana/metabolismo , Polissacarídeos/química , Polissacarídeos/metabolismo , ProteomaRESUMO
Sex promotes the recombination and reassortment of genetic material and is prevalent across eukaryotes, although our knowledge of the molecular details of sexual inheritance is scant in several major lineages. In social amoebae, sex involves a promiscuous mixing of cytoplasm before zygotes consume the majority of cells, but for technical reasons, sexual progeny have been difficult to obtain and study. We report here genome-wide characterization of meiotic progeny in Dictyostelium discoideum We find that recombination occurs at high frequency in pairwise crosses between all three mating types, despite the absence of the Spo11 enzyme that is normally required to initiate crossover formation. Fusions of more than two gametes to form transient syncytia lead to frequent triparental inheritance, with haploid meiotic progeny bearing recombined nuclear haplotypes from two parents and the mitochondrial genome from a third. Cells that do not contribute genetically to the Dictyostelium zygote nucleus thereby have a stake in the next haploid generation. D. discoideum mitochondrial genomes are polymorphic, and our findings raise the possibility that some of this variation might be a result of sexual selection on genes that can promote the spread of individual organelle genomes during sex. This kind of self-interested mitochondrial behavior may have had important consequences during eukaryogenesis and the initial evolution of sex.
RESUMO
Cyanobacteria are complex prokaryotes, incorporating a Gram-negative cell wall and internal thylakoid membranes (TMs). However, localization of proteins within cyanobacterial cells is poorly understood. Using subcellular fractionation and quantitative proteomics, we produced an extensive subcellular proteome map of an entire cyanobacterial cell, identifying â¼67% of proteins in Synechocystis sp. PCC 6803, â¼1000 more than previous studies. Assigned to six specific subcellular regions were 1,712 proteins. Proteins involved in energy conversion localized to TMs. The majority of transporters, with the exception of a TM-localized copper importer, resided in the plasma membrane (PM). Most metabolic enzymes were soluble, although numerous pathways terminated in the TM (notably those involved in peptidoglycan monomer, NADP+, heme, lipid, and carotenoid biosynthesis) or PM (specifically, those catalyzing lipopolysaccharide, molybdopterin, FAD, and phylloquinol biosynthesis). We also identified the proteins involved in the TM and PM electron transport chains. The majority of ribosomal proteins and enzymes synthesizing the storage compound polyhydroxybuyrate formed distinct clusters within the data, suggesting similar subcellular distributions to one another, as expected for proteins operating within multicomponent structures. Moreover, heterogeneity within membrane regions was observed, indicating further cellular complexity. Cyanobacterial TM protein localization was conserved in Arabidopsis (Arabidopsis thaliana) chloroplasts, suggesting similar proteome organization in more developed photosynthetic organisms. Successful application of this technique in Synechocystis suggests it could be applied to mapping the proteomes of other cyanobacteria and single-celled organisms. The organization of the cyanobacterial cell revealed here substantially aids our understanding of these environmentally and biotechnologically important organisms.
Assuntos
Compartimento Celular , Proteoma/metabolismo , Proteômica , Synechocystis/citologia , Synechocystis/metabolismo , Arabidopsis/metabolismo , Proteínas de Bactérias/metabolismo , Fracionamento Celular , Membrana Celular/metabolismo , Parede Celular/metabolismo , Cloroplastos/metabolismo , Cloroplastos/ultraestrutura , Redes e Vias Metabólicas , Análise de Componente Principal , Subunidades Ribossômicas/metabolismo , Synechocystis/ultraestruturaRESUMO
Large-scale chromosome structure and spatial nuclear arrangement have been linked to control of gene expression and DNA replication and repair. Genomic techniques based on chromosome conformation capture (3C) assess contacts for millions of loci simultaneously, but do so by averaging chromosome conformations from millions of nuclei. Here we introduce single-cell Hi-C, combined with genome-wide statistical analysis and structural modelling of single-copy X chromosomes, to show that individual chromosomes maintain domain organization at the megabase scale, but show variable cell-to-cell chromosome structures at larger scales. Despite this structural stochasticity, localization of active gene domains to boundaries of chromosome territories is a hallmark of chromosomal conformation. Single-cell Hi-C data bridge current gaps between genomics and microscopy studies of chromosomes, demonstrating how modular organization underlies dynamic chromosome structure, and how this structure is probabilistically linked with genome activity patterns.
Assuntos
Cromossomos/química , Técnicas Genéticas , Modelos Moleculares , Animais , Núcleo Celular/genética , Cromatina/química , Cromossomos/genética , Masculino , Camundongos , Conformação Molecular , Análise de Célula Única , Cromossomo X/química , Cromossomo X/genéticaRESUMO
Measuring changes in protein or organelle abundance in the cell is an essential, but challenging aspect of cell biology. Frequently-used methods for determining organelle abundance typically rely on detection of a very few marker proteins, so are unsatisfactory. In silico estimates of protein abundances from publicly available protein spectra can provide useful standard abundance values but contain only data from tissue proteomes, and are not coupled to organelle localization data. A new protein abundance score, the normalized protein abundance scale (NPAS), expands on the number of scored proteins and the scoring accuracy of lower-abundance proteins in Arabidopsis. NPAS was combined with subcellular protein localization data, facilitating quantitative estimations of organelle abundance during routine experimental procedures. A suite of targeted proteomics markers for subcellular compartment markers was developed, enabling independent verification of in silico estimates for relative organelle abundance. Estimation of relative organelle abundance was found to be reproducible and consistent over a range of tissues and growth conditions. In silico abundance estimations and localization data have been combined into an online tool, multiple marker abundance profiling, available in the SUBA4 toolbox (http://suba.live).
Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/metabolismo , Proteoma , Proteômica , Biomarcadores/metabolismo , Organelas/metabolismo , Transporte ProteicoRESUMO
Cyanobacteria are intricately organized, incorporating an array of internal thylakoid membranes, the site of photosynthesis, into cells no larger than other bacteria. They also synthesize C15-C19 alkanes and alkenes, which results in substantial production of hydrocarbons in the environment. All sequenced cyanobacteria encode hydrocarbon biosynthesis pathways, suggesting an important, undefined physiological role for these compounds. Here, we demonstrate that hydrocarbon-deficient mutants of Synechococcus sp. PCC 7002 and Synechocystis sp. PCC 6803 exhibit significant phenotypic differences from wild type, including enlarged cell size, reduced growth, and increased division defects. Photosynthetic rates were similar between strains, although a minor reduction in energy transfer between the soluble light harvesting phycobilisome complex and membrane-bound photosystems was observed. Hydrocarbons were shown to accumulate in thylakoid and cytoplasmic membranes. Modeling of membranes suggests these compounds aggregate in the center of the lipid bilayer, potentially promoting membrane flexibility and facilitating curvature. In vivo measurements confirmed that Synechococcus sp. PCC 7002 mutants lacking hydrocarbons exhibit reduced thylakoid membrane curvature compared to wild type. We propose that hydrocarbons may have a role in inducing the flexibility in membranes required for optimal cell division, size, and growth, and efficient association of soluble and membrane bound proteins. The recent identification of C15-C17 alkanes and alkenes in microalgal species suggests hydrocarbons may serve a similar function in a broad range of photosynthetic organisms.
Assuntos
Divisão Celular/efeitos dos fármacos , Hidrocarbonetos/farmacologia , Synechocystis/citologia , Synechocystis/crescimento & desenvolvimento , Vias Biossintéticas/efeitos dos fármacos , Proliferação de Células/efeitos dos fármacos , Membranas Intracelulares/efeitos dos fármacos , Membranas Intracelulares/metabolismo , Bicamadas Lipídicas/metabolismo , Mutação/genética , Fotossíntese/efeitos dos fármacos , Synechocystis/efeitos dos fármacos , Synechocystis/metabolismo , Tilacoides/efeitos dos fármacos , Tilacoides/metabolismoRESUMO
CcpNmr Analysis provides a streamlined pipeline for both NMR chemical shift assignment and structure determination of biological macromolecules. In addition, it encompasses tools to analyse the many additional experiments that make NMR such a pivotal technique for research into complex biological questions. This report describes how CcpNmr Analysis can seamlessly link together all of the tasks in the NMR structure-determination process. It details each of the stages from generating NMR restraints [distance, dihedral, hydrogen bonds and residual dipolar couplings (RDCs)], exporting these to and subsequently re-importing them from structure-calculation software (such as the programs CYANA or ARIA) and analysing and validating the results obtained from the structure calculation to, ultimately, the streamlined deposition of the completed assignments and the refined ensemble of structures into the PDBe repository. Until recently, such solution-structure determination by NMR has been quite a laborious task, requiring multiple stages and programs. However, with the new enhancements to CcpNmr Analysis described here, this process is now much more intuitive and efficient and less error-prone.
Assuntos
Ressonância Magnética Nuclear Biomolecular/métodos , Ligação de Hidrogênio , Estrutura MolecularRESUMO
The Golgi-localized golgins golgin-97 and golgin-245 capture transport vesicles arriving from endosomes via the protein TBC1D23. The amino-terminal domain of TBC1D23 binds to the golgins, and the carboxyl-terminal domain of TBC1D23 captures the vesicles, but how it recognizes specific vesicles was unclear. A search for binding partners of the carboxyl-terminal domain unexpectedly revealed direct binding to carboxypeptidase D and syntaxin-16, known cargo proteins of the captured vesicles. Binding is via a threonine-leucine-tyrosine (TLY) sequence present in both proteins next to an acidic cluster. A crystal structure reveals how this acidic TLY motif binds to TBC1D23. An acidic TLY motif is also present in the tails of other endosome-to-Golgi cargo, and these also bind TBC1D23. Structure-guided mutations in the carboxyl-terminal domain that disrupt motif binding in vitro also block vesicle capture in vivo. Thus, TBC1D23 attached to golgin-97 and golgin-245 captures vesicles by a previously undescribed mechanism: the recognition of a motif shared by cargo proteins carried by the vesicle.
Assuntos
Complexo de Golgi , Proteínas de Membrana , Proteínas da Matriz do Complexo de Golgi/metabolismo , Proteínas de Membrana/metabolismo , Complexo de Golgi/metabolismo , Transporte Biológico , Endossomos/metabolismo , Ligação ProteicaRESUMO
The Golgi apparatus is the central organelle in the secretory pathway and plays key roles in glycosylation, protein sorting, and secretion in plants. Enzymes involved in the biosynthesis of complex polysaccharides, glycoproteins, and glycolipids are located in this organelle, but the majority of them remain uncharacterized. Here, we studied the Arabidopsis (Arabidopsis thaliana) membrane proteome with a focus on the Golgi apparatus using localization of organelle proteins by isotope tagging. By applying multivariate data analysis to a combined data set of two new and two previously published localization of organelle proteins by isotope tagging experiments, we identified the subcellular localization of 1,110 proteins with high confidence. These include 197 Golgi apparatus proteins, 79 of which have not been localized previously by a high-confidence method, as well as the localization of 304 endoplasmic reticulum and 208 plasma membrane proteins. Comparison of the hydrophobic domains of the localized proteins showed that the single-span transmembrane domains have unique properties in each organelle. Many of the novel Golgi-localized proteins belong to uncharacterized protein families. Structure-based homology analysis identified 12 putative Golgi glycosyltransferase (GT) families that have no functionally characterized members and, therefore, are not yet assigned to a Carbohydrate-Active Enzymes database GT family. The substantial numbers of these putative GTs lead us to estimate that the true number of plant Golgi GTs might be one-third above those currently annotated. Other newly identified proteins are likely to be involved in the transport and interconversion of nucleotide sugar substrates as well as polysaccharide and protein modification.
Assuntos
Arabidopsis/enzimologia , Glicosiltransferases/metabolismo , Complexo de Golgi/enzimologia , Proteínas de Membrana/metabolismo , Proteoma/análise , Proteínas de Arabidopsis/metabolismo , Membrana Celular/metabolismo , Parede Celular/metabolismo , Bases de Dados de Proteínas , Retículo Endoplasmático/metabolismo , Interações Hidrofóbicas e Hidrofílicas , Marcação por Isótopo/métodos , Análise Multivariada , Análise de Componente Principal , Estrutura Terciária de Proteína , Proteoma/metabolismo , Proteômica/métodosRESUMO
Detailed understanding of the 3D structure of chromatin is a key ingredient to investigate a variety of processes inside the cell. Since direct methods to experimentally ascertain these structures lack the desired spatial fidelity, computational inference methods based on single cell Hi-C data have gained significant interest. Here, we develop a progressive simulation protocol to iteratively improve the resolution of predicted interphase structures by maximum-likelihood association of ambiguous Hi-C contacts using lower-resolution predictions. Compared to state-of-the-art methods, our procedure is not limited to haploid cell data and allows us to reach a resolution of up to 5,000 base pairs per bead. High resolution chromatin models grant access to a multitude of structural phenomena. Exemplarily, we verify the formation of chromosome territories and holes near aggregated chromocenters as well as the inversion of the CpG content for rod photoreceptor cells.
RESUMO
We present a suite of programs, named CING for Common Interface for NMR Structure Generation that provides for a residue-based, integrated validation of the structural NMR ensemble in conjunction with the experimental restraints and other input data. External validation programs and new internal validation routines compare the NMR-derived models with empirical data, measured chemical shifts, distance- and dihedral restraints and the results are visualized in a dynamic Web 2.0 report. A red-orange-green score is used for residues and restraints to direct the user to those critiques that warrant further investigation. Overall green scores below ~20 % accompanied by red scores over ~50 % are strongly indicative of poorly modelled structures. The publically accessible, secure iCing webserver ( https://nmr.le.ac.uk ) allows individual users to upload the NMR data and run a CING validation analysis.
Assuntos
Ressonância Magnética Nuclear Biomolecular/métodos , Proteínas/química , Software , Modelos Moleculares , Conformação Proteica , Reprodutibilidade dos Testes , Interface Usuário-ComputadorRESUMO
SUMMARY: We present here the freely available Metabolomics Project resource specifically designed to work under the CcpNmr Analysis program produced by CCPN (Collaborative Computing Project for NMR) (Vranken et al., 2005, The CCPN data model for NMR spectroscopy: development of a software pipeline. Proteins, 59, 687-696). The project consists of a database of assigned 1D and 2D spectra of many common metabolites. The project aims to help the user to analyze and assign 1D and 2D NMR spectra of unknown metabolite mixtures. Spectra of unknown mixtures can be easily superimposed and compared with the database spectra, thus facilitating their assignment and identification. AVAILABILITY: The CCPN Metabolomics Project, together with an annotated example dataset, is freely available via: http://www.ccpn.ac.uk/metabolomics/.
Assuntos
Espectroscopia de Ressonância Magnética/métodos , Metabolômica/métodos , Software , Biologia Computacional/métodos , Bases de Dados FactuaisRESUMO
Solid-state magic-angle-spinning (MAS) NMR of proteins has undergone many rapid methodological developments in recent years, enabling detailed studies of protein structure, function and dynamics. Software development, however, has not kept pace with these advances and data analysis is mostly performed using tools developed for solution NMR which do not directly address solid-state specific issues. Here we present additions to the CcpNmr Analysis software package which enable easier identification of spinning side bands, straightforward analysis of double quantum spectra, automatic consideration of non-uniform labelling schemes, as well as extension of other existing features to the needs of solid-state MAS data. To underpin this, we have updated and extended the CCPN data model and experiment descriptions to include transfer types and nomenclature appropriate for solid-state NMR experiments, as well as a set of experiment prototypes covering the experiments commonly employed by solid-sate MAS protein NMR spectroscopists. This work not only improves solid-state MAS NMR data analysis but provides a platform for anyone who uses the CCPN data model for programming, data transfer, or data archival involving solid-state MAS NMR data.