RESUMO
In late 2019, a virus subsequently named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in China and led to a worldwide pandemic of the disease termed coronavirus disease 2019. The global health threat posed by this pandemic led to an extremely rapid and robust mobilization of the scientific and medical communities as evidenced by the publication of more than 10,000 peer-reviewed articles and thousands of preprints in the first year of the pandemic alone. With the publication of the initial genome sequence of SARS-CoV-2, the proteomics community immediately joined this effort publishing, to date, more than 100 peer-reviewed proteomics studies and submitting many more preprints to preprint servers. In this review, we focus on peer-reviewed articles published on the proteome, glycoproteome, and glycome of SARS-CoV-2. At a basic level, proteomic studies provide valuable information on quantitative aspects of viral infection course; information on the identities, sites, and microheterogeneity of post-translational modifications; and, information on protein-protein interactions. At a biological systems level, these studies elucidate host cell and tissue responses, characterize antibodies and other immune system factors in infection, suggest biomarkers that may be useful for diagnosis and disease-course monitoring, and help in the development or repurposing of potential therapeutics. Here, we summarize results from selected early studies to provide a perspective on the current rapidly evolving literature.
Assuntos
COVID-19/metabolismo , Proteômica/métodos , SARS-CoV-2/patogenicidade , Proteínas Virais/metabolismo , Enzima de Conversão de Angiotensina 2/metabolismo , Biomarcadores/análise , Biomarcadores/metabolismo , COVID-19/diagnóstico , COVID-19/virologia , Interações Hospedeiro-Patógeno/fisiologia , Humanos , Processamento de Proteína Pós-Traducional , SARS-CoV-2/metabolismo , Internalização do VírusRESUMO
The O-GlcNAc transferase (OGT) modifies nuclear and cytoplasmic proteins with ß-N-acetyl-glucosamine (O-GlcNAc). With thousands of O-GlcNAc-modified proteins but only one OGT encoded in the mammalian genome, a prevailing question is how OGT selects its substrates. Prior work has indicated that the tetratricopeptide repeat (TPR) domain of OGT is involved in substrate selection. Furthermore, several variants of OGT causal for X-linked intellectual disability (XLID) occur in the TPR domain. Therefore, we adapted the BioID labeling method to identify interactors of a TPR-BirA* fusion protein in HeLa cells. We identified 115 interactors representing known and novel O-GlcNAc-modified proteins and OGT interactors (raw data deposited in MassIVE, Dataset ID MSV000085626). The interactors are enriched in known OGT processes (e.g., chromatin remodeling) as well as processes in which OGT has yet to be implicated (e.g., pre-mRNA processing). Importantly, the identified TPR interactors are linked to several disease states but most notably are enriched in pathologies featuring intellectual disability that may underlie the mechanism by which mutations in OGT lead to XLID. This interactome for the TPR domain of OGT serves as a jumping-off point for future research exploring the role of OGT, the TPR domain, and its protein interactors in multiple cellular processes and disease mechanisms, including intellectual disability.
Assuntos
Deficiência Intelectual , Repetições de Tetratricopeptídeos , Animais , Células HeLa , Humanos , Deficiência Intelectual/genética , N-Acetilglucosaminiltransferases/genéticaRESUMO
Heme is an iron-containing cofactor essential for multiple cellular processes and fundamental activities such as oxygen transport. To better understand the means by which heme synthesis is regulated during erythropoiesis, affinity purification coupled with mass spectrometry (MS) was performed to identify putative protein partners interacting with ferrochelatase (FECH), the terminal enzyme in the heme biosynthetic pathway. Both progesterone receptor membrane component 1 (PGRMC1) and progesterone receptor membrane component 2 (PGRMC2) were identified in these experiments. These interactions were validated by reciprocal affinity purification followed by MS analysis and immunoblotting. The interaction between PGRMC1 and FECH was confirmed in vitro and in HEK 293T cells, a non-erythroid cell line. When cells that are recognized models for erythroid differentiation were treated with a small molecule inhibitor of PGRMC1, AG-205, there was an observed decrease in the level of hemoglobinization relative to that of untreated cells. In vitro heme transfer experiments showed that purified PGRMC1 was able to donate heme to apo-cytochrome b5. In the presence of PGRMC1, in vitro measured FECH activity decreased in a dose-dependent manner. Interactions between FECH and PGRMC1 were strongest for the conformation of FECH associated with product release, suggesting that PGRMC1 may regulate FECH activity by controlling heme release. Overall, the data illustrate a role for PGRMC1 in regulating heme synthesis via interactions with FECH and suggest that PGRMC1 may be a heme chaperone or sensor.
Assuntos
Ferroquelatase/metabolismo , Proteínas de Membrana/fisiologia , Receptores de Progesterona/fisiologia , Animais , Linhagem Celular , Humanos , CamundongosRESUMO
Metal ion cofactors afford proteins virtually unlimited catalytic potential, enable electron transfer reactions and have a great impact on protein stability. Consequently, metalloproteins have key roles in most biological processes, including respiration (iron and copper), photosynthesis (manganese) and drug metabolism (iron). Yet, predicting from genome sequence the numbers and types of metal an organism assimilates from its environment or uses in its metalloproteome is currently impossible because metal coordination sites are diverse and poorly recognized. We present here a robust, metal-based approach to determine all metals an organism assimilates and identify its metalloproteins on a genome-wide scale. This shifts the focus from classical protein-based purification to metal-based identification and purification by liquid chromatography, high-throughput tandem mass spectrometry (HT-MS/MS) and inductively coupled plasma mass spectrometry (ICP-MS) to characterize cytoplasmic metalloproteins from an exemplary microorganism (Pyrococcus furiosus). Of 343 metal peaks in chromatography fractions, 158 did not match any predicted metalloprotein. Unassigned peaks included metals known to be used (cobalt, iron, nickel, tungsten and zinc; 83 peaks) plus metals the organism was not thought to assimilate (lead, manganese, molybdenum, uranium and vanadium; 75 peaks). Purification of eight of 158 unexpected metal peaks yielded four novel nickel- and molybdenum-containing proteins, whereas four purified proteins contained sub-stoichiometric amounts of misincorporated lead and uranium. Analyses of two additional microorganisms (Escherichia coli and Sulfolobus solfataricus) revealed species-specific assimilation of yet more unexpected metals. Metalloproteomes are therefore much more extensive and diverse than previously recognized, and promise to provide key insights for cell biology, microbial growth and toxicity mechanisms.
Assuntos
Proteínas de Bactérias/análise , Metaloproteínas/análise , Metaloproteínas/química , Metais/análise , Proteoma/análise , Pyrococcus furiosus/química , Proteínas de Bactérias/química , Cromatografia Líquida , Escherichia coli/química , Metais/química , Metais/metabolismo , Proteoma/química , Proteômica , Pyrococcus furiosus/metabolismo , Sulfolobus solfataricus/química , Espectrometria de Massas em TandemRESUMO
The mammalian O-mannosylation pathway for protein post-translational modification is intricately involved in modulating cell-matrix interactions in the musculature and nervous system. Defects in enzymes of this biosynthetic pathway are causative for multiple forms of congenital muscular dystophy. The application of advanced genetic and biochemical technologies has resulted in remarkable progress in this field over the past few years, culminating with the publication of three landmark papers in 2013 alone. In this review, we will highlight recent progress focusing on the dramatic expansion of the set of genes known to be involved in O-mannosylation and disease processes, the concurrent acceleration of the rate of O-mannosylation pathway protein functional assignments, the tremendous increase in the number of proteins now known to be modified by O-mannosylation, and the recent progress in protein O-mannose glycan quantification and site assignment. Also, we attempt to highlight key outstanding questions raised by this abundance of new information.
Assuntos
Manose/metabolismo , Manosiltransferases/metabolismo , Polissacarídeos/metabolismo , Processamento de Proteína Pós-Traducional/fisiologia , Animais , Configuração de Carboidratos , Glicosilação , Humanos , Manose/química , Manosiltransferases/química , Polissacarídeos/químicaRESUMO
In this paper, we extend our framework for constructing low-dimensional dynamical system models of large-scale neuronal networks of mammalian primary visual cortex. Our dimensional reduction procedure consists of performing a suitable linear change of variables and then systematically truncating the new set of equations. The extended framework includes modeling the effect of neglected modes as a stochastic process. By parametrizing and including stochasticity in one of two ways we show that we can improve the systems-level characterization of our dimensionally reduced neuronal network model. We examined orientation selectivity maps calculated from the firing rate distribution of large-scale simulations and stochastic dimensionally reduced models and found that by using stochastic processes to model the neglected modes, we were able to better reproduce the mean and variance of firing rates in the original large-scale simulations while still accurately predicting the orientation preference distribution.
Assuntos
Modelos Neurológicos , Rede Nervosa/fisiologia , Neurônios/fisiologia , Ruído , Processos Estocásticos , Córtex Visual/fisiologia , Animais , Simulação por Computador , Humanos , Córtex Visual/citologiaRESUMO
α-Dystroglycan (α-DG) is uniquely modified on O-mannose sites by a repeating disaccharide (-Xylα1,3-GlcAß1,3-)n termed matriglycan, which is a receptor for laminin-G domain-containing proteins and employed by old-world arenaviruses for infection. Using chemoenzymatically synthesized matriglycans printed as a microarray, we demonstrate length-dependent binding to Laminin, Lassa virus GP1, and the clinically-important antibody IIH6. Utilizing an enzymatic engineering approach, an N-linked glycoprotein was converted into a IIH6-positive Laminin-binding glycoprotein. Engineering of the surface of cells deficient for either α-DG or O-mannosylation with matriglycans of sufficient length recovers infection with a Lassa-pseudovirus. Finally, free matriglycan in a dose and length dependent manner inhibits viral infection of wildtype cells. These results indicate that matriglycan alone is necessary and sufficient for IIH6 staining, Laminin and LASV GP1 binding, and Lassa-pseudovirus infection and support a model in which it is a tunable receptor for which increasing chain length enhances ligand-binding capacity.
Assuntos
Distroglicanas , Laminina , Distroglicanas/metabolismo , Glicoproteínas/metabolismo , Laminina/metabolismo , Vírus Lassa/metabolismo , Polissacarídeos/metabolismoRESUMO
BACKGROUND: Metal-containing proteins comprise a diverse and sizable category within the proteomes of organisms, ranging from proteins that use metals to catalyze reactions to proteins in which metals play key structural roles. Unfortunately, reliably predicting that a protein will contain a specific metal from its amino acid sequence is not currently possible. We recently developed a generally-applicable experimental technique for finding metalloproteins on a genome-wide scale. Applying this metal-directed protein purification approach (ICP-MS and MS/MS based) to the prototypical microbe Pyrococcus furiosus conclusively demonstrated the extent and diversity of the uncharacterized portion of microbial metalloproteomes since a majority of the observed metal peaks could not be assigned to known or predicted metalloproteins. However, even using this technique, it is not technically feasible to purify to homogeneity all metalloproteins in an organism. In order to address these limitations and complement the metal-directed protein purification, we developed a computational infrastructure and statistical methodology to aid in the pursuit and identification of novel metalloproteins. RESULTS: We demonstrate that our methodology enables predictions of metal-protein interactions using an experimental data set derived from a chromatography fractionation experiment in which 870 proteins and 10 metals were measured over 2,589 fractions. For each of the 10 metals, cobalt, iron, manganese, molybdenum, nickel, lead, tungsten, uranium, vanadium, and zinc, clusters of proteins frequently occurring in metal peaks (of a specific metal) within the fractionation space were defined. This resulted in predictions that there are from 5 undiscovered vanadium- to 13 undiscovered cobalt-containing proteins in Pyrococcus furiosus. Molybdenum and nickel were chosen for additional assessment producing lists of genes predicted to encode metalloproteins or metalloprotein subunits, 22 for nickel including seven from known nickel-proteins, and 20 for molybdenum including two from known molybdo-proteins. The uncharacterized proteins are prime candidates for metal-based purification or recombinant approaches to validate these predictions. CONCLUSIONS: We conclude that the largely uncharacterized extent of native metalloproteomes can be revealed through analysis of the co-occurrence of metals and proteins across a fractionation space. This can significantly impact our understanding of metallobiochemistry, disease mechanisms, and metal toxicity, with implications for bioremediation, medicine and other fields.
Assuntos
Biologia Computacional/métodos , Metaloproteínas/análise , Proteoma/análise , Espectrometria de Massas em Tandem , Sequência de Aminoácidos , Proteínas de Bactérias/análise , Proteínas de Bactérias/química , Proteínas de Bactérias/isolamento & purificação , Bases de Dados de Proteínas , Processamento Eletrônico de Dados/métodos , Metaloproteínas/química , Metaloproteínas/isolamento & purificação , Metais/análise , Metais/química , Metais/metabolismo , Molibdênio/química , Níquel/química , Domínios e Motivos de Interação entre Proteínas , Pyrococcus furiosus/metabolismoRESUMO
Virtually all cellular processes are carried out by dynamic molecular assemblies or multiprotein complexes, the compositions of which are largely undefined. They cannot be predicted solely from bioinformatics analyses nor are there well defined techniques currently available to unequivocally identify protein complexes (PCs). To address this issue, we attempted to directly determine the identity of PCs from native microbial biomass using Pyrococcus furiosus, a hyperthermophilic archaeon that grows optimally at 100 degrees C, as the model organism. Novel PCs were identified by large scale fractionation of the native proteome using non-denaturing, sequential column chromatography under anaerobic, reducing conditions. A total of 967 distinct P. furiosus proteins were identified by mass spectrometry (nano LC-ESI-MS/MS), representing approximately 80% of the cytoplasmic proteins. Based on the co-fractionation of proteins that are encoded by adjacent genes on the chromosome, 106 potential heteromeric PCs containing 243 proteins were identified, only 20 of which were known or expected. In addition to those of unknown function, novel and uncharacterized PCs were identified that are proposed to be involved in the metabolism of amino acids (10), carbohydrates (four), lipids (two), vitamins and metals (three), and DNA and RNA (nine). A further 30 potential PCs were classified as tentative, and the remaining potential PCs (13) were classified as weakly interacting. Some major advantages of native biomass fractionation for PC identification are that it provides a road map for the (partial) purification of native forms of novel and uncharacterized PCs, and the results can be utilized for the recombinant production of low abundance PCs to provide enough material for detailed structural and biochemical analyses.
Assuntos
Proteínas Arqueais/análise , Fracionamento Químico/métodos , Complexos Multiproteicos/análise , Proteoma/análise , Pyrococcus furiosus/metabolismo , Aminoácidos/metabolismo , Proteínas Arqueais/isolamento & purificação , Citoplasma/metabolismo , Desnaturação Proteica , Multimerização ProteicaRESUMO
The canonical O-mannosylation pathway in humans is essential for the functional glycosylation of α-dystroglycan. Disruption of this post-translational modification pathway leads to congenital muscular dystrophies. The first committed step in the construction of a functional matriglycan structure involves the post-translational modification of α-dystroglycan. This is essential for binding extracellular matrix proteins and arenaviruses, and is catalyzed by ß-1,4-N-acetylglucosaminyltransferase 2 (POMGNT2). While another glycosyl transferase, ß-1,4-N-acetylglucosaminyltransferase 1 (POMGNT1), has been shown to be promiscuous in extending O-mannosylated sites, POMGNT2 has been shown to display significant primary amino-acid selectivity near the site of O-mannosylation. Moreover, several single point mutations in POMGNT2 have been identified in patients with assorted dystroglycanopathies such as Walker-Warburg syndrome and limb girdle muscular dystrophy. To gain insight into POMGNT2 function in humans, the enzyme was expressed as a soluble, secreted fusion protein by transient infection of HEK293 suspension cultures. Here, crystal structures of POMGNT2 (amino-acid residues 25-580) with and without UDP bound are reported. Consistent with a novel fold and a unique domain organization, no molecular-replacement model was available and phases were obtained through crystallization of a selenomethionine variant of the enzyme in the same space group. Tetragonal (space group P4212; unit-cell parameters a = b = 129.8, c = 81.6â Å, α = γ = ß = 90°) crystals with UDP bound diffracted to 1.98â Å resolution and contained a single monomer in the asymmetric unit. Orthorhombic (space group P212121; unit-cell parameters a = 142.3, b = 153.9, c = 187.4â Å, α = γ = ß = 90°) crystals were also obtained; they diffracted to 2.57â Å resolution and contained four monomers with differential glycosylation patterns and conformations. These structures provide the first rational basis for an explanation of the loss-of-function mutations and offer significant insights into the mechanics of this important human enzyme.
Assuntos
Distroglicanas/metabolismo , Glicosiltransferases/química , Distrofias Musculares/metabolismo , Sítios de Ligação , Glicosilação , Células HEK293 , Humanos , Ligação ProteicaRESUMO
The SARS-CoV-2 betacoronavirus uses its highly glycosylated trimeric Spike protein to bind to the cell surface receptor angiotensin converting enzyme 2 (ACE2) glycoprotein and facilitate host cell entry. We utilized glycomics-informed glycoproteomics to characterize site-specific microheterogeneity of glycosylation for a recombinant trimer Spike mimetic immunogen and for a soluble version of human ACE2. We combined this information with bioinformatics analyses of natural variants and with existing 3D structures of both glycoproteins to generate molecular dynamics simulations of each glycoprotein both alone and interacting with one another. Our results highlight roles for glycans in sterically masking polypeptide epitopes and directly modulating Spike-ACE2 interactions. Furthermore, our results illustrate the impact of viral evolution and divergence on Spike glycosylation, as well as the influence of natural variants on ACE2 receptor glycosylation. Taken together, these data can facilitate immunogen design to achieve antibody neutralization and inform therapeutic strategies to inhibit viral infection.
Assuntos
Betacoronavirus/metabolismo , Infecções por Coronavirus/enzimologia , Infecções por Coronavirus/virologia , Peptidil Dipeptidase A/metabolismo , Pneumonia Viral/enzimologia , Pneumonia Viral/virologia , Glicoproteína da Espícula de Coronavírus/metabolismo , Enzima de Conversão de Angiotensina 2 , COVID-19 , Glicosilação , Células HEK293 , Humanos , Simulação de Dinâmica Molecular , Pandemias , Peptidil Dipeptidase A/química , Domínios Proteicos , Domínios e Motivos de Interação entre Proteínas , Receptores Virais/química , Receptores Virais/metabolismo , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus/química , Internalização do VírusRESUMO
The current COVID-19 pandemic is caused by the SARS-CoV-2 betacoronavirus, which utilizes its highly glycosylated trimeric Spike protein to bind to the cell surface receptor ACE2 glycoprotein and facilitate host cell entry. We utilized glycomics-informed glycoproteomics to characterize site-specific microheterogeneity of glycosylation for a recombinant trimer Spike mimetic immunogen and for a soluble version of human ACE2. We combined this information with bioinformatic analyses of natural variants and with existing 3D-structures of both glycoproteins to generate molecular dynamics simulations of each glycoprotein alone and interacting with one another. Our results highlight roles for glycans in sterically masking polypeptide epitopes and directly modulating Spike-ACE2 interactions. Furthermore, our results illustrate the impact of viral evolution and divergence on Spike glycosylation, as well as the influence of natural variants on ACE2 receptor glycosylation that, taken together, can facilitate immunogen design to achieve antibody neutralization and inform therapeutic strategies to inhibit viral infection.
RESUMO
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high-rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain "low-hanging fruit" protein structures. We review the practical aspects of today's high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG).
Assuntos
Cristalização/instrumentação , Cristalização/métodos , Cristalografia por Raios X/instrumentação , Cristalografia por Raios X/métodos , Proteínas/química , Proteínas/ultraestrutura , Complexos Multiproteicos/análise , Complexos Multiproteicos/química , Complexos Multiproteicos/ultraestrutura , Proteínas/análise , Integração de SistemasRESUMO
Multiple glycosyltransferases are essential for the proper modification of alpha-dystroglycan, as mutations in the encoding genes cause congenital/limb-girdle muscular dystrophies. Here we elucidate further the structure of an O-mannose-initiated glycan on alpha-dystroglycan that is required to generate its extracellular matrix-binding polysaccharide. This functional glycan contains a novel ribitol structure that links a phosphotrisaccharide to xylose. ISPD is a CDP-ribitol (ribose) pyrophosphorylase that generates the reduced sugar nucleotide for the insertion of ribitol in a phosphodiester linkage to the glycoprotein. TMEM5 is a UDP-xylosyl transferase that elaborates the structure. We demonstrate in a zebrafish model as well as in a human patient that defects in TMEM5 result in muscular dystrophy in combination with abnormal brain development. Thus, we propose a novel structure-a ribitol in a phosphodiester linkage-for the moiety on which TMEM5, B4GAT1, and LARGE act to generate the functional receptor for ECM proteins having LG domains.
Assuntos
Distroglicanas/química , Distroglicanas/metabolismo , Matriz Extracelular/metabolismo , Proteínas de Membrana/metabolismo , Polissacarídeos/análise , Animais , Humanos , Manose/análise , Nucleotidiltransferases/metabolismo , Pentosiltransferases , Ligação Proteica , Ribitol/análise , Peixe-ZebraRESUMO
Recent studies demonstrated that mutations in B3GNT1, an enzyme proposed to be involved in poly-N-acetyllactosamine synthesis, were causal for congenital muscular dystrophy with hypoglycosylation of α-dystroglycan (secondary dystroglycanopathies). Since defects in the O-mannosylation protein glycosylation pathway are primarily responsible for dystroglycanopathies and with no established O-mannose initiated structures containing a ß3 linked GlcNAc known, we biochemically interrogated this human enzyme. Here we report this enzyme is not a ß-1,3-N-acetylglucosaminyltransferase with catalytic activity towards ß-galactose but rather a ß-1,4-glucuronyltransferase, designated B4GAT1, towards both α- and ß-anomers of xylose. The dual-activity LARGE enzyme is capable of extending products of B4GAT1 and we provide experimental evidence that B4GAT1 is the priming enzyme for LARGE. Our results further define the functional O-mannosylated glycan structure and indicate that B4GAT1 is involved in the initiation of the LARGE-dependent repeating disaccharide that is necessary for extracellular matrix protein binding to O-mannosylated α-dystroglycan that is lacking in secondary dystroglycanopathies.
Assuntos
Distroglicanas/metabolismo , N-Acetilglucosaminiltransferases/metabolismo , Sequência de Aminoácidos , Biocatálise , Dissacarídeos/metabolismo , Glicosilação , Células HEK293 , Humanos , Cinética , Modelos Biológicos , Dados de Sequência Molecular , N-Acetilglucosaminiltransferases/química , Pentosiltransferases/metabolismo , Solubilidade , Estereoisomerismo , Especificidade por Substrato , Trissacarídeos/metabolismo , Uridina Difosfato Ácido Glucurônico/metabolismo , Xilose/química , Xilose/metabolismo , UDP Xilose-Proteína XilosiltransferaseRESUMO
The determination of protein structures on a genomic scale requires both computing capacity and efficiency increases at many stages along the complex process. By combining bioinformatics workflow-management techniques, cluster-based computing and popular crystallographic structure-determination software packages, an efficient and powerful new tool for structural biology/genomics has been developed. Using the workflow manager and a simple web interface, the researcher can, in a few easy steps, set up hundreds of structure-determination jobs, each using a slightly different set of program input parameters, thus efficiently screening parameter space for the optimal input-parameter combination, i.e. a set of parameters that leads to a successful structure determination. Upon completion, results from the programs are harvested, analyzed, sorted based on success and presented to the user via the web interface. This approach has been applied with success in more than 30 cases. Examples of successful structure determinations based on single-wavelength scattering (SAS) are described and include cases where the 'rational' crystallographer-based selection of input parameters values had failed.
Assuntos
Proteínas/química , Biologia Computacional , Cristalização , Cristalografia por Raios X , Interpretação Estatística de Dados , Conformação Proteica , Espalhamento de Radiação , Software , Relação Estrutura-AtividadeRESUMO
Using a high degree of automation, the crystallography core at the Southeast Collaboratory for Structural Genomics (SECSG) has developed a high-throughput protein-to-structure pipeline. Various robots and automation procedures have been adopted and integrated into a pipeline that is capable of screening 40 proteins for crystallization and solving four protein structures per week. This pipeline is composed of three major units: crystallization, structure determination/validation and crystallomics. Coupled with the protein-production cores at SECSG, the protein-to-structure pipeline provides a two-tiered approach for protein production at SECSG. In tier 1, all protein samples supplied by the protein-production cores pass through the pipeline using standard crystallization screening and optimization procedures. The protein targets that failed to yield diffraction-quality crystals (resolution better than 3.0 A) become tier 2 or salvaging targets. The goal of tier 2 target salvaging, carried out by the crystallomics core, is to produce the target proteins with increased purity and homogeneity, which would render them more likely to yield well diffracting crystals. This is performed by alternative purification procedures and/or the introduction of chemical modifications to the proteins (such as tag removal, methylation, surface mutagenesis, selenomethionine labelling etc.). Details of the various procedures in the pipeline for protein crystallization, target salvaging, data collection/processing and high-throughput structure determination/validation, as well as some examples, are described.