Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 112
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 51(D1): D466-D478, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36300618

RESUMO

Proteins often act through oligomeric interactions with other proteins. X-ray crystallography and cryo-electron microscopy provide detailed information on the structures of biological assemblies, defined as the most likely biologically relevant structures derived from experimental data. In crystal structures, the most relevant assembly may be ambiguously determined, since multiple assemblies observed in the crystal lattice may be plausible. It is estimated that 10-15% of PDB entries may have incorrect or ambiguous assembly annotations. Accurate assemblies are required for understanding functional data and training of deep learning methods for predicting assembly structures. As with any other kind of biological data, replication via multiple independent experiments provides important validation for the determination of biological assembly structures. Here we present the Protein Common Assembly Database (ProtCAD), which presents clusters of protein assembly structures observed in independent structure determinations of homologous proteins in the Protein Data Bank (PDB). ProtCAD is searchable by PDB entry, UniProt identifiers, or Pfam domain designations and provides downloads of coordinate files, PyMol scripts, and publicly available assembly annotations for each cluster of assemblies. About 60% of PDB entries contain assemblies in clusters of at least 2 independent experiments. All clusters and coordinates are available on ProtCAD web site (http://dunbrack2.fccc.edu/protcad).


Assuntos
Bases de Dados de Proteínas , Complexos Multiproteicos , Proteínas , Microscopia Crioeletrônica , Cristalografia por Raios X , Proteínas/química , Complexos Multiproteicos/química
2.
J Biol Chem ; 299(8): 104965, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37356718

RESUMO

Janus Kinase-1 (JAK1) plays key roles during neurodevelopment and following neuronal injury, while activatory JAK1 mutations are linked to leukemia. In mice, Jak1 genetic deletion results in perinatal lethality, suggesting non-redundant roles and/or regulation of JAK1 for which other JAKs cannot compensate. Proteomic studies reveal that JAK1 is more likely palmitoylated compared to other JAKs, implicating palmitoylation as a possible JAK1-specific regulatory mechanism. However, the importance of palmitoylation for JAK1 signaling has not been addressed. Here, we report that JAK1 is palmitoylated in transfected HEK293T cells and endogenously in cultured Dorsal Root Ganglion (DRG) neurons. We further use comprehensive screening in transfected non-neuronal cells and shRNA-mediated knockdown in DRG neurons to identify the related enzymes ZDHHC3 and ZDHHC7 as dominant protein acyltransferases (PATs) for JAK1. Surprisingly, we found palmitoylation minimally affects JAK1 localization in neurons, but is critical for JAK1's kinase activity in cells and even in vitro. We propose this requirement is likely because palmitoylation facilitates transphosphorylation of key sites in JAK1's activation loop, a possibility consistent with structural models of JAK1. Importantly, we demonstrate a leukemia-associated JAK1 mutation overrides the palmitoylation-dependence of JAK1 activity, potentially explaining why this mutation is oncogenic. Finally, we show that JAK1 palmitoylation is important for neuropoietic cytokine-dependent signaling and neuronal survival and that combined Zdhhc3/7 loss phenocopies loss of palmitoyl-JAK1. These findings provide new insights into the control of JAK signaling in both physiological and pathological contexts.


Assuntos
Citocinas , Lipoilação , Neurônios , Transdução de Sinais , Animais , Feminino , Humanos , Camundongos , Gravidez , Citocinas/metabolismo , Gânglios Espinais/metabolismo , Células HEK293 , Janus Quinase 1/genética , Janus Quinase 1/metabolismo , Neurônios/citologia , Neurônios/metabolismo , Proteômica , Sobrevivência Celular
3.
Nucleic Acids Res ; 50(D1): D654-D664, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34643709

RESUMO

The active form of kinases is shared across different family members, as are several commonly observed inactive forms. We previously performed a clustering of the conformation of the activation loop of all protein kinase structures in the Protein Data Bank (PDB) into eight classes based on the dihedral angles that place the Phe side chain of the DFG motif at the N-terminus of the activation loop. Our clusters are strongly associated with the placement of the activation loop, the C-helix, and other structural elements of kinases. We present Kincore, a web resource providing access to our conformational assignments for kinase structures in the PDB. While other available databases provide conformational states or drug type but not both, KinCore includes the conformational state and the inhibitor type (Type 1, 1.5, 2, 3, allosteric) for each kinase chain. The user can query and browse the database using these attributes or determine the conformational labels of a kinase structure using the web server or a standalone program. The database and labeled structure files can be downloaded from the server. Kincore will help in understanding the conformational dynamics of these proteins and guide development of inhibitors targeting specific states. Kincore is available at http://dunbrack.fccc.edu/kincore.


Assuntos
Bases de Dados de Proteínas , Inibidores de Proteínas Quinases/classificação , Proteínas Quinases/classificação , Software , Conformação Proteica , Inibidores de Proteínas Quinases/química , Proteínas Quinases/química
4.
Proteomics ; 23(17): e2200323, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37365936

RESUMO

Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.


Assuntos
Proteínas , Reprodutibilidade dos Testes , Proteínas/metabolismo , Ligação Proteica
5.
BMC Genomics ; 24(1): 212, 2023 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-37095444

RESUMO

BACKGROUND: Early-onset renal cell carcinoma (eoRCC) is typically associated with pathogenic germline variants (PGVs) in RCC familial syndrome genes. However, most eoRCC patients lack PGVs in familial RCC genes and their genetic risk remains undefined. METHODS: Here, we analyzed biospecimens from 22 eoRCC patients that were seen at our institution for genetic counseling and tested negative for PGVs in RCC familial syndrome genes. RESULTS: Analysis of whole-exome sequencing (WES) data found enrichment of candidate pathogenic germline variants in DNA repair and replication genes, including multiple DNA polymerases. Induction of DNA damage in peripheral blood monocytes (PBMCs) significantly elevated numbers of [Formula: see text]H2AX foci, a marker of double-stranded breaks, in PBMCs from eoRCC patients versus PBMCs from matched cancer-free controls. Knockdown of candidate variant genes in Caki RCC cells increased [Formula: see text]H2AX foci. Immortalized patient-derived B cell lines bearing the candidate variants in DNA polymerase genes (POLD1, POLH, POLE, POLK) had DNA replication defects compared to control cells. Renal tumors carrying these DNA polymerase variants were microsatellite stable but had a high mutational burden. Direct biochemical analysis of the variant Pol δ and Pol η polymerases revealed defective enzymatic activities. CONCLUSIONS: Together, these results suggest that constitutional defects in DNA repair underlie a subset of eoRCC cases. Screening patient lymphocytes to identify these defects may provide insight into mechanisms of carcinogenesis in a subset of genetically undefined eoRCCs. Evaluation of DNA repair defects may also provide insight into the cancer initiation mechanisms for subsets of eoRCCs and lay the foundation for targeting DNA repair vulnerabilities in eoRCC.


Assuntos
Carcinoma de Células Renais , Neoplasias Renais , Humanos , Predisposição Genética para Doença , Replicação do DNA , Mutação em Linhagem Germinativa , Células Germinativas
6.
Proc Natl Acad Sci U S A ; 116(14): 6818-6827, 2019 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-30867294

RESUMO

Targeting protein kinases is an important strategy for intervention in cancer. Inhibitors are directed at the active conformation or a variety of inactive conformations. While attempts have been made to classify these conformations, a structurally rigorous catalog of states has not been achieved. The kinase activation loop is crucial for catalysis and begins with the conserved DFGmotif. This motif is observed in two major classes of conformations, DFGin-a set of active and inactive conformations where the Phe residue is in contact with the C-helix of the N-terminal lobe-and DFGout-an inactive form where Phe occupies the ATP site exposing the C-helix pocket. We have developed a clustering of kinase conformations based on the location of the Phe side chain (DFGin, DFGout, and DFGinter or intermediate) and the backbone dihedral angles of the sequence X-D-F, where X is the residue before the DFGmotif, and the DFG-Phe side-chain rotamer, utilizing a density-based clustering algorithm. We have identified eight distinct conformations and labeled them based on the Ramachandran regions (A, alpha; B, beta; L, left) of the XDF motif and the Phe rotamer (minus, plus, trans). Our clustering divides the DFGin group into six clusters including BLAminus, which contains active structures, and two common inactive forms, BLBplus and ABAminus. DFGout structures are predominantly in the BBAminus conformation, which is essentially required for binding type II inhibitors. The inactive conformations have specific features that make them unable to bind ATP, magnesium, and/or substrates. Our structurally intuitive nomenclature will aid in understanding the conformational dynamics of kinases and structure-based development of kinase drugs.


Assuntos
Algoritmos , Modelos Moleculares , Proteínas Quinases , Domínio Catalítico , Inibidores de Proteínas Quinases/química , Proteínas Quinases/química , Proteínas Quinases/genética , Estrutura Secundária de Proteína , Especificidade por Substrato , Terminologia como Assunto
7.
Proc Natl Acad Sci U S A ; 115(8): 1889-1894, 2018 02 20.
Artigo em Inglês | MEDLINE | ID: mdl-29432160

RESUMO

Unlike αß-T lineage cells, where the role of ligand in intrathymic selection is well established, the role of ligand in the development of γδ-T cells remains controversial. Here we provide evidence for the role of a bona fide selecting ligand in shaping the γδ-T cell-receptor (TCR) repertoire. Reactivity of the γδ-TCR with the major histocompatibility complex (MHC) Class Ib ligands, H2-T10/22, is critically dependent upon the EGYEL motif in the complementarity determining region 3 (CDR3) of TCRδ. In the absence of H2-T10/22 ligand, the commitment of H2-T10/22 reactive γδ-T cells to the γδ fate is diminished, and the specification of those γδ committed cells to the IFN-γ or interleukin-17 effector fate is altered. Furthermore, those cells that do adopt the γδ fate and mature exhibit a profound alteration in the γδTCR repertoire, including depletion of the EGYEL motif and reductions in both CDR3δ length and charge. Taken together, these data suggest that ligand plays an important role in shaping the TCR repertoire of γδ-T cells.


Assuntos
Receptores de Antígenos de Linfócitos T gama-delta/metabolismo , Subpopulações de Linfócitos T/fisiologia , Animais , Linhagem da Célula , Ligantes , Camundongos , Ligação Proteica , Receptores de Antígenos de Linfócitos T gama-delta/genética
8.
PLoS Comput Biol ; 15(3): e1006844, 2019 03.
Artigo em Inglês | MEDLINE | ID: mdl-30845191

RESUMO

Protein loops connect regular secondary structures and contain 4-residue beta turns which represent 63% of the residues in loops. The commonly used classification of beta turns (Type I, I', II, II', VIa1, VIa2, VIb, and VIII) was developed in the 1970s and 1980s from analysis of a small number of proteins of average resolution, and represents only two thirds of beta turns observed in proteins (with a generic class Type IV representing the rest). We present a new clustering of beta-turn conformations from a set of 13,030 turns from 1074 ultra-high resolution protein structures (≤1.2 Å). Our clustering is derived from applying the DBSCAN and k-medoids algorithms to this data set with a metric commonly used in directional statistics applied to the set of dihedral angles from the second and third residues of each turn. We define 18 turn types compared to the 8 classical turn types in common use. We propose a new 2-letter nomenclature for all 18 beta-turn types using Ramachandran region names for the two central residues (e.g., 'A' and 'D' for alpha regions on the left side of the Ramachandran map and 'a' and 'd' for equivalent regions on the right-hand side; classical Type I turns are 'AD' turns and Type I' turns are 'ad'). We identify 11 new types of beta turn, 5 of which are sub-types of classical beta-turn types. Up-to-date statistics, probability densities of conformations, and sequence profiles of beta turns in loops were collected and analyzed. A library of turn types, BetaTurnLib18, and cross-platform software, BetaTurnTool18, which identifies turns in an input protein structure, are freely available and redistributable from dunbrack.fccc.edu/betaturn and github.com/sh-maxim/BetaTurn18. Given the ubiquitous nature of beta turns, this comprehensive study updates understanding of beta turns and should also provide useful tools for protein structure determination, refinement, and prediction programs.


Assuntos
Proteínas/química , Terminologia como Assunto , Algoritmos , Sequência de Aminoácidos , Aminoácidos/química , Análise por Conglomerados , Conformação Proteica , Reprodutibilidade dos Testes
9.
J Biol Chem ; 293(36): 13921-13931, 2018 09 07.
Artigo em Inglês | MEDLINE | ID: mdl-30030379

RESUMO

Mutations in the cystathionine ß-synthase (CBS) gene are the cause of classical homocystinuria, the most common inborn error in sulfur metabolism. The p.G307S mutation is the most frequent cause of CBS deficiency in Ireland, which has the highest prevalence of CBS deficiency in Europe. Individuals homozygous for this mutation tend to be severely affected and are pyridoxine nonresponsive, but the molecular basis for the strong effects of this mutation is unclear. Here, we characterized a transgenic mouse model lacking endogenous Cbs and expressing human p.G307S CBS protein from a zinc-inducible metallothionein promoter (Tg-G307S Cbs-/-). Unlike mice expressing other mutant CBS alleles, the Tg-G307S transgene could not efficiently rescue neonatal lethality of Cbs-/- in a C57BL/6J background. In a C3H/HeJ background, zinc-induced Tg-G307S Cbs-/- mice expressed high levels of p.G307S CBS in the liver, and this protein variant forms multimers, similarly to mice expressing WT human CBS. However, the p.G307S enzyme had no detectable residual activity. Moreover, treating mice with proteasome inhibitors failed to significantly increase CBS-specific activity. These findings indicated that the G307S substitution likely affects catalytic function as opposed to causing a folding defect. Using molecular dynamics simulation techniques, we found that the G307S substitution likely impairs catalytic function by limiting the ability of the tyrosine at position 308 to assume the proper conformational state(s) required for the formation of the pyridoxal-cystathionine intermediate. These results indicate that the p.G307S CBS is stable but enzymatically inert and therefore unlikely to respond to chaperone-based therapy.


Assuntos
Cistationina beta-Sintase/genética , Mutação , Substituição de Aminoácidos , Animais , Catálise , Cistationina beta-Sintase/química , Cistationina beta-Sintase/metabolismo , Homocistinúria/genética , Humanos , Camundongos , Camundongos Endogâmicos C3H , Camundongos Endogâmicos C57BL , Camundongos Transgênicos , Inibidores de Proteassoma/farmacologia , Conformação Proteica , Estabilidade Proteica , Piridoxina/farmacologia
10.
BMC Med Genet ; 20(1): 125, 2019 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-31307431

RESUMO

BACKGROUND: Alpha 1 Antitrypsin (AAT) is a key serum proteinase inhibitor encoded by SERPINA1. Sequence variants of the gene can cause Alpha 1 Antitrypsin Deficiency (AATD), a condition associated with lung and liver disease. The majority of AATD cases are caused by the 'Z' and 'S' variants - single-nucleotide variations (SNVs) that result in amino acid substitutions of E342K and E264V. However, SERPINA1 is highly polymorphic, with numerous potentially clinically relevant variants reported. Novel variants continue to be discovered, and without reports of pathogenicity, it can be difficult for clinicians to determine the best course of treatment. METHODS: We assessed the utility of next-generation sequencing (NGS) and predictive computational analysis to guide the diagnosis of patients suspected of having AATD. Blood samples on serum separator cards were submitted to the DNA1 Advanced Screening Program (Biocerna LLC, Fulton, Maryland, USA) by physicians whose patients were suspected of having AATD. Laboratory analyses included quantification of serum AAT levels, qualitative analysis by isoelectric focusing, and targeted genotyping and NGS of the SERPINA1 gene. Molecular modeling software UCSF Chimera (University College of San Francisco, CA) was used to visualize the positions of amino acid changes as a result of rare/novel SNVs. Predictive software was used to assess the potential pathogenicity of these variants; methods included a support vector machine (SVM) program, PolyPhen-2 (Harvard University, Cambridge, MA), and FoldX (Centre for Genomic Regulation, Barcelona, Spain). RESULTS: Samples from 23 patients were analyzed; 21 rare/novel sequence variants were identified by NGS, including splice variants (n = 2), base pair deletions (n = 1), stop codon insertions (n = 2), and SNVs (n = 16). Computational modeling of protein structures caused by the novel SNVs showed that 8 were probably deleterious, and two were possibly deleterious. For the majority of probably/possibly deleterious SNVs (I50N, P289S, M385T, M221T, D341V, V210E, P369H, V333M and A142D), the mechanism is probably via disruption of the packed hydrophobic core of AAT. Several deleterious variants occurred in combination with more common deficiency alleles, resulting in very low AAT levels. CONCLUSIONS: NGS and computational modeling are useful tools that can facilitate earlier, more precise diagnosis, and consideration for AAT therapy in AATD.


Assuntos
Variação Genética , Modelos Moleculares , Deficiência de alfa 1-Antitripsina/genética , alfa 1-Antitripsina/química , alfa 1-Antitripsina/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Substituição de Aminoácidos , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Pessoa de Meia-Idade , Pennsylvania , Conformação Proteica em alfa-Hélice , Splicing de RNA , Análise de Sequência de Proteína , Virulência/genética , alfa 1-Antitripsina/sangue , Deficiência de alfa 1-Antitripsina/diagnóstico
11.
PLoS Comput Biol ; 14(4): e1006112, 2018 04.
Artigo em Inglês | MEDLINE | ID: mdl-29702641

RESUMO

A structural-bioinformatics-based computational methodology and framework have been developed for the design of antibodies to targets of interest. RosettaAntibodyDesign (RAbD) samples the diverse sequence, structure, and binding space of an antibody to an antigen in highly customizable protocols for the design of antibodies in a broad range of applications. The program samples antibody sequences and structures by grafting structures from a widely accepted set of the canonical clusters of CDRs (North et al., J. Mol. Biol., 406:228-256, 2011). It then performs sequence design according to amino acid sequence profiles of each cluster, and samples CDR backbones using a flexible-backbone design protocol incorporating cluster-based CDR constraints. Starting from an existing experimental or computationally modeled antigen-antibody structure, RAbD can be used to redesign a single CDR or multiple CDRs with loops of different length, conformation, and sequence. We rigorously benchmarked RAbD on a set of 60 diverse antibody-antigen complexes, using two design strategies-optimizing total Rosetta energy and optimizing interface energy alone. We utilized two novel metrics for measuring success in computational protein design. The design risk ratio (DRR) is equal to the frequency of recovery of native CDR lengths and clusters divided by the frequency of sampling of those features during the Monte Carlo design procedure. Ratios greater than 1.0 indicate that the design process is picking out the native more frequently than expected from their sampled rate. We achieved DRRs for the non-H3 CDRs of between 2.4 and 4.0. The antigen risk ratio (ARR) is the ratio of frequencies of the native amino acid types, CDR lengths, and clusters in the output decoys for simulations performed in the presence and absence of the antigen. For CDRs, we achieved cluster ARRs as high as 2.5 for L1 and 1.5 for H2. For sequence design simulations without CDR grafting, the overall recovery for the native amino acid types for residues that contact the antigen in the native structures was 72% in simulations performed in the presence of the antigen and 48% in simulations performed without the antigen, for an ARR of 1.5. For the non-contacting residues, the ARR was 1.08. This shows that the sequence profiles are able to maintain the amino acid types of these conserved, buried sites, while recovery of the exposed, contacting residues requires the presence of the antigen-antibody interface. We tested RAbD experimentally on both a lambda and kappa antibody-antigen complex, successfully improving their affinities 10 to 50 fold by replacing individual CDRs of the native antibody with new CDR lengths and clusters.


Assuntos
Anticorpos/química , Software , Sequência de Aminoácidos , Animais , Anticorpos/genética , Anticorpos/imunologia , Complexo Antígeno-Anticorpo/química , Complexo Antígeno-Anticorpo/genética , Complexo Antígeno-Anticorpo/imunologia , Regiões Determinantes de Complementaridade , Biologia Computacional , Simulação por Computador , Evolução Molecular Direcionada , Desenho de Fármacos , Humanos , Modelos Moleculares , Método de Monte Carlo , Conformação Proteica , Engenharia de Proteínas/métodos , Engenharia de Proteínas/estatística & dados numéricos
13.
Hum Mutat ; 38(9): 1123-1131, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28370845

RESUMO

The Critical Assessment of Genome Interpretation (CAGI) is a global community experiment to objectively assess computational methods for predicting phenotypic impacts of genomic variation. One of the 2015-2016 competitions focused on predicting the influence of mutations on the allosteric regulation of human liver pyruvate kinase. More than 30 different researchers accessed the challenge data. However, only four groups accepted the challenge. Features used for predictions ranged from evolutionary constraints, mutant site locations relative to active and effector binding sites, and computational docking outputs. Despite the range of expertise and strategies used by predictors, the best predictions were marginally greater than random for modified allostery resulting from mutations. In contrast, several groups successfully predicted which mutations severely reduced enzymatic activity. Nonetheless, poor predictions of allostery stands in stark contrast to the impression left by more than 700 PubMed entries identified using the identifiers "computational + allosteric." This contrast highlights a specialized need for new computational tools and utilization of benchmarks that focus on allosteric regulation.


Assuntos
Benchmarking/métodos , Piruvato Quinase/química , Piruvato Quinase/genética , Regulação Alostérica , Sítio Alostérico , Biologia Computacional/métodos , Bases de Dados Genéticas , Frutosedifosfatos/metabolismo , Humanos , Modelos Moleculares , Mutação , Piruvato Quinase/metabolismo
14.
Nucleic Acids Res ; 43(Database issue): D432-8, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25392411

RESUMO

Classification of the structures of the complementarity determining regions (CDRs) of antibodies is critically important for antibody structure prediction and computational design. We have previously performed a clustering of antibody CDR conformations and defined a systematic nomenclature consisting of the CDR, length and an integer starting from the largest to the smallest cluster in the data set (e.g. L1-11-1). We present PyIgClassify (for Python-based immunoglobulin classification; available at http://dunbrack2.fccc.edu/pyigclassify/), a database and web server that provides access to assignments of all CDR structures in the PDB to our classification system. The database includes assignments to the IMGT germline V regions for heavy and light chains for several species. For humanized antibodies, the assignment of the frameworks is to human germlines and the CDRs to the germlines of mice or other species sources. The database can be searched by PDB entry, cluster identifier and IMGT germline group (e.g. human IGHV1). The entire database is downloadable so that users may filter the data as needed for antibody structure analysis, prediction and design.


Assuntos
Regiões Determinantes de Complementaridade/química , Bases de Dados de Proteínas , Animais , Regiões Determinantes de Complementaridade/classificação , Humanos , Cadeias Pesadas de Imunoglobulinas/química , Cadeias Leves de Imunoglobulina/química , Internet , Camundongos
15.
Proteins ; 84 Suppl 1: 260-81, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27081793

RESUMO

CASP11 (the 11th Meeting on the Critical Assessment of Protein Structure Prediction) ran a blind experiment in the refinement of protein structure predictions, the fourth such experiment since CASP8. As with the previous experiments, the predictors were provided with one starting structure from the server models of each of a selected set of template-based modeling targets and asked to refine the coordinates of the starting structure toward native. We assessed the refined structures with the Z-scores of the standard CASP measures, which compare the model-target similarities of the models from all the predictors. Furthermore, we assessed the refined structures with "relative measures," which compare the improvement in accuracy of each model with respect to the starting structure. The latter provides an assessment of the extent to which each predictor group is able to improve the starting structures toward native. We utilized heat maps to display improvements in the Calpha-Calpha distance matrix for each model. The heat maps labeled with each element of secondary structure helped us to identify regions of refinement toward native in each model. Most positively scoring models show modest improvements in multiple regions of the structure, while in some models we were able to identify significant repositioning of N/C-terminal segments and internal elements of secondary structure. The best groups were able to improve more than 70% of the targets from the starting models, and by an average of 3-5% in the standard CASP measures. Proteins 2016; 84(Suppl 1):260-281. © 2016 Wiley Periodicals, Inc.


Assuntos
Benchmarking , Biologia Computacional/estatística & dados numéricos , Modelos Moleculares , Modelos Estatísticos , Proteínas/química , Software , Algoritmos , Motivos de Aminoácidos , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados de Proteínas , Humanos , Internet , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Estrutura Terciária de Proteína , Homologia Estrutural de Proteína , Termodinâmica
16.
Proteins ; 84 Suppl 1: 200-20, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27081927

RESUMO

We present the assessment of predictions submitted in the template-based modeling (TBM) category of CASP11 (Critical Assessment of Protein Structure Prediction). Model quality was judged on the basis of global and local measures of accuracy on all atoms including side chains. The top groups on 39 human-server targets based on model 1 predictions were LEER, Zhang, LEE, MULTICOM, and Zhang-Server. The top groups on 81 targets by server groups based on model 1 predictions were Zhang-Server, nns, BAKER-ROSETTASERVER, QUARK, and myprotein-me. In CASP11, the best models for most targets were equal to or better than the best template available in the Protein Data Bank, even for targets with poor templates. The overall performance in CASP11 is similar to the performance of predictors in CASP10 with slightly better performance on the hardest targets. For most targets, assessment measures exhibited bimodal probability density distributions. Multi-dimensional scaling of an RMSD matrix for each target typically revealed a single cluster with models similar to the target structure, with a mode in the GDT-TS density between 40 and 90, and a wide distribution of models highly divergent from each other and from the experimental structure, with density mode at a GDT-TS value of ∼20. The models in this peak in the density were either compact models with entirely the wrong fold, or highly non-compact models. The results argue for a density-driven approach in future CASP TBM assessments that accounts for the bimodal nature of these distributions instead of Z scores, which assume a unimodal, Gaussian distribution. Proteins 2016; 84(Suppl 1):200-220. © 2016 Wiley Periodicals, Inc.


Assuntos
Biologia Computacional/estatística & dados numéricos , Modelos Moleculares , Modelos Estatísticos , Proteínas/química , Software , Algoritmos , Animais , Archaea/química , Bactérias/química , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados de Proteínas , Drosophila melanogaster/química , Humanos , Internet , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Estrutura Secundária de Proteína , Homologia Estrutural de Proteína , Vírus/química
17.
Proteins ; 84 Suppl 1: 370-91, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27181425

RESUMO

In CASP11, the organizers sought to bring the biological inferences from predicted structures to the fore. To accomplish this, we assessed the models for their ability to perform quantifiable tasks related to biological function. First, for 10 targets that were probable homodimers, we measured the accuracy of docking the models into homodimers as a function of GDT-TS of the monomers, which produced characteristic L-shaped plots. At low GDT-TS, none of the models could be docked correctly as homodimers. Above GDT-TS of ∼60%, some models formed correct homodimers in one of the largest docked clusters, while many other models at the same values of GDT-TS did not. Docking was more successful when many of the templates shared the same homodimer. Second, we docked a ligand from an experimental structure into each of the models of one of the targets. Docking to the models with two different programs produced poor ligand RMSDs with the experimental structure. Measures that evaluated similarity of contacts were reasonable for some of the models, although there was not a significant correlation with model accuracy. Finally, we assessed whether models would be useful in predicting the phenotypes of missense mutations in three human targets by comparing features calculated from the models with those calculated from the experimental structures. The models were successful in reproducing accessible surface areas but there was little correlation of model accuracy with calculation of FoldX evaluation of the change in free energy between the wild-type and the mutant. Proteins 2016; 84(Suppl 1):370-391. © 2016 Wiley Periodicals, Inc.


Assuntos
Amidoidrolases/química , Proteínas Quinases Dependentes de AMP Cíclico/química , Proteína gp120 do Envelope de HIV/química , Fator de Crescimento de Hepatócito/química , Modelos Estatísticos , Simulação de Acoplamento Molecular , Proteínas Proto-Oncogênicas/química , Amidoidrolases/genética , Amidoidrolases/metabolismo , Sítios de Ligação , Biologia Computacional/métodos , Proteínas Quinases Dependentes de AMP Cíclico/genética , Proteínas Quinases Dependentes de AMP Cíclico/metabolismo , Proteínas Ligadas por GPI/química , Proteínas Ligadas por GPI/genética , Proteínas Ligadas por GPI/metabolismo , Proteína gp120 do Envelope de HIV/genética , Proteína gp120 do Envelope de HIV/metabolismo , Fator de Crescimento de Hepatócito/genética , Fator de Crescimento de Hepatócito/metabolismo , Humanos , Ligantes , Mutação de Sentido Incorreto , Fenótipo , Ligação Proteica , Domínios Proteicos , Dobramento de Proteína , Multimerização Proteica , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Proteínas Proto-Oncogênicas/genética , Proteínas Proto-Oncogênicas/metabolismo , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade , Termodinâmica
18.
Proteins ; 84 Suppl 1: 20-33, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-26756794

RESUMO

Protein target structures for the Critical Assessment of Structure Prediction round 11 (CASP11) and CASP ROLL were split into domains and classified into categories suitable for assessment of template-based modeling (TBM) and free modeling (FM) based on their evolutionary relatedness to existing structures classified by the Evolutionary Classification of Protein Domains (ECOD) database. First, target structures were divided into domain-based evaluation units. Target splits were based on the domain organization of available templates as well as the performance of servers on whole targets compared to split target domains. Second, evaluation units were classified into TBM and FM categories using a combination of measures that evaluate prediction quality and template detectability. Generally, target domains with sequence-related templates and good server prediction performance were classified as TBM, whereas targets without sequence-identifiable templates and low server performance were classified as FM. As in previous CASP experiments, the boundaries for classification were blurred due to the presence of significant insertions and deteriorations in the targets with respect to homologous templates, as well as the presence of templates with partial coverage of new folds. The FM category included 45 target domains, which represents an unprecedented number of difficult CASP targets provided for modeling. Proteins 2016; 84(Suppl 1):20-33. © 2016 Wiley Periodicals, Inc.


Assuntos
Biologia Computacional/estatística & dados numéricos , Modelos Moleculares , Modelos Estatísticos , Proteínas/química , Software , Animais , Bacteriófagos/química , Biologia Computacional/métodos , Gráficos por Computador , Bases de Dados de Proteínas , Humanos , Cooperação Internacional , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Multimerização Proteica , Estrutura Secundária de Proteína , Proteínas/classificação , Homologia de Sequência de Aminoácidos
19.
J Natl Compr Canc Netw ; 13(8): 947-52, 2015 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-26285240

RESUMO

Next-generation sequencing of primary and metachronous metastatic cancer lesions may impact patient care. We present a case of relapsed metastatic breast cancer with a dominant pulmonary lesion originally identified as lung adenocarcinoma. A 72-year-old, never-smoker woman with a protracted cough was found to have a large lung mass and regional lymphadenopathy on a chest CT. Lung mass biopsy showed adenocarcinoma with focal TTF-1 (thyroid transcription factor 1) positivity, favoring a lung primary. In addition to stereotactic brain radiation for cerebral metastases, she was started on carboplatin/pemetrexed. As part of the workup, the tumor was analyzed by a 50-gene targeted mutation panel, which detected 3 somatic mutations: ERBB2 (HER2) D769H activating missense mutation, TP53 Y126 inactivating truncating mutation, and SMARCB1 R374Q missense mutation. Of note, the patient had a history of stage IIA triple-negative grade 3 invasive ductal carcinoma of the left breast 1.5 years ago and received neoadjuvant chemotherapy and adjuvant radiation, and underwent a lumpectomy. Further analysis of her primary breast tumor showed a mutational profile identical to that of the lung tumor. Fluorescence in situ hybridization revealed HER2 amplification in the lung tumor, with a HER2/CEP17 ratio of 3.9. The patient was diagnosed with recurrent HER2-positive metastatic breast carcinoma with a coexisting ERBB2 (HER2) activating mutation. Chemotherapy was adjusted to include dual HER2-targeted therapy containing trastuzumab and pertuzumab, resulting in an ongoing partial response. This case demonstrates that a unique genetic mutational profile can clarify whether a tumor represents a metastatic lesion or new malignancy when conventional morphological and immunohistochemical methods are indeterminate, and can directly impact treatment decisions.


Assuntos
Neoplasias da Mama/genética , Amplificação de Genes , Mutação , Receptor ErbB-2/genética , Idoso , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico , Biópsia , Neoplasias Encefálicas/diagnóstico , Neoplasias Encefálicas/secundário , Neoplasias Encefálicas/terapia , Neoplasias da Mama/diagnóstico , Neoplasias da Mama/tratamento farmacológico , Feminino , Testes Genéticos , Humanos , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/terapia , Estadiamento de Neoplasias , Segunda Neoplasia Primária , Radiografia Torácica , Radiocirurgia , Tomografia Computadorizada por Raios X , Resultado do Tratamento
20.
Clin Trials ; 12(6): 618-26, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26152834

RESUMO

BACKGROUND/AIMS: Diverse samples in clinical trials can make findings more generalizable. We sought to characterize the prevalence of clinical trials in the United States that required English fluency for participants to enroll in the trial. METHODS: We randomly chose over 10,000 clinical trial protocols registered with ClinicalTrials.gov and examined the inclusion and exclusion criteria of the trials. We compared the relationship of clinical trial characteristics with English fluency inclusion requirements. We merged the ClinicalTrials.gov data with US Census and American Community Survey data to investigate the association of English-language restrictions with ZIP-code-level demographic characteristics of participating institutions. We used Chi-squared tests, t-tests, and logistic regression models for analyses. RESULTS: English fluency requirements have been increasing over time, from 1.7% of trials having such requirements before 2000 to 9.0% after 2010 (p < 0.001 from Chi-squared test). Industry-sponsored trials had low rates of English fluency requirements (1.8%), while behavioral trials had high rates (28.4%). Trials opening in the Northeast of the United States had the highest regional English requirement rates (10.7%), while trials opening in more than one region had the lowest (3.3%, p<0.001). Since 1995, trials opening in ZIP codes with larger Hispanic populations were less likely to have English fluency requirements (odds ratio=0.92 for each 10% increase in proportion of Hispanics, 95% confidence interval=0.86-0.98, p=0.013). Trials opening in ZIP codes with more residents self-identifying as Black/African American (odds ratio=1.87, 95% confidence interval=1.36-2.58, p<0.001 for restricted cubic spline term) or Asian (odds ratio=1.16 for linear term, 95% confidence interval=1.07-1.25, p<0.001) were more likely to have English fluency requirements. ZIP codes with higher poverty rates had trials with more English-language restrictions (odds ratio=1.06 for a 10% poverty rate increase, 95% confidence interval=1.001-1.11, p=0.045). There was a statistically significant interaction between year and intervention type, such that the increase in English fluency requirements was more common for some interventions than for others. CONCLUSION: The proportion of clinical trials registered with ClinicalTrials.gov that have English fluency requirements for study inclusion has been increasing over time. English-language restrictions are associated with a number of characteristics, including the demographic characteristics of communities in which the sponsoring institutions are located.


Assuntos
Ensaios Clínicos como Assunto , Definição da Elegibilidade , Idioma , Seleção de Pacientes , Bases de Dados Factuais , Feminino , Humanos , Alfabetização , Masculino , Projetos de Pesquisa , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA