Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
Proteins ; 85(10): 1944-1956, 2017 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-28688107

RESUMEN

NMR chemical shifts can be computed from molecular dynamics (MD) simulations using a template matching approach and a library of conformers containing chemical shifts generated from ab initio quantum calculations. This approach has potential utility for evaluating the force fields that underlie these simulations. Imperfections in force fields generate flawed atomic coordinates. Chemical shifts obtained from flawed coordinates have errors that can be traced back to these imperfections. We use this approach to evaluate a series of AMBER force fields that have been refined over the course of two decades (ff94, ff96, ff99SB, ff14SB, ff14ipq, and ff15ipq). For each force field a series of MD simulations are carried out for eight model proteins. The calculated chemical shifts for the 1 H, 15 N, and 13 Ca atoms are compared with experimental values. Initial evaluations are based on root mean squared (RMS) errors at the protein level. These results are further refined based on secondary structure and the types of atoms involved in nonbonded interactions. The best chemical shift for identifying force field differences is the shift associated with peptide protons. Examination of the model proteins on a residue by residue basis reveals that force field performance is highly dependent on residue position. Examination of the time course of nonbonded interactions at these sites provides explanations for chemical shift differences at the atomic coordinate level. Results show that the newer ff14ipq and ff15ipq force fields developed with the implicitly polarized charge method perform better than the older force fields.


Asunto(s)
Péptidos/química , Conformación Proteica , Proteínas/química , Simulación de Dinámica Molecular , Resonancia Magnética Nuclear Biomolecular , Estructura Secundaria de Proteína , Teoría Cuántica , Electricidad Estática
2.
Proc Natl Acad Sci U S A ; 111(3): 1114-9, 2014 Jan 21.
Artículo en Inglés | MEDLINE | ID: mdl-24395800

RESUMEN

The underpinnings of STAT3 hyperphosphorylation resulting in enhanced signaling and cancer progression are incompletely understood. Loss-of-function mutations of enzymes that dephosphorylate STAT3, such as receptor protein tyrosine phosphatases, which are encoded by the PTPR gene family, represent a plausible mechanism of STAT3 hyperactivation. We analyzed whole exome sequencing (n = 374) and reverse-phase protein array data (n = 212) from head and neck squamous cell carcinomas (HNSCCs). PTPR mutations are most common and are associated with significantly increased phospho-STAT3 expression in HNSCC tumors. Expression of receptor-like protein tyrosine phosphatase T (PTPRT) mutant proteins induces STAT3 phosphorylation and cell survival, consistent with a "driver" phenotype. Computational modeling reveals functional consequences of PTPRT mutations on phospho-tyrosine-substrate interactions. A high mutation rate (30%) of PTPRs was found in HNSCC and 14 other solid tumors, suggesting that PTPR alterations, in particular PTPRT mutations, may define a subset of patients where STAT3 pathway inhibitors hold particular promise as effective therapeutic agents.


Asunto(s)
Regulación Neoplásica de la Expresión Génica , Neoplasias de Cabeza y Cuello/metabolismo , Mutación , Proteínas Tirosina Fosfatasas Clase 2 Similares a Receptores/genética , Factor de Transcripción STAT3/metabolismo , Secuencia de Aminoácidos , Línea Celular Tumoral , Supervivencia Celular , Simulación por Computador , Células HEK293 , Humanos , Inmunohistoquímica , Modelos Moleculares , Datos de Secuencia Molecular , Fosforilación , Estructura Terciaria de Proteína , Proteoma , Proteínas Tirosina Fosfatasas Clase 2 Similares a Receptores/metabolismo , Transfección
3.
BMC Bioinformatics ; 9: 72, 2008 Jan 30.
Artículo en Inglés | MEDLINE | ID: mdl-18234090

RESUMEN

BACKGROUND: A new algorithm has been developed for generating conservation profiles that reflect the evolutionary history of the subfamily associated with a query sequence. It is based on n-gram patterns (NP{n,m}) which are sets of n residues and m wildcards in windows of size n+m. The generation of conservation profiles is treated as a signal-to-noise problem where the signal is the count of n-gram patterns in target sequences that are similar to the query sequence and the noise is the count over all target sequences. The signal is differentiated from the noise by applying singular value decomposition to sets of target sequences rank ordered by similarity with respect to the query. RESULTS: The new algorithm was used to construct 4,248 profiles from 120 randomly selected Pfam-A families. These were compared to profiles generated from multiple alignments using the consensus approach. The two profiles were similar whenever the subfamily associated with the query sequence was well represented in the multiple alignment. It was possible to construct subfamily specific conservation profiles using the new algorithm for subfamilies with as few as five members. The speed of the new algorithm was comparable to the multiple alignment approach. CONCLUSION: Subfamily specific conservation profiles can be generated by the new algorithm without aprioi knowledge of family relationships or domain architecture. This is useful when the subfamily contains multiple domains with different levels of representation in protein databases. It may also be applicable when the subfamily sample size is too small for the multiple alignment approach.


Asunto(s)
Algoritmos , Secuencia Conservada , Reconocimiento de Normas Patrones Automatizadas/métodos , Proteínas/química , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Datos de Secuencia Molecular
4.
Proteins ; 68(4): 830-8, 2007 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-17523186

RESUMEN

An n-gram pattern (NP{n,m}) in a protein sequence is a set of n residues and m wildcards in a window of size n+m. Each window of n+m amino acids is associated with a collection of NP{n,m} patterns based on the combinatorics of n+m objects taken m at a time. NP{n,m} patterns that are shared between sequences reflect evolutionary relationships. Recently the authors developed an alignment-independent protein classification algorithm based on shared NP{4,2} patterns that compared favorably to PSI-BLAST. Theoretically, NP{4,2} patterns should also reflect secondary structure propensity since they contain all possible n-grams for 1 < or = n < or = 4 and a window of 6 residues is wide enough to capture periodicities in the 2 < or = n < or = 5 range. This sparked interest in differentiating the information content in NP{4,2} patterns related to evolution from the content related to local propensity. The probability of alpha-, beta-, and coil components was determined for every NP{4,2} pattern over all the chains in the Protein Data Bank (PDB). An algorithm exclusively based on the Z-values of these distributions was developed, which accurately predicted 71-76% of alpha-helical segments and 62-67% of beta-sheets in rigorous jackknife tests. This provided evidence for the strong correlation between NP{4,2} patterns and secondary structure. By grouping PDB chains into subsets with increasing levels of sequence identity, it was also possible to separate the evolutionary and local propensity contributions to the classification process. The results showed that information derived from evolutionary relationships was more important for beta-sheet prediction than alpha-helix prediction.


Asunto(s)
Estructura Secundaria de Proteína , Proteínas/química , Secuencia de Aminoácidos , Aminoácidos/análisis , Aminoácidos/química , Bases de Datos de Proteínas , Modelos Moleculares , Datos de Secuencia Molecular , Nanotecnología , Conformación Proteica , Proteínas/clasificación
5.
Comput Theor Chem ; 1099: 152-166, 2017 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-29109930

RESUMEN

Accurate chemical shifts for the atoms in molecular mechanics (MD) trajectories can be obtained from quantum mechanical (QM) calculations that depend solely on the coordinates of the atoms in the localized regions surrounding atoms of interest. If these coordinates are correct and the sample size is adequate, the ensemble average of these chemical shifts should be equal to the chemical shifts obtained from NMR spectroscopy. If this is not the case, the coordinates must be incorrect. We have utilized this fact to quantify the errors associated with the backbone atoms in MD simulations of proteins. A library of regional conformers containing 169,499 members was constructed from 6 model proteins. The chemical shifts associated with the backbone atoms in each of these conformers was obtained from QM calculations using density functional theory at the B3LYP level with a 6-311+G(2d,p) basis set. Chemical shifts were assigned to each backbone atom in each MD simulation frame using a template matching approach. The ensemble average of these chemical shifts was compared to chemical shifts from NMR spectroscopy. A large systematic error was identified that affected the 1H atoms of the peptide bonds involved in hydrogen bonding with water molecules or peptide backbone atoms. This error was highly sensitive to changes in electrostatic parameters. Smaller errors affecting the 13Ca and 15N atoms were also detected. We believe these errors could be useful as metrics for comparing the force-fields and parameter sets used in MD simulation because they are directly tied to errors in atomic coordinates.

6.
Appl Bioinformatics ; 3(2-3): 137-48, 2004.
Artículo en Inglés | MEDLINE | ID: mdl-15693739

RESUMEN

Annotation of the rapidly accumulating body of sequence data relies heavily on the detection of remote homologues and functional motifs in protein families. The most popular methods rely on sequence alignment. These include programs that use a scoring matrix to compare the probability of a potential alignment with random chance and programs that use curated multiple alignments to train profile hidden Markov models (HMMs). Related approaches depend on bootstrapping multiple alignments from a single sequence. However, alignment-based programs have limitations. They make the assumption that contiguity is conserved between homologous segments, which may not be true in genetic recombination or horizontal transfer. Alignments also become ambiguous when sequence similarity drops below 40%. This has kindled interest in classification methods that do not rely on alignment. An approach to classification without alignment based on the distribution of contiguous sequences of four amino acids (4-grams) was developed. Interest in 4-grams stemmed from the observation that almost all theoretically possible 4-grams (20(4)) occur in natural sequences and the majority of 4-grams are uniformly distributed. This implies that the probability of finding identical 4-grams by random chance in unrelated sequences is low. A Bayesian probabilistic model was developed to test this hypothesis. For each protein family in Pfam-A and PIR-PSD, a feature vector called a probe was constructed from the set of 4-grams that best characterised the family. In rigorous jackknife tests, unknown sequences from Pfam-A and PIR-PSD were compared with the probes for each family. A classification result was deemed a true positive if the probe match with the highest probability was in first place in a rank-ordered list. This was achieved in 70% of cases. Analysis of false positives suggested that the precision might approach 85% if selected families were clustered into subsets. Case studies indicated that the 4-grams in common between an unknown and the best matching probe correlated with functional motifs from PRINTS. The results showed that remote homologues and functional motifs could be identified from an analysis of 4-gram patterns.


Asunto(s)
Algoritmos , Reconocimiento de Normas Patrones Automatizadas/métodos , Proteínas/química , Proteínas/clasificación , Análisis de Secuencia de Proteína/métodos , Secuencias de Aminoácidos , Secuencia de Aminoácidos , Secuencia Conservada , Modelos Químicos , Modelos Moleculares , Modelos Estadísticos , Datos de Secuencia Molecular , Proteínas/análisis , Alineación de Secuencia , Homología de Secuencia de Aminoácido
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA