Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
Genes Dev ; 27(7): 749-66, 2013 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-23592795

RESUMEN

SFMBT1 (Scm [Sex comb on midleg] with four MBT [malignant brain tumor] domains 1) is a poorly characterized mammalian MBT domain-containing protein homologous to Drosophila SFMBT, a Polycomb group protein involved in epigenetic regulation of gene expression. Here, we show that SFMBT1 regulates transcription in somatic cells and during spermatogenesis through the formation of a stable complex with LSD1 and CoREST. When bound to its gene targets, SFMBT1 recruits its associated proteins and causes chromatin compaction and transcriptional repression. SFMBT1, LSD1, and CoREST share a large fraction of target genes, including those encoding replication-dependent histones. Simultaneous occupancy of histone genes by SFMBT1, LSD1, and CoREST is regulated during the cell cycle and correlates with the loss of RNA polymerase II at these promoters during G2, M, and G1. The interplay between the repressive SFMBT1-LSD1-CoREST complex and RNA polymerase II contributes to the timely transcriptional regulation of histone genes in human cells. SFMBT1, LSD1, and CoREST also form a stable complex in germ cells, and their chromatin binding activity is regulated during spermatogenesis.


Asunto(s)
Cromatina/metabolismo , Regulación de la Expresión Génica , Histona Demetilasas/metabolismo , Histonas , Proteínas Represoras/metabolismo , Animales , Cromatina/genética , Proteínas Co-Represoras , Genoma , Células HEK293 , Células HeLa , Histona Demetilasas/genética , Histonas/genética , Histonas/metabolismo , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Proteínas del Tejido Nervioso/metabolismo , Oxidorreductasas N-Desmetilantes/metabolismo , Unión Proteica , Estabilidad Proteica , Transporte de Proteínas , Proteínas Represoras/genética , Células Sf9 , Espermatogénesis/genética , Testículo/metabolismo
2.
Mol Cell ; 45(3): 344-56, 2012 Feb 10.
Artículo en Inglés | MEDLINE | ID: mdl-22325352

RESUMEN

The heterogeneous nature of mammalian PRC1 complexes has hindered our understanding of their biological functions. Here, we present a comprehensive proteomic and genomic analysis that uncovered six major groups of PRC1 complexes, each containing a distinct PCGF subunit, a RING1A/B ubiquitin ligase, and a unique set of associated polypeptides. These PRC1 complexes differ in their genomic localization, and only a small subset colocalize with H3K27me3. Further biochemical dissection revealed that the six PCGF-RING1A/B combinations form multiple complexes through association with RYBP or its homolog YAF2, which prevents the incorporation of other canonical PRC1 subunits, such as CBX, PHC, and SCM. Although both RYBP/YAF2- and CBX/PHC/SCM-containing complexes compact chromatin, only RYBP stimulates the activity of RING1B toward H2AK119ub1, suggesting a central role in PRC1 function. Knockdown of RYBP in embryonic stem cells compromised their ability to form embryoid bodies, likely because of defects in cell proliferation and maintenance of H2AK119ub1 levels.


Asunto(s)
Péptidos y Proteínas de Señalización Intracelular/metabolismo , Complejos Multiproteicos/metabolismo , Proteínas Represoras/metabolismo , Proteínas Represoras/fisiología , Diferenciación Celular , Proliferación Celular , Proteínas Cromosómicas no Histona/genética , Proteínas Cromosómicas no Histona/metabolismo , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Cuerpos Embrioides/metabolismo , Expresión Génica , Células HEK293 , Histonas/metabolismo , Humanos , Péptidos y Proteínas de Señalización Intracelular/genética , Complejos Multiproteicos/genética , Proteínas Musculares/genética , Proteínas Musculares/metabolismo , Nucleosomas/metabolismo , Nucleosomas/ultraestructura , Complejo Represivo Polycomb 1 , Proteínas del Grupo Polycomb , Regiones Promotoras Genéticas , Unión Proteica , Proteómica , Proteínas Represoras/genética , Homología de Secuencia de Aminoácido , Ubiquitina-Proteína Ligasas/genética , Ubiquitina-Proteína Ligasas/metabolismo
3.
Mol Cell ; 47(6): 873-85, 2012 Sep 28.
Artículo en Inglés | MEDLINE | ID: mdl-22864115

RESUMEN

Class switch recombination (CSR) has the potential to generate genomic instability in B cells as activation-induced cytidine deaminase (AID), which mediates this process, is known to target many sites outside Igh. Nonetheless we do not fully understand what factors influence AID targeting genome-wide. Given that errors in CSR can lead to dangerous, oncogenic chromosomal translocations it is important to identify the elements that determine which genes are at risk of being "hit" and could be involved in aberrant rearrangements. Here we have investigated the influence of nuclear organization in determining "off-target" activity and the choice of fusion partners. Our studies indicate that the vast majority of known AID-mediated Igh translocation partners are found in chromosomal domains that contact this locus during class switching. Further, these interaction domains can be used to identify other genes that are hit by AID.


Asunto(s)
Linfocitos B/citología , Citidina Desaminasa/metabolismo , Genes de las Cadenas Pesadas de las Inmunoglobulinas , Cambio de Clase de Inmunoglobulina , Translocación Genética , Animales , Linfocitos B/metabolismo , Citidina Desaminasa/genética , Inestabilidad Genómica , Ratones , Ratones Endogámicos C57BL , Datos de Secuencia Molecular , Hipermutación Somática de Inmunoglobulina
4.
Nat Methods ; 11(5): 559-65, 2014 May.
Artículo en Inglés | MEDLINE | ID: mdl-24658142

RESUMEN

Promoters and enhancers establish precise gene transcription patterns. The development of functional approaches for their identification in mammalian cells has been complicated by the size of these genomes. Here we report a high-throughput functional assay for directly identifying active promoter and enhancer elements called FIREWACh (Functional Identification of Regulatory Elements Within Accessible Chromatin), which we used to simultaneously assess over 80,000 DNA fragments derived from nucleosome-free regions within the chromatin of embryonic stem cells (ESCs) and identify 6,364 active regulatory elements. Many of these represent newly discovered ESC-specific enhancers, showing enriched binding-site motifs for ESC-specific transcription factors including SOX2, POU5F1 (OCT4) and KLF4. The application of FIREWACh to additional cultured cell types will facilitate functional annotation of the genome and expand our view of transcriptional network dynamics.


Asunto(s)
Elementos de Facilitación Genéticos , Regulación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Regiones Promotoras Genéticas , Animales , Sitios de Unión , Cromatina/química , Biología Computacional , Desoxirribonucleasa I/metabolismo , Células Madre Embrionarias/citología , Citometría de Flujo , Biblioteca de Genes , Genes Reporteros , Técnicas Genéticas , Genoma , Proteínas Fluorescentes Verdes/metabolismo , Factor 4 Similar a Kruppel , Lentivirus/genética , Lentivirus/metabolismo , Luciferasas/metabolismo , Ratones , Plásmidos/metabolismo , Transcripción Genética , Transgenes
5.
Proc Natl Acad Sci U S A ; 111(4): 1253-8, 2014 Jan 28.
Artículo en Inglés | MEDLINE | ID: mdl-24474744

RESUMEN

In a broad range of classification and decision-making problems, one is given the advice or predictions of several classifiers, of unknown reliability, over multiple questions or queries. This scenario is different from the standard supervised setting, where each classifier's accuracy can be assessed using available labeled data, and raises two questions: Given only the predictions of several classifiers over a large set of unlabeled test data, is it possible to (i) reliably rank them and (ii) construct a metaclassifier more accurate than most classifiers in the ensemble? Here we present a spectral approach to address these questions. First, assuming conditional independence between classifiers, we show that the off-diagonal entries of their covariance matrix correspond to a rank-one matrix. Moreover, the classifiers can be ranked using the leading eigenvector of this covariance matrix, because its entries are proportional to their balanced accuracies. Second, via a linear approximation to the maximum likelihood estimator, we derive the Spectral Meta-Learner (SML), an unsupervised ensemble classifier whose weights are equal to these eigenvector entries. On both simulated and real data, SML typically achieves a higher accuracy than most classifiers in the ensemble and can provide a better starting point than majority voting for estimating the maximum likelihood solution. Furthermore, SML is robust to the presence of small malicious groups of classifiers designed to veer the ensemble prediction away from the (unknown) ground truth.


Asunto(s)
Funciones de Verosimilitud , Modelos Teóricos
6.
BMC Bioinformatics ; 17 Suppl 5: 206, 2016 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-27295177

RESUMEN

BACKGROUND: Peak calling is a fundamental step in the analysis of data generated by ChIP-seq or similar techniques to acquire epigenetics information. Current peak callers are often hard to parameterise and may therefore be difficult to use for non-bioinformaticians. In this paper, we present the ChIP-seq analysis tool available in CLC Genomics Workbench and CLC Genomics Server (version 7.5 and up), a user-friendly peak-caller designed to be not specific to a particular *-seq protocol. RESULTS: We illustrate the advantages of a shape-based approach and describe the algorithmic principles underlying the implementation. Thanks to the generality of the idea and the fact the algorithm is able to learn the peak shape from the data, the implementation requires only minimal user input, while still being applicable to a range of *-seq protocols. Using independently validated benchmark datasets, we compare our implementation to other state-of-the-art algorithms explicitly designed to analyse ChIP-seq data and provide an evaluation in terms of receiver-operator characteristic (ROC) plots. In order to show the applicability of the method to similar *-seq protocols, we also investigate algorithmic performances on DNase-seq data. CONCLUSIONS: The results show that CLC shape-based peak caller ranks well among popular state-of-the-art peak callers while providing flexibility and ease-of-use.


Asunto(s)
Algoritmos , Genómica/métodos , Área Bajo la Curva , Inmunoprecipitación de Cromatina , Bases de Datos Genéticas , Humanos , Internet , Curva ROC , Análisis de Secuencia de ADN , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Interfaz Usuario-Computador
7.
Stem Cells ; 33(2): 378-91, 2015 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-25335464

RESUMEN

Both pluripotent embryonic stem cells (ESCs), established from preimplantation murine blastocysts, and epiblast stem cells (EpiSCs), established from postimplantation embryos, can self-renew in culture or differentiate into each of the primary germ layers. While the core transcription factors (TFs) OCT4, SOX2, and NANOG are expressed in both cell types, the gene expression profiles and other features suggest that ESCs and EpiSCs reflect distinct developmental maturation stages of the epiblast in vivo. Accordingly, "naïve" or "ground state" ESCs resemble cells of the inner cell mass, whereas "primed" EpiSCs resemble cells of the postimplantation egg cylinder. To gain insight into the relationship between naïve and primed pluripotent cells, and of each of these pluripotent states to that of nonpluripotent cells, we have used FAIRE-seq to generate a comparative atlas of the accessible chromatin regions within ESCs, EpiSCs, multipotent neural stem cells, and mouse embryonic fibroblasts. We find a distinction between the accessible chromatin patterns of pluripotent and somatic cells that is consistent with the highly related phenotype of ESCs and EpiSCs. However, by defining cell-specific and shared regions of open chromatin, and integrating these data with published gene expression and ChIP analyses, we also illustrate unique features of the chromatin of naïve and primed cells. Functional studies suggest that multiple stage-specific enhancers regulate ESC- or EpiSC-specific gene expression, and implicate auxiliary TFs as important modulators for stage-specific activation by the core TFs. Together these observations provide insights into the chromatin structure dynamics accompanying transitions between these pluripotent states.


Asunto(s)
Blastocisto/metabolismo , Ensamble y Desensamble de Cromatina/fisiología , Cromatina/metabolismo , Células Madre Embrionarias/metabolismo , Células Madre Pluripotentes/metabolismo , Factores de Transcripción/metabolismo , Animales , Blastocisto/citología , Línea Celular , Cromatina/genética , Células Madre Embrionarias/citología , Ratones , Células Madre Multipotentes/citología , Células Madre Multipotentes/metabolismo , Células-Madre Neurales/citología , Células-Madre Neurales/metabolismo , Células Madre Pluripotentes/citología , Factores de Transcripción/genética
8.
Nucleic Acids Res ; 41(17): e165, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23892400

RESUMEN

Revealing the clonal composition of a single tumor is essential for identifying cell subpopulations with metastatic potential in primary tumors or with resistance to therapies in metastatic tumors. Sequencing technologies provide only an overview of the aggregate of numerous cells. Computational approaches to de-mix a collective signal composed of the aberrations of a mixed cell population of a tumor sample into its individual components are not available. We propose an evolutionary framework for deconvolving data from a single genome-wide experiment to infer the composition, abundance and evolutionary paths of the underlying cell subpopulations of a tumor. We have developed an algorithm (TrAp) for solving this mixture problem. In silico analyses show that TrAp correctly deconvolves mixed subpopulations when the number of subpopulations and the measurement errors are moderate. We demonstrate the applicability of the method using tumor karyotypes and somatic hypermutation data sets. We applied TrAp to Exome-Seq experiment of a renal cell carcinoma tumor sample and compared the mutational profile of the inferred subpopulations to the mutational profiles of single cells of the same tumor. Finally, we deconvolve sequencing data from eight acute myeloid leukemia patients and three distinct metastases of one melanoma patient to exhibit the evolutionary relationships of their subpopulations.


Asunto(s)
Algoritmos , Evolución Clonal , Neoplasias/genética , Alelos , Biopsia , Carcinoma de Células Renales/genética , Carcinoma de Células Renales/patología , Aberraciones Cromosómicas , Humanos , Cariotipificación , Neoplasias Renales/genética , Neoplasias Renales/patología , Leucemia Mieloide Aguda/genética , Melanoma/genética , Mutación , Metástasis de la Neoplasia
9.
Nucleic Acids Res ; 41(16): e161, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23873955

RESUMEN

Researchers generating new genome-wide data in an exploratory sequencing study can gain biological insights by comparing their data with well-annotated data sets possessing similar genomic patterns. Data compression techniques are needed for efficient comparisons of a new genomic experiment with large repositories of publicly available profiles. Furthermore, data representations that allow comparisons of genomic signals from different platforms and across species enhance our ability to leverage these large repositories. Here, we present a signal processing approach that characterizes protein-chromatin interaction patterns at length scales of several kilobases. This allows us to efficiently compare numerous chromatin-immunoprecipitation sequencing (ChIP-seq) data sets consisting of many types of DNA-binding proteins collected from a variety of cells, conditions and organisms. Importantly, these interaction patterns broadly reflect the biological properties of the binding events. To generate these profiles, termed Arpeggio profiles, we applied harmonic deconvolution techniques to the autocorrelation profiles of the ChIP-seq signals. We used 806 publicly available ChIP-seq experiments and showed that Arpeggio profiles with similar spectral densities shared biological properties. Arpeggio profiles of ChIP-seq data sets revealed characteristics that are not easily detected by standard peak finders. They also allowed us to relate sequencing data sets from different genomes, experimental platforms and protocols. Arpeggio is freely available at http://sourceforge.net/p/arpeggio/wiki/Home/.


Asunto(s)
Inmunoprecipitación de Cromatina , Cromatina/metabolismo , Proteínas de Unión al ADN/metabolismo , Compresión de Datos/métodos , Animales , Cromatina/química , Proteínas de Unión al ADN/química , Histonas/metabolismo , Humanos , Ratones , Análisis de Secuencia de ADN , Factores de Transcripción/metabolismo
10.
J Immunol ; 188(12): 6084-92, 2012 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-22581861

RESUMEN

Ag receptor diversity involves the introduction of DNA double-stranded breaks during lymphocyte development. To ensure fidelity, cleavage is confined to the G(0)-G(1) phase of the cell cycle. One established mechanism of regulation is through periodic degradation of the RAG2 recombinase protein. However, there are additional levels of protection. In this paper, we show that cyclical changes in the IL-7R signaling pathway functionally segregate pro-B cells according to cell cycle status. In consequence, the level of a downstream effector of IL-7 signaling, phospho-STAT5, is inversely correlated with cell cycle expression of Rag, a key gene involved in recombination. Higher levels of phopho-STAT5 in S-G(2) correlate with decreased Rag expression and Rag relocalization to pericentromeric heterochromatin. These cyclical changes in transcription and locus repositioning are ablated upon transformation with v-Abl, which renders STAT5 constitutively active across the cell cycle. We propose that this activity of the IL-7R/STAT5 pathway plays a critical protective role in development, complementing regulation of RAG2 at the protein level, to ensure that recombination does not occur during replication. Our data, suggesting that pro-B cells are not a single homogeneous population, explain inconsistencies in the role of IL-7 signaling in regulating Igh recombination.


Asunto(s)
Subgrupos de Linfocitos B/inmunología , Ciclo Celular/inmunología , Interleucina-7/inmunología , Células Precursoras de Linfocitos B/inmunología , Animales , Subgrupos de Linfocitos B/citología , Subgrupos de Linfocitos B/metabolismo , Ciclo Celular/genética , Diferenciación Celular/genética , Diferenciación Celular/inmunología , Citometría de Flujo , Reordenamiento Génico de Cadena Pesada de Linfocito B/genética , Reordenamiento Génico de Cadena Pesada de Linfocito B/inmunología , Genes RAG-1 , Proteínas de Homeodominio/biosíntesis , Proteínas de Homeodominio/genética , Proteínas de Homeodominio/inmunología , Hibridación Fluorescente in Situ , Interleucina-7/metabolismo , Ratones , Microscopía Confocal , Análisis de Secuencia por Matrices de Oligonucleótidos , Células Precursoras de Linfocitos B/citología , Células Precursoras de Linfocitos B/metabolismo , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Factor de Transcripción STAT5/inmunología , Factor de Transcripción STAT5/metabolismo , Transducción de Señal/inmunología , Transcripción Genética
11.
Nucleic Acids Res ; 40(9): e70, 2012 May.
Artículo en Inglés | MEDLINE | ID: mdl-22307239

RESUMEN

Numerous algorithms have been developed to analyze ChIP-Seq data. However, the complexity of analyzing diverse patterns of ChIP-Seq signals, especially for epigenetic marks, still calls for the development of new algorithms and objective comparisons of existing methods. We developed Qeseq, an algorithm to detect regions of increased ChIP read density relative to background. Qeseq employs critical novel elements, such as iterative recalibration and neighbor joining of reads to identify enriched regions of any length. To objectively assess its performance relative to other 14 ChIP-Seq peak finders, we designed a novel protocol based on Validation Discriminant Analysis (VDA) to optimally select validation sites and generated two validation datasets, which are the most comprehensive to date for algorithmic benchmarking of key epigenetic marks. In addition, we systematically explored a total of 315 diverse parameter configurations from these algorithms and found that typically optimal parameters in one dataset do not generalize to other datasets. Nevertheless, default parameters show the most stable performance, suggesting that they should be used. This study also provides a reproducible and generalizable methodology for unbiased comparative analysis of high-throughput sequencing tools that can facilitate future algorithmic development.


Asunto(s)
Algoritmos , Inmunoprecipitación de Cromatina , Cromatina/metabolismo , Epigénesis Genética , Análisis de Secuencia de ADN , Animales , Artefactos , Secuenciación de Nucleótidos de Alto Rendimiento , Histonas/metabolismo , Ratones
12.
Nat Biotechnol ; 2024 Apr 05.
Artículo en Inglés | MEDLINE | ID: mdl-38580861

RESUMEN

Single-cell RNA sequencing has been widely used to investigate cell state transitions and gene dynamics of biological processes. Current strategies to infer the sequential dynamics of genes in a process typically rely on constructing cell pseudotime through cell trajectory inference. However, the presence of concurrent gene processes in the same group of cells and technical noise can obscure the true progression of the processes studied. To address this challenge, we present GeneTrajectory, an approach that identifies trajectories of genes rather than trajectories of cells. Specifically, optimal transport distances are calculated between gene distributions across the cell-cell graph to extract gene programs and define their gene pseudotemporal order. Here we demonstrate that GeneTrajectory accurately extracts progressive gene dynamics in myeloid lineage maturation. Moreover, we show that GeneTrajectory deconvolves key gene programs underlying mouse skin hair follicle dermal condensate differentiation that could not be resolved by cell trajectory approaches. GeneTrajectory facilitates the discovery of gene programs that control the changes and activities of biological processes.

13.
bioRxiv ; 2023 Nov 13.
Artículo en Inglés | MEDLINE | ID: mdl-38014159

RESUMEN

Accurate cell marker identification in single-cell RNA-seq data is crucial for understanding cellular diversity and function. An ideal marker is highly specific in identifying cells that are similar in terms of function and state. Current marker identification methods, commonly based on clustering and differential expression, capture general cell-type markers but often miss markers for subtypes or functional cell subsets, with their performance largely dependent on clustering quality. Moreover, cluster-independent approaches tend to favor genes that lack the specificity required to characterize regions within the transcriptomic space at multiple scales. Here we introduce Localized Marker Detector (LMD), a novel tool to identify "localized genes" - genes with expression profiles specific to certain groups of highly similar cells - thereby characterizing cellular diversity in a multi-resolution and fine-grained manner. LMD's strategy involves building a cell-cell affinity graph, diffusing the gene expression value across the cell graph, and assigning a score to each gene based on its diffusion dynamics. We show that LMD exhibits superior accuracy in recovering known cell-type markers in the Tabula Muris bone marrow dataset relative to other methods for marker identification. Notably, markers favored by LMD exhibit localized expression, whereas markers prioritized by other clustering-free algorithms are often dispersed in the transcriptomic space. We further group the markers suggested by LMD into functional gene modules to improve the separation of cell types and subtypes in a more fine-grained manner. These modules also identify other sources of variation, such as cell cycle status. In conclusion, LMD is a novel algorithm that can identify fine-grained markers for cell subtypes or functional states without relying on clustering or differential expression analysis. LMD exploits the complex interactions among cells and reveals cellular diversity at high resolution.

14.
Yale J Biol Med ; 85(3): 347-61, 2012 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23012583

RESUMEN

The heterogeneity of tumor samples is a major challenge in the analysis of high-throughput profiling of tumor biopsies and cell lines. The measured aggregate signals of multigenerational progenies often represent an average of several tumor subclones with varying genomic aberrations and different gene expression levels. The goal of the present study was to integrate copy number analyses from SNP-arrays and karyotyping, gene expression profiling, and pathway analyses to detect heterogeneity, identify driver mutations, and explore possible mechanisms of tumor evolution. We showed the heterogeneity of the studied samples, characterized the global copy number alteration profiles, and identified genes whose copy number status and expression levels were aberrant. In particular, we identified a recurrent association between two BRAF(V600E) and BRAF(V600K) mutations and changes in DKK1 gene expression levels, which might indicate an association between the BRAF and WNT pathways. These findings show that the integrated approaches used in the present study can robustly address the challenging issue of tumor heterogeneity in high-throughput profiling.


Asunto(s)
Variaciones en el Número de Copia de ADN , Perfilación de la Expresión Génica/métodos , Melanoma/genética , Línea Celular Tumoral , Mapeo Cromosómico , Cromosomas Humanos/genética , Evolución Molecular , Regulación Neoplásica de la Expresión Génica , Genes Relacionados con las Neoplasias , Humanos , Péptidos y Proteínas de Señalización Intercelular/genética , Péptidos y Proteínas de Señalización Intercelular/metabolismo , Cariotipificación , Melanoma/metabolismo , Mutación , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Polimorfismo de Nucleótido Simple , Proteínas Proto-Oncogénicas B-raf/genética , Proteínas Proto-Oncogénicas B-raf/metabolismo
15.
J Comput Aided Mol Des ; 24(5): 423-31, 2010 May.
Artículo en Inglés | MEDLINE | ID: mdl-20407802

RESUMEN

Norovirus strains are known to cause recurring epidemics of winter vomiting disease. The crystal structure of the capsid protein of VA387, a representative of the clinically important GII.4 genocluster, was recently solved in complex with histo-blood group A- and B-trisaccharides. However, the VA387 strain is known to bind also to other natural carbohydrates for which detailed structural information of the complexes is not available. In this study we have computationally explored the fit of the VA387 with a set of naturally occurring carbohydrate ligands containing a terminal alpha1,2-linked fucose. MD simulations both with explicit and implicit solvent models indicate that type 1 and 3 extensions of the ABO-determinant including ALe(b) and BLe(b) pentasaccharides can be well accommodated in the site. Scoring with Glide XP indicates that the downstream extensions of the ABO-determinants give an increase in binding strength, although the alpha1,2-linked fucose is the single strongest interacting residue. An error was discovered in the geometry of the GalNAc-Gal moiety of the published crystal structure of the A-trisaccharide/VA387 complex. The present modeling of the complexes with histo-blood group A-active structures shows some contacts which provide insight into mutational data, explaining the involvement of I389 and Q331. Our results can be applicable in structure-based design of adhesion inhibitors of noroviruses.


Asunto(s)
Sistema del Grupo Sanguíneo ABO/química , Sistema del Grupo Sanguíneo ABO/metabolismo , Proteínas de la Cápside/química , Proteínas de la Cápside/metabolismo , Simulación por Computador , Norovirus/química , Norovirus/metabolismo , Oligosacáridos/química , Oligosacáridos/metabolismo , Sitios de Unión , Secuencia de Carbohidratos , Humanos , Técnicas In Vitro , Ligandos , Modelos Moleculares , Simulación de Dinámica Molecular , Datos de Secuencia Molecular , Norovirus/patogenicidad , Unión Proteica , Programas Informáticos
16.
J Comput Aided Mol Des ; 24(12): 1009-21, 2010 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-20976527

RESUMEN

The identification of glycan epitopes such as the histo-blood group ABH determinants as docking sites for bacterial/viral infections and signals in growth regulation fuels the interest to develop non-hydrolysable mimetics for therapeutic applications. Inevitably, the required substitution of the linkage oxygen atom will alter the derivative's topology. Our study addresses the question of the impact of substitution of oxygen by selenium. In order to characterize spatial parameters and flexibility of selenoglycosides, we first performed ab initio calculations on model compounds to refine the MM4 force field. The following application of the resulting MM4R version appears to reduce the difference to ab initio data when compared to using the MM4 estimator. Systematic conformational searches on the derivatives of histo-blood group ABH antigens revealed increased flexibility with acquisition of additional low-energy conformer(s), akin to the behavior of S-glycosides. Docking analysis using the Glide program for eight test cases indicated potential for bioactivity, giving further experimental investigation a clear direction to testing Se-glycosides as lectin ligands.


Asunto(s)
Antígenos de Grupos Sanguíneos/química , Diseño de Fármacos , Glicósidos/química , Compuestos de Organoselenio/química , Trisacáridos/química , Sistema del Grupo Sanguíneo ABO , Biomimética , Humanos , Lectinas , Modelos Moleculares , Conformación Molecular , Selenio/química
17.
J Comput Aided Mol Des ; 23(12): 845-52, 2009 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-19757090

RESUMEN

Histo-blood group ABH antigens serve as recognition sites for infectious microorganisms and tissue lectins in intercellular communication, e.g. in tumor progression. Thus, they are of interest as a starting point for drug design. In this respect, potent non-hydrolysable derivatives such as thioglycosides are of special interest. As prerequisite to enable estimations of ligand properties relative to their natural counterparts, conformational properties of the thioglycosidic derivatives of ABH trisaccharides and their disaccharide units were calculated using systematic and filtered systematic searches with the MM4 force field. Parameters for the glycosidic torsions of thioglycosides were independently derived from ab initio calculations. The resulting energy deviations required a reparameterization of MM4 to a new parameter set called MM4R. The data sets obtained using MM4R reveal that the thioglycosides have somewhat increased levels of flexibility about the major low-energy conformations shared with the corresponding O-glycosides. In the trisaccharides, the thiosubstitution of the Gal[NAc]alpha1-3Gal linkage leads to a preference for a conformation which is the secondary minimum of the natural counterparts. This conformation also generates contacts between the N-acetyl group and the fucose moiety in the blood group A derivative. Calculations further indicate that thiosubstitution of only the Fucalpha1-2Gal linkage does not affect the conformational preferences compared to the natural trisaccharide. Thiosubstitution of both linkages in the trisaccharide results in increased flexibility but the favored conformation of the natural trisaccharides is preferred. The study suggests that thioglycoside derivatives of ABH antigens could have pharmaceutical interest as ligands of lectins and other carbohydrate-binding proteins.


Asunto(s)
Antígenos de Grupos Sanguíneos/química , Tioglicósidos/química , Biomimética , Disacáridos/química , Diseño de Fármacos , Modelos Moleculares , Conformación Molecular , Trisacáridos/química
18.
Carbohydr Res ; 340(5): 1019-24, 2005 Apr 11.
Artículo en Inglés | MEDLINE | ID: mdl-15780266

RESUMEN

We present a computational conformational analysis of the exopolysaccharide of Burkholderia cepacia, which is believed to play a role in colonization and persistence of B. cepacia in the lungs of cystic fibrosis patients. The repeating unit of the exopolysaccharide is a heptasaccharide with three branches, which cause significant steric restraints. Conformational searches using glygal, an in-house developed software using genetic algorithm search methods, were performed on fragments as well as on the complete repeating unit with wrap-over residues. The force field used for the calculations was MM3(96). The search showed four favored conformations for an isolated repeating unit. However, for a sequence of several repeating units, the calculations indicate a single, well-defined linear conformation.


Asunto(s)
Burkholderia cepacia/química , Polisacáridos Bacterianos/química , Algoritmos , Conformación de Carbohidratos , Secuencia de Carbohidratos , Datos de Secuencia Molecular , Programas Informáticos
19.
Carbohydr Res ; 340(5): 1059-64, 2005 Apr 11.
Artículo en Inglés | MEDLINE | ID: mdl-15780270

RESUMEN

We have implemented a system called glygal that can perform conformational searches on oligosaccharides using several different genetic algorithm (GA) search methods. The searches are performed in the torsion angle conformational space, considering both the primary glycosidic linkages as well as the pendant groups (C-5-C-6 and hydroxyl groups) where energy calculations are performed using the MM3(96) force field. The system includes a graphical user interface for setting calculation parameters and incorporates a 3D molecular viewer. The system was tested using dozens of structures and we present two case studies for two previously investigated O-specific oligosaccharides of the Shigella dysenteriae type 2 and 4. The results obtained using glygal show a significant reduction in the number of structures that need to be sampled in order to find the best conformation, as compared to filtered systematic search.


Asunto(s)
Oligosacáridos/química , Algoritmos , Conformación de Carbohidratos , Antígenos O/química , Shigella dysenteriae/química , Programas Informáticos
20.
PLoS One ; 6(10): e26074, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-22046256

RESUMEN

The multitude of bioinformatics algorithms designed for performing a particular computational task presents end-users with the problem of selecting the most appropriate computational tool for analyzing their biological data. The choice of the best available method is often based on expensive experimental validation of the results. We propose an approach to design validation sets for method comparison and performance assessment that are effective in terms of cost and discrimination power.Validation Discriminant Analysis (VDA) is a method for designing a minimal validation dataset to allow reliable comparisons between the performances of different algorithms. Implementation of our VDA approach achieves this reduction by selecting predictions that maximize the minimum Hamming distance between algorithmic predictions in the validation set. We show that VDA can be used to correctly rank algorithms according to their performances. These results are further supported by simulations and by realistic algorithmic comparisons in silico.VDA is a novel, cost-efficient method for minimizing the number of validation experiments necessary for reliable performance estimation and fair comparison between algorithms.Our VDA software is available at http://sourceforge.net/projects/klugerlab/files/VDA/


Asunto(s)
Algoritmos , Biología Computacional/métodos , Validación de Programas de Computación , Estudios de Validación como Asunto , Análisis Discriminante , Métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA