Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Mol Cell ; 81(14): 2975-2988.e6, 2021 07 15.
Article in English | MEDLINE | ID: mdl-34157308

ABSTRACT

The heterogeneous nature of eukaryotic replication kinetics and the low efficiency of individual initiation sites make mapping the location and timing of replication initiation in human cells difficult. To address this challenge, we have developed optical replication mapping (ORM), a high-throughput single-molecule approach, and used it to map early-initiation events in human cells. The single-molecule nature of our data and a total of >2,500-fold coverage of the human genome on 27 million fibers averaging ∼300 kb in length allow us to identify initiation sites and their firing probability with high confidence. We find that the distribution of human replication initiation is consistent with inefficient, stochastic activation of heterogeneously distributed potential initiation complexes enriched in accessible chromatin. These observations are consistent with stochastic models of initiation-timing regulation and suggest that stochastic regulation of replication kinetics is a fundamental feature of eukaryotic replication, conserved from yeast to humans.


Subject(s)
DNA Replication/genetics , Eukaryotic Cells/physiology , Genome, Human/genetics , Cell Line, Tumor , Chromatin/genetics , DNA Replication Timing/genetics , Genome, Fungal/genetics , Genome-Wide Association Study/methods , HeLa Cells , Humans , Replication Origin/genetics , Saccharomyces cerevisiae/genetics , Transcription Initiation Site/physiology
2.
Bioinformatics ; 36(22-23): 5377-5385, 2021 Apr 01.
Article in English | MEDLINE | ID: mdl-33355667

ABSTRACT

MOTIVATION: The binding of T-cell receptors (TCRs) to their target peptide MHC (pMHC) ligands initializes the cell-mediated immune response. In autoimmune diseases such as multiple sclerosis, the TCR erroneously recognizes self-peptides as foreign and activates an immune response against healthy cells. Such responses can be triggered by cross-recognition of the autoreactive TCR with foreign peptides. Hence, it would be desirable to identify such foreign-antigen triggers to provide a mechanistic understanding of autoimmune diseases. However, the large sequence space of foreign antigens presents an obstacle in the identification of cross-reactive peptides. RESULTS: Here, we present an in silico modeling and scoring method which exploits the structural properties of TCR-pMHC complexes to predict the binding of cross-reactive peptides. We analyzed three mouse TCRs and one human TCR isolated from a patient with multiple sclerosis. Cross-reactive peptides for these TCRs were previously identified via yeast display coupled with deep sequencing, providing a robust dataset for evaluating our method. Modeling query peptides in their associated TCR-pMHC crystal structures, our method accurately selected the top binding peptides from sets containing more than a hundred thousand unique peptides. AVAILABILITY AND IMPLEMENTATION: Analyses were performed using custom Python and R scripts available at https://github.com/weng-lab/antigen-predict. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

3.
Proteins ; 88(8): 1050-1054, 2020 08.
Article in English | MEDLINE | ID: mdl-31994784

ABSTRACT

We report docking performance on the six targets of Critical Assessment of PRedicted Interactions (CAPRI) rounds 39-45 that involved heteromeric protein-protein interactions and had the solved structures released since the rounds were held. Our general strategy involved protein-protein docking using ZDOCK, reranking using IRAD, and structural refinement using Rosetta. In addition, we made extensive use of experimental data to guide our docking runs. All the experimental information at the amino-acid level proved correct. However, for two targets, we also used protein-complex structures as templates for modeling interfaces. These resulted in incorrect predictions, presumably due to the low sequence identity between the targets and templates. Albeit a small number of targets, the performance described here compared somewhat less favorably with our previous CAPRI reports, which may be due to the CAPRI targets being increasingly challenging.


Subject(s)
Molecular Docking Simulation , Peptides/chemistry , Proteins/chemistry , Software , Amino Acid Sequence , Binding Sites , Humans , Ligands , Peptides/metabolism , Protein Binding , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Interaction Domains and Motifs , Protein Interaction Mapping , Protein Multimerization , Proteins/metabolism , Research Design , Structural Homology, Protein
4.
J Immunol ; 199(7): 2203-2213, 2017 10 01.
Article in English | MEDLINE | ID: mdl-28923982

ABSTRACT

T cell specificity emerges from a myriad of processes, ranging from the biological pathways that control T cell signaling to the structural and physical mechanisms that influence how TCRs bind peptides and MHC proteins. Of these processes, the binding specificity of the TCR is a key component. However, TCR specificity is enigmatic: TCRs are at once specific but also cross-reactive. Although long appreciated, this duality continues to puzzle immunologists and has implications for the development of TCR-based therapeutics. In this review, we discuss TCR specificity, emphasizing results that have emerged from structural and physical studies of TCR binding. We show how the TCR specificity/cross-reactivity duality can be rationalized from structural and biophysical principles. There is excellent agreement between predictions from these principles and classic predictions about the scope of TCR cross-reactivity. We demonstrate how these same principles can also explain amino acid preferences in immunogenic epitopes and highlight opportunities for structural considerations in predictive immunology.


Subject(s)
Peptides/immunology , Receptors, Antigen, T-Cell/immunology , T-Cell Antigen Receptor Specificity , Cell Membrane/metabolism , Cross Reactions , Epitopes, T-Lymphocyte/chemistry , Epitopes, T-Lymphocyte/immunology , Epitopes, T-Lymphocyte/metabolism , Humans , Peptides/chemistry , Peptides/metabolism , Protein Binding , Receptors, Antigen, T-Cell/genetics , Receptors, Antigen, T-Cell/metabolism
5.
Genome Res ; 25(12): 1886-92, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26359232

ABSTRACT

Replication timing is a crucial aspect of genome regulation that is strongly correlated with chromatin structure, gene expression, DNA repair, and genome evolution. Replication timing is determined by the timing of replication origin firing, which involves activation of MCM helicase complexes loaded at replication origins. Nonetheless, how the timing of such origin firing is regulated remains mysterious. Here, we show that the number of MCMs loaded at origins regulates replication timing. We show for the first time in vivo that multiple MCMs are loaded at origins. Because early origins have more MCMs loaded, they are, on average, more likely to fire early in S phase. Our results provide a mechanistic explanation for the observed heterogeneity in origin firing and help to explain how defined replication timing profiles emerge from stochastic origin firing. These results establish a framework in which further mechanistic studies on replication timing, such as the strong effect of heterochromatin, can be pursued.


Subject(s)
DNA Replication Timing , DNA Replication , Minichromosome Maintenance Proteins/metabolism , Replication Origin , Cell Cycle/genetics , Chromatin Immunoprecipitation , High-Throughput Nucleotide Sequencing , Protein Binding , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism
6.
Proteins ; 85(3): 408-416, 2017 03.
Article in English | MEDLINE | ID: mdl-27718275

ABSTRACT

We report the performance of our protein-protein docking pipeline, including the ZDOCK rigid-body docking algorithm, on 19 targets in CAPRI rounds 28-34. Following the docking step, we reranked the ZDOCK predictions using the IRAD scoring function, pruned redundant predictions, performed energy landscape analysis, and utilized our interface prediction approach RCF. In addition, we applied constraints to the search space based on biological information that we culled from the literature, which increased the chance of making a correct prediction. For all but two targets we were able to find and apply biological information and we found the information to be highly accurate, indicating that effective incorporation of biological information is an important component for protein-protein docking. Proteins 2017; 85:408-416. © 2016 Wiley Periodicals, Inc.


Subject(s)
Algorithms , Computational Biology/methods , Molecular Docking Simulation/methods , Proteins/chemistry , Benchmarking , Binding Sites , Cluster Analysis , Databases, Protein , Protein Binding , Protein Conformation , Protein Interaction Mapping , Research Design , Software , Structural Homology, Protein , Thermodynamics
7.
Proteins ; 85(5): 908-916, 2017 05.
Article in English | MEDLINE | ID: mdl-28160322

ABSTRACT

The ATLAS (Altered TCR Ligand Affinities and Structures) database (https://zlab.umassmed.edu/atlas/web/) is a manually curated repository containing the binding affinities for wild-type and mutant T cell receptors (TCRs) and their antigens, peptides presented by the major histocompatibility complex (pMHC). The database links experimentally measured binding affinities with the corresponding three dimensional (3D) structures for TCR-pMHC complexes. The user can browse and search affinities, structures, and experimental details for TCRs, peptides, and MHCs of interest. We expect this database to facilitate the development of next-generation protein design algorithms targeting TCR-pMHC interactions. ATLAS can be easily parsed using modeling software that builds protein structures for training and testing. As an example, we provide structural models for all mutant TCRs in ATLAS, built using the Rosetta program. Utilizing these structures, we report a correlation of 0.63 between experimentally measured changes in binding energies and our predicted changes. Proteins 2017; 85:908-916. © 2016 Wiley Periodicals, Inc.


Subject(s)
Databases, Factual , Histocompatibility Antigens Class I/chemistry , Mutation , Peptides/chemistry , Receptors, Antigen, T-Cell/chemistry , Software , Binding Sites , Histocompatibility Antigens Class I/genetics , Histocompatibility Antigens Class I/immunology , Humans , Internet , Ligands , Models, Molecular , Peptides/immunology , Protein Binding , Protein Conformation , Receptors, Antigen, T-Cell/genetics , Receptors, Antigen, T-Cell/immunology , Thermodynamics
8.
Proteins ; 84 Suppl 1: 323-48, 2016 09.
Article in English | MEDLINE | ID: mdl-27122118

ABSTRACT

We present the results for CAPRI Round 30, the first joint CASP-CAPRI experiment, which brought together experts from the protein structure prediction and protein-protein docking communities. The Round comprised 25 targets from amongst those submitted for the CASP11 prediction experiment of 2014. The targets included mostly homodimers, a few homotetramers, and two heterodimers, and comprised protein chains that could readily be modeled using templates from the Protein Data Bank. On average 24 CAPRI groups and 7 CASP groups submitted docking predictions for each target, and 12 CAPRI groups per target participated in the CAPRI scoring experiment. In total more than 9500 models were assessed against the 3D structures of the corresponding target complexes. Results show that the prediction of homodimer assemblies by homology modeling techniques and docking calculations is quite successful for targets featuring large enough subunit interfaces to represent stable associations. Targets with ambiguous or inaccurate oligomeric state assignments, often featuring crystal contact-sized interfaces, represented a confounding factor. For those, a much poorer prediction performance was achieved, while nonetheless often providing helpful clues on the correct oligomeric state of the protein. The prediction performance was very poor for genuine tetrameric targets, where the inaccuracy of the homology-built subunit models and the smaller pair-wise interfaces severely limited the ability to derive the correct assembly mode. Our analysis also shows that docking procedures tend to perform better than standard homology modeling techniques and that highly accurate models of the protein components are not always required to identify their association modes with acceptable accuracy. Proteins 2016; 84(Suppl 1):323-348. © 2016 Wiley Periodicals, Inc.


Subject(s)
Computational Biology/statistics & numerical data , Models, Statistical , Molecular Docking Simulation , Molecular Dynamics Simulation , Proteins/chemistry , Software , Algorithms , Amino Acid Motifs , Bacteria/chemistry , Binding Sites , Computational Biology/methods , Humans , International Cooperation , Internet , Protein Binding , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Folding , Protein Interaction Domains and Motifs , Protein Multimerization , Protein Structure, Tertiary , Sequence Homology, Amino Acid , Thermodynamics
10.
Nat Genet ; 53(3): 367-378, 2021 03.
Article in English | MEDLINE | ID: mdl-33574602

ABSTRACT

Nuclear compartmentalization of active and inactive chromatin is thought to occur through microphase separation mediated by interactions between loci of similar type. The nature and dynamics of these interactions are not known. We developed liquid chromatin Hi-C to map the stability of associations between loci. Before fixation and Hi-C, chromosomes are fragmented, which removes strong polymeric constraint, enabling detection of intrinsic locus-locus interaction stabilities. Compartmentalization is stable when fragments are larger than 10-25 kb. Fragmentation of chromatin into pieces smaller than 6 kb leads to gradual loss of genome organization. Lamin-associated domains are most stable, whereas interactions for speckle- and polycomb-associated loci are more dynamic. Cohesin-mediated loops dissolve after fragmentation. Liquid chromatin Hi-C provides a genome-wide view of chromosome interaction dynamics.


Subject(s)
Chromatin/chemistry , Chromatin/metabolism , Chromosomes, Human/chemistry , Cell Compartmentation , Cell Cycle Proteins/metabolism , Cell Nucleus/chemistry , Cell Nucleus/genetics , Chromatin/genetics , Chromatin Assembly and Disassembly , Chromosomal Proteins, Non-Histone/metabolism , Chromosomes, Human/metabolism , Half-Life , Humans , K562 Cells , Kinetics , Cohesins
11.
Genome Med ; 12(1): 19, 2020 02 19.
Article in English | MEDLINE | ID: mdl-32075678

ABSTRACT

BACKGROUND: Midbrain dopaminergic neurons (MDN) represent 0.0005% of the brain's neuronal population and mediate cognition, food intake, and metabolism. MDN are also posited to underlay the neurobiological dysfunction of schizophrenia (SCZ), a severe neuropsychiatric disorder that is characterized by psychosis as well as multifactorial medical co-morbidities, including metabolic disease, contributing to markedly increased morbidity and mortality. Paradoxically, however, the genetic risk sequences of psychosis and traits associated with metabolic disease, such as body mass, show very limited overlap. METHODS: We investigated the genomic interaction of SCZ with medical conditions and traits, including body mass index (BMI), by exploring the MDN's "spatial genome," including chromosomal contact landscapes as a critical layer of cell type-specific epigenomic regulation. Low-input Hi-C protocols were applied to 5-10 × 103 dopaminergic and other cell-specific nuclei collected by fluorescence-activated nuclei sorting from the adult human midbrain. RESULTS: The Hi-C-reconstructed MDN spatial genome revealed 11 "Euclidean hot spots" of clustered chromatin domains harboring risk sequences for SCZ and elevated BMI. Inter- and intra-chromosomal contacts interconnecting SCZ and BMI risk sequences showed massive enrichment for brain-specific expression quantitative trait loci (eQTL), with gene ontologies, regulatory motifs and proteomic interactions related to adipogenesis and lipid regulation, dopaminergic neurogenesis and neuronal connectivity, and reward- and addiction-related pathways. CONCLUSIONS: We uncovered shared nuclear topographies of cognitive and metabolic risk variants. More broadly, our PsychENCODE sponsored Hi-C study offers a novel genomic approach for the study of psychiatric and medical co-morbidities constrained by limited overlap of their respective genetic risk architectures on the linear genome.


Subject(s)
Dopaminergic Neurons/metabolism , Polymorphism, Genetic , Quantitative Trait Loci , Schizophrenia/genetics , Adipogenesis , Animals , Body Mass Index , Chromosomes/genetics , Cognition , Humans , Lipid Metabolism , Mesencephalon/cytology , Mesencephalon/metabolism , Mice , Mice, Inbred C57BL , Neurogenesis , Schizophrenia/metabolism , Schizophrenia/pathology
12.
Curr Opin Neurobiol ; 59: 112-119, 2019 12.
Article in English | MEDLINE | ID: mdl-31255842

ABSTRACT

The 'non-linear' genome, or the spatial proximity of non-contiguous sequences, emerges as an important regulatory layer for genome organization and function, including transcriptional regulation. Here, we review recent genome-scale chromosome conformation mappings ('Hi-C') in developing and adult human and mouse brain. Neural differentiation is associated with widespread remodeling of the chromosomal contact map, reflecting dynamic changes in cell-type-specific gene expression programs, with a massive (estimated 20-50%) net loss of chromosomal contacts that is specific for the neuronal lineage. Hi-C datasets provided an unexpected link between locus-specific abnormal expansion of repeat sequences positioned at the boundaries of self-associating topological chromatin domains, and monogenic neurodevelopmental and neurodegenerative disease. Furthermore, integrative cell-type-specific Hi-C and transcriptomic analysis uncovered an expanded genomic risk space for sequences conferring liability for schizophrenia and other cognitive disease. We predict that spatial genome exploration will deliver radically new insights into the brain nucleome in health and disease.


Subject(s)
Neurodegenerative Diseases , Animals , Chromatin , Cognition , Genome , Genomics , Humans
13.
Science ; 362(6420)2018 12 14.
Article in English | MEDLINE | ID: mdl-30545851

ABSTRACT

To explore the developmental reorganization of the three-dimensional genome of the brain in the context of neuropsychiatric disease, we monitored chromosomal conformations in differentiating neural progenitor cells. Neuronal and glial differentiation was associated with widespread developmental remodeling of the chromosomal contact map and included interactions anchored in common variant sequences that confer heritable risk for schizophrenia. We describe cell type-specific chromosomal connectomes composed of schizophrenia risk variants and their distal targets, which altogether show enrichment for genes that regulate neuronal connectivity and chromatin remodeling, and evidence for coordinated transcriptional regulation and proteomic interaction of the participating genes. Developmentally regulated chromosomal conformation changes at schizophrenia-relevant sequences disproportionally occurred in neurons, highlighting the existence of cell type-specific disease risk vulnerabilities in spatial genome organization.


Subject(s)
Chromosomes, Human/chemistry , Connectome , Epigenesis, Genetic , Gene Expression Regulation, Developmental , Genetic Predisposition to Disease , Neural Stem Cells/cytology , Neurogenesis/genetics , Schizophrenia/genetics , Brain/growth & development , Brain/metabolism , Cells, Cultured , Chromatin/chemistry , Chromatin Assembly and Disassembly , Genome, Human , Genome-Wide Association Study , Humans , Male , Neural Stem Cells/metabolism , Neuroglia/cytology , Neurons/cytology , Neurons/metabolism , Nucleic Acid Conformation , Protein Interaction Maps/genetics , Proteomics , Risk , Transcription, Genetic , Transcriptome
14.
Microarrays (Basel) ; 4(3): 339-69, 2015 Aug 12.
Article in English | MEDLINE | ID: mdl-27600228

ABSTRACT

DNA copy number aberrations (CNAs) are of biological and medical interest because they help identify regulatory mechanisms underlying tumor initiation and evolution. Identification of tumor-driving CNAs (driver CNAs) however remains a challenging task, because they are frequently hidden by CNAs that are the product of random events that take place during tumor evolution. Experimental detection of CNAs is commonly accomplished through array comparative genomic hybridization (aCGH) assays followed by supervised and/or unsupervised statistical methods that combine the segmented profiles of all patients to identify driver CNAs. Here, we extend a previously-presented supervised algorithm for the identification of CNAs that is based on a topological representation of the data. Our method associates a two-dimensional (2D) point cloud with each aCGH profile and generates a sequence of simplicial complexes, mathematical objects that generalize the concept of a graph. This representation of the data permits segmenting the data at different resolutions and identifying CNAs by interrogating the topological properties of these simplicial complexes. We tested our approach on a published dataset with the goal of identifying specific breast cancer CNAs associated with specific molecular subtypes. Identification of CNAs associated with each subtype was performed by analyzing each subtype separately from the others and by taking the rest of the subtypes as the control. Our results found a new amplification in 11q at the location of the progesterone receptor in the Luminal A subtype. Aberrations in the Luminal B subtype were found only upon removal of the basal-like subtype from the control set. Under those conditions, all regions found in the original publication, except for 17q, were confirmed; all aberrations, except those in chromosome arms 8q and 12q were confirmed in the basal-like subtype. These two chromosome arms, however, were detected only upon removal of three patients with exceedingly large copy number values. More importantly, we detected 10 and 21 additional regions in the Luminal B and basal-like subtypes, respectively. Most of the additional regions were either validated on an independent dataset and/or using GISTIC. Furthermore, we found three new CNAs in the basal-like subtype: a combination of gains and losses in 1p, a gain in 2p and a loss in 14q. Based on these results, we suggest that topological approaches that incorporate multiresolution analyses and that interrogate topological properties of the data can help in the identification of copy number changes in cancer.

SELECTION OF CITATIONS
SEARCH DETAIL