RESUMO
Thousands of interactions assemble proteins into modules that impart spatial and functional organization to the cellular proteome. Through affinity-purification mass spectrometry, we have created two proteome-scale, cell-line-specific interaction networks. The first, BioPlex 3.0, results from affinity purification of 10,128 human proteins-half the proteome-in 293T cells and includes 118,162 interactions among 14,586 proteins. The second results from 5,522 immunoprecipitations in HCT116 cells. These networks model the interactome whose structure encodes protein function, localization, and complex membership. Comparison across cell lines validates thousands of interactions and reveals extensive customization. Whereas shared interactions reside in core complexes and involve essential proteins, cell-specific interactions link these complexes, "rewiring" subnetworks within each cell's interactome. Interactions covary among proteins of shared function as the proteome remodels to produce each cell's phenotype. Viewable interactively online through BioPlexExplorer, these networks define principles of proteome organization and enable unknown protein characterization.
Assuntos
Mapeamento de Interação de Proteínas/métodos , Mapas de Interação de Proteínas/genética , Proteoma/genética , Biologia Computacional/métodos , Células HCT116/metabolismo , Células HEK293/metabolismo , Humanos , Espectrometria de Massas/métodos , Mapas de Interação de Proteínas/fisiologia , Proteoma/metabolismo , Proteômica/métodosRESUMO
Protein interactions form a network whose structure drives cellular function and whose organization informs biological inquiry. Using high-throughput affinity-purification mass spectrometry, we identify interacting partners for 2,594 human proteins in HEK293T cells. The resulting network (BioPlex) contains 23,744 interactions among 7,668 proteins with 86% previously undocumented. BioPlex accurately depicts known complexes, attaining 80%-100% coverage for most CORUM complexes. The network readily subdivides into communities that correspond to complexes or clusters of functionally related proteins. More generally, network architecture reflects cellular localization, biological process, and molecular function, enabling functional characterization of thousands of proteins. Network structure also reveals associations among thousands of protein domains, suggesting a basis for examining structurally related proteins. Finally, BioPlex, in combination with other approaches, can be used to reveal interactions of biological or clinical significance. For example, mutations in the membrane protein VAPB implicated in familial amyotrophic lateral sclerosis perturb a defined community of interactors.
Assuntos
Mapas de Interação de Proteínas , Proteômica/métodos , Esclerose Lateral Amiotrófica/genética , Humanos , Espectrometria de Massas , Mapeamento de Interação de Proteínas , Proteínas/química , Proteínas/isolamento & purificação , Proteínas/metabolismoRESUMO
Just as reference genome sequences revolutionized human genetics, reference maps of interactome networks will be critical to fully understand genotype-phenotype relationships. Here, we describe a systematic map of ?14,000 high-quality human binary protein-protein interactions. At equal quality, this map is ?30% larger than what is available from small-scale studies published in the literature in the last few decades. While currently available information is highly biased and only covers a relatively small portion of the proteome, our systematic map appears strikingly more homogeneous, revealing a "broader" human interactome network than currently appreciated. The map also uncovers significant interconnectivity between known and candidate cancer gene products, providing unbiased evidence for an expanded functional cancer landscape, while demonstrating how high-quality interactome models will help "connect the dots" of the genomic revolution.
Assuntos
Mapas de Interação de Proteínas , Proteoma/metabolismo , Animais , Bases de Dados de Proteínas , Estudo de Associação Genômica Ampla , Humanos , Camundongos , Neoplasias/metabolismoRESUMO
The physiology of a cell can be viewed as the product of thousands of proteins acting in concert to shape the cellular response. Coordination is achieved in part through networks of protein-protein interactions that assemble functionally related proteins into complexes, organelles, and signal transduction pathways. Understanding the architecture of the human proteome has the potential to inform cellular, structural, and evolutionary mechanisms and is critical to elucidating how genome variation contributes to disease. Here we present BioPlex 2.0 (Biophysical Interactions of ORFeome-derived complexes), which uses robust affinity purification-mass spectrometry methodology to elucidate protein interaction networks and co-complexes nucleated by more than 25% of protein-coding genes from the human genome, and constitutes, to our knowledge, the largest such network so far. With more than 56,000 candidate interactions, BioPlex 2.0 contains more than 29,000 previously unknown co-associations and provides functional insights into hundreds of poorly characterized proteins while enhancing network-based analyses of domain associations, subcellular localization, and co-complex formation. Unsupervised Markov clustering of interacting proteins identified more than 1,300 protein communities representing diverse cellular activities. Genes essential for cell fitness are enriched within 53 communities representing central cellular functions. Moreover, we identified 442 communities associated with more than 2,000 disease annotations, placing numerous candidate disease genes into a cellular framework. BioPlex 2.0 exceeds previous experimentally derived interaction networks in depth and breadth, and will be a valuable resource for exploring the biology of incompletely characterized proteins and for elucidating larger-scale patterns of proteome organization.
Assuntos
Bases de Dados de Proteínas , Doença , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas , Proteoma/metabolismo , Fenômenos Fisiológicos Celulares/genética , Genoma Humano , Humanos , Espaço Intracelular/metabolismo , Cadeias de Markov , Espectrometria de Massas , Anotação de Sequência Molecular , Fases de Leitura Aberta , Proteoma/análise , Proteoma/química , Proteoma/genéticaRESUMO
In cellular systems, biophysical interactions between macromolecules underlie a complex web of functional interactions. How biophysical and functional networks are coordinated, whether all biophysical interactions correspond to functional interactions, and how such biophysical-versus-functional network coordination is shaped by evolutionary forces are all largely unanswered questions. Here, we investigate these questions using an "inter-interactome" approach. We systematically probed the yeast and human proteomes for interactions between proteins from these two species and functionally characterized the resulting inter-interactome network. After a billion years of evolutionary divergence, the yeast and human proteomes are still capable of forming a biophysical network with properties that resemble those of intra-species networks. Although substantially reduced relative to intra-species networks, the levels of functional overlap in the yeast-human inter-interactome network uncover significant remnants of co-functionality widely preserved in the two proteomes beyond human-yeast homologs. Our data support evolutionary selection against biophysical interactions between proteins with little or no co-functionality. Such non-functional interactions, however, represent a reservoir from which nascent functional interactions may arise.
Assuntos
Proteínas Fúngicas/metabolismo , Mapeamento de Interação de Proteínas/métodos , Proteoma/metabolismo , Biologia Computacional/métodos , Bases de Dados de Proteínas , Evolução Molecular , HumanosRESUMO
Next-generation sequencing has not been applied to protein-protein interactome network mapping so far because the association between the members of each interacting pair would not be maintained in en masse sequencing. We describe a massively parallel interactome-mapping pipeline, Stitch-seq, that combines PCR stitching with next-generation sequencing and used it to generate a new human interactome dataset. Stitch-seq is applicable to various interaction assays and should help expand interactome network mapping.
Assuntos
Bases de Dados de Proteínas/estatística & dados numéricos , Mapeamento de Interação de Proteínas/estatística & dados numéricos , Análise de Sequência de DNA/estatística & dados numéricos , Humanos , Fases de Leitura Aberta , Reação em Cadeia da Polimerase , Técnicas do Sistema de Duplo-HíbridoRESUMO
To provide accurate biological hypotheses and elucidate global properties of cellular networks, systematic identification of protein-protein interactions must meet high quality standards.We present an expanded C. elegans protein-protein interaction network, or 'interactome' map, derived from testing a matrix of approximately 10,000 x approximately 10,000 proteins using a highly specific, high-throughput yeast two-hybrid system. Through a new empirical quality control framework, we show that the resulting data set (Worm Interactome 2007, or WI-2007) was similar in quality to low-throughput data curated from the literature. We filtered previous interaction data sets and integrated them with WI-2007 to generate a high-confidence consolidated map (Worm Interactome version 8, or WI8). This work allowed us to estimate the size of the worm interactome at approximately 116,000 interactions. Comparison with other types of functional genomic data shows the complementarity of distinct experimental approaches in predicting different functional relationships between genes or proteins
Assuntos
Proteínas de Caenorhabditis elegans/análise , Proteínas de Caenorhabditis elegans/metabolismo , Caenorhabditis elegans/metabolismo , Mapeamento de Interação de Proteínas/métodos , Animais , Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/genética , Linhagem Celular , Humanos , Ligação Proteica , SoftwareRESUMO
Several attempts have been made to systematically map protein-protein interaction, or 'interactome', networks. However, it remains difficult to assess the quality and coverage of existing data sets. Here we describe a framework that uses an empirically-based approach to rigorously dissect quality parameters of currently available human interactome maps. Our results indicate that high-throughput yeast two-hybrid (HT-Y2H) interactions for human proteins are more precise than literature-curated interactions supported by a single publication, suggesting that HT-Y2H is suitable to map a significant portion of the human interactome. We estimate that the human interactome contains approximately 130,000 binary interactions, most of which remain to be mapped. Similar to estimates of DNA sequence data quality and genome size early in the Human Genome Project, estimates of protein interaction data quality and interactome size are crucial to establish the magnitude of the task of comprehensive human interactome mapping and to elucidate a path toward this goal.
Assuntos
Mapeamento de Interação de Proteínas/métodos , Proteínas/análise , Proteínas/metabolismo , Bases de Dados de Proteínas , Humanos , Ligação Proteica , Proteínas/genética , Sensibilidade e EspecificidadeRESUMO
Plants generate effective responses to infection by recognizing both conserved and variable pathogen-encoded molecules. Pathogens deploy virulence effector proteins into host cells, where they interact physically with host proteins to modulate defense. We generated an interaction network of plant-pathogen effectors from two pathogens spanning the eukaryote-eubacteria divergence, three classes of Arabidopsis immune system proteins, and ~8000 other Arabidopsis proteins. We noted convergence of effectors onto highly interconnected host proteins and indirect, rather than direct, connections between effectors and plant immune receptors. We demonstrated plant immune system functions for 15 of 17 tested host proteins that interact with effectors from both pathogens. Thus, pathogens from different kingdoms deploy independently evolved virulence proteins that interact with a limited set of highly connected cellular hubs to facilitate their diverse life-cycle strategies.
Assuntos
Arabidopsis/imunologia , Arabidopsis/metabolismo , Interações Hospedeiro-Patógeno , Doenças das Plantas/imunologia , Imunidade Vegetal , Receptores Imunológicos/metabolismo , Fatores de Virulência/metabolismo , Arabidopsis/genética , Arabidopsis/microbiologia , Proteínas de Bactérias/metabolismo , Evolução Molecular , Genes de Plantas , Imunidade Inata , Oomicetos/patogenicidade , Mapeamento de Interação de Proteínas , Pseudomonas syringae/patogenicidadeRESUMO
Current yeast interactome network maps contain several hundred molecular complexes with limited and somewhat controversial representation of direct binary interactions. We carried out a comparative quality assessment of current yeast interactome data sets, demonstrating that high-throughput yeast two-hybrid (Y2H) screening provides high-quality binary interaction information. Because a large fraction of the yeast binary interactome remains to be mapped, we developed an empirically controlled mapping framework to produce a "second-generation" high-quality, high-throughput Y2H data set covering approximately 20% of all yeast binary interactions. Both Y2H and affinity purification followed by mass spectrometry (AP/MS) data are of equally high quality but of a fundamentally different and complementary nature, resulting in networks with different topological and biological properties. Compared to co-complex interactome models, this binary map is enriched for transient signaling interactions and intercomplex connections with a highly significant clustering between essential proteins. Rather than correlating with essentiality, protein connectivity correlates with genetic pleiotropy.