Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
1.
Nat Methods ; 8(6): 478-80, 2011 Jun.
Article in English | MEDLINE | ID: mdl-21516116

ABSTRACT

Next-generation sequencing has not been applied to protein-protein interactome network mapping so far because the association between the members of each interacting pair would not be maintained in en masse sequencing. We describe a massively parallel interactome-mapping pipeline, Stitch-seq, that combines PCR stitching with next-generation sequencing and used it to generate a new human interactome dataset. Stitch-seq is applicable to various interaction assays and should help expand interactome network mapping.


Subject(s)
Databases, Protein/statistics & numerical data , Protein Interaction Mapping/statistics & numerical data , Sequence Analysis, DNA/statistics & numerical data , Humans , Open Reading Frames , Polymerase Chain Reaction , Two-Hybrid System Techniques
2.
Retrovirology ; 9: 26, 2012 Mar 29.
Article in English | MEDLINE | ID: mdl-22458338

ABSTRACT

BACKGROUND: Human T-cell leukemia virus type 1 (HTLV-1) and type 2 both target T lymphocytes, yet induce radically different phenotypic outcomes. HTLV-1 is a causative agent of Adult T-cell leukemia (ATL), whereas HTLV-2, highly similar to HTLV-1, causes no known overt disease. HTLV gene products are engaged in a dynamic struggle of activating and antagonistic interactions with host cells. Investigations focused on one or a few genes have identified several human factors interacting with HTLV viral proteins. Most of the available interaction data concern the highly investigated HTLV-1 Tax protein. Identifying shared and distinct host-pathogen protein interaction profiles for these two viruses would enlighten how they exploit distinctive or common strategies to subvert cellular pathways toward disease progression. RESULTS: We employ a scalable methodology for the systematic mapping and comparison of pathogen-host protein interactions that includes stringent yeast two-hybrid screening and systematic retest, as well as two independent validations through an additional protein interaction detection method and a functional transactivation assay. The final data set contained 166 interactions between 10 viral proteins and 122 human proteins. Among the 166 interactions identified, 87 and 79 involved HTLV-1 and HTLV-2 -encoded proteins, respectively. Targets for HTLV-1 and HTLV-2 proteins implicate a diverse set of cellular processes including the ubiquitin-proteasome system, the apoptosis, different cancer pathways and the Notch signaling pathway. CONCLUSIONS: This study constitutes a first pass, with homogeneous data, at comparative analysis of host targets for HTLV-1 and -2 retroviruses, complements currently existing data for formulation of systems biology models of retroviral induced diseases and presents new insights on biological pathways involved in retroviral infection.


Subject(s)
Host-Pathogen Interactions , Human T-lymphotropic virus 1/immunology , Human T-lymphotropic virus 1/pathogenicity , Human T-lymphotropic virus 2/immunology , Human T-lymphotropic virus 2/pathogenicity , T-Lymphocytes/immunology , T-Lymphocytes/virology , Humans , Systems Biology/methods , Two-Hybrid System Techniques
3.
Nat Methods ; 6(1): 83-90, 2009 Jan.
Article in English | MEDLINE | ID: mdl-19060904

ABSTRACT

Several attempts have been made to systematically map protein-protein interaction, or 'interactome', networks. However, it remains difficult to assess the quality and coverage of existing data sets. Here we describe a framework that uses an empirically-based approach to rigorously dissect quality parameters of currently available human interactome maps. Our results indicate that high-throughput yeast two-hybrid (HT-Y2H) interactions for human proteins are more precise than literature-curated interactions supported by a single publication, suggesting that HT-Y2H is suitable to map a significant portion of the human interactome. We estimate that the human interactome contains approximately 130,000 binary interactions, most of which remain to be mapped. Similar to estimates of DNA sequence data quality and genome size early in the Human Genome Project, estimates of protein interaction data quality and interactome size are crucial to establish the magnitude of the task of comprehensive human interactome mapping and to elucidate a path toward this goal.


Subject(s)
Protein Interaction Mapping/methods , Proteins/analysis , Proteins/metabolism , Databases, Protein , Humans , Protein Binding , Proteins/genetics , Sensitivity and Specificity
4.
Nat Methods ; 6(1): 47-54, 2009 Jan.
Article in English | MEDLINE | ID: mdl-19123269

ABSTRACT

To provide accurate biological hypotheses and elucidate global properties of cellular networks, systematic identification of protein-protein interactions must meet high quality standards.We present an expanded C. elegans protein-protein interaction network, or 'interactome' map, derived from testing a matrix of approximately 10,000 x approximately 10,000 proteins using a highly specific, high-throughput yeast two-hybrid system. Through a new empirical quality control framework, we show that the resulting data set (Worm Interactome 2007, or WI-2007) was similar in quality to low-throughput data curated from the literature. We filtered previous interaction data sets and integrated them with WI-2007 to generate a high-confidence consolidated map (Worm Interactome version 8, or WI8). This work allowed us to estimate the size of the worm interactome at approximately 116,000 interactions. Comparison with other types of functional genomic data shows the complementarity of distinct experimental approaches in predicting different functional relationships between genes or proteins


Subject(s)
Caenorhabditis elegans Proteins/analysis , Caenorhabditis elegans Proteins/metabolism , Caenorhabditis elegans/metabolism , Protein Interaction Mapping/methods , Animals , Caenorhabditis elegans/genetics , Caenorhabditis elegans Proteins/genetics , Cell Line , Humans , Protein Binding , Software
5.
Nature ; 437(7062): 1173-8, 2005 Oct 20.
Article in English | MEDLINE | ID: mdl-16189514

ABSTRACT

Systematic mapping of protein-protein interactions, or 'interactome' mapping, was initiated in model organisms, starting with defined biological processes and then expanding to the scale of the proteome. Although far from complete, such maps have revealed global topological and dynamic features of interactome networks that relate to known biological properties, suggesting that a human interactome map will provide insight into development and disease mechanisms at a systems level. Here we describe an initial version of a proteome-scale map of human binary protein-protein interactions. Using a stringent, high-throughput yeast two-hybrid system, we tested pairwise interactions among the products of approximately 8,100 currently available Gateway-cloned open reading frames and detected approximately 2,800 interactions. This data set, called CCSB-HI1, has a verification rate of approximately 78% as revealed by an independent co-affinity purification assay, and correlates significantly with other biological attributes. The CCSB-HI1 data set increases by approximately 70% the set of available binary interactions within the tested space and reveals more than 300 new connections to over 100 disease-associated proteins. This work represents an important step towards a systematic and comprehensive human interactome project.


Subject(s)
Proteome/metabolism , Cloning, Molecular , Humans , Open Reading Frames/genetics , Protein Binding , Proteome/genetics , RNA/genetics , RNA/metabolism , Saccharomyces cerevisiae/genetics , Two-Hybrid System Techniques
6.
Biotechniques ; 35(3): 510-6, 518, 2003 Sep.
Article in English | MEDLINE | ID: mdl-14513556

ABSTRACT

The normalization and subtraction of highly expressed cDNAs from relatively large tissues before cloning dramatically enhanced the gene discovery by sequencing for the mouse full-length cDNA encyclopedia, but these methods have not been suitable for limited RNA materials. To normalize and subtract full-length cDNA libraries derived from limited quantities of total RNA, here we report a method to subtract plasmid libraries excised from size-unbiased amplified lambda phage cDNA libraries that avoids heavily biasing steps such as PCR and plasmid library amplification. The proportion of full-length cDNAs and the gene discovery rate are high, and library diversity can be validated by in silico randomization.


Subject(s)
Gene Expression Profiling/methods , Gene Library , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Transcription, Genetic/genetics , Reproducibility of Results , Sensitivity and Specificity
7.
Genome Res ; 19(7): 1262-9, 2009 Jul.
Article in English | MEDLINE | ID: mdl-19447967

ABSTRACT

"Smart-pooling," in which test reagents are multiplexed in a highly redundant manner, is a promising strategy for achieving high efficiency, sensitivity, and specificity in systems-level projects. However, previous applications relied on low redundancy designs that do not leverage the full potential of smart-pooling, and more powerful theoretical constructions, such as the Shifted Transversal Design (STD), lack experimental validation. Here we evaluate STD smart-pooling in yeast two-hybrid (Y2H) interactome mapping. We employed two STD designs and two established methods to perform ORFeome-wide Y2H screens with 12 baits. We found that STD pooling achieves similar levels of sensitivity and specificity as one-on-one array-based Y2H, while the costs and workloads are divided by three. The screening-sequencing approach is the most cost- and labor-efficient, yet STD identifies about twofold more interactions. Screening-sequencing remains an appropriate method for quickly producing low-coverage interactomes, while STD pooling appears as the method of choice for obtaining maps with higher coverage.


Subject(s)
Caenorhabditis elegans/genetics , Gene Expression Profiling , Protein Interaction Mapping , Sequence Analysis, DNA/methods , Two-Hybrid System Techniques , Animals , Oligonucleotide Array Sequence Analysis , Open Reading Frames/genetics , Sensitivity and Specificity , Software
8.
Science ; 322(5898): 104-10, 2008 Oct 03.
Article in English | MEDLINE | ID: mdl-18719252

ABSTRACT

Current yeast interactome network maps contain several hundred molecular complexes with limited and somewhat controversial representation of direct binary interactions. We carried out a comparative quality assessment of current yeast interactome data sets, demonstrating that high-throughput yeast two-hybrid (Y2H) screening provides high-quality binary interaction information. Because a large fraction of the yeast binary interactome remains to be mapped, we developed an empirically controlled mapping framework to produce a "second-generation" high-quality, high-throughput Y2H data set covering approximately 20% of all yeast binary interactions. Both Y2H and affinity purification followed by mass spectrometry (AP/MS) data are of equally high quality but of a fundamentally different and complementary nature, resulting in networks with different topological and biological properties. Compared to co-complex interactome models, this binary map is enriched for transient signaling interactions and intercomplex connections with a highly significant clustering between essential proteins. Rather than correlating with essentiality, protein connectivity correlates with genetic pleiotropy.


Subject(s)
Protein Interaction Mapping , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/metabolism , Computational Biology , Gene Regulatory Networks , Mass Spectrometry , Metabolic Networks and Pathways , Protein Array Analysis , Protein Binding , Protein Interaction Mapping/methods , Protein Interaction Mapping/standards , Proteome/metabolism , Proteomics , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/isolation & purification , Signal Transduction , Transcription Factors/metabolism , Two-Hybrid System Techniques
9.
Proc Natl Acad Sci U S A ; 104(18): 7606-11, 2007 May 01.
Article in English | MEDLINE | ID: mdl-17446270

ABSTRACT

A comprehensive mapping of interactions among Epstein-Barr virus (EBV) proteins and interactions of EBV proteins with human proteins should provide specific hypotheses and a broad perspective on EBV strategies for replication and persistence. Interactions of EBV proteins with each other and with human proteins were assessed by using a stringent high-throughput yeast two-hybrid system. Overall, 43 interactions between EBV proteins and 173 interactions between EBV and human proteins were identified. EBV-EBV and EBV-human protein interaction, or "interactome" maps provided a framework for hypotheses of protein function. For example, LF2, an EBV protein of unknown function interacted with the EBV immediate early R transactivator (Rta) and was found to inhibit Rta transactivation. From a broader perspective, EBV genes can be divided into two evolutionary classes, "core" genes, which are conserved across all herpesviruses and subfamily specific, or "noncore" genes. Our EBV-EBV interactome map is enriched for interactions among proteins in the same evolutionary class. Furthermore, human proteins targeted by EBV proteins were enriched for highly connected or "hub" proteins and for proteins with relatively short paths to all other proteins in the human interactome network. Targeting of hubs might be an efficient mechanism for EBV reorganization of cellular processes.


Subject(s)
Epstein-Barr Virus Infections/metabolism , Herpesvirus 4, Human/physiology , Proteins/metabolism , Humans , Promoter Regions, Genetic/genetics , Time Factors
10.
Cell ; 129(6): 1065-79, 2007 Jun 15.
Article in English | MEDLINE | ID: mdl-17574021

ABSTRACT

The karyotypic chaos exhibited by human epithelial cancers complicates efforts to identify mutations critical for malignant transformation. Here we integrate complementary genomic approaches to identify human oncogenes. We show that activation of the ERK and phosphatidylinositol 3-kinase (PI3K) signaling pathways cooperate to transform human cells. Using a library of activated kinases, we identify several kinases that replace PI3K signaling and render cells tumorigenic. Whole genome structural analyses reveal that one of these kinases, IKBKE (IKKepsilon), is amplified and overexpressed in breast cancer cell lines and patient-derived tumors. Suppression of IKKepsilon expression in breast cancer cell lines that harbor IKBKE amplifications induces cell death. IKKepsilon activates the nuclear factor-kappaB (NF-kappaB) pathway in both cell lines and breast cancers. These observations suggest a mechanism for NF-kappaB activation in breast cancer, implicate the NF-kappaB pathway as a downstream mediator of PI3K, and provide a framework for integrated genomic approaches in oncogene discovery.


Subject(s)
Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Gene Expression Regulation, Neoplastic , Genomics , I-kappa B Kinase/genetics , Alleles , Cell Line , Cell Transformation, Neoplastic , Extracellular Signal-Regulated MAP Kinases/metabolism , Gene Library , Genome , Humans , Models, Biological , NF-kappa B/metabolism , Phosphatidylinositol 3-Kinases/metabolism , Signal Transduction
11.
PLoS One ; 2(11): e1235, 2007 Nov 28.
Article in English | MEDLINE | ID: mdl-18043742

ABSTRACT

Rice (Oryza sativa L.) is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA) sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE) genes, 33K annotated non-expressed (ANE) genes, and 5.5K non-annotated expressed (NAE) genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.


Subject(s)
DNA, Complementary/genetics , DNA, Plant/genetics , Gene Expression Profiling , Oryza/genetics , Chromosome Mapping , Exons , Introns
12.
Proc Natl Acad Sci U S A ; 103(41): 15212-7, 2006 Oct 10.
Article in English | MEDLINE | ID: mdl-17018643

ABSTRACT

Songbirds have one of the most accessible neural systems for the study of brain mechanisms of behavior. However, neuroethological studies in songbirds have been limited by the lack of high-throughput molecular resources and gene-manipulation tools. To overcome these limitations, we constructed 21 regular, normalized, and subtracted full-length cDNA libraries from brains of zebra finches in 57 developmental and behavioral conditions in an attempt to clone as much of the brain transcriptome as possible. From these libraries, approximately 14,000 transcripts were isolated, representing an estimated 4,738 genes. With the cDNAs, we created a hierarchically organized transcriptome database and a large-scale songbird brain cDNA microarray. We used the arrays to reveal a set of 33 genes that are regulated in forebrain vocal nuclei by singing behavior. These genes clustered into four anatomical and six temporal expression patterns. Their functions spanned a large range of cellular and molecular categories, from signal transduction, trafficking, and structural, to synaptically released molecules. With the full-length cDNAs and a lentiviral vector system, we were able to overexpress, in vocal nuclei, proteins of representative singing-regulated genes in the absence of singing. This publicly accessible resource http://songbirdtranscriptome.net can now be used to study molecular neuroethological mechanisms of behavior.


Subject(s)
Behavior, Animal/physiology , Ethology , Finches/genetics , Gene Expression Regulation/physiology , Nervous System Physiological Phenomena , Animals , Chickens , Female , Finches/physiology , Gene Expression Profiling , Humans , Male , Molecular Sequence Data , Vocalization, Animal/physiology
13.
Genome Res ; 14(10B): 2162-8, 2004 Oct.
Article in English | MEDLINE | ID: mdl-15489339

ABSTRACT

The recently completed Caenorhabditis elegans genome sequence allows application of high-throughput (HT) approaches for phenotypic analyses using RNA interference (RNAi). As large phenotypic data sets become available, "phenoclustering" strategies can be used to begin understanding the complex molecular networks involved in development and other biological processes. The current HT-RNAi resources represent a great asset for phenotypic profiling but are limited by lack of flexibility. For instance, existing resources do not take advantage of the latest improvements in RNAi technology, such as inducible hairpin RNAi. Here we show that a C. elegans ORFeome resource, generated with the Gateway cloning system, can be used as a starting point to generate alternative HT-RNAi resources with enhanced flexibility. The versatility inherent to the Gateway system suggests that additional HT-RNAi libraries can now be readily generated to perform gene knockdowns under various conditions, increasing the possibilities for phenome mapping in C. elegans.


Subject(s)
Caenorhabditis elegans/genetics , Genome , Open Reading Frames/physiology , Phenotype , RNA Interference , RNA, Helminth/genetics , Animals , Gene Expression Regulation/genetics , Gene Library , Genotype
14.
Genome Res ; 14(10B): 2128-35, 2004 Oct.
Article in English | MEDLINE | ID: mdl-15489335

ABSTRACT

The advent of systems biology necessitates the cloning of nearly entire sets of protein-encoding open reading frames (ORFs), or ORFeomes, to allow functional studies of the corresponding proteomes. Here, we describe the generation of a first version of the human ORFeome using a newly improved Gateway recombinational cloning approach. Using the Mammalian Gene Collection (MGC) resource as a starting point, we report the successful cloning of 8076 human ORFs, representing at least 7263 human genes, as mini-pools of PCR-amplified products. These were assembled into the human ORFeome version 1.1 (hORFeome v1.1) collection. After assessing the overall quality of this version, we describe the use of hORFeome v1.1 for heterologous protein expression in two different expression systems at proteome scale. The hORFeome v1.1 represents a central resource for the cloning of large sets of human ORFs in various settings for functional proteomics of many types, and will serve as the foundation for subsequent improved versions of the human ORFeome.


Subject(s)
Cloning, Molecular , Genomics/methods , Open Reading Frames/genetics , Open Reading Frames/physiology , Proteomics , Gene Expression , Genetic Vectors , Humans , Recombinant Proteins/genetics , Recombinant Proteins/isolation & purification , Recombinant Proteins/metabolism
15.
Genome Res ; 13(6B): 1273-89, 2003 Jun.
Article in English | MEDLINE | ID: mdl-12819125

ABSTRACT

We report the construction of the mouse full-length cDNA encyclopedia,the most extensive view of a complex transcriptome,on the basis of preparing and sequencing 246 libraries. Before cloning,cDNAs were enriched in full-length by Cap-Trapper,and in most cases,aggressively subtracted/normalized. We have produced 1,442,236 successful 3'-end sequences clustered into 171,144 groups, from which 60,770 clones were fully sequenced cDNAs annotated in the FANTOM-2 annotation. We have also produced 547,149 5' end reads,which clustered into 124,258 groups. Altogether, these cDNAs were further grouped in 70,000 transcriptional units (TU),which represent the best coverage of a transcriptome so far. By monitoring the extent of normalization/subtraction, we define the tentative equivalent coverage (TEC),which was estimated to be equivalent to >12,000,000 ESTs derived from standard libraries. High coverage explains discrepancies between the very large numbers of clusters (and TUs) of this project,which also include non-protein-coding RNAs,and the lower gene number estimation of genome annotations. Altogether,5'-end clusters identify regions that are potential promoters for 8637 known genes and 5'-end clusters suggest the presence of almost 63,000 transcriptional starting points. An estimate of the frequency of polyadenylation signals suggests that at least half of the singletons in the EST set represent real mRNAs. Clones accounting for about half of the predicted TUs await further sequencing. The continued high-discovery rate suggests that the task of transcriptome discovery is not yet complete.


Subject(s)
Cloning, Molecular , DNA, Complementary/genetics , Genome , Transcription, Genetic/genetics , Animals , Cloning, Molecular/methods , Cluster Analysis , Databases, Genetic/statistics & numerical data , Expressed Sequence Tags , Gene Expression Profiling/methods , Gene Expression Profiling/statistics & numerical data , Gene Library , Genes/genetics , Genes/physiology , Mice , Mice, Inbred C57BL , Organ Specificity/genetics , Polyadenylation/genetics , RNA Caps/genetics , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/statistics & numerical data
16.
Science ; 303(5657): 540-3, 2004 Jan 23.
Article in English | MEDLINE | ID: mdl-14704431

ABSTRACT

To initiate studies on how protein-protein interaction (or "interactome") networks relate to multicellular functions, we have mapped a large fraction of the Caenorhabditis elegans interactome network. Starting with a subset of metazoan-specific proteins, more than 4000 interactions were identified from high-throughput, yeast two-hybrid (HT=Y2H) screens. Independent coaffinity purification assays experimentally validated the overall quality of this Y2H data set. Together with already described Y2H interactions and interologs predicted in silico, the current version of the Worm Interactome (WI5) map contains approximately 5500 interactions. Topological and biological features of this interactome network, as well as its integration with phenome and transcriptome data sets, lead to numerous biological hypotheses.


Subject(s)
Caenorhabditis elegans Proteins/metabolism , Caenorhabditis elegans/metabolism , Proteome/metabolism , Animals , Caenorhabditis elegans/genetics , Caenorhabditis elegans Proteins/genetics , Computational Biology , Evolution, Molecular , Genes, Helminth , Genomics , Open Reading Frames , Phenotype , Protein Binding , Transcription, Genetic , Two-Hybrid System Techniques
SELECTION OF CITATIONS
SEARCH DETAIL