Search | VHL Search Portal

1.

Plasma proteomic associations with genetics and health in the UK Biobank.

Sun, Benjamin B; Chiou, Joshua; Traylor, Matthew; Benner, Christian; Hsu, Yi-Hsiang; Richardson, Tom G; Surendran, Praveen; Mahajan, Anubha; Robins, Chloe; Vasquez-Grinnell, Steven G; Hou, Liping; Kvikstad, Erika M; Burren, Oliver S; Davitte, Jonathan; Ferber, Kyle L; Gillies, Christopher E; Hedman, Åsa K; Hu, Sile; Lin, Tinchi; Mikkilineni, Rajesh; Pendergrass, Rion K; Pickering, Corran; Prins, Bram; Baird, Denis; Chen, Chia-Yen; Ward, Lucas D; Deaton, Aimee M; Welsh, Samantha; Willis, Carissa M; Lehner, Nick; Arnold, Matthias; Wörheide, Maria A; Suhre, Karsten; Kastenmüller, Gabi; Sethi, Anurag; Cule, Madeleine; Raj, Anil; Burkitt-Gray, Lucy; Melamud, Eugene; Black, Mary Helen; Fauman, Eric B; Howson, Joanna M M; Kang, Hyun Min; McCarthy, Mark I; Nioi, Paul; Petrovski, Slavé; Scott, Robert A; Smith, Erin N; Szalma, Sándor; Waterworth, Dawn M.

Nature ; 622(7982): 329-338, 2023 Oct.

Article in English | MEDLINE | ID: mdl-37794186

ABSTRACT

The Pharma Proteomics Project is a precompetitive biopharmaceutical consortium characterizing the plasma proteomic profiles of 54,219 UK Biobank participants. Here we provide a detailed summary of this initiative, including technical and biological validations, insights into proteomic disease signatures, and prediction modelling for various demographic and health indicators. We present comprehensive protein quantitative trait locus (pQTL) mapping of 2,923 proteins that identifies 14,287 primary genetic associations, of which 81% are previously undescribed, alongside ancestry-specific pQTL mapping in non-European individuals. The study provides an updated characterization of the genetic architecture of the plasma proteome, contextualized with projected pQTL discovery rates as sample sizes and proteomic assay coverages increase over time. We offer extensive insights into trans pQTLs across multiple biological domains, highlight genetic influences on ligand-receptor interactions and pathway perturbations across a diverse collection of cytokines and complement networks, and illustrate long-range epistatic effects of ABO blood group and FUT2 secretor status on proteins with gastrointestinal tissue-enriched expression. We demonstrate the utility of these data for drug discovery by extending the genetic proxied effects of protein targets, such as PCSK9, on additional endpoints, and disentangle specific genes and proteins perturbed at loci associated with COVID-19 susceptibility. This public-private partnership provides the scientific community with an open-access proteomics resource of considerable breadth and depth to help to elucidate the biological mechanisms underlying proteo-genomic discoveries and accelerate the development of biomarkers, predictive models and therapeutics1.

Subject(s)

Biological Specimen Banks , Blood Proteins , Databases, Factual , Genomics , Health , Proteome , Proteomics , Humans , ABO Blood-Group System/genetics , Blood Proteins/analysis , Blood Proteins/genetics , COVID-19/genetics , Drug Discovery , Epistasis, Genetic , Fucosyltransferases/metabolism , Genetic Predisposition to Disease , Plasma/chemistry , Proprotein Convertase 9/metabolism , Proteome/analysis , Proteome/genetics , Public-Private Sector Partnerships , Quantitative Trait Loci , United Kingdom , Galactoside 2-alpha-L-fucosyltransferase

2.

Supervised enhancer prediction with epigenetic pattern recognition and targeted validation.

Sethi, Anurag; Gu, Mengting; Gumusgoz, Emrah; Chan, Landon; Yan, Koon-Kiu; Rozowsky, Joel; Barozzi, Iros; Afzal, Veena; Akiyama, Jennifer A; Plajzer-Frick, Ingrid; Yan, Chengfei; Novak, Catherine S; Kato, Momoe; Garvin, Tyler H; Pham, Quan; Harrington, Anne; Mannion, Brandon J; Lee, Elizabeth A; Fukuda-Yuzawa, Yoko; Visel, Axel; Dickel, Diane E; Yip, Kevin Y; Sutton, Richard; Pennacchio, Len A; Gerstein, Mark.

Nat Methods ; 17(8): 807-814, 2020 08.

Article in English | MEDLINE | ID: mdl-32737473

ABSTRACT

Enhancers are important non-coding elements, but they have traditionally been hard to characterize experimentally. The development of massively parallel assays allows the characterization of large numbers of enhancers for the first time. Here, we developed a framework using Drosophila STARR-seq to create shape-matching filters based on meta-profiles of epigenetic features. We integrated these features with supervised machine-learning algorithms to predict enhancers. We further demonstrated that our model could be transferred to predict enhancers in mammals. We comprehensively validated the predictions using a combination of in vivo and in vitro approaches, involving transgenic assays in mice and transduction-based reporter assays in human cell lines (153 enhancers in total). The results confirmed that our model can accurately predict enhancers in different species without re-parameterization. Finally, we examined the transcription factor binding patterns at predicted enhancers versus promoters. We demonstrated that these patterns enable the construction of a secondary model that effectively distinguishes enhancers and promoters.

Subject(s)

Epigenesis, Genetic/physiology , Pattern Recognition, Automated/methods , Animals , Cell Line , Drosophila , Histones/genetics , Histones/metabolism , Humans , Mice , Mice, Transgenic , Reproducibility of Results

3.

Comprehensive Molecular Characterization of Papillary Renal-Cell Carcinoma.

Linehan, W Marston; Spellman, Paul T; Ricketts, Christopher J; Creighton, Chad J; Fei, Suzanne S; Davis, Caleb; Wheeler, David A; Murray, Bradley A; Schmidt, Laura; Vocke, Cathy D; Peto, Myron; Al Mamun, Abu Amar M; Shinbrot, Eve; Sethi, Anurag; Brooks, Samira; Rathmell, W Kimryn; Brooks, Angela N; Hoadley, Katherine A; Robertson, A Gordon; Brooks, Denise; Bowlby, Reanne; Sadeghi, Sara; Shen, Hui; Weisenberger, Daniel J; Bootwalla, Moiz; Baylin, Stephen B; Laird, Peter W; Cherniack, Andrew D; Saksena, Gordon; Haake, Scott; Li, Jun; Liang, Han; Lu, Yiling; Mills, Gordon B; Akbani, Rehan; Leiserson, Mark D M; Raphael, Benjamin J; Anur, Pavana; Bottaro, Donald; Albiges, Laurence; Barnabas, Nandita; Choueiri, Toni K; Czerniak, Bogdan; Godwin, Andrew K; Hakimi, A Ari; Ho, Thai H; Hsieh, James; Ittmann, Michael; Kim, William Y; Krishnan, Bhavani.

N Engl J Med ; 374(2): 135-45, 2016 Jan 14.

Article in English | MEDLINE | ID: mdl-26536169

ABSTRACT

BACKGROUND: Papillary renal-cell carcinoma, which accounts for 15 to 20% of renal-cell carcinomas, is a heterogeneous disease that consists of various types of renal cancer, including tumors with indolent, multifocal presentation and solitary tumors with an aggressive, highly lethal phenotype. Little is known about the genetic basis of sporadic papillary renal-cell carcinoma, and no effective forms of therapy for advanced disease exist. METHODS: We performed comprehensive molecular characterization of 161 primary papillary renal-cell carcinomas, using whole-exome sequencing, copy-number analysis, messenger RNA and microRNA sequencing, DNA-methylation analysis, and proteomic analysis. RESULTS: Type 1 and type 2 papillary renal-cell carcinomas were shown to be different types of renal cancer characterized by specific genetic alterations, with type 2 further classified into three individual subgroups on the basis of molecular differences associated with patient survival. Type 1 tumors were associated with MET alterations, whereas type 2 tumors were characterized by CDKN2A silencing, SETD2 mutations, TFE3 fusions, and increased expression of the NRF2-antioxidant response element (ARE) pathway. A CpG island methylator phenotype (CIMP) was observed in a distinct subgroup of type 2 papillary renal-cell carcinomas that was characterized by poor survival and mutation of the gene encoding fumarate hydratase (FH). CONCLUSIONS: Type 1 and type 2 papillary renal-cell carcinomas were shown to be clinically and biologically distinct. Alterations in the MET pathway were associated with type 1, and activation of the NRF2-ARE pathway was associated with type 2; CDKN2A loss and CIMP in type 2 conveyed a poor prognosis. Furthermore, type 2 papillary renal-cell carcinoma consisted of at least three subtypes based on molecular and phenotypic features. (Funded by the National Institutes of Health.).

Subject(s)

Carcinoma, Papillary/metabolism , Kidney Neoplasms/metabolism , Mutation , NF-E2-Related Factor 2/metabolism , Proto-Oncogene Proteins c-met/metabolism , Carcinoma, Papillary/genetics , CpG Islands/physiology , DNA Methylation , Humans , Kidney Neoplasms/genetics , MicroRNAs/chemistry , NF-E2-Related Factor 2/genetics , Phenotype , Proto-Oncogene Proteins c-met/chemistry , Proto-Oncogene Proteins c-met/genetics , RNA, Messenger/chemistry , RNA, Neoplasm/chemistry , Sequence Analysis, RNA , Signal Transduction/physiology

4.

Increased enzyme binding to substrate is not necessary for more efficient cellulose hydrolysis.

Gao, Dahai; Chundawat, Shishir P S; Sethi, Anurag; Balan, Venkatesh; Gnanakaran, S; Dale, Bruce E.

Proc Natl Acad Sci U S A ; 110(27): 10922-7, 2013 Jul 02.

Article in English | MEDLINE | ID: mdl-23784776

ABSTRACT

Substrate binding is typically one of the rate-limiting steps preceding enzyme catalytic action during homogeneous reactions. However, interfacial-based enzyme catalysis on insoluble crystalline substrates, like cellulose, has additional bottlenecks of individual biopolymer chain decrystallization from the substrate interface followed by its processive depolymerization to soluble sugars. This additional decrystallization step has ramifications on the role of enzyme-substrate binding and its relationship to overall catalytic efficiency. We found that altering the crystalline structure of cellulose from its native allomorph I(ß) to III(I) results in 40-50% lower binding partition coefficient for fungal cellulases, but surprisingly, it enhanced hydrolytic activity on the latter allomorph. We developed a comprehensive kinetic model for processive cellulases acting on insoluble substrates to explain this anomalous finding. Our model predicts that a reduction in the effective binding affinity to the substrate coupled with an increase in the decrystallization procession rate of individual cellulose chains from the substrate surface into the enzyme active site can reproduce our anomalous experimental findings.

Subject(s)

Cellulose/metabolism , Biofuels , Cellulase/metabolism , Cellulose/chemistry , Fungal Proteins/metabolism , Hydrolysis , Kinetics , Lignin/chemistry , Lignin/metabolism , Protein Binding , Substrate Specificity , Trichoderma/enzymology

5.

Membrane-mediated regulation of the intrinsically disordered CD3Ïµ cytoplasmic tail of the TCR.

López, Cesar A; Sethi, Anurag; Goldstein, Byron; Wilson, Bridget S; Gnanakaran, S.

Biophys J ; 108(10): 2481-2491, 2015 May 19.

Article in English | MEDLINE | ID: mdl-25992726

ABSTRACT

The regulation of T-cell-mediated immune responses depends on the phosphorylation of immunoreceptor tyrosine-based activation motifs (ITAMs) on T-cell receptors. Although many details of the signaling cascades are well understood, the initial mechanism and regulation of ITAM phosphorylation remains unknown. We used molecular dynamics simulations to study the influence of different compositions of lipid bilayers on the membrane association of the CD3Ïµ cytoplasmic tails of the T-cell receptors. Our results show that binding of CD3Ïµ to membranes is modulated by both the presence of negatively charged lipids and the lipid order of the membrane. Free-energy calculations reveal that the protein-membrane interaction is favored by the presence of nearby basic residues and the ITAM tyrosines. Phosphorylation minimizes membrane association, rendering the ITAM motif more accessible to binding partners. In systems mimicking biological membranes, the CD3Ïµ chain localization is modulated by different facilitator lipids (e.g., gangliosides or phosphoinositols), revealing a plausible regulatory effect on activation through the regulation of lipid composition in cell membranes.

Subject(s)

Intrinsically Disordered Proteins/chemistry , Lipid Bilayers/metabolism , Receptors, Antigen, T-Cell/chemistry , Amino Acid Motifs , Amino Acid Sequence , Gangliosides/metabolism , Humans , Intrinsically Disordered Proteins/metabolism , Lipid Bilayers/chemistry , Molecular Dynamics Simulation , Molecular Sequence Data , Phosphatidylinositols/metabolism , Protein Binding , Protein Structure, Tertiary , Receptors, Antigen, T-Cell/metabolism

6.

Viral escape from neutralizing antibodies in early subtype A HIV-1 infection drives an increase in autologous neutralization breadth.

Murphy, Megan K; Yue, Ling; Pan, Ruimin; Boliar, Saikat; Sethi, Anurag; Tian, Jianhui; Pfafferot, Katja; Karita, Etienne; Allen, Susan A; Cormier, Emmanuel; Goepfert, Paul A; Borrow, Persephone; Robinson, James E; Gnanakaran, S; Hunter, Eric; Kong, Xiang-Peng; Derdeyn, Cynthia A.

PLoS Pathog ; 9(2): e1003173, 2013 Feb.

Article in English | MEDLINE | ID: mdl-23468623

ABSTRACT

Antibodies that neutralize (nAbs) genetically diverse HIV-1 strains have been recovered from a subset of HIV-1 infected subjects during chronic infection. Exact mechanisms that expand the otherwise narrow neutralization capacity observed during early infection are, however, currently undefined. Here we characterized the earliest nAb responses in a subtype A HIV-1 infected Rwandan seroconverter who later developed moderate cross-clade nAb breadth, using (i) envelope (Env) glycoproteins from the transmitted/founder virus and twenty longitudinal nAb escape variants, (ii) longitudinal autologous plasma, and (iii) autologous monoclonal antibodies (mAbs). Initially, nAbs targeted a single region of gp120, which flanked the V3 domain and involved the alpha2 helix. A single amino acid change at one of three positions in this region conferred early escape. One immunoglobulin heavy chain and two light chains recovered from autologous B cells comprised two mAbs, 19.3H-L1 and 19.3H-L3, which neutralized the founder Env along with one or three of the early escape variants carrying these mutations, respectively. Neither mAb neutralized later nAb escape or heterologous Envs. Crystal structures of the antigen-binding fragments (Fabs) revealed flat epitope contact surfaces, where minimal light chain mutation in 19.3H-L3 allowed for additional antigenic interactions. Resistance to mAb neutralization arose in later Envs through alteration of two glycans spatially adjacent to the initial escape signatures. The cross-neutralizing nAbs that ultimately developed failed to target any of the defined V3-proximal changes generated during the first year of infection in this subject. Our data demonstrate that this subject's first recognized nAb epitope elicited strain-specific mAbs, which incrementally acquired autologous breadth, and directed later B cell responses to target distinct portions of Env. This immune re-focusing could have triggered the evolution of cross-clade antibodies and suggests that exposure to a specific sequence of immune escape variants might promote broad humoral responses during HIV-1 infection.

Subject(s)

Antibodies, Neutralizing/immunology , Antibodies, Viral/immunology , HIV Infections/immunology , HIV-1/immunology , Immune Evasion/immunology , Amino Acid Sequence , Antibodies, Neutralizing/chemistry , Cross Reactions/immunology , Female , HIV Infections/virology , HIV Seropositivity/immunology , HIV Seropositivity/virology , Humans , Immunoglobulin Heavy Chains/immunology , Immunoglobulin Light Chains/immunology , Male , Molecular Sequence Data , Protein Structure, Tertiary , Sequence Alignment

7.

Binding and movement of individual Cel7A cellobiohydrolases on crystalline cellulose surfaces revealed by single-molecule fluorescence imaging.

Jung, Jaemyeong; Sethi, Anurag; Gaiotto, Tiziano; Han, Jason J; Jeoh, Tina; Gnanakaran, Sandrasegaram; Goodwin, Peter M.

J Biol Chem ; 288(33): 24164-72, 2013 Aug 16.

Article in English | MEDLINE | ID: mdl-23818525

ABSTRACT

The efficient catalytic conversion of biomass to bioenergy would meet a large portion of energy requirements in the near future. A crucial step in this process is the enzyme-catalyzed hydrolysis of cellulose to glucose that is then converted into fuel such as ethanol by fermentation. Here we use single-molecule fluorescence imaging to directly monitor the movement of individual Cel7A cellobiohydrolases from Trichoderma reesei (TrCel7A) on the surface of insoluble cellulose fibrils to elucidate molecular level details of cellulase activity. The motion of multiple, individual TrCel7A cellobiohydrolases was simultaneously recorded with â¼15-nm spatial resolution. Time-resolved localization microscopy provides insights on the activity of TrCel7A on cellulose and informs on nonproductive binding and diffusion. We measured single-molecule residency time distributions of TrCel7A bound to cellulose both in the presence of and absence of cellobiose the major product and a potent inhibitor of Cel7A activity. Combining these results with a kinetic model of TrCel7A binding provides microscopic insight into interactions between TrCel7A and the cellulose substrate.

Subject(s)

Cellulose 1,4-beta-Cellobiosidase/metabolism , Cellulose/metabolism , Optical Imaging/methods , Trichoderma/enzymology , Adsorption/drug effects , Cellulose 1,4-beta-Cellobiosidase/antagonists & inhibitors , Electrophoresis, Polyacrylamide Gel , Enzyme Inhibitors/pharmacology , Fluorescence , Hydrogen-Ion Concentration/drug effects , Microscopy, Atomic Force , Models, Biological , Protein Binding/drug effects , Protein Transport/drug effects , Solubility , Substrate Specificity/drug effects , Surface Properties , Time Factors

8.

A mechanistic understanding of allosteric immune escape pathways in the HIV-1 envelope glycoprotein.

Sethi, Anurag; Tian, Jianhui; Derdeyn, Cynthia A; Korber, Bette; Gnanakaran, S.

PLoS Comput Biol ; 9(5): e1003046, 2013.

Article in English | MEDLINE | ID: mdl-23696718

ABSTRACT

The HIV-1 envelope (Env) spike, which consists of a compact, heterodimeric trimer of the glycoproteins gp120 and gp41, is the target of neutralizing antibodies. However, the high mutation rate of HIV-1 and plasticity of Env facilitates viral evasion from neutralizing antibodies through various mechanisms. Mutations that are distant from the antibody binding site can lead to escape, probably by changing the conformation or dynamics of Env; however, these changes are difficult to identify and define mechanistically. Here we describe a network analysis-based approach to identify potential allosteric immune evasion mechanisms using three known HIV-1 Env gp120 protein structures from two different clades, B and C. First, correlation and principal component analyses of molecular dynamics (MD) simulations identified a high degree of long-distance coupled motions that exist between functionally distant regions within the intrinsic dynamics of the gp120 core, supporting the presence of long-distance communication in the protein. Then, by integrating MD simulations with network theory, we identified the optimal and suboptimal communication pathways and modules within the gp120 core. The results unveil both strain-dependent and -independent characteristics of the communication pathways in gp120. We show that within the context of three structurally homologous gp120 cores, the optimal pathway for communication is sequence sensitive, i.e. a suboptimal pathway in one strain becomes the optimal pathway in another strain. Yet the identification of conserved elements within these communication pathways, termed inter-modular hotspots, could present a new opportunity for immunogen design, as this could be an additional mechanism that HIV-1 uses to shield vulnerable antibody targets in Env that induce neutralizing antibody breadth.

Subject(s)

HIV Envelope Protein gp120 , HIV-1 , Immune Evasion/physiology , Molecular Dynamics Simulation , Binding Sites , CD4 Antigens/chemistry , CD4 Antigens/metabolism , Computational Biology , HIV Envelope Protein gp120/chemistry , HIV Envelope Protein gp120/metabolism , HIV Envelope Protein gp120/physiology , HIV Infections/virology , HIV-1/chemistry , HIV-1/metabolism , HIV-1/physiology , Humans , Principal Component Analysis , Protein Conformation , Reproducibility of Results , Structural Homology, Protein

9.

Taste of sugar at the membrane: thermodynamics and kinetics of the interaction of a disaccharide with lipid bilayers.

Tian, Jianhui; Sethi, Anurag; Swanson, Basil I; Goldstein, Byron; Gnanakaran, S.

Biophys J ; 104(3): 622-32, 2013 Feb 05.

Article in English | MEDLINE | ID: mdl-23442913

ABSTRACT

Sugar recognition at the membrane is critical in various physiological processes. Many aspects of sugar-membrane interaction are still unknown. We take an integrated approach by combining conventional molecular-dynamics simulations with enhanced sampling methods and analytical models to understand the thermodynamics and kinetics of a di-mannose molecule in a phospholipid bilayer system. We observe that di-mannose has a slight preference to localize at the water-phospholipid interface. Using umbrella sampling, we show the free energy bias for this preferred location to be just -0.42 kcal/mol, which explains the coexistence of attraction and exclusion mechanisms of sugar-membrane interaction. Accurate estimation of absolute entropy change of water molecules with a two-phase model indicates that the small energy bias is the result of a favorable entropy change of water molecules. Then, we incorporate results from molecular-dynamics simulation in two different ways to an analytical diffusion-reaction model to obtain association and dissociation constants for di-mannose interaction with membrane. Finally, we verify our approach by predicting concentration dependence of di-mannose recognition at the membrane that is consistent with experiment. In conclusion, we provide a combined approach for the thermodynamics and kinetics of a weak ligand-binding system, which has broad implications across many different fields.

Subject(s)

Disaccharides/chemistry , Lipid Bilayers/chemistry , Thermodynamics , Kinetics , Lipids/chemistry , Mannose/chemistry , Water/chemistry

10.

Deducing conformational variability of intrinsically disordered proteins from infrared spectroscopy with Bayesian statistics.

Sethi, Anurag; Anunciado, Divina; Tian, Jianhui; Vu, Dung M; Gnanakaran, S.

Chem Phys ; 4222013 Aug 30.

Article in English | MEDLINE | ID: mdl-24187427

ABSTRACT

As it remains practically impossible to generate ergodic ensembles for large intrinsically disordered proteins (IDP) with molecular dynamics (MD) simulations, it becomes critical to compare spectroscopic characteristics of the theoretically generated ensembles to corresponding measurements. We develop a Bayesian framework to infer the ensemble properties of an IDP using a combination of conformations generated by MD simulations and its measured infrared spectrum. We performed 100 different MD simulations totaling more than 10 µs to characterize the conformational ensemble of αsynuclein, a prototypical IDP, in water. These conformations are clustered based on solvent accessibility and helical content. We compute the amide-I band for these clusters and predict the thermodynamic weights of each cluster given the measured amide-I band. Bayesian analysis produces a reproducible and non-redundant set of thermodynamic weights for each cluster, which can then be used to calculate the ensemble properties. In a rigorous validation, these weights reproduce measured chemical shifts.

11.

Genetics implicates overactive osteogenesis in the development of diffuse idiopathic skeletal hyperostosis.

Sethi, Anurag; Ruby, J Graham; Veras, Matthew A; Telis, Natalie; Melamud, Eugene.

Nat Commun ; 14(1): 2644, 2023 05 08.

Article in English | MEDLINE | ID: mdl-37156767

ABSTRACT

Diffuse idiopathic skeletal hyperostosis (DISH) is a condition where adjacent vertebrae become fused through formation of osteophytes. The genetic and epidemiological etiology of this condition is not well understood. Here, we implemented a machine learning algorithm to assess the prevalence and severity of the pathology in ~40,000 lateral DXA scans in the UK Biobank Imaging cohort. We find that DISH is highly prevalent, above the age of 45, ~20% of men and ~8% of women having multiple osteophytes. Surprisingly, we find strong phenotypic and genetic association of DISH with increased bone mineral density and content throughout the entire skeletal system. Genetic association analysis identified ten loci associated with DISH, including multiple genes involved in bone remodeling (RUNX2, IL11, GDF5, CCDC91, NOG, and ROR2). Overall, this study describes genetics of DISH and implicates the role of overactive osteogenesis as a key driver of the pathology.

Subject(s)

Hyperostosis, Diffuse Idiopathic Skeletal , Osteophyte , Male , Humans , Female , Hyperostosis, Diffuse Idiopathic Skeletal/diagnostic imaging , Hyperostosis, Diffuse Idiopathic Skeletal/genetics , Hyperostosis, Diffuse Idiopathic Skeletal/complications , Osteogenesis/genetics , Osteophyte/complications , Osteophyte/pathology , Spine/pathology , Absorptiometry, Photon

12.

Identification of minimally interacting modules in an intrinsically disordered protein.

Sethi, Anurag; Tian, Jianhui; Vu, Dung M; Gnanakaran, S.

Biophys J ; 103(4): 748-57, 2012 Aug 22.

Article in English | MEDLINE | ID: mdl-22947936

ABSTRACT

The conformational characterization of intrinsically disordered proteins (IDPs) is complicated by their conformational heterogeneity and flexibility. If an IDP could somehow be divided into smaller fragments and reconstructed later, theoretical and spectroscopic studies could probe its conformational variability in detail. Here, we used replica molecular-dynamics simulations and network theory to explore whether such a divide-and-conquer strategy is feasible for α-synuclein, a prototypical IDP. We characterized the conformational variability of α-synuclein by conducting >100 unbiased all-atom molecular-dynamics simulations, for a total of >10 µs of trajectories. In these simulations, α-synuclein formed a heterogeneous ensemble of collapsed coil states in an aqueous environment. These states were stabilized by heterogeneous contacts between sequentially distant regions. We find that α-synuclein contains residual secondary structures in the collapsed states, and the heterogeneity in the collapsed state makes it feasible to split α-synuclein into sequentially contiguous minimally interacting fragments. This study reveals previously unknown characteristics of α-synuclein and provides a new (to our knowledge) approach for studying other IDPs.

Subject(s)

Molecular Dynamics Simulation , alpha-Synuclein/chemistry , Peptide Fragments/chemistry , Protein Structure, Secondary , Vibration , Water/chemistry , alpha-Synuclein/metabolism

13.

The B cell response is redundant and highly focused on V1V2 during early subtype C infection in a Zambian seroconverter.

Lynch, Rebecca M; Rong, Rong; Boliar, Saikat; Sethi, Anurag; Li, Bing; Mulenga, Joseph; Allen, Susan; Robinson, James E; Gnanakaran, S; Derdeyn, Cynthia A.

J Virol ; 85(2): 905-15, 2011 Jan.

Article in English | MEDLINE | ID: mdl-20980495

ABSTRACT

High-titer autologous neutralizing antibody responses have been demonstrated during early subtype C human immunodeficiency virus type 1 (HIV-1) infection. However, characterization of this response against autologous virus at the monoclonal antibody (MAb) level has only recently begun to be elucidated. Here we describe five monoclonal antibodies derived from a subtype C-infected seroconverter and their neutralizing activities against pseudoviruses that carry envelope glycoproteins from 48 days (0 month), 2 months, and 8 months after the estimated time of infection. Sequence analysis indicated that the MAbs arose from three distinct B cell clones, and their pattern of neutralization compared to that in patient plasma suggested that they circulated between 2 and 8 months after infection. Neutralization by MAbs representative of each B cell clone was mapped to two residues: position 134 in V1 and position 189 in V2. Mutational analysis revealed cooperative effects between glycans and residues at these two positions, arguing that they contribute to a single epitope. Analysis of the cognate gp120 sequence through homology modeling places this potential epitope near the interface between the V1 and V2 loops. Additionally, the escape mutation R189S in V2, which conferred resistance against all three MAbs, had no detrimental effect on virus replication in vitro. Taken together, our data demonstrate that independent B cells repeatedly targeted a single structure in V1V2 during early infection. Despite this assault, a single amino acid change was sufficient to confer complete escape with minimal impact on replication fitness.

Subject(s)

Antibodies, Monoclonal/immunology , Antibodies, Neutralizing/immunology , B-Lymphocytes/immunology , HIV Antibodies/immunology , HIV Envelope Protein gp120/immunology , HIV Infections/immunology , HIV-1/immunology , DNA Mutational Analysis , Epitopes/genetics , Epitopes/immunology , Genotype , Glycosylation , HIV-1/classification , Humans , RNA, Viral/genetics , Sequence Analysis, DNA , Zambia

14.

Quantifying intramolecular binding in multivalent interactions: a structure-based synergistic study on Grb2-Sos1 complex.

Sethi, Anurag; Goldstein, Byron; Gnanakaran, S.

PLoS Comput Biol ; 7(10): e1002192, 2011 Oct.

Article in English | MEDLINE | ID: mdl-22022247

ABSTRACT

Numerous signaling proteins use multivalent binding to increase the specificity and affinity of their interactions within the cell. Enhancement arises because the effective binding constant for multivalent binding is larger than the binding constants for each individual interaction. We seek to gain both qualitative and quantitative understanding of the multivalent interactions of an adaptor protein, growth factor receptor bound protein-2 (Grb2), containing two SH3 domains interacting with the nucleotide exchange factor son-of-sevenless 1 (Sos1) containing multiple polyproline motifs separated by flexible unstructured regions. Grb2 mediates the recruitment of Sos1 from the cytosol to the plasma membrane where it activates Ras by inducing the exchange of GDP for GTP. First, using a combination of evolutionary information and binding energy calculations, we predict an additional polyproline motif in Sos1 that binds to the SH3 domains of Grb2. This gives rise to a total of five polyproline motifs in Sos1 that are capable of binding to the two SH3 domains of Grb2. Then, using a hybrid method combining molecular dynamics simulations and polymer models, we estimate the enhancement in local concentration of a polyproline motif on Sos1 near an unbound SH3 domain of Grb2 when its other SH3 domain is bound to a different polyproline motif on Sos1. We show that the local concentration of the Sos1 motifs that a Grb2 SH3 domain experiences is approximately 1000 times greater than the cellular concentration of Sos1. Finally, we calculate the intramolecular equilibrium constants for the crosslinking of Grb2 on Sos1 and use thermodynamic modeling to calculate the stoichiometry. With these equilibrium constants, we are able to predict the distribution of complexes that form at physiological concentrations. We believe this is the first systematic analysis that combines sequence, structure, and thermodynamic analyses to determine the stoichiometry of the complexes that are dominant in the cellular environment.

Subject(s)

GRB2 Adaptor Protein/metabolism , SOS1 Protein/metabolism , Amino Acid Sequence , Animals , GRB2 Adaptor Protein/chemistry , Humans , Models, Molecular , Molecular Dynamics Simulation , Molecular Sequence Data , Molecular Structure , Probability , Protein Binding , SOS1 Protein/chemistry , Sequence Homology, Amino Acid

15.

Dynamical networks in tRNA:protein complexes.

Sethi, Anurag; Eargle, John; Black, Alexis A; Luthey-Schulten, Zaida.

Proc Natl Acad Sci U S A ; 106(16): 6620-5, 2009 Apr 21.

Article in English | MEDLINE | ID: mdl-19351898

ABSTRACT

Community network analysis derived from molecular dynamics simulations is used to identify and compare the signaling pathways in a bacterial glutamyl-tRNA synthetase (GluRS):tRNA(Glu) and an archaeal leucyl-tRNA synthetase (LeuRS):tRNA(Leu) complex. Although the 2 class I synthetases have remarkably different interactions with their cognate tRNAs, the allosteric networks for charging tRNA with the correct amino acid display considerable similarities. A dynamic contact map defines the edges connecting nodes (amino acids and nucleotides) in the physical network whose overall topology is presented as a network of communities, local substructures that are highly intraconnected, but loosely interconnected. Whereas nodes within a single community can communicate through many alternate pathways, the communication between monomers in different communities has to take place through a smaller number of critical edges or interactions. Consistent with this analysis, there are a large number of suboptimal paths that can be used for communication between the identity elements on the tRNAs and the catalytic site in the aaRS:tRNA complexes. Residues and nucleotides in the majority of pathways for intercommunity signal transmission are evolutionarily conserved and are predicted to be important for allosteric signaling. The same monomers are also found in a majority of the suboptimal paths. Modifying these residues or nucleotides has a large effect on the communication pathways in the protein:RNA complex consistent with kinetic data.

Subject(s)

Archaea/enzymology , Bacteria/enzymology , Glutamate-tRNA Ligase/metabolism , Leucine-tRNA Ligase/metabolism , RNA, Transfer, Amino Acyl/metabolism , Computer Simulation , Protein Structure, Secondary , Transfer RNA Aminoacylation

16.

Joint inference of physiological network and survival analysis identifies factors associated with aging rate.

Sethi, Anurag; Melamud, Eugene.

Cell Rep Methods ; 2(12): 100356, 2022 12 19.

Article in English | MEDLINE | ID: mdl-36590696

ABSTRACT

We describe methodology for joint reconstruction of physiological-survival networks from observational data capable of identifying key survival-associated variables, inferring a minimal physiological network structure, and bridging this network to the Gompertzian survival layer. Using synthetic network structures, we show that the method is capable of identifying aging variables in cohorts as small as 5,000 participants. Applying the methodology to the observational human cohort, we find that interleukin-6, vascular calcification, and red-blood distribution width are strong predictors of baseline fitness. More important, we find that red blood cell counts, kidney function, and phosphate level are directly linked to the Gompertzian aging rate. Our model therefore enables discovery of processes directly linked to the aging rate of our species. We further show that this epidemiological framework can be applied as a causal inference engine to simulate the effects of interventions on physiology and longevity.

Subject(s)

Aging , Gene Regulatory Networks , Humans , Causality , Survival Analysis

17.

Calcification of the abdominal aorta is an under-appreciated cardiovascular disease risk factor in the general population.

Sethi, Anurag; Taylor, D Leland; Ruby, J Graham; Venkataraman, Jagadish; Sorokin, Elena; Cule, Madeleine; Melamud, Eugene.

Front Cardiovasc Med ; 9: 1003246, 2022.

Article in English | MEDLINE | ID: mdl-36277789

ABSTRACT

Calcification of large arteries is a high-risk factor in the development of cardiovascular diseases, however, due to the lack of routine monitoring, the pathology remains severely under-diagnosed and prevalence in the general population is not known. We have developed a set of machine learning methods to quantitate levels of abdominal aortic calcification (AAC) in the UK Biobank imaging cohort and carried out the largest to-date analysis of genetic, biochemical, and epidemiological risk factors associated with the pathology. In a genetic association study, we identified three novel loci associated with AAC (FGF9, NAV9, and APOE), and replicated a previously reported association at the TWIST1/HDAC9 locus. We find that AAC is a highly prevalent pathology, with ~ 1 in 10 adults above the age of 40 showing significant levels of hydroxyapatite build-up (Kauppila score > 3). Presentation of AAC was strongly predictive of future cardiovascular events including stenosis of precerebral arteries (HR~1.5), myocardial infarction (HR~1.3), ischemic heart disease (HR~1.3), as well as other diseases such as chronic obstructive pulmonary disease (HR~1.3). Significantly, we find that the risk for myocardial infarction from elevated AAC (HR ~1.4) was comparable to the risk of hypercholesterolemia (HR~1.4), yet most people who develop AAC are not hypercholesterolemic. Furthermore, the overwhelming majority (98%) of individuals who develop pathology do so in the absence of known pre-existing risk conditions such as chronic kidney disease and diabetes (0.6% and 2.7% respectively). Our findings indicate that despite the high cardiovascular risk, calcification of large arteries remains a largely under-diagnosed lethal condition, and there is a clear need for increased awareness and monitoring of the pathology in the general population.

18.

Genetic signatures in the envelope glycoproteins of HIV-1 that associate with broadly neutralizing antibodies.

Gnanakaran, S; Daniels, Marcus G; Bhattacharya, Tanmoy; Lapedes, Alan S; Sethi, Anurag; Li, Ming; Tang, Haili; Greene, Kelli; Gao, Hongmei; Haynes, Barton F; Cohen, Myron S; Shaw, George M; Seaman, Michael S; Kumar, Amit; Gao, Feng; Montefiori, David C; Korber, Bette.

PLoS Comput Biol ; 6(10): e1000955, 2010 Oct 07.

Article in English | MEDLINE | ID: mdl-20949103

ABSTRACT

A steady increase in knowledge of the molecular and antigenic structure of the gp120 and gp41 HIV-1 envelope glycoproteins (Env) is yielding important new insights for vaccine design, but it has been difficult to translate this information to an immunogen that elicits broadly neutralizing antibodies. To help bridge this gap, we used phylogenetically corrected statistical methods to identify amino acid signature patterns in Envs derived from people who have made potently neutralizing antibodies, with the hypothesis that these Envs may share common features that would be useful for incorporation in a vaccine immunogen. Before attempting this, essentially as a control, we explored the utility of our computational methods for defining signatures of complex neutralization phenotypes by analyzing Env sequences from 251 clonal viruses that were differentially sensitive to neutralization by the well-characterized gp120-specific monoclonal antibody, b12. We identified ten b12-neutralization signatures, including seven either in the b12-binding surface of gp120 or in the V2 region of gp120 that have been previously shown to impact b12 sensitivity. A simple algorithm based on the b12 signature pattern was predictive of b12 sensitivity/resistance in an additional blinded panel of 57 viruses. Upon obtaining these reassuring outcomes, we went on to apply these same computational methods to define signature patterns in Env from HIV-1 infected individuals who had potent, broadly neutralizing responses. We analyzed a checkerboard-style neutralization dataset with sera from 69 HIV-1-infected individuals tested against a panel of 25 different Envs. Distinct clusters of sera with high and low neutralization potencies were identified. Six signature positions in Env sequences obtained from the 69 samples were found to be strongly associated with either the high or low potency responses. Five sites were in the CD4-induced coreceptor binding site of gp120, suggesting an important role for this region in the elicitation of broadly neutralizing antibody responses against HIV-1.

Subject(s)

Antibodies, Neutralizing/metabolism , Computational Biology/methods , HIV Envelope Protein gp120/genetics , HIV Envelope Protein gp41/genetics , HIV-1/genetics , Algorithms , Amino Acid Sequence , Antibodies, Neutralizing/blood , Artificial Intelligence , Cluster Analysis , DNA Mutational Analysis/methods , Epitope Mapping , Epitopes, T-Lymphocyte , HIV Envelope Protein gp120/metabolism , HIV Envelope Protein gp41/metabolism , Humans , Logistic Models , Models, Molecular , Mutation/genetics , Neutralization Tests , Phylogeny , Sequence Alignment

19.

Molecular signatures of ribosomal evolution.

Roberts, Elijah; Sethi, Anurag; Montoya, Jonathan; Woese, Carl R; Luthey-Schulten, Zaida.

Proc Natl Acad Sci U S A ; 105(37): 13953-8, 2008 Sep 16.

Article in English | MEDLINE | ID: mdl-18768810

ABSTRACT

Ribosomal signatures, idiosyncrasies in the ribosomal RNA (rRNA) and/or proteins, are characteristic of the individual domains of life. As such, insight into the early evolution of the domains can be gained from a comparative analysis of their respective signatures in the translational apparatus. In this work, we identify signatures in both the sequence and structure of the rRNA and analyze their contributions to the universal phylogenetic tree using both sequence- and structure-based methods. Domain-specific ribosomal proteins can be considered signatures in their own right. Although it is commonly assumed that they developed after the universal ribosomal proteins, we present evidence that at least one may have been present before the divergence of the organismal lineages. We find correlations between the rRNA signatures and signatures in the ribosomal proteins showing that the rRNA signatures coevolved with both domain-specific and universal ribosomal proteins. Finally, we show that the genomic organization of the universal ribosomal components contains these signatures as well. From these studies, we propose the ribosomal signatures are remnants of an evolutionary-phase transition that occurred as the cell lineages began to coalesce and so should be reflected in corresponding signatures throughout the fabric of the cell and its genome.

Subject(s)

Evolution, Molecular , Phylogeny , Protein Biosynthesis , Ribosomes/genetics , Ribosomes/metabolism , Animals , Base Sequence , Genome/genetics , Humans , Models, Molecular , Molecular Sequence Data , Nucleic Acid Conformation , Protein Structure, Quaternary , Protein Structure, Tertiary , RNA, Ribosomal/chemistry , RNA, Ribosomal/genetics , RNA, Ribosomal/metabolism , Ribosomal Proteins/chemistry , Ribosomal Proteins/metabolism , Ribosomes/chemistry , Structural Homology, Protein

20.

The promise and reality of therapeutic discovery from large cohorts.

Melamud, Eugene; Taylor, D Leland; Sethi, Anurag; Cule, Madeleine; Baryshnikova, Anastasia; Saleheen, Danish; van Bruggen, Nick; FitzGerald, Garret A.

J Clin Invest ; 130(2): 575-581, 2020 02 03.

Article in English | MEDLINE | ID: mdl-31929188

ABSTRACT

Technological advances in rapid data acquisition have transformed medical biology into a data mining field, where new data sets are routinely dissected and analyzed by statistical models of ever-increasing complexity. Many hypotheses can be generated and tested within a single large data set, and even small effects can be statistically discriminated from a sea of noise. On the other hand, the development of therapeutic interventions moves at a much slower pace. They are determined from carefully randomized and well-controlled experiments with explicitly stated outcomes as the principal mechanism by which a single hypothesis is tested. In this paradigm, only a small fraction of interventions can be tested, and an even smaller fraction are ultimately deemed therapeutically successful. In this Review, we propose strategies to leverage large-cohort data to inform the selection of targets and the design of randomized trials of novel therapeutics. Ultimately, the incorporation of big data and experimental medicine approaches should aim to reduce the failure rate of clinical trials as well as expedite and lower the cost of drug development.

Subject(s)

Big Data , Biomedical Research , Cohort Studies , Models, Statistical , Randomized Controlled Trials as Topic , Humans

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL