Pesquisa | Portal Regional da BVS

Blended Genome Exome (BGE) as a Cost Efficient Alternative to Deep Whole Genomes or Arrays.

DeFelice, Matthew; Grimsby, Jonna L; Howrigan, Daniel; Yuan, Kai; Chapman, Sinéad B; Stevens, Christine; DeLuca, Samuel; Townsend, Megan; Buxbaum, Joseph; Pericak-Vance, Margaret; Qin, Shengying; Stein, Dan J; Teferra, Solomon; Xavier, Ramnik J; Huang, Hailiang; Martin, Alicia R; Neale, Benjamin M.

bioRxiv ; 2024 Apr 09.

Artigo em Inglês | MEDLINE | ID: mdl-38645052

RESUMO

Genomic scientists have long been promised cheaper DNA sequencing, but deep whole genomes are still costly, especially when considered for large cohorts in population-level studies. More affordable options include microarrays + imputation, whole exome sequencing (WES), or low-pass whole genome sequencing (WGS) + imputation. WES + array + imputation has recently been shown to yield 99% of association signals detected by WGS. However, a method free from ascertainment biases of arrays or the need for merging different data types that still benefits from deeper exome coverage to enhance novel coding variant detection does not exist. We developed a new, combined, "Blended Genome Exome" (BGE) in which a whole genome library is generated, an aliquot of that genome is amplified by PCR, the exome regions are selected and enriched, and the genome and exome libraries are combined back into a single tube for sequencing (33% exome, 67% genome). This creates a single CRAM with a low-coverage whole genome (2-3x) combined with a higher coverage exome (30-40x). This BGE can be used for imputing common variants throughout the genome as well as for calling rare coding variants. We tested this new method and observed >99% r 2 concordance between imputed BGE data and existing 30x WGS data for exome and genome variants. BGE can serve as a useful and cost-efficient alternative sequencing product for genomic researchers, requiring ten-fold less sequencing compared to 30x WGS without the need for complicated harmonization of array and sequencing data.

Fully Flexible Docking of Medium Sized Ligand Libraries with RosettaLigand.

DeLuca, Samuel; Khar, Karen; Meiler, Jens.

PLoS One ; 10(7): e0132508, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26207742

RESUMO

RosettaLigand has been successfully used to predict binding poses in protein-small molecule complexes. However, the RosettaLigand docking protocol is comparatively slow in identifying an initial starting pose for the small molecule (ligand) making it unfeasible for use in virtual High Throughput Screening (vHTS). To overcome this limitation, we developed a new sampling approach for placing the ligand in the protein binding site during the initial 'low-resolution' docking step. It combines the translational and rotational adjustments to the ligand pose in a single transformation step. The new algorithm is both more accurate and more time-efficient. The docking success rate is improved by 10-15% in a benchmark set of 43 protein/ligand complexes, reducing the number of models that typically need to be generated from 1000 to 150. The average time to generate a model is reduced from 50 seconds to 10 seconds. As a result we observe an effective 30-fold speed increase, making RosettaLigand appropriate for docking medium sized ligand libraries. We demonstrate that this improved initial placement of the ligand is critical for successful prediction of an accurate binding position in the 'high-resolution' full atom refinement step.

Assuntos

Simulação de Acoplamento Molecular/métodos , Proteínas/química , Bibliotecas de Moléculas Pequenas/farmacologia , Algoritmos , Sítios de Ligação , Modelos Moleculares , Proteínas/metabolismo

In silico analysis and experimental verification of OSR1 kinase - Peptide interaction.

Austin, Thomas M; Nannemann, David P; Deluca, Samuel L; Meiler, Jens; Delpire, Eric.

J Struct Biol ; 187(1): 58-65, 2014 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-24821279

RESUMO

The oxidative-stress-responsive kinase 1 (OSR1) and the STE20/SPS1-related proline/alanine-rich kinase (SPAK) are key enzymes in a signaling cascade regulating the activity of Na(+)-K(+)-2Cl(-) cotransporters (NKCC1-2) and Na(+)-Cl(-) cotransporter (NCC). Both kinases have a conserved carboxyl-terminal (CCT) domain, which recognizes a unique peptide motif present in OSR1- and SPAK-activating kinases (with-no-lysine kinase 1 (WNK1) and WNK4) as well as their substrates (NKCC1, NKCC2, and NCC). Utilizing various modalities of the Rosetta Molecular Modeling Software Suite including flexible peptide docking and protein design, we comprehensively explored the sequence space recognized by the CCT domain. Specifically, we studied single residue mutations as well as complete unbiased designs of a hexapeptide substrate. The computational study started from a crystal structure of the CCT domain of OSR1 in complex with a hexapeptide derived from WNK4. Point mutations predicted to be favorable include Arg to His or Trp substitutions at position 2 and a Phe to Tyr substitution at position 3 of the hexapeptide. In addition, de novo design yielded two peptides predicted to bind to the CCT domain: FRFQVT and TRFDVT. These results, which indicate a little bit more freedom in the composition of the peptide, were confirmed through the use of yeast two-hybrid screening.

Assuntos

Modelos Moleculares , Proteínas Serina-Treonina Quinases/química , Proteínas Recombinantes de Fusão/química , Sequência de Aminoácidos , Animais , Simulação por Computador , Cristalografia por Raios X , Expressão Gênica , Humanos , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Camundongos , Dados de Sequência Molecular , Mutação , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Proteínas Serina-Treonina Quinases/genética , Estrutura Secundária de Proteína , Proteínas Recombinantes de Fusão/genética , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Técnicas do Sistema de Duplo-Híbrido

Small-molecule ligand docking into comparative models with Rosetta.

Combs, Steven A; Deluca, Samuel L; Deluca, Stephanie H; Lemmon, Gordon H; Nannemann, David P; Nguyen, Elizabeth D; Willis, Jordan R; Sheehan, Jonathan H; Meiler, Jens.

Nat Protoc ; 8(7): 1277-98, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23744289

RESUMO

Structure-based drug design is frequently used to accelerate the development of small-molecule therapeutics. Although substantial progress has been made in X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy, the availability of high-resolution structures is limited owing to the frequent inability to crystallize or obtain sufficient NMR restraints for large or flexible proteins. Computational methods can be used to both predict unknown protein structures and model ligand interactions when experimental data are unavailable. This paper describes a comprehensive and detailed protocol using the Rosetta modeling suite to dock small-molecule ligands into comparative models. In the protocol presented here, we review the comparative modeling process, including sequence alignment, threading and loop building. Next, we cover docking a small-molecule ligand into the protein comparative model. In addition, we discuss criteria that can improve ligand docking into comparative models. Finally, and importantly, we present a strategy for assessing model quality. The entire protocol is presented on a single example selected solely for didactic purposes. The results are therefore not representative and do not replace benchmarks published elsewhere. We also provide an additional tutorial so that the user can gain hands-on experience in using Rosetta. The protocol should take 5-7 h, with additional time allocated for computer generation of models.

Assuntos

Modelos Moleculares , Simulação de Acoplamento Molecular , Conformação Proteica , Desenho de Fármacos , Ligantes , Alinhamento de Sequência/métodos , Software , Interface Usuário-Computador

Human germline antibody gene segments encode polyspecific antibodies.

Willis, Jordan R; Briney, Bryan S; DeLuca, Samuel L; Crowe, James E; Meiler, Jens.

PLoS Comput Biol ; 9(4): e1003045, 2013 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-23637590

RESUMO

Structural flexibility in germline gene-encoded antibodies allows promiscuous binding to diverse antigens. The binding affinity and specificity for a particular epitope typically increase as antibody genes acquire somatic mutations in antigen-stimulated B cells. In this work, we investigated whether germline gene-encoded antibodies are optimal for polyspecificity by determining the basis for recognition of diverse antigens by antibodies encoded by three VH gene segments. Panels of somatically mutated antibodies encoded by a common VH gene, but each binding to a different antigen, were computationally redesigned to predict antibodies that could engage multiple antigens at once. The Rosetta multi-state design process predicted antibody sequences for the entire heavy chain variable region, including framework, CDR1, and CDR2 mutations. The predicted sequences matched the germline gene sequences to a remarkable degree, revealing by computational design the residues that are predicted to enable polyspecificity, i.e., binding of many unrelated antigens with a common sequence. The process thereby reverses antibody maturation in silico. In contrast, when designing antibodies to bind a single antigen, a sequence similar to that of the mature antibody sequence was returned, mimicking natural antibody maturation in silico. We demonstrated that the Rosetta computational design algorithm captures important aspects of antibody/antigen recognition. While the hypervariable region CDR3 often mediates much of the specificity of mature antibodies, we identified key positions in the VH gene encoding CDR1, CDR2, and the immunoglobulin framework that are critical contributors for polyspecificity in germline antibodies. Computational design of antibodies capable of binding multiple antigens may allow the rational design of antibodies that retain polyspecificity for diverse epitope binding.

Assuntos

Anticorpos/química , Complexo Antígeno-Anticorpo/química , Algoritmos , Aminoácidos/química , Antígenos/química , Biologia Computacional/métodos , Simulação por Computador , Epitopos/química , Genes de Imunoglobulinas , Humanos , Mutação , Linguagens de Programação , Ligação Proteica , Conformação Proteica , Software

Design of native-like proteins through an exposure-dependent environment potential.

DeLuca, Samuel; Dorr, Brent; Meiler, Jens.

Biochemistry ; 50(40): 8521-8, 2011 Oct 11.

Artigo em Inglês | MEDLINE | ID: mdl-21905701

RESUMO

We hypothesize that the degree of surface exposure of amino acid side chains within a globular, soluble protein has been optimized in evolution, not only to minimize the solvation free energy of the monomeric protein but also to prevent protein aggregation. This effect needs to be taken into account when engineering proteins de novo. We test this hypothesis through addition of a knowledge-based, exposure-dependent energy term to the RosettaDesign solvation potential [Lazaridis, T., and Karplus, M. (1999) Proteins 35, 133-152]. Correlation between amino acid type and surface exposure is determined from a representative set of experimental protein structures. The amino acid solvent accessible surface area (SASA) is estimated with a neighbor vector measure that increases in accuracy compared to the neighbor count measure while remaining pairwise decomposable [Durham, E., et al. (2009) J. Mol. Model. 15, 1093-1108]. Benchmarking of this potential in protein design displays a 3.2% improvement in the overall sequence recovery and an 8.5% improvement in recovery of amino acid types tolerated in evolution.

Assuntos

Biologia Computacional , Engenharia de Proteínas , Proteínas/química , Sequência de Aminoácidos , Modelos Moleculares , Dados de Sequência Molecular , Proteínas/genética , Proteínas/metabolismo

Practically useful: what the Rosetta protein modeling suite can do for you.

Kaufmann, Kristian W; Lemmon, Gordon H; Deluca, Samuel L; Sheehan, Jonathan H; Meiler, Jens.

Biochemistry ; 49(14): 2987-98, 2010 Apr 13.

Artigo em Inglês | MEDLINE | ID: mdl-20235548

RESUMO

The objective of this review is to enable researchers to use the software package Rosetta for biochemical and biomedicinal studies. We provide a brief review of the six most frequent research problems tackled with Rosetta. For each of these six tasks, we provide a tutorial that illustrates a basic Rosetta protocol. The Rosetta method was originally developed for de novo protein structure prediction and is regularly one of the best performers in the community-wide biennial Critical Assessment of Structure Prediction. Predictions for protein domains with fewer than 125 amino acids regularly have a backbone root-mean-square deviation of better than 5.0 A. More impressively, there are several cases in which Rosetta has been used to predict structures with atomic level accuracy better than 2.5 A. In addition to de novo structure prediction, Rosetta also has methods for molecular docking, homology modeling, determining protein structures from sparse experimental NMR or EPR data, and protein design. Rosetta has been used to accurately design a novel protein structure, predict the structure of protein-protein complexes, design altered specificity protein-protein and protein-DNA interactions, and stabilize proteins and protein complexes. Most recently, Rosetta has been used to solve the X-ray crystallographic phase problem.

Assuntos

Simulação por Computador , Modelos Moleculares , Proteínas/química , Software , Pesquisa Biomédica , Cristalografia por Raios X , DNA/química , Bases de Conhecimento , Complexos Multiproteicos , Conformação Proteica

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA