Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Cell ; 165(4): 963-75, 2016 May 05.
Artigo em Inglês | MEDLINE | ID: mdl-27087444

RESUMO

Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces the research on the structure and functional interactions of these RNA gene sequences. We mine the evolutionary sequence record to derive precise information about the function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions-e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by increasing sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA.


Assuntos
Conformação de Ácido Nucleico , RNA não Traduzido/química , Entropia , Evolução Molecular , Modelos Moleculares , Dobramento de RNA , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , Ribossomos/metabolismo
2.
Semin Cell Dev Biol ; 111: 67-73, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-32654970

RESUMO

Until the discovery of human embryonic stem cells and human induced pluripotent stem cells, biotechnology companies were severely limited in the number of human tissues that they could model in large-scale in vitro studies. Until this point, companies have been limited to immortalized cancer lines or a small number of primary cell types that could be extracted and expanded. Nowadays, protocols continue to be developed in the stem cell field, enabling researchers to model an ever-growing library of cell types in controlled, large-scale screens. One differentiation method in particular- cerebral organoids- shows substantial potential in the field of neuroscience and developmental neurobiology. Cerebral organoid technology is still in an early phase of development, and there are several challenges that are currently being addressed by academic and industrial researchers alike. Here we briefly describe some of the early adopters of cerebral organoids, several of the challenges that they are likely facing, and various technologies that are currently being implemented to overcome them.


Assuntos
Descoberta de Drogas/métodos , Drogas em Investigação/farmacologia , Modelos Biológicos , Doenças Neurodegenerativas/tratamento farmacológico , Fármacos Neuroprotetores/farmacologia , Organoides/efeitos dos fármacos , Sistemas CRISPR-Cas , Diferenciação Celular , Córtex Cerebral/efeitos dos fármacos , Córtex Cerebral/metabolismo , Córtex Cerebral/patologia , Drogas em Investigação/química , Humanos , Células-Tronco Pluripotentes Induzidas/citologia , Células-Tronco Pluripotentes Induzidas/efeitos dos fármacos , Células-Tronco Pluripotentes Induzidas/metabolismo , Aprendizado de Máquina , Mutação , Proteínas do Tecido Nervoso/genética , Proteínas do Tecido Nervoso/metabolismo , Doenças Neurodegenerativas/genética , Doenças Neurodegenerativas/metabolismo , Doenças Neurodegenerativas/patologia , Neurônios/citologia , Neurônios/efeitos dos fármacos , Neurônios/metabolismo , Fármacos Neuroprotetores/química , Organoides/metabolismo , Organoides/patologia , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos
3.
Nat Methods ; 15(10): 816-822, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30250057

RESUMO

The functions of proteins and RNAs are defined by the collective interactions of many residues, and yet most statistical models of biological sequences consider sites nearly independently. Recent approaches have demonstrated benefits of including interactions to capture pairwise covariation, but leave higher-order dependencies out of reach. Here we show how it is possible to capture higher-order, context-dependent constraints in biological sequences via latent variable models with nonlinear dependencies. We found that DeepSequence ( https://github.com/debbiemarkslab/DeepSequence ), a probabilistic model for sequence families, predicted the effects of mutations across a variety of deep mutational scanning experiments substantially better than existing methods based on the same evolutionary data. The model, learned in an unsupervised manner solely on the basis of sequence information, is grounded with biologically motivated priors, reveals the latent organization of sequence families, and can be used to explore new parts of sequence space.


Assuntos
Biologia Computacional/métodos , Evolução Molecular , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Modelos Teóricos , Mutação , Algoritmos , Humanos
4.
Bioinformatics ; 35(9): 1582-1584, 2019 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-30304492

RESUMO

SUMMARY: Coevolutionary sequence analysis has become a commonly used technique for de novo prediction of the structure and function of proteins, RNA, and protein complexes. We present the EVcouplings framework, a fully integrated open-source application and Python package for coevolutionary analysis. The framework enables generation of sequence alignments, calculation and evaluation of evolutionary couplings (ECs), and de novo prediction of structure and mutation effects. The combination of an easy to use, flexible command line interface and an underlying modular Python package makes the full power of coevolutionary analyses available to entry-level and advanced users. AVAILABILITY AND IMPLEMENTATION: https://github.com/debbiemarkslab/evcouplings.


Assuntos
Análise de Sequência , Software , Proteínas , RNA , Alinhamento de Sequência
5.
Nucleic Acids Res ; 45(11): 6971-6980, 2017 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-28499033

RESUMO

The ability to rewrite large stretches of genomic DNA enables the creation of new organisms with customized functions. However, few methods currently exist for accumulating such widespread genomic changes in a single organism. In this study, we demonstrate a rapid approach for rewriting bacterial genomes with modified synthetic DNA. We recode 200 kb of the Salmonella typhimurium LT2 genome through a process we term SIRCAS (stepwise integration of rolling circle amplified segments), towards constructing an attenuated and genetically isolated bacterial chassis. The SIRCAS process involves direct iterative recombineering of 10-25 kb synthetic DNA constructs which are assembled in yeast and amplified by rolling circle amplification. Using SIRCAS, we create a Salmonella with 1557 synonymous leucine codon replacements across 176 genes, the largest number of cumulative recoding changes in a single bacterial strain to date. We demonstrate reproducibility over sixteen two-day cycles of integration and parallelization for hierarchical construction of a synthetic genome by conjugation. The resulting recoded strain grows at a similar rate to the wild-type strain and does not exhibit any major growth defects. This work is the first instance of synthetic bacterial recoding beyond the Escherichia coli genome, and reveals that Salmonella is remarkably amenable to genome-scale modification.


Assuntos
DNA Bacteriano/genética , Engenharia Genética/métodos , Salmonella typhimurium/genética , Códon , Genes Bacterianos , Genes Sintéticos , Genoma Bacteriano , Leucina/genética , Viabilidade Microbiana , Reprodutibilidade dos Testes
6.
Plant Biotechnol J ; 12(1): 49-59, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24102738

RESUMO

Heterotrimeric G-proteins consisting of Gα, Gß and Gγ subunits play an integral role in mediating multiple signalling pathways in plants. A novel, recently identified plant-specific Gγ protein, AGG3, has been proposed to be an important regulator of organ size and mediator of stress responses in Arabidopsis, whereas its potential homologs in rice are major quantitative trait loci for seed size and panicle branching. To evaluate the role of AGG3 towards seed and oil yield improvement, the gene was overexpressed in Camelina sativa, an oilseed crop of the Brassicaceae family. Analysis of multiple homozygous T4 transgenic Camelina lines showed that constitutive overexpression of AGG3 resulted in faster vegetative as well as reproductive growth accompanied by an increase in photosynthetic efficiency. Moreover, when expressed constitutively or specifically in seed tissue, AGG3 was found to increase seed size, seed mass and seed number per plant by 15%-40%, effectively resulting in significantly higher oil yield per plant. AGG3 overexpressing Camelina plants also exhibited improved stress tolerance. These observations draw a strong link between the roles of AGG3 in regulating two critical yield parameters, seed traits and plant stress responses, and reveal an effective biotechnological tool to dramatically increase yield in agricultural crops.


Assuntos
Proteínas de Arabidopsis/metabolismo , Brassicaceae/metabolismo , Subunidades gama da Proteína de Ligação ao GTP/metabolismo , Plantas Geneticamente Modificadas/metabolismo , Sementes/metabolismo , Proteínas de Arabidopsis/genética , Brassicaceae/genética , Subunidades gama da Proteína de Ligação ao GTP/genética , Regulação da Expressão Gênica de Plantas/genética , Regulação da Expressão Gênica de Plantas/fisiologia , Plantas Geneticamente Modificadas/genética , Sementes/genética
7.
Nat Commun ; 15(1): 5141, 2024 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-38902262

RESUMO

A major challenge in protein design is to augment existing functional proteins with multiple property enhancements. Altering several properties likely necessitates numerous primary sequence changes, and novel methods are needed to accurately predict combinations of mutations that maintain or enhance function. Models of sequence co-variation (e.g., EVcouplings), which leverage extensive information about various protein properties and activities from homologous protein sequences, have proven effective for many applications including structure determination and mutation effect prediction. We apply EVcouplings to computationally design variants of the model protein TEM-1 ß-lactamase. Nearly all the 14 experimentally characterized designs were functional, including one with 84 mutations from the nearest natural homolog. The designs also had large increases in thermostability, increased activity on multiple substrates, and nearly identical structure to the wild type enzyme. This study highlights the efficacy of evolutionary models in guiding large sequence alterations to generate functional diversity for protein design applications.


Assuntos
Evolução Molecular , Mutação , Engenharia de Proteínas , beta-Lactamases , beta-Lactamases/genética , beta-Lactamases/metabolismo , beta-Lactamases/química , Engenharia de Proteínas/métodos , Modelos Moleculares , Sequência de Aminoácidos , Estabilidade Enzimática , Conformação Proteica
8.
bioRxiv ; 2023 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-37214973

RESUMO

Designing optimized proteins is important for a range of practical applications. Protein design is a rapidly developing field that would benefit from approaches that enable many changes in the amino acid primary sequence, rather than a small number of mutations, while maintaining structure and enhancing function. Homologous protein sequences contain extensive information about various protein properties and activities that have emerged over billions of years of evolution. Evolutionary models of sequence co-variation, derived from a set of homologous sequences, have proven effective in a range of applications including structure determination and mutation effect prediction. In this work we apply one of these models (EVcouplings) to computationally design highly divergent variants of the model protein TEM-1 ß-lactamase, and characterize these designs experimentally using multiple biochemical and biophysical assays. Nearly all designed variants were functional, including one with 84 mutations from the nearest natural homolog. Surprisingly, all functional designs had large increases in thermostability and most had a broadening of available substrates. These property enhancements occurred while maintaining a nearly identical structure to the wild type enzyme. Collectively, this work demonstrates that evolutionary models of sequence co-variation (1) are able to capture complex epistatic interactions that successfully guide large sequence departures from natural contexts, and (2) can be applied to generate functional diversity useful for many applications in protein design.

9.
Nat Commun ; 12(1): 2403, 2021 04 23.
Artigo em Inglês | MEDLINE | ID: mdl-33893299

RESUMO

The ability to design functional sequences and predict effects of variation is central to protein engineering and biotherapeutics. State-of-art computational methods rely on models that leverage evolutionary information but are inadequate for important applications where multiple sequence alignments are not robust. Such applications include the prediction of variant effects of indels, disordered proteins, and the design of proteins such as antibodies due to the highly variable complementarity determining regions. We introduce a deep generative model adapted from natural language processing for prediction and design of diverse functional sequences without the need for alignments. The model performs state-of-art prediction of missense and indel effects and we successfully design and test a diverse 105-nanobody library that shows better expression than a 1000-fold larger synthetic library. Our results demonstrate the power of the alignment-free autoregressive model in generalizing to regions of sequence space traditionally considered beyond the reach of prediction and design.


Assuntos
Algoritmos , Biologia Computacional/métodos , Redes Neurais de Computação , Engenharia de Proteínas/métodos , Sequência de Aminoácidos , Anticorpos/genética , Anticorpos/imunologia , Anticorpos/metabolismo , Antígenos/imunologia , Genótipo , Humanos , Mutação , Fenótipo , Proteínas/genética , Proteínas/imunologia , Proteínas/metabolismo
10.
Cell Syst ; 10(1): 15-24.e5, 2020 01 22.
Artigo em Inglês | MEDLINE | ID: mdl-31838147

RESUMO

Natural evolution encodes rich information about the structure and function of biomolecules in the genetic record. Previously, statistical analysis of co-variation patterns in natural protein families has enabled the accurate computation of 3D structures. Here, we explored generating similar information by experimental evolution, starting from a single gene and performing multiple cycles of in vitro mutagenesis and functional selection in Escherichia coli. We evolved two antibiotic resistance proteins, ß-lactamase PSE1 and acetyltransferase AAC6, and obtained hundreds of thousands of diverse functional sequences. Using evolutionary coupling analysis, we inferred residue interaction constraints that were in agreement with contacts in known 3D structures, confirming genetic encoding of structural constraints in the selected sequences. Computational protein folding with interaction constraints then yielded 3D structures with the same fold as natural relatives. This work lays the foundation for a new experimental method (3Dseq) for protein structure determination, combining evolution experiments with inference of residue interactions from sequence information. A record of this paper's Transparent Peer Review process is included in the Supplemental Information.


Assuntos
Evolução Molecular , Proteínas/química , Humanos , Conformação Proteica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA