Búsqueda | Portal Regional de la BVS

Evolutionary-scale prediction of atomic-level protein structure with a language model.

Lin, Zeming; Akin, Halil; Rao, Roshan; Hie, Brian; Zhu, Zhongkai; Lu, Wenting; Smetanin, Nikita; Verkuil, Robert; Kabeli, Ori; Shmueli, Yaniv; Dos Santos Costa, Allan; Fazel-Zarandi, Maryam; Sercu, Tom; Candido, Salvatore; Rives, Alexander.

Science ; 379(6637): 1123-1130, 2023 03 17.

Artículo en Inglés | MEDLINE | ID: mdl-36927031

RESUMEN

Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.

Asunto(s)

Evolución Molecular , Aprendizaje Automático , Proteínas , Análisis de Secuencia de Proteína , Secuencia de Aminoácidos , Proteínas/química , Conformación Proteica

Author Correction: Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations.

Das, Payel; Sercu, Tom; Wadhawan, Kahini; Padhi, Inkit; Gehrmann, Sebastian; Cipcigan, Flaviu; Chenthamarakshan, Vijil; Strobelt, Hendrik; Dos Santos, Cicero; Chen, Pin-Yu; Yang, Yi Yan; Tan, Jeremy P K; Hedrick, James; Crain, Jason; Mojsilovic, Aleksandra.

Nat Biomed Eng ; 5(8): 942, 2021 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-34183803

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.

Rives, Alexander; Meier, Joshua; Sercu, Tom; Goyal, Siddharth; Lin, Zeming; Liu, Jason; Guo, Demi; Ott, Myle; Zitnick, C Lawrence; Ma, Jerry; Fergus, Rob.

Proc Natl Acad Sci U S A ; 118(15)2021 04 13.

Artículo en Inglés | MEDLINE | ID: mdl-33876751

RESUMEN

In the field of artificial intelligence, a combination of scale in data and model capacity enabled by unsupervised learning has led to major advances in representation learning and statistical generation. In the life sciences, the anticipated growth of sequencing promises unprecedented data on natural sequence diversity. Protein language modeling at the scale of evolution is a logical step toward predictive and generative artificial intelligence for biology. To this end, we use unsupervised learning to train a deep contextual language model on 86 billion amino acids across 250 million protein sequences spanning evolutionary diversity. The resulting model contains information about biological properties in its representations. The representations are learned from sequence data alone. The learned representation space has a multiscale organization reflecting structure from the level of biochemical properties of amino acids to remote homology of proteins. Information about secondary and tertiary structure is encoded in the representations and can be identified by linear projections. Representation learning produces features that generalize across a range of applications, enabling state-of-the-art supervised prediction of mutational effect and secondary structure and improving state-of-the-art features for long-range contact prediction.

Asunto(s)

Análisis de Secuencia de Proteína/métodos , Aprendizaje Automático no Supervisado , Aminoácidos/química , Conformación Proteica , Homología de Secuencia de Aminoácido

Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations.

Nat Biomed Eng ; 5(6): 613-623, 2021 06.

Artículo en Inglés | MEDLINE | ID: mdl-33707779

RESUMEN

The de novo design of antimicrobial therapeutics involves the exploration of a vast chemical repertoire to find compounds with broad-spectrum potency and low toxicity. Here, we report an efficient computational method for the generation of antimicrobials with desired attributes. The method leverages guidance from classifiers trained on an informative latent space of molecules modelled using a deep generative autoencoder, and screens the generated molecules using deep-learning classifiers as well as physicochemical features derived from high-throughput molecular dynamics simulations. Within 48 days, we identified, synthesized and experimentally tested 20 candidate antimicrobial peptides, of which two displayed high potency against diverse Gram-positive and Gram-negative pathogens (including multidrug-resistant Klebsiella pneumoniae) and a low propensity to induce drug resistance in Escherichia coli. Both peptides have low toxicity, as validated in vitro and in mice. We also show using live-cell confocal imaging that the bactericidal mode of action of the peptides involves the formation of membrane pores. The combination of deep learning and molecular dynamics may accelerate the discovery of potent and selective broad-spectrum antimicrobials.

Asunto(s)

Antibacterianos/farmacología , Péptidos Catiónicos Antimicrobianos/farmacología , Aprendizaje Profundo , Diseño de Fármacos , Descubrimiento de Drogas/métodos , Farmacorresistencia Bacteriana/efectos de los fármacos , Acinetobacter baumannii/efectos de los fármacos , Acinetobacter baumannii/crecimiento & desarrollo , Acinetobacter baumannii/ultraestructura , Secuencia de Aminoácidos , Animales , Antibacterianos/síntesis química , Péptidos Catiónicos Antimicrobianos/síntesis química , Escherichia coli/efectos de los fármacos , Escherichia coli/crecimiento & desarrollo , Escherichia coli/ultraestructura , Femenino , Infecciones por Klebsiella/tratamiento farmacológico , Klebsiella pneumoniae/efectos de los fármacos , Klebsiella pneumoniae/crecimiento & desarrollo , Klebsiella pneumoniae/ultraestructura , Ratones , Ratones Endogámicos BALB C , Pruebas de Sensibilidad Microbiana , Simulación de Dinámica Molecular , Pseudomonas aeruginosa/efectos de los fármacos , Pseudomonas aeruginosa/crecimiento & desarrollo , Pseudomonas aeruginosa/ultraestructura , Staphylococcus aureus/efectos de los fármacos , Staphylococcus aureus/crecimiento & desarrollo , Staphylococcus aureus/ultraestructura , Relación Estructura-Actividad

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA