Búsqueda | BVS CLAP/SMR-OPS/OMS

A guide to machine learning for biologists.

Greener, Joe G; Kandathil, Shaun M; Moffat, Lewis; Jones, David T.

Nat Rev Mol Cell Biol ; 23(1): 40-55, 2022 01.

Artículo en Inglés | MEDLINE | ID: mdl-34518686

RESUMEN

The expanding scale and inherent complexity of biological data have encouraged a growing use of machine learning in biology to build informative and predictive models of the underlying biological processes. All machine learning techniques fit models to data; however, the specific methods are quite varied and can at first glance seem bewildering. In this Review, we aim to provide readers with a gentle introduction to a few key machine learning techniques, including the most recently developed and widely used techniques involving deep neural networks. We describe how different techniques may be suited to specific types of biological data, and also discuss some best practices and points to consider when one is embarking on experiments involving machine learning. Some emerging directions in machine learning methodology are also discussed.

Asunto(s)

Biología , Aprendizaje Automático , Animales , Aprendizaje Profundo , Humanos , Redes Neurales de la Computación

Deep learning for the PSIPRED Protein Analysis Workbench.

Buchan, Daniel W A; Moffat, Lewis; Lau, Andy; Kandathil, Shaun M; Jones, David T.

Nucleic Acids Res ; 52(W1): W287-W293, 2024 Jul 05.

Artículo en Inglés | MEDLINE | ID: mdl-38747351

RESUMEN

The PSIRED Workbench is a long established and popular bioinformatics web service offering a wide range of machine learning based analyses for characterizing protein structure and function. In this paper we provide an update of the recent additions and developments to the webserver, with a focus on new Deep Learning based methods. We briefly discuss some trends in server usage since the publication of AlphaFold2 and we give an overview of some upcoming developments for the service. The PSIPRED Workbench is available at http://bioinf.cs.ucl.ac.uk/psipred.

Asunto(s)

Aprendizaje Profundo , Proteínas , Programas Informáticos , Proteínas/química , Proteínas/genética , Internet , Conformación Proteica , Biología Computacional/métodos , Análisis de Secuencia de Proteína/métodos

Increasing the accuracy of single sequence prediction methods using a deep semi-supervised learning framework.

Moffat, Lewis; Jones, David T.

Bioinformatics ; 37(21): 3744-3751, 2021 11 05.

Artículo en Inglés | MEDLINE | ID: mdl-34213528

RESUMEN

MOTIVATION: Over the past 50 years, our ability to model protein sequences with evolutionary information has progressed in leaps and bounds. However, even with the latest deep learning methods, the modelling of a critically important class of proteins, single orphan sequences, remains unsolved. RESULTS: By taking a bioinformatics approach to semi-supervised machine learning, we develop Profile Augmentation of Single Sequences (PASS), a simple but powerful framework for building accurate single-sequence methods. To demonstrate the effectiveness of PASS we apply it to the mature field of secondary structure prediction. In doing so we develop S4PRED, the successor to the open-source PSIPRED-Single method, which achieves an unprecedented Q3 score of 75.3% on the standard CB513 test. PASS provides a blueprint for the development of a new generation of predictive methods, advancing our ability to model individual protein sequences. AVAILABILITY AND IMPLEMENTATION: The S4PRED model is available as open source software on the PSIPRED GitHub repository (https://github.com/psipred/s4pred), along with documentation. It will also be provided as a part of the PSIPRED web service (http://bioinf.cs.ucl.ac.uk/psipred/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Análisis de Secuencia de Proteína , Aprendizaje Automático Supervisado , Secuencia de Aminoácidos , Biología Computacional , Conjuntos de Datos como Asunto , Análisis de Secuencia de Proteína/métodos , Exactitud de los Datos

Design of metalloproteins and novel protein folds using variational autoencoders.

Greener, Joe G; Moffat, Lewis; Jones, David T.

Sci Rep ; 8(1): 16189, 2018 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-30385875

RESUMEN

The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks.

Asunto(s)

Biología Computacional , Metaloproteínas/química , Pliegue de Proteína , Secuencia de Aminoácidos/genética , Sitios de Unión , Aprendizaje Profundo , Humanos , Simulación de Dinámica Molecular

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA