Your browser doesn't support javascript.
loading
Improving lab-of-origin prediction of genetically engineered plasmids via deep metric learning.
Soares, Igor M; Camargo, Fernando H F; Marques, Adriano; Crook, Oliver M.
Afiliação
  • Soares IM; Amalgam, Chicago, IL, USA. igor.muniz.ims@gmail.com.
  • Camargo FHF; Amalgam, Chicago, IL, USA. fernando.camargo.ai@gmail.com.
  • Marques A; Amalgam, Chicago, IL, USA. adriano@amalgam.ai.
  • Crook OM; Oxford Protein Informatics Group, University of Oxford, Oxford, UK. oliver.crook@stats.ox.ac.uk.
Nat Comput Sci ; 2(4): 253-264, 2022 Apr.
Article em En | MEDLINE | ID: mdl-38177551
ABSTRACT
Genome engineering is undergoing unprecedented development and is now becoming widely available. Genetic engineering attribution can make sequence-lab associations and assist forensic experts in ensuring responsible biotechnology innovation and reducing misuse of engineered DNA sequences. Here we propose a method based on metric learning to rank the most likely labs of origin while simultaneously generating embeddings for plasmid sequences and labs. These embeddings can be used to perform various downstream tasks, such as clustering DNA sequences and labs, as well as using them as features in machine learning models. Our approach employs a circular shift augmentation method and can correctly rank the lab of origin 90% of the time within its top-10 predictions. We also demonstrate that we can perform few-shot learning and obtain 76% top-10 accuracy using only 10% of the sequences. Finally, our approach can also extract key signatures in plasmid sequences for particular labs, allowing for an interpretable examination of the model's outputs.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article