Your browser doesn't support javascript.
loading
Machine learning-based protein crystal detection for monitoring of crystallization processes enabled with large-scale synthetic data sets of photorealistic images.
Bischoff, Daniel; Walla, Brigitte; Weuster-Botz, Dirk.
Afiliação
  • Bischoff D; Technical University of Munich, Institute of Biochemical Engineering, Boltzmannstr. 15, Bavaria, 85748, Garching, Germany.
  • Walla B; Technical University of Munich, Institute of Biochemical Engineering, Boltzmannstr. 15, Bavaria, 85748, Garching, Germany.
  • Weuster-Botz D; Technical University of Munich, Institute of Biochemical Engineering, Boltzmannstr. 15, Bavaria, 85748, Garching, Germany. dirk.weuster-botz@tum.de.
Anal Bioanal Chem ; 414(21): 6379-6391, 2022 Sep.
Article em En | MEDLINE | ID: mdl-35661232
ABSTRACT
Since preparative chromatography is a sustainability challenge due to large amounts of consumables used in downstream processing of biomolecules, protein crystallization offers a promising alternative as a purification method. While the limited crystallizability of proteins often restricts a broad application of crystallization as a purification method, advances in molecular biology, as well as computational methods are pushing the applicability towards integration in biotechnological downstream processes. However, in industrial and academic settings, monitoring protein crystallization processes non-invasively by microscopic photography and automated image evaluation remains a challenging problem. Recently, the identification of single crystal objects using deep learning has been the subject of increased attention for various model systems. However, the advancement of crystal detection using deep learning for biotechnological applications is limited robust models obtained through supervised machine learning tasks require large-scale and high-quality data sets usually obtained in large projects through extensive manual labeling, an approach that is highly error-prone for dense systems of transparent crystals. For the first time, recent trends involving the use of synthetic data sets for supervised learning are transferred, thus generating photorealistic images of virtual protein crystals in suspension (PCS) through the use of ray tracing algorithms, accompanied by specialized data augmentations modelling experimental noise. Further, it is demonstrated that state-of-the-art models trained with the large-scale synthetic PCS data set outperform similar fine-tuned models based on the average precision metric on a validation data set, followed by experimental validation using high-resolution photomicrographs from stirred tank protein crystallization processes.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Aprendizado de Máquina Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Aprendizado de Máquina Idioma: En Ano de publicação: 2022 Tipo de documento: Article