Your browser doesn't support javascript.
loading
Classification of helical polymers with deep-learning language models.
Li, Daoyi; Jiang, Wen.
Afiliação
  • Li D; Department of Biological Sciences, Purdue University.
  • Jiang W; Department of Biological Sciences, Purdue University. Electronic address: jiang12@purdue.edu.
J Struct Biol ; 215(4): 108041, 2023 12.
Article em En | MEDLINE | ID: mdl-37939748
ABSTRACT
Many macromolecules in biological systems exist in the form of helical polymers. However, the inherent polymorphism and heterogeneity of samples complicate the reconstruction of helical polymers from cryo-EM images. Currently, available 2D classification methods are effective at separating particles of interest from contaminants, but they do not effectively differentiate between polymorphs, resulting in heterogeneity in the 2D classes. As such, it is crucial to develop a method that can computationally divide a dataset of polymorphic helical structures into homogenous subsets. In this work, we utilized deep-learning language models to embed the filaments as vectors in hyperspace and group them into clusters. Tests with both simulated and experimental datasets have demonstrated that our method - HLM (Helical classification with Language Model) can effectively distinguish different types of filaments, in the presence of many contaminants and low signal-to-noise ratios. We also demonstrate that HLM can isolate homogeneous subsets of particles from a publicly available dataset, resulting in the discovery of a previously unreported filament variant with an extra density around the tau filaments.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polímeros / Aprendizado Profundo Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polímeros / Aprendizado Profundo Idioma: En Ano de publicação: 2023 Tipo de documento: Article