Your browser doesn't support javascript.
loading
CodonBERT large language model for mRNA vaccines.
Li, Sizhen; Moayedpour, Saeed; Li, Ruijiang; Bailey, Michael; Riahi, Saleh; Kogler-Anele, Lorenzo; Miladi, Milad; Miner, Jacob; Pertuy, Fabien; Zheng, Dinghai; Wang, Jun; Balsubramani, Akshay; Tran, Khang; Zacharia, Minnie; Wu, Monica; Gu, Xiaobo; Clinton, Ryan; Asquith, Carla; Skaleski, Joseph; Boeglin, Lianne; Chivukula, Sudha; Dias, Anusha; Strugnell, Tod; Montoya, Fernando Ulloa; Agarwal, Vikram; Bar-Joseph, Ziv; Jager, Sven.
Afiliación
  • Li S; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
  • Moayedpour S; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
  • Li R; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
  • Bailey M; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
  • Riahi S; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
  • Kogler-Anele L; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
  • Miladi M; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Miner J; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Pertuy F; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Zheng D; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Wang J; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Balsubramani A; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Tran K; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Zacharia M; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Wu M; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Gu X; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Clinton R; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Asquith C; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Skaleski J; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Boeglin L; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Chivukula S; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Dias A; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Strugnell T; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Montoya FU; mRNA Center of Excellence, Sanofi, 69280 Marcy L'Etoile, France.
  • Agarwal V; mRNA Center of Excellence, Sanofi, Waltham, Massachusetts 02451, USA.
  • Bar-Joseph Z; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA; zivbj@cs.cmu.edu sven.jager@sanofi.com.
  • Jager S; Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.
Genome Res ; 34(7): 1027-1035, 2024 Aug 20.
Article en En | MEDLINE | ID: mdl-38951026
ABSTRACT
mRNA-based vaccines and therapeutics are gaining popularity and usage across a wide range of conditions. One of the critical issues when designing such mRNAs is sequence optimization. Even small proteins or peptides can be encoded by an enormously large number of mRNAs. The actual mRNA sequence can have a large impact on several properties, including expression, stability, immunogenicity, and more. To enable the selection of an optimal sequence, we developed CodonBERT, a large language model (LLM) for mRNAs. Unlike prior models, CodonBERT uses codons as inputs, which enables it to learn better representations. CodonBERT was trained using more than 10 million mRNA sequences from a diverse set of organisms. The resulting model captures important biological concepts. CodonBERT can also be extended to perform prediction tasks for various mRNA properties. CodonBERT outperforms previous mRNA prediction methods, including on a new flu vaccine data set.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: ARN Mensajero / Vacunas de ARNm Límite: Humans Idioma: En Revista: Genome Res Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: ARN Mensajero / Vacunas de ARNm Límite: Humans Idioma: En Revista: Genome Res Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos