Your browser doesn't support javascript.
loading
Large-scale data mining of four billion human antibody variable regions reveals convergence between therapeutic and natural antibodies that constrains search space for biologics drug discovery.
Dudzic, Pawel; Chomicz, Dawid; Konczak, Jaroslaw; Satlawa, Tadeusz; Janusz, Bartosz; Wrobel, Sonia; Gawlowski, Tomasz; Jaszczyszyn, Igor; Bielska, Weronika; Demharter, Samuel; Spreafico, Roberto; Schulte, Lukas; Martin, Kyle; Comeau, Stephen R; Krawczyk, Konrad.
Afiliación
  • Dudzic P; NaturalAntibody, Szczecin, Poland.
  • Chomicz D; NaturalAntibody, Szczecin, Poland.
  • Konczak J; NaturalAntibody, Szczecin, Poland.
  • Satlawa T; NaturalAntibody, Szczecin, Poland.
  • Janusz B; NaturalAntibody, Szczecin, Poland.
  • Wrobel S; NaturalAntibody, Szczecin, Poland.
  • Gawlowski T; NaturalAntibody, Szczecin, Poland.
  • Jaszczyszyn I; NaturalAntibody, Szczecin, Poland.
  • Bielska W; NaturalAntibody, Szczecin, Poland.
  • Demharter S; Discovery Data Science, Genmab, Copenhagen, Denmark.
  • Spreafico R; Discovery Data Science, Genmab, Utrecht, The Netherlands.
  • Schulte L; Global Computational Biology & Digital Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riß, Germany.
  • Martin K; Biotherapeutics Discovery, Boehringer Ingelheim, Ridgefield, CT, USA.
  • Comeau SR; Biotherapeutics Discovery, Boehringer Ingelheim, Ridgefield, CT, USA.
  • Krawczyk K; NaturalAntibody, Szczecin, Poland.
MAbs ; 16(1): 2361928, 2024.
Article en En | MEDLINE | ID: mdl-38844871
ABSTRACT
The naïve human antibody repertoire has theoretical access to an estimated > 1015 antibodies. Identifying subsets of this prohibitively large space where therapeutically relevant antibodies may be found is useful for development of these agents. It was previously demonstrated that, despite the immense sequence space, different individuals can produce the same antibodies. It was also shown that therapeutic antibodies, which typically follow seemingly unnatural development processes, can arise independently naturally. To check for biases in how the sequence space is explored, we data mined public repositories to identify 220 bioprojects with a combined seven billion reads. Of these, we created a subset of human bioprojects that we make available as the AbNGS database (https//naturalantibody.com/ngs/). AbNGS contains 135 bioprojects with four billion productive human heavy variable region sequences and 385 million unique complementarity-determining region (CDR)-H3s. We find that 270,000 (0.07% of 385 million) unique CDR-H3s are highly public in that they occur in at least five of 135 bioprojects. Of 700 unique therapeutic CDR-H3, a total of 6% has direct matches in the small set of 270,000. This observation extends to a match between CDR-H3 and V-gene call as well. Thus, the subspace of shared ('public') CDR-H3s shows utility for serving as a starting point for therapeutic antibody design.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Productos Biológicos / Regiones Determinantes de Complementariedad / Descubrimiento de Drogas / Minería de Datos Idioma: En Revista: MAbs Asunto de la revista: ALERGIA E IMUNOLOGIA Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Productos Biológicos / Regiones Determinantes de Complementariedad / Descubrimiento de Drogas / Minería de Datos Idioma: En Revista: MAbs Asunto de la revista: ALERGIA E IMUNOLOGIA Año: 2024 Tipo del documento: Article