Protein language models enable prediction of polyreactivity of monospecific, bispecific, and heavy-chain-only antibodies

Abstract Background Early assessment of antibody off-target binding is essential for mitigating developability risks such as fast clearance, reduced efficacy, toxicity, and immunogenicity. The baculovirus particle (BVP) binding assay has been widely utilized to evaluate polyreactivity of antibodies....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Antibody therapeutics 2024-07, Vol.7 (3), p.199-208
Hauptverfasser: Yu, Xin, Vangjeli, Kostika, Prakash, Anusha, Chhaya, Meha, Stanley, Samantha J, Cohen, Noah, Huang, Lili
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Abstract Background Early assessment of antibody off-target binding is essential for mitigating developability risks such as fast clearance, reduced efficacy, toxicity, and immunogenicity. The baculovirus particle (BVP) binding assay has been widely utilized to evaluate polyreactivity of antibodies. As a complementary approach, computational prediction of polyreactivity is desirable for counter-screening antibodies from in silico discovery campaigns. However, there is a lack of such models. Methods Herein, we present the development of an ensemble of three deep learning models based on two pan-protein foundational protein language models (ESM2 and ProtT5) and an antibody-specific protein language model (PLM) (Antiberty). These models were trained in a transfer learning network to predict the outcomes in the BVP assay and the bovine serum albumin binding assay, which was developed as a complement to the BVP assay. The training was conducted on a large dataset of antibody sequences augmented with experimental conditions, which were collected through a highly efficient application system. Results The resulting models demonstrated robust performance on canonical mAbs (monospecific with heavy and light chain), bispecific Abs, and single-domain Fc (VHH-Fc). PLMs outperformed a model built using molecular descriptors calculated from AlphaFold 2 predicted structures. Embeddings from the antibody-specific and foundational PLMs resulted in similar performance. Conclusion To our knowledge, this represents the first application of PLMs to predict assay data on bispecifics and VHH-Fcs. Statement of Significance: Our study yields insights on building infrastructures to support machine learning activities and training models for critical assays in antibody discovery.
ISSN:2516-4236
2516-4236
DOI:10.1093/abt/tbae012