Cambridge Healthtech Institute’s 4th Annual

Machine Learning for Protein Engineering Part 2

Demonstrating Value and Putting Theory into Practice

13 November 2025 ALL TIMES WET (GMT/UTC)


Cambridge Healthtech Institute's Machine Learning for Protein Engineering Part 2 track at PEGS Europe examines and measures the real impact of using ML/AI techniques by developing benchmarks, experimental validation, data curation, standards, and controls to guide the process for implementing these tools throughout the drug discovery and development pipeline. Case studies comparing experimental approaches with computational and machine learning approaches will shed light on how to adapt and utilize them for discovery, prediction, developability, simulation, and optimization of biologics.

Scientific Advisory Board
M. Frank Erasmus, PhD, Head, Bioinformatics, Specifica, Inc.
Victor Greiff, PhD, Associate Professor, Oslo University Hospital and Director, Computational Immunology, IMPRINT
Maria Wendt, PhD, Global Head of Preclinical Computational Innovation Strategy, Research Platforms, R&D, Sanofi

Recommended Short Course*
Monday, 4 November, 14:00 – 17:00
SC4: In silico and Machine Learning Tools for Antibody Design and Developability Predictions
*Separate registration required. See short courses page for details. All short courses take place in-person only.





Thursday, 13 November

Registration and Morning Coffee

ML APPROACHES TO OPTIMISATION AND DEVELOPABILITY OF ANTIBODIES

Chairperson's Remarks

Photo of M. Frank Erasmus, PhD, Head, Bioinformatics, Specifica, an IQVIA business , Director/Head , Bioinformatics , Specifica, Inc.
M. Frank Erasmus, PhD, Head, Bioinformatics, Specifica, an IQVIA business , Director/Head , Bioinformatics , Specifica, Inc.

Predicting Nonspecificity in Therapeutic Antibody Formats Using Structure-Informed Machine Learning Models

Photo of Paolo Marcatili, PhD, Head, Antibody Design, Novo Nordisk , Head , Antibody Design , Novo Nordisk
Paolo Marcatili, PhD, Head, Antibody Design, Novo Nordisk , Head , Antibody Design , Novo Nordisk

This presentation examines how AI-driven computational frameworks—combining sequence, structural, and biophysical data—can predict nonspecific binding and developability risks in therapeutic antibodies and related formats. By integrating protein language models, inverse folding approaches, and dynamic structural features (simulated or ML-derived), we demonstrate how these tools identify molecular liabilities, and in turn how these models can impact the DMTA cycle by enhancing hit selection, guide optimisation, and de-risk development.

Humanness and VHH-Nativeness Assessment with AbNatiV2 & SpyBLI for Rapid Quantification of Binding Kinetics

Photo of Pietro Sormanni, PhD, Group Leader, Royal Society University Research Fellow, Chemistry of Health, Yusuf Hamied Department of Chemistry, University of Cambridge , Royal Society Univ Research Fellow , Yusuf Hamied Department of Chemistry , University of Cambridge
Pietro Sormanni, PhD, Group Leader, Royal Society University Research Fellow, Chemistry of Health, Yusuf Hamied Department of Chemistry, University of Cambridge , Royal Society Univ Research Fellow , Yusuf Hamied Department of Chemistry , University of Cambridge

Designing antibodies for pre-selected epitopes is hampered by scarce structural data and slow, costly experimental validation or noisy high-throughput screens. I will present a data-efficient workflow that fuses sequence and structural embeddings with data augmentation to design antibodies predicted to engage user-defined epitopes, addressing practical questions on minimal data requirements and useful in silico metrics. To close the Design-Make-Test loop, I will introduce SpyBLI—a kinetic screen that reports kon, koff, and KD bypassing purification and concentration-determination steps, giving DNA-to-data in less than 24 hours. Benchmarking across affinities spanning six orders of magnitude confirms its reliability, enabling rapid, quantitative validation of epitope-specific antibodies.

Coffee Break in the Exhibit Hall with Poster Viewing

Generative Design of Antibodies with Programmable Fc Functional Profiles

Photo of Edward B. Irvine, PhD, Postdoctoral Scientist, Sai Reddy Group, Laboratory for Systems and Synthetic Immunology, ETH Zürich , Postdoc Scientist , Biosystems Science & Engineering , ETH Zurich
Edward B. Irvine, PhD, Postdoctoral Scientist, Sai Reddy Group, Laboratory for Systems and Synthetic Immunology, ETH Zürich , Postdoc Scientist , Biosystems Science & Engineering , ETH Zurich

Antibodies bridge adaptive and innate immunity through their constant (Fc) domains, yet most of Fc sequence space remains unexplored due to experimental constraints. To address this, we developed a machine learning-guided platform for Fc-engineering. By integrating the screening of synthetic Fc-libraries with next-generation sequencing and deep learning, we can accurately predict antibody functional activity from sequence, and computationally design antibodies with bespoke functional profiles, unlocking new possibilities for precision immunotherapy.

A Machine Learning Approach to Improving Antibody Developability

Photo of Paul MacDonald, PhD, Data Scientist, Protein Design Informatics, GSK , Senior Principal Scientist , Protein Design Informatics , GSK
Paul MacDonald, PhD, Data Scientist, Protein Design Informatics, GSK , Senior Principal Scientist , Protein Design Informatics , GSK

Machine learning optimizes biotherapeutics by evaluating antibody developability, focusing on stability, functionality, and safety. In silico assessments streamline discovery by deselecting problematic antibodies early. Predictive models can optimise libraries toward designs with fewer liabilities. Evaluating these models hinges on new, representative data, with an emphasis on generalisation to novel paratopes. Deliberate data partitioning and appropriate evaluation metrics are critical to achieving this and are the focus of this talk.

Antibody DomainBed: Out-of-Distribution Generalisation in Therapeutic Protein Design

Photo of Ji Won Park, PhD, Principal ML Scientist, Prescient Design, Genentech , Principal ML Scientist , Prescient Design (AI for Drug Discovery) , Genentech
Ji Won Park, PhD, Principal ML Scientist, Prescient Design, Genentech , Principal ML Scientist , Prescient Design (AI for Drug Discovery) , Genentech

Machine Learning accelerates drug design using surrogate models to predict key molecular properties. An iterative experimental feedback introduces distribution shifts in design cycles. Domain generalisation classifies antibody–antigen stability across five design cycles. Foundational models and ensembling, improve performance on out-of-distribution data. A public codebase and dataset extend the DomainBed benchmark for realistic shift simulation.

Luncheon in the Exhibit Hall with Last Chance for Poster Viewing

BENCHMARKING AND DATA CURATION

Chairperson's Remarks

Photo of Monica L. Fernandez-Quintero, PhD, Staff Scientist, Integrative Structural and Computational Biology Department, Scripps Research Institute , Staff Scientist , Integrative structural and computational biology , Scripps Research Institute
Monica L. Fernandez-Quintero, PhD, Staff Scientist, Integrative Structural and Computational Biology Department, Scripps Research Institute , Staff Scientist , Integrative structural and computational biology , Scripps Research Institute

KEYNOTE PRESENTATION: AI for Antibody Design - Going Beyond the Static

Photo of Charlotte M. Deane, PhD, Professor, Structural Bioinformatics, Statistics, University of Oxford; Executive Chair, Engineering and Physical Sciences Research Council (EPSRC) , Prof Structural Bioinformatics , Statistics , Oxford University
Charlotte M. Deane, PhD, Professor, Structural Bioinformatics, Statistics, University of Oxford; Executive Chair, Engineering and Physical Sciences Research Council (EPSRC) , Prof Structural Bioinformatics , Statistics , Oxford University

We can now computationally predict a single, static protein structure with high accuracy. However, we are not yet able to reliably predict structural flexibility. This ability to adapt their shape can be fundamental to their functional properties. A major factor limiting such predictions is the scarcity of suitable training data. I will show novel tools and databases that help to overcome this.

Scaling Foundation Models for Protein Generation

Photo of Ali Madani, PhD, Founder and CEO, Profluent Bio , CEO, Founder , Profluent Bio
Ali Madani, PhD, Founder and CEO, Profluent Bio , CEO, Founder , Profluent Bio

Language models learn powerful representations of protein biology. We introduce a new foundation model suite that directly investigates scaling effects for protein generation. We then apply this for applications in antibody and gene editor design.

The AIntibody Challenge: An Update on the Use of AI/ML in Antibody Discovery

Photo of Andrew R.M. Bradbury, MD, PhD, CSO, Specifica, an IQVIA business , CSO , Specifica, Inc.
Andrew R.M. Bradbury, MD, PhD, CSO, Specifica, an IQVIA business , CSO , Specifica, Inc.
Photo of M. Frank Erasmus, PhD, Head, Bioinformatics, Specifica, an IQVIA business , Director/Head , Bioinformatics , Specifica, Inc.
M. Frank Erasmus, PhD, Head, Bioinformatics, Specifica, an IQVIA business , Director/Head , Bioinformatics , Specifica, Inc.

The AIntibody competition was launched to benchmark real-world performance, and potential value, of artificial intelligence (AI) models in antibody discovery through a blinded, prospective experimental design. In the inaugural challenge, 33 organizations submitted 527 antibody sequences responding to three tasks focused on RBD, the most studied protein in history: (1) in silico affinity maturation from early round NGS selection outputs, (2) affinity ranking from NGS selection outputs, and (3) de novo generative design of CDRs based on selection outputs. All sequences were expressed as full-length IgGs and experimentally tested for binding affinity using surface plasmon resonance (SPR) and developability. The affinities of the highest affinity antibodies were further validated by KinExA. This talk will provide an overview of the NGS data used in the competition and final results of the competition.

Session Break

NEW METHODS TO UNCOVER NEW BIOLOGY AND DRUG TARGETS: SHIFTING FROM DISCOVERY TO DESIGN

Chairperson's Remarks

Photo of Monica L. Fernandez-Quintero, PhD, Staff Scientist, Integrative Structural and Computational Biology Department, Scripps Research Institute , Staff Scientist , Integrative structural and computational biology , Scripps Research Institute
Monica L. Fernandez-Quintero, PhD, Staff Scientist, Integrative Structural and Computational Biology Department, Scripps Research Institute , Staff Scientist , Integrative structural and computational biology , Scripps Research Institute

Unraveling Structure-Function Relationships of Entire Protein Families Using Alphafold

Photo of Luigi Vitagliano, PhD, Research Director, Institute of Biostructure and Bioimaging, Department of Biomedical Science, National Research Council Italy , Research Director , Institute of Biostructures and Bioimaging , National Research Council Italy
Luigi Vitagliano, PhD, Research Director, Institute of Biostructure and Bioimaging, Department of Biomedical Science, National Research Council Italy , Research Director , Institute of Biostructures and Bioimaging , National Research Council Italy

In traditional structural biology, the intrinsic technical difficulties associated with the experimental structural characterisation of biological macromolecules have frequently imposed reductionist approaches limiting the investigations of individual proteins. The rapid determination of protein structures starting from their sequences, assured by computational approaches based on machine learning, allows now the simultaneous elucidation of structure-function relationships in entire families. Illustrative examples will be provided for proteins (KCTDs/CHCHD4) involved in key physio-pathological processes.

Artificial Intelligence in the Creation of Precision Therapeutic Enzymes that Target Pathogenic Immunoglobulins

Photo of Ryan Peckner, PhD, Director, Machine Learning, Seismic Therapeutic , Director , Machine Learning and Computational Biology , Seismic Therapeutic
Ryan Peckner, PhD, Director, Machine Learning, Seismic Therapeutic , Director , Machine Learning and Computational Biology , Seismic Therapeutic

Considerable unmet need exists in autoimmune, inflammatory, and allergic indications with underlying etiology related to immunoglobulins. IgG, IgE, IgM, and IgA can each play a role in disparate disease processes, and an ability to precisely target only the immunoglobulin isotype involved is crucial in striking the desired balance between efficacy and safety. Seismic has achieved this using its structure-augmented AI/ML IMPACT platform creating a Swiss Army knife of Ig degrading therapeutics.

Engineering Modular Binders Combining Machine Learning, Structural Biology, and Experimental Evolution

Photo of Erich Michel, PhD, Postdoctoral Researcher, Department of Biochemistry, University of Zurich , PostDoc Researcher , Department of Biochemistry , University of Zurich
Erich Michel, PhD, Postdoctoral Researcher, Department of Biochemistry, University of Zurich , PostDoc Researcher , Department of Biochemistry , University of Zurich

We will challenge the paradigm of selection from large universal libraries to obtain binding proteins rapidly and efficiently. When it comes to linear epitopes, we can exploit the periodicity of peptide bonds and create a completely modular system, based on a binding protein design that shares the same periodicity. Here, we present our progress on a binding protein system that is modular and complementary to a given peptide sequence.

Structure-Guided Antibody and Immunogen Design

Photo of Monica L. Fernandez-Quintero, PhD, Staff Scientist, Integrative Structural and Computational Biology Department, Scripps Research Institute , Staff Scientist , Integrative structural and computational biology , Scripps Research Institute
Monica L. Fernandez-Quintero, PhD, Staff Scientist, Integrative Structural and Computational Biology Department, Scripps Research Institute , Staff Scientist , Integrative structural and computational biology , Scripps Research Institute

Advances in protein design have enhanced our ability to engineer proteins with defined properties, functions, and structures. Here, we integrate computational protein design with structural biology to develop targeted vaccines and therapeutics for two major global health threats: influenza and malaria. For malaria, we present computationally designed multi-epitope scaffolds that combine epitopes from the circumsporozoite protein (CSP) and the blood-stage antigen PfEMP1. Informed by cryo-EM structures, we designed 10 scaffolds that bind with low nanomolar affinity to broadly neutralizing antibodies (bnAbs) targeting both major/minor CSP repeats and PfEMP1. These scaffolds also engage inferred germline antibodies, underscoring their potential as germline-targeting vaccine candidates.For influenza, we designed hemagglutinin (HA) lower stem scaffolds that bind with low nanomolar to picomolar affinity to stem-directed bnAbs such as 1D03 and 1C06. Structural analysis of 1D03-HA complexes (H1, H2) and related mAbs revealed that the HCDR1 loop is a key determinant of cross-group breadth. Guided by these insights, we engineered antibodies capable of binding H1, H2, and H5 subtypes with high affinity.Additionally, we optimized the central stem bnAb CR9114 for enhanced breadth across group 1 and group 2 Has, including H3, using a combination of language models and physics-based interface design.Together, our work highlights the power of integrating structural data, machine learning, and physics-based modeling to inform rational antibody and immunogen design for infectious disease targets.

Designing Novel Protein Interactions with Therapeutic Potential Using Learned Surface Fingerprints

Photo of Anthony Marchand, PhD, R&D Scientist, bNovate Technologies , R&D Scientist , bNovate Technologies SA
Anthony Marchand, PhD, R&D Scientist, bNovate Technologies , R&D Scientist , bNovate Technologies SA

Protein-protein interactions (PPI) are essential for most biological processes governing life. Using a geometric deep-learning framework on protein surfaces, we generated fingerprints capturing key interaction features. As a proof of concept, we designed de novo protein binders targeting proteins and protein-ligand complexes. These novel interactions could act as protein therapeutics, enhance biosensing, and enable the construction of new synthetic pathways in engineered cells.

Close of Summit


For more details on the conference, please contact:

Christina Lingham
Executive Director, Conferences and Fellow
Cambridge Healthtech Institute
Phone: (+1) 508-813-7570
Email: clingham@healthtech.com

For sponsorship information, please contact:

Companies A-K
Jason Gerardi
Sr. Manager, Business Development
Cambridge Healthtech Institute
Phone: (+1) 781-972-5452
Email: jgerardi@healthtech.com

Companies L-Z
Ashley Parsons
Manager, Business Development
Cambridge Healthtech Institute
Phone: (+1) 781-972-1340
Email: ashleyparsons@healthtech.com