12

Surface-based Molecular Design with Multi-modal Flow Matching

Knowledge Discovery and Data Mining (KDD), 2025
Fang Wu
Zhengyuan Zhou
Shuting Jin
Xiangxiang Zeng
Jure Leskovec
Jinbo Xu
Main:8 Pages
7 Figures
Bibliography:3 Pages
5 Tables
Appendix:1 Pages
Abstract

Therapeutic peptides show promise in targeting previously undruggable binding sites, with recent advancements in deep generative models enabling full-atom peptide co-design for specific protein receptors. However, the critical role of molecular surfaces in protein-protein interactions (PPIs) has been underexplored. To bridge this gap, we propose an omni-design peptides generation paradigm, called SurfFlow, a novel surface-based generative algorithm that enables comprehensive co-design of sequence, structure, and surface for peptides. SurfFlow employs a multi-modality conditional flow matching (CFM) architecture to learn distributions of surface geometries and biochemical properties, enhancing peptide binding accuracy. Evaluated on the comprehensive PepMerge benchmark, SurfFlow consistently outperforms full-atom baselines across all metrics. These results highlight the advantages of considering molecular surfaces in de novo peptide discovery and demonstrate the potential of integrating multiple protein modalities for more effective therapeutic peptide discovery.

View on arXiv
Comments on this paper