Bridging Gaps in Ligand Binding Affinity Prediction: Empirical Machine Learning Analyses
Journal
2025 MIPRO 48th ICT and Electronics Convention
Date Issued
2025-06-02
Author(s)
Fetaji, Fjolla
DOI
10.1109/mipro65660.2025.11132020
Abstract
Predicting ligand binding affinity remains a critical challenge in computational drug discovery, as existing techniques often require extensive computational resources and are not readily generalizable to diverse protein-ligand systems. This study addresses three key gaps in current research: (1) the lack of versatile, data-efficient predictive models; (2) insufficient strategies for integrating protein and ligand structural information; and (3) limited methods for simultaneously improving accuracy and generalizability. By systematically reviewing recent advances in machine learning-including approaches derived from deep learning, graph-based methodologies, and hybrid frameworks-we show how emerging techniques enable higher predictive power and reduced computational cost. We leverage two large-scale, public datasets (PDBBind and BindingDB) to empirically evaluate a novel dual-model framework that integrates graph-based feature extraction and neural network regression. Comparative analyses illustrate how spatial and sequence representations contribute to model performance, achieving robust improvements in binding affinity prediction. The theory-based advantages of this approach demonstrate how it reveals both small-scale and wide-ranging relationships between proteins and their ligands and the operational benefits result in quicker medication development through decreased processing needs. The research confirms the necessity of building adaptable frameworks which unite structural data with sequence data for therapeutic advancement along clear research paths.
