We are excited to announce that our paper has been accepted by Patterns and is featured as the cover article for the October 2025 issue!
This ocean symbolizes the human proteome—the complete set of proteins that carry out essential functions in our bodies. For medicine to work, it often needs to interact with a specific protein. For an estimated 90% of these proteins, however, they lack known small-molecule ligands with high activity. In the image, these proteins are represented as sailboats drifting in the dark.
At the center, stands a lighthouse symbolizing the AI method LigUnity. Its beam illuminates several sailboats, guiding them toward glowing buoys, which symbolize ligands with high activity found by LigUnity. The work by Feng et al. highlights the power of AI-driven computational methods to efficiently find active ligands and optimize their activity, opening up new therapeutic avenues for various diseases.
Structure-based drug discovery involves two critical, sequential tasks: virtual screening to identify active compounds and hit-to-lead optimization to refine their potency. Existing computational methods often treat these tasks separately due to their conflicting speed-accuracy requirements. This separation prevents the synergy that could arise from a unified approach. We introduce LigUnity, a protein-ligand affinity foundation model that jointly addresses both tasks. LigUnity learns a shared embedding space for protein pockets and ligands by capturing both coarse-grained active/inactive distinctions (scaffold discrimination) and fine-grained affinity rankings (pharmacophore ranking). To enable this, we curated PocketAffDB, the largest structure-aware affinity database to date, containing 0.8 million data points. Our evaluations show that LigUnity sets a new state-of-the-art, outperforming 24 methods in virtual screening and approaching the accuracy of costly physics-based methods like FEP+ in hit-to-lead optimization, all while being 106 times faster than traditional docking.
LigUnity's core innovation is its hierarchical approach to learning a shared pocket-ligand embedding space. This allows it to understand both global structure-activity relationships and subtle, affinity-determining chemical features. The entire pipeline is illustrated below.
On the DUD-E, Dekois 2.0, and LIT-PCBA benchmarks, LigUnity consistently and significantly outperforms 24 competing methods, including docking programs and other ML models. It achieves over a 50% improvement in Enrichment Factor (EF 1%) compared to the next-best structure-based methods and shows strong generalization to novel protein targets.
On two FEP benchmarks (Merck and JACS), LigUnity shows state-of-the-art performance in predicting binding free energies. When fine-tuned on just a few data points, LigUnity's accuracy becomes comparable to FEP+, a computationally intensive industry standard, positioning it as a powerful, cost-effective alternative. Importance scores calculated by the model also correctly identify key atoms and residues responsible for binding.
@article{Feng2025LigUnity,
title = {Hierarchical affinity landscape navigation through learning a shared pocket-ligand space},
author = {Feng, Bin and Liu, Zijing and Li, Hao and Yang, Mingjun and Zou, Junjie and Cao, He and Li, Yu and Zhang, Lei and Wang, Sheng},
journal = {Patterns},
volume = {6},
pages = {101371},
year = {2025},
publisher = {Elsevier},
doi = {10.1016/j.patter.2025.101371}
}