Multimodal Contrast-Enhanced Molecular Representation Learning and Property Prediction
Abstract
1. Introduction
2. Materials and Methods
2.1. 2D Encoder Module
2.2. 3D Encoder Module
2.3. Molecular Fingerprints Encoder Module
2.4. Fusion Module
2.5. Contrastive Learning Module
3. Results
3.1. Overview of MCMRL
3.2. Molecular Property Prediction Performance
3.3. Evaluation of Contrastive Learning Pre-Training and Multimodal Fusion
3.4. Investigation on MCMRL’s Graph Representation
3.5. Case Study: Potential Drug for DRD2
4. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Details of Molecular Datasets
| Dataset | Molecules | Tasks | Task Type | Metric | Split |
|---|---|---|---|---|---|
| BBBP | 2039 | 1 | Classification | ROC-AUC | Scaffold |
| Tox21 | 7831 | 12 | Classification | ROC-AUC | Scaffold |
| ClinTox | 1478 | 2 | Classification | ROC-AUC | Scaffold |
| HIV | 41,127 | 1 | Classification | ROC-AUC | Scaffold |
| BACE | 1513 | 1 | Classification | ROC-AUC | Scaffold |
| SIDER | 1427 | 27 | Classification | ROC-AUC | Scaffold |
| MUV | 93,087 | 17 | Classification | ROC-AUC | Scaffold |
| FreeSolv | 642 | 1 | Regression | RMSE | Scaffold |
| ESOL | 1128 | 1 | Regression | RMSE | Scaffold |
| Lipo | 4200 | 1 | Regression | RMSE | Scaffold |
| QM7 | 6830 | 1 | Regression | MAE | Scaffold |
| QM8 | 21,786 | 12 | Regression | MAE | Scaffold |
| QM9 | 130,829 | 8 | Regression | MAE | Random |
| Embedding Method | Feature Name | Description | Size |
|---|---|---|---|
| 2D graph | Atomic number | Type of atom, by atomic number (one-hot) | 119 |
| Chirality | CW, CCW, unspecified or other (one-hot) | 4 | |
| Bond type | Single, double, triple or aromatic (one-hot) | 4 | |
| Bond direction | Begin dash, begin wedge, etc. (one-hot) | 3 | |
| 3D graph | Atomic number | Type of atom, by atomic number (one-hot) | 119 |
| coordinate | Node coordinates (float) | - | |
| Fingerprint | MACCS | A fingerprint based on a substructure key using SMARTS mode | 166 |
| PubChem | A substructure-based fingerprint offering broad coverage of chemical structures | 881 | |
| Pharmacophore ErG | Encoding of Extended Reduced Graph (ErG) and pharmacodynamic node descriptions | 442 |
Appendix B. Visualization of MCMRL Representations

Appendix C. Comparison of Single-Molecule Fingerprint Performance

Appendix D. Details of Molecular Docking Procedure

References
- Wieder, O.; Kohlbacher, S.; Kuenemann, M.; Garon, A.; Ducrot, P.; Seidel, T.; Langer, T. A Compact Review of Molecular Property Prediction with Graph Neural Networks. Drug Discov. Today Technol. 2020, 37, 1–12. [Google Scholar] [CrossRef]
- Shen, J.; Nicolaou, C.A. Molecular Property Prediction: Recent Trends in the Era of Artificial Intelligence. Drug Discov. Today Technol. 2019, 32–33, 29–36. [Google Scholar] [CrossRef]
- Weininger, D. SMILES, a Chemical Language and Information System. 1. Introduction to Methodology and Encoding Rules. J. Chem. Inf. Comput. Sci. 1988, 28, 31–36. [Google Scholar] [CrossRef]
- Rogers, D.; Hahn, M. Extended-Connectivity Fingerprints. J. Chem. Inf. Model. 2010, 50, 742–754. [Google Scholar] [CrossRef]
- RDKit. Available online: https://www.rdkit.org/ (accessed on 10 November 2025).
- Bohacek, R.S.; McMartin, C.; Guida, W.C. The Art and Practice of Structure-Based Drug Design: A Molecular Modeling Perspective. Med. Res. Rev. 1996, 16, 3–50. [Google Scholar] [CrossRef]
- Fang, X.; Liu, L.; Lei, J.; He, D.; Zhang, S.; Zhou, J.; Wang, F.; Wu, H.; Wang, H. Geometry-Enhanced Molecular Representation Learning for Property Prediction. Nat. Mach. Intell. 2022, 4, 127–134. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. arXiv 2016, arXiv:1609.02907. [Google Scholar]
- Xu, K.; Hu, W.; Leskovec, J.; Jegelka, S. How Powerful Are Graph Neural Networks? In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
- Huang, K.; Fu, T.; Glass, L.M.; Zitnik, M.; Xiao, C.; Sun, J. DeepPurpose: A Deep Learning Library for Drug-Target Interaction Prediction. Bioinformatics 2021, 36, 5545–5547. [Google Scholar] [CrossRef]
- Rong, Y.; Bian, Y.; Xu, T.; Xie, W.; Wei, Y.; Huang, W.; Huang, J. Self-Supervised Graph Transformer on Large-Scale Molecular Data. In Proceedings of the 34th International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2020; pp. 12559–12571. [Google Scholar]
- Shindo, H.; Matsumoto, Y. Gated Graph Recursive Neural Networks for Molecular Property Prediction. arXiv 2019, arXiv:1909.00259. [Google Scholar] [CrossRef]
- Xiong, Z.; Wang, D.; Liu, X.; Zhong, F.; Wan, X.; Li, X.; Li, Z.; Luo, X.; Chen, K.; Jiang, H.; et al. Pushing the Boundaries of Molecular Representation for Drug Discovery with the Graph Attention Mechanism. J. Med. Chem. 2020, 63, 8749–8760. [Google Scholar] [CrossRef]
- Yang, K.; Swanson, K.; Jin, W.; Coley, C.; Eiden, P.; Gao, H.; Guzman-Perez, A.; Hopper, T.; Kelley, B.; Mathea, M.; et al. Analyzing Learned Molecular Representations for Property Prediction. J. Chem. Inf. Model. 2019, 59, 3370–3388, Erratum in J. Chem. Inf. Model. 2019, 59, 5304–5305. [Google Scholar] [CrossRef] [PubMed]
- Lu, C.; Liu, Q.; Wang, C.; Huang, Z.; Lin, P.; He, L. Molecular Property Prediction: A Multilevel Quantum Interactions Modeling Perspective. AAAI 2019, 33, 1052–1060. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, J.; Cao, Z.; Barati Farimani, A. Molecular Contrastive Learning of Representations via Graph Neural Networks. Nat. Mach. Intell. 2022, 4, 279–287. [Google Scholar] [CrossRef]
- Zhang, Z.; Liu, Q.; Wang, H.; Lu, C.; Lee, C.-K. Motif-Based Graph Self-Supervised Learning for Molecular Property Prediction. In Proceedings of the Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2021; Volume 34, pp. 15870–15882. [Google Scholar]
- Chen, J.; Zheng, S.; Song, Y.; Rao, J.; Yang, Y. Learning Attributed Graph Representation with Communicative Message Passing Transformer. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence; International Joint Conferences on Artificial Intelligence Organization: Montreal, QC, Canada, 2021; pp. 2242–2248. [Google Scholar]
- Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA, 2–7 June 2019. [Google Scholar]
- He, P.; Liu, X.; Gao, J.; Chen, W. Deberta: Decoding-Enhanced Bert with Disentangled Attention. In Proceedings of the International Conference on Learning Representations, Virtual Event, Austria, 3–7 May 2021. [Google Scholar]
- Doersch, C.; Gupta, A.; Efros, A.A. Unsupervised Visual Representation Learning by Context Prediction. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV); IEEE: Santiago, Chile, 2015; pp. 1422–1430. [Google Scholar]
- Shanker, V.R.; Bruun, T.U.J.; Hie, B.L.; Kim, P.S. Unsupervised Evolution of Protein and Antibody Complexes with a Structure-Informed Language Model. Science 2024, 385, 46–53. [Google Scholar] [CrossRef]
- Hu, W.; Liu, B.; Gomes, J.; Zitnik, M.; Liang, P.; Pande, V.; Leskovec, J. Strategies for Pre-Training Graph Neural Networks. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26 April–1 May 2020. [Google Scholar]
- Liu, S.; Wang, H.; Liu, W.; Lasenby, J.; Guo, H.; Tang, J. Pre-Training Molecular Graph Representation with 3d Geometry. In Proceedings of the International Conference on Learning Representations 2022, Online, 25–29 April 2022. [Google Scholar]
- Li, B.; Lin, M.; Chen, T.; Wang, L. FG-BERT: A Generalized and Self-Supervised Functional Group-Based Molecular Representation Learning Framework for Properties Prediction. Brief. Bioinform. 2023, 24, bbad398. [Google Scholar] [CrossRef]
- Dwivedi, V.P.; Rampášek, L.; Galkin, M.; Parviz, A.; Wolf, G.; Luu, A.T.; Beaini, D. Long Range Graph Benchmark. arXiv 2022, arXiv:2206.08164. [Google Scholar]
- Wu, Z.; Jain, P.; Wright, M.; Mirhoseini, A.; Gonzalez, J.E.; Stoica, I. Representing Long-Range Context for Graph Neural Networks with Global Attention. In Proceedings of the Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2021; Volume 34, pp. 13266–13279. [Google Scholar]
- Chen, B.; Barzilay, R.; S Jaakkola, T. Path-Augmented Graph Transformer Network. arXiv 2019, arXiv:1905.12712. [Google Scholar] [CrossRef]
- Maziarka, Ł.; Danel, T.; Mucha, S.; Rataj, K.; Tabor, J.; Jastrzębski, S. Molecule Attention Transformer. arXiv 2020, arXiv:2002.08264. [Google Scholar] [CrossRef]
- Kreuzer, D.; Beaini, D.; Hamilton, W.; Létourneau, V.; Tossou, P. Rethinking Graph Transformers with Spectral Attention. In Proceedings of the Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2021; Volume 34, pp. 21618–21629. [Google Scholar]
- Ying, C.; Cai, T.; Luo, S.; Zheng, S.; Ke, G.; He, D.; Shen, Y.; Liu, T.-Y. Do Transformers Really Perform Badly for Graph Representation? In Proceedings of the Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2021; Volume 34, pp. 28877–28888. [Google Scholar]
- Li, S.; Zhou, J.; Xu, T.; Dou, D.; Xiong, H. GeomGCL: Geometric Graph Contrastive Learning for Molecular Property Prediction. AAAI 2022, 36, 4541–4549. [Google Scholar] [CrossRef]
- Li, C.; Wang, J.; Niu, Z.; Yao, J.; Zeng, X. A Spatial-Temporal Gated Attention Module for Molecular Property Prediction Based on Molecular Geometry. Brief. Bioinform. 2021, 22, bbab078. [Google Scholar] [CrossRef]
- Klicpera, J.; Groß, J.; Günnemann, S. Directional Message Passing for Molecular Graphs. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26 April–1 May 2020. [Google Scholar]
- Liu, Y.; Wang, L.; Liu, M.; Zhang, X.; Oztekin, B.; Ji, S. Spherical Message Passing for 3D Graph Networks. arXiv 2021, arXiv:2102.05013. [Google Scholar]
- Chen, T.; Kornblith, S.; Norouzi, M.; Hinton, G. A Simple Framework for Contrastive Learning of Visual Representations. arXiv 2020, arXiv:2002.05709. [Google Scholar] [CrossRef]
- Schütt, K.T.; Kindermans, P.-J.; Sauceda, H.E.; Chmiela, S.; Tkatchenko, A.; Müller, K.-R. SchNet: A Continuous-Filter Convolutional Neural Network for Modeling Quantum Interactions. arXiv 2017, arXiv:1706.08566. [Google Scholar]
- Sun, R.; Dai, H.; Yu, A.W. Does GNN Pretraining Help Molecular Representation? arXiv 2022, arXiv:2207.06010. [Google Scholar] [CrossRef]
- van den Oord, A.; Li, Y.; Vinyals, O. Representation Learning with Contrastive Predictive Coding. arXiv 2018, arXiv:1807.03748. [Google Scholar]
- Schneider, N.; Sayle, R.A.; Landrum, G.A. Get Your Atoms in Order An Open-Source Implementation of a Novel and Robust Molecular Canonicalization Algorithm. J. Chem. Inf. Model. 2015, 55, 2111–2120. [Google Scholar] [CrossRef]
- Durant, J.L.; Leland, B.A.; Henry, D.R.; Nourse, J.G. Reoptimization of MDL Keys for Use in Drug Discovery. J. Chem. Inf. Comput. Sci. 2002, 42, 1273–1280. [Google Scholar] [CrossRef]
- Stiefl, N.; Watson, I.A.; Baumann, K.; Zaliani, A. ErG: 2D Pharmacophore Descriptions for Scaffold Hopping. J. Chem. Inf. Model. 2006, 46, 208–220. [Google Scholar] [CrossRef]
- Bolton, E.E.; Wang, Y.; Thiessen, P.A.; Bryant, S.H. PubChem: Integrated Platform of Small Molecules and Biological Activities. In Annual Reports in Computational Chemistry; Elsevier: Amsterdam, The Netherlands, 2008; Volume 4, pp. 217–241. [Google Scholar]
- Cai, H.; Zhang, H.; Zhao, D.; Wu, J.; Wang, L. FP-GNN: A Versatile Deep Learning Architecture for Enhanced Molecular Property Prediction. Brief. Bioinform. 2022, 23, bbac408. [Google Scholar] [CrossRef]
- Hao, Y.; Chen, X.; Fei, A.; Jia, Q.; Chen, Y.; Shao, J.; Pandiyan, S.; Wang, L. SG-ATT: A Sequence Graph Cross-Attention Representation Architecture for Molecular Property Prediction. Molecules 2024, 29, 492. [Google Scholar] [CrossRef]
- Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem 2019 Update: Improved Access to Chemical Data. Nucleic Acids Res. 2019, 47, D1102–D1109. [Google Scholar] [CrossRef]
- Wu, Z.; Ramsundar, B.; Feinberg, E.N.; Gomes, J.; Geniesse, C.; Pappu, A.S.; Leswing, K.; Pande, V. MoleculeNet: A Benchmark for Molecular Machine Learning. Chem. Sci. 2018, 9, 513–530. [Google Scholar] [CrossRef]
- Shen, W.X.; Zeng, X.; Zhu, F.; Wang, Y.L.; Qin, C.; Tan, Y.; Jiang, Y.Y.; Chen, Y.Z. Out-of-the-Box Deep Learning Prediction of Pharmaceutical Properties by Broadly Learned Knowledge-Based Molecular Representations. Nat. Mach. Intell. 2021, 3, 334–343. [Google Scholar] [CrossRef]
- van der Maaten, L.; Hinton, G. Visualizing Data Using T-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
- Rose, P.W.; Prlić, A.; Altunkaya, A.; Bi, C.; Bradley, A.R.; Christie, C.H.; Costanzo, L.D.; Duarte, J.M.; Dutta, S.; Feng, Z.; et al. The RCSB Protein Data Bank: Integrative View of Protein, Gene and 3D Structural Information. Nucleic Acids Res. 2017, 45, D271–D281. [Google Scholar] [PubMed]
- Eberhardt, J.; Santos-Martins, D.; Tillack, A.F.; Forli, S. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model. 2021, 61, 3891–3898. [Google Scholar] [CrossRef]
- Wang, S.; Che, T.; Levit, A.; Shoichet, B.K.; Wacker, D.; Roth, B.L. Structure of the D2 Dopamine Receptor Bound to the Atypical Antipsychotic Drug Risperidone. Nature 2018, 555, 269–273. [Google Scholar] [CrossRef]
- DeLano, W.L.; Scientific, D.; Carlos, S. PyMOL: An Open-Source Molecular Graphics Tool. CCP4 Newsl Protein Crystallogr 2002, 40, 82–92. [Google Scholar]
- Ramakrishnan, R.; Hartmann, M.; Tapavicza, E.; Von Lilienfeld, O.A. Electronic Spectra from TDDFT and Machine Learning in Chemical Space. J. Chem. Phys. 2015, 143, 084111. [Google Scholar] [CrossRef]
- Ruddigkeit, L.; Van Deursen, R.; Blum, L.C.; Reymond, J.-L. Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17. J. Chem. Inf. Model. 2012, 52, 2864–2875. [Google Scholar] [CrossRef]
- Ramakrishnan, R.; Dral, P.O.; Rupp, M.; Von Lilienfeld, O.A. Quantum Chemistry Structures and Properties of 134 Kilo Molecules. Sci. Data 2014, 1, 140022. [Google Scholar] [CrossRef]







| Dataset | BBBP | Tox21 | ClinTox | HIV | BACE | SIDER | MUV |
|---|---|---|---|---|---|---|---|
| GCN [9] | 71.8 ± 0.9 | 70.9 ± 2.6 | 62.5 ± 2.8 | 74.0 ± 3.0 | 71.6 ± 2.0 | 53.6 ± 3.2 | 71.6 ± 4.0 |
| GIN [10] | 65.8 ± 4.5 | 74.0 ± 0.8 | 58.0 ± 4.4 | 75.3 ± 1.9 | 70.1 ± 5.4 | 57.3 ± 1.6 | 71.8 ± 2.5 |
| PretrainGNN [24] | 68.7 ± 1.3 | 78.1 ± 0.6 | 87.6 ± 1.5 | 71.1 ± 0.5 | 84.5 ± 0.7 | 62.7 ± 0.8 | 80.1 ± 2.1 |
| Attentive FP [14] | 64.3 ± 1.8 | 76.1 ± 0.5 | 84.7 ± 0.3 | 75.7 ± 1.4 | 78.4 ± 0.0 | 60.6 ± 3.2 | 76.6 ± 1.5 |
| D-MPNN [15] | 71.2 ± 3.8 | 68.9 ± 1.3 | 90.5 ± 5.3 | 75.0 ± 2.1 | 85.3 ± 5.3 | 63.2 ± 2.3 | 76.2 ± 2.8 |
| MGCN [16] | 85.0 ± 6.4 | 70.7 ± 1.6 | 63.4 ± 4.2 | 73.8 ± 1.6 | 73.4 ± 3.0 | 55.2 ± 1.8 | 70.2 ± 3.4 |
| GraphMVP [25] | 72.4 ± 1.6 | 75.9 ± 0.5 | 79.1 ± 2.8 | 77.0 ± 1.2 | 81.2 ± 0.9 | 63.9 ± 1.2 | 77.7 ± 1.9 |
| FG-BERT [26] | 70.2 ± 0.9 | 78.4 ± 0.8 | 83.2 ± 1.6 | 77.4 ± 1.0 | 84.5 ± 1.5 | 64.0 ± 0.7 | 75.3 ± 2.4 |
| MCMRL | 74.1 ± 0.6 | 79.7 ± 1.2 | 91.3 ± 1.8 | 80.3 ± 2.1 | 84.6 ± 0.4 | 67.6 ± 0.7 | 81.8 ± 1.0 |
| Dataset | FreeSolv | ESOL | Lipo | QM7 | QM8 | QM9 |
|---|---|---|---|---|---|---|
| GCN [9] | 2.87 ± 0.14 | 1.43 ± 0.05 | 1.43 ± 0.05 | 122.9 ± 2.2 | 0.037 ± 0.001 | 5.796 ± 1.969 |
| GIN [10] | 2.76 ± 0.18 | 1.45 ± 0.02 | 0.85 ± 0.07 | 124.8 ± 0.7 | 0.037 ± 0.001 | 4.741 ± 0.912 |
| PretrainGNN [24] | 2.76 ± 0.02 | 1.10 ± 0.06 | 0.74 ± 0.10 | 113.2 ± 6.0 | 0.020 ± 0.002 | 4.081 ± 0.001 |
| Attentive FP [14] | 2.07 ± 0.18 | 0.98 ± 0.02 | 0.72 ± 0.10 | 72.0 ± 2.7 | 0.018 ± 0.001 | 2.156 ± 0.001 |
| D-MPNN [15] | 2.18 ± 0.91 | 0.98 ± 0.26 | 0.65 ± 0.05 | 105.8 ± 13.2 | 0.018 ± 0.002 | 3.241 ± 0.119 |
| MGCN [16] | 3.35 ± 0.01 | 1.27 ± 0.15 | 1.11 ± 0.04 | 77.6 ± 4.7 | 0.022 ± 0.002 | 0.050 ± 0.002 |
| GraphMVP [25] | — | 1.029 | 0.681 | — | — | — |
| FG-BERT [26] | — | 0.94 ± 0.03 | 0.66 ± 0.01 | — | — | — |
| MCMRL | 1.79 ± 0.25 | 0.91 ± 0.01 | 0.65 ± 0.05 | 89.0 ± 3.2 | 0.017 ± 0.001 | 2.095 ± 0.216 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Luo, H.; He, J.; Liu, Z.; Zeng, C. Multimodal Contrast-Enhanced Molecular Representation Learning and Property Prediction. Biophysica 2026, 6, 24. https://doi.org/10.3390/biophysica6020024
Luo H, He J, Liu Z, Zeng C. Multimodal Contrast-Enhanced Molecular Representation Learning and Property Prediction. Biophysica. 2026; 6(2):24. https://doi.org/10.3390/biophysica6020024
Chicago/Turabian StyleLuo, Hong, Jie He, Zhichao Liu, and Chen Zeng. 2026. "Multimodal Contrast-Enhanced Molecular Representation Learning and Property Prediction" Biophysica 6, no. 2: 24. https://doi.org/10.3390/biophysica6020024
APA StyleLuo, H., He, J., Liu, Z., & Zeng, C. (2026). Multimodal Contrast-Enhanced Molecular Representation Learning and Property Prediction. Biophysica, 6(2), 24. https://doi.org/10.3390/biophysica6020024

