Leveraging Bag Dissimilarity Regularized Multi-Instance Learning for Analyzing Infrared Spectra of Heterogeneous Objects
Abstract
1. Introduction
2. Results
2.1. Implementation Details
2.2. Performance on PDMS Block Assessment
2.2.1. Overall Comparison
2.2.2. Hyper Parameter Analysis
2.2.3. Analysis of Spectral Regions
2.3. PET Fiber Inspection
2.3.1. Overall Comparison
2.3.2. Hyper Parameter Analysis
2.3.3. Analysis of Spectral Regions
3. Discussion
4. Materials and Methods
4.1. Bag Dissimilarity Regularized MIL
4.2. BDR-SVM
4.3. PDMS Block Assessment
4.4. PET Fiber Data Acquisition
4.5. Generative AI Disclosure
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| MIL | Multiple instance learning |
| IR | Infrared |
| IS | Instance space |
| BS | Bag space |
| ES | Embedded space |
| BDR | Bag dissimilarity regularization/regularized |
References
- Liu, H.; Tian, L.; Wang, L.; Zhang, Z.; Li, J.; Liu, X.; Zheng, B.; Ma, H.; Wang, Y.; Li, J. Real-time grading of roasted tobacco using near infrared spectroscopy technology. Microchem. J. 2024, 204, 110963. [Google Scholar] [CrossRef]
- Chrimatopoulos, C.; Tummino, M.L.; Iliadis, E.; Sakkas, C.T. Attenuated Total Reflection Fourier Transform Infrared Spectroscopy and Chemometrics for the Discrimination of Animal Hair Fibers for the Textile Sector. Appl. Spectrosc. 2025, 79, 1173–1184. [Google Scholar] [CrossRef]
- Rish, A.J.; Kurt, C.; Assis, J.M.; Rehrauer, O.; Rangel-Gil, R.S.; Taylor, E. Evaluation of Calibration Burden for Monitoring of a Pharmaceutical Continuous Manufacturing Line using Near-Infrared Spectroscopy. Int. J. Pharm. 2025, 673, 125419. [Google Scholar] [CrossRef] [PubMed]
- Tarapoulouzi, M.; Theocharis, C.R. Discrimination of Anari cheese samples in comparison with Halloumi cheese samples regarding the origin of the species by FTIR measurements and chemometrics. Analytica 2023, 4, 374–384. [Google Scholar] [CrossRef]
- Dietterich, T.G.; Lathrop, R.H.; Lozano-Pérez, T. Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell. 1997, 89, 31–71. [Google Scholar] [CrossRef]
- Jang, J.; Kwon, H.Y. Are multiple instance learning algorithms learnable for instances? In Proceedings of the NIPS ’24: 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 10–15 December 2024; Curran Associates Inc.: Red Hook, NY, USA, 2024. [Google Scholar]
- Eksi, R.; Li, H.D.; Menon, R.; Wen, Y.; Omenn, G.S.; Kretzler, M.; Guan, Y. Systematically differentiating functions for alternatively spliced isoforms through integrating RNA-seq data. PLoS Comput. Biol. 2013, 9, e1003314. [Google Scholar] [CrossRef]
- Shen, L.C.; Zhang, Y.; Wang, Z.; Littler, D.R.; Liu, Y.; Tang, J.; Rossjohn, J.; Yu, D.J.; Song, J. Self-iterative multiple-instance learning enables the prediction of CD4+ T cell immunogenic epitopes. Nat. Mach. Intell. 2025, 7, 1250–1265. [Google Scholar] [CrossRef]
- Shi, J.; Li, C.; Gong, T.; Zheng, Y.; Fu, H. ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024; pp. 11248–11258. [Google Scholar]
- Makadiya, S.; Pavan Kumar Reddy, K.; Gubbi, J. SS-MIL: Attention-Based Selective Correlated Multiple Instance Learning for Whole Slide Image Classification. In Proceedings of the ECCV 2024 Workshops, Milan, Italy, 29 September–4 October 2024; Springer: Cham, Switzerland, 2025; pp. 239–251. [Google Scholar] [CrossRef]
- Liu, J.; Kong, D.; Huang, L.; Mao, D.; Xue, H. Multiple Instance Learning for Offensive Language Detection. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates, 7–11 December 2022; Association for Computational Linguistics: Abu Dhabi, United Arab Emirates, 2022; pp. 7387–7396. [Google Scholar]
- Yang, R.; Ma, J.; Lin, H.; Gao, W. A weakly supervised propagation model for rumor verification and stance detection with multiple instance learning. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, 11–15 July 2022; Association for Computing Machinery: New York, NY, USA, 2022; pp. 1761–1772. [Google Scholar]
- Amores, J. Multiple instance classification: Review, taxonomy and comparative study. Artif. Intell. 2013, 201, 81–105. [Google Scholar] [CrossRef]
- Andrews, S.; Tsochantaridis, I.; Hofmann, T. Support vector machines for multiple-instance learning. Adv. Neural Inf. Process. Syst. 2002, 15, 577–584. [Google Scholar]
- Bunescu, R.C.; Mooney, R.J. Multiple Instance Learning for Sparse Positive Bags. In Proceedings of the 24th International Conference on Machine Learning (ICML), Corvalis, OR, USA, 20–24 June 2007; Association for Computing Machinery: New York, NY, USA, 2007; pp. 105–112. [Google Scholar] [CrossRef]
- Wu, J.; Yu, Y.; Huang, C.; Yu, K. Deep multiple instance learning for image classification and auto-annotation. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; IEEE: New York, NY, USA, 2015; pp. 3460–3469. [Google Scholar] [CrossRef]
- Zhou, Z.H.; Sun, Y.Y.; Li, Y.F. Multi-Instance Learning by Treating Instances as Non-I.I.D. Samples. In Proceedings of the 26th Annual International Conference on Machine Learning (ICML), Montréal, QC, Canada, 14–18 June 2009; Association for Computing Machinery: New York, NY, USA, 2009; pp. 1249–1256. [Google Scholar] [CrossRef]
- Cheplygina, V.; Tax, D.M.; Loog, M. Multiple instance learning with bag dissimilarities. Pattern Recognit. 2015, 48, 264–275. [Google Scholar] [CrossRef]
- Chen, Y.; Bi, J.; Wang, J.Z. MILES: Multiple-instance learning via embedded instance selection. IEEE Trans. Pattern Anal. Mach. Intell. 2006, 28, 1931–1947. [Google Scholar] [CrossRef] [PubMed]
- Wei, X.S.; Wu, J.; Zhou, Z.H. Scalable algorithms for multi-instance learning. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 975–987. [Google Scholar] [CrossRef] [PubMed]
- Wang, X.; Yan, Y.; Tang, P.; Bai, X.; Liu, W. Revisiting multiple instance neural networks. Pattern Recognit. 2018, 74, 15–24. [Google Scholar] [CrossRef]
- Wang, H.; Han, L.; Cai, W.; Shao, X. Chemometrics: A Vital Implement for Understanding the Water Structures by Near-Infrared Spectroscopy. J. Chemom. 2024, 38, e3631. [Google Scholar] [CrossRef]
- Song, W.; Wang, H.; Power, U.F.; Rahman, E.; Barabas, J.; Huang, J.; McLaughlin, J.; Nugent, C.; Maguire, P. Classification of Respiratory Syncytial Virus and Sendai Virus Using Portable Near-Infrared Spectroscopy and Chemometrics. IEEE Sens. J. 2023, 23, 9981–9989. [Google Scholar] [CrossRef]
- Wang, Y.; Ou, X.; He, H.J.; Kamruzzaman, M. Advancements, limitations and challenges in hyperspectral imaging for comprehensive assessment of wheat quality: An up-to-date review. Food Chem. X 2024, 21, 101235. [Google Scholar] [CrossRef]
- Lin, Z.; Jia, S.; Luo, G.; Dai, X.; Xu, B.; Wu, Z.; Shi, X.; Qiao, Y. Dealing with heterogeneous classification problem in the framework of multi-instance learning. Talanta 2015, 132, 175–181. [Google Scholar] [CrossRef]
- Caetano, V.F.; e Brito, L.R.; Rohwedder, J.J.R.; Pasquini, C.; Pimentel, M.F.; Vinhas, G.M. Determination of diethyleneglycol content and number of carboxylic end groups in poly (ethylene terephthalate) fibers using imaging and conventional near infrared spectroscopy. Polym. Test. 2016, 49, 15–21. [Google Scholar] [CrossRef]
- Liu, Y.; Zhou, S.; Han, W.; Liu, W.; Qiu, Z.; Li, C. Convolutional neural network for hyperspectral data analysis and effective wavelengths selection. Anal. Chim. Acta 2019, 1086, 46–54. [Google Scholar] [CrossRef]
- Qi, Y.; Zhang, Y.; Tang, S.; Zeng, Z. Synergizing Wood Science and Interpretable Artificial Intelligence: Detection and Classification of Wood Species Through Hyperspectral Imaging. Forests 2025, 16, 186. [Google Scholar] [CrossRef]
- Bhargava, A.; Sachdeva, A.; Sharma, K.; Alsharif, M.H.; Uthansakul, P.; Uthansakul, M. Hyperspectral imaging and its applications: A review. Heliyon 2024, 10, e33208. [Google Scholar] [CrossRef]
- Huang, S.; Liu, Z.; Jin, W.; Mu, Y. Bag dissimilarity regularized multi-instance learning. Pattern Recognit. 2022, 126, 108583. [Google Scholar] [CrossRef]
- Chapelle, O.; Haffner, P.; Vapnik, V.N. Support vector machines for histogram-based image classification. IEEE Trans. Neural Netw. 1999, 10, 1055–1064. [Google Scholar] [CrossRef] [PubMed]
- Foulds, J.R. Learning Instance Weights in Multi-Instance Learning. Ph.D. Thesis, The University of Waikato, Hamilton, New Zealand, 2008. [Google Scholar]
- Ilse, M.; Tomczak, J.; Welling, M. Attention-based deep multiple instance learning. In Proceedings of the International Conference on Machine Learning (ICML), Stockholm, Sweden, 10–15 July 2018; Proceedings of Machine Learning Research (PMLR): Cambridge, MA, USA, 2018; pp. 2127–2136. [Google Scholar]
- Traoré, M.; Kaal, J.; Martínez Cortizas, A. Application of FTIR spectroscopy to the characterization of archeological wood. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2016, 153, 63–70. [Google Scholar] [CrossRef]
- Belkin, M.; Niyogi, P.; Sindhwani, V. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J. Mach. Learn. Res. 2006, 7, 2399–2434. [Google Scholar]
- Belkin, M.; Niyogi, P. Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering. In Proceedings of the NIPS’01: 15th International Conference on Neural Information Processing Systems (NIPS), Vancouver, BC, Canada, 3–8 December 2001; MIT Press: Cambridge, MA, USA, 2001; pp. 585–591. [Google Scholar]
- Chung, F.R. Spectral Graph Theory; American Mathematical Soc.: Providence, RI, USA, 1997; Volume 92. [Google Scholar]
- Cai, D.; He, X.; Han, J.; Huang, T.S. Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 33, 1548–1560. [Google Scholar] [CrossRef]
- Melki, G.; Cano, A.; Ventura, S. MIRSVM: Multi-instance support vector machine with bag representatives. Pattern Recognit. 2018, 79, 228–241. [Google Scholar] [CrossRef]
- Johnston, I.; McCluskey, D.; Tan, C.; Tracey, M. Mechanical characterization of bulk Sylgard 184 for microfluidics and microengineering. J. Micromech. Microeng. 2014, 24, 035017. [Google Scholar]
- Raj M, K.; Chakraborty, S. PDMS microfluidics: A mini review. J. Appl. Polym. Sci. 2020, 137, 48958. [Google Scholar] [CrossRef]
- Borók, A.; Laboda, K.; Bonyár, A. PDMS bonding technologies for microfluidic applications: A review. Biosensors 2021, 11, 292. [Google Scholar] [CrossRef]
- Günay, M. Determination of dyeing levelness using surface irregularity function. Color Res. Appl. 2009, 34, 285–290. [Google Scholar] [CrossRef]











Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Huang, S.; Zou, Z. Leveraging Bag Dissimilarity Regularized Multi-Instance Learning for Analyzing Infrared Spectra of Heterogeneous Objects. AI Chem. 2026, 1, 6. https://doi.org/10.3390/aichem1020006
Huang S, Zou Z. Leveraging Bag Dissimilarity Regularized Multi-Instance Learning for Analyzing Infrared Spectra of Heterogeneous Objects. AI Chemistry. 2026; 1(2):6. https://doi.org/10.3390/aichem1020006
Chicago/Turabian StyleHuang, Shiluo, and Zheyu Zou. 2026. "Leveraging Bag Dissimilarity Regularized Multi-Instance Learning for Analyzing Infrared Spectra of Heterogeneous Objects" AI Chemistry 1, no. 2: 6. https://doi.org/10.3390/aichem1020006
APA StyleHuang, S., & Zou, Z. (2026). Leveraging Bag Dissimilarity Regularized Multi-Instance Learning for Analyzing Infrared Spectra of Heterogeneous Objects. AI Chemistry, 1(2), 6. https://doi.org/10.3390/aichem1020006

