Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
Abstract
1. Introduction
- Two or more foods placed in the same container.
- Foods that are of the same texture and/or color.
- Food combinations that mimic real-world scenarios.
- (1)
- Lack of variability in image attributes, such as color, intensity, and texture, might reduce the performance of these methods.
- (2)
- The number of clusters is not known; such an unsupervised clustering scheme may not produce an optimal number of segments.
2. Methods
2.1. Equipment
2.2. Data
2.3. Camera Calibration
2.4. Image Registration
2.5. K-Means Clustering-Based Image Segmentation
- Randomly choose k initial centers C = {c1, …, ck}.
- For each i ∈ {1, …, k}, set the cluster ci to be the set of points in X that are closer to ci than they are to cj for all j ≠ i.
- For each i ∈ {1, …, k}, set ci to be the center of mass of all points in ci.
- Repeat Steps 2 and 3 until C no longer changes.
- Run the clustering algorithm (e.g., k-means clustering) for different values of k. Here, we vary k from 2 to 10 clusters.
- For each value of k, calculate the total within-cluster sum of square (WSS).
- Plot the curve of WSS according to the number of clusters k.
- The bend (knee) location in the plot is generally considered an indicator of the appropriate number of clusters (for all cases in the thermal data). In case of an ambiguous bend, where there was no distinct bend in the plot, a tangent was drawn on the curve starting from the last point (k = 10). The point of tangency with the lowest k value is chosen automatically. This value was selected as plots tend to saturate (in terms of WSS) at this point.
2.6. Segmentation Evaluation
3. Results
4. Discussion
- RGB data may perform poorly when the foods have the same color, texture, and/ or intensities.
- Thermal data may perform poorly when foods are at room temperature as the food blends into the background.
- Thermal data perform better when hot and cold combinations are present.
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A



References
- Zhu, F.; Bosch, M.; Boushey, C.J.; Delp, E.J. An Image Analysis System for Dietary Assessment and Evaluation. In Proceedings of the 2010 IEEE International Conference on Image Processing, Hong Kong, China, 26–29 September 2010; pp. 1853–1856. [Google Scholar]
- Jia, W.; Li, Y.; Qu, R.; Baranowski, T.; Burke, L.E.; Zhang, H.; Bai, Y.; Mancino, J.M.; Xu, G.; Mao, Z.-H.; et al. Automatic Food Detection in Egocentric Images Using Artificial Intelligence Technology. Public Health Nutr. 2018, 1–12. [Google Scholar] [CrossRef] [PubMed]
- Fang, S.; Zhu, F.; Jiang, C.; Zhang, S.; Boushey, C.J.; Delp, E.J. Delp A Comparison of Food Portion Size Estimation Using Geometric Models and Depth Images. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016; pp. 26–30. [Google Scholar]
- Rahman, M.d.H.; Li, Q.; Pickering, M.; Frater, M.; Kerr, D.; Bouchey, C.; Delp, E. Food Volume Estimation in a Mobile Phone Based Dietary Assessment System. In Proceedings of the 8th International Conference on Signal Image Technology and Internet Based Systems, Sorrento, Italy, 25–29 November 2012; SITIS 2012r. pp. 988–995. [Google Scholar]
- Mery, D.; Pedreschi, F. Segmentation of Colour Food Images Using a Robust Algorithm. J. Food Eng. 2005, 66, 353–360. [Google Scholar] [CrossRef]
- He, Y.; Khanna, N.; Boushey, C.J.; Delp, E.J. Snakes Assisted Food Image Segmentation. In Proceedings of the 2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP), Banff, AB, Canada, 17–19 September 2012; pp. 181–185. [Google Scholar]
- Chen, H.-C.; Jia, W.; Sun, X.; Li, Z.; Li, Y.; Fernstrom, J.D.; Burke, L.E.; Baranowski, T.; Sun, M. Saliency-Aware Food Image Segmentation for Personal Dietary Assessment Using a Wearable Computer. Meas. Sci. Technol. 2015, 26, 025702. [Google Scholar] [CrossRef] [PubMed]
- He, Y.; Xu, C.; Khanna, N.; Boushey, C.J.; Delp, E.J. Food Image Analysis: Segmentation, Identification and Weight Estimation. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo (ICME), San Jose, CA, USA, 15–19 July 2013; pp. 1–6. [Google Scholar]
- Kawano, Y.; Yanai, K. Real-Time Mobile Food Recognition System. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, Portland, OR, USA, 23–28 June 2013; pp. 1–7. [Google Scholar]
- Dehais, J.; Anthimopoulos, M.; Mougiakakou, S. Food Image Segmentation for Dietary Assessment. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management—MADiMa ’16, New York, NY, USA, 16 October 2016; ACM Press: Amsterdam, The Netherlands, 2016; pp. 23–28. [Google Scholar]
- Wang, Y.; Liu, C.; Zhu, F.; Boushey, C.J.; Delp, E.J. Efficient Superpixel Based Segmentation for Food Image Analysis. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016; pp. 2544–2548. [Google Scholar]
- Wang, Y.; Zhu, F.; Boushey, C.J.; Delp, E.J. Weakly Supervised Food Image Segmentation Using Class Activation Maps. In Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 17–20 September 2017; pp. 1277–1281. [Google Scholar]
- Pfisterer, K.J.; Amelard, R.; Chung, A.G.; Syrnyk, B.; MacLean, A.; Wong, A. Fully-Automatic Semantic Segmentation for Food Intake Tracking in Long-Term Care Homes. arXiv 2019, arXiv:1910.11250. [Google Scholar]
- Wang, X.; Rohani, N.; Manerikar, A.; Katsagellos, A.; Cossairt, O.; Alshurafa, N. Distinguishing Nigerian Food Items and Calorie Content with Hyperspectral Imaging. In New Trends in Image Analysis and Processing—ICIAP 2017; Battiato, S., Farinella, G.M., Leo, M., Gallo, G., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2017; Volume 10590, pp. 462–470. ISBN 978-3-319-70741-9. [Google Scholar]
- Dubey, S.R.; Dixit, P.; Singh, N.; Gupta, J.P. Infected Fruit Part Detection Using K-Means Clustering Segmentation Technique. IJIMAI 2013, 2, 65. [Google Scholar] [CrossRef][Green Version]
- Pham, V.H.; Lee, B.R. An Image Segmentation Approach for Fruit Defect Detection Using K-Means Clustering and Graph-Based Algorithm. Vietnam J. Comput. Sci. 2015, 2, 25–33. [Google Scholar] [CrossRef]
- Hu, M.; Dong, Q.; Liu, B.; Malakar, P.K. The Potential of Double K-Means Clustering for Banana Image Segmentation: Image Segmentation on Banana. J. Food Process. Eng. 2014, 37, 10–18. [Google Scholar] [CrossRef]
- Zheng, X.; Lei, Q.; Yao, R.; Gong, Y.; Yin, Q. Image Segmentation Based on Adaptive K-Means Algorithm. J. Image Video Proc. 2018, 2018, 68. [Google Scholar] [CrossRef]
- Siswantoro, J.; Prabuwono, A.S.; Abdullah, A.; Idrus, B. Automatic Image Segmentation Using Sobel Operator and K-Means Clustering: A Case Study in Volume Measurement System for Food Products. In Proceedings of the 2015 International Conference on Science in Information Technology (ICSITech), Yogyakarta, Indonesia, 27–28 October 2015; pp. 13–18. [Google Scholar]
- Feng, Z.; Song, L.; Duan, J.; He, L.; Zhang, Y.; Wei, Y.; Feng, W. Monitoring Wheat Powdery Mildew Based on Hyperspectral, Thermal Infrared, and RGB Image Data Fusion. Sensors 2022, 22, 31. [Google Scholar] [CrossRef]
- Zhang, Y.; Müller, S.; Stephan, B.; Gross, H.-M.; Notni, G. Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover. Sensors 2021, 21, 5676. [Google Scholar] [CrossRef]
- Cohen, B.; Edan, Y.; Levi, A.; Alchanatis, V. Early Detection of Grapevine (Vitis Vinifera) Downy Mildew (Peronospora) and Diurnal Variations Using Thermal Imaging. Sensors 2022, 22, 3585. [Google Scholar] [CrossRef] [PubMed]
- Bougrine, A.; Harba, R.; Canals, R.; Ledee, R.; Jabloun, M.; Villeneuve, A. Segmentation of Plantar Foot Thermal Images Using Prior Information. Sensors 2022, 22, 3835. [Google Scholar] [CrossRef] [PubMed]
- Bhadoriya, A.S.; Vegamoor, V.; Rathinam, S. Vehicle Detection and Tracking Using Thermal Cameras in Adverse Visibility Conditions. Sensors 2022, 22, 4567. [Google Scholar] [CrossRef] [PubMed]
- Schischmanow, A.; Dahlke, D.; Baumbach, D.; Ernst, I.; Linkiewicz, M. Seamless Navigation, 3D Reconstruction, Thermographic and Semantic Mapping for Building Inspection. Sensors 2022, 22, 4745. [Google Scholar] [CrossRef]
- Fu, K.S.; Mui, J.K. A Survey on Image Segmentation. Pattern Recognit. 1981, 13, 3–16. [Google Scholar] [CrossRef]
- Raju, V.; Sazonov, E. Processing of Egocentric Camera Images from a Wearable Food Intake Sensor. In Proceedings of the 2019 SoutheastCon, Huntsville, AL, USA, 11–14 April 2019; pp. 1–6. [Google Scholar]
- Raju, V.B.; Sazonov, E. FOODCAM: A Novel Structured Light-Stereo Imaging System for Food Portion Size Estimation. Sensors 2022, 22, 3300. [Google Scholar] [CrossRef]
- Kim, N.; Choi, Y.; Hwang, S.; Park, K.; Yoon, J.S.; Kweon, I.S. Geometrical Calibration of Multispectral Calibration. In Proceedings of the 2015 12th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI), Goyang, Republic of Korea, 28–30 October 2015; pp. 384–385. [Google Scholar]
- Henry Zhang Methods of Thermal Camera Calibration. Henry’s Blog, 20 June 2018.
- Lagüela, S.; González-Jorge, H.; Armesto, J.; Arias, P. Calibration and Verification of Thermographic Cameras for Geometric Measurements. Infrared Phys. Technol. 2011, 54, 92–99. [Google Scholar] [CrossRef]
- Zoetgnandé, Y.; Fougères, A.-J.; Cormier, G.; Dillenseger, J.-L. Robust Low Resolution Thermal Stereo Camera Calibration. In Eleventh International Conference on Machine Vision (ICMV 2018); Nikolaev, D.P., Radeva, P., Verikas, A., Zhou, J., Eds.; SPIE: Munich, Germany, 2019; p. 115. [Google Scholar]
- Brooks, R.R.; Iyengar, S.S. Multi-Sensor Fusion: Fundamentals and Applications with Software; Prentice-Hall: Hoboken, NJ, USA, 1998. [Google Scholar]
- Hill, D.L.G.; Batchelor, P.G.; Holden, M.; Hawkes, D.J. Medical Image Registration. Phys. Med. Biol. 2001, 46, R1–R45. [Google Scholar] [CrossRef]
- Viergever, M.A.; Maintz, J.B.A.; Klein, S.; Murphy, K.; Staring, M.; Pluim, J.P.W. A Survey of Medical Image Registration—Under Review. Med. Image Anal. 2016, 33, 140–144. [Google Scholar] [CrossRef]
- Fonseca, L.M.G.; Kenney, C.S. Control Point Assessment for Image Registration. In Proceedings of the XII Brazilian Symposium on Computer Graphics and Image Processing (Cat. No.PR00481), Campinas, Brazil, 17–20 October 1999; pp. 125–132. [Google Scholar]
- Chen, C.-F.; Chen, M.-H.; Li, H.-T. Fully Automatic and Robust Approach for Remote Sensing Image Registration. In Progress in Pattern Recognition, Image Analysis and Applications; Rueda, L., Mery, D., Kittler, J., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2008; Volume 4756, pp. 891–900. ISBN 978-3-540-76724-4. [Google Scholar]
- Wang, W.-H.; Chen, Y.-C. Image Registration by Control Points Pairing Using the Invariant Properties of Line Segments. Pattern Recognit. Lett. 1997, 18, 269–281. [Google Scholar] [CrossRef]
- van den Elsen, P.A.; Pol, E.-J.D.; Sumanaweera, T.S.; Hemler, P.F.; Napel, S.; Adler, J.R. Grey Value Correlation Techniques Used for Automatic Matching of CT and MR Brain and Spine Images. In Proceedings of the Volume 2359, Visualization in Biomedical Computing 1994, Rochester, MN, USA, 9 September 1994; Robb, R.A., Ed.; pp. 227–237. [Google Scholar]
- Thevenaz, P.; Ruttimann, U.E.; Unser, M. A Pyramid Approach to Subpixel Registration Based on Intensity. IEEE Trans. on Image Process. 1998, 7, 27–41. [Google Scholar] [CrossRef] [PubMed]
- Hartigan, J.A.; Wong, M.A. Algorithm AS 136: A K-Means Clustering Algorithm. Appl. Stat. 1979, 28, 100. [Google Scholar] [CrossRef]
- Arthur, D.; Vassilvitskii, S. K-Means++: The Advantages of Careful Seeding. In Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2007; pp. 1027–1035. [Google Scholar]
- Kettleborough, G.; Rayward-Smith, V.J. Optimising Sum-of-Squares Measures for Clustering Multisets Defined over a Metric Space. Discret. Appl. Math. 2013, 161, 2499–2513. [Google Scholar] [CrossRef]
- Krzanowski, W.J.; Lai, Y.T. A Criterion for Determining the Number of Groups in a Data Set Using Sum-of-Squares Clustering. Biometrics 1988, 44, 23. [Google Scholar] [CrossRef]
- Pollard, D. A Central Limit Theorem for k-Means Clustering. Ann. Probab. 1982, 10, 919–926. [Google Scholar] [CrossRef]
- Syakur, M.A.; Khotimah, B.K.; Rochman, E.M.S.; Satoto, B.D. Integration K-Means Clustering Method and Elbow Method For Identification of The Best Customer Profile Cluster. IOP Conf. Ser. Mater. Sci. Eng. 2018, 336, 012017. [Google Scholar] [CrossRef]
- Lei, T.; Jia, X.; Liu, T.; Liu, S.; Meng, H.; Nandi, A.K. Adaptive Morphological Reconstruction for Seeded Image Segmentation. IEEE Trans. Image Process. 2019, 28, 5510–5523. [Google Scholar] [CrossRef]
- Zhang, Y.; Bai, X.; Fan, R.; Wang, Z. Deviation-Sparse Fuzzy C-Means With Neighbor Information Constraint. IEEE Trans. Fuzzy Syst. 2019, 27, 185–199. [Google Scholar] [CrossRef]
- Lei, T.; Jia, X.; Zhang, Y.; Liu, S.; Meng, H.; Nandi, A.K. Superpixel-Based Fast Fuzzy C-Means Clustering for Color Image Segmentation. IEEE Trans. Fuzzy Syst. 2019, 27, 1753–1766. [Google Scholar] [CrossRef]
- Lei, T.; Liu, P.; Jia, X.; Zhang, X.; Meng, H.; Nandi, A.K. Automatic Fuzzy Clustering Framework for Image Segmentation. IEEE Trans. Fuzzy Syst. 2020, 28, 2078–2092. [Google Scholar] [CrossRef]
- Jia, X.; Lei, T.; Du, X.; Liu, S.; Meng, H.; Nandi, A.K. Robust Self-Sparse Fuzzy Clustering for Image Segmentation. IEEE Access 2020, 8, 146182–146195. [Google Scholar] [CrossRef]
- Felzenszwalb, P.F.; Huttenlocher, D.P. Efficient Graph-Based Image Segmentation. Int. J. Comput. Vis. 2004, 59, 167–181. [Google Scholar] [CrossRef]
- Ramesh, A.; Raju, V.B.; Rao, M.; Sazonov, E. Food Detection and Segmentation from Egocentric Camera Images. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Virtual, 1–5 November 2021; pp. 2736–2740. [Google Scholar]






| Combination | Hot Food (>30 °C) | Warm Food (29–15 °C) | Cold Food (<15 °C) | 
|---|---|---|---|
| C1 (Initial—0s) | French Fries | Potato Chips | - | 
| C1 (Final—150 s) | - | Fr. fries & Potato Chips | - | 
| C2 | Steamed Veggies | - | Fruits (Tropical) | 
| C3 | French Fries | Corn Chips | - | 
| C4 | Steamed Broccoli | Salad | - | 
| C5 | Steamed Brussel Sprouts | - | Fruits (Grapes) | 
| C6 | Steamed Brussel Sprouts | - | Salad | 
| Combination | Food Item | F1 Score (Dice Coefficient) | ||
|---|---|---|---|---|
| RGB-T | RGB | Thermal | ||
| C1 (Mean) | Fries | 0.859 | 0.754 | 0.743 | 
| Potato Chips | 0.549 | 0.544 | ~ | |
| C2 | Veggies | 0.922 | 0.785 | 0.886 | 
| Fruits | 0.936 | 0.945 | 0.879 | |
| C3 | French Fries | 0.924 | 0.479 | 0.882 | 
| Corn Chips | 0.92 | 0.531 | ~ | |
| C4 | Broccoli | 0.925 | 0.659 | 0.886 | 
| Salad | 0.766 | 0.659 | ~ | |
| C5 | Brussel Sprouts | 0.904 | 0.551 | 0.857 | 
| Grapes | 0.893 | 0.619 | 0.809 | |
| C6 | Brussel Sprouts | 0.934 | 0.732 | 0.928 | 
| Salad | 0.864 | 0.693 | 0.845 | |
| IOU (Mean ± Std. Dev.) | 0.87 ± 0.11 | 0.66 ± 0.13 | 0.64 ± 0.39 | |
| Time (s) | Food Item | Thermal + RGB | RGB | Thermal | 
|---|---|---|---|---|
| 0 | French Fries | 0.89 | 0.752 | 0.848 | 
| Potato Chips | 0.552 | 0.553 | ~ | |
| 15 | French Fries | 0.867 | 0.717 | 0.791 | 
| Potato Chips | 0.554 | 0.557 | ~ | |
| 30 | French Fries | 0.875 | 0.719 | 0.818 | 
| Potato Chips | 0.543 | 0.522 | ~ | |
| 45 | French Fries | 0.878 | 0.75 | 0.783 | 
| Potato Chips | 0.526 | 0.56 | ~ | |
| 60 | French Fries | 0.833 | 0.737 | 0.748 | 
| Potato Chips | 0.55 | 0.542 | ~ | |
| 75 | French Fries | 0.858 | 0.788 | 0.711 | 
| Potato Chips | 0.555 | 0.522 | ~ | |
| 90 | French Fries | 0.863 | 0.764 | 0.733 | 
| Potato Chips | 0.567 | 0.563 | ~ | |
| 105 | French Fries | 0.843 | 0.771 | 0.697 | 
| Potato Chips | 0.568 | 0.567 | ~ | |
| 120 | French Fries | 0.85 | 0.771 | 0.716 | 
| Potato Chips | 0.529 | 0.53 | ~ | |
| 135 | French Fries | 0.847 | 0.77 | 0.724 | 
| Potato Chips | 0.54 | 0.517 | ~ | |
| 150 | French Fries | 0.844 | 0.758 | 0.603 | 
| Potato Chips | 0.552 | 0.546 | ~ | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Raju, V.B.; Imtiaz, M.H.; Sazonov, E. Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data. Sensors 2023, 23, 560. https://doi.org/10.3390/s23020560
Raju VB, Imtiaz MH, Sazonov E. Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data. Sensors. 2023; 23(2):560. https://doi.org/10.3390/s23020560
Chicago/Turabian StyleRaju, Viprav B., Masudul H. Imtiaz, and Edward Sazonov. 2023. "Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data" Sensors 23, no. 2: 560. https://doi.org/10.3390/s23020560
APA StyleRaju, V. B., Imtiaz, M. H., & Sazonov, E. (2023). Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data. Sensors, 23(2), 560. https://doi.org/10.3390/s23020560
 
        



 
       