- Article
Multimodal Food Image Classification with Large Language Models
- Jun-Hwa Kim,
- Nam-Ho Kim,
- Donghyeok Jo and
- Chee Sun Won
In this study, we leverage advancements in large language models (LLMs) for fine-grained food image classification. We achieve this by integrating textual features extracted from images using an LLM into a multimodal learning framework. Specifically,...