You are currently viewing a new version of our website. To view the old version click .
Engineering Proceedings
  • Abstract
  • Open Access

24 August 2022

Tracking Multiple Instances of Retail Consumers from RGB and Thermal Images †

,
and
Department of Computer Science and Engineering, University of Ioannina, 45110 Ioannina, Greece
*
Author to whom correspondence should be addressed.
Presented at the 9th International Symposium on Sensor Science, Warsaw, Poland, 20–22 June 2022.
This article belongs to the Proceedings The 9th International Symposium on Sensor Science

Abstract

Visual tracking of multiple objects is a challenging task that has gained considerable attention recently. The process involves several steps: detection, data association, and re-identification, each with its own set of challenges. Trending methods usually combine some of these steps and report impressive results in tracking benchmarks while running at practical speeds. In this work, we develop an online, real-time multi-object tracking approach that is also able to perform consumer tracking in indoor retail stores, without further tuning its design philosophy. Our model can utilize multi-modal inputs, such as RGB and thermal images when available, to improve its performance while maintaining acceptable speeds at 20FPS. Our key contribution to motion forecasting replaces the standard Kalman filters with an LSTM network to properly model long-term dependencies and target tracking. Numerical experiments of our method on the widely used MOT16 benchmark demonstrate its effectiveness. Additionally, qualitative results in in-house retail image data confirm the method’s capacity for practical multi-consumer tracking. Our proposed system is part of a research project aiming to capture retail consumers’ preferences, as a function of their relative position with respect to particular areas of a store that have been mapped according to the shelves, which are visible for each view area of the installed RGB and thermal cameras. Additionally, the system relies on accurate estimation of consumer age and gender using only body images in a multi-task deep learning framework in order to further taxonomize consumer preferences into age and gender groups. By these means, useful consumer meta-data can be extracted, ready to be employed for marketing campaign customization, which might improve sales and the consumer experience altogether.

Author Contributions

Conceptualization, A.P.G. and C.N.; methodology, I.-I.P.; software, I.-I.P.; validation, A.P.G., I.-I.P. and C.N.; formal analysis, A.P.G. and I.-I.P.; investigation, A.P.G. and I.-I.P.; resources, I.-I.P.; data curation, I.-I.P.; writing—original draft preparation, A.P.G. and I.-I.P.; writing—review and editing, A.P.G.; visualization, A.P.G. and I.-I.P.; supervision, A.P.G.; project administration, C.N.; funding acquisition, C.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the [European Regional Development Fund of the European Union and Greek national funds—Operational Program Competitiveness, Entrepreneurship and Innovation—call RESEARCH–CREATE–INNOVATE] grant number [T1EDK-04517].

Institutional Review Board Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.