Next Article in Journal
Trunk Range of Motion Is Related to Axial Rigidity, Functional Mobility and Quality of Life in Parkinson’s Disease: An Exploratory Study
Next Article in Special Issue
Monocular Visual SLAM Based on a Cooperative UAV–Target System
Previous Article in Journal
Personal Guides: Heterogeneous Robots Sharing Personal Tours in Multi-Floor Environments
Open AccessArticle

Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards

Department of Automation, Technical University of Cluj-Napoca, Str. George Bariţiu, Nr. 26-28, 400027 Cluj-Napoca, Romania
*
Author to whom correspondence should be addressed.
This paper is an extended version of our paper published in Mezei, A.; Tamás, L.; Busoniu, L. Sorting objects from a conveyor belt using active perception with a POMDP model. In Proceedings of the 18th European Control Conference, Naples, Italy, 25–28 June 2019; pp. 2466–2471.
Sensors 2020, 20(9), 2481; https://doi.org/10.3390/s20092481
Received: 4 March 2020 / Revised: 14 April 2020 / Accepted: 22 April 2020 / Published: 27 April 2020
(This article belongs to the Special Issue Smart Sensors for Robotic Systems)
We consider a robot that must sort objects transported by a conveyor belt into different classes. Multiple observations must be performed before taking a decision on the class of each object, because the imperfect sensing sometimes detects the incorrect object class. The objective is to sort the sequence of objects in a minimal number of observation and decision steps. We describe this task in the framework of partially observable Markov decision processes, and we propose a reward function that explicitly takes into account the information gain of the viewpoint selection actions applied. The DESPOT algorithm is applied to solve the problem, automatically obtaining a sequence of observation viewpoints and class decision actions. Observations are made either only for the object on the first position of the conveyor belt or for multiple adjacent positions at once. The performance of the single- and multiple-position variants is compared, and the impact of including the information gain is analyzed. Real-life experiments with a Baxter robot and an industrial conveyor belt are provided. View Full-Text
Keywords: robotics; active perception; POMDP; information-gain rewards robotics; active perception; POMDP; information-gain rewards
Show Figures

Figure 1

MDPI and ACS Style

Mezei, A.-D.; Tamás, L.; Buşoniu, L. Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards. Sensors 2020, 20, 2481.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop