Problem Formulation

The goal of the proposed tracking system is to track the 3D poses of a set of rigid bodies
from RGB-D data streams, as obtained by Kinect, as accurate as possible:
where represents an affine 3D transformation which maps the 3D pose of an object between two frames and
is the likelihood of
and
given
and
.
and
are the RGB and the depth information, respectively, delivered by the sensor. The tracked objects are defined as 3D point clusters, or Point Distribution Models (PDMs),
, where
is the total number of tracked clusters. The pose of each cluster model
is given at every frame by its corresponding relative affine transformation
. The position of the clusters is related to their centroids.
Methodology
The tracking is initialized through tabletop object segmentation, which calculates the 3D reference model, or cluster, of the object of interest. Once
is known, its shape is projected into the 2D image, where a classifier for tracking is trained on the resulted projection. Inside the tracking loop, the classifier establishes 2D correspondences between the consecutive frames
and
, where
represents the discrete time. These 2D image matches are used to select the 3D point correspondences
between the corresponding point clouds
and
in the Kinect data. From
, an initial coarse transform
for the reference cluster is calculated. Although, depending on the quality of the
matches,
can, to some extent, provide good tracking results, it fails to precisely map
onto the current cloud
. For this reason, a second transform
is determined using an Iterative Closest Point (ICP) algorithm applied on non-occluded object points. Thus, the final object model transform
, which tracks the pose of a 3D object between consecutive frames, can be written as:
References
S.M. Grigorescu, D. Pangercic and M. Beetz "2D-3D Collaborative Tracking (23CT): Towards Stable Robotic Manipulation", Proceedings of the 2012 IEEE-RSJ International Conference on Intelligent RObots and Systems IROS, Workshop on Active Semantic Perception, Vilamoura, Algarve, Portugal, October 7-12, 2012.
S.M. Grigorescu and C. Pozna "Towards a Stable Robotic Object Manipulation through 2D-3D Features Tracking", International Journal of Advanced Robotic Systems, InTech, vol. 10, no. 200, pp. 1-8, 2013.
Latex Bibtex Citation
@inproceedings{grigorescu2d3d,
author = {Grigorescu, Sorin M and Pangercic, Dejan and Beetz, Michael},
title = {2D--3D Collaborative tracking (23CT): towards stable robotic manipulation},
booktitle = {IEEE-RSJ International Conference on Intelligent Robots and Systems (IROS), Workshop on Active Semantic Perception},
year = {2019},
organization = {Citeseer},
}