Abstract
Suppose we take a set of images obtained from videophotography of a moving object in three dimensions, such as an aeroplane, and that we compute moments or fourier descriptors, or some other set of smooth features of the resulting image, to get a vector in a feature space F-n which describes the image. Then the different orientations and positions of the object in space are generated by a local Lie group, a neighbourhood of the identity in the group SE(3), and provided we compute enough moments or other descriptors, i.e. provided n is big enough, and provided the object is not symmetric, the result is to give a smooth injection of this group of transformations into the feature space. The value of n which is 'big enough' is almost always 2d + 1, where d is the dimension of the group, which for rigid objects under translation and rotation is six. This result has applications to object recognition in video images and to the problem of interpolating between different views of an object, as naive interpolations in the Feature space give erroneous results. It extends to moving objects with more degrees of freedom such as robots. In this paper we state and prove the above result formally and illustrate it with synthetic images of a 'robot'.
Original language | English |
---|---|
Pages (from-to) | 73-84 |
Journal | Journal of Mathematical Imaging and Vision |
Volume | 26 |
Issue number | 1-2 |
DOIs | |
Publication status | Published - 2006 |