Direct computation of shape cues using scale-adapted spatial derivative operators

Jonas Gårding and Tony Lindeberg

International Journal of Computer Vision, vol. 17(2), pp. 163--191, February 1996.


This paper addresses the problem of computing cues to the three-dimensional structure of surfaces in the world directly from the local structure of the brightness pattern of either a single monocular image or a binocular image pair.

It is shown that starting from Gaussian derivatives of order up to two at a range of scales in scale-space, local estimates of (i) surface orientatnadafrom monocular texture foreshortening, (ii) surface orientatnadafrom monocular texture gradients, and (iii) surface orientation from the binocular disparity gradient can be computed without iteration or search, and by using essentially the same basic mechanism.

The methodology is based on a multi-scale descriptor of image structure called the windowed second moment matrix, which is computed with adaptive selection of both scale levels and spatial positions. Notably, this descriptor comprises two scale parameters; a local scale parameter describing the amount of smoothing used in derivative computations, and an integration scale parameter determining over how large a region in space the statistics of regional descriptors is accumulated.

Experimental results for both synthetic and natural images are presented, and the relation with models of biological vision is briefly discussed.

Full paper: (PDF 959kb)

Responsible for this page: Jonas Gårding Tony Lindeberg