In image analysis, the generalized structure tensor (GST) is an extension of the Cartesian structure tensor to curvilinear coordinates. It is mainly used to detect and to represent the "direction" parameters of curves, just as the Cartesian structure tensor detects and represents the direction in Cartesian coordinates. Curve families generated by pairs of locally orthogonal functions have been the best studied.
Contents
- GST in 2D and locally orthogonal bases
- Basic concept for its use in image processing and computer vision
- Physical and mathemical interpretation
- Miscelenous
- References
It is a widely known method in applications of image and video processing including computer vision, such as biometric identification by fingerprints, and studies of human tissue sections.
GST in 2D and locally orthogonal bases
Let the term image represent a function
1. The "lines" are ordinary lines in the curvilinear coordinate basis
which are curves in Cartesian coordinates as depicted by the equation above. The error is measured in the
2. The functions
Accordingly, such curvilinear coordinates
Then GST consists in
where
Thereby, Cartesian Structure tensor is a special case of GST where
Basic concept for its use in image processing and computer vision
Efficient detection of
Logarithmic spirals, including circles, can for instance be detected by (complex) convolutions and non-linear mappings. The spirals can be in gray (valued) images or in a binary image, i.e. locations of edge elements of the concerned patterns, such as contours of circles or spirals, must not be known or marked otherwise.
Generalized structure tensor can be used as an alternative to Hough transform in image processing and computer vision to detect patterns whose local orientations can be modelled, for example junction points. The main differences comprise:
Physical and mathemical interpretation
The curvilinear coordinates of GST can explain physical processes applied to images. A well known pair of processes consist in rotation, and zooming. These are related to the coordinate transformation
If an image
Zooming (comprising unzooming) operation is modeled similarly. If the image has iso-curves that look like a "star" or bicycle spokes, i.e.
In combination,
is invariant to a certain amount of rotation combined with scaling, where the amount is precised by the parameter
Analogously, the Cartesian structure tensor is a representation of a translation too. Here the physical process consists in an ordinary translation of a certain amount along
where the amount is specified by the parameter
Generally, the estimated
Miscelenous
Image in the context of the GST means both an ordinary image and an image neighborhood therein (local image), the context determining. For example, a photograph as well as any neighborhood of it are images.