Citizendia
Your Ad Here

Feature detection

Output of a typical corner detection algorithm
Edge detection
Canny
Canny-Deriche
Differential
Sobel
Interest point detection
Corner detection
Harris operator
Shi and Tomasi
Level curve curvature
SUSAN
FAST
Blob detection
Laplacian of Gaussian (LoG)
Difference of Gaussians (DoG)
Determinant of Hessian (DoH)
Maximally stable extremal regions
Ridge detection
Affine invariant feature detection
Affine shape adaptation
Harris affine
Hessian affine
Feature description
SIFT
SURF
GLOH
LESH
Scale-space
Scale-space axioms
Implementation details
Pyramids
This box: view  talk  edit

In the area of computer vision, 'blob detection' refers to visual modules that are aimed at detecting points and/or regions in the image that are either brighter or darker than the surrounding. In Computer vision and Image processing the concept of feature detection refers to methods that aim at computing abstractions of image information and making Edge detection is a terminology in Image processing and Computer vision, particularly in the areas of feature detection and Feature extraction The Canny Edge detection operator was developed by John F Canny in 1986 and uses a multi-stage Algorithm to detect a wide range of edges in images The Canny Edge detection operator was developed by John F Canny in 1986 and uses a multi-stage Algorithm to detect a wide range of edges in images Edge detection is a terminology in Image processing and Computer vision, particularly in the areas of feature detection and Feature extraction The Sobel operator is used in Image processing, particularly within Edge detection algorithms Interest point detection is a recent terminology in Computer vision that refers to the detection of interest points for subsequent processing Corner detection or the more general terminology Interest point detection is an approach used Corner detection or the more general terminology Interest point detection is an approach used Corner detection or the more general terminology Interest point detection is an approach used Corner detection or the more general terminology Interest point detection is an approach used Corner detection or the more general terminology Interest point detection is an approach used Corner detection or the more general terminology Interest point detection is an approach used In the area of Computer vision, ' blob detection' refers to visual modules that are aimed at detecting points and/or regions in the image that are either brighter or darker In Computer vision, Difference of Gaussians is a Grayscale image enhancement algorithm that involves the subtraction of one blurred version of an original grayscale In the area of Computer vision, ' blob detection' refers to visual modules that are aimed at detecting points and/or regions in the image that are either brighter or darker In Computer vision, maximally stable extremal regions ( MSER) are used as a method of Blob detection in images The ridges (or the ridge set) of a smooth function of two variables is a set of curves whose points are loosely speaking local maxima in at least one dimension Affine shape adaptation is a methodology for iteratively adapting the shape of the smoothing kernels in an affine group of smoothing kernels to the local image structure in neighbourhood In the fields of Computer vision and Image analysis, the Harris-affine region detector belongs to the category of Feature detection. The Hessian-Affine region detector is a feature detector used in the fields of Computer vision and Image analysis. Scale-invariant feature transform (or SIFT) is an algorithm in Computer vision to detect and describe local features in images SURF (Speeded Up Robust Features is a robust image descriptor that can be used in computer vision tasks GLOH ( Gradient Location and Orientation Histogram) is a robust image descriptor that can be used in Computer vision tasks LESH (Local Energy based Shape Histogram is a recently proposed image descriptor in computer vision Scale-space theory is a framework for multi-scale signal representation developed by the Computer vision, Image processing and In Image processing and Computer vision, a Scale-space framework can be used to represent an image as a family of gradually smoothed images The linear Scale space representation of an N-dimensional continuous signal f_C(x_1 x_2 \dots x_N t is obtained by convolving f_C Pyramid or ' pyramid representation' is a type of multi-scale signal representation developed by the Computer vision, Image processing Computer vision is the science and technology of machines that see There are two main classes of blob detectors (i) differential methods based on derivative expressions and (ii) methods based on local extrema in the intensity landscape. With the more recent terminology used in the field, these operators can also be referred to as interest point operators, or alternatively interest region operators (see also interest point detection and corner detection). Interest point detection is a recent terminology in Computer vision that refers to the detection of interest points for subsequent processing Corner detection or the more general terminology Interest point detection is an approach used

There are several motivations for studying and developing blob detectors. One main reason is to provide complementary information about regions, which is not obtained from edge detectors or corner detectors. Edge detection is a terminology in Image processing and Computer vision, particularly in the areas of feature detection and Feature extraction Corner detection or the more general terminology Interest point detection is an approach used In early work in the area, blob detection was used to obtain regions of interest for further processing. These regions could signal the presence of objects or parts of objects in the image domain with application to object recognition and/or object tracking. Object recognition in Computer vision is a task of finding given object in an image or video sequence Video tracking is the process of locating a moving object (or several ones in time using a camera In other domains, such as histogram analysis, blob descriptors can also be used for peak detection with application to segmentation. In Computer vision, segmentation refers to the process of partitioning a Digital image into multiple Regions ( sets of Pixels. Another common use of blob descriptors is as main primitives for texture analysis and texture recognition. Texture refers to the properties held and sensations caused by the external surface of objects received through the sense of touch. In more recent work, blob descriptors have found increasingly popular use as interest points for wide baseline stereo matching and to signal the presence of informative image features for appearance-based object recognition based on local image statistics. Interest point detection is a recent terminology in Computer vision that refers to the detection of interest points for subsequent processing In Computer vision, sets of Data acquired by sampling the same scene or object at different times or from different perspectives will be in different coordinate systems There is also the related notion of ridge detection to signal the presence of elongated objects. The ridges (or the ridge set) of a smooth function of two variables is a set of curves whose points are loosely speaking local maxima in at least one dimension

Contents

The Laplacian of Gaussian

One of the first and also most common blob detectors is based on the Laplacian of the Gaussian (LoG). In Mathematics and Physics, the Laplace operator or Laplacian, denoted by \Delta\  or \nabla^2  and named after Given an input image f(x,y), this image is convolved by a Gaussian kernel

g(x, y, t) = \frac {1}{2{\pi t}}e^{-(x^2+y^2)/(2t)}

at a certain scale t to give a scale-space representation L(x, y, t)\ = g(x, y, t) * f(x, y). Scale-space theory is a framework for multi-scale signal representation developed by the Computer vision, Image processing and Then, the Laplacian operator

\nabla^2 L = L_{xx} + L_{yy}

is computed, which usually results in strong positive responses for dark blobs of extent \sqrt{t} and strong negative responses for bright blobs of similar size. In Mathematics and Physics, the Laplace operator or Laplacian, denoted by \Delta\  or \nabla^2  and named after A main problem when applying this operator at a single scale, however, is that the operator response is strongly dependent on the relationship between the size of the blob structures in the image domain and the size of the Gaussian kernel used for pre-smoothing. In order to automatically capture blobs of different (unknown) size in the image domain, a multi-scale approach is therefore necessary.

A straightforward way to obtain a multi-scale blob detector with automatic scale selection is to consider the scale-normalized Laplacian operator

\nabla^2_{norm} L(x, y; t) = t(L_{xx} + L_{yy})

and to detect scale-space maxima/minima, that are points that are simultaneously local maxima/minima of \nabla^2_{norm} L with respect to both space and scale (Lindeberg 1998). Thus, given a discrete two-dimensional input image f(x,y) a three-dimensional discrete scale-space volume L(x,y,t) is computed and a point is regarded as a bright (dark) blob if the value at this point is greater (smaller) than the value in all its 26 neighbours. Thus, simultaneous selection of interest points (\hat{x}, \hat{y}) and scales \hat{t} is performed according to

(\hat{x}, \hat{y}; \hat{t}) = \operatorname{argmaxminlocal}_{(x, y; t)}(\nabla^2_{norm} L(x, y; t)).

Note that this notion of blob provides a concise and mathematically precise operational definition of the notion of "blob", which directly leads to an efficient and robust algorithm for blob detection. Some basic properties of blobs defined from scale-space maxima of the normalized Laplacian operator are that the responses are covariant with translations, rotations and rescalings in the image domain. Thus, if a scale-space maximum is assumed at a point (x0,y0;t0) then under a rescaling of the image by a scale factor s, there will be a scale-space maximum at (sx0,sy0;s2t0) in the rescaled image (Lindeberg 1998). This in practice highly useful property implies that besides the specific topic of Laplacian blob detection, local maxima/minima of the scale-normalized Laplacian are also used for scale selection in other contexts, such as in corner detection, scale-adaptive feature tracking (Bretzner and Lindeberg 1998), in the scale-invariant feature transform (Lowe 2004) as well as other image descriptors for image matching and object recognition. Corner detection or the more general terminology Interest point detection is an approach used Scale-invariant feature transform (or SIFT) is an algorithm in Computer vision to detect and describe local features in images Object recognition in Computer vision is a task of finding given object in an image or video sequence

The difference of Gaussians approach

From the fact that the scale-space representation L(x,y,t) satisfies the diffusion equation

\partial_t L = \frac{1}{2} \nabla^2 L

it follows that the Laplacian of the Gaussian operator \nabla^2 L(x, y, t) can also be computed as the limit case of the difference between two Gaussian smoothed images (scale-space representations)

\nabla^2 L(x, y; t) = \frac{1}{2 \Delta t} \left( L(x, y; t+\Delta t) - L(x, y; t-\Delta t) \right). Scale-space theory is a framework for multi-scale signal representation developed by the Computer vision, Image processing and The diffusion equation is a Partial differential equation which describes density fluctuations in a material undergoing Diffusion.

In the computer vision literature, this approach is referred to as the Difference of Gaussians (DoG) approach. In Computer vision, Difference of Gaussians is a Grayscale image enhancement algorithm that involves the subtraction of one blurred version of an original grayscale Besides minor technicalities, however, this operator is in essence similar to the Laplacian and can be seen as an approximation of the Laplacian operator. In Mathematics and Physics, the Laplace operator or Laplacian, denoted by \Delta\  or \nabla^2  and named after In a similar fashion as for the Laplacian blob detector, blobs can be detected from scale-space extrema of differences of Gaussians.

The determinant of the Hessian

By considering the scale-normalized determinant of the Hessian, also referred to as the Monge–Ampère operator,

\operatorname{det} H L(x, y; t) = t^2 (L_{xx} L_{yy} - L_{xy}^2)

where HL denotes the Hessian matrix of L and then detecting scale-space maxima/minima of this operator one obtains another straightforward differential blob detector with automatic scale selection which also responds to saddles (Lindeberg 1998)

(\hat{x}, \hat{y}; \hat{t}) = \operatorname{argmaxminlocal}_{(x, y; t)}(\operatorname{det} H L(x, y; t)). In Mathematics, a (real Monge–Ampère equation is a nonlinear second order Partial differential equation of special kind In Mathematics, the Hessian matrix is the Square matrix of second-order Partial derivatives of a function.

The blob points (\hat{x}, \hat{y}) and scales \hat{t} are also defined from an operational differential geometric definitions that leads to blob descriptors that are covariant with translations, rotations and rescalings in the image domain. In terms of scale selection, blobs defined from scale-space extrema of the determinant of the Hessian (DoH) also have slightly better scale selection properties under non-Euclidean affine transformations than the more commonly used Laplacian operator (Lindeberg 1998). In simplified form, the determinant of the Hessian computed from Haar wavelets is used as the basic interest point operator in the SURF descriptor (Bay et al 2006) for image matching and object recognition. The Haar wavelet is the first known Wavelet and was proposed in 1909 by Alfréd Haar. SURF (Speeded Up Robust Features is a robust image descriptor that can be used in computer vision tasks

The hybrid Laplacian and determinant of the Hessian operator (Hessian-Laplace)

A hybrid operator between the Laplacian and the determinant of the Hessian blob detectors has also been proposed, where spatial selection is done by the determinant of the Hessian and scale selection is performed with the scale-normalized Laplacian (Mikolajczyk and Schmid 2004)

(\hat{x}, \hat{y}) = \operatorname{argmaxminlocal}_{(x, y)}(\operatorname{det} H L(x, y; t)),
\hat{t} = \operatorname{argmaxminlocal}_{t}(\nabla^2_{norm} L(\hat{x}, \hat{y}; t)). This operator has been used for image matching, object recognition as well as texture analysis.

Affine-adapted differential blob detectors

The blob descriptors obtained from these blob detectors with automatic scale selection are invariant to translations, rotations and uniform rescalings in the spatial domain. The images that constitute the input to a computer vision system are, however, also subject to perspective distortions. To obtain blob descriptors that are more robust to perspective transformations, a natural approach is to devise a blob detector that is invariant to affine transformations. In practice, affine invariant interest points can be obtained by applying affine shape adaptation to a blob descriptor, where the shape of the smoothing kernel is iteratively warped to match the local image structure around the blob, or equivalently a local image patch is iteratively warped while the shape of the smoothing kernel remains rotationally symmetric (Lindeberg and Garding 1997; Baumberg 2000; Mikolajczyk and Schmid 2004). Affine shape adaptation is a methodology for iteratively adapting the shape of the smoothing kernels in an affine group of smoothing kernels to the local image structure in neighbourhood In this way, we can define affine-adapted versions of the Laplacian/Difference of Gaussian operator, the determinant of the Hessian and the Hessian-Laplace operator (see also Harris-Affine and Hessian-Affine). In the fields of Computer vision and Image analysis, the Harris-affine region detector belongs to the category of Feature detection. The Hessian-Affine region detector is a feature detector used in the fields of Computer vision and Image analysis.

Grey-level blobs, grey-level blob trees and scale-space blobs

A natural approach to detect blobs is to associate a bright (dark) blob with each local maximum (minimum) in the intensity landscape. A main problem with such an approach, however, is that local extrema are very sensitive to noise. To address this problem, Lindeberg (1993) studied the problem of detecting local maxima with extent at multiple scales in scale-space. Scale-space theory is a framework for multi-scale signal representation developed by the Computer vision, Image processing and A region with spatial extent defined from a watershed analogy was associated with each local maximum, as well a local contrast defined from a so-called delimiting saddle point. A local extremum with extent defined in this way was referred to as a grey-level blob. Moreover, by proceeding with the watershed analogy beyond the delimiting saddle point, a grey-level blob tree was defined to capture the nested topological structure of level sets in the intensity landscape, in a way that is invariant to affine deformations in the image domain and monotone intensity transformations. By studying how these structures evolve with increasing scales, the notion of scale-space blobs was introduced. Beyond local contrast and extent, these scale-space blobs also measured how stable image structures are in scale-space, by measuring their scale-space lifetime.

It was proposed that regions of interest and scale descriptors obtained in this way, with associated scale levels defined from the scales at which normalized measures of blob strength assumed their maxima over scales could be used for guiding other early visual processing. An early prototype of simplified vision systems was developed where such regions of interest and scale descriptors were used for directing the focus-of-attention of an active vision system. While the specific technique that was used in these prototypes can be substantially improved with the current knowledge in computer vision, the overall general approach is still valid, for example in the way that that local extrema over scales of the scale-normalized Laplacian operator are nowadays used for providing scale information to other visual processes.

Maximally stable extremum regions (MSER)

Matas et al (2002) were interested in defining image descriptors that are robust under perspective transformations. In Computer vision, maximally stable extremal regions ( MSER) are used as a method of Blob detection in images They studied level sets in the intensity landscape and measured how stable these were along the intensity dimension. Based on this idea, they defined a notion of maximally stable extremum regions and showed how these image descriptors can be used as image features for stereo matching.

Interestingly, there are close relations between this notion and the abovementioned notion of grey-level blob tree. The maximally stable extremum regions can be seen as making a specific subset of the grey-level blob tree explicit for further processing.

See also

References


© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
Dapyx Software network: MP3 Explorer | Ebook Manager | Zenithic