Citizendia
Your Ad Here

A diagram of a pinhole camera.
A diagram of a pinhole camera. A' pinhole camera' is a very simple Camera with no lens and a single very small Aperture.

The pinhole camera model describes the mathematical relationship between the coordinates of a 3D point and its projection onto the image plane of an ideal pinhole camera, where the camera aperture is described as a point and no lenses are used to focus light. A' pinhole camera' is a very simple Camera with no lens and a single very small Aperture. The model does not include, for example, geometric distortions or blurring of unfocused objects caused by lenses and finite sized apertures. It also does not take into account that most practical cameras have only discrete image coordinates. This means that the pinhole camera model can only be used as a first order approximation of the mapping from a 3D scene to a 2D image. Its validity depends on the quality of the camera and, in general, decreases from the center of the image to the edges as lens distortion effects increase.

Some of the effects that the pinhole camera model does not take into account can be compensated for, for example by applying suitable coordinate transformations on the image coordinates, and others effects are sufficiently small to be neglected if a high quality camera is used. This means that the pinhole camera model often can be used as a reasonable description of how a camera depicts a 3D scene, for example in computer vision and computer graphics. Computer vision is the science and technology of machines that see Computer graphics are Graphics created by Computers and more generally the Representation and Manipulation of Pictorial Data

Contents

The geometry and mathematics of the pinhole camera

The geometry of a pinhole camera
The geometry of a pinhole camera

The geometry related to the mapping of a pinhole camera is illustrated in the figure. Geometry ( Greek γεωμετρία; geo = earth metria = measure is a part of Mathematics concerned with questions of size shape and relative position The figure contains the following basic objects

The pinhole aperture of the camera, through which all projection lines must pass, is assumed to be infinitely small, a point. In the literature this point in 3D space is referred to as the optical (or lens or camera) center, focus, or camera focal point.

Next we want to understand how the coordinates (y1,y2) of point Q depend on the coordinates (x1,x2,x3) of point P. This can be done with the help of the following figure which shows the same scene as the previous figure but now from above, looking down in the negative direction of the X2 axis.

The geometry of a pinhole camera as seen from the X2 axis
The geometry of a pinhole camera as seen from the X2 axis

In this figure we see two similar triangles, both having parts of the projection line (green) as their hypotenuses. Geometry Two geometrical objects are called similar if one is congruent to the result of a uniform scaling (enlarging or shrinking of the other A hypotenuse is the longest side of a Right triangle, the side opposite of the Right angle. The catheti of the left triangle are y1 and f and the catheti of the right triangle are x1 and x3. This page is about the geometric meaning For the plant see Phyllanthus. Since the two triangles are similar it follows that

 \frac{-y_1}{f} = \frac{x_1}{x_3} or  y_1 = -\frac{f \, x_1}{x_3}

A similar investigation, looking in the negative direction of the X1 axis gives

 \frac{-y_2}{f} = \frac{x_2}{x_3} or  y_2 = -\frac{f \, x_2}{x_3}

This can be summarized as

 \begin{pmatrix} y_1 \\ y_2 \end{pmatrix} = -\frac{f}{x_3} \begin{pmatrix} x_1 \\ x_2 \end{pmatrix}

which is an expression that describes the relation between the 3D coordinates (x1,x2,x3) of point P and its image coordinates (y1,y2) given by point Q in the image plane.

Before continuing, it should be noted that the mapping from 3D to 2D coordinates described by a pinhole camera is a perspective projection followed by a 180^{\circ} rotation in the image plane. Perspective (from Latin perspicere to see through in the graphic arts such as drawing is an approximate representation on a flat surface (such as paper of an image as it is perceived This corresponds to how a real pinhole camera operates; the resulting image is rotated 180^{\circ} and the relative size of projected objects depends on their distance to the focal point and the overall size of the image depends on the distance f between the image plane and the focal point. In order to produce an unrotated image, which is what we expect from a camera, there are two possibilities:

In both cases the resulting mapping from 3D coordinates to 2D image coordinates is given by

 \begin{pmatrix} y_1 \\ y_2 \end{pmatrix} = \frac{f}{x_3} \begin{pmatrix} x_1 \\ x_2 \end{pmatrix}

(same as before except no minus sign)

Projective representation

Main article: camera matrix

The mapping from 3D coordinates of points in space to 2D image coordinates can also be represented in homogeneous coordinates. In Computer vision a camera matrix or (camera projection matrix is a 3 \times 4 Matrix which describes the mapping of a Pinhole camera In Mathematics, homogeneous coordinates, introduced by August Ferdinand Möbius in his 1827 work Der barycentrische Calcul, allow Affine transformations Let  \mathbf{x} be a representation of a 3D point in homogeneous coordinates (a 4-dimensional vector), and let  \mathbf{y} be a representation of the image of this point in the pinhole camera (a 3-dimensional vector). In Mathematics, homogeneous coordinates, introduced by August Ferdinand Möbius in his 1827 work Der barycentrische Calcul, allow Affine transformations Then the following relation holds

 \mathbf{y} \sim \mathbf{C} \, \mathbf{x}

where  \mathbf{C} is the  3 \times 4 camera matrix and the \, \sim sign implies that the left and right hand sides are equal up to a non-zero scalar multiplication. In Computer vision a camera matrix or (camera projection matrix is a 3 \times 4 Matrix which describes the mapping of a Pinhole camera A consequence of this relation is that  \mathbf{C} can be seen as an element of a projective space; two camera matrices are equivalent if they are equal up to a scalar multiplication. In Mathematics a projective space is a set of elements constructed from a vector space such that a distinct element of the projective space consists of all non-zero vectors which This description of the pinhole camera mapping, as a linear transformation  \mathbf{C} instead of as a fraction of two linear expressions, makes it possible to simplifiy many derivations of relations between 3D and 2D coordinates.

See also

References


© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
Dapyx Software network: MP3 Explorer | Ebook Manager | Zenithic