Computing a homography matrix using arbitrary known geometric relationships

Question

Computing a homography matrix using arbitrary known geometric relationships

I am using OpenCV for an optical measurement system. I need to carry out a perspective transformation between two images captured by a digital camera. In the field of view of the camera, I placed a set of markers (which lie in a common plane), which I use as the corresponding points in both images. Using the positions of the markers, I can calculate the homography matrix. The problem is that the measured object, the images of which I really want to convert, is located at a small distance from the markers and parallel to the plane of the markers. I can measure this distance.

My question is how to take this distance into account when calculating the homography matrix needed to complete the perspective transformation.

In my decision, the strong requirement is not to use the measured points of the object to calculate homography (and therefore I need other markers in the field of view).

Please let me know if the description is inaccurate.

enter image description here

An exemplary image is shown in the figure.

The red rectangle is the measured object. It is physically located a short distance from the circular markers.

I capture images of an object from different camera positions. The measured object can be deformed between each acquisition. Using circular markers, I want to convert the image of the object to the same coordinates. I can measure the distance between the object and the markers, but I do not know how to change the homography matrix to work with the measured object (instead of markers).

+4

opencv computer-vision homography

Marcin Oct 10 '11 at 10:01

source share

1 answer

AldurDisciple · Answer 1 · 2014-04-04T09:29:03+0000

This question is quite old, but it is interesting and may be useful to someone.

Firstly, here is how I understood the problem presented in the question:

You have two images, i ₁ and i ₂ , taken with the same digital camera in two different positions. These images show a set of markers; they all lie in the common plane p _m . There is also a measured object, the visible surface of which lies in the plane p _o parallel to the plane of the marker, but with a slight offset. You calculated the homography H ^m ₁₂ , which maps the positions of the markers in I ₁ to the corresponding positions of the markers in I ₂ and you measured the offset d _mo between the planes p _o and p _m . From this, you would like to calculate the homography of H ^o ₁₂ points of display of the measured object in i ₁ to the corresponding points in i <sub> 2sub>.

A few notes on this issue:

First, note that homography is the relationship between image points, while the distance between the marker plane and the object plane is the distance in world coordinates. Using the latter, in order to do something about the former, it is necessary to have a metric estimate of the position of the camera, i.e. You need to determine the Euclidean and approximate position and orientation of the camera for each of the two images. The Euclidean requirement implies that a digital camera must be calibrated, which should not be a problem for an “optical measurement system”. The scale requirement implies that the true three-dimensional distance between two given three-dimensional points must be known. For example, you need to know the true distance l ₀ between two arbitrary markers.

Since we only need a relative camera position for each image, we can choose a three-dimensional coordinate system centered and aligned with the camera coordinate system for i ₁ . Therefore, we denote the projection matrix for π ₁ by P ₁ = K ₁ * [I | 0]. Then we denote the projection matrix for π ₂ (in the same three-dimensional coordinate system) by P ₂ = K ₂ * [R ₂ | t ₂ ]. We will also denote the D ₁ and D ₂ coefficients that distort the distortion of the lens, respectively, for I ₁ and I ₂ .

As soon as one digital camera acquired both I ₁ and I ₂ , you can assume that K ₁ = K ₂ = K and D ₁ = D ₂ = D. However, if I ₁ and I ₂ were acquired with a long delay between acquisitions (or with other scaling, etc.), it will more accurately take into account that two different camera arrays and two sets of distortion factors are involved.

Here's how you might approach this problem:

The steps for evaluating P ₁ and P _{2 are} as follows:

Rate K ₁ , K ₂ and D ₁ , D ₂ through digital camera calibration
Use D ₁ and D ₂ to correct images i ₁ and i ₂ for lens distortion, then determine the position of the marker in the corrected images
Calculate the fundamental matrix F ₁₂ (points of mapping in I ₁ to epilines in I ₂ ) from the corresponding marker positions and derive the essential matrix E ₁₂ = K ₂ ^T * F ₁₂ * K _{1 to the sub>}
The derivation of R ₂ and t ₂ from E ₁₂ and a one-point correspondence (see this answer to the corresponding question). At the moment, you have an affine assessment of the camera pose, but not to scale, since t ₂ has a unit norm.
Use the measured distance l ₀ between two arbitrary markers to deduce the correct rate for t ₂ .
For better accuracy, you can refine P ₁ and P ₂ using a bunch adjustment, with K ₁ and | | t <south> 2sub> || fixed based on the corresponding marker positions in i ₁ and i ₂ .

At this point, you have an accurate metric estimate of the camera pose P ₁ = K ₁ * [I | 0] and P ₂ = K ₂ * [R ₂ | t ₂ ]. Now the steps to evaluate H ^o ₁₂ :

Use D ₁ and D ₂ to correct images I ₁ and I ₂ for the lens distortion, then determine the position of the marker in the corrected images (the same as 2. above, do not repeat this) and evaluate H ^m ₁₂ from these corresponding positions
Calculate the 3x1 v vector describing the plane of the markers p _m , solving this linear equation: Z * H ^m ₁₂ = K ₂ * (R ₂ - t ₂ * v ^T ) * K ₁ ^-1 (see chapter 13 of HZ00, result 13.5 and equation 13.2 for reference), where Z is the scaling factor. Print the distance to the beginning d _m = || v || and the normal n = v / || v ||, which describe the plane of markers p _m in 3D.
Since the plane of the object p _{o is} parallel to p _m , they have the same normal n. Therefore, you can derive the distance to the beginning d _o for p _o from the distance to the beginning d _m for p _m and from the measured displacement of the plane d _mo , as shown below: d _o = d _m ± d _mo (the sign depends on the relative position of the planes: positive if p _{m is} closer to the camera for i ₁ than p _o , negative otherwise).
From n and d _o describing the plane of the object in 3D, derive the homography H ^o ₁₂ = K ₂ * (R ₂ - t ₂ * n ^T / d _o ) * K ₁ ^-1 (see chapter 13 of HZ00, equation 13.2)
Homography H ^o ₁₂ maps the points on the measured object in I ₁ to the corresponding points in I ₂ , where it is assumed that both I ₁ and I _{2 are} corrected to distort the lens. If you need to match the points from and to the original distorted image, be sure to use the distortion factors D ₁ and D ₂ to convert the input and output points H ^o <sub> 12sub>.

The link I used:

[HZ00] "Multiple geometry of vision for computer vision", R. Hartley and A. Zisserman, 2000.

Computing a homography matrix using arbitrary known geometric relationships

More articles: