STEREO CAMERA CALIBRATION

WHY NOT CALIBRATE BOTH CAMERAS WITH ZHANG

In order to calibrate a stereo camera system the $PPM$ alone is not sufficient, the rigid motion between cameras is also needed, a first approach could be to compute the zhang’s method for both cameras but this approach as one major flaw, is not robust to noise

BETTER SOLUTION: GUESSING

a more robust solution is to make an initial guess of $R$ and $T$ by taking pictures of the same planar pattern from the same position with both cameras and then refine the guess by a non-linear minimization of the reprojection error.

The first guess is obtained as the median between the $R_{i}$ $T_{i}$ computed by chaining the transformations $G_{i} (G_{j}^{- 1})$

Then the guess is refined with Levenberg-Marquardt algorithm

k = L \sum i = 1 \sum n j = 1 \sum m ∥ m_{i, j}^{k} - m \sim (A_{L}, A_{R}, K_{L}, K_{R}, R, T, w_{j}) ∥^{2}

Then for convenience one of the 2 CRF is chosen to be the stereo camera reference frame (SRF), the other camera $PPM$ matrix can be retrieved by the rigid motion matrix $R, T$ between the 2 cameras

P \sim_{L} = A_{L} [I ∣0] \Rightarrow P \sim_{R} = A_{R} [R ∣ T]

RECTIFICATION

For better searching for correspondent points the images need to be perfectly aligned, this is impossible with mechanical alignment so the images are rectified This is done by virtually rotating the calibrated cameras (e.g. redefining the $PPM s$ ) about their optical center through an homography

CONSTRUCTING THE $A$ MATRIX

so in order to define a new $PPM$ a matrix $A_{n e w}$ is arbitrary chosen (e.g. the mean between the $A_{R}, A_{L}$ )

CONSTRUCTING THE $R$ MATRIX

Then a new $R_{n e w}$ matrix need to be defined, the first vector is chosen to be parallel to the baseline vector $B = C_{R} - C_{L}$ that in the stereo reference frame becomes

B = - R^{T} T = [B_{x}, B_{y}, B_{z}]

Then the first vector is taken parallel to the $B$ vector as $r 1 = \frac{B}{∥ B ∥}$

the $Y$ vector is taken to be orthogonal to the $X$ vector and to an arbitrary $k$ that can be the old $Z$ axis:

k = [0, 0, 1]^{T} \Rightarrow r_{2} = r_{1} \land k = \frac{[ - B _{y} , B _{x} , 0 ] ^{T}}{B _{y}^{2} + B _{x}^{2}}

In the end, the new $Z$ axis is perpendicular to the two vectors so

r_{3} = r_{1} \land r_{2}

So the new $PPM s$ became:

P \sim_{L}^{^{'}} = A_{n e w} [R_{n e w} ∣0]

P \sim_{R}^{^{'}} = A_{n e w} [R_{n e w} ∣ - R_{n e w} C_{R}]

RECTIFICATION HOMOGRAPHIES

Both images go trough a rotation and a change of intrinsic parameter, so they are related to the originals through homographies

So for the left camera:

⎩ ⎨ ⎧ m \sim_{L} = A_{L} [I ∣0] M \sim m \sim_{L}^{^{'}} = A_{n e w} [R_{n e w} ∣0] M \sim \Rightarrow m \sim_{L} = A_{L} R_{n e w}^{- 1} A_{n e w}^{- 1} m \sim_{L}^{^{'}}

H_{L} = A_{L} R_{n e w}^{- 1} A_{n e w}^{- 1}

for the right image is convenient to move the origin of the WRF into the optical center of the camera

⎩ ⎨ ⎧ m \sim_{R} = A_{R} [R ∣0] M \sim_{R} m \sim_{R}^{^{'}} = A_{n e w} [R_{n e w} ∣0] M \sim_{R} \Rightarrow m \sim_{R} = A_{R} R R_{n e w}^{- 1} A_{n e w}^{- 1} m \sim_{R}^{^{'}}

H_{R} = A_{R} R R_{n e w}^{- 1} A_{n e w}^{- 1}

GETTING BACK TO 3D COORDINATES

with a calibrated stereo system the depth information and also 3D coordinates can be estimated, so given the relation between 3D point and image points

p^{*} = a_{u} x + u_{0} z a_{v} y + v_{0} z z = A P = a_{u} 00 0 a_{v} 0 u_{0} v_{0} 1 x y z

The coordinates can be retrieved by the following expression

P = A^{- 1} p^{*}

multiplied by $z$

P = z A^{- 1} \frac{p ^{*}}{z}

But $\frac{p ^{*}}{z}$ is the vector of the image coordinates

\frac{p ^{*}}{z} = a_{u} \frac{x}{z} + u_{0} a_{v} \frac{y}{z} + v_{0} 1 = u v 1

and given the $A^{- 1}$ matrix

A^{- 1} = \frac{1}{a _{u}} 00 0 \frac{1}{a _{v}} 0 - \frac{u _{0}}{a _{u}} - \frac{v _{0}}{a _{v}} 1

The 3D coordinates can be computed as follows

P = z A^{- 1} p

Now it’s also possible to compute an image point $P$ of a given 3D space taken by another camera by getting the 3D coordinates and then translating by a rotation and a translation function

p_{2} = A T_{1 \to 2} (z A^{- 1} p_{1}) w i t h T_{1 \to 2} (P_{1}) = R P_{1} + T

It’s also possible to compute it between different cameras

p_{2} = A_{2} T_{1 \to 2} (z A_{1}^{- 1} p_{1}) w i t h T_{1 \to 2} (P_{1}) = R P_{1} + T

PREVIOUS NEXT

Explorer

STEREO CAMERA CALIBRATION

WHY NOT CALIBRATE BOTH CAMERAS WITH ZHANG

BETTER SOLUTION: GUESSING

RECTIFICATION

CONSTRUCTING THE $A$ MATRIX

CONSTRUCTING THE $R$ MATRIX

RECTIFICATION HOMOGRAPHIES

GETTING BACK TO 3D COORDINATES

Graph View

Backlinks

Explorer

STEREO CAMERA CALIBRATION

WHY NOT CALIBRATE BOTH CAMERAS WITH ZHANG

BETTER SOLUTION: GUESSING

RECTIFICATION

CONSTRUCTING THE A MATRIX

CONSTRUCTING THE R MATRIX

RECTIFICATION HOMOGRAPHIES

GETTING BACK TO 3D COORDINATES

Graph View

Backlinks

CONSTRUCTING THE $A$ MATRIX

CONSTRUCTING THE $R$ MATRIX