Point to Point Iterative Closest Point¶
Given two point clouds, \(P\) and \(Q\):
The goal is to find the optimal transformation \((R, t)\) that aligns the source point cloud \(P\) to the reference point cloud \(Q\) by minimizing the following error function :
where \(E\) is the loss function, \(P\) is the set of source points, and \(Q\) is the set of reference points. The objective of ICP is to determine the best transformation \((R, t)\) that aligns point cloud \(P\) to point cloud \(Q\).
Closed-form Solution¶
Point to point ICP can solved by using Singular Value Decomposition. First, the centroids (center of mass) of point clouds \(P\) and \(Q\) are defined as:
Let \(Q'\) and \(P'\) be the sets of points in \(Q\) and \(P\), respectively, with their centroids subtracted. This step removes any translation effects by shifting both point clouds to a common origin:
Define the cross-covariance matrix \(W\) as follows:
The cross-covariance matrix \(W\) provides information about how changes in coordinates in the \(p\) points correlate with changes in the \(q\) points. An ideal cross-covariance matrix would be an identity matrix, indicating no correlation between the x- and y-axes.
Translation¶
The translation vector \(t\) aligns the centroids of \(P\) and \(Q\) and can be derived by minimizing \(E\) with respect to \(t\):
Rotation¶
The optimal rotation matrix \(R\) can be derived by focusing on minimizing the rotational alignment error when \(t = 0\). This minimization yields:
Expanding this norm gives:
Where \(p_i^TR^Tq_i = \left ( p_i^TR^Tq_i\right)^T= q_i^TRp_i\), then
Since \(p_i^T p_i\) and \(q_i^T q_i\) do not depend on \(R\), we simplify the objective to:
Using the matrix trace property \(tr(AB)=tr(BA)\), and \(PQ^T=W\), we can rewrite the objective as:
Where \(W\) is a cross-covariance matrix, then perform the Singular Value Decomposition (SVD) of \(W\):
$$ SVD(W) = U \Sigma V^T $$ Subtitute \(U \Sigma V^T\) to \(tr(RW)\)
Define an orthogonal matrix \(M =V^TRU\). Therefore,
By applying the Cauchy-Schwarz inequality :
Since \(M\) is an orthogonal matrix, \(m_{ii}^2 = |m_{ii}|\leq1\). Therefore, to obtain the maximum value, \(m_{ii}\) bernilai 1. should be 1. As explained earlier \(M\) is an orthogonal matrix, so \(m_{ii}\) equals 1 if \(M = I\)
References¶
- Sorkine-Hornung, O., & Rabinovich, M. (2017). Least-squares rigid motion using svd. Computing, 1(1), 1-5. https://igl.ethz.ch/projects/ARAP/svd_rot.pdf
- Arun, K. S., Huang, T. S., & Blostein, S. D. (1987). Least-Squares fitting of two 3-D point sets. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-9(5), 698–700. https://doi.org/10.1109/tpami.1987.4767965