Polar decomposition

In mathematics, particularly in linear algebra and functional analysis, the polar decomposition of a matrix or linear operator is a factorization analogous to the polar form of a nonzero complex number z as $z=re^{i\theta }\,$

where r is the absolute value of z (a positive real number), and $e^{i\theta }$ is an element of the circle group.

Matrix polar decomposition[edit]

The polar decomposition of a square complex matrix A is a matrix decomposition of the form

A=UP

where U is a unitary matrix and P is a positive-semidefinite Hermitian matrix.^[1] Intuitively, the polar decomposition separates A into a component that stretches the space along a set of orthogonal axes, represented by P, and a rotation (with possible reflection) represented by U. The decomposition of the complex conjugate of $A$ is given by ${\overline {A}}={\overline {U}}{\overline {P}}.$

This decomposition always exists; and so long as A is invertible, it is unique, with P positive-definite. Note that

\det A=\det U\det P=re^{i\theta }

gives the corresponding polar decomposition of the determinant of A, since $\det P=r=|\det A|$ and $\det U=e^{i\theta }$ . In particular, if $A$ has determinant 1 then both $U$ and $P$ have determinant 1.

The matrix P is always unique, even if A is singular, and given by

P=\left(A^{*}A\right)^{\frac {1}{2}},

where A^* denotes the conjugate transpose of A. This expression is ensured to be well-defined, since $A^{*}A$ is a positive-semidefinite Hermitian matrix, and therefore has a unique positive-semidefinite Hermitian square root.^[2] If A is invertible, then the matrix U is uniquely determined by

U=AP^{-1}.

Moreover, if $A$ is invertible, then $P$ is strictly positive definite, and thus has a unique self-adjoint logarithm. Every invertible matrix $A$ can therefore be written uniquely in the form

A=Ue^{X},

where $U$ is unitary and $X$ is self-adjoint.^[3] This decomposition is useful in computing the fundamental group of (matrix) Lie groups.^[4]

In terms of the singular value decomposition of A, A = WΣV^*, one has

{\begin{aligned}P&=V\Sigma V^{*}\\U&=WV^{*}\end{aligned}}

confirming that P is positive-definite and U is unitary. Thus, the existence of the SVD is equivalent to the existence of polar decomposition.

One can also decompose A in the form

A=P'U

Here U is the same as before and P′ is given by

P'=UPU^{-1}=\left(AA^{*}\right)^{\frac {1}{2}}=W\Sigma W^{*}.

This is known as the left polar decomposition, whereas the previous decomposition is known as the right polar decomposition. Left polar decomposition is also known as reverse polar decomposition.

The matrix A is normal if and only if P′ = P. Then UΣ = ΣU, and it is possible to diagonalise U with a unitary similarity matrix S that commutes with Σ, giving SUS^* = Φ⁻¹, where Φ is a diagonal unitary matrix of phases e^iφ. Putting Q = VS^*, one can then re-write the polar decomposition as

A=\left(Q\Phi Q^{*}\right)\left(Q\Sigma Q^{*}\right),\,

so A then thus also has a spectral decomposition

A=Q\Lambda Q^{*}

with complex eigenvalues such that $\Lambda \Lambda ^{*}=\Sigma ^{2}$ and a unitary matrix of complex eigenvectors Q.

The polar decomposition of a square invertible real matrix A is of the form

A=|A|R

where $|A|=\left(AA^{\textsf {T}}\right)^{\frac {1}{2}}$ is a positive-definite matrix and $R=|A|^{-1}A$ is an orthogonal matrix.

Construction and proofs of existence[edit]

The core idea behind the construction of the polar decomposition is similar to that used to compute the singular-value decomposition.

For any matrix $A$ , the matrix $A^{*}A$ is hermitian and positive semi-definite, and therefore unitarily equivalent to a positive semi-definite diagonal matrix. Let then $V$ be the unitary matrix such that $A^{*}A=VDV^{*}$ , with $D$ diagonal and positive semi-definite.

Case of $A$ normal[edit]

If $A$ is normal, then it is unitarily equivalent to a diagonal matrix: $A=V\Lambda V^{*}$ for some unitary $V$ and some diagonal matrix $\Lambda$ .

The polar decomposition is in this case obtained by writing

V^{*}AV=\Phi _{\Lambda }|\Lambda |,

where $|\Lambda |$ is the diagonal matrix with the absolute values of the elements of $\Lambda$ , and $\Phi _{\Lambda }$ is a diagonal matrix with containing the phases of the elements of $\Lambda$ . In other words,

\left(\Phi _{\Lambda }|\Lambda |\right)_{jk}=\delta _{jk}e^{i\phi _{k}}\left|\lambda _{k}\right|=\delta _{jk}\lambda _{k}.

When $\lambda _{k}=0$ , the corresponding phase can be chosen arbitrarily.

Going back into the original basis, we obtain the polar decomposition of $A$ :

A=V\Phi _{\Lambda }|\Lambda |V^{*}=\underbrace {\left(V\Phi _{\Lambda }V^{*}\right)} _{U}\underbrace {\left(V|\Lambda |V^{*}\right)} _{P}.

Case of $A$ invertible[edit]

Using for example the singular-value decomposition, it can be readily shown that a matrix $A$ is invertible if and only if $A^{*}A$ (equivalently, $AA^{*}$ ) is. Moreover, this is true if and only if the eigenvalues of $A^{*}A$ are all not zero^[5].

In this case, the polar decomposition is directly obtained by writing

A=A\left(A^{*}A\right)^{-{\frac {1}{2}}}\left(A^{*}A\right)^{\frac {1}{2}},

and observing that $A\left(A^{*}A\right)^{-{\frac {1}{2}}}$ is unitary. To see this, we can exploit the spectral decomposition of $A^{*}A$ to write $A\left(A^{*}A\right)^{-{\frac {1}{2}}}=AVD^{-{\frac {1}{2}}}V^{*}$ .

In this expression, $V^{*}$ is unitary because $V$ is. To show that also $AVD^{-{\frac {1}{2}}}$ is unitary, we can use the SVD to write $A=WD^{\frac {1}{2}}V^{*}$ , so that

AVD^{-{\frac {1}{2}}}=WD^{\frac {1}{2}}V^{*}VD^{-{\frac {1}{2}}}=W,

where again $W$ is unitary by construction.

Yet another way to directly show the unitarity of $A\left(A^{*}A\right)^{-{\frac {1}{2}}}$ is to note that, writing the SVD of $A$ in terms of rank-1 matrices as $A=\sum _{k}s_{k}v_{k}w_{k}^{*}$ , where $s_{k}$ are the singular values of $A$ , we have

A\left(A^{*}A\right)^{-{\frac {1}{2}}}=\left(\sum _{j}\lambda _{j}v_{j}w_{j}^{*}\right)\left(\sum _{k}|\lambda _{k}|^{-1}w_{k}w_{k}^{*}\right)=\sum _{k}{\frac {\lambda _{k}}{|\lambda _{k}|}}v_{k}w_{k}^{*},

which directly implies the unitarity of

A\left(A^{*}A\right)^{-{\frac {1}{2}}}

because a matrix is unitary if and only if its singular values have unitary absolute value.

Note how, from the above construction, it follows that the unitary matrix in the polar decomposition of an invertible matrix is uniquely defined.

General case[edit]

The above argument crucially relies on the existence of $\left(A^{*}A\right)^{-{\frac {1}{2}}}$ , and therefore on $A^{*}A$ being invertible. Indeed, in the general case, $AVD^{-{\frac {1}{2}}}$ is not generally well-defined, due to the possibility of $D$ having vanishing eigenvalues.

Let us denote with $V_{1}$ the (in general not square) matrix whose columns are the eigenvectors of $A^{*}A$ corresponding to non-vanishing eigenvalues, with $D_{1}$ the diagonal matrix containing the associated non-zero eigenvalues, and with $V_{2}$ the matrix with the remaining eigenvectors of $A^{*}A$ . We can then write the spectral decomposition of $A^{*}A$ as:

A^{*}A={\begin{bmatrix}V_{1}&V_{2}\end{bmatrix}}{\begin{bmatrix}D_{1}&0\\0&0\end{bmatrix}}{\begin{bmatrix}V_{1}^{*}\\V_{2}^{*}\end{bmatrix}}=V_{1}D_{1}V_{1}^{*}.

Note that, similarly to the invertible case, $AV_{1}D_{1}^{-{\frac {1}{2}}}$ is well-defined and its columns are orthonormal, although it is not in general square and therefore unitary.

We now define

U'=\left[AV_{1}D_{1}^{-{\frac {1}{2}}},\Phi \right],

where $\Phi$ is a matrix whose columns are chosen so that $U'$ is unitary. This is done by finding a set of orthonormal vectors which, together with the columns of $AV_{1}D_{1}^{-{\frac {1}{2}}}$ , form a complete base for the space, and using these vectors as the columns of $\Phi$ . Note how the definition of $U'$ is not unique, unless $A^{*}A$ (and therefore $A$ ) is invertible, in which case $AV_{1}D_{1}^{-{\frac {1}{2}}}$ is already unitary and uniquely defined.

The argument now proceeds similarly to the invertible case, with the only difference of using $U'$ in place of $AVD^{-{\frac {1}{2}}}$ . Indeed, we have:

U\left(A^{*}A\right)^{\frac {1}{2}}\equiv U'{\begin{bmatrix}V_{1}^{*}\\V_{2}^{*}\end{bmatrix}}\left(A^{*}A\right)^{\frac {1}{2}}=\left(AV_{1}D_{1}^{-{\frac {1}{2}}}V_{1}^{*}+\Phi V_{2}^{*}\right)V_{1}D_{1}^{\frac {1}{2}}V_{1}=A,

where we used the orthogonality of the columns of $V_{1}$ and $V_{2}$ , which is equivalent to $V_{2}^{*}V_{1}=0$ , and $U$ is the product of two unitaries, and is therefore also unitary.

General case, alternative proof[edit]

Making use of the SVD of $A$ , a more direct proof can be found.

The SVD of $A$ reads $A=WD^{\frac {1}{2}}V^{*}$ , with $W,V$ unitary matrices, and $D$ a diagonal, positive semi-definite matrix. By simply inserting an additional pair of $W$ s or $V$ s, we obtain the two forms of the polar decomposition of $A$ :

A=WD^{\frac {1}{2}}V^{*}=\underbrace {\left(WD^{\frac {1}{2}}W^{*}\right)} _{P}\underbrace {\left(WV^{*}\right)} _{U}=\underbrace {\left(WV^{*}\right)} _{U}\underbrace {\left(VD^{\frac {1}{2}}V^{*}\right)} _{P'}.

Bounded operators on Hilbert space[edit]

The polar decomposition of any bounded linear operator A between complex Hilbert spaces is a canonical factorization as the product of a partial isometry and a non-negative operator.

The polar decomposition for matrices generalizes as follows: if A is a bounded linear operator then there is a unique factorization of A as a product A = UP where U is a partial isometry, P is a non-negative self-adjoint operator and the initial space of U is the closure of the range of P.

The operator U must be weakened to a partial isometry, rather than unitary, because of the following issues. If A is the one-sided shift on l²(N), then |A| = {A^*A}^½ = I. So if A = U |A|, U must be A, which is not unitary.

The existence of a polar decomposition is a consequence of Douglas' lemma:

Lemma If A, B are bounded operators on a Hilbert space H, and A^*A ≤ B^*B, then there exists a contraction C such that A = CB. Furthermore, C is unique if Ker(B^*) ⊂ Ker(C).

The operator C can be defined by C(Bh) := Ah for all h in H, extended by continuity to the closure of Ran(B), and by zero on the orthogonal complement to all of H. The lemma then follows since A^*A ≤ B^*B implies Ker(B) ⊂ Ker(A).

In particular. If A^*A = B^*B, then C is a partial isometry, which is unique if Ker(B^*) ⊂ Ker(C). In general, for any bounded operator A,

A^{*}A=\left(A^{*}A\right)^{\frac {1}{2}}\left(A^{*}A\right)^{\frac {1}{2}},

where (A^*A)^½ is the unique positive square root of A^*A given by the usual functional calculus. So by the lemma, we have

A=U\left(A^{*}A\right)^{\frac {1}{2}}

for some partial isometry U, which is unique if Ker(A^*) ⊂ Ker(U). Take P to be (A^*A)^½ and one obtains the polar decomposition A = UP. Notice that an analogous argument can be used to show A = P'U', where P' is positive and U' a partial isometry.

When H is finite-dimensional, U can be extended to a unitary operator; this is not true in general (see example above). Alternatively, the polar decomposition can be shown using the operator version of singular value decomposition.

By property of the continuous functional calculus, |A| is in the C*-algebra generated by A. A similar but weaker statement holds for the partial isometry: U is in the von Neumann algebra generated by A. If A is invertible, the polar part U will be in the C*-algebra as well.

Unbounded operators[edit]

If A is a closed, densely defined unbounded operator between complex Hilbert spaces then it still has a (unique) polar decomposition

A=U|A|\,

where |A| is a (possibly unbounded) non-negative self adjoint operator with the same domain as A, and U is a partial isometry vanishing on the orthogonal complement of the range Ran(|A|).

The proof uses the same lemma as above, which goes through for unbounded operators in general. If Dom(A^*A) = Dom(B^*B) and A^*Ah = B^*Bh for all h ∈ Dom(A^*A), then there exists a partial isometry U such that A = UB. U is unique if Ran(B)^⊥ ⊂ Ker(U). The operator A being closed and densely defined ensures that the operator A^*A is self-adjoint (with dense domain) and therefore allows one to define (A^*A)^½. Applying the lemma gives polar decomposition.

If an unbounded operator A is affiliated to a von Neumann algebra M, and A = UP is its polar decomposition, then U is in M and so is the spectral projection of P, 1_B(P), for any Borel set B in [0, ∞).

Quaternion polar decomposition[edit]

The polar decomposition of quaternions H depends on the sphere $\lbrace xi+yj+zk\in H:x^{2}+y^{2}+z^{2}=1\rbrace$ of square roots of minus one. Given any r on this sphere, and an angle −π < a ≤ π, the versor $e^{ar}=\cos(a)+r\ \sin(a)$ is on the 3-sphere of H. For a = 0 and a = π, the versor is 1 or −1 regardless of which r is selected. The norm t of a quaternion q is the Euclidean distance from the origin to q. When a quaternion is not just a real number, then there is a unique polar decomposition $q=te^{ar}.$

Alternative planar decompositions[edit]

In the Cartesian plane, alternative planar ring decompositions arise as follows:

If x ≠ 0, z = x(1 + ε(y/x)) is a polar decomposition of a dual number z = x + yε, where ε² = 0; i.e., ε is nilpotent. In this polar decomposition, the unit circle has been replaced by the line x = 1, the polar angle by the slope y/x, and the radius x is negative in the left half-plane.
If x² ≠ y², then the unit hyperbola x² − y² = 1 and its conjugate x² − y² = −1 can be used to form a polar decomposition based on the branch of the unit hyperbola through (1, 0). This branch is parametrized by the hyperbolic angle a and is written
$\cosh(a)+j\ \sinh(a)=\exp(aj)=e^{aj}$

where j² = +1 and the arithmetic^[6] of split-complex numbers is used. The branch through (−1, 0) is traced by −e^aj. Since the operation of multiplying by j reflects a point across the line y = x, the second hyperbola has branches traced by je^aj or −je^aj. Therefore a point in one of the quadrants has a polar decomposition in one of the forms:

$re^{aj},-re^{aj},rje^{aj},-rje^{aj},\quad r>0$
The set { 1, −1, j, −j } has products that make it isomorphic to the Klein four-group. Evidently polar decomposition in this case involves an element from that group.

Numerical determination of the matrix polar decomposition[edit]

To compute an approximation of the polar decomposition A = UP, usually the unitary factor U is approximated.^[7]^[8] The iteration is based on Heron's method for the square root of 1 and computes, starting from $U_{0}=A$ , the sequence

U_{k+1}={\frac {1}{2}}\left(U_{k}+\left(U_{k}^{*}\right)^{-1}\right),\qquad k=0,1,2,\ldots

The combination of inversion and Hermite conjugation is chosen so that in the singular value decomposition, the unitary factors remain the same and the iteration reduces to Heron's method on the singular values.

This basic iteration may be refined to speed up the process:

Every step or in regular intervals, the range of the singular values of $U_{k}$ is estimated and then the matrix is rescaled to $\gamma _{k}U_{k}$ to center the singular values around 1. The scaling factor $\gamma _{k}$ is computed using matrix norms of the matrix and its inverse. Examples of such scale estimates are:
$\gamma _{k}={\sqrt[{4}]{\frac {\left\|U_{k}^{-1}\right\|_{1}\left\|U_{k}^{-1}\right\|_{\infty }}{\left\|U_{k}\right\|_{1}\left\|U_{k}\right\|_{\infty }}}}$

using the row-sum and column-sum matrix norms or

$\gamma _{k}={\sqrt {\frac {\left\|U_{k}^{-1}\right\|_{F}}{\left\|U_{k}\right\|_{F}}}}$

using the Frobenius norm. Including the scale factor, the iteration is now

$U_{k+1}={\frac {1}{2}}\left(\gamma _{k}U_{k}+{\frac {1}{\gamma _{k}}}\left(U_{k}^{*}\right)^{-1}\right),\qquad k=0,1,2,\ldots$
The QR decomposition can be used in a preparation step to reduce a singular matrix A to a smaller regular matrix, and inside every step to speed up the computation of the inverse.
Heron' method for computing roots of $x^{2}-1=0$ can be replaced by higher order methods, for instance based on Halley's method of third order, resulting in
$U_{k+1}=U_{k}\left(I+3U_{k}^{*}U_{k}\right)^{-1}\left(3I+U_{k}^{*}U_{k}\right),\qquad k=0,1,2,\ldots$
This iteration can again be combined with rescaling. This particular formula has the benefit that it is also applicable to singular or rectangular matrices A.

References[edit]

^ Hall 2015 Section 2.5
^ Hall 2015 Lemma 2.18
^ Hall 2015 Theorem 2.17
^ Hall 2015 Section 13.3
^ Note how this implies, by the positivity of $A^{*}A$ , that the eigenvalues are all real and strictly positive.
^ Sobczyk, G.(1995) "Hyperbolic Number Plane", College Mathematics Journal 26:268–80
^ Higham, Nicholas J. (1986). "Computing the polar decomposition with applications". SIAM J. Sci. Stat. Comput. Philadelphia, PA, USA: Society for Industrial and Applied Mathematics. 7 (4): 1160–1174. doi:10.1137/0907079. ISSN 0196-5204. Archived from the original on 2013-05-08.
^ Byers, Ralph; Hongguo Xu (2008). "A New Scaling for Newton's Iteration for the Polar Decomposition and its Backward Stability". SIAM J. Matrix Anal. Appl. Philadelphia, PA, USA: Society for Industrial and Applied Mathematics. 30 (2): 822–843. CiteSeerX 10.1.1.378.6737. doi:10.1137/070699895. ISSN 0895-4798.

Conway, J.B.: A Course in Functional Analysis. Graduate Texts in Mathematics. New York: Springer 1990
Douglas, R.G.: On Majorization, Factorization, and Range Inclusion of Operators on Hilbert Space. Proc. Amer. Math. Soc. 17, 413-415 (1966)
Hall, Brian C. (2015), Lie Groups, Lie Algebras, and Representations: An Elementary Introduction, Graduate Texts in Mathematics, 222 (2nd ed.), Springer, ISBN 978-3319134666.
Helgason, Sigurdur (1978), Differential geometry, Lie groups, and symmetric spaces, Academic Press, ISBN 0-8218-2848-7

[1] Hall 2015 Section 2.5

[2] Hall 2015 Lemma 2.18

[3] Hall 2015 Theorem 2.17

[4] Hall 2015 Section 13.3

[5] Note how this implies, by the positivity of $A^{*}A$ , that the eigenvalues are all real and strictly positive.

[6] Sobczyk, G.(1995) "Hyperbolic Number Plane", College Mathematics Journal 26:268–80

[higham1986-7] Higham, Nicholas J. (1986). "Computing the polar decomposition with applications". SIAM J. Sci. Stat. Comput. Philadelphia, PA, USA: Society for Industrial and Applied Mathematics. 7 (4): 1160–1174. doi:10.1137/0907079. ISSN 0196-5204. Archived from the original on 2013-05-08.

[byers2008-8] Byers, Ralph; Hongguo Xu (2008). "A New Scaling for Newton's Iteration for the Polar Decomposition and its Backward Stability". SIAM J. Matrix Anal. Appl. Philadelphia, PA, USA: Society for Industrial and Applied Mathematics. 30 (2): 822–843. CiteSeerX 10.1.1.378.6737. doi:10.1137/070699895. ISSN 0895-4798.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

v t e Functional analysis (topics)
TVS types	Banach Banach Lattice Barrelled Bornological Brauner F-space Finite-dimensional Fréchet (tame) Hilbert (pre-Hilbert Polarization identity) LF-space Locally convex (Seminorms/Minkowski functionals) Mackey Montel Nuclear Normed (norm) Quasinormed Reflexive Riesz Smith Stereotype Strictly convex Webbed Topological tensor product (of Hilbert spaces)
Mapping topologies	Dual Dual space (Dual norm) Operator Ultraweak Weak (polar operator) Mackey Strong (polar operator) Ultrastrong Uniform convergence
Linear operators	Adjoint Bilinear (form operator sesquilinear) (Un)Bounded Closed Compact (Dis)Continuous Densely defined Fredholm Hilbert–Schmidt Functionals (positive) Normal Nuclear Self-adjoint Strictly singular Trace class Transpose Unitary
Operator theory	Banach algebras C-algebras Spectrum (C-algebra radius) Spectral theory (of ODEs Spectral theorem) Polar decomposition Singular value decomposition
Theorems	Banach–Alaoglu Banach–Mazur Banach–Saks Bessel's inequality Cauchy–Schwarz inequality Closed graph Closed range Eberlein–Šmulian Freudenthal spectral Gelfand–Mazur Gelfand–Naimark Goldstine Hahn–Banach (hyperplane separation) Kakutani fixed-point Krein–Milman Lomonosov's invariant subspace Mackey–Arens Mazur's lemma M. Riesz extension Riesz representation Open mapping Parseval's identity Schauder fixed-point
Analysis	Abstract Wiener space Bochner space Differentiation in Fréchet spaces Derivatives (Fréchet Gateaux functional holomorphic) Integrals (Bochner Dunford Gelfand–Pettis regulated Paley–Wiener weak) Functional calculus (Borel continuous holomorphic) Inverse function theorem (Nash–Moser theorem) Measures (Lebesgue Projection-valued Vector) Weakly measurable function
Types of sets	Absolutely convex Absorbing Balanced Bounded Convex Convex cone (subset) Linear cone (subset) Radial Star-shaped Symmetric Zonotope
Subsets / set operations	Algebraic interior (core) Bounding points Convex hull Extreme point Interior Minkowski addition Polar

Polar decomposition

Contents

Matrix polar decomposition[edit]