Covariant derivative

In mathematics, the covariant derivative is a way of specifying a derivative along tangent vectors of a manifold. Alternatively, the covariant derivative is a way of introducing and working with a connection on a manifold by means of a differential operator, to be contrasted with the approach given by a principal connection on the frame bundle – see affine connection. In the special case of a manifold isometrically embedded into a higher-dimensional Euclidean space, the covariant derivative can be viewed as the orthogonal projection of the Euclidean derivative along a tangent vector onto the manifold's tangent space. In this case the Euclidean derivative is broken into two parts, the extrinsic normal component and the intrinsic covariant derivative component.

In physics, the covariant derivative is the derivative that under a general coordinate transformation transforms covariantly, that is, linearly via the Jacobian matrix of the coordinate transformation.^[1]

This article presents an introduction to the covariant derivative of a vector field with respect to a vector field, both in a coordinate free language and using a local coordinate system and the traditional index notation. The covariant derivative of a tensor field is presented as an extension of the same concept. The covariant derivative generalizes straightforwardly to a notion of differentiation associated to a connection on a vector bundle, also known as a Koszul connection.

History[edit]

Historically, at the turn of the 20th century, the covariant derivative was introduced by Gregorio Ricci-Curbastro and Tullio Levi-Civita in the theory of Riemannian and pseudo-Riemannian geometry.^[2] Ricci and Levi-Civita (following ideas of Elwin Bruno Christoffel) observed that the Christoffel symbols used to define the curvature could also provide a notion of differentiation which generalized the classical directional derivative of vector fields on a manifold.^[3]^[4] This new derivative – the Levi-Civita connection – was covariant in the sense that it satisfied Riemann's requirement that objects in geometry should be independent of their description in a particular coordinate system.

It was soon noted by other mathematicians, prominent among these being Hermann Weyl, Jan Arnoldus Schouten, and Élie Cartan,^[5] that a covariant derivative could be defined abstractly without the presence of a metric. The crucial feature was not a particular dependence on the metric, but that the Christoffel symbols satisfied a certain precise second order transformation law. This transformation law could serve as a starting point for defining the derivative in a covariant manner. Thus the theory of covariant differentiation forked off from the strictly Riemannian context to include a wider range of possible geometries.

In the 1940s, practitioners of differential geometry began introducing other notions of covariant differentiation in general vector bundles which were, in contrast to the classical bundles of interest to geometers, not part of the tensor analysis of the manifold. By and large, these generalized covariant derivatives had to be specified ad hoc by some version of the connection concept. In 1950, Jean-Louis Koszul unified these new ideas of covariant differentiation in a vector bundle by means of what is known today as a Koszul connection or a connection on a vector bundle.^[6] Using ideas from Lie algebra cohomology, Koszul successfully converted many of the analytic features of covariant differentiation into algebraic ones. In particular, Koszul connections eliminated the need for awkward manipulations of Christoffel symbols (and other analogous non-tensorial objects) in differential geometry. Thus they quickly supplanted the classical notion of covariant derivative in many post-1950 treatments of the subject.

Motivation[edit]

The covariant derivative is a generalization of the directional derivative from vector calculus. As with the directional derivative, the covariant derivative is a rule, $\nabla _{\mathbf {u} }{\mathbf {v} }$ , which takes as its inputs: (1) a vector, u, defined at a point P, and (2) a vector field, v, defined in a neighborhood of P.^[7] The output is the vector $\nabla _{\mathbf {u} }{\mathbf {v} }(P)$ , also at the point P. The primary difference from the usual directional derivative is that $\nabla _{\mathbf {u} }{\mathbf {v} }$ must, in a certain precise sense, be independent of the manner in which it is expressed in a coordinate system.

A vector may be described as a list of numbers in terms of a basis, but as a geometrical object a vector retains its own identity regardless of how one chooses to describe it in a basis. This persistence of identity is reflected in the fact that when a vector is written in one basis, and then the basis is changed, the components of the vector transform according to a change of basis formula. Such a transformation law is known as a covariant transformation. The covariant derivative is required to transform, under a change in coordinates, in the same way as a basis does: the covariant derivative must change by a covariant transformation (hence the name).

In the case of Euclidean space, one tends to define the derivative of a vector field in terms of the difference between two vectors at two nearby points. In such a system one translates one of the vectors to the origin of the other, keeping it parallel. With a Cartesian (fixed orthonormal) coordinate system "keeping it parallel" amounts to keeping the components constant. Thus is obtained the simplest example: a covariant derivative which is obtained by taking the ordinary directional derivative of the components in the direction of the displacement vector between the two nearby points.

In the general case, however, one must take into account the change of the coordinate system. For example, if the same covariant derivative is written in polar coordinates in a two dimensional Euclidean plane, then it contains extra terms that describe how the coordinate grid itself "rotates". In other cases the extra terms describe how the coordinate grid expands, contracts, twists, interweaves, etc. In this case "keeping it parallel" does not amount to keeping components constant under translation.

Consider the example of moving along a curve γ(t) in the Euclidean plane. In polar coordinates, γ may be written in terms of its radial and angular coordinates by γ(t) = (r(t), θ(t)). A vector at a particular time t^[8] (for instance, the acceleration of the curve) is expressed in terms of $(\mathbf {e} _{r},\mathbf {e} _{\theta })$ , where $\mathbf {e} _{r}$ and $\mathbf {e} _{\theta }$ are unit tangent vectors for the polar coordinates, serving as a basis to decompose a vector in terms of radial and tangential components. At a slightly later time, the new basis in polar coordinates appears slightly rotated with respect to the first set. The covariant derivative of the basis vectors (the Christoffel symbols) serve to express this change.

In a curved space, such as the surface of the Earth (regarded as a sphere), the translation is not well defined and its analog, parallel transport, depends on the path along which the vector is translated.

A vector e on a globe on the equator at point Q is directed to the north. Suppose we parallel transport the vector first along the equator until at point P and then (keeping it parallel to itself) drag it along a meridian to the pole N and (keeping the direction there) subsequently transport it along another meridian back to Q. Then we notice that the parallel-transported vector along a closed circuit does not return as the same vector; instead, it has another orientation. This would not happen in Euclidean space and is caused by the curvature of the surface of the globe. The same effect can be noticed if we drag the vector along an infinitesimally small closed surface subsequently along two directions and then back. The infinitesimal change of the vector is a measure of the curvature.

Remarks[edit]

The definition of the covariant derivative does not use the metric in space. However, for each metric there is a unique torsion-free covariant derivative called the Levi-Civita connection such that the covariant derivative of the metric is zero.
The properties of a derivative imply that $\nabla _{\mathbf {v} }\mathbf {u}$ depends on an arbitrarily small neighborhood of a point p in the same way as e.g. the derivative of a scalar function along a curve at a given point p depends on an arbitrarily small neighborhood of p.
The information on the neighborhood of a point p in the covariant derivative can be used to define parallel transport of a vector. Also the curvature, torsion, and geodesics may be defined only in terms of the covariant derivative or other related variation on the idea of a linear connection.

Informal definition using an embedding into Euclidean space[edit]

Suppose a (pseudo) Riemann manifold $M$ , is embedded into Euclidean space $(\mathbb {R} ^{n},\langle \cdot ;\cdot \rangle )$ via a twice continuously-differentiable (C²) mapping ${\vec {\Psi }}:\mathbb {R} ^{d}\supset U\rightarrow \mathbb {R} ^{n}$ such that the tangent space at ${\vec {\Psi }}(p)\in M$ is spanned by the vectors

\left\lbrace \left.{\frac {\partial {\vec {\Psi }}}{\partial x^{i}}}\right|_{p}:i\in \lbrace 1,\dots ,d\rbrace \right\rbrace

and the scalar product on $\mathbb {R} ^{n}$ is compatible with the metric on M:

g_{ij}=\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{i}}};{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}\right\rangle .

(Since the manifold metric is always assumed to be regular, the compatibility condition implies linear independence of the partial derivative tangent vectors.)

For a tangent vector field, ${\vec {V}}=v^{j}{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}\,$ , one has

{\frac {\partial {\vec {V}}}{\partial x^{i}}}={\frac {\partial v^{j}}{\partial x^{i}}}{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}+v^{j}{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}

.

The last term is not tangential to M, but can be expressed as a linear combination of the tangent space base vectors using the Christoffel symbols as linear factors plus a vector orthogonal to the tangent space:

{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}={\Gamma ^{k}}_{ij}{\frac {\partial {\vec {\Psi }}}{\partial x^{k}}}+{\vec {n}}

.

In the case of the Levi-Civita connection, the covariant derivative $\nabla _{\mathbf {e} _{i}}{\vec {V}}$ , also written $\nabla _{i}{\vec {V}}$ , is defined as the orthogonal projection of the usual derivative onto tangent space:

\nabla _{\mathbf {e} _{i}}{\vec {V}}:={\frac {\partial {\vec {V}}}{\partial x^{i}}}-{\vec {n}}=\left({\frac {\partial v^{k}}{\partial x^{i}}}+v^{j}{\Gamma ^{k}}_{ij}\right){\frac {\partial {\vec {\Psi }}}{\partial x^{k}}}.

Since ${\vec {n}}$ is orthogonal to tangent space, one can solve the normal equations:

\left\langle {\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}};{\frac {\partial {\vec {\Psi }}}{\partial x^{l}}}\right\rangle ={\Gamma ^{k}}_{ij}\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}};{\frac {\partial {\vec {\Psi }}}{\partial x^{l}}}\right\rangle ={\Gamma ^{k}}_{ij}\,g_{kl}

.

On the other hand,

{\frac {\partial g_{ab}}{\partial x^{c}}}=\left\langle {\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{c}\,\partial x^{a}}};{\frac {\partial {\vec {\Psi }}}{\partial x^{b}}}\right\rangle +\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{a}}};{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{c}\,\partial x^{b}}}\right\rangle

implies

{\begin{pmatrix}{\frac {\partial g_{jk}}{\partial x^{i}}}\\{\frac {\partial g_{ki}}{\partial x^{j}}}\\{\frac {\partial g_{ij}}{\partial x^{k}}}\end{pmatrix}}={\begin{pmatrix}0&1&1\\1&0&1\\1&1&0\end{pmatrix}}{\begin{pmatrix}\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{i}}};{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{j}\,\partial x^{k}}}\right\rangle \\\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{j}}};{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{k}\,\partial x^{i}}}\right\rangle \\\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}};{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}\right\rangle \end{pmatrix}}

(using the symmetry of the scalar product and swapping the order of partial differentiations)

{\frac {\partial g_{jk}}{\partial x^{i}}}+{\frac {\partial g_{ki}}{\partial x^{j}}}-{\frac {\partial g_{ij}}{\partial x^{k}}}=2\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}};{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}\right\rangle

and yields the Christoffel symbols for the Levi-Civita connection in terms of the metric:

g_{kl}{\Gamma ^{k}}_{ij}={\frac {1}{2}}\left({\frac {\partial g_{jl}}{\partial x^{i}}}+{\frac {\partial g_{li}}{\partial x^{j}}}-{\frac {\partial g_{ij}}{\partial x^{l}}}\right).

For a very simple example that captures the essence of the description above, draw a circle on a flat sheet of paper. Travel around the circle at a constant speed. The derivative of your velocity, your acceleration vector, always points radially inward. Roll this sheet of paper into a cylinder. Now the (Euclidean) derivative of your velocity has a component that sometimes points inward toward the axis of the cylinder depending on whether you're near a solstice or an equinox. (At the point of the circle when you are moving parallel to the axis, there is no inward acceleration. Conversely, at a point (1/4 of a circle later) when the velocity is along the cylinder's bend, the inward acceleration is maximum.) This is the (Euclidean) normal component. The covariant derivative component is the component parallel to the cylinder's surface, and is the same as that before you rolled the sheet into a cylinder.

Formal definition[edit]

A covariant derivative is a (Koszul) connection on the tangent bundle and other tensor bundles. Thus it has a certain behavior on vector fields that extends that of the usual differential on functions. It also extends in a unique way to the duals of vector fields (i.e., covector fields), and to arbitrary tensor fields, that ensures compatibility with the tensor product and trace operations (tensor contraction).

Functions[edit]

Given a point p of the manifold, a real function f on the manifold, and a tangent vector v at p, the covariant derivative of f at p along v is the scalar at p, denoted $\left(\nabla _{\mathbf {v} }f\right)_{p}$ , that represents the principal part of the change in the value of f when the argument of f is changed by the infinitesimal displacement vector v. (This is the differential of f evaluated against the vector v.) Formally, there is a differentiable curve $\phi :[-1,1]\to M$ such that $\phi (0)=p$ and $\phi '(0)=\mathbf {v}$ , and the covariant derivative of f at p is defined by

\left(\nabla _{\mathbf {v} }f\right)_{p}=\left(f\circ \phi \right)'\left(0\right)=\lim _{t\to 0}t^{-1}\left(f\left[\phi \left(t\right)\right]-f\left[p\right]\right).

When v is a vector field, the covariant derivative $\nabla _{\mathbf {v} }f$ is the function that associates with each point p in the common domain of f and v the scalar $\left(\nabla _{\mathbf {v} }f\right)_{p}$ . This coincides with the usual Lie derivative of f along the vector field v.

Vector fields[edit]

A covariant derivative $\nabla$ at a point p in a smooth manifold assigns a tangent vector $(\nabla _{\mathbf {v} }\mathbf {u} )_{p}$ to each pair $(\mathbf {u} ,\mathbf {v} )$ , consisting of a tangent vector v at p and vector field u defined in a neighborhood of p, such that the following properties hold (for any vectors v, x and y at p, vector fields u and w defined in a neighborhood of p, scalar values g and h at p, and scalar function f defined in a neighborhood of p):

$\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}$ is linear in $\mathbf {v}$ so
$\left(\nabla _{g\mathbf {x} +h\mathbf {y} }\mathbf {u} \right)_{p}=\left(\nabla _{\mathbf {x} }\mathbf {u} \right)_{p}g+\left(\nabla _{\mathbf {y} }\mathbf {u} \right)_{p}h$
$\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}$ is additive in $\mathbf {u}$ so:
$\left(\nabla _{\mathbf {v} }\left[\mathbf {u} +\mathbf {w} \right]\right)_{p}=\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}+\left(\nabla _{\mathbf {v} }\mathbf {w} \right)_{p}$
$(\nabla _{\mathbf {v} }\mathbf {u} )_{p}$ obeys the product rule; i.e., where $\nabla _{\mathbf {v} }f$ is defined above,
$\left(\nabla _{\mathbf {v} }\left[f\mathbf {u} \right]\right)_{p}=f(p)\left(\nabla _{\mathbf {v} }\mathbf {u} )_{p}+(\nabla _{\mathbf {v} }f\right)_{p}\mathbf {u} _{p}$ .

If u and v are both vector fields defined over a common domain, then $\nabla _{\mathbf {v} }\mathbf {u}$ denotes the vector field whose value at each point p of the domain is the tangent vector $\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}$ . Note that $\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}$ depends not only on the value of v at p but also on values of u in an infinitesimal neighbourhood of p because of the last property, the product rule.

Covector fields[edit]

Given a field of covectors (or one-form) $\alpha$ defined in a neighborhood of p, its covariant derivative $(\nabla _{\mathbf {v} }\alpha )_{p}$ is defined in a way to make the resulting operation compatible with tensor contraction and the product rule. That is, $(\nabla _{\mathbf {v} }\alpha )_{p}$ is defined as the unique one-form at p such that the following identity is satisfied for all vector fields u in a neighborhood of p

\left(\nabla _{\mathbf {v} }\alpha \right)_{p}\left(\mathbf {u} _{p}\right)=\nabla _{\mathbf {v} }\left[\alpha \left(\mathbf {u} \right)\right]_{p}-\alpha _{p}\left[\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}\right].

The covariant derivative of a covector field along a vector field v is again a covector field.

Tensor fields[edit]

Once the covariant derivative is defined for fields of vectors and covectors it can be defined for arbitrary tensor fields by imposing the following identities for every pair of tensor fields $\varphi$ and $\psi \,$ in a neighborhood of the point p:

\nabla _{\mathbf {v} }\left(\varphi \otimes \psi \right)_{p}=\left(\nabla _{\mathbf {v} }\varphi \right)_{p}\otimes \psi (p)+\varphi (p)\otimes \left(\nabla _{\mathbf {v} }\psi \right)_{p},

and for $\varphi$ and $\psi$ of the same valence

\nabla _{\mathbf {v} }(\varphi +\psi )_{p}=(\nabla _{\mathbf {v} }\varphi )_{p}+(\nabla _{\mathbf {v} }\psi )_{p}.

The covariant derivative of a tensor field along a vector field v is again a tensor field of the same type.

Explicitly, let T be a tensor field of type (p, q). Consider T to be a differentiable multilinear map of smooth sections α¹, α², ..., α^q of the cotangent bundle T^∗M and of sections X₁, X₂, ... X_p of the tangent bundle TM, written T(α¹, α², ..., X₁, X₂, ...) into R. The covariant derivative of T along Y is given by the formula

{\begin{aligned}(\nabla _{Y}T)&\left(\alpha _{1},\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)=Y\left(T\left(\alpha _{1},\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)\right)\\&-T\left(\nabla _{Y}\alpha _{1},\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)-T\left(\alpha _{1},\nabla _{Y}\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)-\ldots \\&-T\left(\alpha _{1},\alpha _{2},\ldots ,\nabla _{Y}X_{1},X_{2},\ldots \right)-T\left(\alpha _{1},\alpha _{2},\ldots ,X_{1},\nabla _{Y}X_{2},\ldots \right)-\ldots \end{aligned}}

Coordinate description[edit]

Given coordinate functions

x^{i},\ i=0,1,2,\dots

,

any tangent vector can be described by its components in the basis

\mathbf {e} _{i}={\partial  \over \partial x^{i}}

.

The covariant derivative of a basis vector along a basis vector is again a vector and so can be expressed as a linear combination $\Gamma ^{k}\mathbf {e} _{k}\,$ . To specify the covariant derivative it is enough to specify the covariant derivative of each basis vector field $\mathbf {e} _{i}\,$ along $\mathbf {e} _{j}\,$ .

\nabla _{\mathbf {e} _{j}}\mathbf {e} _{i}={\Gamma ^{k}}_{ij}\mathbf {e} _{k},

the coefficients $\Gamma _{\ ij}^{k}$ are the components of the connection with respect to a system of local coordinates. In the theory of Riemannian and pseudo-Riemannian manifolds, the components of the Levi-Civita connection with respect to a system of local coordinates are called Christoffel symbols.

Then using the rules in the definition, we find that for general vector fields $\mathbf {v} =v^{j}\mathbf {e} _{j}$ and $\mathbf {u} =u^{i}\mathbf {e} _{i}$ we get

{\begin{aligned}\nabla _{\mathbf {v} }\mathbf {u} &=\nabla _{v^{j}\mathbf {e} _{j}}u^{i}\mathbf {e} _{i}\\&=v^{j}\nabla _{\mathbf {e} _{j}}u^{i}\mathbf {e} _{i}\\&=v^{j}u^{i}\nabla _{\mathbf {e} _{j}}\mathbf {e} _{i}+v^{j}\mathbf {e} _{i}\nabla _{\mathbf {e} _{j}}u^{i}\\&=v^{j}u^{i}{\Gamma ^{k}}_{ij}\mathbf {e} _{k}+v^{j}{\partial u^{i} \over \partial x^{j}}\mathbf {e} _{i}\end{aligned}}

so

\nabla _{\mathbf {v} }\mathbf {u} =\left(v^{j}u^{i}{\Gamma ^{k}}_{ij}+v^{j}{\partial u^{k} \over \partial x^{j}}\right)\mathbf {e} _{k}

The first term in this formula is responsible for "twisting" the coordinate system with respect to the covariant derivative and the second for changes of components of the vector field u. In particular

\nabla _{\mathbf {e} _{j}}\mathbf {u} =\nabla _{j}\mathbf {u} =\left({\frac {\partial u^{i}}{\partial x^{j}}}+u^{k}{\Gamma ^{i}}_{kj}\right)\mathbf {e} _{i}

In words: the covariant derivative is the usual derivative along the coordinates with correction terms which tell how the coordinates change.

For covectors similarly we have

\nabla _{\mathbf {e} _{j}}{\mathbf {\theta } }=\left({\frac {\partial \theta _{i}}{\partial x^{j}}}-\theta _{k}{\Gamma ^{k}}_{ij}\right){\mathbf {e} ^{*}}^{i}

where ${\mathbf {e} ^{*}}^{i}(\mathbf {e} _{j})={\delta ^{i}}_{j}$ .

The covariant derivative of a type (r, s) tensor field along $e_{c}$ is given by the expression:

{\begin{aligned}{(\nabla _{e_{c}}T)^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s}}={}&{\frac {\partial }{\partial x^{c}}}{T^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s}}\\&+\,{\Gamma ^{a_{1}}}_{dc}{T^{da_{2}\ldots a_{r}}}_{b_{1}\ldots b_{s}}+\cdots +{\Gamma ^{a_{r}}}_{dc}{T^{a_{1}\ldots a_{r-1}d}}_{b_{1}\ldots b_{s}}\\&-\,{\Gamma ^{d}}_{b_{1}c}{T^{a_{1}\ldots a_{r}}}_{db_{2}\ldots b_{s}}-\cdots -{\Gamma ^{d}}_{b_{s}c}{T^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s-1}d}.\end{aligned}}

Or, in words: take the partial derivative of the tensor and add: $+{\Gamma ^{a_{i}}}_{dc}$ for every upper index $a_{i}$ , and $-{\Gamma ^{d}}_{b_{i}c}$ for every lower index $b_{i}$ .

If instead of a tensor, one is trying to differentiate a tensor density (of weight +1), then you also add a term

-{\Gamma ^{d}}_{dc}{T^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s}}.

If it is a tensor density of weight W, then multiply that term by W. For example, ${\sqrt {-g}}$ is a scalar density (of weight +1), so we get:

\left({\sqrt {-g}}\right)_{;c}=\left({\sqrt {-g}}\right)_{,c}-{\sqrt {-g}}\,{\Gamma ^{d}}_{dc}

where semicolon ";" indicates covariant differentiation and comma "," indicates partial differentiation. Incidentally, this particular expression is equal to zero, because the covariant derivative of a function solely of the metric is always zero.

Examples[edit]

For a scalar field $\displaystyle \phi \,$ , covariant differentiation is simply partial differentiation:

\displaystyle \phi _{;a}\equiv \partial _{a}\phi

For a contravariant vector field $\lambda ^{a}\,$ , we have:

{\lambda ^{a}}_{;b}\equiv \partial _{b}\lambda ^{a}+{\Gamma ^{a}}_{bc}\lambda ^{c}

For a covariant vector field $\lambda _{a}\,$ , we have:

\lambda _{a;c}\equiv \partial _{c}\lambda _{a}-{\Gamma ^{b}}_{ca}\lambda _{b}

For a type (2,0) tensor field $\tau ^{ab}\,$ , we have:

{\tau ^{ab}}_{;c}\equiv \partial _{c}\tau ^{ab}+{\Gamma ^{a}}_{cd}\tau ^{db}+{\Gamma ^{b}}_{cd}\tau ^{ad}

For a type (0,2) tensor field $\tau _{ab}\,$ , we have:

\tau _{ab;c}\equiv \partial _{c}\tau _{ab}-{\Gamma ^{d}}_{ca}\tau _{db}-{\Gamma ^{d}}_{cb}\tau _{ad}

For a type (1,1) tensor field ${\tau ^{a}}_{b}\,$ , we have:

{\tau ^{a}}_{b;c}\equiv \partial _{c}{\tau ^{a}}_{b}+{\Gamma ^{a}}_{cd}{\tau ^{d}}_{b}-{\Gamma ^{d}}_{cb}{\tau ^{a}}_{d}

The notation above is meant in the sense

{\tau ^{ab}}_{;c}\equiv \left(\nabla _{\mathbf {e} _{c}}\tau \right)^{ab}

Covariant derivatives do not commute; i.e. $\lambda _{a;bc}\neq \lambda _{a;cb}\,$ . It can be shown that:

\lambda _{a;bc}-\lambda _{a;cb}={R^{d}}_{abc}\lambda _{d}

where ${R^{d}}_{abc}\,$ is the Riemann tensor. Similarly,

{\lambda ^{a}}_{;bc}-{\lambda ^{a}}_{;cb}=-{R^{a}}_{dbc}\lambda ^{d}

and

{\tau ^{ab}}_{;cd}-{\tau ^{ab}}_{;dc}=-{R^{a}}_{ecd}\tau ^{eb}-{R^{b}}_{ecd}\tau ^{ae}

The latter can be shown by taking (without loss of generality) that $\tau ^{ab}=\lambda ^{a}\mu ^{b}\,$ .

Notation[edit]

In textbooks on physics, the covariant derivative is sometimes simply stated in terms of its components in this equation.

Often a notation is used in which the covariant derivative is given with a semicolon, while a normal partial derivative is indicated by a comma. In this notation we write the same as:

\nabla _{e_{j}}\mathbf {v} \ {\stackrel {\mathrm {def} }{=}}\ {v^{s}}_{;j}e_{s}\;\;\;\;\;\;{v^{i}}_{;j}={v^{i}}_{,j}+v^{k}{\Gamma ^{i}}_{kj}

Once again this shows that the covariant derivative of a vector field is not just simply obtained by differentiating to the coordinates ${v^{i}}_{,j}$ , but also depends on the vector v itself through $v^{k}{\Gamma ^{i}}_{kj}$ .

In some older texts (notably Adler, Bazin & Schiffer, Introduction to General Relativity), the covariant derivative is denoted by a double pipe and the partial derivative by single pipe:

\nabla _{e_{j}}\mathbf {v} \ {\stackrel {\mathrm {def} }{=}}\ {v^{i}}_{||j}={v^{i}}_{|j}+v^{k}{\Gamma ^{i}}_{kj}

Derivative along curve[edit]

Since the covariant derivative $\nabla _{X}T$ of a tensor field $T$ at a point $p$ depends only on the value of the vector field $X$ at $p$ one can define the covariant derivative along a smooth curve $\gamma (t)$ in a manifold:

D_{t}T=\nabla _{{\dot {\gamma }}(t)}T.

Note that the tensor field $T$ only needs to be defined on the curve $\gamma (t)$ for this definition to make sense.

In particular, ${\dot {\gamma }}(t)$ is a vector field along the curve $\gamma$ itself. If $\nabla _{{\dot {\gamma }}(t)}{\dot {\gamma }}(t)$ vanishes then the curve is called a geodesic of the covariant derivative. If the covariant derivative is the Levi-Civita connection of a certain metric then the geodesics for the connection are precisely the geodesics of the metric that are parametrised by arc length.

The derivative along a curve is also used to define the parallel transport along the curve.

Sometimes the covariant derivative along a curve is called absolute or intrinsic derivative.

Relation to Lie derivative[edit]

A covariant derivative introduces an extra geometric structure on a manifold that allows vectors in neighboring tangent spaces to be compared. This extra structure is necessary because there is no canonical way to compare vectors from different vector spaces, as is necessary for this generalization of the directional derivative. There is however another generalization of directional derivatives which is canonical: the Lie derivative. The Lie derivative evaluates the change of one vector field along the flow of another vector field. Thus, one must know both vector fields in an open neighborhood. The covariant derivative on the other hand introduces its own change for vectors in a given direction, and it only depends on the vector direction at a single point, rather than a vector field in an open neighborhood of a point. In other words, the covariant derivative is linear (over C^∞(M)) in the direction argument, while the Lie derivative is linear in neither argument.

Note that the antisymmetrized covariant derivative ∇_uv − ∇_vu, and the Lie derivative L_uv differ by the torsion of the connection, so that if a connection is torsion free, then its antisymmetrization is the Lie derivative.

Notes[edit]

^ Einstein, Albert (1922). "The General Theory of Relativity". The Meaning of Relativity.
^ Ricci, G.; Levi-Civita, T. (1901). "Méthodes de calcul différential absolu et leurs applications". Mathematische Annalen. 54: 125–201. doi:10.1007/bf01454201.
^ Riemann, G. F. B. (1866). "Über die Hypothesen, welche der Geometrie zu Grunde liegen". Gesammelte Mathematische Werke.; reprint, ed. Weber, H. (1953), New York: Dover.
^ Christoffel, E. B. (1869). "Über die Transformation der homogenen Differentialausdrücke zweiten Grades". Journal für die reine und angewandte Mathematik. 70: 46–70.
^ cf. with Cartan, É (1923). "Sur les variétés à connexion affine et la theorie de la relativité généralisée". Annales, École Normale. 40: 325–412.
^ Koszul, J. L. (1950). "Homologie et cohomologie des algebres de Lie". Bulletin de la Société Mathématique. 78: 65–127.
^ The covariant derivative is also denoted variously by $\partial$ _vu, D_vu, or other notations.
^ In many applications, it may be better not to think of t as corresponding to time, at least for applications in general relativity. It is simply regarded as an abstract parameter varying smoothly and monotonically along the path.

References[edit]

Kobayashi, Shoshichi; Nomizu, Katsumi (1996). Foundations of Differential Geometry, Vol. 1 (New ed.). Wiley Interscience. ISBN 0-471-15733-3.
I.Kh. Sabitov (2001) [1994], "Covariant differentiation", in Hazewinkel, Michiel, Encyclopedia of Mathematics, Springer Science+Business Media B.V. / Kluwer Academic Publishers, ISBN 978-1-55608-010-4
Sternberg, Shlomo (1964). Lectures on Differential Geometry. Prentice-Hall.
Spivak, Michael (1999). A Comprehensive Introduction to Differential Geometry (Volume Two). Publish or Perish, Inc.

[1] Einstein, Albert (1922). "The General Theory of Relativity". The Meaning of Relativity.

[2] Ricci, G.; Levi-Civita, T. (1901). "Méthodes de calcul différential absolu et leurs applications". Mathematische Annalen. 54: 125–201. doi:10.1007/bf01454201.

[3] Riemann, G. F. B. (1866). "Über die Hypothesen, welche der Geometrie zu Grunde liegen". Gesammelte Mathematische Werke.; reprint, ed. Weber, H. (1953), New York: Dover.

[4] Christoffel, E. B. (1869). "Über die Transformation der homogenen Differentialausdrücke zweiten Grades". Journal für die reine und angewandte Mathematik. 70: 46–70.

[5] . with Cartan, É (1923). "Sur les variétés à connexion affine et la theorie de la relativité généralisée". Annales, École Normale. 40: 325–412.

[6] Koszul, J. L. (1950). "Homologie et cohomologie des algebres de Lie". Bulletin de la Société Mathématique. 78: 65–127.

[7] The covariant derivative is also denoted variously by $\partial$ _vu, D_vu, or other notations.

[8] In many applications, it may be better not to think of t as corresponding to time, at least for applications in general relativity. It is simply regarded as an abstract parameter varying smoothly and monotonically along the path.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

Covariant derivative

Contents

History[edit]

Motivation[edit]

Remarks[edit]

Informal definition using an embedding into Euclidean space[edit]

Formal definition[edit]

Functions[edit]

Vector fields[edit]

Covector fields[edit]

Tensor fields[edit]

Coordinate description[edit]

Examples[edit]

Notation[edit]

Derivative along curve[edit]

Relation to Lie derivative[edit]

See also[edit]

Notes[edit]

References[edit]

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Interaction

Tools

Print/export

Languages