Introduction to the mathematics of general relativity
This article is on the minimal body of mathematics necessary to understand general relativity. For a more complete overview see Mathematics of general relativity.
An understanding of calculus and differential equations is necessary for the understanding of nonrelativistic physics. In order to understand special relativity one also needs an understanding of tensor calculus. To understand the general theory of relativity, one needs a basic introduction to the mathematics of curved spacetime that includes a treatment of curvilinear coordinates, nontensors, curved space, parallel displacement, Christoffel symbols, geodesics, covariant differentiation, the curvature tensor, Bianchi relations, and the Ricci tensor. This article follows the basic treatment in the lecture series on the topic, intended for advanced undergraduates, given by Paul Dirac at Florida State University.[Ref. 1]
For an introduction based on the specific physical example of particles orbiting a large mass in circular orbits, see Newtonian motivations for general relativity for a nonrelativistic treatment and Theoretical motivation for general relativity for a fully relativistic treatment.
Mathematics of special relativity
Vectors
Interval between two points
Spacetime physics requires four coordinates for the description of a point in spacetime:
where c is the speed of light and x, y, and z are spatial coordinates.
A point very close to our original point is
- .
The square of the distance, or interval, between the two points is
and is invariant under coordinate transformations. Here we are using the Minkowski metric.
Coordinate transformations
Transformation of dx
If one defines a new coordinate system such that
then
where repeated indices are summed according to the Einstein summation convention.
The comma in the subscript of the last term indicates differentiation.
Transformation of a scalar
A scalar quantity transforms as
- .
Contravariant vectors
Quantities that transform in the same way as under a change of coordinates,
- ,
form a contravariant vector. The squared length of the vector is the invariant quantity
- .
The term on the left is the notation for the inner product of A with itself.
Covariant vectors
The covariant vector is defined as
- .
It transforms as a scalar
- .
Inner product
The inner product of two vectors is written
- .
This quantity is also invariant under coordinate transformations.
Tensors
Definition
A rank 2 contravariant tensor can be constructed from the outer product of vectors as
- .
Contravariant tensor
The components of a rank 2 contravariant tensor transform in the same way as the quantities ,
- .
Coavariant and mixed tensors
Higher rank tensors are constructed similarly as are covarariant and mixed tensors. For a rank 2 covariant tensor, the transformation is
- .
Oblique axes
The interval and the metric tensor
An oblique coordinate system is one in which the axis are not necessarily orthogonal to each other. For oblique axes, the interval is
where the coefficients , called the metric tensor depend on the system of oblique axes.
Determinant of the metric tensor
The determinant of is denoted and is always negative for any real coordinate axes.
Inner product
The inner product of any two vectors
is invariant.
Relation between covariant and contravariant tensors
Covariant tensors are related to contravariant tensors by
and
where is the cofactor of the corresponding
and
- .
Nontensors
A nontensor is a tensor-like quantity that behaves like a tensor in the raising and lowering of indices,
and
- ,
but that does not transform like a tensor under a coordinate transformation.
Mathematics of general relativity
Curvilinear coordinates and curved spacetime
Curvilinear coordinates are coordinates in which the angles between axes can change from point to point. In other words, the metric tensor in curvilinear coordinates is no longer a constant, but depends on the spacetime location of the metric tensor. It is therefore a field quantity.
Like the surface of ball embedded in three-dimensional space, we can imagine four dimensional spacetime as embedded in a flat space of a higher dimension. The coordinates on the surface of the ball are curvilinear, while the coordinates in three dimensional space can be rectilinear. The coordinates of four dimensional curved spacetime are curvilinear, while the four space is embedded in a larger dimensional space of rectilinear coordinates.
Parallel displacement
The interval in a high dimensional space
Imagine our four dimensional curved spacetime is embeded in a larger N dimensional flat space. Any true physical vector lies entirely in the curved physical space. In other words, the vector is tangent to the curved physical spacetime. It has no component normal to the four dimensional curved spacetime.
In the N dimensional flat space with coordinates the interval between neighboring points is
where is the metric for the flat space. We do not assume the coordinates are orthogonal, only rectilinear.
The interval between two point in physical spacetime
To quote Dirac:
Physical spacetime forms a four dimensional "surface" in the flat N-dimensional space. Each point determines a definite point in the N-dimensional space. Each coordinate is a function of the four x's; say . There are N-4 such equations.
The relation between neighboring contravariant vectors: Christoffel symbols
The difference in y for two neighboring points in the surface differing by is
where
.
The interval between two neighboring points in physical spacetime becomes
where
- .
A contravariant vector at a point x in physical spacetime is related to the same contravariant vector at the same point y(x) in N-dimensional space by the relation
- .
The vector lies in the surface of physical spacetime.
Now shift the vector to the point y^n(x+dx) keeping it parallel to itself. In other words, we hold the comonents of the vector constant during the shift. The vector no longer lies in the surface because of curvature of the surface.
The shifted vector can be split into two parts, one tangent to the surface and one normal to surface, as
- .
The vector as a function of y tangent to the surface can be written in terms of the vector K in terms of x as
- .
The normal vector is normal to every vector in the surface including the unit vectors that define the comonents of . Therefore
- .
This allows us to write
or
where
is a nontensor called the Christoffel symbol of the first kind. It can be shown to be related to the metric tensor through the relation
- .
Since the Christoffel symbol can be written entirely in terms of the metric in physical spacetime, all reference to ther N-dimensional space has disappeared.
Christoffel symbol of the second kind
The Christoffel symbol of the second kind is defined as
- .
This operation is allowed for nontensors.
This allows us to write
and
- .
The minus sign in the second expression can be seen from the invariance of an inner product of two vectors
- .
The constancy of the length of the parallel displaced vector
From Dirac:
The constancy of the length of the vector follows from geometrical arguments. When we split up the vector into tangential and normal parts ... the normal part is infinitesimal and is orthogonal to the tangential part. It follows that, to the first order, the length of the whole vector equals that of its tangential part.
The covariant derivative
The partial derivative of a vector with respect to a spacetime coordinate is composed of two parts, the normal partial derivative minus the the change in the vector due to parallel transport
- .
Geodesics
Suppose we have a point that moves along a track in physical spacetime. Suppose the track is parameterized with the quantity . The a "velocity" vector that points in the direction of motion in spacetime is
- .
The variation of the velocity upon parallel displacement along the track is then
- .
If there are no "forces" acting on the point, then the velocity is unchanged along the track and we have
- ,
which is called the geodesic equation.
References
- [1] P. A. M. Dirac (1996). General Theory of Relativity. Princeton University Press. ISBN 0-691-01146-X.
- [2] Misner, Charles; Thorne, Kip S. & Wheeler, John Archibald (1973). Gravitation. San Francisco: W. H. Freeman. ISBN 0-7167-0344-0.
{{cite book}}
: CS1 maint: multiple names: authors list (link)
- [3] Landau, L. D. and Lifshitz, E. M. (1975). Classical Theory of Fields (Fourth Revised English Edition). Oxford: Pergamon. ISBN 0-08-018176-7.
{{cite book}}
: CS1 maint: multiple names: authors list (link)
- [4] R. P. Feynman, F. B. Moringo, and W. G. Wagner (1995). Feynman Lectures on Gravitation. Addison-Wesley. ISBN 0-201-62734-5.
{{cite book}}
: CS1 maint: multiple names: authors list (link)
- [5] Einstein, A. (1961). Relativity: The Special and General Theory. New York: Crown. ISBN 0-517-029618.