Wolfram Alpha:
Search by keyword:
Astronomy
Chemistry
Classical Mechanics
Classical Physics
Climate Change
Cosmology
Finance and Accounting
Game Theory
General Relativity
Group Theory
Lagrangian and Hamiltonian Mechanics
Macroeconomics
Mathematics
Microeconomics
Nuclear Physics
Particle Physics
Probability and Statistics
Programming and Computer Science
Quantum Computing
Quantum Field Theory
Quantum Mechanics
Semiconductor Reliability
Solid State Electronics
Special Relativity
Statistical Mechanics
String Theory
Superconductivity
Supersymmetry (SUSY) and Grand Unified Theory (GUT)
The Standard Model
Topology
Units, Constants and Useful Formulas
The Essential Mathematics of General Relativity

We will begin our discussion with a review of vectors
and dual vectors in flat spacetime and then transition
the discussion to curved spacetime.
Coordinate Transformations in Flat Spacetime

Vectors exist independently of coordinate systems.
For example, a velocity is a velocity regardless
of the coordinate system we choose. If we are
moving in a car at 50 MPH in a westerly direction
the speedometer reads 50 MPH and a compass will
read W without any notion of an coordinate system.
Therefore, it is a real physical quantity. Note:
A position vector is different. Its direction will
depend on the coordinate system because it is defined
with respect to one coordinate system. However, a
displacement vector (the difference between 2 points)
is coordinate independent. If a coordinate basis is
chosen, then the components of vector with respect to
that basis can be written. We can see this easily.
V = v_{1}e_{1} + v_{2}e_{2} + v_{3}e_{3}
Where,
v_{1}, v_{2}, v_{3} are the components (numbers).
e_{1}, e_{2}, e_{3} is the basis (i.e. i, j, k).
In Einstein notation this can be written as V = V^{μ}e_{μ}
It should be obvious that as the coordinate basis
changes, the components of the vector must change
accordingly to maintain the invariance. We will now
look at this in more detail.
Vectors

Vectors, V, live in vector space, V.
x^{2}
 e_{2}
 ^
 
 
 > e_{1}
 x^{1}
Consider a vector, V:
V = V^{μ}e_{μ} > V' = V^{μ'}e_{μ'}
_{ } = Λ^{μ'}_{μ}V^{μ}e_{μ'}
_{ } = (Λ^{μ'}_{μ}V^{μ})(Λ^{σ}_{μ'}e_{σ})
_{ } = (component)(basis)
Now, Λ^{μ'}_{μ}Λ^{σ}_{μ'} = δ^{σ}_{μ} so the implication is that Λ^{σ}_{μ'}
has to be the inverse of Λ^{μ'}_{μ}. So if the components
change in one way, the basis has to change in the
opposite way for ds to be invariant. Thus,
V^{μ} > V^{μ'} = Λ^{μ'}_{ν}dx^{ν}
and,
e_{μ} > e_{μ'} = Λ^{σ}_{μ'}e_{σ}
Mathematically, components are said to transform
contravariantly and bases are said to transform
covariantly.
Dual Vectors

Dual vectors, ω, live in dual vector space, V*.
Like vectors They are invariant and can be
expressed in terms of components and dual basis
vectors. They linearly 'eat' vectors to produce
scalars (more on this later).
x^{2}
 θ^{2}
 ^
 
 
 > θ^{1}
 x^{1}
We will define 'eating' in terms of basis vectors and
dual basis vectors as follows:
θ^{μ}e_{ν} = δ^{μ}_{ν}
^{ }
This is interpreted as θ^{μ} acting on e_{ν} to produce a
scalar.
The bases transform as:
θ^{μ}e_{ν} = δ^{μ}_{ν} > θ^{μ'}e_{ν'} = δ^{μ'}_{ν'}
Now e_{ν'} = Λ^{ν}_{ν'}e_{ν} and θ^{μ'} = Λ^{μ'}_{μ}θ^{μ}
Therefore,
δ^{μ'}_{ν'} = (Λ^{μ'}_{μ}θ^{μ})(Λ^{ν}_{ν'}e_{ν})
= Λ^{μ'}_{μ}Λ^{ν}_{ν'}θ^{μ}e_{ν}
So, in order for this to hold true:
Λ^{μ'}_{μ} ≡ Λ^{1}
and,
Λ^{ν}_{ν'} ≡ Λ
In spacetime the simplest example of a dual
vector is the gradient of a scalar function,
φ, which is given by:
ω = (∂φ/∂x^{μ})θ^{μ}
Proof:
We can rewrite ∂φ/∂x^{μ} using the Chain Rule as:
∂φ/∂x^{μ} > ∂φ/∂x^{μ'} = (∂x^{μ}/∂x^{μ'})(∂φ/∂x^{μ})
= λ^{μ}_{μ'}(∂φ/∂x^{μ})
λ^{μ}_{μ'} is the transformation law for dual vector
components so we can conclude that ∂φ/∂x^{μ} is a
dual vector.
The Dot Product

Consider a dual vector, ω, eating a vector, V.
ωV = ω_{μ}θ^{μ}v^{ν}e_{ν}
= ω_{μ}v^{ν}δ^{μ}_{ν}
= ω_{0}v^{0} + w_{1}v^{1} + w_{2}v^{2} + w_{3}v^{3}
This looks like the dot product. Therefore, the dot
product is 'secretely' the act of taking one of the
vectors and turning it into a dual vector by the
application of the metric tensor i.e.,
A.B = η_{μν}A^{μ}e_{μ}B^{ν}e_{ν}
= A_{ν}θ^{μ}B^{ν}e_{ν}
= A_{ν}B^{ν}δ^{μ}_{ν}
The Lorentz Transformation

It is conventional to write the inverse Lorentz
transformation as:
(Λ^{α}_{β})^{1} ≡ Λ_{α}^{β}
Consider a boost in the x direction. The partial
boost matrix is:
 
B =  γ βγ  = B^{μ}_{ν}
 βγ γ 
 
With inverse,
 
B^{1} =  γ βγ  = B_{μ}^{σ}
^{ }  βγ γ 
 

Digression:
Consider a boost in the x direction:
     
 γ βγ  ct  =  γt  βγx 
 βγ γ  x   βγt + γx 
     
And the inverse transformation:
     
 γ βγ  ct'  =  γt' + βγx' 
 βγ γ  x'   βγt' + γx' 
     
From these we get:
x' = γx  βγct
x = γx' + βγct'
t' = γct  γβx
t = γct' + βγx'

The components transform as:
v^{0'} = B^{μ}ν_{}v^{ν} = B^{0}_{0}v^{0} + B^{0}_{1}v^{1}
= γv^{0}  βγv^{1}
v^{1'} = B^{μ}ν_{}v^{ν} = B^{1}_{0}v^{0} + B^{1}_{1}v^{1}
= βγv^{0} + γv^{1}
The basis transforms as:
e_{0'} = B_{μ}^{σ}e_{σ} = B_{0}^{0}e_{0} + B_{0}^{1}e_{1}
= γe_{0} + βγe_{1}
e_{1} = B_{μ}^{σ}e_{σ} = B_{1}^{0}e_{0} + B_{1}^{1}e_{1}
= βγe_{0} + γe_{1}
The transformed vector is identical to the vector
before the transformation.
Proof:
v^{0'}e_{0}' + v^{1'}e_{1'} = (γv^{0}  βγv^{1})(γe_{0} + βγe_{1})
+ (βγv^{0} + γv^{1})(βγe_{0} + γe_{1})
= (γ^{2}v^{0}e_{0} + βγ^{2}v^{0}e_{1}  βγ^{2}v^{1}e_{0}  β^{2}γ^{2}v^{1}e_{1})
+ (β^{2}γ^{2}v^{0}e_{0}  βγ^{2}v^{0}e_{1} + βγ^{2}v^{1}e_{0} + γ^{2}v^{1}e_{1})
= (γ^{2}v^{0}e_{0}  β^{2}γ^{2}v^{1}e_{1}) + (β^{2}γ^{2}v^{0}e_{0} + γ^{2}v^{1}e_{1})
= γ^{2}(1  β^{2})v^{0}e_{0} + γ^{2}(1  β^{2})v^{1}e_{1}
= v^{0}e_{0} + v^{1}e_{1}
Now consider a rotation about the z axis. The
partial rotation matrix is:
 
R(90^{°}) =  0 1  = R^{μ}ν_{}
^{ }  1 0 
 
With inverse,
 
R(90^{°}) =  0 1  = R_{μ}^{σ}
^{ }  1 0 
 
The components transform as:
v^{1'} = R^{μ}ν_{}v^{ν} = R^{1}1_{}v^{1} + R^{1}_{2}v^{2}
= v^{2}
v^{2'} = R^{μ}ν_{}v^{ν} = R^{2}1_{}v^{1} + R^{2}_{2}v^{2}
= v^{1}
The basis transforms as:
e_{1'} = R_{μ}^{σ}e_{σ} = R_{1}^{1}e_{1} + R_{1}^{2}e_{2}
= e_{2}
e_{2'} = R_{μ}^{σ}e_{σ} = R_{2}^{1}e_{1} + R_{2}^{2}e_{2}
= e_{1}
e_{2} e_{1'} (= e_{2})
 
 
 > 
 
 e_{1} e_{2'} 
(= e_{1})
The transformed vector is identical to the vector
before the transformation.
Proof:
v^{1'}e_{1'} + v^{2'}e_{2'} = v^{2}e_{2} + (v^{1})(e_{1})
= v^{1}e_{1} + v^{2}e_{2}
Lorentz Transform as a Coordinate Transformation

The Lorentz transformation is just a special case
of a coordinate transformation. Again, consider a
boost in the x direction:
Λ^{μ'}_{μ}

v
     
 coshζ sinhζ 0 0  t  =  coshζt  sinhζx 
 sinhζ coshζ 0 0  x   sinhζt + coshζx 
 0 0 1 0  y   y 
 0 0 0 1  z   z 
     
 
 t' 
=  x' 
 y' 
 z' 
 
We can write this as a coordinate transformation
as a JACOBEAN MATRIX:
_{ }  
_{ }  ∂t'/∂t ∂t'/∂x ∂t'/∂y ∂t'/∂z 
Λ^{μ'}_{μ} =  ∂x'/∂t ∂x'/∂x ∂x'/∂y ∂x'/∂z 
_{ }  ∂y'/∂t ∂y'/∂x ∂y'/∂y ∂y'/∂z 
_{ }  ∂z'/∂t ∂z'/∂x ∂z'/∂y ∂z'/∂z 
_{ }  
_{ }  
_{ }  coshζ sinhζ 0 0 
_{ } =  sinhζ coshζ 0 0 
_{ }  0 0 1 0 
_{ }  0 0 0 1 
_{ }  
Alternatively, we can see this in another way:
t' = γct  γβx
x' = γx  βγct
y' = y
z' = z
Therefore,
∂t'/∂t = γc ∂t'/∂x = γβ ∂t'/∂y = ∂t'/∂z = 0
And,
∂x'/∂t = γβc ∂x'/∂x = γ
In general we can write:
Λ^{μ'}_{μ} = ∂x^{μ'}/∂x^{μ}
This produces the matrix:
 
_{ }  γ γβ 0 0 
Λ^{μ'}_{μ} =  γβ γ 0 0 
_{ }  0 0 1 0 
_{ }  0 0 0 1 
 
Now consider the inverse transform given by:
t = γct' + βγx'
x = γx' + βγct'
y = y'
z = z'
In this case we get:
∂t/∂t' = γc ∂t/∂x' = βγ ∂t/∂y' = ∂t/∂z' = 0
And,
∂x/∂t' = βcγ ∂x/∂x' = γ
In general we can write:
Λ_{μ'}^{μ} = ∂x^{μ}/∂x^{μ'}
This produces the inverse transformation matrix we
had before:
 
_{ }  γ γβ 0 0 
Λ_{μ'}^{μ} =  γβ γ 0 0 
_{ }  0 0 1 0 
_{ }  0 0 0 1 
 
Summary of Transformation Laws

Vector Components: V^{μ} > V^{μ'} = Λ^{μ'}_{μ}V^{μ}
Basis vectors: e_{μ} > e_{μ'} = Λ^{σ}_{μ'}e_{σ}
Dual Vector Components: ω_{μ} > ω_{μ'} = Λ^{μ}_{μ'}ω_{μ}
Dual Basis vectors: θ^{μ} > θ^{μ'} = Λ^{μ'}_{σ}θ^{σ}
Vectors: V > V' = Λ^{μ'}_{μ}V^{μ}Λ^{σ}_{μ'}e_{σ}
= V
Dual Vectors: ω > ω' = Λ^{μ}_{μ'}ω_{μ}Λ^{μ'}_{σ}θ^{σ}
= ω
Λ^{σ}_{μ'}Λ^{μ'}_{μ} = δ^{σ}_{μ}
The direction of the transformation is always from
the bottom index to the top index.
Coordinate Transformations in Curved Spacetime

Manifolds

A C^{P} manifold, M, is a collection of points in
spacetime with a maximal atlas.
We can break up M into PATCHES. All of M needs to be
covered. We have only shown a few patches to illustrate
the concept.
A COORDINATE CHART (aka chart) is a subset of M with a
1to1 map, φ:
φ: U_{α} > ℝ^{n} where ℝ^{n} is ndimensional Euclidean space.
(Note: R ≡ ℝ in diagrams).
Such that the image of φ is open. Open means the
interior of an n  1 dimensonal closed surface (i.e.
the edges of the image are not included). Note that
charts must have the same dimensionality as the
manifold.
The fact that manifolds look locally like ℝ^{n}, which
is manifested by the construction of coordinate charts,
introduces the possibility of analysis on manifolds,
including operations such as differentiation and
integration. In other words φ(p) converts a point,
p, in U on M to a point with coordinates x^{μ} in ℝ^{n}.
Thus,
φ(p) = (x^{1}(p), x^{2}(p) ... x^{n}(p))
or, in general:
φ^{μ}(p) = x^{μ}(p)
Where x^{μ}(p) are COORDINATE FUNCTIONS. They are scalar
functions that return the μth coordinate of P as a
real number. In a fixed coordinate system, each of
the 4 coordinates, x^{μ}, can be thought of as a function
on spacetime. Therefore,
x^{μ} = x^{μ}(p)
For example:
. . 
. u_{α} .  ℝ^{2}
. p. φ(p) >x^{μ} 
. .  

This takes a point, p, in U_{α} on a 2D manifold, M^{2} and
sends it to (x,y) ∈ ℝ^{2}.
An important thing to recognize is that if a manifold
can be covered by a single chart, we can have a global
coordinate system on M. If it cannot be covered by a
single chart then it is not possible to have a system
of global coordinates.
Atlases

An ATLAS on M is a collection of charts {(U_{α},φ)} such
that:
1. The union of all charts U_{α} is equal to M
2. The charts are sewn together with C^{P} TRANSITION
FUNCTIONS.
C^{P} Transition Functions

Suppose that 2 charts overlap.
Now,
P_{3} = φ_{3}(p)
= φ_{3} o φ_{41}(P_{4}): ℝ^{n} > ℝ^{n}
Now, the interesting thing is that P_{3} and P_{4} are both
in ℝ^{n}. Therefore, we are taking the point P_{4} in one
set of coordinates and expressing it as the point P_{3}
in another set of coordinates. In other words, this
is just a ccoordinate transformation.
P_{4} with coordinates (x_{4}, y_{4}, z_{4} ...)
> P_{3} with coordinates (x_{3}, y_{3}, z_{3} ...)
The transition function, φ_{43} is defined as:
φ_{43} = φ_{3} o φ_{4}^{1}
Where φ_{3} and φ_{4}^{1} are coordinate and inverse coordinate
functions respectively.
Now that we are dealing with functions from ℝ^{n} to ℝ^{n}
we can talk about using calculus, in particular we
can consider the derivatives of φ_{αβ}. If the pth
derivative of all φ_{αβ} exists and is continuous then
φ_{αβ} is C^{p}. If P = ∞ (i.e. C^{∞}) then we call φ_{αβ}
smooth.
Maximal Atlas

It is possible that a manifold may be covered by
different choices of atlases. In this case each
atlas is an equally good representation of the
manifold so they belong to the maximal atlas.
Therefore, a C^{p} ndimensional manifold is a set,
M, with a maximal atlas.
In summary, we can construct an atlas of charts
and use the maps φ_{α}: u_{α} > ℝ^{n} to coordinatize M
on each chart. We can then sew them together
with transition functions which are equivalent
to coordinate transformations.
Manifolds in SR and GR

With curved manifolds it is important to remove
the idea that a vector stretches from one point to
another, but instead is just an object associated
with a single point (the tangent space  see next)
that facilitates the use of multivariable calculus.
Manifolds can then be equipped with a metric at each
point that allows the existence of the inner product.
Consider the infinitesimally small displacement
vector, ds:
ds^{2} = g_{αβ}dx^{α}dx^{β}
= g_{00}dx^{0}dx^{0} + g_{01}dx^{0}dx^{1} + g_{10}dx^{1}dx^{0} + g_{11}dx^{1}dx^{1}
If ds^{2} is always positive the manifold is RIEMANNIAN
If ds^{2} can take any value it is PSEUDORIEMANNIAN.
If ds^{2} takes the form:
ds^{2} = dt^{2} + dx^{2} + dy^{2} + dz^{2}
it is LORENTZIAN.
Because ds^{2} can have different signs in GR, the
manifolds are pseudoRiemannian.
Armed with this knowledge we can now proceed with
a discussion of tangent and cotangent spaces on
manifolds.
Tangent Space

Consider an ndimensional mamifold, M.
Consider a SCALAR field, ψ(p), that lives in the manifold.
γ(λ) maps λ_{p} in ℝ^{1} to the 1 dimensional curve, γ(λ_{p}), that
passes through a particular point, p, on the manifold. So
as λ_{p} changes, P will move along the curve γ. f is a scalar
function that maps any point, P, on the manifold to α_{p} in
ℝ^{1}. Therefore, f takes any point on the manifold and
assigns to it a real number (i.e. a scalar).
Note, λ and α are not charts since they do not have the
same dimensions as the manifold.
We want to find how the arbitrary function, f, changes
as we change λ_{p}. Since f, gives us α_{p} this is equivalent
to seeing how α_{p} changes with λ_{p}. Thus,
df/dλ = d(f o γ)/dλ
This is called the DIRECTIONAL DERIVATIVE OF f ALONG THE
CURVE γ
Note: F o G ≡ F(G(x)) so the order of execution is from
right to left.
This is equivalent to the tangent at P along the curve
γ. If we repeat this for other curves through P we wlll
construct set of directional derivatives that forms a
vector space of tangents at P called the TANGENT SPACE, T_{p},
at p. This tangent space is an example of a vector spaces.
The set of all the tangent spaces of a manifold is called
the TANGENT BUNDLE.
Now that we have a tangent space we can now make use of
charts to setup a basis and, therefore, coordinate
representations of vectors in that tangent space.
λ_{p} > P > x^{μ} > P > α_{p}
f o γ = f o φ^{1} o φ o γ: λ_{p} > α_{p}
df/dλ = d[(f o φ^{1}) o (φ o γ)]/dλ
Now φ o γ: ℝ^{1} > ℝ^{n} so we can call it (φ o γ)^{μ}
Using the chain rule:
df/dλ = [∂(f o φ^{1})/∂(φ o γ)^{μ}][d(φ o γ)^{μ})/dλ]
Since (φ o γ)^{μ}: ℝ > ℝ^{n} we can call it x^{μ}(λ)
Since (f o φ^{1}): ℝ^{n} > ℝ we can call it f(x^{μ})
Therefore;
df/dλ = (∂f/∂x^{μ})(dx^{μ}/dλ)
Or,
d/dλ = (∂/∂x^{μ})(dx^{μ}/dλ)
Where,
(dx^{μ}/dλ) are the components of the TV.
and,
(∂/∂x^{μ}) = ∂_{μ} are the bases of the tangent space ≡ e_{μ}
d/dλ is a tangent vector in tangent space. Therefore,
we should be able to generalize this for any vector, V,
in that space. Therefore, we can write:
V = V^{μ}∂_{μ}
The basis vectors in the primed frame are given by
the chain rule. Therefore,
∂_{μ} > ∂_{μ'} = (∂x^{μ}/∂x^{μ'})∂_{μ}
Since V = V^{μ}∂_{μ} must be invariant under a transformation
this implies:
V^{μ}∂_{μ} = V^{μ'}∂_{μ'}
= V^{μ'}(∂x^{μ}/∂x^{μ'})∂_{μ}
Summarizing:
Directional Derivative:
d/dλ = (∂/∂x^{μ})(dx^{μ}/dλ)
Transformation law for basis vectors:
∂_{μ} > ∂_{μ'} = (∂x^{μ}/∂x^{μ'})∂_{μ}
Transformation law for vector components:
V^{μ} > V^{μ'} = (∂x^{μ'}/∂x^{μ})V^{μ}
So, as in the case of flat spacetime, the basis
transforms in the opposite direction to the components
to preserve the invariance of the vector.
Basis vectors are usually not written explicitly with
the understanding that a change of coordinates induces
a change of basis.

A slightly different view:
Consider a curve on a manifold. We can define a
displacement (separation) vector Δp along the curve.
Δp has to short so that we can assume that the surface
local to P is flat and we can use basic calculus to
figure out the tangent vector, A, to the curve at p.
The tangent vector to the curve, A, is given by:
A = dp/dε ≡ lim [P(ε + Δε)  P(ε)]/Δε
^{Δε>0}
Any point on the curve can be written in terms of
coordinates, x^{α}.
Coordinate basis: e_{α} ≡ ∂p/∂x^{α}
Components of vector: A^{α} = dp/dε ∂x^{α}/∂ε
We can write A = dp/dε as:
A = (dx^{α}/dε)(∂p/∂x^{α}) .. 1.
≡ A^{α}e_{α}
Therefore, we can identify the components of the
vector as:
A^{α} = dx^{α}/dε
Directional Derivative

Consider a scalar field, φ(p), that lives in the
manifold at position, p, on the curve. The
derivative of φ along A, ∂_{A}φ, is given by:
∂_{A}φ(P(ε)) = A^{α}∂[φ(P(ε))]/∂x^{α}
Or,
∂_{A} = A^{α}∂/∂x^{α}
= (dx^{α}/dε)(∂/∂x^{α}) ... 2.
= (d/dε) along the curve.
If we compare 1. with 2. we see that there is a
direct 1to1 correspondence between the tangent
vector and the directional derivative. Thus,
A ≡ ∂_{A} = ∂p/∂ε = ∂/∂ε
Cotangent (Dual) Space

In addition to tangent space it is also possible to
construct COTANGENT SPACES, T_{p}* at each point on the
manifold. Just as vectors are defined as elements of
the tangent space, dual vectors are defined as elements
of the cotangent space. The set of all cotangent
spaces over a manifold is called the COTANGENT BUNDLE.
A dual space, V*, is a vector space of linear maps
from a vector space, V, to its field of scalars. The
maps are also referred to as linear functionals,
linear forms, oneforms, covectors or dual vectors.
In layman's terms a linear functionals at P 'eat' the
tangent vector at P producing a scalar. Therefore,
ω: V > ℝ
Where ω ∈ V* (aka T_{p}*) and V ∈ V (aka T_{p})
Note that dim V = dim V*.
In tangent space we used charts to set up coordinates
that enabled us to define the basis for the tangent
vector along λ. With cotangent space we use the
gradients of the scalar coordinate functions, φ^{μ}(p)
(≡ x^{μ}(p)), denoted by (df), to determine the basis.
Note that dx^{μ} is not the same as dx^{μ} which is an
infinitesimal displacement vector. Therefore, we
can write:
(df): T_{p}V > ℝ
Where (df) = (∂f/∂x^{μ})
Dual spaces have to satisfy the operation of addition
and scalar multiplication.
(df)(V + V') = (df)(V) + (df)(V')
(df)(αV) = α(df)(V)
We now let (df) 'act' on the tangent vector d/dλ.
Note 'act' in this case does NOT mean differentiate.
Therefore,
(∂f/∂x^{μ})(d/dλ)
Now by the definition of linearity:
(df)V = V o f V ∈ V, f ∈ V*
Therefore, we get:
(df)(d/dλ) = (d/dλ) o f
= df/dλ
We now write the tangent vector in terms of its
components and bases.
df(dx^{μ}/dλ)(∂_{μ}) = df/dλ
We are free to write this as:
(dx^{μ}/dλ)(df)(∂_{μ}) = df/dλ
Now the RHS can be rewritten using the chain rule.
(dx^{μ}/dλ)(df)(∂_{μ}) = (dx^{μ}/dλ)(∂f/∂x^{μ})
By comparison:
(df)(∂_{μ}) ≡ ∂f/∂x^{μ}
But f = x^{ν}(P), the coordinate functions, so:
(dx^{ν})(∂_{μ}) = ∂x^{ν}/∂x^{μ}
= δ^{μ}_{ν}
In conclusion,
θ^{μ} = (dx^{μ})
Now, we had said previously that dual vectors
'eat' vectors to produce scalars, i.e.
θ^{μ}e_{ν} = δ^{μ}_{ν}
Or,
dx^{μ}∂_{ν} = δ^{μ}_{ν}
Therefore, for invariance,
dx^{μ}∂_{ν} = dx^{μ'}∂_{ν'}
So,
dx^{μ'} = (∂x^{μ'}/∂x^{μ})dx^{μ}
If this is the transformation law for dual basis
vectors then the transformation law for dual
vector components has to be:
ω_{μ} > ω_{μ'} = (∂x^{μ}/∂x^{μ'})ω_{μ}
Comparison with Quantum Mechanics

Dual spaces play an important role in Quantum Mechanics.
QM uses complex functions to represent vectors and dual
vectors. Since vectors live in complex Hilbert space,
the dual vectors have to live in a conjugate space to
ensure that the resulting wavefunctions are scalars.
The state α> does not represent a wave function until
it is projected onto a basis in position or momentum
space. For example,
ψ(p) = <pα> or ψ(x) = <xα>
ψ is a scalar and <pα> just specifies the 'components'
of α> in that basis.
We can also project <α onto a basis:
ψ*(p) = <αp>* or ψ*(x) = <αx>
= <pα>* and ψ*(x) = <xα>*
Likewise, the overlap of 2 states <φψ> is a scalar
since probabilities have to be postive numbers.
Dual Space Vector Space
 
Dual vector Vector
Oneform Tangent vector
Covariant vector Contravariant vector
Row vector Column vector
Complex conjugate function Complex function (QM)
Summary of Transformation Laws

SR: V^{μ} > V^{μ'} = Λ^{μ'}_{μ}V^{μ} and e_{μ} > e_{μ'} = Λ^{μ}_{μ'}e_{μ}
ω_{μ} > ω_{μ'} = Λ^{μ}_{μ'}ω_{μ} and θ^{μ} > θ^{μ'} = Λ^{μ'}_{μ}θ^{μ}
Where Λ^{μ'}_{μ} and Λ^{μ}_{μ'} allows global transformations
of (t,x,y,z).
GR: V^{μ} > V^{μ'} = (∂x^{μ'}/∂x^{μ})V^{μ} and ∂_{μ} > ∂_{μ'} = (∂x^{μ}/∂x^{μ'})∂_{μ}
ω_{μ} > ω_{μ'} = (∂x^{μ}/∂x^{μ'})ω_{μ} and dx^{μ} > dx^{μ'} = (∂x^{μ'}/∂x^{μ})dx^{μ}
Where ∂x^{μ'}/∂x^{μ} and ∂x^{μ}/∂x^{μ'} allows local or
coordinate dependent transformations of
(t,x,y,z).
Vectors: V > V' = Λ^{μ'}_{μ}V^{μ}Λ^{σ}_{μ'}e_{σ}
= V
Dual Vectors: ω > ω' = Λ^{μ}_{μ'}ω_{μ}Λ^{μ'}_{σ}θ^{σ}
= ω
Λ^{σ}_{μ'}Λ^{μ'}_{μ} = δ^{σ}_{μ}
Pullbacks and Pushforwards

Consider 2 manifolds M and N.
The map:
φ: M > N
φ is called a DIFFEOMORPHISM if it is an invertible
function that maps one differentiable manifold to
another such that both the function and its inverse
are smooth.
The PULLBACK of the function, f, from N to M is:
f o φ (≡ f(φ))
Which is written as:
φ_{*}f
While we can pull functions back there is no
construction that allows them to be pushed forward.
However, a tangent vector is defined as A^{μ}∂_{μ}.
This allows us to define the PUSHFORWARD of a
tangent vector, V, from M to N, as:
φ o f (≡ φ(f))
Which is written as:
φ^{*}V
The action of the pushed forward vector on f is
defined to be:
(φ^{*}V)f = V(φ_{*}f)
What this means is that pushing V forward to N
and acting on f is equivalent to pulling f back
to M and acting on it with V.
To see how this works in practice, apply the
pushed forward vector V^{α}∂_{α} to an arbitrary
function, f. Therefore,
(φ^{*}V)^{α}∂_{α}f = V^{μ}∂_{μ}(φ_{*}f)
= V^{μ}∂_{μ}(f o φ)
= V^{μ}(∂φ/∂x^{μ})(∂f/∂y^{α})
= V^{μ}(∂y^{α}/∂x^{μ})(∂f/∂y^{α})
= V^{μ}(∂y^{α}/∂x^{μ})∂_{α}f
We can write this as:
(φ^{*}V)^{α}∂_{α} = (φ^{*})^{α}_{μ}V^{μ}∂_{α}
Where (φ^{*})^{α}_{μ} = ∂y^{α}/∂x^{μ} is a matrix that represents
the vector transformation law under a change of
coordinates. When M and N are the same manifold
the constructions are identical and ∂y^{α}/∂x^{μ} = 1.
Example:
Consider the map:
 
φ(x,y) =  xy  or (xy,x + y)
 x + y 
 
And an arbitrary curve, γ(t), on the manifold,
M, defined as:
γ(t) = (1 + 3t,2  2t)
Here we have replaced the parameter, λ, with the
parameter, t.
If we plot γ(t) for different values of t we get
a straight line with slope = 2/3. Taking the
differential of γ(t) w.r.t. t gives the vector
(3,2) which we can define at the point (1,2),
i.e.
The image curve of γ(t), β(t), is:
β(t) = φ o γ = ((1 + 3t)(2  2t),(1 + 3t)
+ (2  2t))
= ((2 + 4t  6t^{2}),(3 + t))
β(0) = (2,3)
dβ(t)/dt = ((4  12t),1)
The tangent vector to the image curve at (2,3)
when t = 0 is (4,1). Again, we define dβ(t)/dt
at (2,3). i.e.
Let us now compute this in a different way. We
let:
x = a(t), y = b(t)
where a(t) and b(t) are coordinate functions.
Therefore,
 
β(t) = φ o γ =  a(t)b(t) 
 a(t) + b(t) 
 
and,
 
dβ(t)/dt =  a(t)db(t)/dt + b(t)da(t)/dt 
 da(t)/dt + db(t)/dt 
 
   
=  a(t) b(t)  db(t)/dt 
 1 1  da(t)/dt 
   
The first matrix is just the Jacobean:
 
J =  ∂f_{x}/dx ∂f_{x}/dy 
 ∂f_{y}/dx ∂f_{y}/dy 
 
 
=  x y 
 1 1 
 
The vertical sum of each column gives the
coordinates of the pushed forward vector.
x: a(t) + 1 = 1 + 1 = 2
y: b(t) + 1 = 2 + 1 = 3
To get the vector at (2,3) we apply the Jacobean
to the vector at (1,2):
     
 1 2  2  =  4 
 1 1  3   1 
     
This can also be found as:
(3∂_{x}  2∂_{y})(xy) = 3y  2x = 6  2 = 4
(3∂_{x}  2∂_{y})(x + y) = 3  2 = 1
Vectors can be pushed forward from M to N but
there is no construction that allows them to
be pulled back. On the other hand, cotangent
vectors, ω, behave in just the opposite manner.
They can be pulled back from N to M but there is
no construction to allow them to push forward
from N to M.
The action of the pulled back dual vector on V
is defined as:
(φ_{*}ω)V = ω(φ^{*}V)
What this means is that pulling ω back to M and
acting on V to produce a scalar is equivalent to
pushing forward V to N and acting on it with ω.
We can repeat the above process and write:
(φ_{*}ω)_{μ} = (φ_{*})^{α}_{μ}ω_{α}
Where (φ_{*})^{α}_{μ} = ∂y^{α}/∂x^{μ} is the same matrix as before
except that we now contract on a different index.
We can extend the idea of pulling back dual vectors
to tensors with an arbitrary number of lower indices.
One common occurrence of a map between two manifolds
is when M is actually a smooth embedded submanifold
of N. Under these circumstances there is a map from
M to N which just takes an element of M to the same
element of N. Therefore, every embedded smooth
submanifold inherits a metric from being embedded in
a Riemannian manifold. For example, in String Theory
there are 2 manifolds that are involved in string
propagation.
 The spacetime in which the string propagates.
 The worldsheet of the string itself.
The 2 dimensional world sheet is considered to be
embedded in the spacetime manifold such that points
(σ^{1},σ^{1}) on the worldsheet correspond to points in
the manifold x^{μ}(σ^{1},σ^{1}). The worldheet then inherits
its metric from the metric on the ambient spacetime
manifold.
Orthonormal and Coordinate Bases

Up until now we have defined the bases of tangent
and cotangent space as:
e_{μ} = ∂_{μ} and θ^{ν} = dx^{ν}
Where we set up the basis vectors to point along
the coordinate axes.
In 2D this corresponds to the following diagram.
We can define the commutator [∂_{μ}e_{μ},∂_{ν}e_{ν}] = 0
There is nothing to stop us, however, from setting
up any bases we like. We will choose these basis
vectors to be orthonormal (orthogonal and of unit
length) and denote them as e_{a} and e_{b}.
In 2D this corresponds to the following diagram.
The left is a flat space and the commutator equals
0. However, when we move to a curved surface the
commutator no longer equals 0 but represents a
difference vector (A).
In general we can write:
[e_{σ},e_{ρ}] = c_{σρ}^{λ}e_{λ}
Where c_{σρ}^{λ} is the commutation coefficient.
There is a theorem that says a basis is a coordinate
basis if and only if c_{σρ}^{λ} = 0.
Calculations are easier in the coordinate basis.
On the other hand, the orthonormal basis is the
most familiar to people and is better for physical
interpretation. For example, Quantum Mechanics,
SR and classical physics generally are developed
in terms of orthonormal bases. From the above
diagram it should be clear that we cannot just
impose an orthonormal basis on curved spacetime.
What we need, therefore, is a mechanism that
enables us to move from one coordinate system to
the other. The approach in GR that replaces the
coordinate basis with an orthonormal basis is
referred to as the TETRAD FORMALISM. This was
proposed by by Albert Einstein in 1928.
Tetrads

In 4 dimensions, the set of vector fields in an
orthonomal basis is called a TETRAD (aka VIERBEIN
or FRAME FIELD).
We can express the coordinate basis in terms of
the orthonormal (tetrad) basis as:
Vectors: e_{μ} = e^{a}_{μ}e_{a} or e_{a} = e^{μ}_{a}e_{μ}
Oneforms: θ^{μ} = e^{μ}_{a}θ^{a} or θ^{a} = e^{a}_{μ}θ^{μ}
Components of a vector written in a coordinate
basis can be written in an orthonormal basis as:
V^{a} = e^{a}_{μ}V^{μ}
alternatively,
V^{μ} = e^{μ}_{a}V^{a}
The components of e^{a}_{μ} etc. form n x n invertible
matrices where the Latin indeces represent the
orthonormal basis and the Greek indeces represent
the coordinate basis. If the distinction between
a vector and its components is dropped, e^{a}_{μ} etc.
are regarded as the vierbeins. Naturally, they
are functions of position on the manifold. They
have the following properties:
e^{μ}_{a} = (e^{a}_{μ})^{1}
e^{μ}_{a}e^{a}_{ν} = δ^{μ}_{ν} e^{a}_{μ}e^{μ}_{b} = δ^{a}_{b}
g(e_{a},e_{b}) = e_{a}.e_{b} = η_{ab}
g(e_{μ},e_{ν}) = e_{μ}.e_{ν} = g_{μν}
Now,
e_{a}.e_{b} = e^{μ}_{a}e^{ν}_{b}e_{μ}.e_{ν}
= e^{μ}_{a}e^{ν}_{b}g_{μν}
Therefore,
η_{ab} = e^{μ}_{a}e^{ν}_{b}g_{μν}
or equivalently,
g_{μν} = e^{a}_{μ}e^{b}_{ν}η_{ab}
This last result expresses the fact that e^{a}_{μ} and
e^{b}_{ν} are very much like √g_{μν}.
Tetrads can be looked at as linear maps from
tangent space to Minkowski space that preserves
the inner product. The fact that we can write
the Minkowski metric in terms of g_{μν} gives us the
freedom to perform LOCAL LORENTZ TRANSFORMATIONS
at every point in space. Therefore,
Λ_{a'}^{a}Λ_{b}^{b}η_{ab} = η_{a'b'}
e_{a} > e_{a'} = Λ_{a'}^{a}e_{a}
θ^{a} > θ^{a'} = Λ^{a'}_{a}θ^{a}
Where (Λ^{1})^{ν'}_{μ} = Λ_{ν'}^{μ}
We also retain the freedom to make general
coordinates transformations at the same time.
Therefore, we can define a mixed tensor
transformation law as follows:
T^{a'μ'}_{b'ν'} = Λ^{a'}_{a}(∂x^{μ'}/∂x^{μ})Λ_{b'}^{b}(∂x^{ν}/∂x^{ν'})T^{aμ}_{bν}
Spin Connection

The introduction of tetrads requires changing
the covariant derivative. ∇_{μ}e^{a}_{ν} must transform
like a (1,1) tensor and a 4vector. In order
to account for this we need to modify the idea
of a connection.
Now ∇_{μ}η_{ab} ≡ ∂_{μ}η_{ab} = 0 so we can impose the same
condition on the tetrad. Therefore,
∇_{μ}e^{a}_{ν} = 0
∇_{μ}e^{a}_{ν} = ∂_{μ}e^{a}_{ν}  Γ^{ρ}_{μν}e^{a}_{ρ} + ω^{a}_{bμ}e^{b}_{ν} = 0
Multiply through by e^{ν}_{b} to get:
ω^{a}_{bμ} = e^{ν}_{b}Γ^{ρ}_{μν}e^{a}_{ρ}  e^{ν}_{b}∂_{μ}e^{a}_{ν}
ω^{a}_{bμ}is called the SPIN CONNECTION.
We can define a curvature associated with this
connection:
R_{μνab} = ∂_{μ}ω_{νab}  ∂_{ν}ω_{μab} + ω^{e}_{μa}ω_{νeb}  ω^{e}_{νa}ω_{μeb}
Note that:
e^{a}_{ρ}e^{b}_{α}R_{μνab} = R_{μνρα}
Which is the Riemann tensor.
Why Tetrads?

Tetrads provide the ability to describe spinor
fields on spacetime and take their covariant
derivatives. They are the most natural way to
represent a relativistic quantum field theory
in curved space. It is also gauge field theory
for gravity with the tetrad playing the role of
a gauge field but not exactly like the vector
potential field does in YangMills theory.
Covariant Derivative

We have established the transformation laws for
vectors and dual vectors as well as scalars and
partial derivatives. In summary,
Scalars: φ > φ
Gradients of scalars: ∂φ/∂x^{μ} > ∂φ/∂x^{μ'}
= (∂φ/∂x^{μ})(∂x^{μ}/∂x^{μ'})
Partial derivatives: ∂_{μ} > ∂_{μ'} = (∂/∂x^{μ'})∂_{μ}
Vectors: V^{μ} > V^{μ'} = (∂x^{μ'}/∂x^{μ})V^{μ}
Dual vectors: V_{μ} > V_{μ'} = (∂x^{μ}/∂x^{μ'})V_{μ}
Tensors: T_{μν} > T_{μ'ν'} = (∂x^{μ}/∂x^{μ'})(∂x^{ν}/∂x^{ν'})T_{μν}
THe question is how do the derivatives of vectors
and tensors transform?. Consider, a vector:
∂_{μ}V^{ν} = lim [V^{ν}(x^{μ} + ε^{μ})  V^{ν}(x^{μ})]/ε^{μ}
^{ε>0}
The problem is that V^{ν}(x^{μ} + ε^{μ}) and V^{ν}(x^{μ}) live in
different tangent spaces so the coordinate system
is not constant. This problem manifests itself
as follows:
∂_{μ}V^{ν} > ∂_{μ'}V^{ν'} = (∂x^{μ}/∂x^{μ'})∂_{μ}(∂x^{ν'}/∂x^{ν})V^{ν}
= (∂x^{μ}/∂x^{μ'})(∂x^{ν'}/∂x^{ν})∂_{μ}V^{ν}
+ (∂x^{μ}/∂x^{μ'})V^{ν}∂_{μ}(∂x^{ν'}/∂x^{ν})
The first term is a tensor but the second is not.
Therefore, since ∂_{μ'}V^{ν'} does not ransform like a
tensor it is not a tensor. We can write the above
equation as follows:
∇_{μ}V^{ν} = ∂_{μ}V^{ν} + Γ^{ν}_{μλ}V^{λ}
Where ∇_{μ} is the COVARIANT DERIVATIVE and Γ^{ν}_{μλ}
are the CHRISTOFFEL SYMBOLS.
The Christoffel symbols solves the problem by
taking V^{ν}(x^{μ} + ε^{μ}) and PARALLEL TRANSPORTING it
back to x^{μ} before performing the subtraction.
Therefore,
∇_{μ}V^{ν} = lim [V_{//}^{ν}(x^{μ} + ε^{μ})  V^{ν}(x^{μ})]/ε^{μ}
^{ε>0}
Note. ∇_{μ} of a scalar is the same as ∂_{μ}. Thus,
∇_{μ}φ = ∂_{μ}φ
How do the symbols transform?
∇_{μ}V^{ν} > ∇_{μ'}V^{ν'} = (∂x^{μ}/∂x^{μ'})(∂x^{ν'}/∂x^{ν})∇_{μ}V^{ν}
Γ^{ν'}_{μ'λ'} = (∂x^{μ}/∂x^{μ'})(∂x^{ν'}/∂x^{ν})(∂x^{λ}/∂x^{λ'})Γ^{ν}_{μλ}
+ (∂x^{μ}/∂x^{μ'})(∂x^{λ}/∂x^{λ'})(∂^{2}x^{ν'}/∂x^{μ}x^{λ})
The first term is a tensor but the second term
is not. It is this second term that counters
the second term in the covariant derivative to
produce a result that is a tensor.
Computing the Christoffel Symbols

Before we do this we need to consider 2 things 
torsion and metric compatibility.
Torsion Tensor

The torsion tensor is defined as:
T_{μν}^{λ} = Γ^{λ}_{μν}  Γ^{λ}_{νμ}
Torsion is a difficult concept to explain. Loosely
speaking it characterizes how tangent spaces twist
about a curve when they are parallel transported
whereas curvature describes how the tangent spaces
roll along the curve.
Suffice to say that in GR we assume the torsion to
be 0 meaning that the Christoffel symbols are
symmetric under the interchange of their lower
indeces. Likewise the metric is also symmetric
under the interchange its indeces.
Metric Compatibility

A connection is metric compatible if the covariant
derivative of the metric with respect to that
connection is everywhere zero, i.e.,
∇_{ρ}g^{μν} = 0
Now we're ready to compute the Christoffel symbols.
Lets expand out the equation of metric compatibility
for three different permutations of the indices:
1. ∇_{ρ}g_{μν} = ∂_{ρ}g_{μν}  Γ^{λ}_{ρμ}g_{λν}  Γ^{λ}_{ρν}g_{μλ} = 0
2. ∇_{μ}g_{νρ} = ∂_{μ}g_{νρ}  Γ^{λ}_{μν}g_{λρ}  Γ^{λ}_{μρ}g_{νλ} = 0
^

≡ Γ^{λ}_{ρμ}g_{λν}
3. ∇_{ν}g_{ρμ} = ∂_{ν}g_{ρμ}  Γ^{λ}_{νρ}g_{λμ}  Γ^{λ}_{νμ}g_{ρλ} = 0
^

≡ Γ^{λ}_{ρν}g_{μλ}
1.  2.  3. gives:
∂_{ρ}g_{μν}  ∂_{μ}g_{νρ}  ∂_{ν}g_{ρμ} + 2Γ^{λ}_{μν}g_{λρ}
Γ^{λ}_{μν} = (1/2)(∂_{ρ}g_{μν} + ∂_{μ}g_{νρ} + ∂_{ν}g_{ρμ})/g_{λρ}
Noting that 1/g_{λρ} = g^{σρ} (i.e. g_{λρ}g^{σρ} = δ^{σ}_{λ})
Γ^{σ}_{μν} = (1/2)g^{σρ}(∂_{μ}g_{νρ} + ∂_{ν}g_{ρμ}  ∂_{ρ}g_{μν})
Note that Γ^{σ}_{μν} > 0 as we get closer and closer to
the local coordinate frame where the derivative of
the metric = 0 and the space can be considered to
be flat (tangent space). Γ^{σ}_{μν} = 0 does not by
itself imply that the overall space is flat. This
determination is made by examining the 2 second
derivative of the metric as discussed above.
Covariant Directional Derivative

Recall, for a scalar the directional derivative is:
d/dλ = (dx^{μ}/dλ)∂_{μ}
For a vector/tensor this becomes we can define the
DIRECTIONAL COVARIANT DERIVATIVE as:
D/dλ = (dx^{μ}/dλ)∇_{μ}
Therefore, for a vector, V^{ν},
DV^{ν}/dλ = (dx^{μ}/dλ)∇_{μ}V^{ν}
= lim [V_{//}^{ν}(x^{μ}(λ + δλ))  V^{ν}(x^{μ}(λ))]/Δx^{μ}
^{Δxμ>0}
∇_{μ}V^{ν} picks a coordinate, μ, and shifts V^{ν} along x^{μ}.
DV^{ν}/dλ shifts V^{ν} along the curve x^{μ}(λ).
Parallel Transport

From the above we have the CDD:
DV^{ν}/dλ = (dx^{μ}/dλ)∇_{μ}V^{ν}
To transport V^{ν}(x^{μ} + ε^{μ}) along x^{μ} to P requires
keeping the vector as PARALLEL TO ITSELF AS POSSIBLE
as it is transported. This condition implies that:
DV^{ν}/dλ = (dx^{μ}/dλ)∇_{μ}V^{ν} = 0
= (dx^{μ}/dλ)∂_{μ}V^{ν} + Γ^{ν}_{μρ}(dx^{μ}/dλ)V^{ρ} = 0
In other words, the components of the tangent vector
dx^{μ}/dλ don't change as it is transported.
Now, (dx^{μ}/dλ)∂_{μ} is just the ordinary DD so we can
write:
dV^{ν}/dλ + Γ^{ν}_{μρ}V^{ρ} = 0
This is a differential equation that can be solved
for any point on x^{μ}(λ).
Geodesics

Geodesics are paths taken by objects under the
influence of gravity. They are curves x^{μ}(λ) that
parallel transport their tangent vectors. In
other words, if x^{μ}(λ) is a geodesic then dx^{μ}/∂λ
(the components of the tangent vector) should be
constant along the path (i.e. should be parallel
to each other.
The 4velocity, u^{μ}, along the curve is given by the
directional derivative:
u^{μ} = dx^{μ}/dλ
The covariant directional derivative of u^{μ} along
the curve is given by:
Du^{μ}/dλ = D(dx^{μ}/dλ)/dλ
The fact that the components of the tangent vector
should be constant implies D(dx^{μ}/dλ)/dλ = 0.
Therefore,
D(dx^{μ}/dλ)/dλ ≡ (dx^{ν}/dλ)∇_{ν}(dx^{μ}/dλ)
= (dx^{ν}/dλ)[∂_{ν}(dx^{μ}/dλ) + Γ^{μ}_{να}(dx^{α}/dλ)]
= (dx^{ν}/dλ)∂_{ν}(dx^{μ}/dλ) + Γ^{μ}_{να}(dx^{ν}/dλ)(dx^{α}/dλ)
= (dx^{ν}/dλ)(∂(dx^{μ}/dλ)/∂x^{ν})
+ Γ^{μ}_{να}(dx^{ν}/dλ)(dx^{α}/dλ)
= d^{2}x^{μ}/dλ^{2} + Γ^{μ}_{να}(dx^{ν}/dλ)(dx^{α}/dλ)
= 0
Therefore,
d^{2}x^{μ}/dλ^{2} = Γ^{μ}_{να}(dx^{ν}/dλ)(dx^{α}/dλ)
If we now substitute the proper time, τ, for
λ to create a world line we get:
d^{2}x^{μ}/dτ^{2} = Γ^{μ}_{να}(dx^{ν}/dτ)(dx^{α}/dτ)
Flatness and Curvature

Consider the 2 sphere which represents the surface
of a sphere (not the interior).
The 2 sphere is not flat and has the metric:
 
g_{μν} =  R^{2} 0 ^{ } 
_{ }  0 R^{2}sin^{2}θ 
 
Where R is the radius.
Which can be written as a line element as:
ds^{2} = R^{2}dθ^{2} + R^{2}sinθ^{2}dφ^{2}
This is derived using spherical coordinates as
follows:
x = Rsinθcosφ
y = Rsinθsinφ
z = Rcosθ
ds^{2} = dx^{2} + dy^{2} + dz^{2}
Which leads to:
ds^{2} = dR^{2} + R^{2}dθ^{2} + R^{2}sin^{2}θdφ^{2}
Consider a point, P, at the North pole where θ = 0.
If we do this we get a horrible metric.
 
g_{μν} =  R^{2} 0 
_{ }  0^{ } 0 
 
Which is degenerate. We need to find a 'good' set
of coordinates that gets rid of this degeneracy.
Such coordinates are referred to as RIEMANN NORMAL
COORDINATES.
The basic idea behind Riemann normal coordinates is
to use the geodesics through a given point to define
the coordinates at a nearby point that lies on the
same geodesic.
We choose a point, P, to be at x^{μ}(0) in the tangent
space. The components of the tangent vector at P
are given by:
V^{μ} = dx^{μ}/dλ_{λ= 0}
We now push V^{μ} along the geodesic in n tiny increments
to the nearby point, Q. We can write this as:
λ_{Q} = lim (1 + λ/n)^{n}
^{n>∞}
~ 1
So we have increased λ by 1 unit of distance. The
change from x^{μ}(0) to x^{μ}(1) gives a map from the set
of tangent vectors at P to points in the manifold,
and locally this map is onetoone. In this way we
can use V^{μ} to define a local coordinate system on the
manifold near P. Now consider the geodesic equation:
d^{2}x^{μ}/dλ^{2} = Γ^{μ}_{να}(dx^{ν}/dλ)(dx^{α}/dλ)
If the space at P is flat Γ^{μ}_{να} = 0 and d^{2}x^{μ}/dλ^{2} = 0
Geodesics that satisfy this differential equation
have the form:
x^{μ}(λ) = λV^{μ}
Therefore, any geodesic that passes through P allows
us to define a local coordinate system on the manifold.
Now lim (1 + λ/n)^{n} is also the definition of the
^{n>∞}
exponential function so the map from vectors at P
to points in the manifold is called the EXPONENTIAL
MAP.
exp: V in T_{p} > M
In other words,
exp_{P}(V^{μ}) = x^{μ}(1)
A 'better' set of coordinates can be found as
follows. Consider a point very close to the
North pole where sinθ ~ θ and the z coordinate
can be regarded as constant. By doing this we
are effectively emulating the exponential map
discussed above. Therefore, we can approximate x
and y as:
x = Rθcosφ
y = Rθsinφ
So that:
x^{2} + y^{2} = R^{2}θ^{2}(sin^{2}φ + cos^{2}φ)
Therefore,
θ = √(x^{2} + y^{2})/R
and,
dθ = (1/2)(x^{2} + y^{2})^{1/2}[2xdx + 2ydy]/R
= (xdx + ydy)/R√(x^{2} + y^{2})
y/x = tanφ
Therefore,
φ = arctan(y/x)
d(arctan(y/x))= ydx/(x^{2} + y^{2})
and
d(arctan(y/x)) = xdy/(x^{2} + y^{2})
Therefore,
dφ = (xdy  ydx)/(x^{2} + y^{2})
The line element for the 2 Sphere is:
ds^{2} = R^{2} + R^{2}dθ^{2} + R^{2}sin^{2}θdφ^{2}
For constant R we get:
ds^{2} = R^{2}dθ^{2} + R^{2}sin^{2}θdφ^{2}
Taking this term by term and substituting gives:
R^{2}dθ^{2} = R^{2}(xdx + ydy)^{2}/R^{2}(x^{2} + y^{2})
= (x^{2}dx^{2} + y^{2}dy^{2} + 2xydydx)/(x^{2} + y^{2})
and,
R^{2}sin^{2}θdφ^{2} = R^{2}sin^{2}(√(x^{2} + y^{2})/R)(x^{2}dy^{2} + y^{2}dx^{2}
 2xdydx)/(x^{2} + y^{2})^{2}
Therefore, R^{2}dθ^{2} + R^{2}sin^{2}θdφ^{2} equals:
[x^{2}/(x^{2} + y^{2}) + R^{2}sin^{2}(√(x^{2} + y^{2})/R)(y^{2}/(x^{2} + y^{2})^{2}]dx^{2}
+
[y^{2}/(x^{2} + y^{2}) + R^{2}sin^{2}(√(x^{2} + y^{2})/R)(x^{2}/(x^{2} + y^{2})^{2}]dy^{2}

2[xy/(x^{2} + y^{2})  R^{2}sin^{2}(√(x^{2} + y^{2})/R)(xy/(x^{2} + y^{2})^{2}]dydx
Now sin^{2}a = a^{2}  a^{4}/3 + .... (Taylor series)
Therefore,
sin^{2}(√(x^{2} + y^{2})/R) = (√(x^{2} + y^{2}))^{2}/R^{2}  (√(x^{2} + y^{2}))^{4}/3R^{4} ...
For the dx^{2} term we get:
[y^{2}/(x^{2} + y^{2}) + R^{2}{(√(x^{2} + y^{2}))^{2}/R^{2}
 (√(x^{2} + y^{2}))^{4}/3R^{4})}(y^{2}/(x^{2} + y^{2})^{2}]dx^{2}
= (1  y^{2}/3R^{2})dx^{2}
Similarly, for the dy^{2} we get:
(1  x^{2}/3R^{2})dy^{2}
and for the dxdy term we get:
(2xy/3R^{2})dxdy
Therefore, the line elements becomes:
ds^{2} = (1  y^{2}/3R^{2})dx^{2} + (1  x^{2}/3R^{2})dy^{2} + (2xy/3R^{2})dxdy
and the metric looks like:
 
g_{μν} =  1  y^{2}/3R^{2} 2xy/3R^{2} 
_{ }  2xy/3R^{2} 1  x^{2}/3R^{2} 
 
At P, x = y = 0 the metric becomes:
 
g_{μν} =  1 0 
_{ }  0 1 
 
and the first derivative vanishes.
What this tells us is that just looking at the metric
and its first derivative to see it vanishes, doesn't
tell us if the overall space we are working in is flat.
It is possible to be working in a local coordinate
system without knowing it ahead of time, and that the
space is in fact curved. However, in general the second
derivative of the metric will not vanish if the space is
curved even if you happen to be in a local coordinate
system. For example, ∂^{2}(1  x^{2}/3R^{2})/∂x^{2} = 2/R^{3} ≠ 0 for
any x. If the second derivative does vanish then it
means the the space is truly flat. This is the reason
why the second derivative of the metric plays a key
role in the structure of the Riemann curvature tensor
that we will discuss next.
Riemann normal coordinates, with the associated basis
vectors constitute a local Lorentz frame. They are
the best approximation to flat space that is available.
The fact that the metric at P looks like that of flat
space to first order support the idea that small enough
regions of spacetime look like Minkowski space. This
implies that locally all physics looks the same, which
is the famous equivalence principle.
The Riemann Curvature Tensor

Parallel transport can detect curvature. Previously
we have talked about curvature how this might be
measured using the second derivative of the metric.
We now look at this in more detail. Consider the
following schematic representation of the parallel
transport of a tangent vector, V_{A}^{λ}, around a square
in flat space.
V_{A}^{λ}, V_{A'}^{λ} Δx^{ν}
A <
^{ }  ^
^{ }  
Δx^{μ}   Δx^{μ}
^{ }  
^{ } v 
> B V_{B}^{λ}, V_{B'}^{λ}
Δx^{ν}
Let Δx^{μ}, Δx^{ν}, Δx^{μ} and Δx^{ν} act as infinitessimal
operators that do the shifting. Moving counter
clockwise we get:
(Δx^{ν})(Δx^{μ})(Δx^{ν})(Δx^{μ})V_{A}^{λ} = V_{A'}^{λ}
Therefore,
V_{A'}^{λ}  V_{A}^{λ} = δV^{λ}
Alternatively, we can split this into 2 steps
Counterclockwise from A to B: (Δx^{ν})(Δx^{μ})V_{A}^{λ} = V_{B}^{λ}
Clockwise from A to B: (Δx^{μ})(Δx^{ν})V_{A}^{λ} = V_{B'}^{λ}
Therefore,
V_{B'}^{λ}  V_{B}^{λ} = [(Δx^{μ}),(Δx^{ν})]V^{λ} = δV^{λ}
We can do the same thing with the 2Sphere which is
a curved surface.
So again the idea is to transport the tangent vector
in suvh a way that it remains as parallel as possible
from point to point (the vector has to lie in the
tangent plane and cannot protrude above it.). If
this operation is perform on a manifold that is not
flat, the original vector and the transported vector
will not coincide if brought back to the same point
along a closed trajectory.
Now, we have previously seen that the covariant
derivative parallel transports a vector along a
curve. Therefore, we can replace the Δs with ∇s.
[∇_{μ},∇_{ν}]V^{λ} = δV^{λ}
If we expand this out we get:
[∇_{μ},∇_{ν}]V^{λ} = (∂_{μ}Γ^{λ}_{νρ}  ∂_{ν}Γ^{λ}_{μρ} + Γ^{λ}_{μα}Γ^{α}_{νρ}  Γ^{λ}_{να}Γ^{α}_{μρ})V^{ρ}
= R^{λ}_{ρμν}V^{ρ}
R^{λ}_{ρμν} is the RIEMANN CURVATURE TENSOR. It is comprised
of Christoffel symbols and their partial derivatives.
In turn, the Christoffel symbols are comprised of the
metric tensor and its derivatives. Therefore, the
Riemann tensor consists of 2nd derivativeS of metric
which, from the previous discussion, is a measure of
the curvature. The Riemann tensor can be shown to
transform in the following way proving indeed that it
is a tensor.
R^{λ}_{ρμν} > R^{λ'}_{ρ'μ'ν'} = (∂x^{λ'}/∂x^{λ})(∂x^{ρ}/∂x^{ρ'})(∂x^{μ}/∂x^{μ'})(∂x^{ν}/∂x^{ν'})R^{λ}_{ρμν}
R^{λ}_{ρμν}
ρ corresponds to the transported vector
μ and μ correspond to the paths taken.
λ corresponds to the components of the difference vector.
If one is interested only in certain aspects of curvature,
it is possible to form other tensors from the Riemann
tensor. The simplest of those is the Ricci tensor,
which is formed by contracting the upper index with
the lower middle index. This the trace of the Riemann
tensor.
Symmetries

The Riemann tensor has the following symmetries:
R_{αρμν} = g_{αλ}R^{λ}_{ρμν}
R_{αρμν} = R_{αρνμ}
R_{αρμν} = R_{ραμν}
R_{αρμν} = R_{μνρα}
R_{αρμν} + R_{ανρμ} + R_{αμνρ} = 0
The original Riemann tensor has 256 components but
after these symmetries have been applied, the number
of independent components reduces to 20..
Geodesic Deviation Equation

We start by looking at the geodesic deviation under
Newtonian gravity. Consider the following diagram.
At the P the geodesics are parallel and the separation
vector, ξ is, orthogonal to both geodesics.
Therefore,
dξ/dt = 0 and ξ.t = 0
The acceleration can be written as:
d^{2}ξ/dt^{2} = 𝕋(_,ξ)
𝕋(_,ξ) is referred to as the TIDAL TENSOR.
Alternatively, we can write this in component (index)
form as:
d^{2}ξ^{j}/dt^{2} = T_{jk}ξ^{k}
The equation of motions of particles moving along
A and B are given by:
(d^{2}x^{j}/dt^{2})_{A} = (∂φ/∂x^{j})_{A} ... 1.
and,
(d^{2}x^{j}/dt^{2})_{B} = (∂φ/∂x^{j})_{B}
Where φ is the gravitational potential. Note that
φ = mgh ∴ F = ma = φ/d so the RHS is equivalent to
a force as one would expect.
We can expand (φ)_{A} in terms of ξ using a Taylor
series as follows:
(φ)_{A} = (φ)_{B} + ξ(∂φ/∂x^{j})_{B}
If we substitute this into 1 we get:.
(d^{2}x^{j}/dt^{2})_{A} = (∂φ/∂x^{j} + ξ∂^{2}φ/∂x^{j}∂x^{j})_{B}
Now, d^{2}ξ/dt^{2} = (d^{2}x^{j}/dt^{2})_{A}  (d^{2}x^{j}/dt^{2})_{B} so
Therefore,
d^{2}ξ/dt^{2} = (∂φ/∂x^{j} + ξ∂^{2}φ/∂x^{j}∂x^{j})_{B} + (∂φ/∂x^{j})_{B}
= ξ∂^{2}φ/∂x^{j}∂x^{j}
Restoring the indeces leads to:
d^{2}ξ^{j}/dt^{2} = (∂^{2}φ/∂x^{j}∂x^{k})ξ^{k}
This is the geodesic deviation in 3D space. Lets
now see how this looks in General Relativity. We
replace d with D which is the covariant directional
derivative of ξ along τ to get:
D^{2}ξ/dτ^{2} = R(_,u,ξ,u)
Where u is the 4velocity = dx/dτ and the geodesic
is treated as a world line parametized by τ instead
of λ.
Note that R(_,ξ,u,u) doesn't work because of the
asymmetry in these indeces.
In component form this is:
D^{2}ξ^{α}/dτ^{2} = R^{α}_{βγδ}(dx^{β}/dτ)ξ^{γ}(dx^{δ}/dτ)
The Riemann tensor produces the rate of change of
separation between neighbouring geodesics over time.
This is a vector since it has both magnitude and
direction. It is the link between curvature and
a force (≡ accelerration) that causes the geodesics
to converge. These are the tidal forces of GRAVITY.