Documentos de Académico
Documentos de Profesional
Documentos de Cultura
OF
T08G8TO
.
:i-"-;, .ii:
MATHEMATICS OF RELATIVITY
LECTURE NOTES
BY
G.*Y?RAINICH
in
EDWARDS BROTHERS,
Lithoprinters and Publishers
INC.
COHTBHT8
Introduction
Page
.
Chapter I. OLD PHYSICS
1. Motion of a Particle.
The Inverse Square Law
2. Two Pictures of Matter
3. Vectors, Tensors, Operations
4. Maxwell's Equations
5. The Stress-Energy Tensor
6. General Equations of Motion.
The Complete Tensor
10
12
12
14
16
20
22
24
.
Chapter III. SPECIAL RELATIVITY
13. Equations of Motion
,
14. Lorentz Transformations
15. Addition of Velocities
16 . Light Corpuscles , or Photons
17. Electricity and Magnetism in Special Relativity
26
26
1
1
3:
5
7
9
28
29
31
33
35
35
37
39
41
44
46
49
57
57
58
60
61
50
53
54
55
62
63
65
66
INTRODOCTIOS
Since we are going to deal with applied
Mathematics, or Mathematics applied to Physics
we have to state In the beginning the general
point of view we take on that subject. A mathematical theory consists of statements or propositions, some of which are written as formulas.
Some of these propositions are proved,
that
means deduced from others, and some are not;
the latter are called definitions and axioms.
Furthermore most mathematical theories, in particular those in which we are interested, deal
with quantities, so that the propositions take
the form of relations between quantities.
Physical experiments also deal with quantities which are measured in definite prescribed
ways, and then empirical relations are established between these measured quantities.
In an application of Mathematics to Physics a correspondence is established between some
mathematical quantities and some physical quantities in such a way that the same relationship
exists (as a result of the mathematical theory)
between mathematical quantities as the experimentally established relation between the corresponding physical quantities. This view is
not new, it was emphatically formulated by H.
Hertz in the introduction to his Mechanics, and
then emphasized again by A. S. Eddington in application to Relativity. The process of establishing the correspondence between the physical
and the mathematical quantities we shall, following Eddington, call identification. An identification is successful, if the condition mentioned above is fulfilled, viz., if the relations deduced for the mathematical quantities
are experimentally proved to exist between the
Physical quantities with which they have been
identified. From this point of view we do not
speak of true or false theories, still less of
absolute truth, etc.; truth for us is nothing
but a successful identification, and it is necessary to say expressly that there may exist at
the same time two successful identifications,
two theories, each of which may be applied within experimental errors to the known experimental
results; and that there may be times when no
such theory has been found; and also that an
identification which is successful at one time
may cease to be so later, when the experimental
precision will be increased.
Very often it happens that the quantities
of a theory are compared not with quantities which
are direct results of experiment but with quantities of another, less comprehensive, theory
Chapter I.
OLD PHYSICS
The purpose of this chapter is to reformulate some of the fundamental equations of mechanics and electrodynamics, and to write them in
a new form which constitutes an appropriate basThe conis for the discussion that follows.
tents of the chapter is classical, the modifications which are characteristic of the Relativity theory have not been introduced, but, as
was mentioned the form is decidedly new.
1.
Motion of a Particle.
of Mechanics of
1.1
8
X, m-d*y/dt = Y,
= Z.
in
dmu/dt = X,
dmv/dt = Y,
dmw/dt = Z,
u = dx/dt,
v = dy/dt,
w = dz/dt
where
1.2
The quantities
mv,
mw
1.12
dm/dt = 0.
In the equations 1.1, x,y,z are usually unknown functions of the time and X,Y,Z are given
functions of the coordinates. The situation is
then this:
first the field has to be described
by giving the forces X,Y,Z, and then the motion
in the given field is determined by solving
equations 1.1 (with some additional initial conditions) .
We shall first discuss fields of a certain
of
simple type. One of the simplest fields
force is the so-called inverse square field.
The field has a center, which is a singularity
of the field; in it the field is not determined; in every other point of the field the force
is directed toward the center (or away from it)
and the magnitude of the force Is Inversely
proportional to the distance from the center.
As the most common realization of such a field
of force we may consider the gravitational field
of a mass particle or of a sphere. If cartesian coordinates with the origin at the center
are introduced the force components are
1.3
X = cx/r 3 ,
Y = cy/r 3 ,
Z = cz/r",
1.4
1.51
1.52
bZ/bx - bX/bz = 0,
bX/oy - bY/bX = 0.
The fact that the functions X,Y,Z given by
the formulas 1.5 satisfy these equations may be
proved by direct substitution; to facilitate
r-br/bx = x,
rbr/by = y,
rbr/bz
= z.
3cx s /r*.
1.53
X = b<p/bx,
Y = bq>/dy,
Z = b<p/bz
1.54
5 <p/dx
+ b
9/by
b^/bz*
= 0.
btp/bx = <p''br/bx, etc.
1.56
The last equation is known as the equation
of Laplace.
In the particular case where X,Y,Z are given by the formulas 1.3 a function <p of which X,
Y,Z are partial derivatives is (as it is easy
to verify)
q>
and
1
x a = <p"-(dr/bx)*
1.57
etc.
taking
= -c/r.
We may say now that the field of force given by l.b satisfies the differential equations
1.51, 1.52 or 1.5&, 1.54 which express the same
It is easy to show that these equations
thing.
are satisfied not only by the field produced by
one particle at the origin, but also by that due
to any number of particles:
first we notice
that if a particle is not at the origin this
results only in additive constants in the coordinates, and so does not affect partial derivatives which appear in the equations 1.51-1.52
which therefore remain true in this case;
secondly, these equations are linear and homogeneous, as a consequence of which the sum of two
solutions of these equations necessarily is a
solution, or if two fields satisfy these equations their sum also satisfies them; if then, as
is generally assumed, the field produced by several particles is the sum of the fields due to
the individual particles, such a field also satisfies equations 1.51, 1.52.
Conversely, it can be proved that any field
satisfying the differential equations 1.51-1.52
may be produced by a - finite or infinite - set
of particles each of which acts according to the
inverse square law. We shall not prove here
this fact (the proof is given in Potential Theequations
ory), but we shall show that these
furnish us back the inverse square law, if we
add the condition that the field must be symmetric with respect to one point.
or
1, etc.;
\5
by
bz
2.
=-
=
which substituted in 1.56 to>
1.6
with
gives 1.?. We see thus that
gether
the general equations 1.51-1.52 give us all the
general information we need about the fields of
force in question; we shall call this system
the system of equations of a Newtonian field or
simply the Newtonian system, although Newton
never considered, the differential equations
that make it up.
We may comment briefly on the mathematical
character of the magnitudes and equations we
whence
'
<p
have been dealing with in this section. At every point X,Y,Z may be considered as the components of a vector, the force vector; we have
thus a vector at every point of space, and this
constitutes what is called a vector field. The
function <p is an example of a scalar field.
These two fields are in the particular relation
that the first is derived from the second by
differentiation. The vector field satisfies
equation 1.51, the left hand side of which is
called divergence of the vector field. To find
a divergence of a vector field we. take the sum
of the derivatives of the components of the vector with respect to the corresponding coordinates, or we differentiate each component with
respect to the corresponding coordinate and add
up the results. The formula becomes more expressive if we number the coordinates by writing
1.71
x = x^,
y = xa ,
z =
xa
1.78
X = Xx ,
Y = X2 ,
Z = X3
then
be
1.8
1=1
2.
Our fundamental equations 1.1 connect matter as represented by the left hand sides with
forces as represented by the right hand sides.
There seems to be a fundamental difference in
the mathematical aspects of matter and force.
The quantities characterizing matter, the momentum components, for example, are functions
of one variable t, and are subjected to ordinary differential equations, whereas quantities
characterizing force, X,Y,Z are functions of
three variables x,y,z and, as a consequence, are
subjected to partial differential equations;
they are field quantities, whereas the matter
components are not; another way of saying this
is to say that force seems to be distributed
continuously through space but matter seems to
be connected with discrete points. This distinction is, however, not as essential as it
looks; it is merely the result of the point of
view we take. Vt'e could very well consider matter to be distributed continuously
through
space; each of the two theories, the discrete
theory, according to which matter consists of
as of time; by
u(x,y,z,t),
v(x,y,z,t),
w(x,y,z,t)
we understand the velocity components of a particle which at the time t occupies the position
x,y,z. The fundamental quantity in this theory
corresponding to mass of the discrete theory is
density. A particle does not possess any finite
mass, a mass corresponds only to a finite volume (at a given time). To a point (at a given
time) we assign a density which may be explained as the limit of the mass of a sphere with
the center at the given point divided by the
volume of that sphere as the radius of the sphere
tends toward zero. A better way of putting it
is to say that we consider a point function
p(x,y,z,t) called the density and that the mass
of matter occupying a given volume at a given
time is the integral
Now set dx = x, -
Xj.;
dx x /dt
Uj.
dp/dt
= op/ox- dx/dt +
J"p(x,y,z,t)-dxdydz
extended over the given volume. This integral
the
will, in general, be a function of time;
mass in a given volume changes with time because
new matter may be coming in and old matter going
out, and they do not exactly balance each other.
But if we consider a certain volume at a given
time, and then consider at other moments the
volume which is occupied by the same matter,
then the mass of matter in that new volume must
be the same. That means that if we consider x,
y,z, as functions of t, namely, as the coordinates of the same particle of matter at different times, and if we consider the region of integration as a variable volume but one that is
occupied by the same particles of matter at all
times, then the integral must be independent of
time, or
by
the
A rigorous proof would be based on expressing the integral for the moment t 1 which may be
written as '^(x* ,y',z f jt^dx'dy'dz
using as
variables of integration the coordinates of the
The
corresponding particles at the moment t.
formulas of transformation would be
1
2.3
= f(x,y,z,t'),
x'
z'
= g(x,y,r,t'),
= h(x,y,z,t)
.1
-jT/J
p(x,y,z,t)dxdydz =
0.
/P(f,g,h,t')-J.dxdydz
as
2.2
.3
where J is the Jacobian of the functions
and the integration is over the volume occupied
at the moment t. Setting the derivative of
this integral with respect to t equal to zero,
and then making t = t and noticing that bf/bt
for t = t is the velocity component u, etc.,
we would find the same equation 2.2.
This equation is called the "continuity
equation of matter" or the "equation of conservation of matter." The corresponding equation
In the discrete theory is the equation (1.12)
which is not usually included among
dm/dt =
The
the fundamental equations of mechanics.
a
in
written
be
very
continuity equation may
simple form if we use the index notations for
the coordinates introduced before (1.71), introduce analogous notations for the velocity components, viz.,
1
dxdydz
dV/dt
*
ddx/dtdydz
+ dx-ddy/dt-dz +
dx-dyddz/dt.
2.4
u =
u-
v = ut ,
w =
u,
2.5
t = x*
to write
u4
2.41
1 = u4
equa-
= 0.
1,2,8
2.7
Noticing the analogy of this equation with equation 1.8 we are tempted to say that the continuity equation expresses the fact that the
"divergence" of the "vector" of components pu^
is zero.
This involves, of course, a generalization of the conceptions divergence and vector,
because the summation here goes from one to 4
instead of to 3 as in the above formula.
This
generalization will be of extreme importance in
what follows. In the meantime we may notice
that the divergence of the vector pu^ plays the
same role in the continuous theory as the time
derivative of the number m played in the discrete theory.
We now continue the translation of the
equations of the discrete theory into the continuous language. The equations of motion express the fact that the time derivatives of the
momentum components are equal to the force components; limiting our consideration to the left
hand sides of the equations we have therefore
to consider the time derivatives of the momentum components; in the first place the time derivative of mu; without repeating the reasoning
which led us to the continuity equation of matter, and noticing that the only change consists
in replacing of m by mu we find that the time
rate of change of the first component of the momentum vector will be here
2.61
2.62
2.63
iJ
I OMJ./C-X!
2.65
J.
= 1,2,3,4
3.
We shall later treat the fundamental concepts of vector and tensor analysis in a systematic way. At present we shall show how the
language of this theory which for ordinary
space has been partly introduced in section 1
can be applied to the case of four independent
variables and extended so as to furnish a simple way of describing the relations introduced
in the preceding section.
A quantity like p which depends on the independent variables x,y,z,t, we shall call a
scalar field. The four quantities u if u a , U 3 ,
u4 , we shall consider as the components of a
if
vector (or of a vector field; the latter,
we want to emphasize the dependence on the independent variables). The sixteen quantities
furnish an example of a tensor (or tensor
pujiij
field) . A convenient way to arrange the components of a tensor is in a square array; for
instance,
3.1
pu a u t
PU 3 U 8
pu4 u 2
pUjU 3
pu a u,
PU 3 U 3
pu 4 u s
pu4 u4
3.2
OU a /bx
"bu s /bx
OU4/&X
"bu 3 /by
ou a /bz
bu 3 /dz
bu a /bt
ou,/bt
bu4 /oy
3.4
1
1
1>pu a/bx a
=0
and
3.6
u^
determinants
and
a ia
a 33
and It Is seen that the elements of this determinant are the components of the tensor of rank
two which arises from the tensor
and by by
a^j
first multiplying them and then contracting with
respect to the two inside indices.
We could also speak of the contracted square
of a tensor, meaning by this the contracted
product of a tensor with itself.
0,
4.11
of
Maxwell's Equations.
4.2
4.21
bM/bz
oN/bx
5L/by
dN/oy = -6X/bt
bL/bz
-bY/bt
bM/bx = -bZ/bt,
=
Several remarks must be made here concerning these equations, which will be referred to
as Maxwell's equations.
In the first place, these equations cannot
be proved; they have to be regarded as the fundamental equations of a mathematical theory, whose
Justification lies in the fact that its quantities have been successfully identified with measured quantities of Physics, in the sense that
for physical quantities the same relationships
have been established experimentally as those
deduced for the corresponding theoretical quantities from the fundamental equations. In the
second place, the equations as they appear above
present a simplified and Idealized fora of the
fundamental equations, namely the fundamental
equations ft>r the case of free space, i.e., complete absence of matter.
4.3
N - Mz
y
Lz ~ N x
Mx - L y
*x + *
Xt =
Yt =
- Z
=
t
+ Z =
z
Yx
=
=
+ Nt =
= 0.
+
N,
4.31
Ny - M s
Mz - Ly
Xx +
Yy
- It =
eu
- ** " 6V
- Zt = ew
+ Z = e
f
Zy
- Y
4.4
We come thus across a new scalar quantity - electric density. However, in most cases this density is proportional to mass density p we have
considered before, the factor of proportionality
being capable of only two numerical values - one
negative for negative electricity, and the other
positive for positive electricity.
Even these equations are not sufficient for
the description of electromagnetic phenomena;
they correspond to a certain idealization in
which the dielectric constant and magnetic permeability are neglected, but we shall not go
beyond this idealization.
In the above equations we have four independent variables x,y,z,t, as in the discussion
X = F 4i ,
Y = F 4t ,
Z = F 4 ,,
L = F ts ,
M = F 31 ,
4.5
^F 14 /6x4 =
OF 18 /OX 8 + OF 13 /OX 3
oF 8 3/0*3 + "oF ai /Ox x + oF 14 A>x4 =
oF 31 /OXj. + "oFst/OX. + OF34/OX* =
"oF^/oXi + oF 4t /Ox t + oF 43 /ox\
4.6
Lt^
0;
in fact, the elements in the main diagonal correspond to equal indices; if we set J = i the
above formula becomes FJJ_ +
0,
as asserted. We try now to Idenwhence FJH =
tify the components of this tensor with our
electromagnetic force components in tha following way:
=
Xy + Nt
L x + My + N a = 0,
Yx
or
= 0.
We can
letter by Hamilton as early as 1845.
overcome the difficulty if we allow ourselves
to use Imaginary quantities side by side with
real quantities - this ought to cause no worry
provided we know the formal rules of operations,
since our new notations are of an entirely formal nature anyway; we set now Instead of .5
4.7
xx ,
It
x,,
1Y = F 4t ,
L = F.,,
M = F, x ,
IX = F 41
4.72
iZ = F 4 ,,
N = Fj.,
JJ34
4.61
ox,
0*41
ox 3
4.9
114
* i
ox a
or
5.
As mentioned before we consider the components Fj, as the components of a tensor. We may
say that we have sixteen of them; the six which
appear in the relations 4.78, six more which result from them by interchanging the indices and
whose values differ from those given in 4.72 only in sign, and four more with equal indices.
According to the formula (4.4) they are zero.
We may arrange them in a square array as follows:
4.8
F ia
F8 a
j.3
Fas
Fas
F 43
F 14
F a4
F 34
F 44
-NO
M
IX
-L
1Y
-M
L
-IX
-H
-iZ
iZ
= -F
We may compare the property F
ij
which our new tensor has with the property
=
of matter (2.7)
MJJ
Mji possessed by the tensor
ana which is simply the result of commutativity
of multiplication. These two properties are
manifested in the square arrays (3.1 and 4.8) in
that the components of M^i which are symmetric
with respect to the main diagonal are equal, and
those of FJLJ which are symmetric with respect to
the main diagonal are opposite.
Tensors of the
first type are called symmetric, those of the
second - antisymmetric .
We want to see now whether the notations
which permitted us to write in a nice form Maxwell 's equations would not spoil the nice form
which we previously gave to the hydrodynamical
equations. But since here we have at our dis-
whose array is
J(X
- Y* - Z*)
XY
XZ
i(Y
12
-X*-Z t )
TZ
IZ
-X 1 -Y 1 ).
Of this tensor we fora the divergence (threedimensional), and find as its components
10
Z,) + Y(X 7 - Y x ) + Z(X, - Zz),
Y(X X + Yy + Z.) +- Z(Y. - Zy),
Y(Z y - Y.) * Z(X X -f Yy + Z.);
X(X X + Yy
X(Y X - Xy )
X(Z X - X,)
XY+LM
H+LM
i(I
8 +!!l 8
XZ+LN
xz+iJi
YZ-Hffl
TZ4MN
is called the
"electromagnetic stress tensor";
it has been introduced by Maxwell and plays some
part in electromagnetic theory, for instance, in
the discussion of light pressure; but its main
applications and Importance seem to be in the
study of the fundamental questions, as part of
a more general four-dimensional tensor.
We saw how nicely the system of Indices
worked in the case of Maxwell's equations; it
is natural to express in index notation this
tensor also. We assert that the required expression is given by
*4
= F.
= F tt F l4 * F.,F, 4
p Fp 4
iJ
F F
ij op po
where i,j take on values 1,8,3, and the summations indicated by p and o are extended from 1
to 4.
In fact,
+Fi 4 F4i
= E(-L 8 - a -N 8 +X 8
FipFpx
+I 8 +
Z 8 ),
- fc 8 - *T 8 - |Z 8 =
i(LZ - IX),
+ X"
M8
I" + Z)
).
These quantities happen also to have physical meaning. The first three constitute (except for the factor i) the components of the
so-called Poynting vector, and the last on* is
the so-called electromagnetic energy (or, enerWe are thus led by the notations
gy density) .
we have introduced in a purely mathematical way
to some physical quantities;
we may say that
the entire tensor with its sixteen components it is called "the electromagnetic stress-energy
tensor" - unifies in a single expression all
the second degree quantities appearing in the
electromagnetic theory; the stress components,
the Poynting components, and the energy.
The stress-energy tensor may be written
out in the form of the following square array:
Xa + L 8 - h
5.2
XY + LM
XZ + LN
i(NY-MZ)
5.3
XY + LM
Y 8 + M8- h
YZ + MN
i(LZ-NX)
XZ + LN
YZ + MN
Z8
+N a -h
i(BY - MZ)
i(LZ - II)
i(MX - LY)
i(MX-LY)
where h = J(X 8 + Y 8 + Z 8 + L 8 + M 8 + I 8 ).
6.
5.1
= F ,F, 4 + F ,F, 4 X
X
1(HI - HZ),
FipFp*
L8 - I 8 - Z 8 - M 8 - H8 )
and similarly for the other components corresponding to different combinations of the indices
1,2,3 for 1 and J. There seems to be an inconsistency here; the summation indices p and o we
let run from 1 to 4, but we consider only the
^F
i*olc^
If
'.
6.1
"oE,
val-
(in
12
Chapter II.
NEW GEOMETRY
In the preceding chapter we achieved by Introducing appropriate notations a great simplicThe noity and uniformity in our formulas.
tations in which indices take the values from 1
to 4 are modeled after those previously
introduced in ordinary geometry, the two points of
distinction being first that we have four independent variables instead of the three coordinates, and second, that the fourth variable la
assigned imaginary values. In spite of these
distinctions the analogy with ordinary geometry
is very great, and we shall profit very much by
pushing this analogy as far as possible, and
using geometrical language, as well as notations modeled after those of geometry.
Physics seems to require then, a mathematical theory analogous to geometry and differing
from it only in that it must contain four coordinates, one of which is imaginary. The first
purpose of this chapter will be to build a theory to these specifications. The remaining part
of the chapter will be devoted to a more systematic treatment of tensor analysis.
7.
given a., v x ) the coordinate* of all the different points of the line. The variable p if
called the parameter, and this whole way of describing a line is called "parametric representation". Parametric representation is by no
means peculiar to four dimensions, it may be,
and is, of ten used in plane and solid analytic
geometry. We present it here because we shall
need it later, and it is not always sufficiently emphasized.
A straight line is determined by two
points; the equations of the line through the
points ai and bi is given by the above equations (7.1) in which
7.2
= b x
v a = bt - a a ,
etc.
7.3
is
7.1
vi/ vi = v a /V 8 = v a /V 3 = v4 A* or Vj = avx
as
7.4
x 2 = a a +pv 8 ,
7.11
*a = a 3 +pv 3 ,
x 4 = a 4 +pv4 ;
YJ.VX
+ v 8 V a + v s V, + v4 V4
18
7.41
-b a ) + (a s -b,)
(a 4 -b 4 )j
7.5
cos
<p
a
a
(x.y)* ^ x .y
or
(x.y)*-
x 8 y 8 - 0.
X 8 x2
and the above inequality, which expresses the
fact that the discriminant of this expression is
negative is seen to be a consequence of the assumption that a square of a vector is never negative.
A plane we would expect to be determined by
three points not in a line, or by two vectors
with the same initial point or by two lines
through a point. Instead of characterizing a
plane by equations we prefer to give it in parametric form; limiting ourselves to a plane
through the origin we have
where x, a, b stand for vectors whose components are x if a^, b^ and where we use Greek
letters for parameters in order to avoid confusion with vectors which we denote now by Roman letters.
We always can choose two mutually perpendicular unit vectors as the two vectors determining a plane; if we call these vectors 1 and
J the preceding formula becomes
7.6
xa
or
i.J = 0,
-1.
= pa a + qb g ,
7.18
x s = pa 3 + qb a ,
oi + pj.
F,
qbi,
aa + pb
x 4 = pa4 + qb 4 ,
14
first four are considered as the four projections of a part of a straight line on the coordinate axes, the remaining six, as the projections of a part of a plane on the coordinate
planes.
A little against our expectations, however,
these six quantities FIJ are not independent;
the reader will easily verify, using the above
expressions, that
7.7
l.J
J.k
k.l
J" - k* - 1.
0;
0.
7.62
We have here a relation that exists in the mathematical theory; at once the question arises:
does a corresponding relation hold for the corresponding quantities in the physical theory;
according to the formulas 4.5 this would mean
0,
i.e., perpendicularity of the electric and magnetic force vectors; these vectors are, however,
known not to be necessarily perpendicular to
each other; our Identification is therefore
faulty; a slight modification would, however,
help to overcome the difficulty; if instead of
considering the areas of projections of a triangular contour, we consider the areas of projections of an arbitrary contour, not necessarily a flat one, then the six quantities are independent and the formal analogy holds perfectly.
al +
6f;
this formula gives the expression of every vector with initial point at the origin in terms
of four mutually perpendicular unit vectors. We
have, of course,
7.8
i.J
k.
8.
0,
= k
Until now we have been listing some propositions that we may expect to have in four-dimensional geometry. But what is. four-dimensional geometry? As an abstract mathematical theory it is Just a collection of statements of
which some may be taken without proof and considered as axioms and definitions, and the others are deduced from them as theorems.
It is
not difficult then to pass from our expectations
of a four-dimensional geometry to a realization
of such a geometry; all we would have to do
would be to pick out certain of the propositions
listed above and consider them as axioms and to
show that the others can be deduced from them.
But in so doing we do not want to include
among our axioms propositions involving coordinates. In two and three-dimensional geometry
we are accustomed to see analytic geometry based
on the study of elementary geometry, that precedes It. We have an idea of what a straight
line is before we come to coordinate axes, and
we choose three of these pre-existing straight
lines to play the part of coordinate axes.
We
may choose these axes to a certain extent arbitrarily (this arbitrariness being restricted
only by our desire to have a rectangular system) ; the coordinate axes play only an auxiliary
role, and it would be awkward to Include any
reference to a particular coordinate system in
the axioms. We look, therefore, among the propositions mentioned in the preceding section for
some that are Independent of a coordinate system
and from which we can reconstruct the whole system.
7.61
x = oi +
where i,J,k have again been chosen as perpendicular unit vectors so that
15
Axiom III. To every two vectors a and b corresponds a number a.b or ab called their (scalScalar multiplication is commutaar) product.
=
a.b
it obeys together with multib.a;
tive,
plication of vectors by numbers the associative
law, a(a.b) = (aa.b); and, together with addition - the distributive law a.(b + c) = a.b +
a.c.
8.1
aa
pb
+ YC + ...
x =
J(jx) + k(kx)
=0;
no
ex
+ PJ +
=0;
(-f)i
(-f)J
-,
16
- (X - 0) + (0 - 0), the old coordiX nates will be equal to the new coordinates plus
the old coordinates of the new origin, and thus
a connection is established with ordinary
analytic geometry.
Formula 8.1 may be compared with the formula 7.62. It may also be written as
x =
8.2
Xj.1
9.
+ x a j + x a k + x4 /
Tensor Analysis.
We want to substitute now for the preliminary definitions of tensor analysis that were
suggested by the formal developments in Chapter
At
I, a definition that is more satisfactory.
that time our point of view was simply that we
shall consider symbols with two (or more)
indices as the tensor components in a way similar
to that of using symbols with one index as vector components. But in case of vectors (in ordinary space) we know what vectors are, and we
consider the components as a method of representing that known thing. In the case of tensors we seem to have to take representation as
the starting point of our study.
The situation
seems complicated since the fact that there is
not one but that there are many different representations of the same vector, depending on
the coordinate system we use leads us to think
that the same general situation will obtain in
the case of tensors,, and the question arises nahow shall we be able to find out, givturally:
en two representations of a tensor, whether it
is the same tensor that is represented in the
two cases; or, given a representation of a tensor in one system of coordinates how to find
the representation of the same tensor in a given other system.
In order to be able to answer
such questions intelligently we want to introduce the idea of the tensor itself, to put it in
the foreground and to consider the components as
something secondary. In the beginning we shall
limit ourselves to the consideration of two dimensions.
We look then for some entity, of which the
components will be constituent parts. The first
thing that occurs to our minds in connection
with tensors of rank two is, of course, a determinant. It is a single number determined by its
elements, or components. However, it cannot be
used for our purposes because the determinant
does not, in turn, determine its components.
Another instance where two index symbols
occur in mathematics is the case of quadratic
forms.
The equation of a central conic may, for
a
example, be written in the form ax + 2bxy + cy*
= 1; or
introducing the notations x x for x, x a
for y, a
for a, a 12 and a 2 i for b, and a 22 for
c, in the form
+ a t aXy a ;
9 =
9.11
- X' a S,
a c,
where
a al x 8 Xj.
a ai x a yx
9.3
c = cos
s =
sin 9;
yj. ,
y a will
17
dinate systems yet. We want therefore to define linearity independently of coordinate representation and we shall see that the following
definition, entirely Independent of coordinates
leads to the same results.
We say, In general, that ? (x) depends on
its argument x linearly if
9.6
+ a ia c a - a^. s 8 + a aa sc
ia
9.21
a1
8i
= -a lx sc - a ia s 8 + a ai c
a sg cs
a
8
a'as 3 ajj.8 - a lg sc - a ai cs + a 88 c .
J7i;
+ s8 = 1
9.4
a'n
of a
e(J,J)x a
the relation
9.5
<p(Xx
+ a' 8a = a 1]L + a aa ;
an
9.51
a al
7.
f(x,y).g(z,u,v);
18
X8 V8
O^x^
of
components
9.71
and
this
vector
T, -'of/ox,;
given another coordinate system the relation between X and x'i being given by formulas 9.,
we can form the derivatives
9.72
"of/ox 1 !
and
"of /ox 1 8
-f
"of/ox^.s
= -vx s + vt c = -
^f/OX
= ^f/OXi.^Xi/bX 1 ! +
and since
^Xi/ox'i = c,
W/ox 8 .h 8 ;
df =
"of/ox a .dx a ,
which leads to the interpretation of the differential as a tensor of rank one whose components
are the derivatives of the given function.
19
ofa/ox a ;
can we consider these as the components of some
tensor of rank two? In other words, if we define a tensor by saying that in the old system
it has these components will its components in
the new system be obtained by differentiating
with respect to the new coordinates of the new
components f 'i of the given tensor? A calculation analogous to the one preceding will convince us that this is so.
We have thus introduced an operation which
leads from a tensor of rank zero to one of rank
one, and from a tensor of rank one to one of
rank two, and we could, continuing in the same
way pass from any tensor to one of the next
In introducing this operation we
higher rank.
used components of tensors and coordinates of
points, but we proved that the result is the
same no matter what particular coordinate system we might have used; the operation of differ
entiation is thus independent of a coordinate
system.
After this detailed treatment of tensors
and operations on them in plane geometry it
will not be difficult to generalize to higher
dimensions. We consider first three-dimensional space - solid geometry - and begin with an
equation of a central quadric surface.
Using
notations similar to those introduced at the be
ginning of this section it may be written as
9.8
+ a23.Xj.Xa + a 88 x 8 x 2
+ a 3 iX 3 Xj. + 832X3X2 +
33X3X3 = 1,
x x
po p o*
9.81
a,
coefficient a 23 .
In coordinateless notation we were
pendent of the number of dimensions to
indebegin
*(x p ip,
The notation
pp
a 22 + a 33
20
and the result of contraction will be
PP
'11
22
SJ
in order to prove that contraction has an intrinsic meaning, independent of the system of
coordinates we have to prove that the last two
expressions are equal. If the transformation
involves only the x x and x a coordinates,
but
does not involve x a , then a' 33 which is the coefficient of x'sy's will be a 33 which is the
coefficient of x 3 y 3 , because x 3 = x 3 , y'3=y 3 ,
and the other coordinates do not depend on x
and y 3 ; the coefficients a'n, a' 22 , on the
other hand will be transformed by the same formulas (9.21) as in the two-dimensional case because xi, x a , yi> Yz are transformed by the
same formulas (9.?) as before.
Therefore, formula 9.5 is applicable, and this together with
the fact that a'sa = 33 establishes the invariance of contraction under a transformation of
coordinates involving x x and x a only. But the
same reasoning would apply to transformation involving x 2 and x 3 only, or x x and x 3 only, and
since we have proved that a general transformation of coordinates may be replaced by a succession of transformations involving each only
two coordinates we have proved the invariance of
the operation of contraction of a tensor of rank
two under a general coordinate transformation.
Following the same principle we could prove
the invariance of contraction for tensors of any
rank and also, using the fact that the invariance of the operation of differentiation has
been proved for two dimensions, prove that it
has an intrinsic meaning in three dimensions.
We come now to four dimensions. Here it is
easy to prove that a general transformation can
be effectuated by a succession of six single rotations, i.e., rotations involving only two axes
each; in fact, a rotation in the xt-plane will
bring the x-axis into the new xyz-solid; a rotation in the yt-plane will bring the y-axis into the new xyz-solid, a rotation in the zt-plane
will bring there the z-axis; now the t-axis coincides with the new t-axis and the x,y,z axes
are all in the new xyz-solid and can be brought
into coincidence with the new x,y,z axes by
three more rotations as we saw before.
The reasoning indicated for the three-dimensional case will, therefore, prove the invariance of the fundamental operations of tensor
analysis also for four dimensions.
1
10.
the fact that the fourth coordinate is imaginThe introduction of Imaginarles helped us
ary.
to obtain a symmetrical form of Maxwell B equations, and seems to be beneficial from this
point of view. The formal part of the theory'
runs now smoothly; but there Is a disadvantage
in this smoothness, It conceals very important
peculiarities, and the present section will be
devoted to the consideration of some of these
1
peculiarities.
The discussion may be conveniently attached to the consideration of the expression (pomp.
7.41)
which defines the square of the distance between the two points whose coordinates appear
In it; or the square of the vector joining
these two points.
Since in the above formula the quantities
x 4 and x*4 and, therefore, their difference is
imaginary, the fourth square is negative, and
,the expression may, according to the
relative
magnitudes of the terms, be positive, negative,
or zero. There are thus three types of relative positions of two points, or directions, or
of vectors; there are vectors of positive square,
those of negative square, and those of zero
square. Our geometry is thus more complicated
than what we would expect it to be if it would
differ from the ordinary geometry in the number of dimensions only. This complication, or
this richness of our geometry, far from being
an undesirable feature, is, as we shall see, an
advantage, because it corresponds to certain
features of the outside world, e.g., the existence of both matter and light which are going
to be identified with two kinds of vectors. At
present we only mention that the momentum vector of a material particle, or the vector of
components u,v,w,i are vectors of negative
square; in fact, the square of the latter is
u 2 +v a +w 8 -!; the first three terms representing
the square of the velocity of the particle
which, according to the third remark preceding
formula 4.3 is very small compared with unity,
the expression is negative.
In some cases it may be desirable to sacrifice the formal advantages accruing from the
use of the imaginary coordinate in order to put
in evidence the peculiarities we are discussing; it is permissible to go back then to the
old notations x,y,z,t, but it becomes necessary
shall
to modify the formulas accordingly.
(We
see later how it is possible to use index notaIf we detions and still avoid imaginaries.)
note two points by x,y,z, it and x' , y , z , it ,
instead of by xi and x't the formula for the
square of the distance will be
f
10.1
(x-x')
f
- (t-t')',
(y-y')* + (z-z )*
ee 7.4)
10.2
aa'
+ bb'
+ cc
dd
xs =
cos
<p
x 4 sin
* 4 = x 3 sin 9 + x 4 cos
10.
sin t =
COS f
10.4
1.
10.5
sec
tan
10.6
z'
zo + tt,
t1
ZT
tO.
Axiom
(a
l-c
H
1
sign. Dividing each of the vectors a and b' by
root
of
the
absolute
value
of Its
the square
square we obtain two mutually perpendicular vectors whose squares are +1 and -1, which we may
It is easy now to
call k and / respectively.
pick out two more vectors 1 and J which together with k and Jf constitute a set of four mutually perpendicular vectors; none of them can have
should be
a zero square, because if, say, i a
zero all the vectors of the plane determined by
a and i would have zero squares, contrary to
axiom V.
Using these four vectors we can, as in the
other case, express every vector in the form
(comp. 8.2)
ai + pj
10.7
O/.
will be considered as
The numbers a, p , Y
In order to
the components of this vector.
show that we have what we wanted we shall express the square of x in terms of its components.
Squaring 10.7 and taking into account that f* =
-1 we obtain
+ p
= a,
x3
p,
10,
we
11.
^pua/oxg
0,
^Fy
/DXfc +
"oFjfc
/QXj[ + "dFfci
^Fia/ox a = eui
11.3
/ox* = 0,
11.4
'oT
ia /ox a
=0
11.5
= F
iP PJ
(i = 1,2,3)
ployed .
An analogous reasoning applied to 11.3
would show the invariant character of that sysThe question of invariance of the first
tem.
axiom V .
Using these four vectors we can, as in the
other case, express every vector in the form
(comp. 8.2)
will be considered as
The numbers a, p , Y >
In order to
the components of this vector.
show that we have what we wanted we shall express the square of x in terms of its components.
Squaring 10.7 and taking into account that jf* =
-1 we obtain
= a2
= Y,
X 4 = 10,
we
11.
11.1
SpUa/oxa
0,
11.8
"oF^j
/ox k +
-oFjfc
/oxi +
e-Ffci
^>Fia/x a = eui
11.3
/OXj
= 0,
11.4
-oT
ia /ox a
=0
iP PJ
(i = 1,2,3)
ployed .
An analogous reasoning applied to 11.3
would show the invariant character of that sysThe question of invariance of the first
tem.
23
system of Maxwell's equations requires a special discussion; it can best be treated by introducing a new an ti- symmetric tensor DJ_J connected with F4
by the following relations:
F 8 , = D 14 , F 31 = D, 4 ,
11.6
F 14 = D 13 , F 84 = D 81
etc.,
12
F 2l
F 82 sc;
is anti-symmetric (4.4)
F' a = F,
12
11.7
yi =
0,
c,
x3
=0,
x 4 = 0,
y 2 = 0,
y3
=1,
y 4 = 0;
fl
so that
11.71
FjJ
= -sF 13 + cF 23 ;
11.72
D 14 c
the
that set
11.2'
and its invariance follows from general considerations as in the case of 11.3.
Formula 11.5 contains only multiplications,
contractions and additions, so that there la no
doubt concerning its invariance, but the situation changes when we come to the set 11.4. The
vector *> T
ia/ox a has been obtained by invariant
operations but 11.4 states that only three of
its components are zero, a statement which obviously depends on the choice of coordinate axes and is not invariant.
We have now two courses open before us:
one is that of resignation, we can say: we see
that physics is not like geometry in this respect, that we can only use four-dimensional
notations, a four-dimensional diagram but not
four-dimensional geometry; the other course is
that of adventure, we may try to play the game
of geometry; let us pretend that we can apply
the formulas of transformation of coordinates
in this case; we know that there will be a difference between the theory we obtain and the
physics which we undertook to translate into
our language; but it may be that the difference
will amount numerically to very little. Consider the fourth component of the vector oTj^oXgj
we found (comp. remark following 6.3) that one
of the terms of this expression, ^Mio/oxg vanishes, and the other 0^3/0X3 gives Xu+Yv+Zw,
where u,v,w are the components of velocity, but
in order to present the Maxwell equations in a
simple form we had to choose our units in such
ora way that the velocity of light is unity;
dinary velocities are of the order of magnitude
of one ten-millionth of the velocity of light,
so that we see that by setting the fourth component of oTi a /ox o equal to zero we would commit an error that is numerically very small.
This encourages us to go on with our adventure
and try to force the geometrical character on
In order to do that let us go beyond
physics.
the formal structure of our formulas and recall
what the meaning of our fundamental quantities
The components of the vector U A were givwas.
en (see 1.2, 2.4, 4.9) as
- D 84 s;
11.3
FJJ
becomes
u x = dx/dt,
u 2 = dy/dt,
u a = dz/dt,
u 4 = 1.
But this identification is obviously not independent of the coordinate system, it gives prefWe may think
erence to the fourth coordinate.
and
that this is the source of our difficulty,
that this difficulty may be overcome if we find
an invariant identification to take the place
The next section will prepare the way
of 11.8.
for this.
4
12.
The root of the difficulty is that our description of motion was not invariant; motion
was described by giving the dependence of the
coordinates x,y,z on time, that means by giving
three of our coordinates Xj., x a , x 3 , as functions of the fourth x 4 which thus is given prefThe situation is analogous to that In
erence.
plane analytic geometry where we give y as a
function of x, or that in solid analytic geometin
ry when we give y and z as functions of x;
both cases we represent curves; from our fourdimensional point of view we should then consider motion of a particle as a curve in four dimensions (using the word curve in a general
sense so that straight line is a special case) .
What we want then is a representation of
curves in four dimensions which would not give
preference to the fourth coordinate. We begin
by considering representations of curves in two
and three dimensions which give no preference to
one coordinate.
In the plane a line may be represented by
x = ap + b,
y = cp + d,
x = r cos p,
y = r sin p;
a circle by
in space a line by
x = ap + b,
y = cp + d,
z =
12.2
(dx/dp)
+ (dy/dp)
+ (dz/dp)
ter
1.
ep + f ,
(dx a /dq)
+ (dx a /dq) a
a helix by
x = r cos p,
y = r sin p,
z = kp, etc.
y = g(p),
z = h(p)
18.1
s =
/ V( dx/dp) a +
(dy/dp)*+ (dz/dp)'dp;
dp/dq
that
the
12.3
w,
i.e.,
for
26
Chapter III.
SPECIAL RELATIVITY
Guided by the point of view that the formulas of physics ought to be interpreted in fourdimensional geometry we were led to the interpretation of the motion of a particle as a curve
in space-time. Following the analogy with a
curve in ordinary space where arc length, s, is
often used as a parameter, we have introduced a
standard parameter, which we may also. call arc
length and denote by s, for curves in spaceThe partial derivatives dxj/ds of the cotime.
ordinates of a point on the curve with respect
to s may be considered as the components of a
vector tangent to the curve (the square of this
vector is -1 in every point - we shall refer to
such a vector also as a unit vector) . We have
then at every point Xi, x a > x 3 , x 4 of such a
curve a .unit vector dxi/ds, and we have agreed
to identify this vector with the vector u^ which
appears in our fundamental laws of physics (11. 1
to 11.5) so that
ui
dxi/ds.
15.
Equations of Motion.
u = dx/dt,
v = dy/dt,
13.1
Ul
U2
U4
w = dz/dt.
described by giving
of s; and we have
dxi/ds which, due
parameter satisfies
u = dx/dt =
dx/ds
dxi/ds
13.8
w =
mv
13.3
mu4
These are what we call the mathematical components of the momentum vector; its "physical components" are
mu
1
~ -L
Of course, we have merely two representations of the same thing. Given the Xi(s) we can
express s as a function of t from
x 4 (s) = it
13.51
c
87
We obtain a relation between the momentum
components and mass if we take the sum of the
squares of the components 13.3 or 13.31 and use
13.1, viz.,
a a + b" + c
- d*
= -m";
13.4
d =
13.41
imV /c
the first factor of the first term vanishes according to the continuity equation 11.1;
the
second term may be written, recalling the definition of uj, as
= p.ou /ox .dx /ds
1
a
p.duj/ds;
our
so
p.duj/ds
or, if we use the discrete picture, considering
both mass and electric density to be concentrated
in one point, and denoting mass by m, electric
charge by e,
13.51
m.duj/ds =
13.52
m.dUj/dt
= eF
la .dxo/dt
13.53
m.du 3 /dt = e(Z + Mu - Lv),
m.du 4 /dt = ie(Xu + Yv + Zw)
m'.d*x/dt=e(X+Nv-Mw)
13.6
where
m
13.7
= m/-
14.
Lorentz Transformations.
about our coordinate system* IB the greater arbitrariness in their choice. Before we were
free to pass from one system to another with
the same time axis; now we may change the time
axis also (formulas 10.6), and we want to see
what it means.
In general, in one system of coordinates A
geometrical configuration is described by certain numbers (e.g., coordinates of its points)
and certain equations (e.g., equations of its
straight lines); In another system of coordinates the same configuration will be characterized by other numbers and other equations, but
It will be another description of the same configuration; or, we may say, another identification which is theoretically just as good as the
first, but may be more, or less, convenient for
practical purposes. In general we make our
choice of a coordinate system guided by the properties of the object we are studying and our owi
position in space. If we study an ellipsoid we
would choose for coordinate axes its principal
axes; or in another case we would choose the direction away from us as the y-axis, the direction to the right as the x-axis, the vertical
direction as the z-axis; but in principle all
axes are permitted. The same general situation
obtains in physics, considered as four-dimensional geometry. We have many systems of coordinate axes at our disposal, and we want to investigate now what use we can make of this arbitrariness, how we can adjust the choice of axes to the requirements of a particular situaIn particular, we are interested in the
tion.
choice of the t-axis.
The object we want to study in the first
We repreplace is the motion of a particle.
sent such a motion by a curve in four- space (and
a straight line we consider as a special case of
a curve) . At every point of that curve we have
a unit tangent vector, the four-dimensional velocity vector of components a,b,c,d; or we may
characterize it by the three-dimensional velocity components u,v,w; if we pass to another coordinate system the components a,b,c,d will be
changed, and so will the components u,v,w. If
the coordinate transformation affects only the
space coordinates x,y,z then the component d will
not be affected, and therefore p will not change;
in other words, u,v,w will be changed but not
u a + v 8 + w; the velocity vector will have different components, but its absolute value, the
speed, will be the same. This is essentially as
in old physics; the new feature is in the existence of transformations affecting t, and the
most striking result of it is expressed in the
following theorem.
Theorem. For every motion it is possible
for every moment to choose a system of spacetime coordinates in such a way that the speed
be zero.
Proof. Begin with any axes'; then, without
changing the, time axis, change the space axes
so that the motion, at the moment considered,
takes place
= 0,
if we want to make c
so that
a'
=0,
angle
b'
co + dT,
= cr +
do;
\J>
-o
= sin
<r
= -c/d;
therefore more general coordinate transformations; we shall replace our equations by more
general ones which will be invariant under these
more general coordinate transformations; in comparison with this situation we may say that we
consider now only special coordinate transformations and invariance under them; therefore the
present theory is called "Special Relativity
Theory".
It may be mentioned that the historical
order of appearance of the ideas of our subject - as it happens so often - has been quite
different from the order which seems natural and
in which we have presented them. First the
formulas of transformation involving space coordinates and time have been introduced by Lorentz without, however, giving to them the meaning they have now; in Lorentz's theory there
exists one universal time t, and other times t 1
play only an auxiliary part. The merit of making the decisive step and recognizing the fact
that all these variables are on the same footing - belongs to Einstein (1905). The four-dimensional point of view, after some preliminary
work had been done by Poincare and Marco longo,
was most emphatically introduced by Minkowski
in 1908.
15.
Addition of Velocities.
V A = V B>
XA ~
ZA =
15.11
(L
= Z T
B BA
15.12
=Z C O CA
+ t cT CA ,
tA
=Z CJTCA + t c o cl ,
Z=ZO
B
C(B
=ZT
CCB
and also
15.21
VGA =
VBA
VCB
OCA'
TCB
OCB*
Express now Z A , t A in terms of z c , t c by substituting the values for ZB, t B given by the
transformation formulas 15.12 into the transformation formulas 15. Ij comparing the result
with the transformation formulas 15.11 we get
CA
GB'BA +TCB'TBA
TCA = TCB*BA
+
CB*
TBA
15.3
VCB
'
"CA
l + VB A .V CB
15.4
ZA
ZA
and
15.1
ZA ~
BA
V =
V'
+ V".
16.
+ (dy/dt)
or
+ (dz/dt)
u2 + v2 + wa
-1=0
= 1,
choose the origins of the two coordinate systems on the respective straight lines; next we
may consider the two lines In the respective
z-t planes.
The momentum vectors of the corresponding photons will have now In their respective coordinate systems the components
0,0,q a ,q 4 and 0,0,p,,p 4 (contrary to our general agreement we use here subscripts with physical coordinates) and since both vectors are of
zero square we'll have
/-
- 0,
Q4*
- P4*
P*
Of
by choosing appropriately the sense on each coordinate axis we can reduce these conditions to
16.1
" P4<
ZO + tT,
t'
ZT
tO,
= P* =
condition
that o 2 - ** = 1, so that we can choose o + t
arbitrarily; if we make the choice
16.3
+ T
= q a /p 3 ,
M
and energy is given by the formula
E = hv,
16.4
where h is the so-called Planck's quantum constant, which for us appears simply as coefficient of proportionality establishing the relation between the values of two quantities which
measure the same thing in different units, much
in the same way that c, the velocity of light,
appears in the formula connecting mass and energy (compare 15.41) . Of the two quantities, S
E
and v, which can be used to measure color,
will be the one that is more convenient for our
purposes because we use the corpuscular theory
of light.
The question now is with what quantity in
our theory are we going to identify E. In order
to have a suggestion we notice that E is of the
character of kinetic energy; it plays for light
particles the same role that kinetic energy
plays for material particles. There (Section
sec13) we identified kinetic energy with the
ond term in the development
m
= m
a+i
this gives
16.4
v /v
T* = 1
T *
V.
or, in first approximation 1
Let us try to figure out the predicted
change of frequency on the classical (ware) theIf we have a wave of frequency v that
ory.
means that there are v vibrations per unit of
there
time, and since the velocity is unity,
will be v waves per unit of length, low, if we
move toward the source with a velocity V we
shall travel in a unit of time the distance V
17.
o*
F 48 - F 38 s
F 43 = F 43
F 48 c
F 83 = F 8 ,c - F 84 s
Fix = F 31 c - F 41 s
IB
ox + TM
= oL - TY
yi
= oy - TL
= oM + TX
Z'
= Z
L.
M
Chapter IT.
CURVED SPACE
18.
=*
f(x),
which can be expanded into a power series converging in the neighborhood of the origin. The
constant term of this expansion vanishes because
the curve passes through the origin so that the
equation must be satisfied for x = 0, y = 0; the
coefficient of the first power of x also vanishes, since the slope of the tangent, which is
the x-axis, must be zero; if we write the next
term in the form
18.1
Y = *a,x'
18. tt
Z =
a* +
2*18*1*8 +
18.3
iX
18.4
s(x,u)
s(x,v)
s(y,u)
s(y,v)
18.5
a a2 x 3 y 2 = s(x,y)
x.u
18.6
X U
x.v
=
K.
y.u
P P
7ou o
y.v
u
& 21
multiplying by the law of multiplication of determinants the second and the third factors and
writing K for the first, we get 18.6; applying
the law of multiplication of determinants first
to the first two factors, then to the resulting
determinant and the third factor we get 18.5;
we may thus write
s(x,u)
s(x,v)
x.u
x.v
y.u
y.y
K.
18.7
s(y,u)
s(y,v)
Before we leave the topic of ordinary surfaces we want to establish a relation between
curvature of surfaces and curvature of curves.
The points common to our surface and the xzplane constitute a plane curve whose equation
in the xz-plane may be obtained from z = f (x,y)
It is clear that the x-axis
by setting y = 0.
is a tangent to this curve and that the first
term of the expansion of z as a function f (x,0)
into a power series will be obtained by setting
in 18.2. We have thus
x2 =
18.11
This expression plays the part of the expressions 18.1 and 18.2 which have occurred in the
two preceding situations.
ll*l
18.8
If we change the y- and z-axes which fall into
the normal plane to the curve, to other axes in
18.9
19 .
General! za t ions .
In the preceding section we discussed configurations in the ordinary space, and we could
rely on our intuition; everybody can conceive a
plane curve, a surface, a twisted curve; we
have at our disposal physical objects (drawings,
graphs, models) with measurements on which quantities of our theories may be identified successfully. In the investigation we undertake
and
now, we cannot use our intuition any more,
the identifications, when they come, will be of
a much less immediate character.
We have than
to rely on analogy with the configurations studied in the preceding section and on mathematical
reasoning supported by formulas.
We begin with what seems the next simplest
it
case, a surface in four-dimensional space;
may be considered as a generalization both of a
surface and of a curve in ordinary space. Such
one is given, in general, by two equations on
the four coordinates; in other words, we daf^nf
as a surface in four-space the totality of
points whose coordinates satisfy two equations
=
where F,G are
P(x,y,z,t) = 0, G(x,y,z,t)
two functions subjected to certain restrictions
We define
to be imposed presently.
plane
In four dimensions as a surface which may be
given by two linear equations (this definition,
although given in terms of coordinates, is invariant, because it can be proved that if the
equations are linear In one system of coordinates they remain linear after a transformation; the equivalence of this definition of the
plane with that given in Section 7 is easily
recognized). In the general case we choose a
point on the surface as the origin of coordinates; we solve the two equations for two of the
coordinates, and we define as the tangent plane
at that point the plane whose equations result
from omitting all but linear terms in the expansions. We next choose that plane as one of
our coordinate planes; the lowest terms in the
expansions are then the quadratic ones; denoting the coordinates for which the equation* apand the two other
pear as solved by x, x,
H
coordinates by x x , x a , we may write the groups
of quadratic terms in the two expansions as
a ta x,),
19.1
2b 18 x x x a + b aa x a ").
For every vector in the tangent plane of components x x , x a this gives us two numbers which
may be considered as the components of a vector
in the normal x, x 4 plane; or, we may consider
this vector as given by a vector form
19.2
'11*1
2v xa x 1 x a + v aa x 8
the coefficients v^j being vectors of the normal plane whose components are a ij and bjj (along
This expresthe x 3 and x 4 axes respectively) .
sion assigns to every vector of the tangent
plane a vector of the normal plane; we may substitute for it, as we did in an analogous case
before (compare 18.3), a more general expression
19.3
s(x,y)
19.41
v ai
but although this is linear in each of the vectors x and y it is_ not a tensor, because the
values of this expression are not numbers (they
are vectors in the normal plane) . We shall not
introduce a special name for such expressions
because we shall not have to deal with them much;
the expression 19.5 we have denoted, as before,
by s(x,y), but we must keep in mind that the
values of s(x,y) are not numbers but vectors of
the normal plane.
We may in this case form the expression
18.5 where it is understood that in the expansion of the determinant scalar products have to
be used where ordinary products were used before; this change is made necessary by the more
than once mentioned fact that the values of the
elements are vectors. In all other respects we
can apply to the expression the same reasoning
as before and we come to the conclusion that the
relation 18.7 remains true, where K is a number,
independent of the coordinate systems in the tangent and normal planes, but which after such coordinate systems have been chosen can be calculated in terms of the coefficients of the vector
form 19.3 by means of the formula
xx
N-n
V 82 x 8 y a ,
where
N-2byr
19.5
k=l
19.6
r
(k)
krl
19.4
xx
r
19.41
k=l
.00
s(x,u)
20.
=
19.43
k=l
where the Greek letter subscripts imply summation fron one to two corresponding to the tangent plane, and the summation with respect to k
corresponding to the r coordinates of the normal flat is indicated in the usual fashion. Each
of the determinants corresponding to the differrent values of k is exactly of the same nature
as 18.5 so that the reasoning which led from it
to 18.7 applies to each of the determinants
without change, and it is easy to see that formula 18.7 continues to hold. We may use this
formula to define K which we continue to call
total curvature.
And now we come to the last generalization.
We consider, in a space of an arbitrary number
of dimensions N a curved space of n dimensions
with n also an arbitrary number N, which by
definition is the totality of points whose coordinates satisfy N - n = r equations
19.7
Fk
(xi.,
x 2 ,....xN )
=0
(k = l,2,...r).
V.'e
assume that for every point in the curved space these equations can be solved for r
of the coordinates and that these solutions can
be expanded into power series in the remaining
n coordinates, converging in the neighborhood of
the point selected. By a transformation of coordinates we may arrange it so that these expansions begin with second degree terms so that
we may write
19.71
x n+j
where the summation indicated by the Greek indices now goes from 1 to n. The sub-space defined by the first n coordinate axes we call the
tangent flat space at the point considered, and
the sub-space corresponding to the remaining r
coordinate axes - the normal flat space at that
point.
/jjN
As before, we use the coefficients aj
to
form the expressions
19.8
s(x,y)
p-J-k
20.1
s(x,u)
s(x,v)
s(y,u)
s(y,v)
R(x,y;u,v) =
10
its points
a flat space of 2 + r dimensions;
will be those points of the N-space whose coordinates x a , x 4 ,....x n vanish; the intersection
of this flat space with the given curved space
will be a surface, i.e., a two-dimensional curved space, because the coordinates of its points
must satisfy the r equations of the curved space
(19.7) and n - 2 equations
20.2
0,
= 0,
x4 =
isN-n+n-2=N-2
which together
equaThis surface we may consider as a surtions.
face of the r + 2 dimensional flat space 20.2;
its equations in that space will be obtained by
in the equations
setting x 3 = x 4 = ...x n =
19.71 of the curved space (just as the equation
of the normal section of a surface in the xzplane was obtained (preceding formula 18.11) by
in the equation of the surface);
setting y =
these equations will then become
20.3
+ terms of higher degree
20.42
20.43
20.5
,1
v ia
where
'28
R ab;cd
20.6
with
(k)
v ai =
20.71
20.72
20.73
abjcd
= ~R
Rab;cd
Rab;cd
+ R
'
ba;od
= R
~R
abjdc
od;ab
ac;db
+ R ad;bc
= 0.
41
20 ' 8
r
Z
k=l
(k)
A
because the I's are unit vectors and I^.Ifc = 1>
or
r
2
k=l
PY
21.
=
abjcd
In the last section we learned how to associate with every point of a curved space a tensor of rank fourj for our physical interpretation
we need one of rank two; but we know how to obtain from a tensor of rank four one of rank two;
we have to apply the operation of contraction.
The result we shall call the "contracted Riemann
tensor" and we shall expect to identify it with
The first question we have to ask
the tensor T.
ourselves in this connection is whether the contracted Riemann tensor satisfies the equation
11.4, viz., oTia/axg = o. But before we do that
we have to go through quite a lengthy development
because at the present stage we do not know how
to differentiate tensors on a curved space. In
flat space we could consider the differential of
a vector, or, more exactly, of a vector field,
by (roughly) considering the difference of two
X 8 = X(Ui,U 8 ,U 3 )
*1 = Xl(u X ,U 8 ,U 3 )
x, = X 3 (u!,u 8 ,u 3 )
u8
21.2
= U 3 (xi,x 8 ,x 3 ),
21.3
y(p),
21.4
dx/dp,
dy/dp,
dz/dp.
du a /dp,
21.5
duj/dp.
We have then the required system of representation; but it is not necessary, every time
when we want to represent a vector to introduce
a curve to which it is tangent; we shall show
how to find the components 21.5 when we are given the cartesian components 21.4 without actually considering the curve.
We have, considering that Uj depends on
xx , x 8 , x 3 which in turn depend on p,
Uj.,
21.6
u8 , u a ,
duj/dp
oui/3xa.dx<i/dp,
21.7
V 1 * OUi/OXo.Ya
21.8
21.15
21.9
21.10
where
21.11
V 1 = a la v a .
v l = b ia a v
op p>
ap
=0
p
1Q v fl
so
10V
21.16
21.16'
21.31
21.13
ia*aj=lj-
21.14
a iab
=
aj
C,
n b rj
Ui(p).
21.17
=/|/(dx/dp)"
+ (dy/dp)*
-i-
we
(dz/dp)dp
ds
= dx 8 + dy*
dz 1
dx =
dz = b 3 a.du;
for dx 2 we may write bi adu.bijjduP; using similar expressions for the other terms of 21.17' we
get
ds a
21.18
22.
We come now to the representation of tenWe know that a tensor is a function which
assigns numbers to vectors, and the question of
representation will be simply this: given the
general components of the vector arguments how
to find the corresponding value of the
tensor.
We have already solved this problem for one particular tensor, namely for the tensor of the
scalar product which we expressed in the preceding section by means of the g's, and we shell
use the same method in the general case. Given
the cartesian components of a tensor
fj* and the
1
general components V* and W of two vector arguments, to find the corresponding value of the
tensor. We pass from general components to cartesian components by formulas 21.10 and substitute these expressions into the expression
f
Y6 v Yw C for tne value of tne tensor; the result
sors.
is
Tji
T7^wP ~
f*
f
Z
if we set
PP P
.Vi
D
Y6 Yi Oj
1?
*lj
we call FIJ the general components of the tensor, whose cartesian components are fji , and we
see that the values of a tensor are expressed by
formula 22.1 in its general components and the
general components of the vector arguments in
the same way as in terms of cartesian components
of the tensor and the vector arguments.
Formula 21.15 for the scalar product of two vectors
may be considered as a special case of 22.1. The
cartesian components of the tensor gj j are, of
22.11
F,
22.5
ljk
We know now how to write tensors in general components, and we want to find out how to
perform operations on them. Of course, we could
always pass from general components to cartesian components, perform the required operations
and then, if the result is a tensor, pass back
to general components; but instead of following
this program in every special case as it presents itself we shall do it once for all and
derive general formulas whose application in
special cases is much more convenient than ad
hoc calculations.
We begin with the operation of contraction.
Given again a tensor of rank two by its general
components FJJ we pass to its cartesian components by formula 22.3 and now we contract by
taking the sum of components with equal indices
according to the original definition,
22.4
fyy
aiY a
jY
= gi ^'
22.8
22.41
22.81
student.
irith
ijkl
iokp
j
g
aj
To prove it suffices to substitute the expressions 21.16 and 22.5 for the two kinds of g's
also
and to apply formula 21.14 twice; we may
notice that, as it follows from the definition,
op 7(
66
"
tfij
g
"
=- g
^Ji
be written as
22.61
Next comes the operation of differentiation.
The result of differentiating a tensor is always
a tensor of rank higher by one than the
given
tensor; its components will have one more index than those of the given tensor; we shall
denote them by simply adding a new index preceded by a comma, to the symbol of the given tensor.
Because the situation is slightly more complicated, let us start in translating differentiation into general components with the simplest case of a tensor of rank one given in general components, F^. We follow the same program:
as a first step we pass to cartesian components
we next
by formula 22.5 and get f^ = Ffja^:
find the cartesian components of the differential
by simply differentiating with respect to cartesian coordinates with the result
As a third step we pass back to general components using the formula 22.2 and arriving at the
result
but b
where we set
22.82
ij,k
22.66
ijk,a
r
r
<*
v
F
iak
ru
r
r
9
r
i F ajk
TT
k*ija>
there
the general rule ought to be clear now;
are as many correction terms as there are indices; each correction term corresponds to one
index, the other indices being disregarded in
its formation.
In order to be able to perform the operation of contraction (and the operation of scalin
ar multiplication is a special case of it)
general coordinates we have to know, as we saw,
the values of the g's; in order to be able to
perform the operation of differentiation we
have to know the value of the T's (the correction coefficients); if we know those we can perform all the necessary operations in general coordinates without going back to cartesian coordinates. We shall show now the values of the
T 's can be derived from those of the g's.
The correction coefficients were given originally by the formula 22.81; we can give to
this another form by using the relation 21.14;
=ft kl and differentiating
writing it as a^b
it with respect to uj we get, since the as are
constants,
&
so that we have
22.91
22.84
k
r, 4
d*Xv
=
ij
r i*
j
22.85
o,
o,
22.92
23.
components
to the same cartesian components v^ there correspond thus two different sets of general components depending on whether we consider the v^
as vector or as tensor components; it was in anicipation of this situation that we have been
using the index for general vector components
as a superscript.
Essentially, a vector and a
tensor of rank one are one and the same thing;
and so we have two different systems of components for every vector (in a given general coordinate system) ; the components with subscripts
are called covariant components, those with the
superscript - contravariant components. It is
clear from what precedes, but it may be worthwhile to repeat that we have here two different
representations of one and the same thing.
It was mentioned in Section 21 that there
are two ways in which a vector results from differentiation; one, a vector considered as a tangent vector to a curve, was discussed before,
and is the basis of what we have been doing all
this time; it is interesting to consider now
briefly the other. If we start with a scalar
field f = f(x,y,z) we may derive a vector field
by differentiation, and the cartesian components
47
of tals vector field will be
23.1
this vector la known as the gradient of the field
f ; now, we may give the same scalar field in general coordinates
f = f
will
if we differentiate f with respect to u lf
we obtain general components of the gradient
vector field? The question is easily answered
by computing these components; we have
23.2
of
Pf
oxg
23.4
3f
comparing this with 22.11 we see that the partial derivatives of/oui are the components with
subscripts - the covariant components in general coordinates of the gradient. The two representations, the covariant, and the contravariant, may be thus considered as corresponding to
two ways in which a vector can be arrived at by
differentiation; if we consider a vector as a
gradient we arrive naturally at Its representation by covariant components, if we consider it
as a tangent vector we arrive at its representation by contravariant components.
(The name coto
Indicate
is
Intended
the
way,
variant, by
that these components change in the same way, as,
or have similar formulas of transformation with,
partial derivatives.) In the case of cartesian
coordinates, of course, covariant and contravariant components of a given vector coincide: in
this case it is not necessary to make any distinction.
We shall have to use covariant as well as
contravariant components, and it is important
to be able to pass from one to the other representation; the necessary formulas can be found,
of course, by passing through a cartesian representation. Let covariant components F A of a
tensor of rank one (or vector) be given; formulas 22.3 show us that the corresponding carteterms of these
sian components are fA = Fa a ai ;
the contravariant components are obtained by
formula 21.9 which gives here
23.3
F 1 = a F aa a p =
lp
23.42
23.31
It may seem that there is a wasteful redundancy in this double system of notations, that
one representation is enough, and that to have
1
We call F ^ the contravariant components of the
tensor FJJ and the components with lower Indices
(subscripts) which we have been using for tensors heretofore we call covariant components.
We have thus two representations not only for
tensors of the first rank (vectors) but also for
tensors of all ranks. In one case we have been
using already a symbol with two superscripts,
1
we shall
viz., the g ^ (introduced by 22.5);
in
is
agreement
notation
this
that
show now
48
with the general notation we are introducing now
by proving that these g's with upper indices are
the contravariant components of the tensor of
In order to prove that,
scalar multiplication.
the
we notice that, according to formula 23.42
contravariant components of a tensor of covariant components g^ are
23.48
O a so that
we
but according to 22.61
j
1
get 6ag al - g^ * and the assertion is proved.
text we want to learn how to differentiate
a tensor given in contravariant components, but
before we do that it seems necessary to Introduce what we call mixed components. Suppose we
are given one vector argument of a tensor of
rank two in contravariant, and the other In covariant components, and we want to find the value of the tensor; if the components of the two
given vectors are V 1 and W^, and the cartesian
components of the tensor are f^j , we pass to
cartesian components of the vector arguments
23.5
Vi
= b
VY
1Y ,
wj
= W a ,
ol
23.51
v
ap a wp
components :
we differentiate this:
"ft
O~Vj.
&
23.6
^J
"
lu~
r?
r*
23.7
We are in
possession now of all the formal rules of operations on tensors in general
coordinates. Although these rules were deduced
by means of cartesian coordinates these coordinates and components together with all formulas
involving the a's and the b's form only a kind
of scaffolding that can be removed after the
building has been completed. All ire have to
know in order to operate on tensors are the g's.
Using the g's we can lower and raise Indices and
contract and, as a special case, find the scalar
product of two vectors; also find the angle between two vectors (using formula 7.5) and the
length of a curve (using 81.18). Given the g's
we can calculate the r" s (end of Section 22) and
with the aid of the P s we can differentiate
tensors (formulas
2.8). We see thus that the
g's play a fundamental part in all operations the tensor of which they are components is often called the fundamental tensor.
Before we conclude we might state explicitly that all the formulas we have obtained are
entirely independent of the number of dimensions.
24.21
4.3
x x = x,
x t = y,
x a = z,
x4 = it
x =
x.
y = xs ,
z = x,,
t = -1x4
24.2
* -
+ 7* +
t.
IX,
31 = M,
24.11
f,, - L,
f 4 , = 1Z,
24.1
-g 44
41
24.
1.16
aa = ass = 1
Using formulas 22.2 and 24.2 we obtain the covariant components in physical coordinates - and
we use here capital letters - as follows:
F 4l = X,
F 4t
F 83 = L,
F J8 = M,
Y,
= Z,
P4
24.4
= M.
= ga
V*F al
24.6
F\
'
a 44 = i
b 44 = -ij
24.7
b sl = b, 3 = 1,
f lt = i.
jk>l
0,
50
and in the presence of matter the
becomes
second
set
eu j
24.71
We notice here that no Imaginary quantities appear and In spite of this our formulas
are symmetric. The raising of the index 4 Is
equivalent to changing the sign of a component,
and this is how the minus sign is taken care of,
We shall now write out the expressions for
the stress energy tensor and the equations of
motion; it is clear that the formulas 11.4 and
11.5 become
24.6
T la
24.9
=
<*
*o,jFF a
or
24.91
T iJ =
- ig 1J F
FP
= F FP - i
F FP
p
gij
lp
j
puSx
or
24.92
24.10
25.
(pu
a
),
0.
We want next to apply the general coordinate system that we have introduced for flat
space also to curved spaces. In flat space we
introduce the language of general coordinates
and components by translating from the language
of cartesian coordinates and components; in
curved space we have no cartesian components; we
shall have, therefore, to begin by introducing
something that will play the role of cartesian
coordinates; we shall introduce quasi-cartesian
coordinates, which will take that place; but
whereas cartesian coordinates are universal in
that the same system of coordinates works for
the whole plane, or flat space - the neighborhood of every point in curved space has its own
system of quasi-cartesian coordinates. They are
defined in the following way: Consider at a
given point P the tangent flat; there will be In
general a neighborhood of P such that no two
points of that neighborhood have the same projection on the tangent flat (for a sphere, e.g.,
we may obtain such a neighborhood by drawing any
small circle around the point of contact) for
such a neighborhood there exists a one-to-one
correspondence between the points of it and the
points of the tangent flat which are their projections. We introduce now on the tangent flat
a cartesian system of coordinates with origin
at the point of contact, and we use the coordi-
nates of a point of the flat as the quasi-cartesian coordinates of the point of the curved
space whose projection it is; If, for Instance,
a surface is given by equation
5.1
z = J(ax
2bxy + cy)+t.h.d.,
25.11
t.h.d.
51
or it may be considered as a vector of the containing space. Its (cartesian) components will
be given by x^ (l = !,...,) where the x are
functions of t which are obtained fro* 25.2 by
substituting for the u's the expressions
characterizing our curve; we hare thus
25.2
1
and
oxi/ou a .u
= r equations,
1=1
= u ,
x
x n = Ua,
x a = ua ,
i=n+l
25.4
= z
t.h.d
k=l
t.h.d.
*n
(g. n )
- o
25.42
25.5
t.h.d.
the
x's
25.6
25.7
0.
25.8
65
It may seem that we lost from view the
original purpose of Introducing curved space,
which was that of obtaining a tensor which we
could Identify with T^ . We Introduced the
Riemann tensor having this In mind, but now we
seem to be Immersed In an entirely formal theory and far removed from the Riemann tensor; as
a matter of fact, It Is just around the corner;
differentiation, although performed according
to formulas that are formally the same
as
In
flat space, has, as we shall see, a new content;
in trying to discover the difference we will be
led to the Riemann tensor from a new point of
26.21
view.
26.
^^71
7K^*
- F3
in flat space.
or
26.4
- F
where
26.5
-
26.2
26.3
O3C-J
and g
r ^ respectively,
and these are equal because the result of ordinary differentiation does
not depend on the order; two tensors having equal
components in one system of coordinates would be
equal in all systems of coordinates and so we
have
26.1
+I* P;
aJ
rj
The left hand side is a scalar that has been obtained by legitimate operations and is, therefore, independent from the coordinate system
used; so is therefore the right hand side, and
this proves that the B's are the components of
a tensor.
(In order to see that this is an essential point and that not every symbol with
indices may be considered as a tensor, the read<
er might consider the expression P krX e^Z*';
this expression is, obviously, not independent
from the choice of coordinates since in cartesian coordinates the r's vanish, and in other
systems they do not; the r's furnish thus an
54
tainly can measure lengths; it follows that curvature, as expressed by the Riemann tensor if
accessible to the Inhabitants, It Is an internal
property of the space. In particular, for I
5,
n
2, i.e., for the ordinary surface we obtain
the fact that the total curvature can be calculated from the expression for the line element;
this is Gauss's Theorema Egreglun.
27.
27.1
The proof is very simple if we use quasicartesian coordinates. In these coordinates the
r's at the point of contact vanish and although
the first derivatives of the r's do not, the
components of the tensor obtained by differentiating the B's (formula 26.5) which we have
identified with the R's will contain the second
derivatives only, because the first derivatives
will be multiplied by the r's themselves that
do vanish. With this remark in mind the proof
of the Bianchi relations does not present any
difficulty; we simply substitute for each of
the three terms in 27.1 the difference of the
two second order derivatives and find that the
result vanishes identically.
Now, in order to deduce froa 27.1 the relations for the contracted tensor we raise in
27.1 the second index so that we have
n,p
with n. We
obtain
55
p p,a
But R Pjp are the mixed components of the contracted Riemann tensor which we denote by R*j
so that, dividing by 2 and changing the sign we
have
a
Finally, R a is the result of contraction of the
contracted Riemann tensor; we denote this scalar by R (it is called the twi^e contracted Riemann tensor); then we can write for R a a p simply
R - or (O aijR) a and our relation becomes
28.
Multiplying by
to i, writing i
0.
a^
we
28.2
ttJ
r^pU^
0.
Geodesies.
28.1
dil/dp = b 1(
28.3
= 0.
0.
dxVdp
= bi u,
a
Since arc
57
Chapter V.
GENERAL RELATIVITY
In Chapter I we introduced certain fundamental quantities, and we combined them into the
We found
symmetric tensor of rank two, TJJ .
that this tensor satisfies the differential
equation
^T la /'ox a =
0,
D1
n
XK
t>
304X Jt n.
It might seem strange that we find a physical interpretation only for the contracted
Riemann tensor, only for ten combinations of
its twenty components. But this is quite In
order. Should all the components of the Riemann tensor be used up in interpreting matter
and electricity that would mean that where there
is no matter (and electricity) space-time is
flat (as far as internal properties are concerned) ; that would mean that matter acts only
where it is; but we know that matter make* itself felt, for instance, by the gravitational
field that it produces, also outside the region
which it occupies, and this is in accord with
our identification as a result of which only
part of the components of the Riemann tensor
vanish where there is no matter, so that the
remaining components may be Interpreted as corresponding to gravitational forces.
29.
a
=
1>a
u p,iuP
29.3
29.4
op/oxa.^Uj
0.
9.2 we get
R}
= 0.
+ pua
auj + Prf*Uj a
iijR
0.
0, so that R
A = o log p/ox a .u
ua a
uSij a
= 0.
Substi-
u a .u.
Solar System.
Symmetry Conditions.
Our equations 29.1 and 29.2 describe relations existing between matter and field.
We
proved that the motion of matter is characterized by the geodesies of the curved space, but
the curvature is in turn determined by matter.
Theoretically, we may have a complete description of the situation, but in practice we do not
know how to handle it, we do not know where to
begin. In such cases we often resort to the
method of successive approximations. Let us try
to apply this method here.
In investigating the
motion of a planet around the sun we neglect in
the first place the motion of the sun.
Then, in
Rj
0.
These equations are known as Einstein's equations. We see that the statement that the corrected contracted Riemann tensor vanishes is
equivalent to the statement that the contracted
Riemann tensor vanishes.
As a result of our first approximation we
derived thus the field equations 30.1. In the
next approximation we introduce the planet n4
assume that its action on the field is negligible but that the field acts on It, i.e., that
the motion of the planet is given by the geodesies of the field which has been determined
in the preceding step; the motions will then be
given by the equations 28.?
30.2
'
- x) 8 + B(TJ
= 1
30.3
8
Adr 8 + B(d9 + sin a e.d9 8 ) = 1.
Adr
+ B(d6 2 +
Cdt
60
SO. 5
-ds a =
8
+ r 8 (d9 8 + sin 8 0.d<p 8 ) - T)(r).dt
31.2
g
33 =
Our task is now accomplished, we have imposed on our space the conditions of symmetry
and we have next to impose on it the general
equations 30.1.
31.
32 =
g aa ? =
44 =
-i/ n (r)
In what follows
Xi
= r,
x a = 8,
x, = f, x
= -r'slnO. cos 8,
-iV*
= t.
>
31.3
r
r
s,33
= r>
t, i
= r sin*0
r ,i
= r a sin 8. cos 8,
F 4 , 14 = -Jtf
^.
r
31.31
r sin 8 6
r;,
li
r; 4 = J
31.1
a
2
g 33 = r sin e,
e,
1,44
to r.
1/r",
g*
7,
rJ =
33
8
,
= -sine .cos e,
;
-cote,
r,: =
81
Ol * const.
or
Using 31.6 and 21.7 in the second of the equations 31.5 we obtain
4T1
g'r
1 -
which gives
31.8
TJ
= 1 -
31.9
where
-ds
TI
r d0
.d*
- r)dt*
is given by 31.8.
Equations of Geodesies.
32.
^Ti-
31.4
32.1
=0
Subtracting the last one from the first we have
= -1;
32.21
r -
-.r
rn
rrj
sin a 9.f8
2T]
32. 22
9 +
32.23
32.24
2.8
2p.f
sinO
+ 2
cosO f a = 0,
cote .9* * 0,
5-.rt = 0.
coordinates is at our
and
The choice of the
such a way that the
in
them
choose
We
disposal.
initial position of the particle be on the equator and that the tangent be tangent to the equator.
In this case 8 =^ and e = zero at the
shows
initial moment and the second equation
that
always. Now the last two equations
may be integrated once each and they furnish
0=4
32.3
32.4
32.5
1
lit
A.
32.6
co.,
.in,
We calculate the first and the second derivatives of x and y with reipect to t, substitute
them into 33.1 and combining the terns with COM
and
those with sin f we get
(d)
TJ
from 31.8,
equation
dt
33.1
211
-a du d9 ' -i ~d*9 _
"
'dt 1
'dt'dt
"dt
9/dt
gdu/dt
u
d9/dt
whence
1
d9/dt * Hu
33.3
We next
where H is a constant of integration.
want to eliminate t from 33. 2 with the help of
the last formula. Differentiating it we have
dt
and then
du = du d9 = du
de
dt
d9*dt
d!u m
dt*
dV*v"
dfPMV
+ _du.d!9 . d!^j
1
d9 dtT
de
**-
dx
COS9/U,
y =
sin9/u.
re-
by
8u~"(-n:)
33.
- 0.
-u~ 1 H u
+ Mu* =
33.4
du
entiation from
=
33.5
EMu
H
33.7
where a is a constant.
This corresponds to equation 38.7 obtained from Relativity theory in the
preceding section; in that last equation we
have, of course, to take A = -1 if we consider
the motion of a planet so that it becomes
du
- u,
ua
sln'x;
increase
u - u,
(ux - u t ) sin*x
33.8
ux - u =
and we see that the difference is essentially
in
only one term. But before we come to the com-
u8 )
33.53
and we see that a real solution is only possible
when u is between u 2 and ux . The motion will
manifest itself in an oscillation of u between
u a and Ui and the sign of du/d? will change at
these points. The particular question we want
to investigate is, as was mentioned at the beginning of the section, to what change of q> corresponds one oscillation of u, between u a and u^
In order to find this we solve the equasay.
tion for dp, obtaining
whence
33.6
33.51
33.52
U)(U
U 8 )'
Uj.
(u^
(u x
|u a
-
cos'z,
2dx
34.
64
Next we must identify the constants of the
new theory with those of the old. It would seem
as though we must, in order to reach an agreeso as to get rid of the last
ment, make Y =
term of the equation 33.51 by which it differs
essentially from 33.5. But this would annihilate also the preceding term in the new formula
and so spoil the correspondence altogether. We
must, therefore, ascribe to Y a finite value,
but we will expect that it will be small; more
precisely, it will be small in such a way that
3
the term yu will not affect the equation 33.51
essentially or will be small in comparison with
u a . Next, let us compare 32.3 with 33.3.
Of
differ
the
the
left
hand
sides
facby
course,
to
tor dt/ds, but this is equal (Section 13)
* which
1//1 - p
is, even for motions of planets,
very close to one, so that, in the first approximation we may identify h with H. Comparing now
33.5 and 33.51 we come to the conclusion that
34.1
the roots denoted in the saae way in the preceding case, the third root will be
ar
ur
ut ,
y
I
.
/(u x -u)(u - u t ) [1 -
u,
u)E*J
the same substitution 55.7 as before will be applied. We only have to calculate
u x + us + u
-i x
+ u,
+ (u x -
ut
u t )sin*z
= ux + u a + u x sin'x + u, cos*x,
so that the integral becomes
= 2M
2M(ui + u r
u x sin'x
u,cos x)"
34.2
ua
ux
sinx
cosx)] dx,
or
The new theory predicts then that the angle e
will have changed by this amount while the distance from the sun changes from its miniaum to
its maximum; i.e., that the perihelion and aphelion are not in a straight line with the sun but
that the planet moves through an additional angle
of -^f (u x + u a ) after reaching the position opposite the one where it was during the perhelion,
before reaching the aphelion. Since the same
situation applies to the motion between an aphelion and the next perihelion we see that between
two consecutive perihelia the planet will have
moved through an angle * * 3xM(u! + u,), or
that the perihelion will have moved through an
angle 3M(u 1 + u t ) during one revolution of the
planet. This is a very small amount, and it nay
be considered as a correction to the classical
result according to which the planet moves on an
elliptic orbit with the sun in one of the foci.
If a is the major semi-axis and e the eccentricity, the distance at perihelion is a- ae and the
distance at aphelion a + ae: we have then
ux
-i-
e),
65
and the final formula for the advance
perihelion comes out
34.5
of
the
35.
Deflection of Light.
+ u2 = o
2Mu
Etfu
a'
(^)
Mu
and
2Mu.
d9 =
-^_
/2M(u* -
$U
3
- (u* - u
)
)"
u = UQ sin x
and after the substitution develop according to
powers of M and keep only two terms; we thus
get for d9 approximately
on the old theory which corresponds to the absence of the term with M in the equation 35.1
we will have to omit the term with M in this
integral and the result is *. The approximate
result according to the new theory will differ
from that by
M
apparent position of the star, this latter position must appear shifted away from the sun apActual measureproximately by the angle 4M/r
ments are possible only during an eclipse of the
sun, because otherwise the light from the sun
and
drowns out the fainter light from the star,
are beset with difficulties but the results seem
to be in favor of the prediction.
.
* - 0.
ap * b
so that
d/dp
36.
36.1
a.d/dr.
. nt a = o.
a
a.
dr
dr
-r
dr
dr
dr
the last according to 36.2. What about the value of a? The answer is that it is not and cannot be determined by the foregoing discussion.
There are different beams of light which satisfy
all the conditions imposed so far; they differ
in color, and different colors correspond to
different values of a.
Color, according to our definition, is the
time components of the momentum vector; i.e.,
the scalar product of the momentum vector of
light and the unit vector in the time direction.
If we denote the (contravariant) components of
the latter by T 1 , the condition that it has tlae
direction will be given by
rpl
3 q3 = Q
ip8
This gives
dr =
rpGCmp
...
cc(3
dr =
36.2
equa-
tions
dq>
4
T)(T )"
= 1.
The scalar product of the vectors ui and T 1 calculated according to the formula g^JiJT^ becomes
ryit.
de = 0,
= 0,
dr =
rjdt.
e(l + M/r)
two
The color of a beam of light is then not constant along the beam. We shall compare the col
or as it appears near the surface of the sun,
where r is equal to the radius of the sun r s ,
and near the surface of the earth, where we may
The frequencies in these two
assume r = &.
cases will be for a given beam of light proportional to
1 + M/r
and
1;
equation 36.1 shows that the last two terms cancel leaving us with
M/r,
67
If now we consider some source of light
near the surface of the sun, whose frequency we
know, the light emitted by it when it is received at the surface of the earth will have a
frequency that is less, the amount of the relative change being given by
M/r.
If then we compare light coming from a terrestrial source, for Instance, emitted by an
BINDING LIST
University of Torontp
o
Oi
o>
CD
00
Library
CVJ
H 0)
s ^
JH
0)
tiO
to
0)
-P
h o
O -H
O
\
CJ
o3
e
o
-P
0}
-S
DO NOT
REMOVE
THE
CARD
FROM
THIS
&
Acme Library Card Pocket
Under Pat. "Ref. Index Flte"
PS