In non-technical terms, M-theory presents an idea about the basic substance of the universe.
Although a complete mathematical formulation of M-theory is not known,
the general approach is the leading contender for a universal "Theory of Everything" that unifies gravity with other forces such as electromagnetism. M-theory aims to unify quantum mechanics with general relativity's gravitational force in a mathematically consistent way. In comparison, other theories such as loop quantum gravity are considered by physicists and researchers to be less elegant, because they posit gravity to be completely different from forces such as the electromagnetic force.
Background
In the early years of the 20th century, the atom – long believed to be the smallest building-block of matter – was proven to consist of even smaller components called protons, neutrons and electrons, which are known as subatomic particles.
Other subatomic particles began being discovered in the 1960s. In the
1970s, it was discovered that protons and neutrons (and other hadrons) are themselves made up of smaller particles called quarks. The Standard Model is the set of rules that describes the interactions of these particles.
In the 1980s, a new mathematical model of theoretical physics, called string theory,
emerged. It showed how all the different subatomic particles known to
science could be constructed by hypothetical one-dimensional "strings",
infinitesimal building-blocks that have only the dimension of length,
but not height or width. These strings vibrate in multiple dimensions
and, depending on how they vibrate, they might be seen in
three-dimensional space as matter, light or gravity. In string theory,
every form of matter is said to be the result of the vibration of
strings.
However, for string theory to be mathematically consistent, the strings must live in a universe with ten dimensions.
String theory explains our perception of the universe to have four
dimensions (three space dimensions and one time dimension) by imagining
that the extra six dimensions are "curled up", to be so small that they
can't be observed day-to-day. The technical term for this is compactification. These dimensions are usually made to take the shape of mathematical objects called Calabi–Yau manifolds.
Five major string theories were developed and found to be
mathematically consistent with the principle of all matter being made of
strings. Having five different versions of string theory was seen as a
puzzle.
Speaking at the Strings '95 conference at the University of Southern California, Edward Witten of the Institute for Advanced Studysuggested that the five different versions of string theory might be describing the same thing seen from different perspectives. He proposed a unifying theory called "M-theory",
which brought all of the string theories together. It did this by
asserting that strings are an approximation of curled-up two-dimensional
membranes vibrating in an 11-dimensional spacetime. According to Witten, the M
could stand for "magic", "mystery", or "membrane" according to taste,
and the true meaning of the title should be decided when a better
understanding of the theory is discovered.
Status
M-theory
is not complete, and the mathematics of the approach are not yet well
understood. M-theory is a theory of quantum gravity; and as all others
it has not gained experimental evidence that would confirm its validity. It also does not single out our observable universe as being special,
and so does not aim to predict from first principles everything we can
measure about it.
Nevertheless, some physicists are drawn to M-theory because of
its degree of uniqueness and rich set of mathematical properties,
triggering the hope that it may describe our world within a single
framework.
Given the components of the four-vectors or tensors in some
frame, the "transformation rule" allows one to determine the altered
components of the same four-vectors or tensors in another frame, which
could be boosted or accelerated, relative to the original frame. A
"boost" should not be conflated with spatial translation, rather it's characterized by the relative velocity between frames. The transformation rule itself depends on the relative motion of the frames. In the simplest case of two inertial frames the relative velocity between enters the transformation rule. For rotating reference frames or general non-inertial reference frames,
more parameters are needed, including the relative velocity (magnitude
and direction), the rotation axis and angle turned through.
The usual treatment (e.g., Albert Einstein's
original work) is based on the invariance of the speed of light.
However, this is not necessarily the starting point: indeed (as is
described, for example, in the second volume of the Course of Theoretical Physics by Landau and Lifshitz), what is really at stake is the locality
of interactions: one supposes that the influence that one particle,
say, exerts on another can not be transmitted instantaneously. Hence,
there exists a theoretical maximal speed of information transmission
which must be invariant, and it turns out that this speed coincides with
the speed of light in vacuum. Newton
had himself called the idea of action at a distance philosophically
"absurd", and held that gravity had to be transmitted by some agent
according to certain laws.
Michelson and Morley in 1887 designed an experiment, employing an interferometer and a half-silvered mirror, that was accurate enough to detect aether flow. The mirror system reflected the light back into the interferometer. If there were an aether drift,
it would produce a phase shift and a change in the interference that
would be detected. However, no phase shift was ever found. The negative
outcome of the Michelson–Morley experiment
left the concept of aether (or its drift) undermined. There was
consequent perplexity as to why light evidently behaves like a wave,
without any detectable medium through which wave activity might
propagate.
In a 1964 paper, Erik Christopher Zeeman showed that the causality-preserving
property, a condition that is weaker in a mathematical sense than the
invariance of the speed of light, is enough to assure that the
coordinate transformations are the Lorentz transformations. Norman
Goldstein's paper shows a similar result using inertiality (the preservation of time-like lines) rather than causality.
Physical principles
Einstein
based his theory of special relativity on two fundamental postulates.
First, all physical laws are the same for all inertial frames of
reference, regardless of their relative state of motion; and second, the
speed of light in free space is the same in all inertial frames of
reference, again, regardless of the relative velocity of each reference
frame. The Lorentz transformation is fundamentally a direct consequence
of this second postulate.
The second postulate
Assume the second postulate
of special relativity stating the constancy of the speed of light,
independent of reference frame, and consider a collection of reference
systems moving with respect to each other with constant velocity, i.e. inertial systems, each endowed with its own set of Cartesian coordinates labeling the points, i.e. events
of spacetime. To express the invariance of the speed of light in
mathematical form, fix two events in spacetime, to be recorded in each
reference frame. Let the first event be the emission of a light signal,
and the second event be it being absorbed.
Pick any reference frame in the collection. In its coordinates, the first event will be assigned coordinates , and the second . The spatial distance between emission and absorption is , but this is also the distance traveled by the signal. One may therefore set up the equation
Every other coordinate system will record, in its own
coordinates, the same equation. This is the immediate mathematical
consequence of the invariance of the speed of light. The quantity on the
left is called the spacetime interval. The interval is, for events separated by light signals, the same (zero) in all reference frames, and is therefore called invariant.
Invariance of interval
For
the Lorentz transformation to have the physical significance realized
by nature, it is crucial that the interval is an invariant measure for any two events, not just for those separated by light signals. To establish this, one considers an infinitesimal interval,
as recorded in a system . Let be another system assigning the interval to the same two infinitesimally separated events. Since if , then the interval will also be zero in any other system (second postulate), and since and are infinitesimals of the same order, they must be proportional to each other,
On what may depend? It may not depend on the positions of the two events in spacetime, because that would violate the postulated homogeneity of spacetime. It might depend on the relative velocity between and , but only on the speed, not on the direction, because the latter would violate the isotropy of space.
Now bring in systems and ,
From these it follows,
Now, one observes that on the right-hand side that depend on both and ; as well as on the angle between the vectors and .
However, one also observes that the left-hand side does not depend on
this angle. Thus, the only way for the equation to hold true is if the
function is a constant. Further, by the same equation this constant is unity. Thus,
for all systems . Since this holds for all infinitesimal intervals, it holds for all intervals.
Most, if not all, derivations of the Lorentz transformations take this for granted. In those derivations, they use the constancy of the speed of light
(invariance of light-like separated events) only. This result ensures
that the Lorentz transformation is the correct transformation.
Rigorous Statement and Proof of Proportionality of ds2 and ds′2
Theorem:
Let be integers, and a vector space over of dimension. Let be an indefinite-inner product on with signature type . Suppose is a symmetric bilinear form on such that the null set of the associated quadratic form of is contained in that of (i.e. suppose that for every , if then ). Then, there exists a constant such that . Furthermore, if we assume and that also has signature type , then we have .
Standard configuration
The
spacetime coordinates of an event, as measured by each observer in
their inertial reference frame (in standard configuration) are shown in
the speech bubbles. Top: frame F′ moves at velocity v along the x-axis of frame F. Bottom: frame F moves at velocity −v along the x′-axis of frame F′.
The invariant interval can be seen as a non-positive definite
distance function on spacetime. The set of transformations sought must
leave this distance invariant. Due to the reference frame's coordinate
system's cartesian nature, one concludes that, as in the Euclidean case,
the possible transformations are made up of translations and rotations,
where a slightly broader meaning should be allowed for the term
rotation.
The interval is quite trivially invariant under translation. For
rotations, there are four coordinates. Hence there are six planes of
rotation. Three of those are rotations in spatial planes. The interval
is invariant under ordinary rotations too.
It remains to find a "rotation" in the three remaining coordinate
planes that leaves the interval invariant. Equivalently, to find a way
to assign coordinates so that they coincide with the coordinates
corresponding to a moving frame.
The general problem is to find a transformation such that
To solve the general problem, one may use the knowledge about
invariance of the interval of translations and ordinary rotations to
assume, without loss of generality, that the frames F and F′ are aligned in such a way that their coordinate axes all meet at t = t′ = 0 and that the x and x′ axes are permanently aligned and system F′ has speed V along the positive x-axis. Call this the standard configuration. It reduces the general problem to finding a transformation such that
The standard configuration is used in most examples below. A linear solution of the simpler problem
solves the more general problem since coordinate differences
then transform the same way. Linearity is often assumed or argued
somehow in the literature when this simpler problem is considered. If
the solution to the simpler problem is not linear, then it doesn't solve the original problem because of the cross terms appearing when expanding the squares.
The solutions
As
mentioned, the general problem is solved by translations in spacetime.
These do not appear as a solution to the simpler problem posed, while
the boosts do (and sometimes rotations depending on angle of attack).
Even more solutions exist if one only insist on invariance of the
interval for lightlike separated events. These are nonlinear conformal
("angle preserving") transformations. One has
Some equations of physics are conformal invariant, e.g. the Maxwell's equations in source-free space, but not all. The relevance of the conformal transformations in
spacetime is not known at present, but the conformal group in two
dimensions is highly relevant in conformal field theory and statistical mechanics. It is thus the Poincaré group that is singled out by the postulates of
special relativity. It is the presence of Lorentz boosts (for which velocity addition
is different from mere vector addition that would allow for speeds
greater than the speed of light) as opposed to ordinary boosts that
separates it from the Galilean group of Galilean relativity.
Spatial rotations, spatial and temporal inversions and translations are
present in both groups and have the same consequences in both theories
(conservation laws of momentum, energy, and angular momentum). Not all
accepted theories respect symmetry under the inversions.
Using the geometry of spacetime
Landau & Lifshitz solution
These three hyperbolic function formulae (H1–H3) are referenced below:
The problem posed in standard configuration for a boost in the x-direction, where the primed coordinates refer to the moving system is solved by finding a linear solution to the simpler problem
The most general solution is, as can be verified by direct substitution using (H1),
1
To find the role of Ψ in the physical setting, record the progression of the origin of F′, i.e. x′ = 0, x = vt. The equations become (using first x′ = 0),
Now divide:
where x = vt was used in the first step, (H2) and (H3) in the second, which, when plugged back in (1), gives
or, with the usual abbreviations,
This calculation is repeated with more detail in section hyperbolic rotation.
Start from the equations of the spherical wave front of a light pulse, centred at the origin:
which take the same form in both frames because of the special relativity postulates. Next, consider relative motion along the x-axes of each frame, in standard configuration above, so that y = y′, z = z′, which simplifies to
Linearity
Now assume that the transformations take the linear form:
where A, B, C, D are to be found. If they were non-linear, they would not take the same form for all observers, since fictitious forces
(hence accelerations) would occur in one frame even if the velocity was
constant in another, which is inconsistent with inertial frame
transformations.
Substituting into the previous result:
and comparing coefficients of x2, t2, xt:
Hyperbolic rotation
The equations suggest the hyperbolic identity
Introducing the rapidity parameter ϕ as a hyperbolic angle allows the consistent identifications
where the signs after the square roots are chosen so that x' and t' increase if x and t increase, respectively. The hyperbolic transformations have been solved for:
If the signs were chosen differently the position and time coordinates would need to be replaced by −x and/or −t so that x and t increase not decrease.
To find how ϕ relates to the relative velocity, from the standard configuration the origin of the primed frame x′ = 0 is measured in the unprimed frame to be x = vt (or the equivalent and opposite way round; the origin of the unprimed frame is x = 0 and in the primed frame it is at x′ = −vt):
and hyperbolic identities leads to the relations between β, γ, and ϕ,
From physical principles
The problem is usually restricted to two dimensions by using a velocity along the x axis such that the y and z coordinates do not intervene, as described in standard configuration above.
Time dilation and length contraction
The transformation equations can be derived from time dilation and length contraction, which in turn can be derived from first principles. With O and O′ representing the spatial origins of the frames F and F′, and some event M, the relation between the position vectors (which here reduce to oriented segments OM, OO′ and O′M) in both frames is given by:
OM = OO′ + O′M.
Using coordinates (x,t) in F and (x′,t′) in F′ for event M, in frame F the segments are OM = x, OO′ = vt and O′M = x′/γ (since x′ is O′M as measured in F′):
Likewise, in frame F′, the segments are OM = x/γ (since x is OMas measured inF), OO′ = vt′ and O′M = x′:
By rearranging the first equation, we get
which is the space part of the Lorentz transformation. The second relation gives
which is the inverse of the space part. Eliminating x′ between the two space part equations gives
that, if , simplifies to:
which is the time part of the transformation, the inverse of which is found by a similar elimination of x:
Spherical wavefronts of light
The following is similar to that of Einstein. As in the Galilean transformation, the Lorentz transformation is linear since the relative velocity of the reference frames is constant as a vector; otherwise, inertial forces
would appear. They are called inertial or Galilean reference frames.
According to relativity no Galilean reference frame is privileged.
Another condition is that the speed of light must be independent of the
reference frame, in practice of the velocity of the light source.
Consider two inertial frames of referenceO and O′, assuming O to be at rest while O′ is moving with a velocity v with respect to O in the positive x-direction. The origins of O and O′
initially coincide with each other. A light signal is emitted from the
common origin and travels as a spherical wave front. Consider a point P on a spherical wavefront at a distance r and r′ from the origins of O and O′ respectively. According to the second postulate of the special theory of relativity the speed of light is the same in both frames, so for the point P:
The equation of a sphere in frame O is given by
For the spherical wavefront that becomes
Similarly, the equation of a sphere in frame O′ is given by
so the spherical wavefront satisfies
The origin O′ is moving along x-axis. Therefore,
x′ must vary linearly with x and t. Therefore, the transformation has the form
For the origin of O′ x′ and x are given by
so, for all t,
and thus
This simplifies the transformation to
where γ is to be determined. At this point γ is not necessarily a constant, but is required to reduce to 1 for v ≪ c.
The inverse transformation is the same except that the sign of v is reversed:
The above two equations give the relation between t and t′ as:
or
Replacing x′, y′, z′ and t′ in the spherical wavefront equation in the O′ frame,
with their expressions in terms of x, y, z and t produces:
and therefore,
which implies,
or
Comparing the coefficient of t2 in the above equation with the coefficient of t2 in the spherical wavefront equation for frame O produces:
Equivalent expressions for γ can be obtained by matching the x2 coefficients or setting the tx coefficient to zero. Rearranging:
or, choosing the positive root to ensure that the x and x' axes and the time axes point in the same direction,
which is called the Lorentz factor. This produces the Lorentz transformation from the above expression. It is given by
The Lorentz transformation is not the only transformation leaving
invariant the shape of spherical waves, as there is a wider set of spherical wave transformations in the context of conformal geometry, leaving invariant the expression . However, scale changing conformal transformations cannot be used to symmetrically describe all laws of nature including mechanics, whereas the Lorentz transformations (the only one implying ) represent a symmetry of all laws of nature and reduce to Galilean transformations at .
Galilean and Einstein's relativity
Galilean reference frames
In classical kinematics, the total displacement x in the R frame is the sum of the relative displacement x′ in frame R′ and of the distance between the two origins x − x′. If v is the relative velocity of R′ relative to R, the transformation is: x = x′ + vt, or x′ = x − vt. This relationship is linear for a constant v, that is when R and R′ are Galilean frames of reference.
In Einstein's relativity, the main difference from Galilean
relativity is that space and time coordinates are intertwined, and in
different inertial frames t ≠ t′.
Since space is assumed to be homogeneous, the transformation must be linear. The most general linear relationship is obtained with four constant coefficients, A, B, γ, and b:
The linear transformation becomes the Galilean transformation when γ = B = 1, b = −v and A = 0.
An object at rest in the R′ frame at position x′ = 0 moves with constant velocity v in the R frame. Hence the transformation must yield x′ = 0 if x = vt. Therefore, b = −γv and the first equation is written as
Using the principle of relativity
According
to the principle of relativity, there is no privileged Galilean frame
of reference: therefore the inverse transformation for the position from
frame R′ to frame R should have the same form as the original but with the velocity in the opposite direction, i.o.w. replacing v with -v:
and thus
Determining the constants of the first equation
Since
the speed of light is the same in all frames of reference, for the case
of a light signal, the transformation must guarantee that t = x/c when t′ = x′/c.
Substituting for t and t′ in the preceding equations gives:
Multiplying these two equations together gives,
At any time after t = t′ = 0, xx′ is not zero, so dividing both sides of the equation by xx′ results in
which is called the "Lorentz factor".
When the transformation equations are required to satisfy the light signal equations in the form x = ct and x′ = ct′, by substituting the x and x'-values, the same technique produces the same expression for the Lorentz factor.
Determining the constants of the second equation
The transformation equation for time can be easily obtained by considering the special case of a light signal, again satisfying x = ct and x′ = ct′, by substituting term by term into the earlier obtained equation for the spatial coordinate
giving
so that
which, when identified with
determines the transformation coefficients A and B as
So A and B are the unique constant coefficients necessary
to preserve the constancy of the speed of light in the primed system of
coordinates.
Einstein's popular derivation
In his popular book Einstein derived the Lorentz transformation by arguing that there must be two non-zero coupling constants λ and μ such that
that correspond to light traveling along the positive and negative x-axis, respectively.
For light x = ct if and only if x′ = ct′. Adding and subtracting the two equations and defining
gives
Substituting x′ = 0 corresponding to x = vt and noting that the relative velocity is v = bc/γ, this gives
The constant γ can be evaluated by demanding c2t2 − x2 = c2t′2 − x′2 as per standard configuration.
Using group theory
From group postulates
Following is a classical derivation based on group postulates and isotropy of the space.
Coordinate transformations as a group
The coordinate transformations between inertial frames form a group
(called the proper Lorentz group) with the group operation being the
composition of transformations (performing one transformation after
another). Indeed, the four group axioms are satisfied:
Closure: the composition of two transformations is a transformation: consider a composition of transformations from the inertial frame K to inertial frame K′, (denoted as K → K′), and then from K′ to inertial frame K′′, [K′ → K′′], there exists a transformation, [K → K′] [K′ → K′′], directly from an inertial frame K to inertial frame K′′.
Associativity: the transformations ( [K → K′] [K′ → K′′] ) [K′′ → K′′′] and [K → K′] ( [K′ → K′′] [K′′ → K′′′] ) are identical.
Identity element: there is an identity element, a transformation K → K.
Inverse element: for any transformation K → K′ there exists an inverse transformation K′ → K.
Transformation matrices consistent with group axioms
Consider two inertial frames, K and K′, the latter moving with velocity v with respect to the former. By rotations and shifts we can choose the x and x′ axes along the relative velocity vector and also that the events (t, x) = (0,0) and (t′, x′) = (0,0) coincide. Since the velocity boost is along the x (and x′)
axes nothing happens to the perpendicular coordinates and we can just
omit them for brevity. Now since the transformation we are looking after
connects two inertial frames, it has to transform a linear motion in (t, x) into a linear motion in (t′, x′) coordinates. Therefore, it must be a linear transformation. The general form of a linear transformation is
where α, β, γ and δ are some yet unknown functions of the relative velocity v.
Let us now consider the motion of the origin of the frame K′. In the K′ frame it has coordinates (t′, x′ = 0), while in the K frame it has coordinates (t, x = vt). These two points are connected by the transformation
from which we get
Analogously, considering the motion of the origin of the frame K, we get
from which we get
Combining these two gives α = γ and the transformation matrix has simplified,
Now consider the group postulate inverse element. There are two ways we can go from the K′ coordinate system to the K coordinate system. The first is to apply the inverse of the transform matrix to the K′ coordinates:
The second is, considering that the K′ coordinate system is moving at a velocity v relative to the K coordinate system, the K coordinate system must be moving at a velocity −v relative to the K′ coordinate system. Replacing v with −v in the transformation matrix gives:
Now the function γ can not depend upon the direction of v
because it is apparently the factor which defines the relativistic
contraction and time dilation. These two (in an isotropic world of ours)
cannot depend upon the direction of v. Thus, γ(−v) = γ(v) and comparing the two matrices, we get
According to the closure group postulate a composition of
two coordinate transformations is also a coordinate transformation, thus
the product of two of our matrices should also be a matrix of the same
form. Transforming K to K′ and from K′ to K′′ gives the following transformation matrix to go from K to K′′:
In the original transform matrix, the main diagonal elements are both equal to γ,
hence, for the combined transform matrix above to be of the same form
as the original transform matrix, the main diagonal elements must also
be equal. Equating these elements and rearranging gives:
The denominator will be nonzero for nonzero v, because γ(v) is always nonzero;
If v = 0 we have the identity matrix which coincides with putting v = 0 in the matrix we get at the end of this derivation for the other values of v, making the final matrix valid for all nonnegative v.
For the nonzero v, this combination of function must be a universal constant, one and the same for all inertial frames. Define this constant as δ(v)/vγ(v) = κ, where κ has the dimension of 1/v2. Solving
we finally get
and thus the transformation matrix, consistent with the group axioms, is given by
If κ > 0, then there would be transformations (with κv2 ≫ 1)
which transform time into a spatial coordinate and vice versa. We
exclude this on physical grounds, because time can only run in the
positive direction. Thus two types of transformation matrices are
consistent with group postulates:
with the universal constant κ = 0, and
with κ < 0.
Galilean transformations
If κ = 0 then we get the Galilean-Newtonian kinematics with the Galilean transformation,
where time is absolute, t′ = t, and the relative velocity v of two inertial frames is not limited.
Lorentz transformations
If κ < 0, then we set which becomes the invariant speed, the speed of light in vacuum. This yields κ = −1/c2 and thus we get special relativity with Lorentz transformation
where the speed of light is a finite universal constant determining the
highest possible relative velocity between inertial frames.
If v ≪ c the Galilean transformation is a good approximation to the Lorentz transformation.
Only experiment can answer the question which of the two possibilities, κ = 0 or κ < 0, is realized in our world. The experiments measuring the speed of light, first performed by a Danish physicist Ole Rømer, show that it is finite, and the Michelson–Morley experiment showed that it is an absolute speed, and thus that κ < 0.
Boost from generators
Using rapidity ϕ to parametrize the Lorentz transformation, the boost in the x direction is
likewise for a boost in the y-direction
and the z-direction
where ex, ey, ez are the Cartesian basis vectors, a set of mutually perpendicular unit vectors along their indicated directions. If one frame is boosted with velocity v relative to another, it is convenient to introduce a unit vectorn = v/v = β/β in the direction of relative motion. The general boost is
Notice the matrix depends on the direction of the relative motion
as well as the rapidity, in all three numbers (two for direction, one
for rapidity).
We can cast each of the boost matrices in another form as follows. First consider the boost in the x direction. The Taylor expansion of the boost matrix about ϕ = 0 is
where the derivatives of the matrix with respect to ϕ are given by differentiating each entry of the matrix separately, and the notation |ϕ = 0 indicates ϕ is set to zero after the derivatives are evaluated. Expanding to first order gives the infinitesimal transformation
which is valid if ϕ is small (hence ϕ2 and higher powers are negligible), and can be interpreted as no boost (the first term I is the 4×4 identity matrix), followed by a small boost. The matrix
is the generator of the boost in the x direction, so the infinitesimal boost is
Now, ϕ is small, so dividing by a positive integer N gives an even smaller increment of rapidity ϕ/N, and N of these infinitesimal boosts will give the original infinitesimal boost with rapidity ϕ,
In the limit of an infinite number of infinitely small steps, we obtain the finite boost transformation
Repeating the process for the boosts in the y and z directions obtains the other generators
and the boosts are
For any direction, the infinitesimal transformation is (small ϕ and expansion to first order)
where
is the generator of the boost in direction n. It is the full boost generator, a vector of matrices K = (Kx, Ky, Kz), projected into the direction of the boost n. The infinitesimal boost is
Then in the limit of an infinite number of infinitely small steps, we obtain the finite boost transformation
which is now true for any ϕ. Expanding the matrix exponential of −ϕ(n ⋅ K) in its power series
we now need the powers of the generator. The square is
but the cube (n ⋅ K)3 returns to (n ⋅ K), and as always the zeroth power is the 4×4 identity, (n ⋅ K)0 = I. In general the odd powers n = 1, 3, 5, ... are
while the even powers n = 2, 4, 6, ... are
therefore the explicit form of the boost matrix depends only the
generator and its square. Splitting the power series into an odd power
series and an even power series, using the odd and even powers of the
generator, and the Taylor series of sinh ϕ and cosh ϕ about ϕ = 0 obtains a more compact but detailed form of the boost matrix
where 0 = −1 + 1 is introduced for the even power series to complete the Taylor series for cosh ϕ. The boost is similar to Rodrigues' rotation formula,
Negating the rapidity in the exponential gives the inverse transformation matrix,
Howard Percy Robertson and others showed that the Lorentz transformation can also be derived empirically. In order to achieve this, it's necessary to write down coordinate
transformations that include experimentally testable parameters. For
instance, let there be given a single "preferred" inertial frame in which the speed of light is constant, isotropic, and independent of the velocity of the source. It is also assumed that Einstein synchronization and synchronization by slow clock transport are equivalent in this frame. Then assume another frame
in relative motion, in which clocks and rods have the same internal
constitution as in the preferred frame. The following relations,
however, are left undefined:
differences in time measurements,
differences in measured longitudinal lengths,
differences in measured transverse lengths,
depends on the clock synchronization procedure in the moving frame,
then the transformation formulas (assumed to be linear) between those frames are given by:
depends on the synchronization convention and is not determined experimentally, it obtains the value by using Einstein synchronization in both frames. The ratio between and is determined by the Michelson–Morley experiment, the ratio between and is determined by the Kennedy–Thorndike experiment, and alone is determined by the Ives–Stilwell experiment. In this way, they have been determined with great precision to and , which converts the above transformation into the Lorentz transformation.