Lorentz transformation


A Lorentz transformation is a Coordinate transformation between inertial frames in spacetime. A transformation between a frame of reference (x,y,z,t)(x,y,z,t) to (x,y,z,t)(x',y',z',t') states

{x=γ(xβct)y=yz=zt=γ(tβcx)\left\{\begin{align} &x'=\gamma(x-\beta ct) \\ &y'=y \\ &z'=z \\ &t'=\gamma\left( t- \frac{\beta}{c}x \right) \end{align}\right.

with parameters

γ11β2,βvc\gamma\equiv \frac{1}{\sqrt{ 1-\beta ^{2} }},\qquad \beta\equiv\frac{v}{c}

In the language of proper velocity, vv would be the ordinary velocity. Conventionally, an object is said to be relativistic when β\beta is not almost-zero or γ>1\gamma>1. In other words, the object's speed must be a significant fraction of the speed of light. Alternatively, in common four-vector notation (ct,x,y,z)(x0,x1,x2,x3)(ct,x,y,z)\equiv(x^{0},x^{1},x^{2},x^{3}):

{xˉ0=γ(x0βx1)xˉ1=γ(x1βx0)xˉ2=x2xˉ3=x3\left\{\begin{align} &\bar{x}^{0}=\gamma(x^{0}-\beta x^{1}) \\ &\bar{x}^{1}=\gamma(x^{1}- \beta x^{0} ) \\ &\bar{x}^{2}=x^{2} \\ &\bar{x}^{3}=x^{3} \end{align}\right.

where the overbar now denotes the new frame. Here, time is expressed in units of ctct instead of tt, meaning that it is measured in meters and interpreted as the time it takes for light to travel that many meters.

Matrix form

Lorentz transformations can also be written in matrix notation:

xˉ=Λx\bar{x}=\Lambda x

where

Λ(β)=(γβγ00βγγ0000100001),x=(x0x1x2x3),xˉ=(xˉ0xˉ1xˉ2xˉ3)\Lambda(\beta)=\begin{pmatrix}\gamma & -\beta \gamma & 0 & 0 \\ -\beta \gamma & \gamma & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1\end{pmatrix},\qquad x=\begin{pmatrix} x^{0} \\ x^{1} \\ x^{2} \\ x^{3} \end{pmatrix},\qquad \bar{x}=\begin{pmatrix} \bar{x}^{0} \\ \bar{x}^{1} \\ \bar{x}^{2} \\ \bar{x}^{3} \end{pmatrix}

Λ\Lambda is known as the Lorentz transformation matrix. The convenience of this form lies in that, besides being more compact, it also makes it easier to handle transformations that are not on a common x,xˉx,\bar{x} axis, as you'd just change the matrix accordingly. Each individual component is then written

xˉμ=ν=03ΛνμxνΛνμxν\bar{x}^{\mu}=\sum_{\nu=0}^{3} \Lambda_{\nu}^{\mu}x^{\nu}\equiv \Lambda_{\nu}^{\mu}x^{\nu}

where the second equality is using Einstein notation for implicit summation.

The matrix has a few properties:

  • γ2β2γ2=1\gamma ^{2}-\beta ^{2}\gamma ^{2}=1.
  • Its determinant is 1.
  • Its inverse is equal to itself with an inverted argument: Λ1(β)=Λ(β)\Lambda^{-1}(\beta)=\Lambda(-\beta).

Derivation

Say we have an event EE at coordinates (x,y,z,t)(x,y,z,t) in some inertial system S\mathcal{S}. We want to express this event in another inertial frame S\mathcal{S}', where it'll have coordinates (x,y,z,t)(x',y',z',t'). We are looking for the coordinate transformation that will convert between the two. This space is four dimensional, so a complete visualization is not possible. However, think of it like this: since we have inertial frames, the origins can move at some constant velocity. Let's say that at time t=0t=0, both S\mathcal{S} and S\mathcal{S}' coincide. As time goes on, the two systems will in general move away from each other, at some constant velocity v\mathbf{v}. Say this velocity is on the xx axis (if it's not, rotate everything until it is). Then, over some time tt, the origin of S\mathcal{S}' will move a distance vtvt away from the origin of S\mathcal{S} on the xx axis (the other coordinates don't change). The event coordinates will of course be connected by

x=x+vtx=x'+vt

and so the coordinate transformation is

{x=xvty=yz=zt=t\begin{cases} x'=x-vt \\ y'=y \\ z'=z \\ t'=t \end{cases}

These are the old Galilean transformations that applied to Galilean relativity. As you may guess from this nomenclature, these start to collapse when introducing special relativity; the cause is the postulates of special relativity and more specifically setting a maximum speed in the universe. Relativity of simultaneity and time dilation can't realistically permit something as naive as t=tt'=t and length contraction certainly invalidates the snarky "of course" regarding x=x+vtx=x'+vt. On the other hand, y=yy'=y and z=zz'=z are safe since length contraction occurs only on the axis of motion, which we claimed to be on xx.

The issue in the previous derivation is in being a little too sure that x=x+vtx=x'+vt. More generally, say we have x=d+vtx=d+vt, where dd is the distance measured between the origin of S\mathcal{S}' and the xx-component of EE. If length contraction weren't a thing, dd would certainly be xx' and our results above would hold. But length contraction is indeed a thing, so we must take it into account:

d=xγx=d+vt=xγ+vtx=γ(xvt)d=\frac{x'}{\gamma}\quad\Rightarrow \quad x=d+vt=\frac{x'}{\gamma}+vt\quad\Rightarrow \quad x'=\gamma(x-vt)

The same argument could be done in reverse to go from S\mathcal{S}' to S\mathcal{S}, just this time we'll have x=dvtx'=d'-vt', where dd' is the distance between the origin of S\mathcal{S} and the xx component of EE. In classical physics, this is again d=xd'=x, but in relativity, we must consider contraction, so

d=xγx=dvt=xγvtx=γ(x+vt)d'=\frac{x}{\gamma}\quad\Rightarrow \quad x'=d'-vt'=\frac{x}{\gamma}-vt'\quad\Rightarrow \quad x=\gamma(x'+vt')

which shouldn't be surprising, since relativistic effects are symmetrical between observer. We can now substitute these results into the Galilean transformations and solve for tt' to obtain

{x=γ(xvt)y=yz=zt=γ(tvc2x)\boxed{\left\{\begin{align} &x'=\gamma(x-vt) \\ &y'=y \\ &z'=z \\ &t'=\gamma\left( t- \frac{v}{c^{2}}x \right) \end{align}\right.}

These are the Lorentz transformations for spacetime coordinates (or a singular Lorentz transformation if you are talking the change as a whole). The reverse transformation is just a matter of changing the primes and inverting the minuses into pluses:

{x=γ(x+vt)y=yz=zt=γ(t+vc2x)\left\{\begin{align} &x=\gamma(x'+vt') \\ &y=y' \\ &z=z' \\ &t=\gamma\left( t'+ \frac{v}{c^{2}}x' \right) \end{align}\right.

Defining βv/c\beta\equiv v/c, they can also be written as

{x=γ(xβct)y=yz=zt=γ(tβcx)\left\{\begin{align} &x'=\gamma(x-\beta ct) \\ &y'=y \\ &z'=z \\ &t'=\gamma\left( t- \frac{\beta}{c}x \right) \end{align}\right.

Note how when vcv\ll c, then γ1\gamma\to 1 and β0\beta\to 0 and these go back to being the usual Galilean transformations.

As a hyperbolic rotation

A Lorentz transformation is a hyperbolic rotation in spacetime, the angle of which is called the rapidity yy:

\pmatrix{ct' \\ x' \\ y' \\ z'}=\pmatrix{\cosh y & -\sinh y & 0 & 0 \\ -\sinh y & \cosh y & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1}\pmatrix{ct \\ x \\ y \\ z}

Experimental frames

In experimental physics that deals with special relativity, it's common to reference two well-known frames of reference.

  1. The laboratory frame is fixed to the observer (and detectors). It shows what the detector or scientist sees when observing the phenomenon. It's particularly useful when the one of the elements of the system is at rest with respect to us, like a stationary target in a particle accelerator. In that case, its momentum vanishes and its relativistic energy is purely rest energy.
  2. The center-of-momentum frame is the frame in which the total momentum (but not four-momentum!) of the system is zero. This provides easy access to the invariant mass of the system. The center-of-mass frame is a special case in which the center of mass is in the origin.