Prerequisites: Part I — Lie Groups and Lie Algebras, familiarity with index notation for vectors and matrices, and special relativity at the level of knowing what a Lorentz boost is.
In the previous post, we built the general machinery of Lie groups: generators, structure constants, representations, and the exponential map. Now we put that machinery to work on the group that underpins all of relativistic physics — the Lorentz group.
The Lorentz group is, in a sense, the answer to the question: what are all the linear transformations that preserve the structure of spacetime? Everything that follows in quantum field theory — spinors, the Dirac equation, gauge theories — is built on the representation theory of this group. So it’s worth understanding it carefully.
Definition: The Orthogonal Group
Before jumping to spacetime, let’s set up the general framework.
Consider a space with coordinates . The group of transformations that leaves invariant the quadratic form
is called the orthogonal group .
When all signs are the same (), this is just the familiar rotation group preserving . The interesting physics happens when the signs are mixed — when some coordinates enter with a plus and others with a minus. This indefinite signature is exactly what spacetime has.
Definition: The Lorentz Group
The Lorentz group is defined as the group of linear coordinate transformations
that leave the following quantity invariant:
This is the spacetime interval. We’re working in the mostly-minus convention , and we’ll stick with this throughout.
Hence:
The "" counts the spatial dimensions (entering with a minus sign in the quadratic form), and the "" counts the time dimension (entering with a plus). The ordering vs. is a convention — some authors write it the other way. What matters is the signature: one plus and three minuses.
Lorentz Invariance of the Minkowski Metric
What condition must the matrix satisfy to be a valid Lorentz transformation? We require that the spacetime interval is invariant:
Substituting :
where in the last step we used the fact that are just dummy variables — the equality must hold for the coefficients. Since this must be true for any , we can strip off the ‘s:
In matrix notation, this is:
This is the defining equation of the Lorentz group. Compare this to the orthogonal group , whose defining equation is — same structure, but with the Minkowski metric replacing the Euclidean metric .
A note on index gymnastics: The metric is the object that raises and lowers indices — it converts contravariant (upper) indices to covariant (lower) indices and vice versa. In the equation above, can also change the indices on itself, as demonstrated by the contraction on the right-hand side.
Segregation of the Lorentz Group
The defining equation places constraints on that split the Lorentz group into four disconnected components. Let’s see how.
By Determinant: Proper vs. Improper
Take the determinant of both sides of :
Since and , we can divide through:
This gives us two classes:
-
: Proper Lorentz transformations. These form the subgroup . The "" stands for special, meaning unit determinant — the same convention as for rotations.
-
: Improper Lorentz transformations. These include parity, time reversal, and combinations thereof. They cannot be continuously connected to the identity (you can’t smoothly go from to ), so they don’t have a Lie algebra description — they are discrete.
By the 00-Component: Orthochronous vs. Non-Orthochronous
Now consider the component of :
This gives us:
Since , we must have either or . There is no middle ground — this is a discrete split:
- : Orthochronous — the transformation preserves the direction of time.
- : Non-orthochronous — the transformation reverses the direction of time.
The Four Components
Combining these two binary choices, the Lorentz group splits into four disconnected components:
| Component | Contains | Example | ||
|---|---|---|---|---|
| Identity | Rotations, boosts | |||
| Combined parity + time reversal | ||||
| Parity (spatial inversion) | ||||
| Time reversal |
Only — the proper orthochronous Lorentz group — is connected to the identity. This is the component that has a Lie algebra, and when physicists say “the Lorentz group” without qualification, they almost always mean this component:
The other three components are obtained by applying the discrete transformations , , or to elements of .
Non-Orthochronous Transformations
When , the transformation reverses the direction of time. Any non-orthochronous transformation can be written as an orthochronous transformation composed with a discrete inversion. The relevant discrete operations are:
- Time reversal : , which has and
- Combined : , which has and
Improper Lorentz Transformations
Transformations with are called improper. Any improper transformation can be written as a proper transformation () composed with a discrete transformation. Examples include:
- Parity: — flips all spatial coordinates, , orthochronous
- Single-axis reflection: — flips one spatial axis, , orthochronous
- Time reversal: — flips time, , non-orthochronous
Notice that parity is improper but orthochronous (), while time reversal is both improper and non-orthochronous. These are genuinely different — they live in different disconnected components of the Lorentz group.
Lorentz Group Representations
Now we apply the representation theory from Part I. Recall: a set of objects (where ) transforms in a representation of dimension of the Lorentz group if, under a Lorentz transformation:
Here, is a matrix representation of the abstract Lorentz group element. The are the Lorentz generators in the representation , and they are matrices. The parameters are antisymmetric () — an antisymmetric matrix has independent components, corresponding to three rotations and three boosts.
For infinitesimal transformations ( small), we expand the exponential to first order:
The pair labels which generator (which rotation or boost), while are the matrix indices of that generator in the representation . The explicit form of as an matrix depends on which representation we are considering.
Let’s now work through the representations one by one.
Scalar Representation
For a scalar , the index takes only one value (), so this is a 1-dimensional representation. The generator is a matrix — a single number for each pair .
A scalar field is invariant under Lorentz transformations — it does not change:
(Invariant means the value doesn’t change. Contrast with covariant, which means it transforms in a well-defined way under certain rules — all representations are covariant, but only the scalar is invariant.)
Since in this representation:
A representation in which all generators are zero is a valid solution of the Lie algebra (both sides are trivially zero), and is called the trivial representation.
A typical Lorentz scalar in particle physics is the rest mass of a particle — all observers agree on its value regardless of their reference frame.
Vector Representation
A contravariant 4-vector transforms as:
and a covariant 4-vector transforms as:
with satisfying the Lorentz invariance condition . The spacetime coordinates and the four-momentum are the most important examples of contravariant 4-vectors.
This is a 4-dimensional representation: each generator is a matrix, denoted . The explicit form of the generator is:
This formula is antisymmetric in (as it must be, since ), and it’s the unique generator consistent with the infinitesimal form of .
To see this, consider an infinitesimal Lorentz transformation :
Comparing with the general formula and substituting the explicit generator, one can verify that the two expressions agree: . The circle closes.
Lorentz Transformations of 4-Vectors
Let’s now write down the explicit Lorentz transformations. A general Lorentz transformation depends on six parameters:
where are the three rapidity (boost) parameters and are the three rotation angles.
Boosts
A boost along the -axis can be written in terms of velocity or rapidity. In terms of velocity, where (in natural units with ) and :
Since , we can write where is the rapidity. The same transformation becomes:
The rapidity parameterization is nicer for several reasons: rapidities add under composition of collinear boosts (unlike velocities), and the hyperbolic functions make the analogy with rotations transparent — for rotations become for boosts.
Boosts in the and directions follow identically, with the and appearing in the appropriate row/column.
Boost Matrices
The explicit matrices for boosts along each axis are:
Boost along (-form and -form):
Boost along :
Boost along :
The pattern: the boost mixes the time component (row/column 0) with the spatial component in the boost direction, leaving the other two spatial components untouched. The sits on the diagonal and the on the off-diagonal — compare with rotations, where and play the same role.
Rotation Matrices
Rotations don’t touch the time component at all — they act purely in the spatial block:
Rotation about -axis:
Rotation about -axis:
Rotation about -axis:
These are just the familiar rotation matrices embedded in the lower-right block of a matrix, with the time-time component equal to 1 and all time-space components equal to 0.
Rotation and Boost Generators
Now we extract the generators by differentiating the finite transformations and evaluating at the identity (all parameters equal to zero).
Boost Generators
Using and :
Notice that the are symmetric matrices (the inner part, before the ). This reflects the fact that boosts are not unitary transformations — the Lorentz group is non-compact, and boosts push you along a hyperbola rather than around a circle.
Rotation Generators
Using and :
The are antisymmetric (the inner part, before the ). This reflects the fact that rotations are unitary — the rotation group is compact.
The Lorentz Algebra
These six generators satisfy the following commutation relations:
Each of these relations has a clear physical meaning:
-
: The rotation generators close among themselves and satisfy the algebra. This is simply the statement that are angular momenta — rotations form a subgroup.
-
: The boosts transform as a vector under rotations. If you rotate your coordinate system, the boost generators rotate accordingly. This is expected on physical grounds — a boost “in the -direction” should become a boost “in the -direction” under a rotation about .
-
: This is the crucial relation. The commutator of two boosts gives a rotation, not another boost. Boosts do not form a subgroup. The minus sign (compared to ) is physically significant: it’s the reason the Lorentz group is non-compact, the reason finite-dimensional unitary representations don’t exist, and ultimately the reason we need spinors and the Dirac equation.
If this third relation had a plus sign instead — — then would generate , a compact group with perfectly well-behaved finite-dimensional unitary representations. The minus sign makes all the difference.
Connecting to the Tensor Notation
The six generators and can be packaged into the antisymmetric tensor via:
This is useful because the Lorentz transformation takes the compact form , where the six independent components of are the three rotation angles and three boost rapidities.
Connection to What Comes Next
We now have the complete Lorentz algebra — six generators, their explicit matrix forms, and their commutation relations. But the (vector) representation is only one possibility. The Lie algebra admits infinitely many representations, and the physically most important ones are not the vector representation.
In the next post, we’ll complexify the Lorentz algebra by defining , which decomposes it into . This will reveal the spinorial representations — the and representations that describe left- and right-handed Weyl fermions. From there, we’ll construct Dirac and Majorana spinors, and finally arrive at the Poincaré group and the Wigner classification of particles by mass and spin.
The minus sign in will be the engine that drives everything.
Next post: Part III: Spinors, Fields, and the Representations That Matter
This post is based on my own self-study notes that I created in 2022 in order to get a deeper understanding of all of this.