2647 words
13 minutes
Dark Energy Beyond Scalars, Part II: Gauge Symmetry, Mass, and Degrees of Freedom

This is the second post in a series on vector and 2-form dark energy. In the previous post, we motivated why cosmologists look beyond scalar fields and introduced the two theories we’ll study: a massive 1-form and a massless 2-form. Now we get into the physics of these theories — what their symmetries are, how many degrees of freedom they propagate, and what their equations of motion and stress-energy tensors look like.


What Gauge Symmetry Really Means#

Before computing anything, it’s worth being precise about what gauge symmetry is and why it matters for counting degrees of freedom.

A gauge symmetry is a transformation you can apply to the fields that changes their mathematical description but leaves all physical observables unchanged. It means your description has redundancy — multiple field configurations correspond to the same physical state.

The most familiar example is electromagnetism. The 4-potential AμA_\mu has four components, but the physics — electric and magnetic fields, forces on charges, radiation — is unchanged if you shift

AμAμ+μθ,A_\mu \to A_\mu + \partial_\mu \theta,

for any scalar function θ(x)\theta(x). This means that out of the four components of AμA_\mu, one is “pure gauge” — you can always set it to whatever you like by choosing θ\theta appropriately. It carries no physical information.

But gauge symmetry alone doesn’t determine the full count of physical degrees of freedom. You also have to check whether the equations of motion contain constraints — equations with no second-order time derivatives. A constraint doesn’t tell you how a field evolves; it tells you what the field must be, given the current state of everything else. Gauss’s law, E=ρ\nabla \cdot \vec{E} = \rho, is the prototype: it’s not an evolution equation for E\vec{E}, it’s an instantaneous relationship between the electric field and the charge distribution.

The general counting formula is:

propagating DOFs=field componentsgauge freedomsconstraints.\text{propagating DOFs} = \text{field components} - \text{gauge freedoms} - \text{constraints}.

Each gauge freedom removes one component (you can set it to zero by a gauge choice), and each independent constraint removes another (it’s determined by the remaining fields). Let’s now apply this to our two theories.


Theory 1: The Massive Vector#

Gauge Symmetry (or Lack Thereof)#

Recall the action:

S[A,g]=d4xg[14FμνFμνV(AμAμ)].S[A, g] = \int d^4x \sqrt{-g} \left[ -\frac{1}{4} F_{\mu\nu} F^{\mu\nu} - V(A_\mu A^\mu) \right].

Under the gauge transformation AμAμ+μθA_\mu \to A_\mu + \nabla_\mu \theta, the kinetic term FμνFμνF_{\mu\nu}F^{\mu\nu} is invariant — the field strength Fμν=μAννAμF_{\mu\nu} = \nabla_\mu A_\nu - \nabla_\nu A_\mu doesn’t change, for the same reason as in electromagnetism. But the potential term V(AμAμ)V(A_\mu A^\mu) is not invariant. The contraction AμAμA_\mu A^\mu changes when you shift AμA_\mu, and since VV depends on this contraction, the action changes.

This is not a technicality — it’s the defining feature of a massive vector theory. In electromagnetism, the gauge symmetry is what keeps the photon massless. Breaking it is what gives the field a mass. The simplest case, V(X)=12m2XV(X) = \frac{1}{2}m^2 X, adds the mass term 12m2AμAμ\frac{1}{2}m^2 A_\mu A^\mu to the Lagrangian, which is exactly the Proca mass term.

There is a close analogy with the Higgs mechanism in particle physics. The W and Z bosons are massive vector fields. They start life as massless gauge fields (with 2 DOFs each, like the photon), but the Higgs field breaks the gauge symmetry spontaneously, giving them mass and a third — longitudinal — polarization. In our theory, the potential V(AμAμ)V(A_\mu A^\mu) breaks the gauge symmetry explicitly rather than spontaneously, but the physical consequence is the same: an extra degree of freedom appears.

Equations of Motion#

To find the equations of motion, we vary the action with respect to AμA_\mu while holding the metric fixed. Writing X=AμAμX = A_\mu A^\mu and VX=dV/dXV_X = dV/dX, the variation gives:

δAS=d4xg[αFαβ2VXAβ]δAβ.\delta_A S = \int d^4x \sqrt{-g} \left[ \nabla_\alpha F^{\alpha\beta} - 2 V_X A^\beta \right] \delta A_\beta.

The key steps are: (i) varying the kinetic term produces Fμν[μδAν]F^{\mu\nu} \nabla_{[\mu} \delta A_{\nu]}, (ii) integration by parts moves the derivative onto FμνF^{\mu\nu}, and (iii) the metric compatibility αgμν=0\nabla_\alpha g_{\mu\nu} = 0 lets us raise indices freely. Setting the variation to zero for arbitrary δAβ\delta A_\beta:

αFαβ=2VXAβ.\boxed{\nabla_\alpha F^{\alpha\beta} = 2 V_X A^\beta.}

This is the generalized Proca equation. For V(X)=12m2XV(X) = \frac{1}{2}m^2 X, it reduces to αFαβ=m2Aβ\nabla_\alpha F^{\alpha\beta} = m^2 A^\beta, the standard Proca equation.

Compare this with the Maxwell equation αFαβ=0\nabla_\alpha F^{\alpha\beta} = 0 (in vacuum). The right-hand side is what’s new — it acts as a source term generated by the field itself, proportional to AβA^\beta. This self-sourcing is the hallmark of a massive field.

The Constraint: Where the Missing DOF Goes#

Now we count degrees of freedom. Set β=0\beta = 0 in the equation of motion and expand:

αFα0=2VXA0.\nabla_\alpha F^{\alpha 0} = 2 V_X A^0.

The α=0\alpha = 0 term vanishes by antisymmetry (F00=0F^{00} = 0), leaving only spatial contributions:

iFi0=2VXA0.\nabla_i F^{i0} = 2 V_X A^0.

Now here is the critical point. Expand Fi0F^{i0}: it involves iA00Ai\nabla^i A^0 - \nabla^0 A^i. When you take i\nabla_i of this, you get terms involving spatial derivatives of A0A^0 and first-order time derivatives of AiA^i. There are no second-order time derivatives anywhere in this equation.

This means the β=0\beta = 0 equation is not a dynamical equation — it’s a constraint. It determines A0A^0 in terms of the spatial components AiA^i and their first time derivatives, just like Gauss’s law determines the longitudinal part of the electric field in terms of the charge distribution. A0A^0 is not an independent dynamical variable; it’s enslaved to the spatial components.

The counting:

4components of Aμ0gauge freedoms1constraint=3 propagating DOFs.\underbrace{4}_{\text{components of } A_\mu} - \underbrace{0}_{\text{gauge freedoms}} - \underbrace{1}_{\text{constraint}} = 3 \text{ propagating DOFs.}

Physical Meaning of the Three Modes#

What are these three degrees of freedom? To build intuition, consider a massive vector field in flat spacetime with a plane wave solution AμϵμeikxA_\mu \propto \epsilon_\mu \, e^{ik \cdot x}, where ϵμ\epsilon_\mu is the polarization vector. For a massless photon, gauge symmetry and the constraint together restrict you to two transverse polarizations — the electric field oscillates perpendicular to the direction of propagation.

For a massive field, the gauge symmetry is gone. You still have the two transverse polarizations, but now there’s a third: the longitudinal polarization, where the field oscillates along the direction of propagation. This mode exists because the field has a rest frame — you can Lorentz-boost to a frame where the massive particle is at rest, and there the three polarizations correspond to the three spatial directions. A massless particle has no rest frame (it travels at the speed of light), which is why it can’t support a longitudinal mode.

In our cosmological context, we’ll see that all three modes — two transverse vector perturbations δAiT\delta A_i^T and one longitudinal scalar perturbation AA — propagate on top of the cosmological background.

Stress-Energy Tensor#

The stress-energy tensor comes from varying the action with respect to the metric:

Tμν=2gδSδgμν.T_{\mu\nu} = -\frac{2}{\sqrt{-g}} \frac{\delta S}{\delta g^{\mu\nu}}.

The computation has two pieces. The variation of g\sqrt{-g} produces the familiar 12gμνL-\frac{1}{2}g_{\mu\nu} \mathcal{L} term. The variation of the kinetic term, after careful contraction of indices, gives terms involving FμαFανF_{\mu\alpha}F^{\alpha}{}_{\nu}. The result is:

Tμν=FμαFαν14gμνFαβFαβ+2VXAμAνgμνV(X).\boxed{T_{\mu\nu} = F_{\mu\alpha} F^{\alpha}{}_{\nu} - \frac{1}{4}g_{\mu\nu} F_{\alpha\beta}F^{\alpha\beta} + 2V_X A_\mu A_\nu - g_{\mu\nu} V(X).}

The first two terms are identical to the electromagnetic stress-energy tensor. The last two terms are the contribution from the potential — they vanish in Maxwell’s theory, where V=0V = 0.

A useful sanity check: for V=0V = 0, the trace Tμμ=0T^\mu{}_\mu = 0 — the electromagnetic stress-energy tensor is traceless, as it must be for a conformal (massless) theory. With V0V \neq 0, the trace is nonzero, reflecting the fact that a massive theory has an intrinsic scale and breaks conformal symmetry.


Theory 2: The Massless 2-Form#

Gauge Symmetry#

The action:

S[B,g]=d4xg  f(X),X=112HμνρHμνρ.S[B, g] = \int d^4x \sqrt{-g} \; f(X), \qquad X = -\frac{1}{12} H_{\mu\nu\rho} H^{\mu\nu\rho}.

Under the transformation

BμνBμν+μθννθμ,B_{\mu\nu} \to B_{\mu\nu} + \partial_\mu \theta_\nu - \partial_\nu \theta_\mu,

the field strength HμνρH_{\mu\nu\rho} is invariant. The mechanism is exactly the same as for electromagnetism, one level up. The added term μθννθμ\partial_\mu \theta_\nu - \partial_\nu \theta_\mu contributes terms like ρ(μθννθμ)\partial_\rho(\partial_\mu \theta_\nu - \partial_\nu \theta_\mu) to HρμνH_{\rho\mu\nu}. Since HH is totally antisymmetric while ρμ\partial_\rho \partial_\mu is symmetric, these contributions vanish identically.

But there’s a subtlety: the gauge parameter θμ\theta_\mu has its own redundancy. Two different gauge parameters can produce the same transformation of BμνB_{\mu\nu}. Specifically, shifting

θμθμ+μφ\theta_\mu \to \theta_\mu + \partial_\mu \varphi

doesn’t change μθννθμ\partial_\mu \theta_\nu - \partial_\nu \theta_\mu at all (again because μνφ\partial_\mu \partial_\nu \varphi is symmetric). This is a gauge symmetry of the gauge symmetry — sometimes called a reducible gauge symmetry in the literature.

Why does this matter? Because it affects the DOF counting. Naively, the gauge parameter θμ\theta_\mu has 4 components, so you might think you can gauge away 4 of the 6 components of BμνB_{\mu\nu}. But the residual freedom φ\varphi means that one of those 4 gauge transformations is redundant — it doesn’t actually change anything. So the effective gauge freedom is 41=34 - 1 = 3.

This structure generalizes. For pp-form gauge fields in DD dimensions, there is a tower of gauge symmetries: the (p1)(p-1)-form gauge parameter has its own (p2)(p-2)-form gauge parameter, and so on, all the way down. This tower is the de Rham complex of differential forms, and its structure determines the physical content of the theory. For our 2-form, the tower has just two levels: φθμBμν\varphi \to \theta_\mu \to B_{\mu\nu}.

Equations of Motion#

The variation with respect to BμνB_{\mu\nu} proceeds similarly to the vector case. Writing fX=df/dXf_X = df/dX:

δBS=d4xg[16HαβρfX(αδBβρ+βδBρα+ρδBαβ)].\delta_B S = \int d^4x \sqrt{-g} \left[ -\frac{1}{6} H^{\alpha\beta\rho} f_X \left( \nabla_\alpha \delta B_{\beta\rho} + \nabla_\beta \delta B_{\rho\alpha} + \nabla_\rho \delta B_{\alpha\beta} \right) \right].

The total antisymmetry of HαβρH^{\alpha\beta\rho} means the three terms in the bracket are all equal (they just relabel dummy indices), giving a factor of 3:

δBS=12d4xg  HαβρfXαδBβρ.\delta_B S = -\frac{1}{2} \int d^4x \sqrt{-g} \; H^{\alpha\beta\rho} f_X \, \nabla_\alpha \delta B_{\beta\rho}.

Integration by parts moves the derivative off δBβρ\delta B_{\beta\rho}, using the metric-compatibility identity αg=0\nabla_\alpha \sqrt{-g} = 0:

δBS=12d4xg  α(HαβρfX)δBβρ.\delta_B S = \frac{1}{2} \int d^4x \sqrt{-g} \; \nabla_\alpha \left( H^{\alpha\beta\rho} f_X \right) \delta B_{\beta\rho}.

Setting this to zero:

α(HαμνfX)=0.\boxed{\nabla_\alpha \left( H^{\alpha\mu\nu} f_X \right) = 0.}

This is the generalized equation of motion for the massless 2-form. For the free theory f(X)=Xf(X) = X, it reduces to αHαμν=0\nabla_\alpha H^{\alpha\mu\nu} = 0, which is the direct analogue of the vacuum Maxwell equation αFαμ=0\nabla_\alpha F^{\alpha\mu} = 0.

Notice the structural parallel: Maxwell’s equation says the divergence of FF vanishes; the 2-form equation says the divergence of HfXHf_X vanishes. The fXf_X factor encodes the nonlinearity from the general f(X)f(X) action — it’s the analogue of the nonlinear dielectric function in Born-Infeld electrodynamics.

Counting Degrees of Freedom#

This is where the 2-form theory gets interesting. Let’s go step by step.

Step 1: Raw components. The antisymmetric BμνB_{\mu\nu} has 4×32=6\frac{4 \times 3}{2} = 6 independent components.

Step 2: Gauge freedom. The gauge parameter θμ\theta_\mu has 4 components, but the residual symmetry θμθμ+μφ\theta_\mu \to \theta_\mu + \partial_\mu \varphi means one of those is redundant. Effective gauge freedom: 41=34 - 1 = 3. So we can gauge-fix 3 of the 6 components, leaving 3 potentially physical components.

Step 3: Constraints. The equations of motion α(HαμνfX)=0\nabla_\alpha(H^{\alpha\mu\nu} f_X) = 0 contain constraint equations — equations with at most first-order time derivatives. Due to the antisymmetry of HαμνH^{\alpha\mu\nu}, only the components BxyB_{xy}, BxzB_{xz}, ByzB_{yz} (the purely spatial ones) can appear with second-order time derivatives. The remaining equations are constraints.

How many independent constraints are there? There are three constraint equations, but only two are independent — the third is automatically satisfied if the first two hold. (This is analogous to how, in electromagnetism, the time derivative of Gauss’s law is automatically satisfied by the other Maxwell equations — it’s a consequence of the Bianchi identity.)

Step 4: Final count.

632=1 propagating degree of freedom.6 - 3 - 2 = 1 \text{ propagating degree of freedom.}

This is exactly what Hodge duality predicts: a massless 2-form in 4D is dual to a scalar, and a scalar has 1 DOF.

The Constraint Structure in Physical Terms#

It’s worth pausing on what the constraints mean physically. In electromagnetism, the 4-potential AμA_\mu has 4 components. Gauge symmetry removes 1, the constraint (Gauss’s law) removes 1, leaving 2 propagating DOFs — the two photon polarizations. The constraint tells you that the longitudinal electric field isn’t free to do what it wants; it’s fixed by the charge distribution.

For the 2-form, the situation is more dramatic. Of the 6 components, gauge freedom eats 3 and constraints eat 2, leaving just 1. The constraints are telling you that almost all of the apparent richness of the 2-form field — six components, surfaces in every orientation — is either gauge redundancy or instantaneously determined. Only one degree of freedom actually propagates as a wave.

This is why Hodge duality is so powerful as a consistency check. If you go through the constraint analysis and get anything other than 1, you’ve made a mistake. The mathematics of differential forms guarantees the answer before you compute a single equation of motion.

Stress-Energy Tensor#

Varying the action with respect to gμνg^{\mu\nu} requires the identity δgg=12ggμνδgμν\delta_g \sqrt{-g} = -\frac{1}{2}\sqrt{-g}\, g_{\mu\nu}\, \delta g^{\mu\nu} and careful handling of the three inverse metrics needed to form X=112gακgβσgρλHαβρHκσλX = -\frac{1}{12}g^{\alpha\kappa}g^{\beta\sigma}g^{\rho\lambda}H_{\alpha\beta\rho}H_{\kappa\sigma\lambda}. Each metric contributes a term under variation, but by relabeling dummy indices all three terms turn out to be identical (a factor of 3 that cancels the 1/121/12 normalization, leaving 1/41/4). The result:

Tμν=gμνf(X)+12fX(X)gβσgρλHμβρHνσλ.\boxed{T_{\mu\nu} = g_{\mu\nu}\, f(X) + \frac{1}{2} f_X(X)\, g^{\beta\sigma} g^{\rho\lambda} H_{\mu\beta\rho}\, H_{\nu\sigma\lambda}.}

With one index raised:

Tμν=δνμf(X)+12fX(X)gβσgρλgκνHμβρHκσλ.T^{\mu}{}_{\nu} = \delta^{\mu}_{\nu}\, f(X) + \frac{1}{2} f_X(X)\, g^{\beta\sigma} g^{\rho\lambda} g^{\kappa\nu} H_{\mu\beta\rho}\, H_{\kappa\sigma\lambda}.

Compare this with the vector stress-energy tensor. Both have a “potential” piece proportional to gμνg_{\mu\nu} and a “kinetic” piece built from contractions of field strengths. But there’s an important structural difference: the vector has the Maxwell stress-energy tensor (which is traceless) plus potential corrections, while the 2-form has a single f(X)f(X) term that plays both roles — since ff is a general function, the split between kinetic and potential energy is not sharp.


Side by Side#

Let’s summarize the two theories:

Massive 1-FormMassless 2-Form
FieldAμA_\mu (4 components)BμνB_{\mu\nu} (6 components)
Field strengthFμν=μAννAμF_{\mu\nu} = \nabla_\mu A_\nu - \nabla_\nu A_\muHμνρ=μBνρ+cyclicH_{\mu\nu\rho} = \partial_\mu B_{\nu\rho} + \text{cyclic}
Gauge symmetryNone (broken by VV)BμνBμν+2[μθν]B_{\mu\nu} \to B_{\mu\nu} + 2\partial_{[\mu}\theta_{\nu]}
Residual gaugeθμθμ+μφ\theta_\mu \to \theta_\mu + \partial_\mu \varphi
Gauge freedoms03 (= 4 − 1)
Constraints12
Propagating DOFs31
EOMαFαβ=2VXAβ\nabla_\alpha F^{\alpha\beta} = 2V_X A^\betaα(HαμνfX)=0\nabla_\alpha(H^{\alpha\mu\nu} f_X) = 0
Dual descriptionNo simple dualEquivalent to a scalar

The massive vector has more dynamical content than the massless 2-form, despite the 2-form having more raw components. This is entirely due to the gauge symmetry. The 2-form’s gauge freedom, together with its constraints, strips away five of the six components, leaving behind a single propagating scalar hidden inside an antisymmetric tensor. The massive vector, with no gauge symmetry to protect it, keeps three of its four components as dynamical fields.


A Note on the Bianchi Identity#

Both theories have a Bianchi identity, and it’s worth mentioning because it plays a crucial role in the next post.

For the 1-form, the identity is

[μFνρ]=0,\nabla_{[\mu} F_{\nu\rho]} = 0,

which follows from F=dAF = dA and d2=0d^2 = 0. In 4D, this is equivalent to μF~μ=0\nabla_\mu \tilde{F}^{\mu} = 0, where F~μ=12ϵμνρσFρσ\tilde{F}^\mu = \frac{1}{2}\epsilon^{\mu\nu\rho\sigma}F_{\rho\sigma} is the Hodge dual. In electromagnetism, this encodes the two homogeneous Maxwell equations: B=0\nabla \cdot \vec{B} = 0 and ×E+tB=0\nabla \times \vec{E} + \partial_t \vec{B} = 0.

For the 2-form, the identity is

[βHαμν]=0,\partial_{[\beta} H_{\alpha\mu\nu]} = 0,

which follows from H=dBH = dB and d2=0d^2 = 0. In 4D this is a single equation (since a totally antisymmetric 4-index object has only one independent component in 4 dimensions). When we specialize to the cosmological background, this identity will force the only surviving component of HH to be constant in time — a constraint that profoundly shapes the background dynamics.


Coming Up#

We now have the full machinery: gauge symmetry, DOF counting, equations of motion, and stress-energy tensors for both theories. In the next post, we put these theories to work in cosmology. We’ll impose the cosmological principle — homogeneity and isotropy — and discover that the massive vector and massless 2-form respond to it in fundamentally different ways. The vector’s spatial components are killed outright, leaving behind only a (negative) cosmological constant. The 2-form finds a more creative solution, threading a constant flux through space while keeping its stress-energy tensor perfectly isotropic. Whether either theory can actually drive accelerated expansion is the question Post 3 will answer.


Conventions: metric signature (+,,,)(+,-,-,-), natural units c==1c = \hbar = 1. We write VX=dV/dXV_X = dV/dX and fX=df/dXf_X = df/dX for derivatives of the potential and the function ff, respectively.

Dark Energy Beyond Scalars, Part II: Gauge Symmetry, Mass, and Degrees of Freedom
https://rohankulkarni.me/posts/notes/dark-energy-p-forms-part2/
Author
Rohan Kulkarni
Published at
2025-12-20
License
CC BY-NC-SA 4.0