1507 words
8 minutes
Faddeev–Popov Quantization (Abelian), Part 1: The Overcounting Problem

When you try to quantize electromagnetism using the path integral, you immediately hit a wall: the kinetic operator has no inverse. This isn’t some exotic UV divergence — it’s a basic linear algebra problem caused by gauge redundancy. The Faddeev–Popov procedure is how you fix it. In this post (Part 1 of 2), we’ll understand why the problem exists and what gauge fixing needs to accomplish. In Part 2, we’ll actually do it.

We’ll work with the abelian case (U(1)U(1) gauge theory) throughout. The non-abelian generalization follows the same logic but is technically harder — we’ll get to that in a separate series.


Moving to Euclidean space#

Before doing anything with the path integral, we Wick-rotate to Euclidean space. This means sending x0=tiτx^0 = t \to -i\tau, which turns the oscillatory eiSe^{iS} into a damped eSEe^{-S_E} — much better behaved for path integral manipulations.

The gauge field rotates too. Since A0A_0 sits in the same slot as x0x^0, it picks up a factor of ii:

A0iA0EA_0 \to iA_0^E

while the spatial components AiA_i are unchanged. This means the Minkowski contraction AμAμ=A02Ai2A_\mu A^\mu = A_0^2 - A_i^2 becomes (A0E)2Ai2=AμEAμE-(A_0^E)^2 - A_i^2 = -A_\mu^E A_\mu^E in Euclidean space. The relative sign between time and space components disappears — Euclidean space has no distinction between “upper” and “lower” indices, and everything just contracts with δμν\delta_{\mu\nu}.

The field strength transforms similarly. Take F0i=0AiiA0F_{0i} = \partial_0 A_i - \partial_i A_0. After the rotation, each time derivative 0\partial_0 picks up an ii, and A0iA0EA_0 \to iA_0^E, giving F0iiF0iEF_{0i} \to iF_{0i}^E. The Euclidean action for a massless U(1)U(1) gauge field becomes:

SE[A]=14d4x  FμνEFμνES_E[A] = \frac{1}{4}\int d^4x \; F_{\mu\nu}^E F_{\mu\nu}^E

From here on, we’ll drop the EE superscripts — everything is Euclidean until further notice.


The operator you can’t invert#

Now let’s massage the action into a form that reveals the problem. The Lagrangian density is:

L=14FμνFμν=14(F0i2+Fij2)\mathcal{L} = \frac{1}{4}F_{\mu\nu}F_{\mu\nu} = \frac{1}{4}(F_{0i}^2 + F_{ij}^2)

We can write Fμν=μAννAμF_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu and expand. After integrating by parts (throwing away boundary terms), the action takes the form:

S[A]=12d4x  Aμ ⁣(δμνμν)AνS[A] = \frac{1}{2}\int d^4x \; A_\mu\!\left(\Box\,\delta_{\mu\nu} - \partial_\mu\partial_\nu\right)A_\nu

This is a quadratic action in AμA_\mu, which means the path integral is Gaussian. For a Gaussian integral to make sense, you need to invert the operator in the exponent — that inverse is the propagator. But the operator

Oμν=δμνμν\mathcal{O}_{\mu\nu} = \Box\,\delta_{\mu\nu} - \partial_\mu\partial_\nu

has a zero mode. To see this, hit it with μχ\partial_\mu\chi for any function χ\chi:

Oμννχ=μχμννχ=μχμχ=0\mathcal{O}_{\mu\nu}\,\partial_\nu\chi = \Box\,\partial_\mu\chi - \partial_\mu\partial_\nu\partial_\nu\chi = \Box\,\partial_\mu\chi - \partial_\mu\Box\chi = 0

The operator annihilates any pure gauge configuration μχ\partial_\mu\chi. A linear operator with zero modes has no inverse. No inverse means no propagator, and no propagator means the path integral

Z=DAμ  eS[A]Z = \int \mathcal{D}A_\mu \; e^{-S[A]}

is not well-defined. We’re stuck.

But why does this zero mode exist? It’s not an accident — it’s a direct consequence of gauge invariance. The action doesn’t change under AμAμ+μχA_\mu \to A_\mu + \partial_\mu\chi, so the operator encoding the action must kill the pure-gauge directions. The kernel of Oμν\mathcal{O}_{\mu\nu} is precisely the space of gauge transformations. This is the mathematical statement of the overcounting problem.


Two types of field configurations#

Let’s think about what the path integral DAμ\int \mathcal{D}A_\mu is actually summing over. Every field configuration Aμ(x)A_\mu(x) contributes to this integral. But there are really two types of contributions:

  1. Physically inequivalent configurations — these are the ones we want. They represent genuinely different states of the electromagnetic field, and summing over them gives the quantum behavior of the theory.

  2. Gauge copies — for each physical configuration AμA_\mu, there is an entire family {Aμ+μχ}\{A_\mu + \partial_\mu\chi\} of configurations related by gauge transformations. These all describe the same physics but the path integral counts each one separately.

The second type is the problem. We don’t just have one extra copy — we have one for every possible function χ(x)\chi(x), which is an infinite-dimensional family. The path integral includes a factor equal to the “volume” of this gauge orbit, and this volume is what makes the integral diverge.

What we’d like to do is split the measure:

DAμ=DAphys×DAgauge\int \mathcal{D}A_\mu = \int \mathcal{D}A_\text{phys} \times \int \mathcal{D}A_\text{gauge}

and just throw away the gauge part. The challenge is doing this correctly.


Gauge orbits: the geometric picture#

The cleanest way to think about this is geometrical. Consider the (infinite-dimensional) space of all gauge field configurations Aμ(x)A_\mu(x).

Pick a particular configuration AμA_\mu. Now act on it with every possible gauge transformation:

Aμχ=Aμ+μχA_\mu^\chi = A_\mu + \partial_\mu\chi

The set of all such AμχA_\mu^\chi as χ\chi varies is called the gauge orbit of AμA_\mu, denoted O(A)\mathcal{O}(A). Every point on this orbit is physically equivalent to AμA_\mu.

Now pick a different, physically inequivalent configuration AμA_\mu' — meaning there is no gauge transformation taking AμA_\mu to AμA_\mu'. It has its own orbit O(A)\mathcal{O}(A'), which doesn’t intersect O(A)\mathcal{O}(A).

The full field space is foliated into these non-intersecting orbits. Physics lives on the space of orbits, not on individual configurations.

Gauge orbits and gauge-fixing surfaces

The horizontal curves are gauge orbits — each one represents a family of gauge-equivalent configurations. The pink dots (AμA_\mu, AμA_\mu', AμA_\mu'') are physically inequivalent starting points, and the gauge transformations GT1,GT2,GT_1, GT_2, \ldots slide you along an orbit without changing the physics. The vertical surfaces GF1,GF2GF_1, GF_2 are gauge-fixing conditions — they slice through the orbits, picking out one representative from each.

A good gauge-fixing surface intersects each orbit exactly once. This is what we need: one configuration per orbit, no overcounting, no missing configurations. For U(1)U(1) gauge theory in the perturbative regime, we can assume such a unique intersection exists. (For non-abelian theories, this assumption fails — a problem known as Gribov copies, which we’ll address in the non-abelian series.)


The Lorenz condition as a gauge-fixing surface#

The most common choice of gauge-fixing surface is the Lorenz condition:

μAμ=0\partial_\mu A_\mu = 0

Why does this pick one point per orbit? Start with any configuration AμA_\mu and ask: can I find a χ\chi such that the transformed field satisfies the condition?

μAμχ=μAμ+χ=0\partial_\mu A_\mu^\chi = \partial_\mu A_\mu + \Box\chi = 0

This requires:

χ=μAμ\Box\chi = -\partial_\mu A_\mu

which is just Poisson’s equation for χ\chi. With appropriate boundary conditions, it has a unique solution. So for every orbit, there is exactly one configuration satisfying μAμ=0\partial_\mu A_\mu = 0 — the Lorenz condition is a good gauge-fixing surface.

More generally, we can impose

μAμ=G(x)\partial_\mu A_\mu = G(x)

for any fixed function G(x)G(x). The required gauge transformation is then χ=G(x)μAμ\Box\chi = G(x) - \partial_\mu A_\mu, which again has a unique solution. Different choices of GG give different gauge-fixing surfaces, but they all cut each orbit once, and physical results will be independent of this choice. This freedom in choosing GG will be important when we derive the gauge-fixed action in Part 2.


What we need to do (and why it’s subtle)#

So the goal is clear: restrict the path integral to one representative per gauge orbit. The naive approach would be to insert a delta function:

Z=?DAμ  δ(μAμG)  eS[A]Z \stackrel{?}{=} \int \mathcal{D}A_\mu \; \delta(\partial_\mu A_\mu - G) \; e^{-S[A]}

This forces the integral onto the gauge-fixing surface μAμ=G(x)\partial_\mu A_\mu = G(x), which is what we want. But there’s a catch.

When you restrict a multi-dimensional integral to a surface using a delta function, you pick up a Jacobian — the determinant of how fast the constraint changes as you move off the surface. In our case, this means: how fast does μAμχ\partial_\mu A_\mu^\chi change as you vary χ\chi? From the relation μAμχ=μAμ+χ\partial_\mu A_\mu^\chi = \partial_\mu A_\mu + \Box\chi, the relevant derivative is:

δ(μAμχ)δχ=\frac{\delta(\partial_\mu A_\mu^\chi)}{\delta\chi} = \Box

So the Jacobian factor is det()\det(\Box). Missing this factor gives the wrong answer.

Here’s the punchline for Part 1, and the key fact that makes the abelian case tractable: this determinant doesn’t depend on AμA_\mu. The operator \Box is just the d’Alembertian — it knows nothing about the gauge field. So det()\det(\Box) is a constant that can be pulled out of the path integral. This is a massive simplification.

In non-abelian theories, the analogous operator does depend on AμA_\mu, and det(M[A])\det(M[A]) cannot be pulled out. Dealing with this AA-dependent determinant is what introduces ghost fields — but that’s a story for the non-abelian series.


Conceptual summary#

Here’s what we’ve established:

  • The kinetic operator δμνμν\Box\,\delta_{\mu\nu} - \partial_\mu\partial_\nu has zero modes along gauge directions, making it non-invertible and the path integral ill-defined.
  • This is because the path integral overcounts: every physical configuration has an infinite family of gauge copies, and we integrate over all of them.
  • The space of field configurations is foliated into gauge orbits. Gauge fixing means choosing a surface that cuts each orbit exactly once.
  • The Lorenz condition μAμ=0\partial_\mu A_\mu = 0 is such a surface — existence and uniqueness of the required gauge transformation follows from invertibility of \Box.
  • Naively inserting a delta function to enforce the gauge condition misses a Jacobian factor det()\det(\Box), which in the abelian case is a harmless constant.

Looking ahead#

In Part 2, we’ll derive the Faddeev–Popov identity that correctly accounts for this Jacobian, insert it into the path integral, and use a beautiful Gaussian integral trick to convert the delta function into the familiar gauge-fixing term 12ξ(μAμ)2-\frac{1}{2\xi}(\partial_\mu A_\mu)^2 in the Lagrangian. We’ll then immediately read off the photon propagator and see how it depends on the gauge parameter ξ\xi.


Next up — Part 2: The Faddeev–Popov identity and the gauge-fixed photon propagator.


Source Material#

This post was inspired by course material from Heidelberg University QFT Tutorial 6. You can download the original notes:

📄 Tutorial 6: Faddeev-Popov Quantization (PDF)

Faddeev–Popov Quantization (Abelian), Part 1: The Overcounting Problem
https://rohankulkarni.me/posts/notes/faddeev-popov-abelian-part-1/
Author
Rohan Kulkarni
Published at
2026-04-03
License
CC BY-NC-SA 4.0