603 words
3 minutes
The Density Matrix: Six Essential Properties

The density operator is the most general way to describe a quantum system. It handles both pure states (a definite quantum state) and mixtures (statistical ensembles of states) within a single framework — essential for open quantum systems, quantum statistical mechanics, and quantum information.

The density operator is defined as:

ρ=iwiα(i) ⁣α(i),iwi=1\rho = \sum_i w_i \left|\alpha^{(i)}\right\rangle\!\left\langle\alpha^{(i)}\right|, \qquad \sum_i w_i = 1

where the states α(i)|\alpha^{(i)}\rangle need not be orthogonal, and the wi0w_i \geq 0 are classical probabilities. A pure state has exactly one wi0w_i \neq 0. Otherwise it’s a mixture. Crucially, the wiw_i are not quantum amplitudes — the α(i)|\alpha^{(i)}\rangle themselves can already be superpositions.


1. Tr(ρ)=1\text{Tr}(\rho) = 1#

The matrix elements of ρ\rho in some basis i|i\rangle are:

ρij=iρj=kwkiα(k)α(k)j\rho_{ij} = \langle i|\rho|j\rangle = \sum_k w_k \langle i|\alpha^{(k)}\rangle\langle\alpha^{(k)}|j\rangle

Taking the trace (sum of diagonal elements):

Tr(ρ)=iρii=ikwkα(k)iiα(k)\text{Tr}(\rho) = \sum_i \rho_{ii} = \sum_i \sum_k w_k \langle\alpha^{(k)}|i\rangle\langle i|\alpha^{(k)}\rangle

Using completeness iii=1\sum_i |i\rangle\langle i| = \mathbf{1}:

Tr(ρ)=kwkα(k)α(k)=kwk=1\text{Tr}(\rho) = \sum_k w_k \langle\alpha^{(k)}|\alpha^{(k)}\rangle = \sum_k w_k = 1 \quad \blacksquare


2. Pure states: ρ2=ρ\rho^2 = \rho and Tr(ρ2)=1\text{Tr}(\rho^2) = 1#

For a pure state there is a single normalized state i|i\rangle with wi=1w_i = 1, so ρ=ii\rho = |i\rangle\langle i|. Then:

ρ2=iiii=ii=ρ\rho^2 = |i\rangle\langle i|i\rangle\langle i| = |i\rangle\langle i| = \rho \quad \blacksquare

And immediately:

Tr(ρ2)=Tr(ρ)=1\text{Tr}(\rho^2) = \text{Tr}(\rho) = 1

For a mixture, Tr(ρ2)<1\text{Tr}(\rho^2) < 1 — this is actually used as a measure of purity. The closer Tr(ρ2)\text{Tr}(\rho^2) is to 1, the more “pure-like” the state is.


3. Ensemble averages: [A]=Tr(ρA)[A] = \text{Tr}(\rho A)#

The ensemble average of an operator AA is defined as:

[A]=iwiα(i)Aα(i)[A] = \sum_i w_i \langle\alpha^{(i)}|A|\alpha^{(i)}\rangle

Working from the right-hand side Tr(ρA)\text{Tr}(\rho A), using the matrix element definition Aji=jAiA_{ji} = \langle j|A|i\rangle:

Tr(ρA)=ijρijAji=i,j,kwkα(k)jjAiiα(k)\text{Tr}(\rho A) = \sum_i \sum_j \rho_{ij} A_{ji} = \sum_{i,j,k} w_k \langle\alpha^{(k)}|j\rangle\langle j|A|i\rangle\langle i|\alpha^{(k)}\rangle

Summing over ii and jj using completeness twice:

Tr(ρA)=kwkα(k)Aα(k)=[A]\text{Tr}(\rho A) = \sum_k w_k \langle\alpha^{(k)}|A|\alpha^{(k)}\rangle = [A] \quad \blacksquare

This is powerful: any expectation value in any ensemble is just a single trace.


4. Cyclic property of trace, and basis independence#

Cyclic property: Tr(AB)=Tr(BA)\text{Tr}(AB) = \text{Tr}(BA).

Insert a complete set jϕjϕj=1\sum_j |\phi_j\rangle\langle\phi_j| = \mathbf{1} between AA and BB:

Tr(AB)=iψiABψi=i,jψiAϕjϕjBψi\text{Tr}(AB) = \sum_i \langle\psi_i|AB|\psi_i\rangle = \sum_{i,j}\langle\psi_i|A|\phi_j\rangle\langle\phi_j|B|\psi_i\rangle

Swap the scalar factors (they’re just numbers):

=i,jϕjBψiψiAϕj=jϕjBAϕj=Tr(BA)= \sum_{i,j}\langle\phi_j|B|\psi_i\rangle\langle\psi_i|A|\phi_j\rangle = \sum_j \langle\phi_j|BA|\phi_j\rangle = \text{Tr}(BA) \quad \blacksquare

For longer products this extends by induction: Tr(ABC)=Tr(BCA)=\text{Tr}(ABC\cdots) = \text{Tr}(BC\cdots A) = \cdots

Basis independence follows immediately. If UU is a unitary change of basis, then the trace in the new basis is Tr(UρU)=Tr(UUρ)=Tr(ρ)\text{Tr}(U\rho U^\dagger) = \text{Tr}(U^\dagger U \rho) = \text{Tr}(\rho), using the cyclic property.


5. Superposition \neq mixture: an explicit example#

This is arguably the most important conceptual point about the density matrix. Consider a spin-1/2 particle with ±Sz=±2|\pm\rangle \equiv |S_z = \pm\tfrac{\hbar}{2}\rangle.

The mixture ρmix=12(+++)\rho_\text{mix} = \tfrac{1}{2}(|+\rangle\langle+| + |-\rangle\langle-|):

ρmix=12(1000)+12(0001)=12(1001)=12\rho_\text{mix} = \frac{1}{2}\begin{pmatrix}1&0\\0&0\end{pmatrix} + \frac{1}{2}\begin{pmatrix}0&0\\0&1\end{pmatrix} = \frac{1}{2}\begin{pmatrix}1&0\\0&1\end{pmatrix} = \frac{\mathbf{1}}{2}

With Sx=2(0110)S_x = \frac{\hbar}{2}\begin{pmatrix}0&1\\1&0\end{pmatrix}:

[Sx]mix=Tr(ρmixSx)=4Tr(0110)=0[S_x]_\text{mix} = \text{Tr}(\rho_\text{mix} S_x) = \frac{\hbar}{4}\text{Tr}\begin{pmatrix}0&1\\1&0\end{pmatrix} = 0

The superposition ψsup=12(++)|\psi_\text{sup}\rangle = \frac{1}{\sqrt{2}}(|+\rangle + |-\rangle):

ρsup=ψsupψsup=12(11)(11)=12(1111)\rho_\text{sup} = |\psi_\text{sup}\rangle\langle\psi_\text{sup}| = \frac{1}{2}\begin{pmatrix}1\\1\end{pmatrix}\begin{pmatrix}1&1\end{pmatrix} = \frac{1}{2}\begin{pmatrix}1&1\\1&1\end{pmatrix}

[Sx]sup=Tr(ρsupSx)=4Tr(1111)(0110)=4Tr(1111)=2[S_x]_\text{sup} = \text{Tr}(\rho_\text{sup} S_x) = \frac{\hbar}{4}\text{Tr}\begin{pmatrix}1&1\\1&1\end{pmatrix}\begin{pmatrix}0&1\\1&0\end{pmatrix} = \frac{\hbar}{4}\text{Tr}\begin{pmatrix}1&1\\1&1\end{pmatrix} = \frac{\hbar}{2}

So [Sx]mix=0[S_x]_\text{mix} = 0 but [Sx]sup=/2[S_x]_\text{sup} = \hbar/2 — the two states give different measurable predictions. The superposition has definite spin along xx; the mixture does not. The off-diagonal elements of ρ\rho (the coherences) encode quantum interference and are entirely absent in the mixture.


6. Equation of motion: the von Neumann equation#

Differentiating ρ=iwi(t)α(i)(t)α(i)(t)\rho = \sum_i w_i(t)|\alpha^{(i)}(t)\rangle\langle\alpha^{(i)}(t)| with respect to time using the product rule:

dρdt=idwidtα(i)α(i)+iwi(ddtα(i)α(i)+α(i)ddtα(i))\frac{d\rho}{dt} = \sum_i \frac{dw_i}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| + \sum_i w_i\left(\frac{d}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| + |\alpha^{(i)}\rangle\frac{d}{dt}\langle\alpha^{(i)}|\right)

Using the time-dependent Schrödinger equation iddtα(i)=Hα(i)i\hbar\frac{d}{dt}|\alpha^{(i)}\rangle = H|\alpha^{(i)}\rangle and its conjugate:

ddtα(i)=iHα(i),ddtα(i)=+iα(i)H\frac{d}{dt}|\alpha^{(i)}\rangle = -\frac{i}{\hbar}H|\alpha^{(i)}\rangle, \qquad \frac{d}{dt}\langle\alpha^{(i)}| = +\frac{i}{\hbar}\langle\alpha^{(i)}|H

Substituting:

dρdt=idwidtα(i)α(i)+(iHρ+iρH)\frac{d\rho}{dt} = \sum_i \frac{dw_i}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| + \left(-\frac{i}{\hbar}H\rho + \frac{i}{\hbar}\rho H\right)

dρdt=idwidtα(i)α(i)i[H,ρ]\boxed{\frac{d\rho}{dt} = \sum_i \frac{dw_i}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| - \frac{i}{\hbar}[H,\,\rho]}

If the weights wiw_i are time-independent (a closed system), the first term vanishes and we recover the von Neumann equation:

dρdt=i[H,ρ]\frac{d\rho}{dt} = -\frac{i}{\hbar}[H, \rho]

This is the quantum analogue of Liouville’s theorem in classical statistical mechanics. Note the sign: it’s i[H,ρ]-\frac{i}{\hbar}[H,\rho], opposite to the Heisenberg equation for operators dAdt=+i[H,A]\frac{dA}{dt} = +\frac{i}{\hbar}[H,A].

The Density Matrix: Six Essential Properties
https://rohankulkarni.me/posts/notes/density-matrix/
Author
Rohan Kulkarni
Published at
2023-07-20
License
CC BY-NC-SA 4.0