The density operator is the most general way to describe a quantum system. It handles both pure states (a definite quantum state) and mixtures (statistical ensembles of states) within a single framework — essential for open quantum systems, quantum statistical mechanics, and quantum information.
The density operator is defined as:
ρ = ∑ i w i ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ , ∑ i w i = 1 \rho = \sum_i w_i \left|\alpha^{(i)}\right\rangle\!\left\langle\alpha^{(i)}\right|, \qquad \sum_i w_i = 1 ρ = ∑ i w i α ( i ) ⟩ ⟨ α ( i ) , ∑ i w i = 1
where the states ∣ α ( i ) ⟩ |\alpha^{(i)}\rangle ∣ α ( i ) ⟩ need not be orthogonal, and the w i ≥ 0 w_i \geq 0 w i ≥ 0 are classical probabilities. A pure state has exactly one w i ≠ 0 w_i \neq 0 w i = 0 . Otherwise it’s a mixture . Crucially, the w i w_i w i are not quantum amplitudes — the ∣ α ( i ) ⟩ |\alpha^{(i)}\rangle ∣ α ( i ) ⟩ themselves can already be superpositions.
1. Tr ( ρ ) = 1 \text{Tr}(\rho) = 1 Tr ( ρ ) = 1 # The matrix elements of ρ \rho ρ in some basis ∣ i ⟩ |i\rangle ∣ i ⟩ are:
ρ i j = ⟨ i ∣ ρ ∣ j ⟩ = ∑ k w k ⟨ i ∣ α ( k ) ⟩ ⟨ α ( k ) ∣ j ⟩ \rho_{ij} = \langle i|\rho|j\rangle = \sum_k w_k \langle i|\alpha^{(k)}\rangle\langle\alpha^{(k)}|j\rangle ρ ij = ⟨ i ∣ ρ ∣ j ⟩ = ∑ k w k ⟨ i ∣ α ( k ) ⟩ ⟨ α ( k ) ∣ j ⟩
Taking the trace (sum of diagonal elements):
Tr ( ρ ) = ∑ i ρ i i = ∑ i ∑ k w k ⟨ α ( k ) ∣ i ⟩ ⟨ i ∣ α ( k ) ⟩ \text{Tr}(\rho) = \sum_i \rho_{ii} = \sum_i \sum_k w_k \langle\alpha^{(k)}|i\rangle\langle i|\alpha^{(k)}\rangle Tr ( ρ ) = ∑ i ρ ii = ∑ i ∑ k w k ⟨ α ( k ) ∣ i ⟩ ⟨ i ∣ α ( k ) ⟩
Using completeness ∑ i ∣ i ⟩ ⟨ i ∣ = 1 \sum_i |i\rangle\langle i| = \mathbf{1} ∑ i ∣ i ⟩ ⟨ i ∣ = 1 :
Tr ( ρ ) = ∑ k w k ⟨ α ( k ) ∣ α ( k ) ⟩ = ∑ k w k = 1 ■ \text{Tr}(\rho) = \sum_k w_k \langle\alpha^{(k)}|\alpha^{(k)}\rangle = \sum_k w_k = 1 \quad \blacksquare Tr ( ρ ) = ∑ k w k ⟨ α ( k ) ∣ α ( k ) ⟩ = ∑ k w k = 1 ■
2. Pure states: ρ 2 = ρ \rho^2 = \rho ρ 2 = ρ and Tr ( ρ 2 ) = 1 \text{Tr}(\rho^2) = 1 Tr ( ρ 2 ) = 1 # For a pure state there is a single normalized state ∣ i ⟩ |i\rangle ∣ i ⟩ with w i = 1 w_i = 1 w i = 1 , so ρ = ∣ i ⟩ ⟨ i ∣ \rho = |i\rangle\langle i| ρ = ∣ i ⟩ ⟨ i ∣ . Then:
ρ 2 = ∣ i ⟩ ⟨ i ∣ i ⟩ ⟨ i ∣ = ∣ i ⟩ ⟨ i ∣ = ρ ■ \rho^2 = |i\rangle\langle i|i\rangle\langle i| = |i\rangle\langle i| = \rho \quad \blacksquare ρ 2 = ∣ i ⟩ ⟨ i ∣ i ⟩ ⟨ i ∣ = ∣ i ⟩ ⟨ i ∣ = ρ ■
And immediately:
Tr ( ρ 2 ) = Tr ( ρ ) = 1 \text{Tr}(\rho^2) = \text{Tr}(\rho) = 1 Tr ( ρ 2 ) = Tr ( ρ ) = 1
For a mixture , Tr ( ρ 2 ) < 1 \text{Tr}(\rho^2) < 1 Tr ( ρ 2 ) < 1 — this is actually used as a measure of purity . The closer Tr ( ρ 2 ) \text{Tr}(\rho^2) Tr ( ρ 2 ) is to 1, the more “pure-like” the state is.
3. Ensemble averages: [ A ] = Tr ( ρ A ) [A] = \text{Tr}(\rho A) [ A ] = Tr ( ρ A ) # The ensemble average of an operator A A A is defined as:
[ A ] = ∑ i w i ⟨ α ( i ) ∣ A ∣ α ( i ) ⟩ [A] = \sum_i w_i \langle\alpha^{(i)}|A|\alpha^{(i)}\rangle [ A ] = ∑ i w i ⟨ α ( i ) ∣ A ∣ α ( i ) ⟩
Working from the right-hand side Tr ( ρ A ) \text{Tr}(\rho A) Tr ( ρ A ) , using the matrix element definition A j i = ⟨ j ∣ A ∣ i ⟩ A_{ji} = \langle j|A|i\rangle A ji = ⟨ j ∣ A ∣ i ⟩ :
Tr ( ρ A ) = ∑ i ∑ j ρ i j A j i = ∑ i , j , k w k ⟨ α ( k ) ∣ j ⟩ ⟨ j ∣ A ∣ i ⟩ ⟨ i ∣ α ( k ) ⟩ \text{Tr}(\rho A) = \sum_i \sum_j \rho_{ij} A_{ji} = \sum_{i,j,k} w_k \langle\alpha^{(k)}|j\rangle\langle j|A|i\rangle\langle i|\alpha^{(k)}\rangle Tr ( ρ A ) = ∑ i ∑ j ρ ij A ji = ∑ i , j , k w k ⟨ α ( k ) ∣ j ⟩ ⟨ j ∣ A ∣ i ⟩ ⟨ i ∣ α ( k ) ⟩
Summing over i i i and j j j using completeness twice:
Tr ( ρ A ) = ∑ k w k ⟨ α ( k ) ∣ A ∣ α ( k ) ⟩ = [ A ] ■ \text{Tr}(\rho A) = \sum_k w_k \langle\alpha^{(k)}|A|\alpha^{(k)}\rangle = [A] \quad \blacksquare Tr ( ρ A ) = ∑ k w k ⟨ α ( k ) ∣ A ∣ α ( k ) ⟩ = [ A ] ■
This is powerful: any expectation value in any ensemble is just a single trace.
4. Cyclic property of trace, and basis independence# Cyclic property: Tr ( A B ) = Tr ( B A ) \text{Tr}(AB) = \text{Tr}(BA) Tr ( A B ) = Tr ( B A ) .
Insert a complete set ∑ j ∣ ϕ j ⟩ ⟨ ϕ j ∣ = 1 \sum_j |\phi_j\rangle\langle\phi_j| = \mathbf{1} ∑ j ∣ ϕ j ⟩ ⟨ ϕ j ∣ = 1 between A A A and B B B :
Tr ( A B ) = ∑ i ⟨ ψ i ∣ A B ∣ ψ i ⟩ = ∑ i , j ⟨ ψ i ∣ A ∣ ϕ j ⟩ ⟨ ϕ j ∣ B ∣ ψ i ⟩ \text{Tr}(AB) = \sum_i \langle\psi_i|AB|\psi_i\rangle = \sum_{i,j}\langle\psi_i|A|\phi_j\rangle\langle\phi_j|B|\psi_i\rangle Tr ( A B ) = ∑ i ⟨ ψ i ∣ A B ∣ ψ i ⟩ = ∑ i , j ⟨ ψ i ∣ A ∣ ϕ j ⟩ ⟨ ϕ j ∣ B ∣ ψ i ⟩
Swap the scalar factors (they’re just numbers):
= ∑ i , j ⟨ ϕ j ∣ B ∣ ψ i ⟩ ⟨ ψ i ∣ A ∣ ϕ j ⟩ = ∑ j ⟨ ϕ j ∣ B A ∣ ϕ j ⟩ = Tr ( B A ) ■ = \sum_{i,j}\langle\phi_j|B|\psi_i\rangle\langle\psi_i|A|\phi_j\rangle = \sum_j \langle\phi_j|BA|\phi_j\rangle = \text{Tr}(BA) \quad \blacksquare = ∑ i , j ⟨ ϕ j ∣ B ∣ ψ i ⟩ ⟨ ψ i ∣ A ∣ ϕ j ⟩ = ∑ j ⟨ ϕ j ∣ B A ∣ ϕ j ⟩ = Tr ( B A ) ■
For longer products this extends by induction: Tr ( A B C ⋯ ) = Tr ( B C ⋯ A ) = ⋯ \text{Tr}(ABC\cdots) = \text{Tr}(BC\cdots A) = \cdots Tr ( A BC ⋯ ) = Tr ( BC ⋯ A ) = ⋯
Basis independence follows immediately. If U U U is a unitary change of basis, then the trace in the new basis is Tr ( U ρ U † ) = Tr ( U † U ρ ) = Tr ( ρ ) \text{Tr}(U\rho U^\dagger) = \text{Tr}(U^\dagger U \rho) = \text{Tr}(\rho) Tr ( U ρ U † ) = Tr ( U † U ρ ) = Tr ( ρ ) , using the cyclic property.
5. Superposition ≠ \neq = mixture: an explicit example# This is arguably the most important conceptual point about the density matrix. Consider a spin-1/2 particle with ∣ ± ⟩ ≡ ∣ S z = ± ℏ 2 ⟩ |\pm\rangle \equiv |S_z = \pm\tfrac{\hbar}{2}\rangle ∣ ± ⟩ ≡ ∣ S z = ± 2 ℏ ⟩ .
The mixture ρ mix = 1 2 ( ∣ + ⟩ ⟨ + ∣ + ∣ − ⟩ ⟨ − ∣ ) \rho_\text{mix} = \tfrac{1}{2}(|+\rangle\langle+| + |-\rangle\langle-|) ρ mix = 2 1 ( ∣ + ⟩ ⟨ + ∣ + ∣ − ⟩ ⟨ − ∣ ) :
ρ mix = 1 2 ( 1 0 0 0 ) + 1 2 ( 0 0 0 1 ) = 1 2 ( 1 0 0 1 ) = 1 2 \rho_\text{mix} = \frac{1}{2}\begin{pmatrix}1&0\\0&0\end{pmatrix} + \frac{1}{2}\begin{pmatrix}0&0\\0&1\end{pmatrix} = \frac{1}{2}\begin{pmatrix}1&0\\0&1\end{pmatrix} = \frac{\mathbf{1}}{2} ρ mix = 2 1 ( 1 0 0 0 ) + 2 1 ( 0 0 0 1 ) = 2 1 ( 1 0 0 1 ) = 2 1
With S x = ℏ 2 ( 0 1 1 0 ) S_x = \frac{\hbar}{2}\begin{pmatrix}0&1\\1&0\end{pmatrix} S x = 2 ℏ ( 0 1 1 0 ) :
[ S x ] mix = Tr ( ρ mix S x ) = ℏ 4 Tr ( 0 1 1 0 ) = 0 [S_x]_\text{mix} = \text{Tr}(\rho_\text{mix} S_x) = \frac{\hbar}{4}\text{Tr}\begin{pmatrix}0&1\\1&0\end{pmatrix} = 0 [ S x ] mix = Tr ( ρ mix S x ) = 4 ℏ Tr ( 0 1 1 0 ) = 0
The superposition ∣ ψ sup ⟩ = 1 2 ( ∣ + ⟩ + ∣ − ⟩ ) |\psi_\text{sup}\rangle = \frac{1}{\sqrt{2}}(|+\rangle + |-\rangle) ∣ ψ sup ⟩ = 2 1 ( ∣ + ⟩ + ∣ − ⟩) :
ρ sup = ∣ ψ sup ⟩ ⟨ ψ sup ∣ = 1 2 ( 1 1 ) ( 1 1 ) = 1 2 ( 1 1 1 1 ) \rho_\text{sup} = |\psi_\text{sup}\rangle\langle\psi_\text{sup}| = \frac{1}{2}\begin{pmatrix}1\\1\end{pmatrix}\begin{pmatrix}1&1\end{pmatrix} = \frac{1}{2}\begin{pmatrix}1&1\\1&1\end{pmatrix} ρ sup = ∣ ψ sup ⟩ ⟨ ψ sup ∣ = 2 1 ( 1 1 ) ( 1 1 ) = 2 1 ( 1 1 1 1 )
[ S x ] sup = Tr ( ρ sup S x ) = ℏ 4 Tr ( 1 1 1 1 ) ( 0 1 1 0 ) = ℏ 4 Tr ( 1 1 1 1 ) = ℏ 2 [S_x]_\text{sup} = \text{Tr}(\rho_\text{sup} S_x) = \frac{\hbar}{4}\text{Tr}\begin{pmatrix}1&1\\1&1\end{pmatrix}\begin{pmatrix}0&1\\1&0\end{pmatrix} = \frac{\hbar}{4}\text{Tr}\begin{pmatrix}1&1\\1&1\end{pmatrix} = \frac{\hbar}{2} [ S x ] sup = Tr ( ρ sup S x ) = 4 ℏ Tr ( 1 1 1 1 ) ( 0 1 1 0 ) = 4 ℏ Tr ( 1 1 1 1 ) = 2 ℏ
So [ S x ] mix = 0 [S_x]_\text{mix} = 0 [ S x ] mix = 0 but [ S x ] sup = ℏ / 2 [S_x]_\text{sup} = \hbar/2 [ S x ] sup = ℏ/2 — the two states give different measurable predictions . The superposition has definite spin along x x x ; the mixture does not. The off-diagonal elements of ρ \rho ρ (the coherences ) encode quantum interference and are entirely absent in the mixture.
6. Equation of motion: the von Neumann equation# Differentiating ρ = ∑ i w i ( t ) ∣ α ( i ) ( t ) ⟩ ⟨ α ( i ) ( t ) ∣ \rho = \sum_i w_i(t)|\alpha^{(i)}(t)\rangle\langle\alpha^{(i)}(t)| ρ = ∑ i w i ( t ) ∣ α ( i ) ( t )⟩ ⟨ α ( i ) ( t ) ∣ with respect to time using the product rule:
d ρ d t = ∑ i d w i d t ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ + ∑ i w i ( d d t ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ + ∣ α ( i ) ⟩ d d t ⟨ α ( i ) ∣ ) \frac{d\rho}{dt} = \sum_i \frac{dw_i}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| + \sum_i w_i\left(\frac{d}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| + |\alpha^{(i)}\rangle\frac{d}{dt}\langle\alpha^{(i)}|\right) d t d ρ = ∑ i d t d w i ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ + ∑ i w i ( d t d ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ + ∣ α ( i ) ⟩ d t d ⟨ α ( i ) ∣ )
Using the time-dependent Schrödinger equation i ℏ d d t ∣ α ( i ) ⟩ = H ∣ α ( i ) ⟩ i\hbar\frac{d}{dt}|\alpha^{(i)}\rangle = H|\alpha^{(i)}\rangle i ℏ d t d ∣ α ( i ) ⟩ = H ∣ α ( i ) ⟩ and its conjugate:
d d t ∣ α ( i ) ⟩ = − i ℏ H ∣ α ( i ) ⟩ , d d t ⟨ α ( i ) ∣ = + i ℏ ⟨ α ( i ) ∣ H \frac{d}{dt}|\alpha^{(i)}\rangle = -\frac{i}{\hbar}H|\alpha^{(i)}\rangle, \qquad \frac{d}{dt}\langle\alpha^{(i)}| = +\frac{i}{\hbar}\langle\alpha^{(i)}|H d t d ∣ α ( i ) ⟩ = − ℏ i H ∣ α ( i ) ⟩ , d t d ⟨ α ( i ) ∣ = + ℏ i ⟨ α ( i ) ∣ H
Substituting:
d ρ d t = ∑ i d w i d t ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ + ( − i ℏ H ρ + i ℏ ρ H ) \frac{d\rho}{dt} = \sum_i \frac{dw_i}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| + \left(-\frac{i}{\hbar}H\rho + \frac{i}{\hbar}\rho H\right) d t d ρ = ∑ i d t d w i ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ + ( − ℏ i H ρ + ℏ i ρ H )
d ρ d t = ∑ i d w i d t ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ − i ℏ [ H , ρ ] \boxed{\frac{d\rho}{dt} = \sum_i \frac{dw_i}{dt}|\alpha^{(i)}\rangle\langle\alpha^{(i)}| - \frac{i}{\hbar}[H,\,\rho]} d t d ρ = i ∑ d t d w i ∣ α ( i ) ⟩ ⟨ α ( i ) ∣ − ℏ i [ H , ρ ]
If the weights w i w_i w i are time-independent (a closed system), the first term vanishes and we recover the von Neumann equation :
d ρ d t = − i ℏ [ H , ρ ] \frac{d\rho}{dt} = -\frac{i}{\hbar}[H, \rho] d t d ρ = − ℏ i [ H , ρ ]
This is the quantum analogue of Liouville’s theorem in classical statistical mechanics. Note the sign: it’s − i ℏ [ H , ρ ] -\frac{i}{\hbar}[H,\rho] − ℏ i [ H , ρ ] , opposite to the Heisenberg equation for operators d A d t = + i ℏ [ H , A ] \frac{dA}{dt} = +\frac{i}{\hbar}[H,A] d t d A = + ℏ i [ H , A ] .