Derivation of Statistical Mechanics

Classical Hamiltonian Mechanics

Start with the definition of action S across some path x(t) where (x,x˙) is the Lagrangian density.

S(x)=tatb(x(t),dx(t)dt)dt

From the principle of least action the path should be a minimum, so the variational derivative should be zero given an perturbation δx(t) that preserves the same start and end points so δx(ta)=0,δx(tb)=0

limϵ0S(x+ϵδx)S(x)ϵ=0
limϵ01ϵtatb(x(t)+ϵδx,dx(t)dt+ϵdδxdt)(x(t),dx(t)dt)dt=0

Assume we can swap the derivative and integral.

tatblimϵ0(x(t)+ϵδx(t),dx(t)dt+ϵdδx(t)dt)(x(t),dx(t)dt)ϵdt=0

Create a dummy value.

tatblimϵ0(x(t)+ϵδx(t),dx(t)dt+ϵdδx(t)dt)(x(t)+ϵδx(t),dx(t)dt)+(x(t)+ϵδx(t),dx(t)dt)(x(t),dx(t)dt)ϵdt=0
tatblimϵ0(x(t)+ϵδx(t),dx(t)dt+ϵdδx(t)dt)(x(t)+ϵδx(t),dx(t)dt)ϵ+(x(t)+ϵδx(t),dx(t)dt)(x(t),dx(t)dt)ϵdt=0

See these are just definitions for partial derivaitves (gradients).

tatb(x(t),x(t)dt)x˙dδx(t)dt+(x(t),x(t)dt)xδx(t)dt=0

Integrate by parts on left side.

(x(tb),x(tb)dt)x˙δx(tb)(x(ta),x(ta)dt)dx˙δx(ta)+tatb(ddt(x(t),x(t)dt)x˙)δx(t)+(x(t),x(t)dt)xδx(t)dt=0

Remember δx(ta),δx(tb)=0.

tatb((x(t),x(t)dt)x(ddt(x(t),x(t)dt)dx˙))δx(t)dt=0

Since this must be true for arbitrary δx

(x(t),x(t)dt)x(ddt(x(t),x(t)dt)dx˙)=0

which is the Euler Lagrange equation. Next we define a Hamiltonian (x,q)=x˙q(x,x˙) where q=(x,x˙)x˙, usually interpreted as momentum.

(x,q)x=(x,x˙)x=ddt(x,x˙)x˙=q˙

So we get Hamiltons equations.

x˙=(x,q)q
q˙=(x,q)x

Classical Microcanonical Ensemble

We start with some probability distribution in the phase space of the Hamiltonian (x,q).

p(x,q,t)=???

The probability distribution evolves by the continuity equation.

p(x,q,t)t=(pv)

Here v=x˙1,x˙2,,x˙N,q˙1,q˙2,,q˙N

In equilibrium the probability distribution at every point in phase space shouldn't change.

(pv)=0
i=12Np(x,q)viai=0
i=1Np(x,q)x˙ixi+p(x,q)q˙iqi=0

Simple chain rule.

i=1Np(x,q)x˙ixi+x˙ip(x,q)xi+p(x,q)q˙iqi+q˙ip(x,q)qi=0

Substitute in Hamiltons equations.

i=1Np(x,q)(x,q)xiqi+x˙ip(x,q)xip(x,q)(x,q)xiqi+q˙ip(x,q)qi=0
i=1Nx˙ip(x,q)xi+q˙ip(x,q)qi=0
i=1N(x,q)qip(x,q)xi(x,q)xip(x,q)qi=0

Next, we say that the probability distribution is a function only of the total energy, so p(x,q,t0)=f((x,q)). This means that p(x,q,t0)ai=f((x,q))(x,q)ai

i=1Nf()((x,q)xi(x,q)qi(x,q)xi(x,q)qi)=0

That assumption creates a solution to the continuity equation. Now in the microcanonical ensemble (NVE) we say the total energy is known, so the only possibility for f is a uniform distribution over the selected energy level.

p(x,q)=δ((x,q)E)Vδ((x,q)E)dxdq

This equation is the principle of equal a priori for the equilibrium in a closed system.

Classical Canonical Ensemble

Define the number of states accessable by a system with N degrees of freedom at energy E to be the below.

Ω(E)=1hNVδ((x,q)E)dxdq

Next split the degrees of freedom into the "bath" and "system". Let the degrees of freedom for the bath be rbath=xM+1,,xN,qM+1,,qN and the degrees of freedom for the system rsys=x1,,xM,,q1,,qM.

In the Canonical ensemble the combined energy of the bath and system should be constant. The combination of the two makes a microcanonical ensemble.

total(rbath,rsys)=bath(rbath)+sys(rsys)+coupling(rbath,rsys)

The total accessable states with an energy should be the total number combined number of states the bath and system can be in that sum to that energy.

Ωtotal(Etotal)=1hMVΩbath(Etotalsys(rsys))drsys

And in the microcanonical ensemble all microstates (rbath,rsys) with the correct energy have an equal probability density.

p(rbath,rsys)=1Ωtotal(Etotal)1hNδ(Etotal(sys(rsys)+bath(rbath)))

We can calculate the probability density of just the microstate for the system.

p(rsys)=Vp(rbath,rsys)drbath
=V1Ωtotal(Etotal)1hNδ(Etotal(sys(rsys)+bath(rbath)))drbath
=Ωbath(Etotalsys(rsys))hMΩtotal(Etotal)

Take the natural log of both sides

ln(p(rsys))=ln(Ωbath(Etotalsys(rsys))hMΩtotal(Etotal))
ln(p(rsys))=ln(Ωbath(Etotalsys(rsys)))ln(hMΩtotal(Etotal))

Then we expand the logarithm to its Taylor series.

f(x+c)=f(x)+df(x)dxc+12d2f(x)dx2c2+16d3f(x)dx3c3+

We set f(x)=ln(Ωbath(x)), x=Etotal, and c=sys(rsys) we get the following expression.

ln(p(rsys))=ln(Ωbath(Etotal))dln(Ωbath(Etotal))dEtotalsys(rsys)
+12d2ln(Ωbath(Etotal))dEtotal2sys(rsys)216d3ln(Ωbath(Etotal))dEtotal3sys(rsys)3+
ln(hMΩtotal(Etotal))

In the limit that Etotal approaches Ebath so that Esystem becomes small in comparison, the bath becomes a microcanonical ensemble and the entropy of the bath Sbath(Etotal)=Sbath(Ebath)=kBln(Ωbath(Ebath))

=ln(Ωbath(Ebath))1kBdSbath(Ebath)dEbathsys(rsys)+12kBd2Sbath(Ebath)dEbath2sys(rsys)2
16kBd3Sbath(Ebath)dEbath3sys(rsys)3+ln(hMΩtotal(Etotal))

And by definition 1Tbath=dSbath(E)dE.

=ln(Ωbath(Ebath))1kBTbathsys(rsys)+12kBTbathdTbathdEbath2sys(rsys)2
16kBTbathd2TbathdEbath2sys(rsys)3+ln(hMΩtotal(Etotal))

Then if we assume the heat capacity of the bath is infinite so that dTbathdEbath=0 we can remove the extra terms.

ln(p(rsys))=1kBTbathsys(rsys)+ln(Ωbath(Ebath)hMΩtotal(Etotal))

Then exponentiate both sides.

p(rsys)=esys(rsys)kBTbath(hMΩtotal(Etotal)Ωbath(Ebath))

And the denominator is just a normalizing factor so the final expression is below.

p(rsys)=esys(rsys)kBTbathVesys(rsys)kBTbathdrsys

Quantum Microcanonical Ensemble

We assume a time evolution of a system of the Schrodinger equation for some Hamiltonian operator.

Ψ(x,t)t=iH^Ψ(x,t)

To measure an observable when the stat is known is

A=Ψ|A^|Ψ

but if there is a probability distrubition over some states then it is below.

=A=ipiΨi|A^|Ψi

If we have an orthanormal basis n so that k|nknk|=I we can insert it into the equation.

=ipiΨi|A^(k|nknk|)|Ψi
=kipiΨi|A^|nknk|Ψi
=kipink|ΨiΨi|A^|nk
=knk|(ipi|ΨiΨi|A^)|nk
=Tr(ipi|ΨiΨi|A^)

If we define an operator

ρ=ipi|ΨiΨi|

then the expected value of any observable is

A=Tr(ρA^)

Next we find the time evolution of ρ.

dρdt=ipid(|ΨiΨi|)dt=ipi(d|ΨidtΨi|+|ΨidΨi|dt)

Then substitute the Schrodinger equation

=ipi(iH^|ΨiΨi|+|ΨiΨi|iH^)
=iipi(H^|ΨiΨi||ΨiΨi|H^)
=i(H^ρρH^)

So it must be that for equilibrium they commute

H^ρ=ρH^

Similar to the classical case, if we have the pi be only a function of the energy for each eigenvector of the Hamiltonian, the system will be in equilibrium. Since observables are Hermetian, an orthogonal eigenbases must exist. Each operator will be defined in the eigenbasis.

H^=aEa|nana|
ρ=ap(Ea)|nana|

We can show they commute

H^ρ=aEa|nana|bp(Eb)|nbnb|
=abEap(Eb)|nana||nbnb|
=abp(Eb)|nana|Ea|nbnb|
=ap(Ea)|nana|bEb|nbnb|
ρH^

So the quantum microcanonical ensemble must have a density matrix of the form

ρ=ap(Ea)|nana|

where the n's are eigenvectors of the Hamiltonian. Since the energy is known, it must be a uniform distribution over all eigenstates with that energy

ρ=a|nana|

Quantum Cannonical Ensemble

TODO