The microcanonical ensemble is an ensemble whose Probability density function is
where is the Hamiltonian of the system and is its total energy. It represents an isolated system in thermal equilibrium whose energy is precisely defined and conserved. Given this definition, the equal a priori probability hypothesis applies to it directly, hence the constant density function.
In practice, we instead consider the system energy to be in an interval to account for practical uncertainties in measurement, since our understanding of energy is never going to be without error. Nevertheless, is to be seen as a tiny constant with respect to the energy () and likewise is a tiny interval.
It differs from the canonical and grand canonical ensembles, both of which are not isolated and have energy fluctuations.
Expectation values#
Given some dynamical variable , its most likely value (the mode) is the one that takes in the largest number of systems. The mean on the other hand is the usual ensemble average:
The mean and mode tend to coincide if the Mean squared error of tends to zero. Empirically, the MSE is inversely proportional to the number of particles in the system : . So for large , the mean and the mode coincide.
Entropy#
To find the entropy function for the ensemble, let's introduce a function which counts the number of states whose energy is between and :
This function explicitly depends on , but also on the number of particles (the integration variables) and the volume (in the integration limits due to ). returns a volume in phase space, which is a finite spherical shell bounded by two hypersurfaces of energies and .
Let's introduce another function that counts all states with energy below :
This is the volume of a sphere of radius . Of course, a spherical shell is just the difference between two spheres of different radii, and so the two functions are related by
Since the thickness of the shell is small compared to the radius (), the volume of the shell can be well approximated as
Since is a number of states, we can reasonably assume the entropy comes from Boltzmann's definition of entropy:
where is some constant (presumably the Boltzmann constant). However, in order to definitively prove this is entropy we need to prove that
- is extensive;
- obeys the second law of thermodynamics.
It can be proven that the entropy can be calculated from any of , and , and that they are all equal up to an additive constant:
Extensiveness#
Let's start from the first. Consider some chamber divided in two volumes and , respectively containing and particles.
If entropy is extensive, then the total entropy must be given as the combination of the entropy of each subsystem1. Entropy measures disorder, so if the two systems heavily interact with each other, that'll increase entropy by a lot. For this proof, let's assume that the energy of interaction between 1 and 2 is much smaller than the energy of each system, and . This allows us to consider the two as non-interacting and leaves us with just the internal entropies. For the interaction to be negligible we need two things:
- the range of interaction between particles must be finite (we'll send the volume to infinity later so any finite value is fine). This excludes long-range interactions like gravity from this proof;
- the contact-surface-to-volume ratio must be negligible as the volume goes infinite. This excludes weird, possibly nonphysical contact surfaces from this proof.
If these are true, then the Hamiltonian is separable: . To continue, let's define the entropies separately as and . and are volumes in the respective phase spaces and . We'd like to find the total entropy. Our simplest option is to just sum the two:
The product is the number of total states whose energy is between and 2. However, we'd be wrong. The reason is that and are, as it stands, undefined. They are just one possible subdivision of the total energy, which leads to only possible set of states. What we need to do is, on top of merging the states with this product, we also need to consider every possible pair of and that we split the system into (there's nothing preventing any combination, so they are all valid states that need to be counted)3. To count the pairs, we divide and into intervals of width , starting from zero. This division requires there to be an energy minimum. Now, by way of energy conservation, an energy pair can be written as and , where is the label for a given interval. With this, we can find the total number of states by summing over every interval:
If we plug this back in the Boltzmann entropy we get
Let with be the largest term in this sum. We must have
The first inequality is true because is just one term in the sum for . The second is true because is arbitrarily large (since is arbitrarily small). If we make the switch to entropies, the inequalities become
See how the only difference in between the outer inequalities is the term. If it were to vanish, the entropy would exactly be . To proceed, note how 4, but also how and therefore . Clearly, the sum goes something like
is something massive, like type massive, so this sum looks like (here using , but the point stands for any basis). Evidently, the second term is completely insignificant and we might as well set it to zero. But if we do this, we are left with
And so we proved that is (approximately) additive over subsystems, i.e. it is extensive. Furthermore, the only term that we are left with is the largest contribution alone, which means that the only term that matters in the sum is the one with highest entropy.
Equilibrium temperature#
To prove that obeys the second law of thermodynamics, we'll need to check if the state with maximum entropy is also the most likely. In other words, the state we found above, with energies , must be so overwhelmingly likely as to make every other state look impossibly rare. Since the state density is uniform (because we said so in the definition of the microcanonical ensemble), being the most likely macrostate is the same as having the largest count of equivalent microstates. As such, and must be such that they maximize under the constraint . We can use Lagrange multipliers to find the maximum under this bounded set. is a stationary point, for which
From this we have
and so
If we introduce the temperature through the Maxwell relations like
we can define the inverse by inverting the above relation (and calling )
and substituting it above we get
which is exactly what we should expect as we are studying the system in the context of thermal equilibrium. But the definition of temperature we used comes from absolute temperature, which inherently relies on the Boltzmann constant. Therefore, has to be precisely the Boltzmann constant , else the definition of would no longer hold.
As a side note, holds for any two subsystems, which means that the global temperature of a isolated system described by the microcanonical ensemble also regulates the temperature (and therefore thermal equilibrium) of all of its components.
Second law#
The only thing left to prove is that obeys the second law of thermodynamics. Thankfully this is straight-forward: the second law says that when a thermodynamic systems goes from one state to another through a transformation, the entropy does not decrease. In a microcanonical ensemble, entropy is a function of , and . But and are constant, so the only thing we can change to vary the state is . But if is increased, then must also increase as we are integrating over more states (recall that the integration bounds are affected by ). But since , and can only increase, then can also only increase. Thus, the second law is preserved, and with that we proved that is formally entropy.
Connection to information theory#
Consider the definition of entropy in function of information-theoretical entropy, . Note that this has the same form as the above three, just with instead of the of , or . This suggests that all three of these are formally types of information-theoretical entropy, which then gets converted into physical entropy by the Boltzmann constant.
Examples#
> With some combinatorics, the number of state combinations due to $n_{1}$ is > $$W=\begin{pmatrix}N \\ n_{1}\end{pmatrix}=\frac{N!}{n_{1}!(N-n_{1})!}> Dividing entropy by the Boltzmann constant we get > $$\frac{S}{k_{B}}=\log N!-\log n_{1}!-\log[(N-n_{1})!]\simeq N\log N- \frac{U}{E}\log \frac{U}{E}- \left( N- \frac{U}{E} \right)\log\left( N- \frac{U}{E} \right)using the binomial coefficient. The entropy therefore is
> We use the inverse just because it's easier, as we already have $S$ anyway. So > $$\frac{1}{T}=\frac{k_{B}}{E} \log\left( \frac{NE-U}{U} \right)using Stirling's approximation and .
As for the temperature, we have from the Maxwell relations:
> Note that this temperature could very well be negative. This, of course, is non-physical, but it's happening here regardless. The reason why it happens is that the system is being bounded in energy from above, that is, there is a maximum energy that the system cannot cross. The unphysical nature of negative temperature implies that systems with an energy maximum cannot exist, or at most exist as approximation of unbounded systems. This is the case for [[Two-level system|two-level systems]], which in practice do exist, but only within a short period of time. For instance, a [[Qubit]] is a two-level system during the short period of its measurement, but if we were to observe one over long periods of time, it would not remain bounded to two levels, as that would imply the possibility of negative temperature. > > We can also invert the last equation to find the internal energy of the system: > $$U=\frac{NE}{e^{\beta E}+1}So
> This is the [[Fermi-Dirac distribution]] for a system with no [[chemical potential]]. It is interesting to see how we managed to find a strictly quantum property from a purely classical treatment. The reason is that a two state system is inherently a quantum idea as it requires the discretization of energy. > > We can also derive the [[heat capacity]] from its definition: > $$C(T)=\frac{\partial U}{\partial T}=\frac{NE^{2}}{k_{B}T^{2}} \frac{e^{\beta E}}{e^{\beta E}+1}or alternatively, using , the fraction of particles in the state:
Footnotes#
-
It doesn't necessarily need to be a simple sum (e.g. due to interactions), but it needs to combine so that the total has terms for each subsystem. ↩
-
It's a product and not a sum because we want all possible combinations of states of 1 and 2. For example, if 1 and 2 both have 3 states each, there are combinations, not . If we call the states for 1 and for 2, the combinations are . ↩
-
At this point it's useful to remember that our original division into two systems is completely arbitrary. Systems 1 and 2 don't exists, they're fictitious, they're just a tool to count possible states, so any pair of these is eligible. ↩
-
This is because counts combinations, which are given by , so . Stirling's approximation then yields . ↩