Ideal crystal


An ideal crystal is a solid-state material with a periodic, perfectly geometric Bravais lattice. It is the simplest mathematical description of a crystal, much like how an ideal gas is the simplest description of a gas. The Hamiltonian is

H=ipi22mi+jPj22MjKinetic terms+12j,jZjZje2RjRjIon-ion interaction+12i,ie2ririElectron-electron interactioni,jZje2riRjIon-electron interactionH=\underbrace{ \sum_{i} \frac{p_{i}^{2}}{2m_{i}}+ \sum_{j} \frac{P_{j}^{2}}{2M_{j}} }_{ \text{Kinetic terms} }+ \underbrace{ \frac{1}{2}\sum_{j,j'} \frac{Z_{j}Z_{j'}e^{2}}{\lvert \mathbf{R}_{j}-\mathbf{R}_{j'} \rvert } }_{ \text{Ion-ion interaction} }+ \underbrace{ \frac{1}{2}\sum_{i,i'} \frac{e^{2}}{\lvert \mathbf{r}_{i}-\mathbf{r}_{i'} \rvert } }_{ \text{Electron-electron interaction} }-\underbrace{ \sum_{i,j} \frac{Z_{j}e^{2}}{\lvert \mathbf{r}_{i}-\mathbf{R}_{j} \rvert } }_{ \text{Ion-electron interaction} }

This is an NN-body Hamiltonian that's certainly not analytically solvable. It can be solved in an approximate manner by combining two assumptions:

  1. Core and valence electrons. We split electrons in two categories: core electrons (inner electron shells) and valence electrons (outermost electron shell). Only the valence electrons are significant when it comes to interactions.
  2. Born-Oppenheimer approximation. The Born-Oppenheimer approximation is noticing that the nuclei move at much lower speeds than the electrons, which allows us to state that ions are subject to a mean Potential and that the electrons see the ions as basically stationary.

In this sense, the Hamiltonian can be written as

HHions(Ri)+He(ri,Rj)+He-ion(ri,δRj)H\equiv H_\text{ions}(R_{i})+H_{e}(r_{i},R_{j})+H_\text{e-ion}(r_{i},\delta R_{j})

where HionsH_\text{ions} is the lattice energy (the energy of each individual ion in isolation), HeH_{e} is the interaction energy between electrons with "frozen" ions (without motion) and He-ionH_\text{e-ion} is the interaction energy between electrons with ion vibrations.

Vibrations

In order to realistically describe a crystal's behavior, we cannot ignore the fact that the ions do move inside the lattice. The great importance of this is that if they didn't, no energy could be transferred through the material through vibrations, a fact that's obviously a glaring issue as any object in a nonzero temperature environment will have some collisions from the outside transferring energy to it. Thus, we need some way to detach the actual position of an ion from the lattice of the crystal. In mathematical terms, the positions R\mathbf{R} are no longer constant, but a function of time, R(t)\mathbf{R}(t).

The vibrations of a crystal are determined by the ions alone, so their study only affects the ionic part of the Hamiltonian HionsH_\text{ions}. By observing the real world behavior of a crystal, we can (and should) make an assumption:

  1. The mean equilibrium position R(t)t\langle \mathbf{R}(t) \rangle_{t} of each ion correspond to a point R\mathbf{R} in the Bravais lattice.

This accounts for the fact that a real-world crystal manifestly abides by some sort of lattice structure, whilst still allowing for ionic motion. This assumption casts a very wide net: any ionic motion is fine so long that on average, it makes the ion stay on the lattice point. The only kind of motion that this assumption openly rejects is ion scattering, where the ions fully break off the lattice and either leave the material entirely, to take a free spot somewhere else in the lattice. These are pretty intuitive scenarios, but is stable conditions, they don't particularly matter.1

The harmonic approximation

Soon enough however, we hit a wall. To vibrate, the ions must affect each other somehow. Of course, they do so electromagnetically (electrostatically even), but we don't know what the electric potential looks like. In order to get a solution that's can be handled by hand, we take a second, more stringent assumption: 2. The displacements from the mean equilibrium position are small compared to the distance between ions.

It might be more illuminate to read it this way: ion oscillations are small. And the answer to small oscillations is always the same: the harmonic oscillator. Thus, the electric potential varies harmonically in time: this is called the harmonic approximation. You might argue whether this assumption is even valid: fair point. But recall that the mass of an ion is essentially that of the nucleus (a "blob" of protons and neutrons) and that is enormous compared to that of the few electrons surrounding it, entire orders of magnitude higher. So large in fact, the Born-Oppenheimer approximation in molecules just does away with nuclear motion outright and still gets good results. Thus, it's perfectly legitimate to assume that the ions move very little around their resting spot.

So, we have a periodic shape in which we slot in our ions and assume that any change from this configuration is small. The generic position of each ion will then be given by r(R)=R+u(R)\mathbf{r}(\mathbf{R})=\mathbf{R}+\mathbf{u}(\mathbf{R}), where u(R)\mathbf{u}(\mathbf{R}) is the (small) displacement from the lattice point. The generic interaction potential between ions UU is then UU(r)=U(R+u)U\equiv U(\mathbf{r})=U(\mathbf{R}+\mathbf{u}). Here's the mathematical statement of the approximation: we take this potential and expand it in a Taylor series around the lattice point:

U(R+u)=U(R)+(u)U(R)+12(u)2U(R)+13!(u)3U(R)+U(\mathbf{R}+\mathbf{u})=U(\mathbf{R})+(\mathbf{u}\cdot \nabla )U(\mathbf{R})+ \frac{1}{2}(\mathbf{u}\cdot \nabla)^{2}U(\mathbf{R})+ \frac{1}{3!}(\mathbf{u}\cdot \nabla)^{3}U(\mathbf{R})+\ldots

If we express UU in terms of the individual potential energy ϕ\phi of each ion we get

U=N2R0ϕ(R)+12RR(u(R)u(R))ϕ(RR)+14RR[u(R)u(R)]2ϕ(RR)+O(u3)\begin{align} U&=\frac{N}{2}\sum_{\mathbf{R}\neq 0}\phi(\mathbf{R})\\ &+ \frac{1}{2}\sum_{\mathbf{R}\mathbf{R}'}(\mathbf{u}(\mathbf{R})-\mathbf{u}(\mathbf{R}'))\cdot \nabla \phi(\mathbf{R}-\mathbf{R}') \\ &+ \frac{1}{4}\sum_{\mathbf{R}\mathbf{R}'}[\mathbf{u}(\mathbf{R})-\mathbf{u}(\mathbf{R}')\cdot \nabla]^{2}\phi(\mathbf{R}-\mathbf{R}') \\ &+O(\mathbf{u}^{3}) \end{align}

The first-order term contains

Rϕ(RR)\sum_{\mathbf{R}'}\nabla \phi(\mathbf{R}-\mathbf{R}')

which is (minus) the total force exerted on the ion at equilibrium in R\mathbf{R} by all other atoms R\mathbf{R}'. But there is no total force at equilibrium, that's the whole point of equilibrium. Thus, this entire term must vanish.

The harmonic approximation consists of truncating at the second order, where we're only left with zeroth- and second-order terms, which we can denote as

U=Uequilbrium+Uharmonic\boxed{U=U_\text{equilbrium}+U_\text{harmonic}}

The former term is the energy of a perfectly immobile lattice. It is purely electrostatic potential. The second term is due to the oscillations of the ions about their resting spots, and can be expanded to read

Uharmonic=14RRμν[uμ(R)uμ(R)]ϕμν(RR)[uν(R)uν(R)]U_\text{harmonic}=\frac{1}{4}\sum_{\mathbf{R}\mathbf{R}'\mu \nu}[u_{\mu}(\mathbf{R})-u_{\mu}(\mathbf{R}')]\phi_{\mu \nu}(\mathbf{R}-\mathbf{R}')[u_{\nu}(\mathbf{R})-u_{\nu}(\mathbf{R}')]

where μ,ν=x,y,z\mu,\nu=x,y,z and

ϕμν=2ϕ(r)rμrν\phi_{\mu \nu}=\frac{ \partial ^{2}\phi(\mathbf{r}) }{ \partial r_{\mu}r_{\nu} }

For convenience, an alternative format is used

Uharmonic=12RRμνuμ(R)Dμν(RR)uν(R)\boxed{U_\text{harmonic}=\frac{1}{2}\sum_{\mathbf{R}\mathbf{R}'\mu \nu} u_{\mu}(\mathbf{R})D_{\mu \nu}(\mathbf{R}-\mathbf{R}')u_{\nu}(\mathbf{R}')}

where

Dμν(RR)=δRRRϕμν(RR)ϕμν(RR)D_{\mu \nu}(\mathbf{R}-\mathbf{R}')=\delta_{\mathbf{R}\mathbf{R}'}\sum_{\mathbf{R}''}\phi_{\mu \nu}(\mathbf{R}-\mathbf{R}'')-\phi_{\mu \nu}(\mathbf{R}-\mathbf{R}')

are the elements of the dynamical matrix. δRR\delta_{\mathbf{R}\mathbf{R}'} is the three-dimensional Dirac delta.

The spring model

Since we're using harmonic potentials for the interaction between ions, how do we interpret this? Well, harmonic potentials are springs. They have some elastic constant and the strength of the pullback increases quadratically with distance. This justifies a somewhat toyish but otherwise not unreasonable model of claiming that ions are bound to each other by (invisible) springs. In other words, a chain of harmonic oscillators.

Say that all of the oscillators have the same elastic constant γ\gamma. Then their single-oscillator Hamiltonian is

Hosc=12Mv2+12γx2H_\text{osc}=\frac{1}{2}Mv^{2}+ \frac{1}{2}\gamma x^{2}

where MM is the mass of the ion.

Let's start from a 1D crystal, so an (effectively) infinite chain of oscillators all bound to their two closest neighbors. We'll assume each primitive cell has just one ion in it. The oscillations are longitudinal, that is, on the axis of the springs2. The lattice parameter is aa, so each ion's resting position is nana, for nZn\in \mathbb{Z}. We assume that the ions only interact with their nearest neighbors (short range approximation), so the potential is

Uharmonic=12ϕ(a)n[u(na)u([n+1]a)]2U_\text{harmonic}=\frac{1}{2}\phi''(a)\sum_{n}[u(na)-u([n+1]a)]^{2}

The differential equation of motion then is

Mu¨(na)=Uharmonicu(na)=ϕ(a)[2u(na)u([n1]a)u([n+1]a)](1)M\ddot{u}(na)=-\frac{ \partial U_\text{harmonic} }{ \partial u(na) }=-\phi''(a)[2u(na)-u([n-1]a)-u([n+1]a)] \tag{1}

This is the equation of motion of a point mass MM attached with two massless springs of elastic constant γ=ϕ(a)\gamma=\phi''(a). It is a wave equation, so we'll choose the solution for u(na,t)un(t)u(na,t)\equiv u_{n}(t) to be a Plane wave

un(t)=uei(knaωt)(2)u_{n}(t)=ue^{i(kna-\omega t)}\tag{2}

for some constant uu.

The complete solution also requires knowing the initial position and velocity/momentum of each ion in the chain, but we don't need those to gather more information from this system. When substituting (2)(2) into (1)(1) we get

Mω2ei(kanωt)=γ[2eikaeika]ei(kan)ωtMω2=γ[2eikaeika]\begin{align} -M\omega ^{2}\cancel{ e^{i(kan-\omega t)} }&=-\gamma[2-e^{ika}-e^{ika}]\cancel{ e^{i(kan)-\omega t} } \\ M\omega ^{2}&=\gamma[2-e^{-ika}-e^{ika}] \end{align}

Extracting ω\omega, using eix+eix=2cosxe^{ix}+e^{-ix}=2\cos x and (1cos2x)/2=sin2x(1-\cos ^{2}x)/2=\sin ^{2}x gives

ω(k)=2γ(1cos(ka))M=2γMsin(ka2)\omega(k)=\sqrt{ \frac{2\gamma(1-\cos(ka))}{M} }=2\sqrt{ \frac{\gamma}{M} }\left\lvert \sin\left( \frac{ka}{2} \right) \right\rvert

This is the dispersion relation of the 1D, spring-modeled, ideal crystal, which tells us the angular frequency of the Wave that carries a perturbation through the chain of ions. The wave, as usual, moves with two different velocities: the phase velocity c=ω/kc=\omega/k and the group velocity vg=ω/kv_{g}=\partial \omega/\partial k.

Something interesting happens when the vibration wavelength is very large compared to the lattice spacing aa. In the kπ/ak\ll \pi/a case, the wavelength of the vibration is much larger than the lattice distance aa, so ions next to each other largely move with the same phase. This is called a continuous approximation. Moreover, since kaπka\ll \pi, the dispersion becomes linear

sinka2ka2ω(k)2γMka2=ckwith c=γMa\sin \frac{ka}{2}\simeq \frac{ka}{2}\quad\to \quad \omega(k)\simeq 2\sqrt{ \frac{\gamma}{M} } \frac{ka}{2}=ck\quad\text{with }c=\sqrt{ \frac{\gamma}{M} }a

In this specific case, the phase velocity is equal to group velocity, as

vgω(k)k=kγMka=γMa=cv_{g}\equiv\frac{ \partial \omega(k) }{ \partial k } =\frac{ \partial }{ \partial k } \sqrt{ \frac{\gamma}{M} }ka=\sqrt{ \frac{\gamma}{M} }a=c

They're also constant and independent of frequency. cc is the speed of sound of the material.

Another interesting case is when k=π/ak=\pi/a. In this case, the wavelength of the vibration is exactly λ=2a\lambda=2a and the direction of motion alternates between ions. For instance, all even ions will move to the right and all odd ions to the left.

sinka2=sinπ2=1ω=2γM\sin \frac{ka}{2}=\sin \frac{\pi}{2}=1\quad\to \quad \omega=2\sqrt{ \frac{\gamma}{M} }

Clearly then, the group velocity is zero. There can be no propagation in this regime: the wave is a standing wave.

Something useful to note is that the dispersion relation is periodic. This is important because it's specifically periodic in the range of the first Brillouin zone. This means that, if we know the Bravais lattice of the crystal, we can find the reciprocal lattice, then the Brillouin zone and finally all of the above.

Two-point basis

For now we've talked about a lattice whose points correspond with actual ions. Let's expand the concept to a lattice with a basis. In the same 1D model, we'll have twice as many ions, but they're still all connected with the same springs as above. We'll identify with unu_{n} the location of one type of ion and with vnv_{n} the location of the other. We'll use bb for the lattice parameter.

We'll now have two equations of motion, one for each ion:

M1d2undt2=γ[2unvn1vn],M2d2vndt2=γ[2vnunun+1]ω2M1u=γv(1+eikb)2γuω2M2v=γu(1+eikb)2γv\begin{align} M_{1} \frac{d^{2}u_{n}}{dt^{2}}=-\gamma[2u_{n}-v_{n-1}-v_{n}],\qquad &M_{2} \frac{d^{2}v_{n}}{dt^{2}}=-\gamma[2v_{n}-u_{n}-u_{n+1}] \\ -\omega ^{2}M_{1}u=\gamma v(1+e^{-ikb})-2\gamma u\qquad&-\omega ^{2}M_{2}v=\gamma u(1+e^{ikb})-2\gamma v \end{align}

This yields the system

{(2γω2M1)uγ(1+eikb)v=0γ(1+eikb)u(2γω2M2)v=0\begin{cases} (2\gamma-\omega ^{2}M_{1})u-\gamma(1+e^{-ikb})v=0 \\ -\gamma(1+e^{ikb})u-(2\gamma-\omega ^{2}M_{2})v=0 \end{cases}

This has nontrivial solution only if its determinant is zero:

2γω2M1γ(1+eikb)γ(1+eikb)2γω2M2=0\begin{vmatrix} 2\gamma-\omega ^{2}M_{1} & -\gamma(1+e^{-ikb}) \\ -\gamma(1+e^{ikb}) & 2\gamma-\omega ^{2}M_{2} \end{vmatrix}=0

There will only be two possible nontrivial solutions, both dependent on kk. The dispersion relation therefore splits in two:

ω2=γ(1M1+1M2)±γ(1M1+1M2)24M1M2sin2kb2\omega ^{2}=\gamma\left( \frac{1}{M_{1}}+ \frac{1}{M_{2}} \right)\pm \gamma\sqrt{ \left( \frac{1}{M_{1}}+ \frac{1}{M_{2}} \right)^{2}- \frac{4}{M_{1}M_{2}}\sin ^{2} \frac{kb}{2} }

On further analysis, these solutions have very different behaviors. Collectively, they're known as the branches of the dispersion relation.

The lower branch behaves like before, with vanishing frequency at the edges and a dispersion relation of the form ω=ck\omega=ck. This is called the acoustic branch of dispersion. The upper branch however has much higher frequencies and never vanishes. This is called the optical branch of dispersion because the higher frequencies can actually interact and couple with commonly encountered Electromagnetic radiation, thus defining the crystal's optical behavior.

Energy spectrum

The above discussion is adequate for the general solution to the wave equation, but in practice, we need some boundary conditions to fully solve it. If there are NN total ions in the crystal, then we can use periodic boundary conditions3 to state that the crystal is actually a looping ring, so u(0)=u(Na)u(0)=u(Na).

The periodic condition enforces

eikNa=1k=2πanNe^{ikNa}=1\quad\Rightarrow \quad \boxed{k=\frac{2\pi}{a} \frac{n}{N}}

Evidently, there's only a discrete set of acceptable wavenumbers for the oscillation of a 1D crystal. These wavenumbers, or rather their associated frequencies, are known as the normal modes of oscillation of the system. We take these to lie between π/a-\pi/a and π/a\pi/a, which is essentially saying that the must be contained in the first Brillouin zone of the reciprocal lattice.

Everything that we've discovered up to now is actually fairly classical. There's nothing obligatorily quantum anywhere in here. But now that we're working with specific modes of vibration, we need to deal with the kind of energy that they carry. In a macroscopic system, energy quantization doesn't matter. Energy scales are way too large for that to be significant. But we're not in a macroscopic setting, we're dealing with individual ions. So the energy has to be quantized and the quantization must matter because the energy scales are just that small. But the fix is simple: just change the harmonic oscillators to quantum harmonic oscillators. This quantizes the energy of the kk-th mode of oscillation to

El(k)=ω(k)(l+12)E_{l}(k)=\hbar \omega(k)\left( l+ \frac{1}{2} \right)

where ll is the "principal quantum number" of the vibration. Clearly then, not only are the modes of oscillation discrete (not quantum), each individual mode has its possible energies quantized (definitely quantum). Taking inspiration from electromagnetic radiation, where the Photon is the quantum of energy, we call phonon the quantum of energy of acoustic radiation.

A phonon behaves in many ways like a Particle. In fact, it is the particle aspect of the sound wave, just like how the photon is the particle aspect of the light wave. However, phonons aren't "real" particles: they're a special class of phenomenon called a quasiparticle. Basically, they are external phenomena that display particle-like behavior. The big difference is that quasiparticles don't have "physicality", they can't exist on their own. A photon is its own thing and is capable of traversing the vacuum on its own, such as when it's shot out from a star into outer space. A phonon isn't "its own thing", it is just that the vibration of a chain of oscillator produces behavior that is consistent with a particle formulation, so we describe it as a particle. It's a purely mathematical invention. This is why phonons are part of a class of quasiparticles generally called collective excitations. They aren't actual particles, they are a collective vibration of several other objects that just happens to behave like a particle4.

Phonons are useful because they are technically bosons, so if we put a lot of them together (i.e. cause some vibrations in the crystal), we get a Bose gas that obeys the Bose-Einstein distribution. We can then use the whole machinery of statistical mechanics to describe the crystal with a quantum ensemble and derive macroscopically relevant thermodynamic properties about it by knowing the microscopic behavior. See Harmonic oscillator > Oscillator chains for more.

Electronic structure

The electronic structure of a crystalline solid is a rather complicated topic, but it serves to explain a considerable portion of the nature of the material itself. Electrons are mobile and somewhat prone to being dislodged from individual ions, so they tend to be the cause of most behavior of the crystal when interacting with other materials or Electromagnetic radiation.

The complication arises from trying to understand how the electrons interact with the lattice of ions they are in. There's two extremes to this interaction: on one side, the atoms are so far away that all electrons exclusively interact with their nucleus (free atom limit); on the other, the electrons are so weakly bound to any one nucleus in particular that they end up roaming around freely, disregarding the ions completely (free electron limit). The truth is, as usual, somewhere in the middle, and the real question is: how do we get there?

There's two paths we could take, each starting from one edge of the potential limit. We could start with free atoms and then progressively make them close and closer, analyzing what effect this has on the energy levels of the electrons. Alternatively, we could start with free electrons and add an ionic potential, making more and more intense and seeing what that does to the electrons.

Both approaches are valid and have their merits. What they have in common is that crystal are, in principle at least, constructed from a perfectly periodic lattice of atoms, each with the same properties. Thus, the potential profile of each lattice cell is identical to that of every other cell. This makes the foundation of how electrons are handled in solid crystalline matter: a periodic potential.

Of course, a real crystal is not perfectly periodic. They aren't perfectly pure, leading to occasional differences in the contents of each cell and therefore the potential in proximity of those cells. Thermal effects always provide a chance that an atom is not where it's supposed to be, leading to a hole in the lattice or a misplaced atom. Ionic vibrations, as seen in the section above, also contribute to breaking perfect periodicity. Finally, electron-electron interactions are incredibly complicated. That said, the study of a periodic potential is paramount, because it sets the foundation of the quantum description of electronic structure. Just like with the fine structure of the hydrogen atom or nuclear motion after the Born-Oppenheimer approximation, it's much easier to develop a theory for an ideal system, then fix it with subsequent corrective terms for each individual problem. This is what we'll be done here too, starting with the perfect periodicity of an ideal crystal.

The periodic potential

The basic idea is simple. We have a lattice that repeats every R\mathbf{R}. So, our potential, which we'll call U(r)U(\mathbf{r}) and is due to the ions in that lattice, repeats in the same way:

U(r)=U(r+R)(3)U(\mathbf{r})=U(\mathbf{r}+\mathbf{R})\tag{3}

Now, we need to choose a U(r)U(\mathbf{r}). That's a hard task. An even harder task is trying to solve the wavefunction of the whole crystal; we can't just put this in a Hamiltonian containing every single ion and electron in the crystal and expect it to work. It's unthinkable to even start solving such an equation. What we'll do is exploit the fact that, regardless of what the potential looks like, it must satisfy (3)(3). This statement already contains a lot of useful information that we can discover.

Let's start by solving for a periodic potential on a single, lone electron. The Schrödinger equation then is

222meψ(r)+U(r)ψ(r)=Eψ(r)(4)- \frac{\hbar^{2}\nabla^{2}}{2m_{e}}\psi(\mathbf{r})+U(\mathbf{r})\psi(\mathbf{r})=E\psi(\mathbf{r})\tag{4}

From here, we can make a first, quite trivial choice of periodic potential: U(r)=0U(\mathbf{r})=0. A constant is technically periodic, and if we make this choice, we get the right edge of the potential limits: the free electron limit. In other words, we went back to the Sommerfeld model of electronic structure. Of course, we won't use the zero potential, but it does prove that the older Sommerfeld model is just a specific case of the periodic potential model, and by extension that we're proving something more general here.

Some terminology: an independent electron (that is, independent of other electrons, not of ions) that obeys a one-electron Schrödinger equation (4)(4) is said to be a Bloch electron. The Sommerfeld free electron is a subtype of the Bloch electron that occurs when the potential is zero. A Bloch electron is interesting because it satisfies Bloch's theorem, which has a whole host of consequences regarding both the electron and the structure of the crystal itself; go read that page before continuing here.

One key way Bloch electrons differ from free ones is how their energy levels are distributed. In the Sommerfeld model (a Fermi gas; assume T=0T=0 for simplicity) the electrons all pile up from the lowest energy state to the highest. Each energy level is determined by ε(k)=2k2/2mεF\varepsilon(\mathbf{k})=\hbar^{2}k^{2}/2m\leq\varepsilon_{F}, where εF\varepsilon_{F} is the Fermi energy. Meanwhile, Bloch electrons have their energy levels identified by εn(k)\varepsilon_{n}(\mathbf{k}), which includes the additional band index nn on top of the wavevector k\mathbf{k} and does not have a simple analytical form. If each level is to only be counted once, k\mathbf{k} must be confined to a single value per primitive cell, so that if there are NN cells, there are NN values of k\mathbf{k}. As the energy levels fill up from lowest to highest, we recognize two possible cases:

  1. Some bands are completely filled. Others are completely empty. The difference in energy between the highest occupied level (top of the highest occupied band) and the lowest occupied level (bottom of lowest unoccupied band) in known as the band gap. A significant band gap (kBT\gg k_{B}T where TT is room temperature) is indicative of an insulator. A smaller, but still relevant band gap (kBT\sim k_{B}T) is indicative of an intrinsic semiconductor.
  2. Some bands are partially filled. There's no hard line between filled and empty bands. In this case, the Fermi energy (energy of highest occupied state) is somewhere in the range of one or more bands. While this means that there is no global Fermi surface, each band will have its own independent Fermi surface to divide its occupied levels from the unoccupied ones. For practical reasons, different terminology is used: the set of all band-specific Fermi surfaces is considered as the whole crystal's Fermi surface. Meanwhile, the band-specific surfaces are called branches of the (total) Fermi surface. So a crystal of this kind has one singular Fermi surface, and each energy band has a branch of that surface. Crystals that possess a Fermi surface are metals.

Electronic density of states5

In any quantum statistical mechanical scenario dealing with a very large number of components, it's often convenient to assume that such a huge number of possibilities can turn a discrete set of levels into a continuous set of levels by sheer virtue of the fact that the difference between each level will be minuscule. Thus, one can turn sums into integrals weighed by a density of states function. This is the gist of the Thomas-Fermi approximation and it works quite well with energy band levels too.

It's pretty common in a many-electron scenario to need to calculate a quantity QQ of the form

Q=2n,kQn(k)Q=2\sum_{n,\mathbf{k}}Q_{n}(\mathbf{k})

where the factor 2 takes Spin degeneracy into account and the sum is over all distinct levels k\mathbf{k} in a single primitive cell of the reciprocal lattice. If the volume is very large then we can define the volume density qq of QQ as

q=limVQV=2nQn(k)(2π)3d3kq=\lim_{ V \to \infty } \frac{Q}{V}=2\sum_{n} \frac{Q_{n}(\mathbf{k})}{(2\pi)^{3}}d^{3}k

If Qn(k)Q_{n}(\mathbf{k}) depends only on εn(k)\varepsilon_{n}(\mathbf{k}), then we can define a density of states (or density of levels in this context) function g(ε)g(\varepsilon) per unit volume such that

q=g(ε)Q(ε)dεwhere g(ε)=ngn(ε)and gn(ε)=cellδ(εεn(k))4π3d3kq=\int g(\varepsilon)Q(\varepsilon)d\varepsilon \quad\text{where }g(\varepsilon)=\sum_{n}g_{n}(\varepsilon)\quad\text{and }g_{n}(\varepsilon)=\int _\text{cell} \frac{\delta(\varepsilon-\varepsilon_{n}(\mathbf{k}))}{4\pi ^{3}}d^{3}k

using a Dirac delta. The density of levels in the band nn can also be defined as the function for which

gn(ε)dε=2V×number of wavevectors allowed in [ε,ε+dε]g_{n}(\varepsilon)d\varepsilon=\frac{2}{V}\times\text{number of wavevectors allowed in }[\varepsilon,\varepsilon+d\varepsilon]

In the continuous limit this becomes

gn(ε)dε=celld3k4π3×{1if εn(k)[ε,ε+dε]0otherwiseg_{n}(\varepsilon)d\varepsilon=\int _\text{cell} \frac{d^{3}k}{4\pi ^{3}}\times \begin{cases} 1 & \text{if }\varepsilon_{n}(\mathbf{k})\in[\varepsilon,\varepsilon+d\varepsilon] \\ 0 & \text{otherwise} \end{cases}

Since dεd\varepsilon is infinitesimal, the density of states can be calculated as an integral on a constant energy surface Sn(ε)S_{n}(\varepsilon) of energy εn(k)=ε\varepsilon_{n}(\mathbf{k})=\varepsilon:

gn(ε)dε=Sn(ε)δk(k)4π3dSg_{n}(\varepsilon)d\varepsilon=\int_{S_{n}(\varepsilon)} \frac{\delta k(\mathbf{k})}{4\pi ^{3}}dS

δk(k)\delta k(\mathbf{k}) is the distance (in reciprocal space) between the surfaces at ε\varepsilon and ε+dε\varepsilon+d\varepsilon. The Gradient kεn(k)\nabla_{\mathbf{k}}\varepsilon_{n}(\mathbf{k}) is normal to the constant-energy surface, so

ε+dε=ε+kεnδk(k)δk(k)=dεkεn\varepsilon+d\varepsilon=\varepsilon+\lvert \nabla_{\mathbf{k}}\varepsilon_{n} \rvert \cdot \delta k(\mathbf{k})\quad\Rightarrow \quad \delta k(\mathbf{k})=\frac{d\varepsilon}{\lvert \nabla_{\mathbf{k}}\varepsilon_{n} \rvert }

Plugging this into the integral yields

gn(ε)=Sn(ε)14π3dSkεng_{n}(\varepsilon)=\int_{S_{n}(\varepsilon)} \frac{1}{4\pi ^{3}} \frac{dS}{\lvert \nabla_{\mathbf{k}}\varepsilon_{n} \rvert }

This goes to show that the density of states in an energy band is dependent on the geometry of the constant energy surface Sn(ε)S_{n}(\varepsilon). It increases when the gradient is small (the states are tightly packed and with little separation) and decreases when it's large (the levels are far apart).

Weak potential

Intuitively, given the large nuclear charges of the atoms that make up matter, you'll probably assume that the periodic potential that applies onto the electrons would be rather intense. But for many metals, you'd be wrong. As it turns out, modern theoretical and experimental studies on metals of the I, II, III and IV group of the periodic table indicate that their electrons are hardly affected by this potential which, while nonzero, is almost constant. These metals are referred to as nearly free electron metals and their study begins from the Sommerfeld model of free electrons to which we add a weak periodic potential.

Firstly, though, we should talk about why the potential is so weak. After all, it's quite counter-intuitive. There are two reasons why:

  1. Ion-electron interaction falls off with distance due to being electromagnetic. Only the outermost electrons are conduction electrons and they are forbidden by the Pauli exclusion principle from going near the ion since they cannot cross the already completed inner shells. Thus, conduction electrons are consistently far from ions.
  2. The potential felt by the conduction electron is weaker than imagined. This is because core electron shells screen the ion potential. Moreover, the high mobility of conduction electrons further screens and diminishes the net potential that is felt by them.

These are qualitative reasons of course, but the point remains. The potential in most metals is quite weak and almost constant. We can use this to our advantage to get some results about metallic electronic structure.

We start from the single-electron Schrödinger equation that can be found in Bloch's theorem > Fourier series proof6:

(2k22mε)ck+KUKckK=0\left( \frac{\hbar^{2}k^{2}}{2m}-\varepsilon \right)c_{\mathbf{k}}+\sum_{\mathbf{K}}U_{\mathbf{K}}c_{\mathbf{k}-\mathbf{K}}=0

We need to solve these equations. To do so, we need the coefficients ck,ckK,ckK,c_{\mathbf{k}},c_{\mathbf{k}-\mathbf{K}},c_{\mathbf{k}-\mathbf{K}'},\ldots, assuming we know the potential UU. We will start in the free electron limit. Then, we'll add a potential later. We are legitimized in doing this because, since the potential is so weak, we can treat it as a perturbation using perturbation theory.

To start, we'll use a 1D crystal like we did for vibrations. We'll express the potential in a Fourier series

U(x)=nUneinKxU(x)=\sum_{n}U_{n}e^{inKx}

We will use the arbitrary constant of the potential to set U0=0U_{0}=0. Assuming parity symmetry for the crystal, we also have Un=UnU_{n}=U_{-n}. We require that only ck,ckK,ck+Kc_{k},c_{k-K},c_{k+K} are nonzero, with all other terms vanishing. Consequently, we also only retain U0U_{0} (which is zero anyway) and U1=U1=UU_{1}=U_{-1}=U. This decision cannot be justified at this stage, but varying the number of coefficients that are kept has consequences on the predicted behavior. We'll return to this in a moment when it's easier to explain. Keeping only three coefficients leads to a system of three equations, with means three solutions (energy bands) for each kk. The equations are

(2(kK)22mε)ckK+Uck=0(2k22mε)ck+UckK+Uck+K=0(2(k+K)22mε)ck+K+Uck=0\begin{align} \left( \frac{\hbar^{2}(k-K)^{2}}{2m} -\varepsilon\right)c_{k-K}+Uc_{k}&=0 \\ \left( \frac{\hbar^{2}k^{2}}{2m} -\varepsilon\right)c_{k}+Uc_{k-K}+Uc_{k+K}&=0 \\ \left( \frac{\hbar^{2}(k+K)^{2}}{2m} -\varepsilon\right)c_{k+K}+Uc_{k}&=0 \end{align}

We start in the free electron limit, so U(x)=0U(x)=0 for all xx. That's like saying Un=0U_{n}=0 for all nn, so in the equations above U=0U=0. This leaves us with

(2(kK)22mε)ckK=0(2k22mε)ck=0(2(k+K)22mε)ck+K+=0\begin{align} \left( \frac{\hbar^{2}(k-K)^{2}}{2m} -\varepsilon\right)c_{k-K}&=0 \\ \left( \frac{\hbar^{2}k^{2}}{2m} -\varepsilon\right)c_{k}&=0 \\ \left( \frac{\hbar^{2}(k+K)^{2}}{2m} -\varepsilon\right)c_{k+K}+&=0 \end{align}

These equations are effectively just squares of kk multiplied and shifted by a constant and then set to zero. In other words, these are all parabolas (convex/concave up specifically).

Here we can see what the solution looks like in arbitrary units. Due to periodicity arguments, only the part inside the first Brillouin zone (1BZ) really matters, since we just copy and paste that solution throughout the entire solid. We can see that the each coefficient that we retain adds one more parabola to represent the potential outside of the 1BZ. These are not insignificant though, as the outer potentials cross into the 1BZ and therefore affect it too. This is the mathematical statement that each electron is not only affected by its closest ion, but also by nearby ions too, just to a lesser extent. The more coefficients we keep, the more ions we consider in the interaction. As we can see by the right graph in the figure, the further the ions are, the higher the energy is when they cross over to the 1BZ. This means that outer ions only really matter in high-energy scenarios.

This is what happens for free electrons in a periodic lattice. Now let's take another step and turn on the potential. We start from a free electron Hamiltonian H0=p2/2mH_{0}=p^{2}/2m. These are free particles so their (continuous, delocalized, not physically valid) eigenstates are plane waves k\ket{\mathbf{k}} with eigenvalues ε0(k)=2k2/2m\varepsilon_{0}(\mathbf{k})=\hbar^{2}\lvert \mathbf{k} \rvert^{2}/2m. Now consider a weak perturbation to this Hamiltonian — our potential — so that we get H=H0+U(r)H=H_{0}+U(\mathbf{r}).

Adding a perturbation to an otherwise stable system always adds a form of quantum dynamics, that is the possibility of a State transition. The matrix element of a transition from the state k\ket{\mathbf{k}} to k\ket{\mathbf{k}'} is

kUk=1VR3ei(kk)rU(r)drUkk\braket{ \mathbf{k}' | U | \mathbf{k} } =\frac{1}{V}\int_{\mathbb{R}^{3}} e^{i(\mathbf{k}-\mathbf{k}')\cdot \mathbf{r}}U(\mathbf{r})d\mathbf{r}\equiv U_{\mathbf{k}'-\mathbf{k}}

This is zero unless kk\mathbf{k}-\mathbf{k}' is a reciprocal lattice vector K\mathbf{K}. Thus, a plane wave k\ket{\mathbf{k}} can be transitioned to another one k\ket{\mathbf{k}'} only if they differ by a reciprocal lattice vector.

Let's apply non-degenerate perturbation theory at the first order:

ε(k)=ε0(k)+kUk=ε0(k)+U0\varepsilon(\mathbf{k})=\varepsilon_{0}(\mathbf{k})+\braket{ \mathbf{k} | U | \mathbf{k} } =\varepsilon_{0}(\mathbf{k})+U_{0}

The first order correction is a constant, so we'll set U0=0U_{0}=0 since potentials are always defined up to as constant. With the second order correction we get

ε(k)=ε0(k)+k=k+KK0kUk2ε0(k)ε0(k)\varepsilon(\mathbf{k})=\varepsilon_{0}(\mathbf{k})+\sum_{\mathbf{k}'=\mathbf{k}+\mathbf{K}}^{\mathbf{K}\neq0} \frac{\lvert \braket{ \mathbf{k}' | U | \mathbf{k} } \rvert^{2}}{\varepsilon_{0}(\mathbf{k})-\varepsilon_{0}(\mathbf{k}')}

The sum occurs only over K0\mathbf{K}\neq 0, which means over transitions between plane waves with a different wavevector than the unperturbed case (basically, only between states permitted by the perturbation). Of course, we're assuming non-degeneracy here. If for any reason any two states that are summed over have the same energy level (ε0(k)=ε0(k)\varepsilon_{0}(\mathbf{k})=\varepsilon_{0}(\mathbf{k}')) then the correction diverges. So, when does degeneracy occur?

Of course the condition above, ε0(k)=ε0(k)\varepsilon_{0}(\mathbf{k})=\varepsilon_{0}(\mathbf{k}'), must apply, but also k=k+K\mathbf{k}'=\mathbf{k}+\mathbf{K}. In the 1D case we know that K=2πn/aK=2\pi n/a and ε0(k)k2\varepsilon_{0}(k)\sim k^{2}. If we impose these two conditions we get

ε0(k)=ε0(k+K)k2=(k+K)2k2=k2+2kK+K22kK+K2=0\begin{align} \varepsilon_{0}(k)&=\varepsilon_{0}(k+K) \\ k^{2}&=(k+K)^{2} \\ k^{2}&=k^{2}+2kK+K^{2} \\ 2kK+K^{2}&=0 \end{align}

Dividing by K0K\neq0 we get 2k+K=02k+K=0 which means

k=K2=nπak=k+K=nπak=- \frac{K}{2}=- \frac{n\pi}{a}\quad\Rightarrow \quad k'=k+K = \frac{n\pi}{a}

Degeneracy happens when the wavevectors are precisely at the edges of the 1BZ (of course, it's periodic, that's the entire point). If this happens, we can't rely on non-degenerate perturbation theory and we need to upgrade to the degenerate version. The matrix elements are

kHk=ε0(k)kHk=ε0(k)=ε0(k+K)kHk=Ukk=UKkHk=Ukk=UK\begin{align} \braket{ \mathbf{k} | H | \mathbf{k} } &=\varepsilon_{0}(\mathbf{k}) \\ \braket{ \mathbf{k}' | H | \mathbf{k}' } &= \varepsilon_{0}(\mathbf{k}')=\varepsilon_{0}(\mathbf{k}+\mathbf{K}) \\ \braket{ \mathbf{k} | H | \mathbf{k}' } &= U_{\mathbf{k}-\mathbf{k}'}=U_{\mathbf{K}}^{*} \\ \braket{ \mathbf{k}' | H | \mathbf{k} } &= U_{\mathbf{k}'-\mathbf{k}}=U_{\mathbf{K}} \end{align}

Since the potential is a real function, UK=UKU_{-\mathbf{K}}=U_{\mathbf{K}}^{*}. The degenerate states k\ket{\mathbf{k}} and k+K\ket{\mathbf{k}+\mathbf{K}} form a two-dimensional Vector space where each state is a Linear combination of the two:

ψ=αk+βk+K\ket{\psi} =\alpha \ket{\mathbf{k}} +\beta \ket{\mathbf{k}+\mathbf{K}}

Applying degenerate perturbation theory gives the eigenvalue equation

(ε0(k)UKUKε0(k+K))(αβ)=E(αβ)\begin{pmatrix} \varepsilon_{0}(\mathbf{k}) & U_{\mathbf{K}}^{*} \\ U_{\mathbf{K}} & \varepsilon_{0}(\mathbf{k}+\mathbf{K}) \end{pmatrix}\begin{pmatrix} \alpha \\ \beta \end{pmatrix}=E\begin{pmatrix} \alpha \\ \beta \end{pmatrix}

EE is determined by

(ε0(k)E)(ε0(k+K)E)UK2=0\boxed{(\varepsilon_{0}(\mathbf{k})-E)(\varepsilon_{0}(\mathbf{k}+\mathbf{K})-E)-\lvert U_{\mathbf{K}} \rvert ^{2}=0}

As an example, let's see what happens if both k\mathbf{k} and k=k+K\mathbf{k}'=\mathbf{k}+\mathbf{K} are on the edges of the Brillouin zone. Since ε0(k)=ε0(k+K)\varepsilon_{0}(\mathbf{k})=\varepsilon_{0}(\mathbf{k}+\mathbf{K}) the above equation becomes

(ε0(k)E)2=UK2E±=ε0(k)±UK(\varepsilon_{0}(\mathbf{k})-E)^{2}=\lvert U_{\mathbf{K}} \rvert ^{2}\quad\Rightarrow \quad E_{\pm}=\varepsilon_{0}(\mathbf{k})\pm \lvert U_{\mathbf{K}} \rvert

This creates an energy interval of space between the top of a lower energy band (EE_{-}) and the bottom of higher one (E+E_{+}) that is exactly the size of 2UK2\lvert U_{\mathbf{K}} \rvert. This interval is called a gap. There are no valid electron states in the gap: an electron must either stay in the lower band or jump to the higher one, requiring a state transition of at least 2UK2\lvert U_{\mathbf{K}} \rvert in distance.

Let's see what this entails in 1D. Consider a periodic potential like

U(x)=U~cos(2πxa)where U~>0U(x)=\tilde{U}\cos\left( \frac{2\pi x}{a} \right)\quad\text{where }\tilde{U}>0

The edges of the Brillouin zone are at k=±π/ak=\pm \pi/a. To cause the above degeneracy, we pick wavevectors at the edges, so k=π/ak=\pi/a and k=k=π/ak'=-k=-\pi/a. From the eigenvalue equation we see that the degenerate eigenstates are

ψ±=k±k2\ket{\psi_{\pm}} =\frac{\ket{k} \pm \ket{k'} }{\sqrt{ 2 }}

Using position representation ψ(x)=xk\psi(x)=\braket{ x | k }, we get two plane waves like keiπx/a\ket{k}\to e^{i\pi x/a} and keiπx/a\ket{k'}\to e^{-i\pi x/a}. The position representation of the degenerate states is

ψ+(x)eiπx/a+eiπx/acos(πxa),ψ(x)eiπx/aeiπx/asin(πxa)\psi_{+}(x)\simeq e^{i\pi x/a}+e^{-i\pi x/a}\propto \cos\left( \frac{\pi x}{a} \right),\quad \psi_{-}(x)\simeq e^{i\pi x/a}-e^{-i\pi x/a}\propto \sin\left( \frac{\pi x}{a} \right)

ψ+2\lvert \psi_{+} \rvert^{2} has maxima at the potential maxima. Similarly, ψ2\lvert \psi_{-} \rvert^{2} has maxima at the potential minima. As such, ψ+\psi_{+} is higher energy than ψ\psi_{-}. The energy gap can be found by solving for the coefficients of the potential UnU_{n} of the potential U(x)=nUneinKxU(x)=\sum_{n}U_{n}e^{inKx} and then squaring them.

The result is something like this. The left plot shows three coefficients, U1,U0,U1U_{-1},U_{0},U_{1} and the right plot shows five, U2,U1,U0,U1,U2U_{-2},U_{-1},U_{0},U_{1},U_{2}. Where the lines here would have been parabola in the zero-potential case, we can see that they now split at the edges of the 1BZ, creating energy gaps. Also, we can see another effect of solving the equations with more or less coefficients: the three-coefficient solution missed the band higher band gap since it had U±22=02=0\lvert U_{\pm2} \rvert^{2}=\lvert 0 \rvert^{2}=0, whereas the five-coefficient shows it. This is another reason why more coefficients provide better results.

Another interesting detail is that, since we're dealing with quantum waves, we can use the Planck formula for energy, E=ωE=\hbar \omega. That might sound irrelevant, but the group velocity of a wave (read: electron) is

vg=dω(k)dk=1dE(k)dkv_{g}=\frac{d\omega(k)}{dk}=\frac{1}{\hbar} \frac{dE(k)}{dk}

But we do have the energy E(k)E(k), it's the energy band itself. Thus, the group velocity of an electron is determined by the slope of the energy band. Since bands with a gap flatten out at the edges of the 1BZ, waves with wavevectors on the edges have zero group velocity: they are standing waves.

Also due to the Planck formula, energy and angular frequency are interchangeable. Since energy bands are functions of kk to EE, they can also be seen as functions of kk to ω\omega. But a function ω(k)\omega(k) of a wave is a dispersion relation. Thus, the energy bands are the dispersion relations of the Bloch electrons.

Ultimately, the takeaways here are these.

Tight-binding

The tight-binding model attempts a different kind of description compared to the weak periodic potential. Whereas the weak potential started with free electrons and added a small but significant potential to the mix, the tight-binding model works the other way around. It starts with free atoms and then pushes them closer and closer until the effects of vicinity start to become relevant, namely the superposition of atomic orbitals. The actual superposition might be small or large, and it really depends on the energy level of the electrons themselves. The tight-binding model attempts to explain cases in which the superposition of atomic orbitals is small but still significant.

Whereas the weak potential described electrons that were largely free to move, tight-binding describes electrons that are largely unable to move and tightly bound to their ion, hence the name. This makes it well-positioned to explain the behavior of insulators instead of conductors, where electron movement is scarce.

We start from the single-electron Hamiltonian for an isolated atom, under the assumption that the solid is entirely made of one type of atom (an pure elemental solid):

H^atom=222m+Uatom(r)\hat{H}_\text{atom}=- \frac{\hbar^{2}\nabla ^{2}}{2m}+U_\text{atom}(\mathbf{r})

Uatom(r)U_\text{atom}(\mathbf{r}) is the potential that each electron feels with respect to the atom that it is bound to. Now, let's combine all of the atoms to make a solid in a manner similar to LCAO for molecules. For example, imagine sodium, which only has one valence electron in 3s3s, with energy E3sE_{3s} and eigenstate 3s\ket{3s}. If each sodium atom is exactly on a Bravais lattice point R\mathbf{R} then we get

H^solid=222m+RUatom(rR)=222m+Uatom(r)+R0Uatom(rR)\begin{align} \hat{H}_\text{solid}&=- \frac{\hbar^{2}\nabla^{2}}{2m}+\sum_{\mathbf{R}}U_\text{atom}(\mathbf{r}-\mathbf{R}) \\ &=- \frac{\hbar^{2}\nabla^{2}}{2m}+U_\text{atom}(\mathbf{r})+\sum_{\mathbf{R}\neq 0}U_\text{atom}(\mathbf{r}-\mathbf{R}) \end{align}

The three terms are the kinetic energy of the electron, the potential due to the atom the electron is bound to and the potential due to every other atom. We'll group the first two in the single-atom Hamiltonian H^atom\hat{H}_\text{atom} and call the third U(r)\mathcal{U}(\mathbf{r}) to distinguish it:

H^solid=H^atom+U(r)\hat{H}_\text{solid}=\hat{H}_\text{atom}+\mathcal{U}(\mathbf{r})

We now have the Hamiltonian of the solid as seen from an electron of an atom, with a corrective term U\mathcal{U} added onto it. This same idea an be applied to every electron.

Now imagine that the atoms farther away from each other. Each single-atom eigenfunction ϕn(r)\phi_{n}(r) has eigenvalue EnE_{n}. We can write the energy eigenvalues of the solid as

Hϕn=ϕn(r)Hsolidϕn(r)dr=En+ϕn(r)U(r)ϕn(r)dr=Enβ\langle H \rangle _{\phi_{n}}=\int \phi_{n}^{*}(\mathbf{r})H_\text{solid}\phi_{n}(\mathbf{r})d\mathbf{r}=E_{n}+\int \phi_{n}^{*}(\mathbf{r})\mathcal{U}(\mathbf{r})\phi_{n}(\mathbf{r})d\mathbf{r}=E_{n}-\beta

The β\beta term depends highly on how superimposed the various wavefunctions are and is zero if they do not superimpose. The eigenfunctions ϕn\phi_{n} go to zero before the potential does. For a solid made of NN atoms, we get NN degenerate solutions for each of the Hamiltonian eigenvalues. We expect this because there is no atom-atom interaction.

Let's see what happens with we include the interaction. We'll use an ansatz to declare the form of the wavefunction of each electron as

ψk(r)=1NRck,Rϕn(rR)\psi_{\mathbf{k}}(\mathbf{r})=\frac{1}{\sqrt{ N }}\sum_{\mathbf{R}}c_{\mathbf{k},\mathbf{R}}\phi_{n}(\mathbf{r}-\mathbf{R})

This is a linear combination of atomic wavefunctions ϕn\phi_{n} on the Bravais lattice. The coefficients are unknown and the thing we need to determine. For this to make sense in a lattice context, it needs to be a Bloch wave, so the coefficients will have to be exponentials:

ψk(r)=1NReikrϕn(rR)\psi_{\mathbf{k}}(\mathbf{r})=\frac{1}{\sqrt{ N }}\sum_{\mathbf{R}}e^{i\mathbf{k}\cdot \mathbf{r}}\phi_{n}(\mathbf{r}-\mathbf{R})

The values k\mathbf{k} are those allowed by the boundary conditions. These wavefunctions satisfy Bloch's theorem:

ψk(r+R)=1NReikRϕn(rR+R)=1NeikRReik(RR)ϕn(r(RR))(RR=RR)=1NeikRReikRϕn(rR)=eikRψk(r)\begin{align} \psi_{\mathbf{k}}(\mathbf{r}+\mathbf{R}')&=\frac{1}{\sqrt{ N }}\sum_{\mathbf{R}}e^{i\mathbf{k}\cdot \mathbf{R}}\phi_{n}(\mathbf{r}-\mathbf{R}+\mathbf{R}') \\ &=\frac{1}{\sqrt{ N }}e^{i\mathbf{k}\cdot \mathbf{R}'}\sum_{\mathbf{R}}e^{i\mathbf{k}\cdot(\mathbf{R}-\mathbf{R}')}\phi_{n}(\mathbf{r}-(\mathbf{R}-\mathbf{R}')) \\ (\mathbf{R}\to \mathbf{R}''=\mathbf{R}-\mathbf{R}')&=\frac{1}{\sqrt{ N }}e^{i\mathbf{k}\cdot \mathbf{R}'}\sum_{\mathbf{R}''}e^{i\mathbf{k}\cdot \mathbf{R}''}\phi_{n}(\mathbf{r}-\mathbf{R}'') \\ &=e^{i\mathbf{k}\cdot \mathbf{R}'}\psi_{\mathbf{k}}(\mathbf{r}) \end{align}

The energy eigenvalues are calculated similarly to the hydrogen molecule.

E(k)=ψk(r)H^solidψk(r)dr=1NR,Reik(RR)ϕn(rR)H^solidϕn(rR)dr=\begin{align} E(\mathbf{k})&=\int \psi_{\mathbf{k}}^{*}(\mathbf{r})\hat{H}_\text{solid}\psi_{\mathbf{k}}(\mathbf{r})d\mathbf{r} \\ &=\frac{1}{N}\sum_{\mathbf{R},\mathbf{R}'}e^{i\mathbf{k}\cdot(\mathbf{R}-\mathbf{R}')}\int \phi_{n}^{*}(\mathbf{r}-\mathbf{R}')\hat{H}_\text{solid}\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r} \\ &=\ldots \end{align}

Considering periodic boundary conditions, we can remove the dependency on R\mathbf{R} and merge the sums

=ReikRϕn(r)H^solidϕn(rR)dr\ldots=\sum_{\mathbf{R}}e^{i\mathbf{k}\cdot \mathbf{R}}\int \phi_{n}^{*}(\mathbf{r})\hat{H}_\text{solid}\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r}

The total energy eigenvalue must be the sum of this piece and the one we found before for the individual atoms:

E(k)=Enβ+R0eikRϕn(r)H^solidϕn(rR)drE(\mathbf{k})=E_{n}-\beta+\sum_{\mathbf{R}\neq 0}e^{i\mathbf{k}\cdot \mathbf{R}}\int \phi_{n}^{*}(\mathbf{r})\hat{H}_\text{solid}\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r}

The integral is the real problem here. Let's see what we can do with it:

ϕn(r)H^solidϕn(rR)dr=Enϕn(r)ϕn(rR)dr+ϕn(r)U(r)ϕn(rR)dr\int \phi_{n}^{*}(\mathbf{r})\hat{H}_\text{solid}\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r}=E_{n}\int \phi_{n}^{*}(\mathbf{r})\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r}+\int \phi_{n}^{*}(\mathbf{r})\mathcal{U}(\mathbf{r})\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r}

The first integral on the right is largely inconsequential: it measures the overlap of wavefunctions evaluated at different sites on the lattice, so it's never going to be significant. The second integral however cannot be ignored: the presence of U(r)\mathcal{U}(\mathbf{r}) makes U(r)ϕn(rR)\mathcal{U}(\mathbf{r})\phi_{n}(\mathbf{r}-\mathbf{R}) nontrivial even in the region it overlaps ϕn(r)\phi_{n}^{*}(\mathbf{r}) so we can't drop it. Instead, we'll add a minus sign and call it t=ϕn(r)U(r)ϕn(rR)drt=-\int \phi_{n}^{*}(\mathbf{r})\mathcal{U}(\mathbf{r})\phi_{n}(\mathbf{r}-\mathbf{R})d\mathbf{r}. This term is known as the hopping term and measures the State transition Probability of an electron from its own orbital to an orbital of nearby atom.

The energy band function, or the electron dispersion relation depending on how you want to see it, then is

E(k)=EnβR0t(R)eikR\boxed{E(\mathbf{k})=E_{n}-\beta-\sum_{\mathbf{R}\neq 0}t(\mathbf{R})e^{i\mathbf{k}\cdot \mathbf{R}}}

This shows the change of an energy level from an independent atom one EnE_{n} to a lattice one E(k)E(\mathbf{k}). β\beta measures transition probabilities from an orbital to an orbital of the same atom. tt measures the same, but to orbitals of other atoms.

Let's a run a simple calculation in a 1D crystal of lattice parameter aa for ss-orbital valence electrons. Each electron has energy eigenvalue EsE_{s} in the free atom limit. Normally, we'd need to sum over all other atoms, but ss wavefunctions decay fast, so in practice only the nearest neighbor matters, which are at ±a\pm a. Moreover, ss orbitals are spherically symmetric, so t(a)=t(a)=tst(-a)=t(a)=t_{s}. All in all, we get

Es(k)=Esβsts(eika+eika)=Esβs2tscos(ka)E_{s}(k)=E_{s}-\beta_{s}-t_{s}(e^{ika}+e^{-ika})=E_{s}-\beta_{s}-2t_{s}\cos (ka)

βs\beta_{s} is the value of β\beta calculated in the ss orbital. Both EsE_{s} and βs\beta_{s} are constant, so maxima and minima are chosen by the cosine. The minimum energy is for k=0k=0 and the maximum is at the Brillouin zone edge at k=π/ak=\pi/a. The center of the energy band is shifted by βs-\beta_{s} compared to EsE_{s} (typically a small shift). These are the edges of the band. The width of the band is 4ts4t_{s}, as seen by the fact that the cosine term goes from 2ts-2t_{s} to 2ts2t_{s}. This shows that it's the hopping term that determines how wide a band is in the tight-binding model. Hopping depends on the distance, increasing for small distances, so the closer the atoms are, the wider the bands are; this is logical, as closer atoms make transferring electrons easier and therefore conductivity higher.

You could then solve this same problem using pp orbitals instead. Then dd, then ff, and so on. Each orbital will give a brand new, separate bands, and because of that, the bands inherit the names of the orbitals: ss-band, pp-band, dd-band... The bottom of each band, the lowest energy level, corresponds to the energy of the bonding molecular orbitals of that type (σ\sigma for ss-band, π\pi for pp-band...), whereas the top corresponds to the energy of the antibonding one.

One thing that is very interesting to look at is that the bands look quite similar to the ones in the weak potential limit. They're not identical of course, but we can see that at the bottom of each period of the wave, the band looks more or less parabolic. In fact, for small kk around the minimum, the dispersion is, to first order, E(k)=E0+ta2k2E(k)=E_{0}+ta^{2}k^{2} for some constant E0E_{0}. A shifted parabola basically, just like for free electrons. This lends itself to describing electrons with kk near the minimum as free electrons with an effective mass mm^{*} such that their energy satisfies

2k22m=ta2k2m=22ta2\frac{\hbar^{2}k^{2}}{2m^{*}}=ta^{2}k^{2}\quad\Rightarrow \quad \boxed{m^{*}=\frac{\hbar^{2}}{2ta^{2}}}

So, if for whatever reason you only need to consider electrons near the band minimum, you can actually just dispense with the entire tight-binding model (except for tt, you need tt) and treat them as free electrons with a mass given the formula above. This is a weird result, because tight-binding is useful for insulators of all things. We somehow managed to describe an insulator, a material defined by its lack of freedom on electrons, with a free electron model. To quote A. Wilson:

We have the rather curious result that not only is it possible to obtain conduction with bound electrons, but it is also possible to obtain non-conduction with free electrons.

Footnotes

  1. They start to matter in high-energy conditions when the crystal starts to break down, such as a phase transition (e.g. melting).

  2. In 1D space it can't be any other way, but it does matter for a 1D object in 3D space.

  3. In this context, they're known as Born-von Karman boundary conditions and more generally they require that the wavefunction be periodic over the Bravais lattice.

  4. For more discussion on this, it's fascinating to notice a historical parallel here. Back in the late 1800s, physicists were hunting for the "luminiferous ether", this assumed substance that permeated the vacuum (thus not making it really vacuum) that allowed for electromagnetic radiation to occur. At the time, the idea of a wave was inconceivable without a medium. The wave is a distortion of the medium! And yet try as they might, they would be proved wrong by Millikan & Morley's experiment. Physicists needed to accept that some wave just exist without a medium. Well, this is it. This is the difference. We're now accustomed to photons being their own thing, so light traveling wherever it wants is perfectly normal. It's its own object that at most passes through other media. Meanwhile, the phonon discussion is just going through history in reverse. Now it's weird to think of a particle that can't exist on its own, even though historically that was always the "obviously correct" one. It's just an odd little consequence of the wave-particle duality. In fact, any wave of any kind could have this kind of treatment applied to it, regardless of whether it's medium-locked or not. The key takeaway here is that quasiparticles are "fake" and are really just a useful mathematical trick.

  5. This section is not particularly well-developed or well-explained. See Solid State Physics, Ashcroft & Mermin, Chapter 8, § Density of levels for a proper treatment.

  6. We are using qk\mathbf{q}\to \mathbf{k} and KK\mathbf{K}'\to \mathbf{K}. It's a purely aesthetic change.